Picture for Dimitri Coelho Mollo

Dimitri Coelho Mollo

Shammie

AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations

Add code
Jun 26, 2024
Viaarxiv icon

AI-as-exploration: Navigating intelligence space

Add code
Feb 05, 2024
Viaarxiv icon

ACROCPoLis: A Descriptive Framework for Making Sense of Fairness

Add code
Apr 19, 2023
Figure 1 for ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
Figure 2 for ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
Figure 3 for ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
Figure 4 for ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
Viaarxiv icon

The Vector Grounding Problem

Add code
Apr 04, 2023
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon