Picture for Íñigo Martínez de Rituerto de Troya

Íñigo Martínez de Rituerto de Troya

AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations

Add code
Jun 26, 2024
Viaarxiv icon