Picture for Stephane Hatgis-Kessell

Stephane Hatgis-Kessell

Learning Optimal Advantage from Preferences and Mistaking it for Reward

Add code
Oct 03, 2023
Viaarxiv icon

Models of human preference for learning reward functions

Add code
Jun 05, 2022
Figure 1 for Models of human preference for learning reward functions
Figure 2 for Models of human preference for learning reward functions
Figure 3 for Models of human preference for learning reward functions
Figure 4 for Models of human preference for learning reward functions
Viaarxiv icon