Picture for Antoine Scheid

Antoine Scheid

Optimal Design for Reward Modeling in RLHF

Add code
Oct 23, 2024
Viaarxiv icon

Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

Add code
Jul 03, 2024
Viaarxiv icon

Incentivized Learning in Principal-Agent Bandit Games

Add code
Mar 06, 2024
Figure 1 for Incentivized Learning in Principal-Agent Bandit Games
Figure 2 for Incentivized Learning in Principal-Agent Bandit Games
Figure 3 for Incentivized Learning in Principal-Agent Bandit Games
Figure 4 for Incentivized Learning in Principal-Agent Bandit Games
Viaarxiv icon