Picture for Jiuding Duan

Jiuding Duan

A Generalized Model for Multidimensional Intransitivity

Add code
Sep 28, 2024
Viaarxiv icon

VickreyFeedback: Cost-efficient Data Construction for Reinforcement Learning from Human Feedback

Add code
Sep 27, 2024
Viaarxiv icon