Picture for Yijia Luo

Yijia Luo

Adaptive Dense Reward: Understanding the Gap Between Action and Reward Space in Alignment

Add code
Oct 23, 2024
Viaarxiv icon