Picture for Xiaoyuan Chen

Xiaoyuan Chen

HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback

Add code
Mar 14, 2024
Viaarxiv icon