Picture for Qiugen Xiao

Qiugen Xiao

HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback

Add code
Mar 14, 2024
Viaarxiv icon