Picture for Su Fong

Su Fong

SuperHF: Supervised Iterative Learning from Human Feedback

Add code
Oct 25, 2023
Viaarxiv icon