Picture for Hao Yi

Hao Yi

TSO: Self-Training with Scaled Preference Optimization

Add code
Aug 31, 2024
Viaarxiv icon