Picture for Qianen Zhang

Qianen Zhang

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

Add code
Feb 11, 2025
Viaarxiv icon