Picture for Shivanshu Verma

Shivanshu Verma

Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization

Add code
May 26, 2024
Viaarxiv icon

Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

Add code
Apr 23, 2024
Viaarxiv icon