Picture for Fangzhi Zhu

Fangzhi Zhu

MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples

Add code
Dec 13, 2024
Viaarxiv icon