Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:$f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Oct 29, 2024

Jiaqi Han, Mingjian Jiang, Yuxuan Song, Jure Leskovec, Stefano Ermon, Minkai Xu

Figure 1 for $f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Figure 2 for $f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Figure 3 for $f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Figure 4 for $f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Share this with someone who'll enjoy it:

Abstract:Preference optimization has made significant progress recently, with numerous methods developed to align language models with human preferences. This paper introduces $f$-divergence Preference Optimization ($f$-PO), a novel framework that generalizes and extends existing approaches. $f$-PO minimizes $f$-divergences between the optimized policy and the optimal policy, encompassing a broad family of alignment methods using various divergences. Our approach unifies previous algorithms like DPO and EXO, while offering new variants through different choices of $f$-divergences. We provide theoretical analysis of $f$-PO's properties and conduct extensive experiments on state-of-the-art language models using benchmark datasets. Results demonstrate $f$-PO's effectiveness across various tasks, achieving superior performance compared to existing methods on popular benchmarks such as AlpacaEval 2, Arena-Hard, and MT-Bench. Additionally, we present ablation studies exploring the impact of different $f$-divergences, offering insights into the trade-offs between regularization and performance in offline preference optimization. Our work contributes both practical algorithms and theoretical understanding to the field of language model alignment. Code is available at https://github.com/MinkaiXu/fPO.

View paper on

Share this with someone who'll enjoy it:

Title:$f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Paper and Code