Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient Imbalance in Direct Preference Optimization

Feb 28, 2025

Qinwei Ma, Jingzhe Shi, Can Jin, Jenq-Neng Hwang, Serge Belongie, Lei Li

Figure 1 for Gradient Imbalance in Direct Preference Optimization

Figure 2 for Gradient Imbalance in Direct Preference Optimization

Figure 3 for Gradient Imbalance in Direct Preference Optimization

Figure 4 for Gradient Imbalance in Direct Preference Optimization

Share this with someone who'll enjoy it:

Abstract:Direct Preference Optimization (DPO) has been proposed as a promising alternative to Proximal Policy Optimization (PPO) based Reinforcement Learning with Human Feedback (RLHF). However, empirical evaluations consistently reveal suboptimal performance in DPO compared to common RLHF pipelines. In this work, we conduct a systematic analysis of DPO's training dynamics and identify gradient imbalance as a critical limitation. We demonstrate theoretically and empirically that this imbalance perturbs optimization trajectories, destabilizes learning, and induces suboptimal convergence. To address this issue, we propose Balanced-DPO, a simple yet effective modification to the DPO objective that introduces a computationally efficient gradient reweighting mechanism. Our experiments demonstrate the effectiveness of Balanced-DPO, validating the theoretical findings and confirming that addressing gradient imbalance is key to improving DPO's performance, highlighting a promising direction for future research.

* 15 pages, 2 figures

View paper on

Share this with someone who'll enjoy it:

Title:Gradient Imbalance in Direct Preference Optimization

Paper and Code