Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Accelerating Direct Preference Optimization with Prefix Sharing

Oct 27, 2024

Franklin Wang, Sumanth Hegde

Figure 1 for Accelerating Direct Preference Optimization with Prefix Sharing

Figure 2 for Accelerating Direct Preference Optimization with Prefix Sharing

Figure 3 for Accelerating Direct Preference Optimization with Prefix Sharing

Figure 4 for Accelerating Direct Preference Optimization with Prefix Sharing

Share this with someone who'll enjoy it:

Abstract:Offline paired preference optimization algorithms have become a popular approach for fine-tuning on preference data, outperforming traditional supervised fine-tuning in various tasks. However, traditional implementations often involve redundant computations, especially for tasks with long shared prompts. We introduce prefix sharing for preference tuning, a novel technique that processes chosen and rejected responses as one sequence with a shared prefix. To prevent cross-response contamination, we use a custom block-sparse attention mask. Our method achieves $1.1$-$1.5\times$ improvement in training throughput on popular DPO datasets, without any effect on convergence. When combined with sequence packing, we observe consistent $1.3$-$1.6\times$ speedups, benefiting even datasets with smaller sequence lengths. While we focus on Direct Preference Optimization (DPO), our approach is applicable to other paired preference tuning methods. By enhancing computational efficiency, our work contributes to making preference-based fine-tuning more accessible for a wider range of applications and model sizes. We open-source our code at https://github.com/frankxwang/dpo-prefix-sharing.

* To appear in NeurIPS 2024 in the Fine-Tuning in Machine Learning Workshop

View paper on

Share this with someone who'll enjoy it:

Title:Accelerating Direct Preference Optimization with Prefix Sharing

Paper and Code