Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Sep 16, 2023

Simeng Sun, Dhawal Gupta, Mohit Iyyer

Figure 1 for Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Figure 2 for Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Figure 3 for Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Figure 4 for Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Share this with someone who'll enjoy it:

Abstract:During the last stage of RLHF, a large language model is aligned to human intents via PPO training, a process that generally requires large-scale computational resources. In this technical report, we empirically investigate an efficient implementation of RLHF using low-rank adaptation (LoRA), which allows us to align the LLaMA 7B checkpoint on the Alpaca dataset using only two A100 GPUs instead of the eight required for full model fine-tuning. Despite tuning only 0.2% of LLaMA 7B's parameters, our implementation achieves better performance than the publicly-released AlpacaFarm checkpoint with full model fine-tuning. Next, we analyze several configurations of our LoRA-based PPO implementation, varying the form of the KL regularization term in the training objective. We find that (1) removing this penalty term does not harm performance on the AlpacaFarm evaluation set under our LoRA setup; (2) other regularizers, such as Jensen-Shannon divergence, lead to improved performance; and (3) while PPO training negatively impacts the factuality of model-generated responses, training with LoRA largely mitigates this effect. We release our code and pretrained checkpoints to facilitate future research on more efficient RLHF.

View paper on

Share this with someone who'll enjoy it:

Title:Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Paper and Code