Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Dec 08, 2024

Zhenyu Hou, Pengfan Du, Yilin Niu, Zhengxiao Du, Aohan Zeng, Xiao Liu, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

Figure 1 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Figure 2 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Figure 3 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Figure 4 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Share this with someone who'll enjoy it:

Abstract:This study explores the scaling properties of Reinforcement Learning from Human Feedback (RLHF) in Large Language Models (LLMs). Although RLHF is considered an important step in post-training of LLMs, its scaling potential is still largely unknown. We systematically analyze key components in the RLHF framework--model size, data composition, and inference budget--and their impacts on performance. Our findings show that increasing data diversity and volume improves reward model performance, helping process-supervision models scale better. For policy training, more response samples per prompt boost performance initially but quickly plateau. And larger reward models offer modest gains in policy training. In addition, larger policy models benefit less from RLHF with a fixed reward model. Overall, RLHF scales less efficiently than pretraining, with diminishing returns from additional computational resources. Based on these observations, we propose strategies to optimize RLHF performance within computational limits.

View paper on

Share this with someone who'll enjoy it:

Title:Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Paper and Code