Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yinzhuo Chen

PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment

Nov 02, 2024

Dongxu Liu, Bing Xu, Yinzhuo Chen, Bufan Xu, Wenpeng Lu, Muyun Yang, Tiejun Zhao

Figure 1 for PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment

Figure 2 for PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment

Figure 3 for PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment

Figure 4 for PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment

Abstract:Reinforcement Learning from Human Feedback (RLHF) has been proven to be an effective method for preference alignment of large language models (LLMs) and is widely used in the post-training process of LLMs. However, RLHF struggles with handling multiple competing preferences. This leads to a decrease in the alignment of LLMs with human preferences. To address this issue, we propose Preference Mixture of LoRAs (PMoL) from the perspective of model architecture, which can adapt to any number of preferences to mix. PMoL combines Mixture of Experts (MoE) and Low Rank Adaptor (LoRA). This architecture is innovatively applied to the research of preference alignment and has achieved significant performance improvement. The expert group soft loss is used to enable MoE with the ability to mix preferences. Through comprehensive evaluation by the reward model and GPT-4o, the experiment results show that PMoL has superior preference mixing capabilities compared to baseline methods. PMoL achieves better preference alignment with lower training costs.

Via

Access Paper or Ask Questions