Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Feb 21, 2025

Jinda Liu, Yi Chang, Yuan Wu

Figure 1 for R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Figure 2 for R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Figure 3 for R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Figure 4 for R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Share this with someone who'll enjoy it:

Abstract:Fine-tuning large language models (LLMs) is prohibitively expensive in terms of computational and memory costs. Low-rank Adaptation (LoRA), as one of the most popular parameter-efficient fine-tuning (PEFT) methods, offers a cost-effective alternative by approximating the model changes $\Delta W \in \mathbb{R}^{m \times n}$ through the product of down-projection matrix $A \in \mathbb{R}^{m \times r}$ and head matrix $B \in \mathbb{R}^{r \times n}$, where $r \ll \min(m, n)$. In real-world scenarios, LLMs are fine-tuned on data from multiple domains to perform tasks across various fields, embodying multi-task learning (MTL). LoRA often underperforms in such complex scenarios. To enhance LoRA's capability in multi-task learning, we propose R-LoRA, which incorporates Multi-Head Randomization. Multi-Head Randomization diversifies the head matrices through Multi-Head Random Initialization and Multi-Head Dropout, enabling more efficient learning of task-specific features while maintaining shared knowledge representation. Extensive experiments demonstrate that R-LoRA is better at capturing task-specific knowledge, thereby improving performance in multi-task scenarios. The code is available at https://github.com/jinda-liu/R-LoRA.

* 9 pages, 10 figures

View paper on

Share this with someone who'll enjoy it:

Title:R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Paper and Code