Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Feiyu Zhang

IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning

Aug 23, 2023

Feiyu Zhang, Liangzhi Li, Junhao Chen, Zhouqiang Jiang, Bowen Wang, Yiming Qian

Figure 1 for IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning

Figure 2 for IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning

Figure 3 for IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning

Figure 4 for IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning

Abstract:With the increasing size of pre-trained language models (PLMs), fine-tuning all the parameters in the model is not efficient, especially when there are a large number of downstream tasks, which incur significant training and storage costs. Many parameter-efficient fine-tuning (PEFT) approaches have been proposed, among which, Low-Rank Adaptation (LoRA) is a representative approach that injects trainable rank decomposition matrices into every target module. Yet LoRA ignores the importance of parameters in different modules. To address this problem, many works have been proposed to prune the parameters of LoRA. However, under limited training conditions, the upper bound of the rank of the pruned parameter matrix is still affected by the preset values. We, therefore, propose IncreLoRA, an incremental parameter allocation method that adaptively adds trainable parameters during training based on the importance scores of each module. This approach is different from the pruning method as it is not limited by the initial number of training parameters, and each parameter matrix has a higher rank upper bound for the same training overhead. We conduct extensive experiments on GLUE to demonstrate the effectiveness of IncreLoRA. The results show that our method owns higher parameter efficiency, especially when under the low-resource settings where our method significantly outperforms the baselines. Our code is publicly available.

Via

Access Paper or Ask Questions