Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dawid Jan Kopiczko

VeRA: Vector-based Random Matrix Adaptation

Oct 17, 2023

Dawid Jan Kopiczko, Tijmen Blankevoort, Yuki Markus Asano

Figure 1 for VeRA: Vector-based Random Matrix Adaptation

Figure 2 for VeRA: Vector-based Random Matrix Adaptation

Figure 3 for VeRA: Vector-based Random Matrix Adaptation

Figure 4 for VeRA: Vector-based Random Matrix Adaptation

Abstract:Low-rank adapation (LoRA) is a popular method that reduces the number of trainable parameters when finetuning large language models, but still faces acute storage challenges when scaling to even larger models or deploying numerous per-user or per-task adapted models. In this work, we present Vector-based Random Matrix Adaptation (VeRA), which reduces the number of trainable parameters by 10x compared to LoRA, yet maintains the same performance. It achieves this by using a single pair of low-rank matrices shared across all layers and learning small scaling vectors instead. We demonstrate its effectiveness on the GLUE and E2E benchmarks, and show its application in instruction-following with just 1.4M parameters using the Llama2 7B model.

Via

Access Paper or Ask Questions