Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shufan Shen

Expanding Sparse Tuning for Low Memory Usage

Nov 04, 2024

Shufan Shen, Junshu Sun, Xiangyang Ji, Qingming Huang, Shuhui Wang

Figure 1 for Expanding Sparse Tuning for Low Memory Usage

Figure 2 for Expanding Sparse Tuning for Low Memory Usage

Figure 3 for Expanding Sparse Tuning for Low Memory Usage

Figure 4 for Expanding Sparse Tuning for Low Memory Usage

Abstract:Parameter-efficient fine-tuning (PEFT) is an effective method for adapting pre-trained vision models to downstream tasks by tuning a small subset of parameters. Among PEFT methods, sparse tuning achieves superior performance by only adjusting the weights most relevant to downstream tasks, rather than densely tuning the whole weight matrix. However, this performance improvement has been accompanied by increases in memory usage, which stems from two factors, i.e., the storage of the whole weight matrix as learnable parameters in the optimizer and the additional storage of tunable weight indexes. In this paper, we propose a method named SNELL (Sparse tuning with kerNELized LoRA) for sparse tuning with low memory usage. To achieve low memory usage, SNELL decomposes the tunable matrix for sparsification into two learnable low-rank matrices, saving from the costly storage of the whole original matrix. A competition-based sparsification mechanism is further proposed to avoid the storage of tunable weight indexes. To maintain the effectiveness of sparse tuning with low-rank matrices, we extend the low-rank decomposition by applying nonlinear kernel functions to the whole-matrix merging. Consequently, we gain an increase in the rank of the merged matrix, enhancing the ability of SNELL in adapting the pre-trained models to downstream tasks. Extensive experiments on multiple downstream tasks show that SNELL achieves state-of-the-art performance with low memory usage, endowing PEFT with sparse tuning to large-scale models. Codes are available at https://github.com/ssfgunner/SNELL.

* Accepted by NeurIPS 2024

Via

Access Paper or Ask Questions

Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020

Oct 20, 2020

Shufan Shen, Ran Miao, Yi Wang, Zhihua Wei

Figure 1 for Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020

Figure 2 for Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020

Figure 3 for Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020

Figure 4 for Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020

Abstract:In this report, we discribe the submission of Tongji University undergraduate team to the CLOSE track of the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2020 at Interspeech 2020. We applied the RSBU-CW module to the ResNet34 framework to improve the denoising ability of the network and better complete the speaker verification task in a complex environment.We trained two variants of ResNet,used score fusion and data-augmentation methods to improve the performance of the model. Our fusion of two selected systems for the CLOSE track achieves 0.2973 DCF and 4.9700\% EER on the challenge evaluation set.

Via

Access Paper or Ask Questions