Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pratik Ramesh

Model merging with SVD to tie the Knots

Oct 25, 2024

George Stoica, Pratik Ramesh, Boglarka Ecsedi, Leshem Choshen, Judy Hoffman

Figure 1 for Model merging with SVD to tie the Knots

Figure 2 for Model merging with SVD to tie the Knots

Figure 3 for Model merging with SVD to tie the Knots

Figure 4 for Model merging with SVD to tie the Knots

Abstract:Recent model merging methods demonstrate that the parameters of fully-finetuned models specializing in distinct tasks can be combined into one model capable of solving all tasks without retraining. Yet, this success does not transfer well when merging LoRA finetuned models. We study this phenomenon and observe that the weights of LoRA finetuned models showcase a lower degree of alignment compared to their fully-finetuned counterparts. We hypothesize that improving this alignment is key to obtaining better LoRA model merges, and propose KnOTS to address this problem. KnOTS uses the SVD to jointly transform the weights of different LoRA models into an aligned space, where existing merging methods can be applied. In addition, we introduce a new benchmark that explicitly evaluates whether merged models are general models. Notably, KnOTS consistently improves LoRA merging by up to 4.3% across several vision and language benchmarks, including our new setting. We release our code at: https://github.com/gstoica27/KnOTS.

Via

Access Paper or Ask Questions

FACTS: First Amplify Correlations and Then Slice to Discover Bias

Sep 29, 2023

Sriram Yenamandra, Pratik Ramesh, Viraj Prabhu, Judy Hoffman

Figure 1 for FACTS: First Amplify Correlations and Then Slice to Discover Bias

Figure 2 for FACTS: First Amplify Correlations and Then Slice to Discover Bias

Figure 3 for FACTS: First Amplify Correlations and Then Slice to Discover Bias

Figure 4 for FACTS: First Amplify Correlations and Then Slice to Discover Bias

Abstract:Computer vision datasets frequently contain spurious correlations between task-relevant labels and (easy to learn) latent task-irrelevant attributes (e.g. context). Models trained on such datasets learn "shortcuts" and underperform on bias-conflicting slices of data where the correlation does not hold. In this work, we study the problem of identifying such slices to inform downstream bias mitigation strategies. We propose First Amplify Correlations and Then Slice to Discover Bias (FACTS), wherein we first amplify correlations to fit a simple bias-aligned hypothesis via strongly regularized empirical risk minimization. Next, we perform correlation-aware slicing via mixture modeling in bias-aligned feature space to discover underperforming data slices that capture distinct correlations. Despite its simplicity, our method considerably improves over prior work (by as much as 35% precision@10) in correlation bias identification across a range of diverse evaluation settings. Our code is available at: https://github.com/yvsriram/FACTS.

* Accepted to ICCV 2023

Via

Access Paper or Ask Questions