Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values

Jun 16, 2023

Stephanie Schoch, Ritwick Mishra, Yangfeng Ji

Figure 1 for Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values

Figure 2 for Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values

Figure 3 for Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values

Figure 4 for Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values

Share this with someone who'll enjoy it:

Abstract:Although Shapley values have been shown to be highly effective for identifying harmful training instances, dataset size and model complexity constraints limit the ability to apply Shapley-based data valuation to fine-tuning large pre-trained language models. To address this, we propose TS-DShapley, an algorithm that reduces computational cost of Shapley-based data valuation through: 1) an efficient sampling-based method that aggregates Shapley values computed from subsets for valuation of the entire training set, and 2) a value transfer method that leverages value information extracted from a simple classifier trained using representations from the target language model. Our experiments applying TS-DShapley to select data for fine-tuning BERT-based language models on benchmark natural language understanding (NLU) datasets show that TS-DShapley outperforms existing data selection methods. Further, TS-DShapley can filter fine-tuning data to increase language model performance compared to training with the full fine-tuning dataset.

* Accepted to ACL SRW 2023

View paper on

Share this with someone who'll enjoy it:

Title:Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values

Paper and Code