Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

May 25, 2022

Yanrui Du, Jing Yan, Yan Chen, Jing Liu, Sendong Zhao, Hua Wu, Haifeng Wang, Bing Qin

Figure 1 for Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

Figure 2 for Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

Figure 3 for Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

Figure 4 for Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

Share this with someone who'll enjoy it:

Abstract:Many recent works indicate that the deep neural networks tend to take dataset biases as shortcuts to make decision, rather than understand the tasks, which results in failures on the real-world applications. In this work, we focus on the spurious correlation between feature and label, which derive from the biased data distribution in the training data, and analyze it concretely. In particular, we define the word highly co-occurring with a specific label as biased word, and the example containing biased word as biased example. Our analysis reveals that the biased examples with spurious correlations are easier for models to learn, and when predicting, the biased words make significantly higher contributions to models' predictions than other words, and the models tend to assign the labels over-relying on the spurious correlation between words and labels. To mitigate the model's over-reliance on the shortcut, we propose a training strategy Less-Learn-Shortcut (LLS): we quantify the biased degree of the biased examples, and down-weight them with the biased degree. Experimental results on QM and NLI tasks show that the models improve the performances both on in-domain and adversarial data (1.57% on DuQM and 2.12% on HANS) with our LLS.

View paper on

Share this with someone who'll enjoy it:

Title:Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

Paper and Code