Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving language models fine-tuning with representation consistency targets

May 23, 2022

Anastasia Razdaibiedina, Vivek Madan, Zohar Karnin, Ashish Khetan, Vishaal Kapoor

Figure 1 for Improving language models fine-tuning with representation consistency targets

Figure 2 for Improving language models fine-tuning with representation consistency targets

Figure 3 for Improving language models fine-tuning with representation consistency targets

Figure 4 for Improving language models fine-tuning with representation consistency targets

Share this with someone who'll enjoy it:

Abstract:Fine-tuning contextualized representations learned by pre-trained language models has become a standard practice in the NLP field. However, pre-trained representations are prone to degradation (also known as representation collapse) during fine-tuning, which leads to instability, suboptimal performance, and weak generalization. In this paper, we propose a novel fine-tuning method that avoids representation collapse during fine-tuning by discouraging undesirable changes in the representations. We show that our approach matches or exceeds the performance of the existing regularization-based fine-tuning methods across 13 language understanding tasks (GLUE benchmark and six additional datasets). We also demonstrate its effectiveness in low-data settings and robustness to label perturbation. Furthermore, we extend previous studies of representation collapse and propose several metrics to quantify it. Using these metrics and previously proposed experiments, we show that our approach obtains significant improvements in retaining the expressive power of representations.

* 32 pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:Improving language models fine-tuning with representation consistency targets

Paper and Code