Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Emergent Properties of Finetuned Language Representation Models

Oct 23, 2019

Alexandre Matton, Luke de Oliveira

Figure 1 for Emergent Properties of Finetuned Language Representation Models

Figure 2 for Emergent Properties of Finetuned Language Representation Models

Figure 3 for Emergent Properties of Finetuned Language Representation Models

Figure 4 for Emergent Properties of Finetuned Language Representation Models

Share this with someone who'll enjoy it:

Abstract:Large, self-supervised transformer-based language representation models have recently received significant amounts of attention, and have produced state-of-the-art results across a variety of tasks simply by scaling up pre-training on larger and larger corpora. Such models usually produce high dimensional vectors, on top of which additional task-specific layers and architectural modifications are added to adapt them to specific downstream tasks. Though there exists ample evidence that such models work well, we aim to understand what happens when they work well. We analyze the redundancy and location of information contained in output vectors for one such language representation model -- BERT. We show empirical evidence that the [CLS] embedding in BERT contains highly redundant information, and can be compressed with minimal loss of accuracy, especially for finetuned models, dovetailing into open threads in the field about the role of over-parameterization in learning. We also shed light on the existence of specific output dimensions which alone give very competitive results when compared to using all dimensions of output vectors.

* 7 pages

View paper on

Share this with someone who'll enjoy it:

Title:Emergent Properties of Finetuned Language Representation Models

Paper and Code