Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

May 22, 2021

Wietse de Vries, Martijn Bartelds, Malvina Nissim, Martijn Wieling

Figure 1 for Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Figure 2 for Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Figure 3 for Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Figure 4 for Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Share this with someone who'll enjoy it:

Abstract:For many (minority) languages, the resources needed to train large models are not available. We investigate the performance of zero-shot transfer learning with as little data as possible, and the influence of language similarity in this process. We retrain the lexical layers of four BERT-based models using data from two low-resource target language varieties, while the Transformer layers are independently fine-tuned on a POS-tagging task in the model's source language. By combining the new lexical layers and fine-tuned Transformer layers, we achieve high task performance for both target languages. With high language similarity, 10MB of data appears sufficient to achieve substantial monolingual transfer performance. Monolingual BERT-based models generally achieve higher downstream task performance after retraining the lexical layer than multilingual BERT, even when the target language is included in the multilingual model.

* Findings of ACL 2021 Camera Ready

View paper on

Share this with someone who'll enjoy it:

Title:Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Paper and Code