Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

May 31, 2022

Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw

Figure 1 for Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Figure 2 for Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Figure 3 for Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Figure 4 for Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Share this with someone who'll enjoy it:

Abstract:Numerous recent work on unsupervised machine translation (UMT) implies that competent unsupervised translations of low-resource and unrelated languages, such as Nepali or Sinhala, are only possible if the model is trained in a massive multilingual environment, where theses low-resource languages are mixed with high-resource counterparts. Nonetheless, while the high-resource languages greatly help kick-start the target low-resource translation tasks, the language discrepancy between them may hinder their further improvement. In this work, we propose a simple refinement procedure to disentangle languages from a pre-trained multilingual UMT model for it to focus on only the target low-resource task. Our method achieves the state of the art in the fully unsupervised translation tasks of English to Nepali, Sinhala, Gujarati, Latvian, Estonian and Kazakh, with BLEU score gains of 3.5, 3.5, 3.3, 4.1, 4.2, and 3.3, respectively. Our codebase is available at https://github.com/nxphi47/refine_unsup_multilingual_mt

View paper on

Share this with someone who'll enjoy it:

Title:Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

Paper and Code