Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Universal Neural Machine Translation for Extremely Low Resource Languages

Apr 17, 2018

Jiatao Gu, Hany Hassan, Jacob Devlin, Victor O. K. Li

Figure 1 for Universal Neural Machine Translation for Extremely Low Resource Languages

Figure 2 for Universal Neural Machine Translation for Extremely Low Resource Languages

Figure 3 for Universal Neural Machine Translation for Extremely Low Resource Languages

Figure 4 for Universal Neural Machine Translation for Extremely Low Resource Languages

Share this with someone who'll enjoy it:

Abstract:In this paper, we propose a new universal machine translation approach focusing on languages with a limited amount of parallel data. Our proposed approach utilizes a transfer-learning approach to share lexical and sentence level representations across multiple source languages into one target language. The lexical part is shared through a Universal Lexical Representation to support multilingual word-level sharing. The sentence-level sharing is represented by a model of experts from all source languages that share the source encoders with all other languages. This enables the low-resource language to utilize the lexical and sentence representations of the higher resource languages. Our approach is able to achieve 23 BLEU on Romanian-English WMT2016 using a tiny parallel corpus of 6k sentences, compared to the 18 BLEU of strong baseline system which uses multilingual training and back-translation. Furthermore, we show that the proposed approach can achieve almost 20 BLEU on the same dataset through fine-tuning a pre-trained multi-lingual system in a zero-shot setting.

* NAACL-HLT 2018

View paper on

Share this with someone who'll enjoy it:

Title:Universal Neural Machine Translation for Extremely Low Resource Languages

Paper and Code