Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Data Diversification: An Elegant Strategy For Neural Machine Translation

Nov 05, 2019

Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw

Figure 1 for Data Diversification: An Elegant Strategy For Neural Machine Translation

Figure 2 for Data Diversification: An Elegant Strategy For Neural Machine Translation

Figure 3 for Data Diversification: An Elegant Strategy For Neural Machine Translation

Figure 4 for Data Diversification: An Elegant Strategy For Neural Machine Translation

Share this with someone who'll enjoy it:

Abstract:A common approach to improve neural machine translation is to invent new architectures. However, the research process of designing and refining such new models is often exhausting. Another approach is to resort to huge extra monolingual data to conduct semi-supervised training, like back-translation. But extra monolingual data is not always available, especially for low resource languages. In this paper, we propose to diversify the available training data by using multiple forward and backward peer models to augment the original training dataset. Our method does not require extra data like back-translation, nor additional computations and parameters like using pretrained models. Our data diversification method achieves state-of-the-art BLEU score of 30.7 in the WMT'14 English-German task. It also consistently and substantially improves translation quality in 8 other translation tasks: 4 IWSLT tasks (English-German and English-French) and 4 low-resource translation tasks (English-Nepali and English-Sinhala).

View paper on

Share this with someone who'll enjoy it:

Title:Data Diversification: An Elegant Strategy For Neural Machine Translation

Paper and Code