Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Abubakar Isa

Using Self-Training to Improve Back-Translation in Low Resource Neural Machine Translation

Jun 04, 2020

Idris Abdulmumin, Bashir Shehu Galadanci, Abubakar Isa

Figure 1 for Using Self-Training to Improve Back-Translation in Low Resource Neural Machine Translation

Figure 2 for Using Self-Training to Improve Back-Translation in Low Resource Neural Machine Translation

Figure 3 for Using Self-Training to Improve Back-Translation in Low Resource Neural Machine Translation

Figure 4 for Using Self-Training to Improve Back-Translation in Low Resource Neural Machine Translation

Abstract:Improving neural machine translation (NMT) models using the back-translations of the monolingual target data (synthetic parallel data) is currently the state-of-the-art approach for training improved translation systems. The quality of the backward system - which is trained on the available parallel data and used for the back-translation - has been shown in many studies to affect the performance of the final NMT model. In low resource conditions, the available parallel data is usually not enough to train a backward model that can produce the qualitative synthetic data needed to train a standard translation model. This work proposes a self-training strategy where the output of the backward model is used to improve the model itself through the forward translation technique. The technique was shown to improve baseline low resource IWSLT'14 English-German and IWSLT'15 English-Vietnamese backward translation models by 11.06 and 1.5 BLEUs respectively. The synthetic data generated by the improved English-German backward model was used to train a forward model which out-performed another forward model trained using standard back-translation by 2.7 BLEU.

* 8 pages, 5 figures, 4 tables

Via

Access Paper or Ask Questions