Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!
Abstract:We examine how various types of noise in the parallel training data impact the quality of neural machine translation systems. We create five types of artificial noise and analyze how they degrade performance in neural and statistical machine translation. We find that neural models are generally more harmed by noise than statistical models. For one especially egregious type of noise they learn to just copy the input sentence.
* Please cite as: @InProceedings{khayrallah-koehn:2018:WNMT, author =
{Khayrallah, Huda and Koehn, Philipp}, title = {On the Impact of Various
Types of Noise on Neural Machine Translation}, booktitle = {Proceedings of
the Second Workshop on Neural Machine Translation and Generation}, year =
{2018}, address = {Melbourne}, publisher = {Association for Computational
Linguistics} }