Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Domain Robustness in Neural Machine Translation

Nov 08, 2019

Mathias Müller, Annette Rios, Rico Sennrich

Figure 1 for Domain Robustness in Neural Machine Translation

Figure 2 for Domain Robustness in Neural Machine Translation

Figure 3 for Domain Robustness in Neural Machine Translation

Figure 4 for Domain Robustness in Neural Machine Translation

Share this with someone who'll enjoy it:

Abstract:Translating text that diverges from the training domain is a key challenge for neural machine translation (NMT). Domain robustness - the generalization of models to unseen test domains - is low compared to statistical machine translation. In this paper, we investigate the performance of NMT on out-of-domain test sets, and ways to improve it. We observe that hallucination (translations that are fluent but unrelated to the source) is common in out-of-domain settings, and we empirically compare methods that improve adequacy (reconstruction), out-of-domain translation (subword regularization), or robustness against adversarial examples (defensive distillation), as well as noisy channel models. In experiments on German to English OPUS data, and German to Romansh, a low-resource scenario, we find that several methods improve domain robustness, reconstruction standing out as a method that not only improves automatic scores, but also shows improvements in a manual assessments of adequacy, albeit at some loss in fluency. However, out-of-domain performance is still relatively low and domain robustness remains an open problem.

* V1

View paper on

Share this with someone who'll enjoy it:

Title:Domain Robustness in Neural Machine Translation

Paper and Code