Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aleš Tamchyna

Neural Machine Translation Quality and Post-Editing Performance

Sep 10, 2021

Vilém Zouhar, Aleš Tamchyna, Martin Popel, Ondřej Bojar

Figure 1 for Neural Machine Translation Quality and Post-Editing Performance

Figure 2 for Neural Machine Translation Quality and Post-Editing Performance

Figure 3 for Neural Machine Translation Quality and Post-Editing Performance

Figure 4 for Neural Machine Translation Quality and Post-Editing Performance

Abstract:We test the natural expectation that using MT in professional translation saves human processing time. The last such study was carried out by Sanchez-Torron and Koehn (2016) with phrase-based MT, artificially reducing the translation quality. In contrast, we focus on neural MT (NMT) of high quality, which has become the state-of-the-art approach since then and also got adopted by most translation companies. Through an experimental study involving over 30 professional translators for English -> Czech translation, we examine the relationship between NMT performance and post-editing time and quality. Across all models, we found that better MT systems indeed lead to fewer changes in the sentences in this industry setting. The relation between system quality and post-editing time is however not straightforward and, contrary to the results on phrase-based MT, BLEU is definitely not a stable predictor of the time or final output quality.

* 9 pages, 1 page appendix. To be presented at EMNLP2021

Via

Access Paper or Ask Questions

Modeling Target-Side Inflection in Neural Machine Translation

Sep 05, 2017

Aleš Tamchyna, Marion Weller-Di Marco, Alexander Fraser

Figure 1 for Modeling Target-Side Inflection in Neural Machine Translation

Figure 2 for Modeling Target-Side Inflection in Neural Machine Translation

Figure 3 for Modeling Target-Side Inflection in Neural Machine Translation

Figure 4 for Modeling Target-Side Inflection in Neural Machine Translation

Abstract:NMT systems have problems with large vocabulary sizes. Byte-pair encoding (BPE) is a popular approach to solving this problem, but while BPE allows the system to generate any target-side word, it does not enable effective generalization over the rich vocabulary in morphologically rich languages with strong inflectional phenomena. We introduce a simple approach to overcome this problem by training a system to produce the lemma of a word and its morphologically rich POS tag, which is then followed by a deterministic generation step. We apply this strategy for English-Czech and English-German translation scenarios, obtaining improvements in both settings. We furthermore show that the improvement is not due to only adding explicit morphological information.

* Accepted as a research paper at WMT17. (Updated version with corrected references.)

Via

Access Paper or Ask Questions

Target-Side Context for Discriminative Models in Statistical Machine Translation

Jul 05, 2016

Aleš Tamchyna, Alexander Fraser, Ondřej Bojar, Marcin Junczys-Dowmunt

Figure 1 for Target-Side Context for Discriminative Models in Statistical Machine Translation

Figure 2 for Target-Side Context for Discriminative Models in Statistical Machine Translation

Figure 3 for Target-Side Context for Discriminative Models in Statistical Machine Translation

Figure 4 for Target-Side Context for Discriminative Models in Statistical Machine Translation

Abstract:Discriminative translation models utilizing source context have been shown to help statistical machine translation performance. We propose a novel extension of this work using target context information. Surprisingly, we show that this model can be efficiently integrated directly in the decoding process. Our approach scales to large training data sizes and results in consistent improvements in translation quality on four language pairs. We also provide an analysis comparing the strengths of the baseline source-context model with our extended source-context and target-context model and we show that our extension allows us to better capture morphological coherence. Our work is freely available as part of Moses.

* Accepted as a long paper for ACL 2016

Via

Access Paper or Ask Questions