Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dakun Zhang

Boosting Neural Machine Translation

Oct 03, 2017

Dakun Zhang, Jungi Kim, Josep Crego, Jean Senellart

Figure 1 for Boosting Neural Machine Translation

Figure 2 for Boosting Neural Machine Translation

Figure 3 for Boosting Neural Machine Translation

Figure 4 for Boosting Neural Machine Translation

Abstract:Training efficiency is one of the main problems for Neural Machine Translation (NMT). Deep networks need for very large data as well as many training iterations to achieve state-of-the-art performance. This results in very high computation cost, slowing down research and industrialisation. In this paper, we propose to alleviate this problem with several training methods based on data boosting and bootstrap with no modifications to the neural network. It imitates the learning process of humans, which typically spend more time when learning "difficult" concepts than easier ones. We experiment on an English-French translation task showing accuracy improvements of up to 1.63 BLEU while saving 20% of training time.

* published in IJCNLP 2017

Via

Access Paper or Ask Questions

SYSTRAN Purely Neural MT Engines for WMT2017

Sep 12, 2017

Yongchao Deng, Jungi Kim, Guillaume Klein, Catherine Kobus, Natalia Segal, Christophe Servan, Bo Wang, Dakun Zhang, Josep Crego, Jean Senellart

Figure 1 for SYSTRAN Purely Neural MT Engines for WMT2017

Figure 2 for SYSTRAN Purely Neural MT Engines for WMT2017

Figure 3 for SYSTRAN Purely Neural MT Engines for WMT2017

Figure 4 for SYSTRAN Purely Neural MT Engines for WMT2017

Abstract:This paper describes SYSTRAN's systems submitted to the WMT 2017 shared news translation task for English-German, in both translation directions. Our systems are built using OpenNMT, an open-source neural machine translation system, implementing sequence-to-sequence models with LSTM encoder/decoders and attention. We experimented using monolingual data automatically back-translated. Our resulting models are further hyper-specialised with an adaptation technique that finely tunes models according to the evaluation test sentences.

* Published in WMT 2017

Via

Access Paper or Ask Questions

SYSTRAN's Pure Neural Machine Translation Systems

Oct 18, 2016

Josep Crego, Jungi Kim, Guillaume Klein, Anabel Rebollo, Kathy Yang, Jean Senellart, Egor Akhanov, Patrice Brunelle, Aurelien Coquard, Yongchao Deng(+20 more)

Figure 1 for SYSTRAN's Pure Neural Machine Translation Systems

Figure 2 for SYSTRAN's Pure Neural Machine Translation Systems

Figure 3 for SYSTRAN's Pure Neural Machine Translation Systems

Figure 4 for SYSTRAN's Pure Neural Machine Translation Systems

Abstract:Since the first online demonstration of Neural Machine Translation (NMT) by LISA, NMT development has recently moved from laboratory to production systems as demonstrated by several entities announcing roll-out of NMT engines to replace their existing technologies. NMT systems have a large number of training configurations and the training process of such systems is usually very long, often a few weeks, so role of experimentation is critical and important to share. In this work, we present our approach to production-ready systems simultaneously with release of online demonstrators covering a large variety of languages (12 languages, for 32 language pairs). We explore different practical choices: an efficient and evolutive open-source framework; data preparation; network architecture; additional implemented features; tuning for production; etc. We discuss about evaluation methodology, present our first findings and we finally outline further work. Our ultimate goal is to share our expertise to build competitive production systems for "generic" translation. We aim at contributing to set up a collaborative framework to speed-up adoption of the technology, foster further research efforts and enable the delivery and adoption to/by industry of use-case specific engines integrated in real production workflows. Mastering of the technology would allow us to build translation engines suited for particular needs, outperforming current simplest/uniform systems.

Via

Access Paper or Ask Questions