Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jean Senellart

OpenNMT: Neural Machine Translation Toolkit

May 28, 2018

Guillaume Klein, Yoon Kim, Yuntian Deng, Vincent Nguyen, Jean Senellart, Alexander M. Rush

Figure 1 for OpenNMT: Neural Machine Translation Toolkit

Figure 2 for OpenNMT: Neural Machine Translation Toolkit

Figure 3 for OpenNMT: Neural Machine Translation Toolkit

Figure 4 for OpenNMT: Neural Machine Translation Toolkit

Abstract:OpenNMT is an open-source toolkit for neural machine translation (NMT). The system prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research into model architectures, feature representations, and source modalities, while maintaining competitive performance and reasonable training requirements. The toolkit consists of modeling and translation support, as well as detailed pedagogical documentation about the underlying techniques. OpenNMT has been used in several production MT systems, modified for numerous research papers, and is implemented across several deep learning frameworks.

* Presentation to AMTA 2018 - Boston. arXiv admin note: substantial text overlap with arXiv:1701.02810

Via

Access Paper or Ask Questions

Boosting Neural Machine Translation

Oct 03, 2017

Dakun Zhang, Jungi Kim, Josep Crego, Jean Senellart

Figure 1 for Boosting Neural Machine Translation

Figure 2 for Boosting Neural Machine Translation

Figure 3 for Boosting Neural Machine Translation

Figure 4 for Boosting Neural Machine Translation

Abstract:Training efficiency is one of the main problems for Neural Machine Translation (NMT). Deep networks need for very large data as well as many training iterations to achieve state-of-the-art performance. This results in very high computation cost, slowing down research and industrialisation. In this paper, we propose to alleviate this problem with several training methods based on data boosting and bootstrap with no modifications to the neural network. It imitates the learning process of humans, which typically spend more time when learning "difficult" concepts than easier ones. We experiment on an English-French translation task showing accuracy improvements of up to 1.63 BLEU while saving 20% of training time.

* published in IJCNLP 2017

Via

Access Paper or Ask Questions

OpenNMT: Open-source Toolkit for Neural Machine Translation

Sep 12, 2017

Guillaume Klein, Yoon Kim, Yuntian Deng, Josep Crego, Jean Senellart, Alexander M. Rush

Figure 1 for OpenNMT: Open-source Toolkit for Neural Machine Translation

Figure 2 for OpenNMT: Open-source Toolkit for Neural Machine Translation

Figure 3 for OpenNMT: Open-source Toolkit for Neural Machine Translation

Figure 4 for OpenNMT: Open-source Toolkit for Neural Machine Translation

Abstract:We introduce an open-source toolkit for neural machine translation (NMT) to support research into model architectures, feature representations, and source modalities, while maintaining competitive performance, modularity and reasonable training requirements.

* Published in EAMT 2017 User Studies and Project/Product Descriptions

Via

Access Paper or Ask Questions

SYSTRAN Purely Neural MT Engines for WMT2017

Sep 12, 2017

Yongchao Deng, Jungi Kim, Guillaume Klein, Catherine Kobus, Natalia Segal, Christophe Servan, Bo Wang, Dakun Zhang, Josep Crego, Jean Senellart

Figure 1 for SYSTRAN Purely Neural MT Engines for WMT2017

Figure 2 for SYSTRAN Purely Neural MT Engines for WMT2017

Figure 3 for SYSTRAN Purely Neural MT Engines for WMT2017

Figure 4 for SYSTRAN Purely Neural MT Engines for WMT2017

Abstract:This paper describes SYSTRAN's systems submitted to the WMT 2017 shared news translation task for English-German, in both translation directions. Our systems are built using OpenNMT, an open-source neural machine translation system, implementing sequence-to-sequence models with LSTM encoder/decoders and attention. We experimented using monolingual data automatically back-translated. Our resulting models are further hyper-specialised with an adaptation technique that finely tunes models according to the evaluation test sentences.

* Published in WMT 2017

Via

Access Paper or Ask Questions

Domain Control for Neural Machine Translation

Sep 12, 2017

Catherine Kobus, Josep Crego, Jean Senellart

Figure 1 for Domain Control for Neural Machine Translation

Figure 2 for Domain Control for Neural Machine Translation

Figure 3 for Domain Control for Neural Machine Translation

Figure 4 for Domain Control for Neural Machine Translation

Abstract:Machine translation systems are very sensitive to the domains they were trained on. Several domain adaptation techniques have been deeply studied. We propose a new technique for neural machine translation (NMT) that we call domain control which is performed at runtime using a unique neural network covering multiple domains. The presented approach shows quality improvements when compared to dedicated domains translating on any of the covered domains and even on out-of-domain data. In addition, model parameters do not need to be re-estimated for each domain, making this effective to real use cases. Evaluation is carried out on English-to-French translation for two different testing scenarios. We first consider the case where an end-user performs translations on a known domain. Secondly, we consider the scenario where the domain is not known and predicted at the sentence level before translating. Results show consistent accuracy improvements for both conditions.

* Published in RANLP 2017

Via

Access Paper or Ask Questions

OpenNMT: Open-Source Toolkit for Neural Machine Translation

Mar 06, 2017

Guillaume Klein, Yoon Kim, Yuntian Deng, Jean Senellart, Alexander M. Rush

Abstract:We describe an open-source toolkit for neural machine translation (NMT). The toolkit prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research into model architectures, feature representations, and source modalities, while maintaining competitive performance and reasonable training requirements. The toolkit consists of modeling and translation support, as well as detailed pedagogical documentation about the underlying techniques.

* Report for http://opennmt.net

Via

Access Paper or Ask Questions

Domain specialization: a post-training domain adaptation for Neural Machine Translation

Dec 19, 2016

Christophe Servan, Josep Crego, Jean Senellart

Figure 1 for Domain specialization: a post-training domain adaptation for Neural Machine Translation

Figure 2 for Domain specialization: a post-training domain adaptation for Neural Machine Translation

Figure 3 for Domain specialization: a post-training domain adaptation for Neural Machine Translation

Figure 4 for Domain specialization: a post-training domain adaptation for Neural Machine Translation

Abstract:Domain adaptation is a key feature in Machine Translation. It generally encompasses terminology, domain and style adaptation, especially for human post-editing workflows in Computer Assisted Translation (CAT). With Neural Machine Translation (NMT), we introduce a new notion of domain adaptation that we call "specialization" and which is showing promising results both in the learning speed and in adaptation accuracy. In this paper, we propose to explore this approach under several perspectives.

* Submitted to EACL 2017 short paper

Via

Access Paper or Ask Questions

Neural Machine Translation from Simplified Translations

Dec 19, 2016

Josep Crego, Jean Senellart

Figure 1 for Neural Machine Translation from Simplified Translations

Figure 2 for Neural Machine Translation from Simplified Translations

Figure 3 for Neural Machine Translation from Simplified Translations

Figure 4 for Neural Machine Translation from Simplified Translations

Abstract:Text simplification aims at reducing the lexical, grammatical and structural complexity of a text while keeping the same meaning. In the context of machine translation, we introduce the idea of simplified translations in order to boost the learning ability of deep neural translation models. We conduct preliminary experiments showing that translation complexity is actually reduced in a translation of a source bi-text compared to the target reference of the bi-text while using a neural machine translation (NMT) system learned on the exact same bi-text. Based on knowledge distillation idea, we then train an NMT system using the simplified bi-text, and show that it outperforms the initial system that was built over the reference data set. Performance is further boosted when both reference and automatic translations are used to learn the network. We perform an elementary analysis of the translated corpus and report accuracy results of the proposed approach on English-to-French and English-to-German translation tasks.

* Submitted to EACL 2017 Short paper

Via

Access Paper or Ask Questions

SYSTRAN's Pure Neural Machine Translation Systems

Oct 18, 2016

Josep Crego, Jungi Kim, Guillaume Klein, Anabel Rebollo, Kathy Yang, Jean Senellart, Egor Akhanov, Patrice Brunelle, Aurelien Coquard, Yongchao Deng(+20 more)

Figure 1 for SYSTRAN's Pure Neural Machine Translation Systems

Figure 2 for SYSTRAN's Pure Neural Machine Translation Systems

Figure 3 for SYSTRAN's Pure Neural Machine Translation Systems

Figure 4 for SYSTRAN's Pure Neural Machine Translation Systems

Abstract:Since the first online demonstration of Neural Machine Translation (NMT) by LISA, NMT development has recently moved from laboratory to production systems as demonstrated by several entities announcing roll-out of NMT engines to replace their existing technologies. NMT systems have a large number of training configurations and the training process of such systems is usually very long, often a few weeks, so role of experimentation is critical and important to share. In this work, we present our approach to production-ready systems simultaneously with release of online demonstrators covering a large variety of languages (12 languages, for 32 language pairs). We explore different practical choices: an efficient and evolutive open-source framework; data preparation; network architecture; additional implemented features; tuning for production; etc. We discuss about evaluation methodology, present our first findings and we finally outline further work. Our ultimate goal is to share our expertise to build competitive production systems for "generic" translation. We aim at contributing to set up a collaborative framework to speed-up adoption of the technology, foster further research efforts and enable the delivery and adoption to/by industry of use-case specific engines integrated in real production workflows. Mastering of the technology would allow us to build translation engines suited for particular needs, outperforming current simplest/uniform systems.

Via

Access Paper or Ask Questions