Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bart van Merrienboer

Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation

Oct 07, 2014

Jean Pouget-Abadie, Dzmitry Bahdanau, Bart van Merrienboer, Kyunghyun Cho, Yoshua Bengio

Figure 1 for Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation

Figure 2 for Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation

Figure 3 for Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation

Figure 4 for Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation

Abstract:The authors of (Cho et al., 2014a) have shown that the recently introduced neural network translation systems suffer from a significant drop in translation quality when translating long sentences, unlike existing phrase-based translation systems. In this paper, we propose a way to address this issue by automatically segmenting an input sentence into phrases that can be easily translated by the neural network translation model. Once each segment has been independently translated by the neural machine translation model, the translated clauses are concatenated to form a final translation. Empirical results show a significant improvement in translation quality for long sentences.

* Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-8)

Via

Access Paper or Ask Questions

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

Oct 07, 2014

Kyunghyun Cho, Bart van Merrienboer, Dzmitry Bahdanau, Yoshua Bengio

Figure 1 for On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

Figure 2 for On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

Figure 3 for On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

Figure 4 for On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

Abstract:Neural machine translation is a relatively new approach to statistical machine translation based purely on neural networks. The neural machine translation models often consist of an encoder and a decoder. The encoder extracts a fixed-length representation from a variable-length input sentence, and the decoder generates a correct translation from this representation. In this paper, we focus on analyzing the properties of the neural machine translation using two models; RNN Encoder--Decoder and a newly proposed gated recursive convolutional neural network. We show that the neural machine translation performs relatively well on short sentences without unknown words, but its performance degrades rapidly as the length of the sentence and the number of unknown words increase. Furthermore, we find that the proposed gated recursive convolutional network learns a grammatical structure of a sentence automatically.

* Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-8)

Via

Access Paper or Ask Questions

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Sep 03, 2014

Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio

Figure 1 for Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Figure 2 for Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Figure 3 for Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Figure 4 for Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Abstract:In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

* EMNLP 2014

Via

Access Paper or Ask Questions