Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Start for Sequence to Sequence Architecture

Aug 19, 2016

Qingfu Zhu, Weinan Zhang, Lianqiang Zhou, Ting Liu

Figure 1 for Learning to Start for Sequence to Sequence Architecture

Figure 2 for Learning to Start for Sequence to Sequence Architecture

Figure 3 for Learning to Start for Sequence to Sequence Architecture

Figure 4 for Learning to Start for Sequence to Sequence Architecture

Share this with someone who'll enjoy it:

Abstract:The sequence to sequence architecture is widely used in the response generation and neural machine translation to model the potential relationship between two sentences. It typically consists of two parts: an encoder that reads from the source sentence and a decoder that generates the target sentence word by word according to the encoder's output and the last generated word. However, it faces to the cold start problem when generating the first word as there is no previous word to refer. Existing work mainly use a special start symbol </s>to generate the first word. An obvious drawback of these work is that there is not a learnable relationship between words and the start symbol. Furthermore, it may lead to the error accumulation for decoding when the first word is incorrectly generated. In this paper, we proposed a novel approach to learning to generate the first word in the sequence to sequence architecture rather than using the start symbol. Experimental results on the task of response generation of short text conversation show that the proposed approach outperforms the state-of-the-art approach in both of the automatic and manual evaluations.

* 10 pages

View paper on

Share this with someone who'll enjoy it:

Title:Learning to Start for Sequence to Sequence Architecture

Paper and Code