Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

KyungHyun Cho

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Dec 11, 2014

Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, Yoshua Bengio

Figure 1 for Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Figure 2 for Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Figure 3 for Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Figure 4 for Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Abstract:In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently proposed gated recurrent unit (GRU). We evaluate these recurrent units on the tasks of polyphonic music modeling and speech signal modeling. Our experiments revealed that these advanced recurrent units are indeed better than more traditional recurrent units such as tanh units. Also, we found GRU to be comparable to LSTM.

* Presented in NIPS 2014 Deep Learning and Representation Learning Workshop

Via

Access Paper or Ask Questions

Not All Neural Embeddings are Born Equal

Nov 13, 2014

Felix Hill, KyungHyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

Figure 1 for Not All Neural Embeddings are Born Equal

Figure 2 for Not All Neural Embeddings are Born Equal

Figure 3 for Not All Neural Embeddings are Born Equal

Abstract:Neural language models learn word representations that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models. We show that translation-based embeddings outperform those learned by cutting-edge monolingual models at single-language tasks requiring knowledge of conceptual similarity and/or syntactic role. The findings suggest that, while monolingual models learn information about how concepts are related, neural-translation models better capture their true ontological status.

* 4 pages plus 1 page of references

Via

Access Paper or Ask Questions