Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Blaise Thomson

Neural Belief Tracker: Data-Driven Dialogue State Tracking

Apr 21, 2017

Nikola Mrkšić, Diarmuid Ó Séaghdha, Tsung-Hsien Wen, Blaise Thomson, Steve Young

Figure 1 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Figure 2 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Figure 3 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Figure 4 for Neural Belief Tracker: Data-Driven Dialogue State Tracking

Abstract:One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user's goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-crafted lexicons for capturing some of the linguistic variation in users' language. We propose a novel Neural Belief Tracking (NBT) framework which overcomes these problems by building on recent advances in representation learning. NBT models reason over pre-trained word vectors, learning to compose them into distributed representations of user utterances and dialogue context. Our evaluation on two datasets shows that this approach surpasses past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

* Accepted as a long paper for the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017)

Via

Access Paper or Ask Questions

Counter-fitting Word Vectors to Linguistic Constraints

Mar 02, 2016

Nikola Mrkšić, Diarmuid Ó Séaghdha, Blaise Thomson, Milica Gašić, Lina Rojas-Barahona, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve Young

Figure 1 for Counter-fitting Word Vectors to Linguistic Constraints

Figure 2 for Counter-fitting Word Vectors to Linguistic Constraints

Figure 3 for Counter-fitting Word Vectors to Linguistic Constraints

Figure 4 for Counter-fitting Word Vectors to Linguistic Constraints

Abstract:In this work, we present a novel counter-fitting method which injects antonymy and synonymy constraints into vector space representations in order to improve the vectors' capability for judging semantic similarity. Applying this method to publicly available pre-trained word vectors leads to a new state of the art performance on the SimLex-999 dataset. We also show how the method can be used to tailor the word vector space for the downstream task of dialogue state tracking, resulting in robust improvements across different dialogue domains.

* Paper accepted for the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)

Via

Access Paper or Ask Questions

Multi-domain Dialog State Tracking using Recurrent Neural Networks

Jun 23, 2015

Nikola Mrkšić, Diarmuid Ó Séaghdha, Blaise Thomson, Milica Gašić, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve Young

Figure 1 for Multi-domain Dialog State Tracking using Recurrent Neural Networks

Figure 2 for Multi-domain Dialog State Tracking using Recurrent Neural Networks

Figure 3 for Multi-domain Dialog State Tracking using Recurrent Neural Networks

Figure 4 for Multi-domain Dialog State Tracking using Recurrent Neural Networks

Abstract:Dialog state tracking is a key component of many modern dialog systems, most of which are designed with a single, well-defined domain in mind. This paper shows that dialog data drawn from different dialog domains can be used to train a general belief tracking model which can operate across all of these domains, exhibiting superior performance to each of the domain-specific models. We propose a training procedure which uses out-of-domain data to initialise belief tracking models for entirely new domains. This procedure leads to improvements in belief tracking performance regardless of the amount of in-domain data available for training the model.

* Accepted as a short paper in the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)

Via

Access Paper or Ask Questions