Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Apr 06, 2018

Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Stefan Ultes, Lina Rojas-Barahona, Steve Young, Milica Gašić

Figure 1 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Figure 2 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Figure 3 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Figure 4 for A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Share this with someone who'll enjoy it:

Abstract:Dialogue assistants are rapidly becoming an indispensable daily aid. To avoid the significant effort needed to hand-craft the required dialogue flow, the Dialogue Management (DM) module can be cast as a continuous Markov Decision Process (MDP) and trained through Reinforcement Learning (RL). Several RL models have been investigated over recent years. However, the lack of a common benchmarking framework makes it difficult to perform a fair comparison between different models and their capability to generalise to different environments. Therefore, this paper proposes a set of challenging simulated environments for dialogue model development and evaluation. To provide some baselines, we investigate a number of representative parametric algorithms, namely deep reinforcement learning algorithms - DQN, A2C and Natural Actor-Critic and compare them to a non-parametric model, GP-SARSA. Both the environments and policy models are implemented using the publicly available PyDial toolkit and released on-line, in order to establish a testbed framework for further experiments and to facilitate experimental reproducibility.

* Accepted at the Deep Reinforcement Learning Symposium, 31st Conference on Neural Information Processing Systems (NIPS 2017) Paper updated with minor changes

View paper on

Share this with someone who'll enjoy it:

Title:A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Paper and Code