Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

May 14, 2020

Baolin Peng, Chunyuan Li, Jinchao Li, Shahin Shayandeh, Lars Liden, Jianfeng Gao

Figure 1 for SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

Figure 2 for SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

Figure 3 for SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

Figure 4 for SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

Share this with someone who'll enjoy it:

Abstract:This paper presents a new method SOLOIST, which uses transfer learning to efficiently build task-oriented dialog systems at scale. We parameterize a dialog system using a Transformer-based auto-regressive language model, which subsumes different dialog modules (e.g., state tracker, dialog policy, response generator) into a single neural model. We pre-train, on large heterogeneous dialog corpora, a large-scale Transformer model which can generate dialog responses grounded in user goals and real-world knowledge for task completion. The pre-trained model can be efficiently adapted to accomplish a new dialog task with a handful of task-specific dialogs via machine teaching. Our experiments demonstrate that (i) SOLOIST creates new state-of-the-art results on two well-known benchmarks, CamRest and MultiWOZ, (ii) in the few-shot learning setting, the dialog systems developed by SOLOIST significantly outperform those developed by existing methods, and (iii) the use of machine teaching substantially reduces the labeling cost. We will release our code and pre-trained models for reproducible research.

* 10 pages; Project Website: https://aka.ms/soloist

View paper on

Share this with someone who'll enjoy it:

Title:SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

Paper and Code