Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chaotao Chen

Dialogue Language Model with Large-Scale Persona Data Engineering

Dec 12, 2024

Mengze Hong, Chen Zhang, Chaotao Chen, Rongzhong Lian, Di Jiang

Abstract:Maintaining persona consistency is paramount in the application of open-domain dialogue systems, as exemplified by models like ChatGPT. Despite significant advancements, the limited scale and diversity of current persona dialogue datasets remain challenges to achieving robust persona-consistent dialogue models. In this study, drawing inspiration from the success of large-scale pre-training, we introduce PPDS, an open-domain persona dialogue system that employs extensive generative pre-training on a persona dialogue dataset to enhance persona consistency. Specifically, we present a persona extraction model designed to autonomously and precisely generate vast persona dialogue datasets. Additionally, we unveil a pioneering persona augmentation technique to address the invalid persona bias inherent in the constructed dataset. Both quantitative and human evaluations consistently highlight the superior response quality and persona consistency of our proposed model, underscoring its effectiveness.

Via

Access Paper or Ask Questions

Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

Jun 05, 2019

Chaotao Chen, Jinhua Peng, Fan Wang, Jun Xu, Hua Wu

Figure 1 for Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

Figure 2 for Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

Figure 3 for Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

Figure 4 for Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

Abstract:In human conversation an input post is open to multiple potential responses, which is typically regarded as a one-to-many problem. Promising approaches mainly incorporate multiple latent mechanisms to build the one-to-many relationship. However, without accurate selection of the latent mechanism corresponding to the target response during training, these methods suffer from a rough optimization of latent mechanisms. In this paper, we propose a multi-mapping mechanism to better capture the one-to-many relationship, where multiple mapping modules are employed as latent mechanisms to model the semantic mappings from an input post to its diverse responses. For accurate optimization of latent mechanisms, a posterior mapping selection module is designed to select the corresponding mapping module according to the target response for further optimization. We also introduce an auxiliary matching loss to facilitate the optimization of posterior mapping selection. Empirical results demonstrate the superiority of our model in generating multiple diverse and informative responses over the state-of-the-art methods.

* Accepted in IJCAI 2019

Via

Access Paper or Ask Questions