Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yin Bai

MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Apr 12, 2024

Tianwen Tang, Tong Zhu, Haodong Liu, Yin Bai, Jia Cheng, Wenliang Chen

Figure 1 for MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Figure 2 for MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Figure 3 for MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Figure 4 for MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Abstract:Zero-shot dialogue state tracking (DST) transfers knowledge to unseen domains, reducing the cost of annotating new datasets. Previous zero-shot DST models mainly suffer from domain transferring and partial prediction problems. To address these challenges, we propose Mixture of Prefix Experts (MoPE) to establish connections between similar slots in different domains, which strengthens the model transfer performance in unseen domains. Empirical results demonstrate that MoPE-DST achieves the joint goal accuracy of 57.13% on MultiWOZ2.1 and 55.40% on SGD.

* Accepted to LREC-COLING 2024

Via

Access Paper or Ask Questions

DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space

Apr 10, 2024

Jianxiang Xiang, Zhenhua Liu, Haodong Liu, Yin Bai, Jia Cheng, Wenliang Chen

Figure 1 for DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space

Figure 2 for DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space

Figure 3 for DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space

Figure 4 for DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space

Abstract:In real-life conversations, the content is diverse, and there exists the one-to-many problem that requires diverse generation. Previous studies attempted to introduce discrete or Gaussian-based continuous latent variables to address the one-to-many problem, but the diversity is limited. Recently, diffusion models have made breakthroughs in computer vision, and some attempts have been made in natural language processing. In this paper, we propose DiffusionDialog, a novel approach to enhance the diversity of dialogue generation with the help of diffusion model. In our approach, we introduce continuous latent variables into the diffusion model. The problem of using latent variables in the dialog task is how to build both an effective prior of the latent space and an inferring process to obtain the proper latent given the context. By combining the encoder and latent-based diffusion model, we encode the response's latent representation in a continuous space as the prior, instead of fixed Gaussian distribution or simply discrete ones. We then infer the latent by denoising step by step with the diffusion model. The experimental results show that our model greatly enhances the diversity of dialog responses while maintaining coherence. Furthermore, in further analysis, we find that our diffusion model achieves high inference efficiency, which is the main challenge of applying diffusion models in natural language processing.

* LREC-COLING 2024 camera ready

Via

Access Paper or Ask Questions