Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Oct 05, 2021

Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Thomas Friedrichs, Haizhou Li

Figure 1 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Figure 2 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Figure 3 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Figure 4 for Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Share this with someone who'll enjoy it:

Abstract:Recently, there is a surge of interest in applying pre-trained language models (Pr-LM) in automatic open-domain dialog evaluation. Pr-LMs offer a promising direction for addressing the multi-domain evaluation challenge. Yet, the impact of different Pr-LMs on the performance of automatic metrics is not well-understood. This paper examines 8 different Pr-LMs and studies their impact on three typical automatic dialog evaluation metrics across three different dialog evaluation benchmarks. Specifically, we analyze how the choice of Pr-LMs affects the performance of automatic metrics. Extensive correlation analyses on each of the metrics are performed to assess the effects of different Pr-LMs along various axes, including pre-training objectives, dialog evaluation criteria, model size, and cross-dataset robustness. This study serves as the first comprehensive assessment of the effects of different Pr-LMs on automatic dialog evaluation.

* Accepted by IWSDS2021 (Long Paper)

View paper on

Share this with someone who'll enjoy it:

Title:Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Paper and Code