Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization

Oct 27, 2023

Yuchen Shen, Xiaojun Wan

Figure 1 for OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization

Figure 2 for OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization

Figure 3 for OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization

Figure 4 for OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization

Share this with someone who'll enjoy it:

Abstract:Opinion summarization sets itself apart from other types of summarization tasks due to its distinctive focus on aspects and sentiments. Although certain automated evaluation methods like ROUGE have gained popularity, we have found them to be unreliable measures for assessing the quality of opinion summaries. In this paper, we present OpinSummEval, a dataset comprising human judgments and outputs from 14 opinion summarization models. We further explore the correlation between 24 automatic metrics and human ratings across four dimensions. Our findings indicate that metrics based on neural networks generally outperform non-neural ones. However, even metrics built on powerful backbones, such as BART and GPT-3/3.5, do not consistently correlate well across all dimensions, highlighting the need for advancements in automated evaluation methods for opinion summarization. The code and data are publicly available at https://github.com/A-Chicharito-S/OpinSummEval/tree/main.

* preprint, 19 pages, 4 figures, 10 tables

View paper on

Share this with someone who'll enjoy it:

Title:OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization

Paper and Code