Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter J Liu

Assessing The Factual Accuracy of Generated Text

May 30, 2019

Ben Goodrich, Vinay Rao, Mohammad Saleh, Peter J Liu

Figure 1 for Assessing The Factual Accuracy of Generated Text

Figure 2 for Assessing The Factual Accuracy of Generated Text

Figure 3 for Assessing The Factual Accuracy of Generated Text

Figure 4 for Assessing The Factual Accuracy of Generated Text

Abstract:We propose a model-based metric to estimate the factual accuracy of generated text that is complementary to typical scoring schemes like ROUGE (Recall-Oriented Understudy for Gisting Evaluation) and BLEU (Bilingual Evaluation Understudy). We introduce and release a new large-scale dataset based on Wikipedia and Wikidata to train relation classifiers and end-to-end fact extraction models. The end-to-end models are shown to be able to extract complete sets of facts from datasets with full pages of text. We then analyse multiple models that estimate factual accuracy on a Wikipedia text summarization task, and show their efficacy compared to ROUGE and other model-free variants by conducting a human evaluation study.

* The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '19), August 4--8, 2019, Anchorage, AK, USA

Via

Access Paper or Ask Questions