Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

May 09, 2024

Yuxia Wang, Minghan Wang, Hasan Iqbal, Georgi Georgiev, Jiahui Geng, Preslav Nakov

Figure 1 for OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Figure 2 for OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Figure 3 for OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Figure 4 for OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Share this with someone who'll enjoy it:

Abstract:The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs. Difficulties lie in assessing the factuality of free-form responses in open domains. Also, different papers use disparate evaluation benchmarks and measurements, which renders them hard to compare and hampers future progress. To mitigate these issues, we propose OpenFactCheck, a unified factuality evaluation framework for LLMs. OpenFactCheck consists of three modules: (i) CUSTCHECKER allows users to easily customize an automatic fact-checker and verify the factual correctness of documents and claims, (ii) LLMEVAL, a unified evaluation framework assesses LLM's factuality ability from various perspectives fairly, and (iii) CHECKEREVAL is an extensible solution for gauging the reliability of automatic fact-checkers' verification results using human-annotated datasets. OpenFactCheck is publicly released at https://github.com/yuxiaw/OpenFactCheck.

* 19 pages, 8 tables, 8 figures

View paper on

Share this with someone who'll enjoy it:

Title:OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Paper and Code