Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FactLens: Benchmarking Fine-Grained Fact Verification

Nov 08, 2024

Kushan Mitra, Dan Zhang, Sajjadur Rahman, Estevam Hruschka

Figure 1 for FactLens: Benchmarking Fine-Grained Fact Verification

Figure 2 for FactLens: Benchmarking Fine-Grained Fact Verification

Figure 3 for FactLens: Benchmarking Fine-Grained Fact Verification

Figure 4 for FactLens: Benchmarking Fine-Grained Fact Verification

Share this with someone who'll enjoy it:

Abstract:Large Language Models (LLMs) have shown impressive capability in language generation and understanding, but their tendency to hallucinate and produce factually incorrect information remains a key limitation. To verify LLM-generated contents and claims from other sources, traditional verification approaches often rely on holistic models that assign a single factuality label to complex claims, potentially obscuring nuanced errors. In this paper, we advocate for a shift toward fine-grained verification, where complex claims are broken down into smaller sub-claims for individual verification, allowing for more precise identification of inaccuracies, improved transparency, and reduced ambiguity in evidence retrieval. However, generating sub-claims poses challenges, such as maintaining context and ensuring semantic equivalence with respect to the original claim. We introduce FactLens, a benchmark for evaluating fine-grained fact verification, with metrics and automated evaluators of sub-claim quality. The benchmark data is manually curated to ensure high-quality ground truth. Our results show alignment between automated FactLens evaluators and human judgments, and we discuss the impact of sub-claim characteristics on the overall verification performance.

* 12 pages, under review

View paper on

Share this with someone who'll enjoy it:

Title:FactLens: Benchmarking Fine-Grained Fact Verification

Paper and Code