Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Nov 08, 2024

Yilun Zhao, Yitao Long, Yuru Jiang, Chengye Wang, Weiyuan Chen, Hongjun Liu, Yiming Zhang, Xiangru Tang, Chen Zhao, Arman Cohan

Figure 1 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Figure 2 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Figure 3 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Figure 4 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Share this with someone who'll enjoy it:

Abstract:We introduce FinDVer, a comprehensive benchmark specifically designed to evaluate the explainable claim verification capabilities of LLMs in the context of understanding and analyzing long, hybrid-content financial documents. FinDVer contains 2,400 expert-annotated examples, divided into three subsets: information extraction, numerical reasoning, and knowledge-intensive reasoning, each addressing common scenarios encountered in real-world financial contexts. We assess a broad spectrum of LLMs under long-context and RAG settings. Our results show that even the current best-performing system, GPT-4o, still lags behind human experts. We further provide in-depth analysis on long-context and RAG setting, Chain-of-Thought reasoning, and model reasoning errors, offering insights to drive future advancements. We believe that FinDVer can serve as a valuable benchmark for evaluating LLMs in claim verification over complex, expert-domain documents.

* EMNLP 2024

View paper on

Share this with someone who'll enjoy it:

Title:FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Paper and Code