Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kunquan Deng

PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

Jun 29, 2024

Kunquan Deng, Zeyu Huang, Chen Li, Chenghua Lin, Min Gao, Wenge Rong

Figure 1 for PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

Figure 2 for PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

Figure 3 for PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

Figure 4 for PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

Abstract:Large Language Models (LLMs) excel in fluency but risk producing inaccurate content, called "hallucinations." This paper outlines a standardized process for categorizing fine-grained hallucination types and proposes an innovative framework--the Progressive Fine-grained Model Editor (PFME)--specifically designed to detect and correct fine-grained hallucinations in LLMs. PFME consists of two collaborative modules: the Real-time Fact Retrieval Module and the Fine-grained Hallucination Detection and Editing Module. The former identifies key entities in the document and retrieves the latest factual evidence from credible sources. The latter further segments the document into sentence-level text and, based on relevant evidence and previously edited context, identifies, locates, and edits each sentence's hallucination type. Experimental results on FavaBench and FActScore demonstrate that PFME outperforms existing methods in fine-grained hallucination detection tasks. Particularly, when using the Llama3-8B-Instruct model, PFME's performance in fine-grained hallucination detection with external knowledge assistance improves by 8.7 percentage points (pp) compared to ChatGPT. In editing tasks, PFME further enhances the FActScore of FActScore-Alpaca13B and FActScore-ChatGPT datasets, increasing by 16.2pp and 4.6pp, respectively.

Via

Access Paper or Ask Questions

ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification

Apr 15, 2022

Meiqi Chen, Yixin Cao, Kunquan Deng, Mukai Li, Kun Wang, Jing Shao, Yan Zhang

Figure 1 for ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification

Figure 2 for ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification

Figure 3 for ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification

Figure 4 for ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification

Abstract:Document-level Event Causality Identification (DECI) aims to identify causal relations between event pairs in a document. It poses a great challenge of across-sentence reasoning without clear causal indicators. In this paper, we propose a novel Event Relational Graph TransfOrmer (ERGO) framework for DECI, which improves existing state-of-the-art (SOTA) methods upon two aspects. First, we formulate DECI as a node classification problem by constructing an event relational graph, without the needs of prior knowledge or tools. Second, ERGO seamlessly integrates event-pair relation classification and global inference, which leverages a Relational Graph Transformer (RGT) to capture the potential causal chain. Besides, we introduce edge-building strategies and adaptive focal loss to deal with the massive false positives caused by common spurious correlation. Extensive experiments on two benchmark datasets show that ERGO significantly outperforms previous SOTA methods (13.1% F1 gains on average). We have conducted extensive quantitative analysis and case studies to provide insights for future research directions (Section 4.8).

Via

Access Paper or Ask Questions