Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jorge Yero

Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

May 30, 2024

Ernesto Quevedo, Jorge Yero, Rachel Koerner, Pablo Rivas, Tomas Cerny

Figure 1 for Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Figure 2 for Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Figure 3 for Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Figure 4 for Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Abstract:Concerns regarding the propensity of Large Language Models (LLMs) to produce inaccurate outputs, also known as hallucinations, have escalated. Detecting them is vital for ensuring the reliability of applications relying on LLM-generated content. Current methods often demand substantial resources and rely on extensive LLMs or employ supervised learning with multidimensional features or intricate linguistic and semantic analyses difficult to reproduce and largely depend on using the same LLM that hallucinated. This paper introduces a supervised learning approach employing two simple classifiers utilizing only four numerical features derived from tokens and vocabulary probabilities obtained from other LLM evaluators, which are not necessarily the same. The method yields promising results, surpassing state-of-the-art outcomes in multiple tasks across three different benchmarks. Additionally, we provide a comprehensive examination of the strengths and weaknesses of our approach, highlighting the significance of the features utilized and the LLM employed as an evaluator. We have released our code publicly at https://github.com/Baylor-AI/HalluDetect.

* ICAI'24 - The 26th Int'l Conf on Artificial Intelligence

Via

Access Paper or Ask Questions