Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

May 29, 2024

Huanshuo Liu, Hao Zhang, Zhijiang Guo, Kuicai Dong, Xiangyang Li, Yi Quan Lee, Cong Zhang, Yong Liu

Figure 1 for CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Figure 2 for CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Figure 3 for CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Figure 4 for CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Share this with someone who'll enjoy it:

Abstract:Retrieval-augmented generation (RAG) has emerged as a promising solution for mitigating hallucinations of large language models (LLMs) with retrieved external knowledge. Adaptive RAG enhances this approach by dynamically assessing the retrieval necessity, aiming to balance external and internal knowledge usage. However, existing adaptive RAG methods primarily realize retrieval on demand by relying on superficially verbalize-based or probability-based feedback of LLMs, or directly fine-tuning LLMs via carefully crafted datasets, resulting in unreliable retrieval necessity decisions, heavy extra costs, and sub-optimal response generation. We present the first attempts to delve into the internal states of LLMs to mitigate such issues by introducing an effective probe-guided adaptive RAG framework, termed CtrlA. Specifically, CtrlA employs an honesty probe to regulate the LLM's behavior by manipulating its representations for increased honesty, and a confidence probe to monitor the internal states of LLM and assess confidence levels, determining the retrieval necessity during generation. Experiments show that CtrlA is superior to existing adaptive RAG methods on a diverse set of tasks, the honesty control can effectively make LLMs more honest and confidence monitoring is proven to be a promising indicator of retrieval trigger. Our codes are available at https://github.com/HSLiu-Initial/CtrlA.git.

* 28 pages, 7 figures, 9 tables

View paper on

Share this with someone who'll enjoy it:

Title:CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Paper and Code