Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Oct 08, 2024

Bowen Jin, Jinsung Yoon, Jiawei Han, Sercan O. Arik

Figure 1 for Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Figure 2 for Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Figure 3 for Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Figure 4 for Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Share this with someone who'll enjoy it:

Abstract:Retrieval-augmented generation (RAG) empowers large language models (LLMs) to utilize external knowledge sources. The increasing capacity of LLMs to process longer input sequences opens up avenues for providing more retrieved information, to potentially enhance the quality of generated outputs. It is plausible to assume that a larger retrieval set would contain more relevant information (higher recall), that might result in improved performance. However, our empirical findings demonstrate that for many long-context LLMs, the quality of generated output initially improves first, but then subsequently declines as the number of retrieved passages increases. This paper investigates this phenomenon, identifying the detrimental impact of retrieved "hard negatives" as a key contributor. To mitigate this and enhance the robustness of long-context LLM-based RAG, we propose both training-free and training-based approaches. We first showcase the effectiveness of retrieval reordering as a simple yet powerful training-free optimization. Furthermore, we explore training-based methods, specifically RAG-specific implicit LLM fine-tuning and RAG-oriented fine-tuning with intermediate reasoning, demonstrating their capacity for substantial performance gains. Finally, we conduct a systematic analysis of design choices for these training-based methods, including data distribution, retriever selection, and training context length.

* 34 pages

View paper on

Share this with someone who'll enjoy it:

Title:Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Paper and Code