Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shivay Nagpal

ARAGOG: Advanced RAG Output Grading

Apr 01, 2024

Matouš Eibich, Shivay Nagpal, Alexander Fred-Ojala

Figure 1 for ARAGOG: Advanced RAG Output Grading

Figure 2 for ARAGOG: Advanced RAG Output Grading

Figure 3 for ARAGOG: Advanced RAG Output Grading

Figure 4 for ARAGOG: Advanced RAG Output Grading

Abstract:Retrieval-Augmented Generation (RAG) is essential for integrating external knowledge into Large Language Model (LLM) outputs. While the literature on RAG is growing, it primarily focuses on systematic reviews and comparisons of new state-of-the-art (SoTA) techniques against their predecessors, with a gap in extensive experimental comparisons. This study begins to address this gap by assessing various RAG methods' impacts on retrieval precision and answer similarity. We found that Hypothetical Document Embedding (HyDE) and LLM reranking significantly enhance retrieval precision. However, Maximal Marginal Relevance (MMR) and Cohere rerank did not exhibit notable advantages over a baseline Naive RAG system, and Multi-query approaches underperformed. Sentence Window Retrieval emerged as the most effective for retrieval precision, despite its variable performance on answer similarity. The study confirms the potential of the Document Summary Index as a competent retrieval approach. All resources related to this research are publicly accessible for further investigation through our GitHub repository ARAGOG (https://github.com/predlico/ARAGOG). We welcome the community to further this exploratory study in RAG systems.

* 14 pages, 8 figures, associated Github repo: https://github.com/predlico/ARAGOG

Via

Access Paper or Ask Questions