Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Feiteng Mu

Unsupervised Query Routing for Retrieval Augmented Generation

Jan 14, 2025

Feiteng Mu, Liwen Zhang, Yong Jiang, Wenjie Li, Zhen Zhang, Pengjun Xie, Fei Huang

Abstract:Query routing for retrieval-augmented generation aims to assign an input query to the most suitable search engine. Existing works rely heavily on supervised datasets that require extensive manual annotation, resulting in high costs and limited scalability, as well as poor generalization to out-of-distribution scenarios. To address these challenges, we introduce a novel unsupervised method that constructs the "upper-bound" response to evaluate the quality of retrieval-augmented responses. This evaluation enables the decision of the most suitable search engine for a given query. By eliminating manual annotations, our approach can automatically process large-scale real user queries and create training data. We conduct extensive experiments across five datasets, demonstrating that our method significantly enhances scalability and generalization capabilities.

Via

Access Paper or Ask Questions

Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment

Nov 09, 2024

Zhen Zhang, Xinyu Wang, Yong Jiang, Zhuo Chen, Feiteng Mu, Mengting Hu, Pengjun Xie, Fei Huang

Figure 1 for Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment

Figure 2 for Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment

Figure 3 for Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment

Figure 4 for Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment

Abstract:Large Language Models (LLMs) are increasingly recognized for their practical applications. However, these models often encounter challenges in dynamically changing knowledge, as well as in managing unknown static knowledge. Retrieval-Augmented Generation (RAG) tackles this challenge and has shown a significant impact on LLMs. Actually, we find that the impact of RAG on the question answering capabilities of LLMs can be categorized into three groups: beneficial, neutral, and harmful. By minimizing retrieval requests that yield neutral or harmful results, we can effectively reduce both time and computational costs, while also improving the overall performance of LLMs. This insight motivates us to differentiate between types of questions using certain metrics as indicators, to decrease the retrieval ratio without compromising performance. In our work, we propose a method that is able to identify different types of questions from this view by training a Knowledge Boundary Model (KBM). Experiments conducted on 11 English and Chinese datasets illustrate that the KBM effectively delineates the knowledge boundary, significantly decreasing the proportion of retrievals required for optimal end-to-end performance. Specifically, we evaluate the effectiveness of KBM in three complex scenarios: dynamic knowledge, long-tail static knowledge, and multi-hop problems, as well as its functionality as an external LLM plug-in.

Via

Access Paper or Ask Questions

Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario

Jun 18, 2024

Feiteng Mu, Yong Jiang, Liwen Zhang, Chu Liu, Wenjie Li, Pengjun Xie, Fei Huang

Figure 1 for Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario

Figure 2 for Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario

Figure 3 for Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario

Figure 4 for Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario

Abstract:Current research on tool learning primarily focuses on selecting the most effective tool from a wide array of options, often overlooking cost-effectiveness, a crucial factor in human problem-solving. In this paper, we address the selection of homogeneous tools by predicting both their performance and the associated cost required to accomplish a given task. We then assign queries to the optimal tools in a cost-effective manner. Our experimental results demonstrate that our method achieves higher performance at a lower cost compared to strong baseline approaches.

Via

Access Paper or Ask Questions

Empathetic Response Generation through Graph-based Multi-hop Reasoning on Emotional Causality

Oct 09, 2021

Jiashuo Wang, Wenjie LI, Peiqin Lin, Feiteng Mu

Figure 1 for Empathetic Response Generation through Graph-based Multi-hop Reasoning on Emotional Causality

Figure 2 for Empathetic Response Generation through Graph-based Multi-hop Reasoning on Emotional Causality

Figure 3 for Empathetic Response Generation through Graph-based Multi-hop Reasoning on Emotional Causality

Figure 4 for Empathetic Response Generation through Graph-based Multi-hop Reasoning on Emotional Causality

Abstract:Empathetic response generation aims to comprehend the user emotion and then respond to it appropriately. Most existing works merely focus on what the emotion is and ignore how the emotion is evoked, thus weakening the capacity of the model to understand the emotional experience of the user for generating empathetic responses. To tackle this problem, we consider the emotional causality, namely, what feelings the user expresses (i.e., emotion) and why the user has such feelings (i.e., cause). Then, we propose a novel graph-based model with multi-hop reasoning to model the emotional causality of the empathetic conversation. Finally, we demonstrate the effectiveness of our model on EMPATHETICDIALOGUES in comparison with several competitive models.

* Knowledge-Based Systems 233 (2021) 107547
* The manuscript is accepted and published in Knowledge-Based Systems

Via

Access Paper or Ask Questions