Hotpotqa


Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning

Add code
Apr 14, 2025
Viaarxiv icon

ControlNET: A Firewall for RAG-based LLM System

Add code
Apr 13, 2025
Viaarxiv icon

Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use

Add code
Apr 07, 2025
Viaarxiv icon

DebFlow: Automating Agent Creation via Agent Debate

Add code
Mar 31, 2025
Viaarxiv icon

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Add code
Mar 30, 2025
Viaarxiv icon

A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks

Add code
Mar 29, 2025
Viaarxiv icon

Do Retrieval-Augmented Language Models Adapt to Varying User Needs?

Add code
Feb 27, 2025
Viaarxiv icon

Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering

Add code
Feb 20, 2025
Viaarxiv icon

SPEX: Scaling Feature Interaction Explanations for LLMs

Add code
Feb 19, 2025
Viaarxiv icon

EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness

Add code
Feb 18, 2025
Viaarxiv icon