Picture for Lingyong Yan

Lingyong Yan

Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking

Add code
Feb 18, 2025
Viaarxiv icon

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

Add code
Jan 25, 2025
Figure 1 for Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
Figure 2 for Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
Figure 3 for Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
Figure 4 for Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
Viaarxiv icon

PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization

Add code
Dec 19, 2024
Figure 1 for PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
Figure 2 for PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
Figure 3 for PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
Figure 4 for PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
Viaarxiv icon

MAIR: A Massive Benchmark for Evaluating Instructed Retrieval

Add code
Oct 14, 2024
Figure 1 for MAIR: A Massive Benchmark for Evaluating Instructed Retrieval
Figure 2 for MAIR: A Massive Benchmark for Evaluating Instructed Retrieval
Figure 3 for MAIR: A Massive Benchmark for Evaluating Instructed Retrieval
Figure 4 for MAIR: A Massive Benchmark for Evaluating Instructed Retrieval
Viaarxiv icon

MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization

Add code
Oct 10, 2024
Figure 1 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Figure 2 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Figure 3 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Figure 4 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Viaarxiv icon

Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation

Add code
Oct 08, 2024
Figure 1 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Figure 2 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Figure 3 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Figure 4 for Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Viaarxiv icon

ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator

Add code
May 28, 2024
Viaarxiv icon

Chain of Tools: Large Language Model is an Automatic Multi-tool Learner

Add code
May 26, 2024
Viaarxiv icon

The Real, the Better: Aligning Large Language Models with Online Human Behaviors

Add code
May 01, 2024
Viaarxiv icon