Picture for Xunliang Cai

Xunliang Cai

Alphabetical order by last name

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

Add code
May 21, 2026
Viaarxiv icon

MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks

Add code
May 20, 2026
Viaarxiv icon

When to Stop Reusing: Dynamic Gradient Gating for Sample-Efficient RLVR

Add code
May 19, 2026
Viaarxiv icon

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Add code
May 13, 2026
Viaarxiv icon

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Add code
May 13, 2026
Viaarxiv icon

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level

Add code
May 07, 2026
Viaarxiv icon

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Add code
May 07, 2026
Viaarxiv icon

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Add code
May 04, 2026
Viaarxiv icon

FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control

Add code
Apr 21, 2026
Viaarxiv icon