Picture for Xunliang Cai

Xunliang Cai

Alphabetical order by last name

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level

Add code
May 07, 2026
Viaarxiv icon

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Add code
May 07, 2026
Viaarxiv icon

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Add code
May 04, 2026
Viaarxiv icon

FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control

Add code
Apr 21, 2026
Viaarxiv icon

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

Add code
Apr 20, 2026
Viaarxiv icon

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention

Add code
Apr 15, 2026
Viaarxiv icon

LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment

Add code
Apr 13, 2026
Viaarxiv icon

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

Add code
Apr 13, 2026
Viaarxiv icon

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting

Add code
Apr 12, 2026
Viaarxiv icon

AsyncTLS: Efficient Generative LLM Inference with Asynchronous Two-level Sparse Attention

Add code
Apr 09, 2026
Viaarxiv icon