Picture for Xunliang Cai

Xunliang Cai

Alphabetical order by last name

Learning to Self-Verify Makes Language Models Better Reasoners

Add code
Feb 07, 2026
Viaarxiv icon

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Add code
Feb 06, 2026
Viaarxiv icon

Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Add code
Feb 03, 2026
Viaarxiv icon

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

$V_0$: A Generalist Value Model for Any Policy at State Zero

Add code
Feb 03, 2026
Viaarxiv icon

DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding

Add code
Jan 30, 2026
Viaarxiv icon

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon

Causal Autoregressive Diffusion Language Model

Add code
Jan 29, 2026
Viaarxiv icon

Exploring Reasoning Reward Model for Agents

Add code
Jan 29, 2026
Viaarxiv icon

Scaling Embeddings Outperforms Scaling Experts in Language Models

Add code
Jan 29, 2026
Viaarxiv icon