Picture for Huaxiu Yao

Huaxiu Yao

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Add code
Mar 17, 2026
Viaarxiv icon

Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation

Add code
Mar 17, 2026
Viaarxiv icon

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Add code
Mar 12, 2026
Viaarxiv icon

Provable and Practical In-Context Policy Optimization for Self-Improvement

Add code
Mar 02, 2026
Viaarxiv icon

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Add code
Feb 25, 2026
Viaarxiv icon

SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read

Add code
Feb 25, 2026
Viaarxiv icon

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Add code
Feb 09, 2026
Viaarxiv icon

MedVerse: Efficient and Reliable Medical Reasoning via DAG-Structured Parallel Execution

Add code
Feb 07, 2026
Viaarxiv icon