Picture for Bo An

Bo An

When More Experts Hurt: Underfitting in Multi-Expert Learning to Defer

Add code
Feb 19, 2026
Viaarxiv icon

Hierarchical Audio-Visual-Proprioceptive Fusion for Precise Robotic Manipulation

Add code
Feb 14, 2026
Viaarxiv icon

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Add code
Feb 09, 2026
Viaarxiv icon

Conditional Performance Guarantee for Large Reasoning Models

Add code
Jan 30, 2026
Viaarxiv icon

Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping

Add code
Jan 24, 2026
Viaarxiv icon

History Is Not Enough: An Adaptive Dataflow System for Financial Time-Series Synthesis

Add code
Jan 15, 2026
Viaarxiv icon

Bayesian Robust Financial Trading with Adversarial Synthetic Market Data

Add code
Jan 14, 2026
Viaarxiv icon

Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation

Add code
Jan 12, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon