Picture for Yu Huang

Yu Huang

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

Add code
Jun 06, 2026
Viaarxiv icon

Agentic Transformers Provably Learn to Search via Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

Add code
May 28, 2026
Viaarxiv icon

Focal Reward: Balanced Reinforcement Learning under Rubric-Based Rewards

Add code
May 26, 2026
Viaarxiv icon

OphMAE: Bridging Volumetric and Planar Imaging with a Foundation Model for Adaptive Ophthalmological Diagnosis

Add code
May 04, 2026
Viaarxiv icon

SAIL: Structure-Aware Interpretable Learning for Anatomy-Aligned Post-hoc Explanations in OCT

Add code
May 04, 2026
Viaarxiv icon

Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment

Add code
Mar 24, 2026
Viaarxiv icon

A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment

Add code
Mar 23, 2026
Viaarxiv icon

AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

Add code
Mar 19, 2026
Viaarxiv icon

On the Learnability of Offline Model-Based Optimization: A Ranking Perspective

Add code
Mar 04, 2026
Viaarxiv icon