Picture for Bowen Yang

Bowen Yang

Charles

ELAN4D: Embodiment-Centric 4D Supervision for Vision-Language-Action Models via Plug-and-Play Adaptation

Add code
May 28, 2026
Viaarxiv icon

ComHymba: Low-Complexity Domain-Informed Foundation Model for Wireless Communications

Add code
May 22, 2026
Viaarxiv icon

GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Add code
May 12, 2026
Viaarxiv icon

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Add code
May 10, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards

Add code
Mar 19, 2026
Viaarxiv icon

AoE: Always-on Egocentric Human Video Collection for Embodied AI

Add code
Mar 02, 2026
Viaarxiv icon

A Training-Free Guess What Vision Language Model from Snippets to Open-Vocabulary Object Detection

Add code
Jan 21, 2026
Viaarxiv icon

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Add code
Jan 12, 2026
Viaarxiv icon