Picture for Tao Yu

Tao Yu

STELLAR: Spatio-Temporal Environmental Learning with Latent Alignment and Refinement for Long-Tailed Species Distribution Modeling

Add code
Jun 07, 2026
Viaarxiv icon

When Seeing Is Not Believing -- A Benchmark for Search-Grounded Video Misinformation Detection

Add code
Jun 02, 2026
Viaarxiv icon

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Add code
May 26, 2026
Viaarxiv icon

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Add code
May 25, 2026
Viaarxiv icon

Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking

Add code
May 22, 2026
Viaarxiv icon

BioHuman: Learning Biomechanical Human Representations from Video

Add code
May 14, 2026
Viaarxiv icon

Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation

Add code
May 06, 2026
Viaarxiv icon

Realizing Immersive Volumetric Video: A Multimodal Framework for 6-DoF VR Engagement

Add code
Apr 10, 2026
Viaarxiv icon

DirectFisheye-GS: Enabling Native Fisheye Input in Gaussian Splatting with Cross-View Joint Optimization

Add code
Apr 01, 2026
Viaarxiv icon