Picture for Yuan Zhou

Yuan Zhou

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Add code
Mar 16, 2026
Viaarxiv icon

Euphonium: Steering Video Flow Matching via Process Reward Gradient Guided Stochastic Dynamics

Add code
Feb 04, 2026
Viaarxiv icon

USS-Nav: Unified Spatio-Semantic Scene Graph for Lightweight UAV Zero-Shot Object Navigation

Add code
Feb 03, 2026
Viaarxiv icon

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Add code
Feb 02, 2026
Viaarxiv icon

Reducing Class-Wise Performance Disparity via Margin Regularization

Add code
Jan 30, 2026
Viaarxiv icon

Online Linear Programming with Replenishment

Add code
Jan 21, 2026
Viaarxiv icon

StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

Add code
Dec 26, 2025
Viaarxiv icon

Detecting Non-Optimal Decisions of Embodied Agents via Diversity-Guided Metamorphic Testing

Add code
Dec 23, 2025
Viaarxiv icon

ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars

Add code
Dec 22, 2025
Viaarxiv icon

SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models

Add code
Dec 17, 2025
Viaarxiv icon