Picture for Daquan Zhou

Daquan Zhou

Refer to the report for detailed contributions

HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining

Add code
Jun 18, 2026
Viaarxiv icon

Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving

Add code
May 25, 2026
Viaarxiv icon

SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules

Add code
May 21, 2026
Viaarxiv icon

StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

Add code
May 18, 2026
Viaarxiv icon

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Add code
Apr 28, 2026
Viaarxiv icon

TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation

Add code
Apr 21, 2026
Viaarxiv icon

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Add code
Feb 27, 2026
Viaarxiv icon

Rethinking Video Generation Model for the Embodied World

Add code
Jan 21, 2026
Viaarxiv icon

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Add code
Jan 12, 2026
Viaarxiv icon

EgoReAct: Egocentric Video-Driven 3D Human Reaction Generation

Add code
Dec 28, 2025
Viaarxiv icon