Picture for Zuxuan Wu

Zuxuan Wu

DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation

Add code
Mar 27, 2025
Viaarxiv icon

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Add code
Mar 24, 2025
Viaarxiv icon

MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

Add code
Mar 20, 2025
Viaarxiv icon

EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation

Add code
Mar 20, 2025
Viaarxiv icon

BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

Add code
Mar 20, 2025
Viaarxiv icon

Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation

Add code
Mar 17, 2025
Viaarxiv icon

Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training

Add code
Mar 15, 2025
Viaarxiv icon

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Add code
Mar 11, 2025
Viaarxiv icon

Human2Robot: Learning Robot Actions from Paired Human-Robot Videos

Add code
Feb 23, 2025
Viaarxiv icon

Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning

Add code
Jan 23, 2025
Viaarxiv icon