Picture for Pengfei Wan

Pengfei Wan

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Add code
Dec 10, 2024
Viaarxiv icon

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Add code
Dec 10, 2024
Viaarxiv icon

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

Towards Precise Scaling Laws for Video Diffusion Transformers

Add code
Nov 25, 2024
Viaarxiv icon

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Add code
Nov 22, 2024
Viaarxiv icon

Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation

Add code
Nov 21, 2024
Viaarxiv icon

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Add code
Oct 29, 2024
Figure 1 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 2 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 3 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 4 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Viaarxiv icon

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

Add code
Oct 10, 2024
Viaarxiv icon

Towards Unified 3D Hair Reconstruction from Single-View Portraits

Add code
Sep 25, 2024
Figure 1 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 2 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 3 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 4 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Viaarxiv icon