Picture for Pengfei Wan

Pengfei Wan

GameFactory: Creating New Games with Generative Interactive Videos

Add code
Jan 14, 2025
Viaarxiv icon

ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning

Add code
Jan 08, 2025
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Add code
Dec 10, 2024
Viaarxiv icon

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Add code
Dec 10, 2024
Viaarxiv icon

Towards Precise Scaling Laws for Video Diffusion Transformers

Add code
Nov 25, 2024
Viaarxiv icon

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Add code
Nov 22, 2024
Viaarxiv icon

Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation

Add code
Nov 21, 2024
Figure 1 for Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Figure 2 for Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Figure 3 for Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Figure 4 for Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Viaarxiv icon

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Add code
Oct 29, 2024
Figure 1 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 2 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 3 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 4 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Viaarxiv icon