Picture for Pengfei Wan

Pengfei Wan

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Add code
Feb 12, 2025
Viaarxiv icon

Improving Video Generation with Human Feedback

Add code
Jan 23, 2025
Figure 1 for Improving Video Generation with Human Feedback
Figure 2 for Improving Video Generation with Human Feedback
Figure 3 for Improving Video Generation with Human Feedback
Figure 4 for Improving Video Generation with Human Feedback
Viaarxiv icon

GameFactory: Creating New Games with Generative Interactive Videos

Add code
Jan 14, 2025
Viaarxiv icon

ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning

Add code
Jan 08, 2025
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Add code
Dec 10, 2024
Figure 1 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Figure 2 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Figure 3 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Figure 4 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Viaarxiv icon

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Add code
Dec 10, 2024
Viaarxiv icon

Towards Precise Scaling Laws for Video Diffusion Transformers

Add code
Nov 25, 2024
Viaarxiv icon

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Add code
Nov 22, 2024
Figure 1 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 2 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 3 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 4 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Viaarxiv icon