Picture for Yingqing He

Yingqing He

Large Motion Video Autoencoding with Cross-modal Video VAE

Add code
Dec 23, 2024
Viaarxiv icon

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation

Add code
Dec 18, 2024
Viaarxiv icon

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Add code
Sep 04, 2024
Viaarxiv icon

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Add code
Jul 30, 2024
Figure 1 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 2 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 3 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 4 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Viaarxiv icon

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models

Add code
Jun 24, 2024
Viaarxiv icon

Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation

Add code
Jun 04, 2024
Viaarxiv icon

LLMs Meet Multimodal Generation and Editing: A Survey

Add code
May 29, 2024
Viaarxiv icon

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

Add code
Mar 13, 2024
Viaarxiv icon

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

Add code
Feb 27, 2024
Viaarxiv icon

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

Add code
Feb 16, 2024
Figure 1 for Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Figure 2 for Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Figure 3 for Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Figure 4 for Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Viaarxiv icon