Picture for Yanhong Zeng

Yanhong Zeng

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Add code
Dec 10, 2024
Viaarxiv icon

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Add code
Jul 28, 2024
Figure 1 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 2 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 3 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 4 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Viaarxiv icon

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

Add code
Jul 11, 2024
Viaarxiv icon

StyleShot: A Snapshot on Any Style

Add code
Jul 01, 2024
Viaarxiv icon

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

Add code
Jul 01, 2024
Figure 1 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Figure 2 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Figure 3 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Figure 4 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Viaarxiv icon

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Add code
Jun 28, 2024
Viaarxiv icon

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Add code
Jun 25, 2024
Viaarxiv icon

Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior

Add code
Jun 13, 2024
Viaarxiv icon

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

Add code
Mar 25, 2024
Viaarxiv icon

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Add code
Dec 21, 2023
Viaarxiv icon