Picture for Yutong Feng

Yutong Feng

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

In-Context LoRA for Diffusion Transformers

Add code
Oct 31, 2024
Figure 1 for In-Context LoRA for Diffusion Transformers
Figure 2 for In-Context LoRA for Diffusion Transformers
Figure 3 for In-Context LoRA for Diffusion Transformers
Figure 4 for In-Context LoRA for Diffusion Transformers
Viaarxiv icon

Group Diffusion Transformers are Unsupervised Multitask Learners

Add code
Oct 19, 2024
Figure 1 for Group Diffusion Transformers are Unsupervised Multitask Learners
Viaarxiv icon

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Add code
Oct 17, 2024
Figure 1 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 2 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 3 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 4 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Viaarxiv icon

Zero-shot Image Editing with Reference Imitation

Add code
Jun 11, 2024
Viaarxiv icon

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Add code
Mar 25, 2024
Viaarxiv icon

Spatio-Temporal Field Neural Networks for Air Quality Inference

Add code
Mar 02, 2024
Viaarxiv icon

LivePhoto: Real Image Animation with Text-guided Motion Control

Add code
Dec 05, 2023
Viaarxiv icon

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation

Add code
Nov 30, 2023
Figure 1 for Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Figure 2 for Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Figure 3 for Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Figure 4 for Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Viaarxiv icon

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

Add code
Nov 30, 2023
Figure 1 for Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Figure 2 for Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Figure 3 for Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Figure 4 for Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Viaarxiv icon