Picture for Yuechen Zhang

Yuechen Zhang

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

Add code
Jan 07, 2025
Figure 1 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Figure 2 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Figure 3 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Figure 4 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Viaarxiv icon

DreamOmni: Unified Image Generation and Editing

Add code
Dec 22, 2024
Viaarxiv icon

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Add code
Dec 12, 2024
Viaarxiv icon

ControlNeXt: Powerful and Efficient Control for Image and Video Generation

Add code
Aug 15, 2024
Viaarxiv icon

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance

Add code
Jun 24, 2024
Figure 1 for ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Figure 2 for ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Figure 3 for ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Figure 4 for ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Viaarxiv icon

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Add code
Dec 07, 2023
Figure 1 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 2 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 3 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 4 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Viaarxiv icon

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance

Add code
Jun 01, 2023
Figure 1 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 2 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 3 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 4 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Viaarxiv icon

Real-World Image Variation by Aligning Diffusion Inversion Chain

Add code
May 30, 2023
Viaarxiv icon

Video-P2P: Video Editing with Cross-attention Control

Add code
Mar 08, 2023
Viaarxiv icon