Picture for Yuechen Zhang

Yuechen Zhang

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Add code
Dec 12, 2024
Viaarxiv icon

ControlNeXt: Powerful and Efficient Control for Image and Video Generation

Add code
Aug 15, 2024
Viaarxiv icon

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance

Add code
Jun 24, 2024
Viaarxiv icon

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Add code
Dec 07, 2023
Figure 1 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 2 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 3 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 4 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Viaarxiv icon

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance

Add code
Jun 01, 2023
Figure 1 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 2 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 3 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 4 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Viaarxiv icon

Real-World Image Variation by Aligning Diffusion Inversion Chain

Add code
May 30, 2023
Viaarxiv icon

Video-P2P: Video Editing with Cross-attention Control

Add code
Mar 08, 2023
Viaarxiv icon

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior

Add code
Jan 06, 2023
Viaarxiv icon

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields

Add code
Dec 06, 2022
Figure 1 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Figure 2 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Figure 3 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Figure 4 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Viaarxiv icon