Picture for Yingya Zhang

Yingya Zhang

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Add code
Nov 13, 2024
Viaarxiv icon

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Add code
Oct 17, 2024
Viaarxiv icon

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 2 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 3 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 4 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Viaarxiv icon

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

Add code
Sep 30, 2024
Figure 1 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Figure 2 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Figure 3 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Figure 4 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Viaarxiv icon

S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis

Add code
Aug 18, 2024
Figure 1 for S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
Figure 2 for S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
Figure 3 for S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
Figure 4 for S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
Viaarxiv icon

UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

Add code
Jun 03, 2024
Viaarxiv icon

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Add code
Dec 25, 2023
Viaarxiv icon

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Add code
Dec 19, 2023
Viaarxiv icon

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

Add code
Dec 18, 2023
Viaarxiv icon

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Add code
Dec 15, 2023
Viaarxiv icon