Picture for Li Yuan

Li Yuan

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses

Add code
Nov 30, 2024
Viaarxiv icon

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Add code
Nov 26, 2024
Figure 1 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Figure 2 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Figure 3 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Figure 4 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Viaarxiv icon

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Add code
Nov 26, 2024
Viaarxiv icon

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

Add code
Nov 25, 2024
Viaarxiv icon

Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection

Add code
Nov 23, 2024
Viaarxiv icon

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Add code
Nov 15, 2024
Viaarxiv icon

Sparse Orthogonal Parameters Tuning for Continual Learning

Add code
Nov 05, 2024
Viaarxiv icon

ETTFS: An Efficient Training Framework for Time-to-First-Spike Neuron

Add code
Oct 31, 2024
Viaarxiv icon

Spatial-Temporal Search for Spiking Neural Networks

Add code
Oct 24, 2024
Figure 1 for Spatial-Temporal Search for Spiking Neural Networks
Figure 2 for Spatial-Temporal Search for Spiking Neural Networks
Figure 3 for Spatial-Temporal Search for Spiking Neural Networks
Figure 4 for Spatial-Temporal Search for Spiking Neural Networks
Viaarxiv icon