Picture for Bin Lin

Bin Lin

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses

Add code
Nov 30, 2024
Viaarxiv icon

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Add code
Nov 26, 2024
Viaarxiv icon

Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization

Add code
Oct 18, 2024
Viaarxiv icon

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Add code
Sep 18, 2024
Viaarxiv icon

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Add code
Sep 02, 2024
Viaarxiv icon

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Add code
Jul 28, 2024
Viaarxiv icon

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Add code
Jun 06, 2024
Figure 1 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 2 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 3 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 4 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Viaarxiv icon

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

Add code
Apr 15, 2024
Figure 1 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Figure 2 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Figure 3 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Figure 4 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Viaarxiv icon

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Add code
Apr 07, 2024
Viaarxiv icon