Picture for Bin Zhu

Bin Zhu

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses

Add code
Nov 30, 2024
Viaarxiv icon

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning

Add code
Nov 19, 2024
Viaarxiv icon

$\ell_0$ factor analysis

Add code
Nov 13, 2024
Viaarxiv icon

Retrieval Augmented Recipe Generation

Add code
Nov 13, 2024
Viaarxiv icon

IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Add code
Oct 17, 2024
Viaarxiv icon

Line Spectral Analysis Using the G-Filter: An Atomic Norm Minimization Approach

Add code
Oct 16, 2024
Viaarxiv icon

When atomic norm meets the G-filter: A general framework for line spectral estimation

Add code
Oct 16, 2024
Viaarxiv icon

Hand1000: Generating Realistic Hands from Text with Only 1,000 Images

Add code
Sep 04, 2024
Viaarxiv icon

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Add code
Sep 02, 2024
Viaarxiv icon