Picture for Daquan Zhou

Daquan Zhou

Refer to the report for detailed contributions

Real-time One-Step Diffusion-based Expressive Portrait Videos Generation

Add code
Dec 18, 2024
Viaarxiv icon

MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation

Add code
Dec 16, 2024
Viaarxiv icon

HunyuanVideo: A Systematic Framework For Large Video Generative Models

Add code
Dec 03, 2024
Viaarxiv icon

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Add code
Oct 14, 2024
Viaarxiv icon

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Add code
Oct 03, 2024
Viaarxiv icon

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Add code
May 02, 2024
Viaarxiv icon

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Add code
Apr 29, 2024
Viaarxiv icon

Chain of Thought Explanation for Dialogue State Tracking

Add code
Mar 09, 2024
Viaarxiv icon

Sora Generates Videos with Stunning Geometrical Consistency

Add code
Feb 27, 2024
Viaarxiv icon

Magic-Me: Identity-Specific Video Customized Diffusion

Add code
Feb 14, 2024
Viaarxiv icon