Picture for Qinglin Lu

Qinglin Lu

Searching Priors Makes Text-to-Video Synthesis Better

Add code
Jun 05, 2024
Viaarxiv icon

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Add code
May 14, 2024
Viaarxiv icon

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Add code
Mar 18, 2024
Viaarxiv icon

DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

Add code
Mar 13, 2024
Viaarxiv icon

Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

Add code
Nov 29, 2023
Viaarxiv icon

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation

Add code
Dec 09, 2022
Viaarxiv icon

Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward

Add code
Sep 25, 2022
Figure 1 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Figure 2 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Figure 3 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Figure 4 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Viaarxiv icon

Overview of Tencent Multi-modal Ads Video Understanding Challenge

Add code
Sep 16, 2021
Figure 1 for Overview of Tencent Multi-modal Ads Video Understanding Challenge
Figure 2 for Overview of Tencent Multi-modal Ads Video Understanding Challenge
Figure 3 for Overview of Tencent Multi-modal Ads Video Understanding Challenge
Figure 4 for Overview of Tencent Multi-modal Ads Video Understanding Challenge
Viaarxiv icon