Picture for Rongwei Quan

Rongwei Quan

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Add code
May 14, 2024
Figure 1 for Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Figure 2 for Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Figure 3 for Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Figure 4 for Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Viaarxiv icon

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation

Add code
Dec 09, 2022
Figure 1 for Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Figure 2 for Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Figure 3 for Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Figure 4 for Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Viaarxiv icon