Picture for Ludan Ruan

Ludan Ruan

Renmin University of China

UniVG: Towards UNIfied-modal Video Generation

Add code
Jan 17, 2024
Viaarxiv icon

Accommodating Audio Modality in CLIP for Multimodal Processing

Add code
Mar 12, 2023
Viaarxiv icon

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat

Add code
Jan 14, 2023
Viaarxiv icon

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Add code
Dec 19, 2022
Viaarxiv icon

Survey: Transformer based Video-Language Pre-training

Add code
Sep 21, 2021
Figure 1 for Survey: Transformer based Video-Language Pre-training
Figure 2 for Survey: Transformer based Video-Language Pre-training
Figure 3 for Survey: Transformer based Video-Language Pre-training
Figure 4 for Survey: Transformer based Video-Language Pre-training
Viaarxiv icon

Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization

Add code
Jun 11, 2021
Figure 1 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 2 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 3 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 4 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Viaarxiv icon

YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos

Add code
Apr 12, 2020
Figure 1 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Figure 2 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Figure 3 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Figure 4 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Viaarxiv icon