Picture for Jianjie Luo

Jianjie Luo

Semantic-Conditional Diffusion Networks for Image Captioning

Add code
Dec 06, 2022
Viaarxiv icon

CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising

Add code
Dec 14, 2021
Figure 1 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 2 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 3 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Figure 4 for CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Viaarxiv icon

Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training

Add code
Jul 05, 2020
Figure 1 for Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Figure 2 for Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Figure 3 for Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Figure 4 for Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Viaarxiv icon