Picture for Shicong Wang

Shicong Wang

Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives

Add code
Aug 20, 2025
Viaarxiv icon

AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding

Add code
Jun 11, 2024
Figure 1 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 2 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 3 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 4 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Viaarxiv icon

Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation

Add code
Sep 07, 2023
Figure 1 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Figure 2 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Figure 3 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Figure 4 for Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Viaarxiv icon