Picture for Chaolei Tan

Chaolei Tan

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Add code
Aug 07, 2024
Figure 1 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 2 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 3 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 4 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Viaarxiv icon

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

Add code
Mar 21, 2024
Viaarxiv icon

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

Add code
Mar 18, 2024
Viaarxiv icon

STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding

Add code
Jul 06, 2022
Figure 1 for STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding
Figure 2 for STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding
Figure 3 for STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding
Viaarxiv icon

Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding

Add code
Jun 20, 2021
Figure 1 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Figure 2 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Figure 3 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Figure 4 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Viaarxiv icon