Picture for Fangtao Shao

Fangtao Shao

Fine-grained Text-Video Retrieval with Frozen Image Encoders

Add code
Jul 14, 2023
Figure 1 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 2 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 3 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 4 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Viaarxiv icon

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

Add code
Jan 20, 2023
Viaarxiv icon

Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision

Add code
Jul 13, 2020
Figure 1 for Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
Figure 2 for Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
Figure 3 for Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
Figure 4 for Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
Viaarxiv icon