Picture for Zerun Feng

Zerun Feng

Disentangle and denoise: Tackling context misalignment for video moment retrieval

Add code
Aug 14, 2024
Viaarxiv icon

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval

Add code
Apr 18, 2024
Viaarxiv icon

Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval

Add code
May 26, 2023
Figure 1 for Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval
Figure 2 for Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval
Figure 3 for Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval
Figure 4 for Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval
Viaarxiv icon

Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching

Add code
Mar 01, 2023
Viaarxiv icon

Image-Text Retrieval with Binary and Continuous Label Supervision

Add code
Oct 20, 2022
Figure 1 for Image-Text Retrieval with Binary and Continuous Label Supervision
Figure 2 for Image-Text Retrieval with Binary and Continuous Label Supervision
Figure 3 for Image-Text Retrieval with Binary and Continuous Label Supervision
Figure 4 for Image-Text Retrieval with Binary and Continuous Label Supervision
Viaarxiv icon

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval

Add code
Sep 28, 2022
Figure 1 for Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval
Figure 2 for Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval
Figure 3 for Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval
Figure 4 for Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval
Viaarxiv icon

Exploiting Visual Semantic Reasoning for Video-Text Retrieval

Add code
Jun 16, 2020
Figure 1 for Exploiting Visual Semantic Reasoning for Video-Text Retrieval
Figure 2 for Exploiting Visual Semantic Reasoning for Video-Text Retrieval
Figure 3 for Exploiting Visual Semantic Reasoning for Video-Text Retrieval
Figure 4 for Exploiting Visual Semantic Reasoning for Video-Text Retrieval
Viaarxiv icon