Picture for Amanmeet Garg

Amanmeet Garg

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

Add code
Nov 05, 2023
Viaarxiv icon

Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment

Add code
Jul 24, 2023
Viaarxiv icon

PodSumm -- Podcast Audio Summarization

Add code
Sep 22, 2020
Figure 1 for PodSumm -- Podcast Audio Summarization
Figure 2 for PodSumm -- Podcast Audio Summarization
Figure 3 for PodSumm -- Podcast Audio Summarization
Viaarxiv icon