Picture for Tamara L. Berg

Tamara L. Berg

Revealing Single Frame Bias for Video-and-Language Learning

Add code
Jun 07, 2022
Figure 1 for Revealing Single Frame Bias for Video-and-Language Learning
Figure 2 for Revealing Single Frame Bias for Video-and-Language Learning
Figure 3 for Revealing Single Frame Bias for Video-and-Language Learning
Figure 4 for Revealing Single Frame Bias for Video-and-Language Learning
Viaarxiv icon

End-to-End Visual Editing with a Generatively Pre-Trained Artist

Add code
May 03, 2022
Figure 1 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Figure 2 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Figure 3 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Figure 4 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Viaarxiv icon

LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval

Add code
Mar 10, 2022
Figure 1 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 2 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 3 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 4 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Viaarxiv icon

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

Add code
Feb 15, 2022
Figure 1 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 2 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 3 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 4 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Viaarxiv icon

MTVR: Multilingual Moment Retrieval in Videos

Add code
Jul 30, 2021
Figure 1 for MTVR: Multilingual Moment Retrieval in Videos
Figure 2 for MTVR: Multilingual Moment Retrieval in Videos
Figure 3 for MTVR: Multilingual Moment Retrieval in Videos
Figure 4 for MTVR: Multilingual Moment Retrieval in Videos
Viaarxiv icon

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

Add code
Jul 20, 2021
Figure 1 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Figure 2 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Figure 3 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Figure 4 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Viaarxiv icon

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Add code
Feb 11, 2021
Figure 1 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Figure 2 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Figure 3 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Figure 4 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Viaarxiv icon

What is More Likely to Happen Next? Video-and-Language Future Event Prediction

Add code
Oct 15, 2020
Figure 1 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 2 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 3 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 4 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Viaarxiv icon

MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Add code
May 11, 2020
Figure 1 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Figure 2 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Figure 3 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Figure 4 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Viaarxiv icon

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Add code
Jan 24, 2020
Figure 1 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 2 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 3 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 4 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Viaarxiv icon