Picture for Marcus Rohrbach

Marcus Rohrbach

Efficient Pre-training for Localized Instruction Generation of Videos

Add code
Nov 27, 2023
Viaarxiv icon

Improving Selective Visual Question Answering by Learning from Your Peers

Add code
Jun 14, 2023
Figure 1 for Improving Selective Visual Question Answering by Learning from Your Peers
Figure 2 for Improving Selective Visual Question Answering by Learning from Your Peers
Figure 3 for Improving Selective Visual Question Answering by Learning from Your Peers
Figure 4 for Improving Selective Visual Question Answering by Learning from Your Peers
Viaarxiv icon

Simple Token-Level Confidence Improves Caption Correctness

Add code
May 11, 2023
Viaarxiv icon

Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

Add code
Jun 09, 2022
Figure 1 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Figure 2 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Figure 3 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Figure 4 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Viaarxiv icon

Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly

Add code
Apr 28, 2022
Figure 1 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Figure 2 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Figure 3 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Figure 4 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Viaarxiv icon

Learning To Recognize Procedural Activities with Distant Supervision

Add code
Jan 26, 2022
Figure 1 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 2 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 3 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 4 for Learning To Recognize Procedural Activities with Distant Supervision
Viaarxiv icon

FLAVA: A Foundational Language And Vision Alignment Model

Add code
Dec 08, 2021
Figure 1 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 2 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 3 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 4 for FLAVA: A Foundational Language And Vision Alignment Model
Viaarxiv icon

A New Split for Evaluating True Zero-Shot Action Recognition

Add code
Jul 27, 2021
Figure 1 for A New Split for Evaluating True Zero-Shot Action Recognition
Figure 2 for A New Split for Evaluating True Zero-Shot Action Recognition
Figure 3 for A New Split for Evaluating True Zero-Shot Action Recognition
Figure 4 for A New Split for Evaluating True Zero-Shot Action Recognition
Viaarxiv icon

CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition

Add code
Jan 18, 2021
Figure 1 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Figure 2 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Figure 3 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Figure 4 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Viaarxiv icon

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA

Add code
Dec 20, 2020
Figure 1 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 2 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 3 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 4 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Viaarxiv icon