Picture for Gunnar A. Sigurdsson

Gunnar A. Sigurdsson

Characterizing Video Question Answering with Sparsified Inputs

Add code
Nov 27, 2023
Viaarxiv icon

Decision Making for Human-in-the-loop Robotic Agents via Uncertainty-Aware Reinforcement Learning

Add code
Mar 14, 2023
Viaarxiv icon

RREx-BoT: Remote Referring Expressions with a Bag of Tricks

Add code
Jan 30, 2023
Viaarxiv icon

Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy

Add code
Oct 18, 2022
Figure 1 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Figure 2 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Figure 3 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Figure 4 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Viaarxiv icon

Visual Grounding in Video for Unsupervised Word Translation

Add code
Mar 26, 2020
Figure 1 for Visual Grounding in Video for Unsupervised Word Translation
Figure 2 for Visual Grounding in Video for Unsupervised Word Translation
Figure 3 for Visual Grounding in Video for Unsupervised Word Translation
Figure 4 for Visual Grounding in Video for Unsupervised Word Translation
Viaarxiv icon

Beyond the Camera: Neural Networks in World Coordinates

Add code
Mar 12, 2020
Figure 1 for Beyond the Camera: Neural Networks in World Coordinates
Figure 2 for Beyond the Camera: Neural Networks in World Coordinates
Figure 3 for Beyond the Camera: Neural Networks in World Coordinates
Figure 4 for Beyond the Camera: Neural Networks in World Coordinates
Viaarxiv icon

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos

Add code
Apr 30, 2018
Figure 1 for Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos
Figure 2 for Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos
Figure 3 for Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos
Figure 4 for Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos
Viaarxiv icon

Actor and Observer: Joint Modeling of First and Third-Person Videos

Add code
Apr 25, 2018
Figure 1 for Actor and Observer: Joint Modeling of First and Third-Person Videos
Figure 2 for Actor and Observer: Joint Modeling of First and Third-Person Videos
Figure 3 for Actor and Observer: Joint Modeling of First and Third-Person Videos
Figure 4 for Actor and Observer: Joint Modeling of First and Third-Person Videos
Viaarxiv icon

What Actions are Needed for Understanding Human Actions in Videos?

Add code
Aug 09, 2017
Figure 1 for What Actions are Needed for Understanding Human Actions in Videos?
Figure 2 for What Actions are Needed for Understanding Human Actions in Videos?
Figure 3 for What Actions are Needed for Understanding Human Actions in Videos?
Figure 4 for What Actions are Needed for Understanding Human Actions in Videos?
Viaarxiv icon

Asynchronous Temporal Fields for Action Recognition

Add code
Jul 24, 2017
Figure 1 for Asynchronous Temporal Fields for Action Recognition
Figure 2 for Asynchronous Temporal Fields for Action Recognition
Figure 3 for Asynchronous Temporal Fields for Action Recognition
Figure 4 for Asynchronous Temporal Fields for Action Recognition
Viaarxiv icon