Picture for Medhini Narasimhan

Medhini Narasimhan

Modular Visual Question Answering via Code Generation

Add code
Jun 08, 2023
Viaarxiv icon

Learning and Verification of Task Structure in Instructional Videos

Add code
Mar 23, 2023
Viaarxiv icon

TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency

Add code
Aug 14, 2022
Figure 1 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Figure 2 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Figure 3 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Figure 4 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Viaarxiv icon

Multi-Person 3D Motion Prediction with Multi-Range Transformers

Add code
Nov 23, 2021
Figure 1 for Multi-Person 3D Motion Prediction with Multi-Range Transformers
Figure 2 for Multi-Person 3D Motion Prediction with Multi-Range Transformers
Figure 3 for Multi-Person 3D Motion Prediction with Multi-Range Transformers
Viaarxiv icon

CLIP-It! Language-Guided Video Summarization

Add code
Jul 01, 2021
Figure 1 for CLIP-It! Language-Guided Video Summarization
Figure 2 for CLIP-It! Language-Guided Video Summarization
Figure 3 for CLIP-It! Language-Guided Video Summarization
Figure 4 for CLIP-It! Language-Guided Video Summarization
Viaarxiv icon

Strumming to the Beat: Audio-Conditioned Contrastive Video Textures

Add code
Apr 06, 2021
Figure 1 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Figure 2 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Figure 3 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Figure 4 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Viaarxiv icon

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

Add code
Jul 20, 2020
Figure 1 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Figure 2 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Figure 3 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Figure 4 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Viaarxiv icon

Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering

Add code
Nov 01, 2018
Figure 1 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Figure 2 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Figure 3 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Figure 4 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Viaarxiv icon

Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering

Add code
Sep 04, 2018
Figure 1 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Figure 2 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Figure 3 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Figure 4 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Viaarxiv icon