Picture for Arun Balajee Vasudevan

Arun Balajee Vasudevan

Planning with Adaptive World Models for Autonomous Driving

Add code
Jun 15, 2024
Figure 1 for Planning with Adaptive World Models for Autonomous Driving
Figure 2 for Planning with Adaptive World Models for Autonomous Driving
Figure 3 for Planning with Adaptive World Models for Autonomous Driving
Figure 4 for Planning with Adaptive World Models for Autonomous Driving
Viaarxiv icon

The Un-Kidnappable Robot: Acoustic Localization of Sneaking People

Add code
Oct 05, 2023
Viaarxiv icon

Sound and Visual Representation Learning with Multiple Pretraining Tasks

Add code
Jan 04, 2022
Figure 1 for Sound and Visual Representation Learning with Multiple Pretraining Tasks
Figure 2 for Sound and Visual Representation Learning with Multiple Pretraining Tasks
Figure 3 for Sound and Visual Representation Learning with Multiple Pretraining Tasks
Figure 4 for Sound and Visual Representation Learning with Multiple Pretraining Tasks
Viaarxiv icon

Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds

Add code
Sep 06, 2021
Figure 1 for Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds
Figure 2 for Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds
Figure 3 for Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds
Figure 4 for Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds
Viaarxiv icon

Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds

Add code
Mar 09, 2020
Figure 1 for Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
Figure 2 for Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
Figure 3 for Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
Figure 4 for Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
Viaarxiv icon

Talk2Nav: Long-Range Vision-and-Language Navigation in Cities

Add code
Oct 04, 2019
Figure 1 for Talk2Nav: Long-Range Vision-and-Language Navigation in Cities
Figure 2 for Talk2Nav: Long-Range Vision-and-Language Navigation in Cities
Figure 3 for Talk2Nav: Long-Range Vision-and-Language Navigation in Cities
Figure 4 for Talk2Nav: Long-Range Vision-and-Language Navigation in Cities
Viaarxiv icon

Object Referring in Videos with Language and Human Gaze

Add code
Apr 04, 2018
Figure 1 for Object Referring in Videos with Language and Human Gaze
Figure 2 for Object Referring in Videos with Language and Human Gaze
Figure 3 for Object Referring in Videos with Language and Human Gaze
Figure 4 for Object Referring in Videos with Language and Human Gaze
Viaarxiv icon

Object Referring in Visual Scene with Spoken Language

Add code
Dec 05, 2017
Figure 1 for Object Referring in Visual Scene with Spoken Language
Figure 2 for Object Referring in Visual Scene with Spoken Language
Figure 3 for Object Referring in Visual Scene with Spoken Language
Figure 4 for Object Referring in Visual Scene with Spoken Language
Viaarxiv icon

Query-adaptive Video Summarization via Quality-aware Relevance Estimation

Add code
Sep 28, 2017
Figure 1 for Query-adaptive Video Summarization via Quality-aware Relevance Estimation
Figure 2 for Query-adaptive Video Summarization via Quality-aware Relevance Estimation
Figure 3 for Query-adaptive Video Summarization via Quality-aware Relevance Estimation
Figure 4 for Query-adaptive Video Summarization via Quality-aware Relevance Estimation
Viaarxiv icon