Picture for Kanishk Jain

Kanishk Jain

Benchmarking Vision Language Models for Cultural Understanding

Add code
Jul 15, 2024
Viaarxiv icon

Instance-Level Semantic Maps for Vision Language Navigation

Add code
May 23, 2023
Viaarxiv icon

Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification

Add code
Feb 01, 2023
Viaarxiv icon

Ground then Navigate: Language-guided Navigation in Dynamic Scenes

Add code
Sep 24, 2022
Figure 1 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 2 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 3 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 4 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Viaarxiv icon

Grounding Linguistic Commands to Navigable Regions

Add code
Dec 24, 2021
Figure 1 for Grounding Linguistic Commands to Navigable Regions
Figure 2 for Grounding Linguistic Commands to Navigable Regions
Figure 3 for Grounding Linguistic Commands to Navigable Regions
Figure 4 for Grounding Linguistic Commands to Navigable Regions
Viaarxiv icon

Comprehensive Multi-Modal Interactions for Referring Image Segmentation

Add code
Apr 21, 2021
Figure 1 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Figure 2 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Figure 3 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Figure 4 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Viaarxiv icon