Picture for Niluthpol Chowdhury Mithun

Niluthpol Chowdhury Mithun

Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement

Add code
Oct 25, 2023
Viaarxiv icon

C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation

Add code
Mar 30, 2023
Viaarxiv icon

Cross-View Visual Geo-Localization for Outdoor Augmented Reality

Add code
Mar 28, 2023
Figure 1 for Cross-View Visual Geo-Localization for Outdoor Augmented Reality
Figure 2 for Cross-View Visual Geo-Localization for Outdoor Augmented Reality
Figure 3 for Cross-View Visual Geo-Localization for Outdoor Augmented Reality
Figure 4 for Cross-View Visual Geo-Localization for Outdoor Augmented Reality
Viaarxiv icon

GraphMapper: Efficient Visual Navigation by Scene Graph Generation

Add code
May 17, 2022
Figure 1 for GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Figure 2 for GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Figure 3 for GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Figure 4 for GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Viaarxiv icon

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments

Add code
Aug 26, 2021
Figure 1 for SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Figure 2 for SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Figure 3 for SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Figure 4 for SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Viaarxiv icon

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

Add code
Sep 12, 2020
Figure 1 for RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization
Figure 2 for RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization
Figure 3 for RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization
Figure 4 for RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization
Viaarxiv icon

Text-based Localization of Moments in a Video Corpus

Add code
Aug 20, 2020
Figure 1 for Text-based Localization of Moments in a Video Corpus
Figure 2 for Text-based Localization of Moments in a Video Corpus
Figure 3 for Text-based Localization of Moments in a Video Corpus
Figure 4 for Text-based Localization of Moments in a Video Corpus
Viaarxiv icon

Weakly Supervised Video Moment Retrieval From Text Queries

Add code
Apr 05, 2019
Figure 1 for Weakly Supervised Video Moment Retrieval From Text Queries
Figure 2 for Weakly Supervised Video Moment Retrieval From Text Queries
Figure 3 for Weakly Supervised Video Moment Retrieval From Text Queries
Figure 4 for Weakly Supervised Video Moment Retrieval From Text Queries
Viaarxiv icon

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval

Add code
Aug 23, 2018
Figure 1 for Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Figure 2 for Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Figure 3 for Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Figure 4 for Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Viaarxiv icon

Diversity-aware Multi-Video Summarization

Add code
Jun 09, 2017
Figure 1 for Diversity-aware Multi-Video Summarization
Figure 2 for Diversity-aware Multi-Video Summarization
Figure 3 for Diversity-aware Multi-Video Summarization
Figure 4 for Diversity-aware Multi-Video Summarization
Viaarxiv icon