Picture for Aisha Urooj Khan

Aisha Urooj Khan

Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training

Add code
May 30, 2024
Viaarxiv icon

Learning Situation Hyper-Graphs for Video Question Answering

Add code
Apr 18, 2023
Figure 1 for Learning Situation Hyper-Graphs for Video Question Answering
Figure 2 for Learning Situation Hyper-Graphs for Video Question Answering
Figure 3 for Learning Situation Hyper-Graphs for Video Question Answering
Figure 4 for Learning Situation Hyper-Graphs for Video Question Answering
Viaarxiv icon

Weakly Supervised Grounding for VQA in Vision-Language Transformers

Add code
Jul 05, 2022
Figure 1 for Weakly Supervised Grounding for VQA in Vision-Language Transformers
Figure 2 for Weakly Supervised Grounding for VQA in Vision-Language Transformers
Figure 3 for Weakly Supervised Grounding for VQA in Vision-Language Transformers
Figure 4 for Weakly Supervised Grounding for VQA in Vision-Language Transformers
Viaarxiv icon

Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules

Add code
May 11, 2021
Figure 1 for Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
Figure 2 for Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
Figure 3 for Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
Figure 4 for Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
Viaarxiv icon

MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering

Add code
Oct 27, 2020
Figure 1 for MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Figure 2 for MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Figure 3 for MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Figure 4 for MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Viaarxiv icon

Analysis of Hand Segmentation in the Wild

Add code
Mar 28, 2018
Figure 1 for Analysis of Hand Segmentation in the Wild
Figure 2 for Analysis of Hand Segmentation in the Wild
Figure 3 for Analysis of Hand Segmentation in the Wild
Figure 4 for Analysis of Hand Segmentation in the Wild
Viaarxiv icon

Segmenting Sky Pixels in Images

Add code
Jan 08, 2018
Figure 1 for Segmenting Sky Pixels in Images
Figure 2 for Segmenting Sky Pixels in Images
Figure 3 for Segmenting Sky Pixels in Images
Figure 4 for Segmenting Sky Pixels in Images
Viaarxiv icon

Egocentric Height Estimation

Add code
Oct 09, 2016
Figure 1 for Egocentric Height Estimation
Figure 2 for Egocentric Height Estimation
Figure 3 for Egocentric Height Estimation
Figure 4 for Egocentric Height Estimation
Viaarxiv icon