Picture for Fahad Khan

Fahad Khan

CNR-ILC

BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities

Add code
Dec 10, 2024
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment

Add code
Oct 02, 2024
Viaarxiv icon

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Add code
Jun 13, 2024
Figure 1 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Figure 2 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Figure 3 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Figure 4 for VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Viaarxiv icon

On the Design of Human-Robot Collaboration Gestures

Add code
Feb 29, 2024
Figure 1 for On the Design of Human-Robot Collaboration Gestures
Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Feb 27, 2024
Figure 1 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Figure 2 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Figure 3 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Figure 4 for MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation
Viaarxiv icon

Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes

Add code
Jan 02, 2024
Viaarxiv icon

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Add code
Nov 25, 2023
Viaarxiv icon

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Add code
Nov 22, 2023
Viaarxiv icon

Sentence-level Prompts Benefit Composed Image Retrieval

Add code
Oct 09, 2023
Figure 1 for Sentence-level Prompts Benefit Composed Image Retrieval
Figure 2 for Sentence-level Prompts Benefit Composed Image Retrieval
Figure 3 for Sentence-level Prompts Benefit Composed Image Retrieval
Figure 4 for Sentence-level Prompts Benefit Composed Image Retrieval
Viaarxiv icon