Picture for Fahad Khan

Fahad Khan

CNR-ILC

BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities

Add code
Dec 10, 2024
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment

Add code
Oct 02, 2024
Viaarxiv icon

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Add code
Jun 13, 2024
Viaarxiv icon

On the Design of Human-Robot Collaboration Gestures

Add code
Feb 29, 2024
Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Feb 27, 2024
Viaarxiv icon

Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes

Add code
Jan 02, 2024
Viaarxiv icon

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Add code
Nov 25, 2023
Viaarxiv icon

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Add code
Nov 22, 2023
Viaarxiv icon

Sentence-level Prompts Benefit Composed Image Retrieval

Add code
Oct 09, 2023
Viaarxiv icon