Picture for Fahad Khan

Fahad Khan

CNR-ILC

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment

Add code
Oct 02, 2024
Viaarxiv icon

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Add code
Jun 13, 2024
Viaarxiv icon

On the Design of Human-Robot Collaboration Gestures

Add code
Feb 29, 2024
Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Feb 27, 2024
Viaarxiv icon

Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes

Add code
Jan 02, 2024
Viaarxiv icon

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Add code
Nov 25, 2023
Viaarxiv icon

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Add code
Nov 22, 2023
Viaarxiv icon

Sentence-level Prompts Benefit Composed Image Retrieval

Add code
Oct 09, 2023
Viaarxiv icon

3D Indoor Instance Segmentation in an Open-World

Add code
Sep 25, 2023
Viaarxiv icon

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment

Add code
Aug 24, 2023
Viaarxiv icon