Picture for Muhammad Uzair Khattak

Muhammad Uzair Khattak

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities

Add code
Dec 13, 2024
Viaarxiv icon

XDT-CXR: Investigating Cross-Disease Transferability in Zero-Shot Binary Classification of Chest X-Rays

Add code
Aug 21, 2024
Viaarxiv icon

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
May 08, 2024
Viaarxiv icon

Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
May 06, 2024
Viaarxiv icon

Learning to Prompt with Text Only Supervision for Vision-Language Models

Add code
Jan 04, 2024
Viaarxiv icon

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization

Add code
Nov 02, 2023
Viaarxiv icon

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition

Add code
Jul 16, 2023
Viaarxiv icon

Self-regulating Prompts: Foundational Model Adaptation without Forgetting

Add code
Jul 13, 2023
Viaarxiv icon

Fine-tuned CLIP Models are Efficient Video Learners

Add code
Dec 06, 2022
Viaarxiv icon

MaPLe: Multi-modal Prompt Learning

Add code
Oct 06, 2022
Figure 1 for MaPLe: Multi-modal Prompt Learning
Figure 2 for MaPLe: Multi-modal Prompt Learning
Figure 3 for MaPLe: Multi-modal Prompt Learning
Figure 4 for MaPLe: Multi-modal Prompt Learning
Viaarxiv icon