Picture for Vishal M. Patel

Vishal M. Patel

Senior Member, IEEE

SINR: Sparsity Driven Compressed Implicit Neural Representations

Add code
Mar 25, 2025
Viaarxiv icon

The Power of Context: How Multimodality Improves Image Super-Resolution

Add code
Mar 18, 2025
Viaarxiv icon

Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset

Add code
Mar 18, 2025
Viaarxiv icon

Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning

Add code
Mar 10, 2025
Viaarxiv icon

$\mathsf{CSMAE~}$:~Cataract Surgical Masked Autoencoder (MAE) based Pre-training

Add code
Feb 12, 2025
Viaarxiv icon

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

FaceXBench: Evaluating Multimodal LLMs on Face Understanding

Add code
Jan 17, 2025
Viaarxiv icon

Distilling Multi-modal Large Language Models for Autonomous Driving

Add code
Jan 16, 2025
Viaarxiv icon

Zero-Shot Scene Understanding for Automatic Target Recognition Using Large Vision-Language Models

Add code
Jan 13, 2025
Viaarxiv icon

SegFace: Face Segmentation of Long-Tail Classes

Add code
Dec 11, 2024
Viaarxiv icon