Picture for Sajid Javed

Sajid Javed

MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding

Add code
Mar 25, 2026
Viaarxiv icon

AgriChat: A Multimodal Large Language Model for Agriculture Image Understanding

Add code
Mar 14, 2026
Viaarxiv icon

SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs

Add code
Mar 12, 2026
Viaarxiv icon

Rethinking Memory Design in SAM-Based Visual Object Tracking

Add code
Dec 27, 2025
Viaarxiv icon

Cytoplasmic Strings Analysis in Human Embryo Time-Lapse Videos using Deep Learning Framework

Add code
Dec 10, 2025
Viaarxiv icon

Spatio-Temporal State Space Model For Efficient Event-Based Optical Flow

Add code
Jun 09, 2025
Viaarxiv icon

CLDTracker: A Comprehensive Language Description for Visual Tracking

Add code
May 29, 2025
Viaarxiv icon

Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation

Add code
Apr 26, 2025
Figure 1 for Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Figure 2 for Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Figure 3 for Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Figure 4 for Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Viaarxiv icon

snnTrans-DHZ: A Lightweight Spiking Neural Network Architecture for Underwater Image Dehazing

Add code
Apr 13, 2025
Viaarxiv icon

Underwater Image Enhancement by Convolutional Spiking Neural Networks

Add code
Mar 26, 2025
Figure 1 for Underwater Image Enhancement by Convolutional Spiking Neural Networks
Figure 2 for Underwater Image Enhancement by Convolutional Spiking Neural Networks
Figure 3 for Underwater Image Enhancement by Convolutional Spiking Neural Networks
Figure 4 for Underwater Image Enhancement by Convolutional Spiking Neural Networks
Viaarxiv icon