Picture for Mainak Singha

Mainak Singha

bi-modal textual prompt learning for vision-language models in remote sensing

Add code
Jan 28, 2026
Viaarxiv icon

MMLGNet: Cross-Modal Alignment of Remote Sensing Data using CLIP

Add code
Jan 13, 2026
Viaarxiv icon

SDHSI-Net: Learning Better Representations for Hyperspectral Images via Self-Distillation

Add code
Jan 12, 2026
Viaarxiv icon

Reconstruction Guided Few-shot Network For Remote Sensing Image Classification

Add code
Jan 12, 2026
Viaarxiv icon

Learning Under Laws: A Constraint-Projected Neural PDE Solver that Eliminates Hallucinations

Add code
Nov 05, 2025
Viaarxiv icon

FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models

Add code
Apr 29, 2025
Figure 1 for FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models
Figure 2 for FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models
Figure 3 for FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models
Figure 4 for FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models
Viaarxiv icon

OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP

Add code
Mar 20, 2025
Viaarxiv icon

GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery

Add code
Nov 04, 2024
Figure 1 for GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery
Figure 2 for GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery
Figure 3 for GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery
Figure 4 for GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery
Viaarxiv icon

COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation

Add code
Aug 31, 2024
Figure 1 for COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Figure 2 for COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Figure 3 for COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Figure 4 for COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Viaarxiv icon

Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning

Add code
Jul 05, 2024
Figure 1 for Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Figure 2 for Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Figure 3 for Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Figure 4 for Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Viaarxiv icon