Picture for Zilei Wang

Zilei Wang

R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning

Add code
Apr 15, 2025
Viaarxiv icon

The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination

Add code
Apr 14, 2025
Viaarxiv icon

Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference

Add code
Mar 17, 2025
Viaarxiv icon

ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models

Add code
Mar 17, 2025
Viaarxiv icon

Towards Compatible Fine-tuning for Vision-Language Model Updates

Add code
Dec 30, 2024
Viaarxiv icon

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Add code
Dec 26, 2024
Viaarxiv icon

DIVE: Subgraph Disagreement for Graph Out-of-Distribution Generalization

Add code
Aug 08, 2024
Viaarxiv icon

Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models

Add code
Feb 06, 2024
Viaarxiv icon

A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation

Add code
Feb 06, 2024
Figure 1 for A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Figure 2 for A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Figure 3 for A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Figure 4 for A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Viaarxiv icon

Not all Minorities are Equal: Empty-Class-Aware Distillation for Heterogeneous Federated Learning

Add code
Jan 04, 2024
Viaarxiv icon