Picture for Beier Zhu

Beier Zhu

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

Add code
Dec 11, 2024
Viaarxiv icon

Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Add code
Nov 23, 2024
Figure 1 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 2 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 3 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 4 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Viaarxiv icon

Robust Fine-tuning of Zero-shot Models via Variance Reduction

Add code
Nov 11, 2024
Figure 1 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 2 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 3 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 4 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Viaarxiv icon

Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting

Add code
Oct 25, 2024
Figure 1 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Figure 2 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Figure 3 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Figure 4 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Viaarxiv icon

Selective Vision-Language Subspace Projection for Few-shot CLIP

Add code
Jul 26, 2024
Figure 1 for Selective Vision-Language Subspace Projection for Few-shot CLIP
Figure 2 for Selective Vision-Language Subspace Projection for Few-shot CLIP
Figure 3 for Selective Vision-Language Subspace Projection for Few-shot CLIP
Figure 4 for Selective Vision-Language Subspace Projection for Few-shot CLIP
Viaarxiv icon

Classes Are Not Equal: An Empirical Study on Image Recognition Fairness

Add code
Mar 13, 2024
Viaarxiv icon

Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models

Add code
Oct 12, 2023
Viaarxiv icon

Debiased Fine-Tuning for Vision-language Models by Prompt Regularization

Add code
Jan 29, 2023
Viaarxiv icon

Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning

Add code
Dec 10, 2022
Viaarxiv icon

Prompt-aligned Gradient for Prompt Tuning

Add code
May 30, 2022
Figure 1 for Prompt-aligned Gradient for Prompt Tuning
Figure 2 for Prompt-aligned Gradient for Prompt Tuning
Figure 3 for Prompt-aligned Gradient for Prompt Tuning
Figure 4 for Prompt-aligned Gradient for Prompt Tuning
Viaarxiv icon