Picture for Yake Wei

Yake Wei

Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception

Add code
Apr 09, 2025
Viaarxiv icon

Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition

Add code
Mar 24, 2025
Viaarxiv icon

Enhancing Modality Representation and Alignment for Multimodal Cold-start Active Learning

Add code
Dec 12, 2024
Viaarxiv icon

On-the-fly Modulation for Balanced Multimodal Learning

Add code
Oct 15, 2024
Viaarxiv icon

Diagnosing and Re-learning for Balanced Multimodal Learning

Add code
Jul 12, 2024
Viaarxiv icon

MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance

Add code
May 28, 2024
Figure 1 for MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Figure 2 for MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Figure 3 for MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Figure 4 for MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Viaarxiv icon

Multimodal Fusion on Low-quality Data: A Comprehensive Survey

Add code
Apr 27, 2024
Figure 1 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 2 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 3 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Figure 4 for Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Viaarxiv icon

Quantifying and Enhancing Multi-modal Robustness with Modality Preference

Add code
Feb 09, 2024
Viaarxiv icon

Enhancing Multi-modal Cooperation via Fine-grained Modality Valuation

Add code
Sep 12, 2023
Viaarxiv icon

Learning in Audio-visual Context: A Review, Analysis, and New Perspective

Add code
Aug 20, 2022
Figure 1 for Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Figure 2 for Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Figure 3 for Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Figure 4 for Learning in Audio-visual Context: A Review, Analysis, and New Perspective
Viaarxiv icon