Picture for Massimiliano Mancini

Massimiliano Mancini

Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization

Add code
Feb 04, 2026
Viaarxiv icon

Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study

Add code
Jul 28, 2025
Figure 1 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Figure 2 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Figure 3 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Figure 4 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Viaarxiv icon

Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers

Add code
Apr 29, 2025
Viaarxiv icon

On Large Multimodal Models as Open-World Image Classifiers

Add code
Mar 27, 2025
Figure 1 for On Large Multimodal Models as Open-World Image Classifiers
Figure 2 for On Large Multimodal Models as Open-World Image Classifiers
Figure 3 for On Large Multimodal Models as Open-World Image Classifiers
Figure 4 for On Large Multimodal Models as Open-World Image Classifiers
Viaarxiv icon

Training-Free Personalization via Retrieval and Reasoning on Fingerprints

Add code
Mar 24, 2025
Figure 1 for Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Figure 2 for Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Figure 3 for Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Figure 4 for Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Viaarxiv icon

Compositional Caching for Training-free Open-vocabulary Attribute Detection

Add code
Mar 24, 2025
Viaarxiv icon

Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models

Add code
Mar 21, 2025
Viaarxiv icon

Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages

Add code
Mar 14, 2025
Figure 1 for Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Figure 2 for Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Figure 3 for Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Figure 4 for Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Viaarxiv icon

Safe Vision-Language Models via Unsafe Weights Manipulation

Add code
Mar 14, 2025
Figure 1 for Safe Vision-Language Models via Unsafe Weights Manipulation
Figure 2 for Safe Vision-Language Models via Unsafe Weights Manipulation
Figure 3 for Safe Vision-Language Models via Unsafe Weights Manipulation
Figure 4 for Safe Vision-Language Models via Unsafe Weights Manipulation
Viaarxiv icon

Group-robust Machine Unlearning

Add code
Mar 12, 2025
Figure 1 for Group-robust Machine Unlearning
Figure 2 for Group-robust Machine Unlearning
Figure 3 for Group-robust Machine Unlearning
Figure 4 for Group-robust Machine Unlearning
Viaarxiv icon