Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Inferring Latent Class Statistics from Text for Robust Visual Few-Shot Learning

Nov 24, 2023

Yassir Bendou, Vincent Gripon, Bastien Pasdeloup, Giulia Lioi, Lukas Mauch, Fabien Cardinaux, Ghouthi Boukli Hacene

Figure 1 for Inferring Latent Class Statistics from Text for Robust Visual Few-Shot Learning

Figure 2 for Inferring Latent Class Statistics from Text for Robust Visual Few-Shot Learning

Figure 3 for Inferring Latent Class Statistics from Text for Robust Visual Few-Shot Learning

Figure 4 for Inferring Latent Class Statistics from Text for Robust Visual Few-Shot Learning

Share this with someone who'll enjoy it:

Abstract:In the realm of few-shot learning, foundation models like CLIP have proven effective but exhibit limitations in cross-domain robustness especially in few-shot settings. Recent works add text as an extra modality to enhance the performance of these models. Most of these approaches treat text as an auxiliary modality without fully exploring its potential to elucidate the underlying class visual features distribution. In this paper, we present a novel approach that leverages text-derived statistics to predict the mean and covariance of the visual feature distribution for each class. This predictive framework enriches the latent space, yielding more robust and generalizable few-shot learning models. We demonstrate the efficacy of incorporating both mean and covariance statistics in improving few-shot classification performance across various datasets. Our method shows that we can use text to predict the mean and covariance of the distribution offering promising improvements in few-shot learning scenarios.

* R0-FoMo: Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models at NeurIPS 2023

View paper on

Share this with someone who'll enjoy it:

Title:Inferring Latent Class Statistics from Text for Robust Visual Few-Shot Learning

Paper and Code