Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Probabilistic Model to explain Self-Supervised Representation Learning

Feb 02, 2024

Alice Bizeul, Bernhard Schölkopf, Carl Allen

Figure 1 for A Probabilistic Model to explain Self-Supervised Representation Learning

Figure 2 for A Probabilistic Model to explain Self-Supervised Representation Learning

Figure 3 for A Probabilistic Model to explain Self-Supervised Representation Learning

Figure 4 for A Probabilistic Model to explain Self-Supervised Representation Learning

Share this with someone who'll enjoy it:

Abstract:Self-supervised learning (SSL) learns representations by leveraging an auxiliary unsupervised task, such as classifying semantically related samples, e.g. different data augmentations or modalities. Of the many approaches to SSL, contrastive methods, e.g. SimCLR, CLIP and VicREG, have gained attention for learning representations that achieve downstream performance close to that of supervised learning. However, a theoretical understanding of the mechanism behind these methods eludes. We propose a generative latent variable model for the data and show that several families of discriminative self-supervised algorithms, including contrastive methods, approximately induce its latent structure over representations, providing a unifying theoretical framework. We also justify links to mutual information and the use of a projection head. Fitting our model generatively, as SimVE, improves performance over previous VAE methods on common benchmarks (e.g. FashionMNIST, CIFAR10, CelebA), narrows the gap to discriminative methods on _content_ classification and, as our analysis predicts, outperforms them where _style_ information is required, taking a step toward task-agnostic representations.

View paper on

Share this with someone who'll enjoy it:

Title:A Probabilistic Model to explain Self-Supervised Representation Learning

Paper and Code