Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters

Jun 13, 2023

Ganesh Ramachandra Kini, Vala Vakilian, Tina Behnia, Jaidev Gill, Christos Thrampoulidis

Figure 1 for Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters

Figure 2 for Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters

Figure 3 for Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters

Figure 4 for Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters

Share this with someone who'll enjoy it:

Abstract:Supervised contrastive loss (SCL) is a competitive and often superior alternative to the cross-entropy (CE) loss for classification. In this paper we ask: what differences in the learning process occur when the two different loss functions are being optimized? To answer this question, our main finding is that the geometry of embeddings learned by SCL forms an orthogonal frame (OF) regardless of the number of training examples per class. This is in contrast to the CE loss, for which previous work has shown that it learns embeddings geometries that are highly dependent on the class sizes. We arrive at our finding theoretically, by proving that the global minimizers of an unconstrained features model with SCL loss and entry-wise non-negativity constraints form an OF. We then validate the model's prediction by conducting experiments with standard deep-learning models on benchmark vision datasets. Finally, our analysis and experiments reveal that the batching scheme chosen during SCL training plays a critical role in determining the quality of convergence to the OF geometry. This finding motivates a simple algorithm wherein the addition of a few binding examples in each batch significantly speeds up the occurrence of the OF geometry.

View paper on

Share this with someone who'll enjoy it:

Title:Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters

Paper and Code