Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MaxSup: Overcoming Representation Collapse in Label Smoothing

Feb 18, 2025

Yuxuan Zhou, Heng Li, Zhi-Qi Cheng, Xudong Yan, Mario Fritz, Margret Keuper

Figure 1 for MaxSup: Overcoming Representation Collapse in Label Smoothing

Figure 2 for MaxSup: Overcoming Representation Collapse in Label Smoothing

Figure 3 for MaxSup: Overcoming Representation Collapse in Label Smoothing

Figure 4 for MaxSup: Overcoming Representation Collapse in Label Smoothing

Share this with someone who'll enjoy it:

Abstract:Label Smoothing (LS) is widely adopted to curb overconfidence in neural network predictions and enhance generalization. However, previous research shows that LS can force feature representations into excessively tight clusters, eroding intra-class distinctions. More recent findings suggest that LS also induces overconfidence in misclassifications, yet the precise mechanism remained unclear. In this work, we decompose the loss term introduced by LS, revealing two key components: (i) a regularization term that functions only when the prediction is correct, and (ii) an error-enhancement term that emerges under misclassifications. This latter term compels the model to reinforce incorrect predictions with exaggerated certainty, further collapsing the feature space. To address these issues, we propose Max Suppression (MaxSup), which uniformly applies the intended regularization to both correct and incorrect predictions by penalizing the top-1 logit instead of the ground-truth logit. Through feature analyses, we show that MaxSup restores intra-class variation and sharpens inter-class boundaries. Extensive experiments on image classification and downstream tasks confirm that MaxSup is a more robust alternative to LS. Code is available at: https://github.com/ZhouYuxuanYX/Maximum-Suppression-Regularization.

* 19 pages, 9 Tables, preliminary work under review do not distribute

View paper on

Share this with someone who'll enjoy it:

Title:MaxSup: Overcoming Representation Collapse in Label Smoothing

Paper and Code