Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Oct 09, 2022

Peng Ye, Shengji Tang, Baopu Li, Tao Chen, Wanli Ouyang

Figure 1 for Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Figure 2 for Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Figure 3 for Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Figure 4 for Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Share this with someone who'll enjoy it:

Abstract:Residual networks have shown great success and become indispensable in today's deep models. In this work, we aim to re-investigate the training process of residual networks from a novel social psychology perspective of loafing, and further propose a new training strategy to strengthen the performance of residual networks. As residual networks can be viewed as ensembles of relatively shallow networks (i.e., \textit{unraveled view}) in prior works, we also start from such view and consider that the final performance of a residual network is co-determined by a group of sub-networks. Inspired by the social loafing problem of social psychology, we find that residual networks invariably suffer from similar problem, where sub-networks in a residual network are prone to exert less effort when working as part of the group compared to working alone. We define this previously overlooked problem as \textit{network loafing}. As social loafing will ultimately cause the low individual productivity and the reduced overall performance, network loafing will also hinder the performance of a given residual network and its sub-networks. Referring to the solutions of social psychology, we propose \textit{stimulative training}, which randomly samples a residual sub-network and calculates the KL-divergence loss between the sampled sub-network and the given residual network, to act as extra supervision for sub-networks and make the overall goal consistent. Comprehensive empirical results and theoretical analyses verify that stimulative training can well handle the loafing problem, and improve the performance of a residual network by improving the performance of its sub-networks. The code is available at https://github.com/Sunshine-Ye/NIPS22-ST .

* NIPS2022 accept

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Paper and Code