Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Sep 19, 2022

Thomas George, Guillaume Lajoie, Aristide Baratin

Figure 1 for Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Figure 2 for Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Figure 3 for Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Figure 4 for Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Share this with someone who'll enjoy it:

Abstract:Among attempts at giving a theoretical account of the success of deep neural networks, a recent line of work has identified a so-called `lazy' regime in which the network can be well approximated by its linearization around initialization. Here we investigate the comparative effect of the lazy (linear) and feature learning (non-linear) regimes on subgroups of examples based on their difficulty. Specifically, we show that easier examples are given more weight in feature learning mode, resulting in faster training compared to more difficult ones. In other words, the non-linear dynamics tends to sequentialize the learning of examples of increasing difficulty. We illustrate this phenomenon across different ways to quantify example difficulty, including c-score, label noise, and in the presence of spurious correlations. Our results reveal a new understanding of how deep networks prioritize resources across example difficulty.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Paper and Code