Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Jun 08, 2021

Gadi Naveh, Zohar Ringel

Figure 1 for A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Figure 2 for A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Figure 3 for A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Figure 4 for A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Share this with someone who'll enjoy it:

Abstract:Deep neural networks (DNNs) in the infinite width/channel limit have received much attention recently, as they provide a clear analytical window to deep learning via mappings to Gaussian Processes (GPs). Despite its theoretical appeal, this viewpoint lacks a crucial ingredient of deep learning in finite DNNs, laying at the heart of their success -- feature learning. Here we consider DNNs trained with noisy gradient descent on a large training set and derive a self consistent Gaussian Process theory accounting for strong finite-DNN and feature learning effects. Applying this to a toy model of a two-layer linear convolutional neural network (CNN) shows good agreement with experiments. We further identify, both analytical and numerically, a sharp transition between a feature learning regime and a lazy learning regime in this model. Strong finite-DNN effects are also derived for a non-linear two-layer fully connected network. Our self consistent theory provides a rich and versatile analytical framework for studying feature learning and other non-lazy effects in finite DNNs.

* 9 pages of main text, 23 pages of appendices, 5 figures total

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Paper and Code