Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Feb 15, 2022

Spencer Frei, Niladri S. Chatterji, Peter L. Bartlett

Figure 1 for Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Share this with someone who'll enjoy it:

Abstract:In this work, we provide a characterization of the feature-learning process in two-layer ReLU networks trained by gradient descent on the logistic loss following random initialization. We consider data with binary labels that are generated by an XOR-like function of the input features. We permit a constant fraction of the training labels to be corrupted by an adversary. We show that, although linear classifiers are no better than random guessing for the distribution we consider, two-layer ReLU networks trained by gradient descent achieve generalization error close to the label noise rate, refuting the conjecture of Malach and Shalev-Shwartz that 'deeper is better only when shallow is good'. We develop a novel proof technique that shows that at initialization, the vast majority of neurons function as random features that are only weakly correlated with useful features, and the gradient descent dynamics 'amplify' these weak, random features to strong, useful features.

* 41 pages

View paper on

Share this with someone who'll enjoy it:

Title:Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Paper and Code