Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Asaf Maman

Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Jan 27, 2022

Noam Razin, Asaf Maman, Nadav Cohen

Figure 1 for Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Figure 2 for Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Figure 3 for Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Figure 4 for Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Abstract:In the pursuit of explaining implicit regularization in deep learning, prominent focus was given to matrix and tensor factorizations, which correspond to simplified neural networks. It was shown that these models exhibit implicit regularization towards low matrix and tensor ranks, respectively. Drawing closer to practical deep learning, the current paper theoretically analyzes the implicit regularization in hierarchical tensor factorization, a model equivalent to certain deep convolutional neural networks. Through a dynamical systems lens, we overcome challenges associated with hierarchy, and establish implicit regularization towards low hierarchical tensor rank. This translates to an implicit regularization towards locality for the associated convolutional networks. Inspired by our theory, we design explicit regularization discouraging locality, and demonstrate its ability to improve performance of modern convolutional networks on non-local tasks, in defiance of conventional wisdom by which architectural changes are needed. Our work highlights the potential of enhancing neural networks via theoretical analysis of their implicit regularization.

Via

Access Paper or Ask Questions

Implicit Regularization in Tensor Factorization

Feb 26, 2021

Noam Razin, Asaf Maman, Nadav Cohen

Figure 1 for Implicit Regularization in Tensor Factorization

Figure 2 for Implicit Regularization in Tensor Factorization

Figure 3 for Implicit Regularization in Tensor Factorization

Figure 4 for Implicit Regularization in Tensor Factorization

Abstract:Implicit regularization in deep learning is perceived as a tendency of gradient-based optimization to fit training data with predictors of minimal "complexity." The fact that only some types of data give rise to generalization is understood to result from them being especially amenable to fitting with low complexity predictors. A major challenge in formalizing this intuition is to define complexity measures that are quantitative yet capture the essence of data that admits generalization. With an eye towards this challenge, we provide the first analysis of implicit regularization in tensor factorization, equivalent to a certain non-linear neural network. We characterize the dynamics that gradient descent induces on the factorization, and establish a bias towards low tensor rank, in compliance with empirical evidence. Then, motivated by tensor rank capturing implicit regularization of a non-linear neural network, we empirically explore it as a measure of complexity, and find that it stays extremely low when fitting standard datasets. This leads us to believe that tensor rank may pave way to explaining both implicit regularization of neural networks, and the properties of real-world data translating this implicit regularization to generalization.

Via

Access Paper or Ask Questions