Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Relative gradient optimization of the Jacobian term in unsupervised deep learning

Jun 26, 2020

Luigi Gresele, Giancarlo Fissore, Adrián Javaloy, Bernhard Schölkopf, Aapo Hyvärinen

Figure 1 for Relative gradient optimization of the Jacobian term in unsupervised deep learning

Figure 2 for Relative gradient optimization of the Jacobian term in unsupervised deep learning

Figure 3 for Relative gradient optimization of the Jacobian term in unsupervised deep learning

Figure 4 for Relative gradient optimization of the Jacobian term in unsupervised deep learning

Share this with someone who'll enjoy it:

Abstract:Learning expressive probabilistic models correctly describing the data is a ubiquitous problem in machine learning. A popular approach for solving it is mapping the observations into a representation space with a simple joint distribution, which can typically be written as a product of its marginals -- thus drawing a connection with the field of nonlinear independent component analysis. Deep density models have been widely used for this task, but their likelihood-based training requires estimating the log-determinant of the Jacobian and is computationally expensive, thus imposing a trade-off between computation and expressive power. In this work, we propose a new approach for exact likelihood-based training of such neural networks. Based on relative gradients, we exploit the matrix structure of neural network parameters to compute updates efficiently even in high-dimensional spaces; the computational cost of the training is quadratic in the input size, in contrast with the cubic scaling of the naive approaches. This allows fast training with objective functions involving the log-determinant of the Jacobian without imposing constraints on its structure, in stark contrast to normalizing flows. An implementation of our method can be found at https://github.com/fissoreg/relative-gradient-jacobian

View paper on

Share this with someone who'll enjoy it:

Title:Relative gradient optimization of the Jacobian term in unsupervised deep learning

Paper and Code