Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alejandro Mahillo

A Discrete Variational Derivation of Accelerated Methods in Optimization

Jun 04, 2021

Cédric M. Campos, Alejandro Mahillo, David Martín de Diego

Figure 1 for A Discrete Variational Derivation of Accelerated Methods in Optimization

Figure 2 for A Discrete Variational Derivation of Accelerated Methods in Optimization

Abstract:Many of the new developments in machine learning are connected with gradient-based optimization methods. Recently, these methods have been studied using a variational perspective. This has opened up the possibility of introducing variational and symplectic integration methods using geometric integrators. In particular, in this paper, we introduce variational integrators which allow us to derive different methods for optimization. Using both, Hamilton's principle and Lagrange-d'Alembert's, we derive two families of optimization methods in one-to-one correspondence that generalize Polyak's heavy ball and the well known Nesterov accelerated gradient method, mimicking the behavior of the latter which reduces the oscillations of typical momentum methods. However, since the systems considered are explicitly time-dependent, the preservation of symplecticity of autonomous systems occurs here solely on the fibers. Several experiments exemplify the result.

* 29 pages, 11 figures

Via

Access Paper or Ask Questions