Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient Descent using Duality Structures

Aug 31, 2017

Thomas Flynn

Figure 1 for Gradient Descent using Duality Structures

Figure 2 for Gradient Descent using Duality Structures

Figure 3 for Gradient Descent using Duality Structures

Figure 4 for Gradient Descent using Duality Structures

Share this with someone who'll enjoy it:

Abstract:In most applications of gradient-based optimization to complex problems the choice of step size is based on trial-and-error and other heuristics. A case when it is easy to choose the step sizes is when the function has a Lipschitz continuous gradient. Many functions of interest do not appear at first sight to have this property, but often it can be established with the right choice of underlying metric. We find a simple recipe for choosing step sizes when a function has a Lipschitz gradient with respect to any Finsler structure that verifies an exponential bound. These step sizes are guaranteed to give convergence, but they may be conservative since they rely on an exponential bound. However, when relevant problem structure can be encoded in the metric to yield a significantly tighter bound while keeping optimization tractable, this may lead to rigorous and efficient algorithms. In particular, our general result can be applied to yield an optimization algorithm with non-asymptotic performance guarantees for batch optimization of multilayer neural networks.

View paper on

Share this with someone who'll enjoy it:

Title:Gradient Descent using Duality Structures

Paper and Code