Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Dodd

Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

Jun 04, 2024

Daniel Dodd, Louis Sharrock, Christopher Nemeth

Figure 1 for Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

Figure 2 for Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

Figure 3 for Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

Figure 4 for Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

Abstract:In recent years, interest in gradient-based optimization over Riemannian manifolds has surged. However, a significant challenge lies in the reliance on hyperparameters, especially the learning rate, which requires meticulous tuning by practitioners to ensure convergence at a suitable rate. In this work, we introduce innovative learning-rate-free algorithms for stochastic optimization over Riemannian manifolds, eliminating the need for hand-tuning and providing a more robust and user-friendly approach. We establish high probability convergence guarantees that are optimal, up to logarithmic factors, compared to the best-known optimally tuned rate in the deterministic setting. Our approach is validated through numerical experiments, demonstrating competitive performance against learning-rate-dependent algorithms.

* ICML 2024

Via

Access Paper or Ask Questions

CoinEM: Tuning-Free Particle-Based Variational Inference for Latent Variable Models

May 24, 2023

Louis Sharrock, Daniel Dodd, Christopher Nemeth

Figure 1 for CoinEM: Tuning-Free Particle-Based Variational Inference for Latent Variable Models

Figure 2 for CoinEM: Tuning-Free Particle-Based Variational Inference for Latent Variable Models

Figure 3 for CoinEM: Tuning-Free Particle-Based Variational Inference for Latent Variable Models

Figure 4 for CoinEM: Tuning-Free Particle-Based Variational Inference for Latent Variable Models

Abstract:We introduce two new particle-based algorithms for learning latent variable models via marginal maximum likelihood estimation, including one which is entirely tuning-free. Our methods are based on the perspective of marginal maximum likelihood estimation as an optimization problem: namely, as the minimization of a free energy functional. One way to solve this problem is to consider the discretization of a gradient flow associated with the free energy. We study one such approach, which resembles an extension of the popular Stein variational gradient descent algorithm. In particular, we establish a descent lemma for this algorithm, which guarantees that the free energy decreases at each iteration. This method, and any other obtained as the discretization of the gradient flow, will necessarily depend on a learning rate which must be carefully tuned by the practitioner in order to ensure convergence at a suitable rate. With this in mind, we also propose another algorithm for optimizing the free energy which is entirely learning rate free, based on coin betting techniques from convex optimization. We validate the performance of our algorithms across a broad range of numerical experiments, including several high-dimensional settings. Our results are competitive with existing particle-based methods, without the need for any hyperparameter tuning.

Via

Access Paper or Ask Questions