Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Patrik Gerber

Fisher information lower bounds for sampling

Oct 05, 2022

Sinho Chewi, Patrik Gerber, Holden Lee, Chen Lu

Abstract:We prove two lower bounds for the complexity of non-log-concave sampling within the framework of Balasubramanian et al. (2022), who introduced the use of Fisher information (FI) bounds as a notion of approximate first-order stationarity in sampling. Our first lower bound shows that averaged LMC is optimal for the regime of large FI by reducing the problem of finding stationary points in non-convex optimization to sampling. Our second lower bound shows that in the regime of small FI, obtaining a FI of at most $\varepsilon^2$ from the target distribution requires $\text{poly}(1/\varepsilon)$ queries, which is surprising as it rules out the existence of high-accuracy algorithms (e.g., algorithms using Metropolis-Hastings filters) in this context.

* 35 pages

Via

Access Paper or Ask Questions

Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Jun 16, 2021

Jason M. Altschuler, Sinho Chewi, Patrik Gerber, Austin J. Stromme

Figure 1 for Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Figure 2 for Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Figure 3 for Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Figure 4 for Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Abstract:We study first-order optimization algorithms for computing the barycenter of Gaussian distributions with respect to the optimal transport metric. Although the objective is geodesically non-convex, Riemannian GD empirically converges rapidly, in fact faster than off-the-shelf methods such as Euclidean GD and SDP solvers. This stands in stark contrast to the best-known theoretical results for Riemannian GD, which depend exponentially on the dimension. In this work, we prove new geodesic convexity results which provide stronger control of the iterates, yielding a dimension-free convergence rate. Our techniques also enable the analysis of two related notions of averaging, the entropically-regularized barycenter and the geometric median, providing the first convergence guarantees for Riemannian GD for these problems.

* 48 pages, 8 figures

Via

Access Paper or Ask Questions

The query complexity of sampling from strongly log-concave distributions in one dimension

Jun 09, 2021

Sinho Chewi, Patrik Gerber, Chen Lu, Thibaut Le Gouic, Philippe Rigollet

Figure 1 for The query complexity of sampling from strongly log-concave distributions in one dimension

Figure 2 for The query complexity of sampling from strongly log-concave distributions in one dimension

Figure 3 for The query complexity of sampling from strongly log-concave distributions in one dimension

Figure 4 for The query complexity of sampling from strongly log-concave distributions in one dimension

Abstract:We establish the first tight lower bound of $\Omega(\log\log\kappa)$ on the query complexity of sampling from the class of strongly log-concave and log-smooth distributions with condition number $\kappa$ in one dimension. Whereas existing guarantees for MCMC-based algorithms scale polynomially in $\kappa$, we introduce a novel algorithm based on rejection sampling that closes this doubly exponential gap.

* 19 pages, 4 figures

Via

Access Paper or Ask Questions

Rejection sampling from shape-constrained distributions in sublinear time

May 29, 2021

Sinho Chewi, Patrik Gerber, Chen Lu, Thibaut Le Gouic, Philippe Rigollet

Figure 1 for Rejection sampling from shape-constrained distributions in sublinear time

Figure 2 for Rejection sampling from shape-constrained distributions in sublinear time

Figure 3 for Rejection sampling from shape-constrained distributions in sublinear time

Figure 4 for Rejection sampling from shape-constrained distributions in sublinear time

Abstract:We consider the task of generating exact samples from a target distribution, known up to normalization, over a finite alphabet. The classical algorithm for this task is rejection sampling, and although it has been used in practice for decades, there is surprisingly little study of its fundamental limitations. In this work, we study the query complexity of rejection sampling in a minimax framework for various classes of discrete distributions. Our results provide new algorithms for sampling whose complexity scales sublinearly with the alphabet size. When applied to adversarial bandits, we show that a slight modification of the Exp3 algorithm reduces the per-iteration complexity from $\mathcal O(K)$ to $\mathcal O(\log^2 K)$, where $K$ is the number of arms.

* 23 pages, 5 figures

Via

Access Paper or Ask Questions