Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gonzalo Mena

Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

Jun 30, 2020

Gonzalo Mena, Amin Nejatbakhsh, Erdem Varol, Jonathan Niles-Weed

Figure 1 for Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

Figure 2 for Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

Figure 3 for Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

Figure 4 for Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

Abstract:We study Sinkhorn EM (sEM), a variant of the expectation maximization (EM) algorithm for mixtures based on entropic optimal transport. sEM differs from the classic EM algorithm in the way responsibilities are computed during the expectation step: rather than assign data points to clusters independently, sEM uses optimal transport to compute responsibilities by incorporating prior information about mixing weights. Like EM, sEM has a natural interpretation as a coordinate ascent procedure, which iteratively constructs and optimizes a lower bound on the log-likelihood. However, we show theoretically and empirically that sEM has better behavior than EM: it possesses better global convergence guarantees and is less prone to getting stuck in bad local optima. We complement these findings with experiments on simulated data as well as in an inference task involving C. elegans neurons and show that sEM learns cell labels significantly better than other approaches.

* Under review

Via

Access Paper or Ask Questions

Statistical bounds for entropic optimal transport: sample complexity and the central limit theorem

May 30, 2019

Gonzalo Mena, Jonathan Weed

Figure 1 for Statistical bounds for entropic optimal transport: sample complexity and the central limit theorem

Figure 2 for Statistical bounds for entropic optimal transport: sample complexity and the central limit theorem

Abstract:We prove several fundamental statistical bounds for entropic OT with the squared Euclidean cost between subgaussian probability measures in arbitrary dimension. First, through a new sample complexity result we establish the rate of convergence of entropic OT for empirical measures. Our analysis improves exponentially on the bound of Genevay et al. (2019) and extends their work to unbounded measures. Second, we establish a central limit theorem for entropic OT, based on techniques developed by Del Barrio and Loubes (2019). Previously, such a result was only known for finite metric spaces. As an application of our results, we develop and analyze a new technique for estimating the entropy of a random variable corrupted by gaussian noise.

* Under review. 23 pages, 2 figures. Version 2 fixes minor typos and errors

Via

Access Paper or Ask Questions

Learning Latent Permutations with Gumbel-Sinkhorn Networks

Feb 23, 2018

Gonzalo Mena, David Belanger, Scott Linderman, Jasper Snoek

Figure 1 for Learning Latent Permutations with Gumbel-Sinkhorn Networks

Figure 2 for Learning Latent Permutations with Gumbel-Sinkhorn Networks

Figure 3 for Learning Latent Permutations with Gumbel-Sinkhorn Networks

Figure 4 for Learning Latent Permutations with Gumbel-Sinkhorn Networks

Abstract:Permutations and matchings are core building blocks in a variety of latent variable models, as they allow us to align, canonicalize, and sort data. Learning in such models is difficult, however, because exact marginalization over these combinatorial objects is intractable. In response, this paper introduces a collection of new methods for end-to-end learning in such models that approximate discrete maximum-weight matching using the continuous Sinkhorn operator. Sinkhorn iteration is attractive because it functions as a simple, easy-to-implement analog of the softmax operator. With this, we can define the Gumbel-Sinkhorn method, an extension of the Gumbel-Softmax method (Jang et al. 2016, Maddison2016 et al. 2016) to distributions over latent matchings. We demonstrate the effectiveness of our method by outperforming competitive baselines on a range of qualitatively different tasks: sorting numbers, solving jigsaw puzzles, and identifying neural signals in worms.

* ICLR 2018

Via

Access Paper or Ask Questions