Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander Kolesov

Field Matching: an Electrostatic Paradigm to Generate and Transfer Data

Feb 04, 2025

Alexander Kolesov, Manukhov Stepan, Vladimir V. Palyulin, Alexander Korotin

Abstract:We propose Electrostatic Field Matching (EFM), a novel method that is suitable for both generative modeling and distribution transfer tasks. Our approach is inspired by the physics of an electrical capacitor. We place source and target distributions on the capacitor plates and assign them positive and negative charges, respectively. We then learn the electrostatic field of the capacitor using a neural network approximator. To map the distributions to each other, we start at one plate of the capacitor and move the samples along the learned electrostatic field lines until they reach the other plate. We theoretically justify that this approach provably yields the distribution transfer. In practice, we demonstrate the performance of our EFM in toy and image data experiments.

Via

Access Paper or Ask Questions

Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

Oct 04, 2024

Milena Gazdieva, Jaemoo Choi, Alexander Kolesov, Jaewoong Choi, Petr Mokrov, Alexander Korotin

Figure 1 for Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

Figure 2 for Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

Figure 3 for Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

Figure 4 for Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

Abstract:A common challenge in aggregating data from multiple sources can be formalized as an \textit{Optimal Transport} (OT) barycenter problem, which seeks to compute the average of probability distributions with respect to OT discrepancies. However, the presence of outliers and noise in the data measures can significantly hinder the performance of traditional statistical methods for estimating OT barycenters. To address this issue, we propose a novel, scalable approach for estimating the \textit{robust} continuous barycenter, leveraging the dual formulation of the \textit{(semi-)unbalanced} OT problem. To the best of our knowledge, this paper is the first attempt to develop an algorithm for robust barycenters under the continuous distribution setup. Our method is framed as a $\min$-$\max$ optimization problem and is adaptable to \textit{general} cost function. We rigorously establish the theoretical underpinnings of the proposed method and demonstrate its robustness to outliers and class imbalance through a number of illustrative experiments.

* 19 pages, 4 figures

Via

Access Paper or Ask Questions

Estimating Barycenters of Distributions with Neural Optimal Transport

Feb 06, 2024

Alexander Kolesov, Petr Mokrov, Igor Udovichenko, Milena Gazdieva, Gudmund Pammer, Evgeny Burnaev, Alexander Korotin

Figure 1 for Estimating Barycenters of Distributions with Neural Optimal Transport

Figure 2 for Estimating Barycenters of Distributions with Neural Optimal Transport

Figure 3 for Estimating Barycenters of Distributions with Neural Optimal Transport

Figure 4 for Estimating Barycenters of Distributions with Neural Optimal Transport

Abstract:Given a collection of probability measures, a practitioner sometimes needs to find an "average" distribution which adequately aggregates reference distributions. A theoretically appealing notion of such an average is the Wasserstein barycenter, which is the primal focus of our work. By building upon the dual formulation of Optimal Transport (OT), we propose a new scalable approach for solving the Wasserstein barycenter problem. Our methodology is based on the recent Neural OT solver: it has bi-level adversarial learning objective and works for general cost functions. These are key advantages of our method, since the typical adversarial algorithms leveraging barycenter tasks utilize tri-level optimization and focus mostly on quadratic cost. We also establish theoretical error bounds for our proposed approach and showcase its applicability and effectiveness on illustrative scenarios and image data setups.

Via

Access Paper or Ask Questions

Energy-Guided Continuous Entropic Barycenter Estimation for General Costs

Oct 02, 2023

Alexander Kolesov, Petr Mokrov, Igor Udovichenko, Milena Gazdieva, Gudmund Pammer, Evgeny Burnaev, Alexander Korotin

Figure 1 for Energy-Guided Continuous Entropic Barycenter Estimation for General Costs

Figure 2 for Energy-Guided Continuous Entropic Barycenter Estimation for General Costs

Figure 3 for Energy-Guided Continuous Entropic Barycenter Estimation for General Costs

Figure 4 for Energy-Guided Continuous Entropic Barycenter Estimation for General Costs

Abstract:Optimal transport (OT) barycenters are a mathematically grounded way of averaging probability distributions while capturing their geometric properties. In short, the barycenter task is to take the average of a collection of probability distributions w.r.t. given OT discrepancies. We propose a novel algorithm for approximating the continuous Entropic OT (EOT) barycenter for arbitrary OT cost functions. Our approach is built upon the dual reformulation of the EOT problem based on weak OT, which has recently gained the attention of the ML community. Beyond its novelty, our method enjoys several advantageous properties: (i) we establish quality bounds for the recovered solution; (ii) this approach seemlessly interconnects with the Energy-Based Models (EBMs) learning procedure enabling the use of well-tuned algorithms for the problem of interest; (iii) it provides an intuitive optimization scheme avoiding min-max, reinforce and other intricate technical tricks. For validation, we consider several low-dimensional scenarios and image-space setups, including non-Euclidean cost functions. Furthermore, we investigate the practical task of learning the barycenter on an image manifold generated by a pretrained generative model, opening up new directions for real-world applications.

Via

Access Paper or Ask Questions

Building the Bridge of Schrödinger: A Continuous Entropic Optimal Transport Benchmark

Jun 16, 2023

Nikita Gushchin, Alexander Kolesov, Petr Mokrov, Polina Karpikova, Andrey Spiridonov, Evgeny Burnaev, Alexander Korotin

Figure 1 for Building the Bridge of Schrödinger: A Continuous Entropic Optimal Transport Benchmark

Figure 2 for Building the Bridge of Schrödinger: A Continuous Entropic Optimal Transport Benchmark

Figure 3 for Building the Bridge of Schrödinger: A Continuous Entropic Optimal Transport Benchmark

Figure 4 for Building the Bridge of Schrödinger: A Continuous Entropic Optimal Transport Benchmark

Abstract:Over the last several years, there has been a significant progress in developing neural solvers for the Schr\"odinger Bridge (SB) problem and applying them to generative modeling. This new research field is justifiably fruitful as it is interconnected with the practically well-performing diffusion models and theoretically-grounded entropic optimal transport (EOT). Still the area lacks non-trivial tests allowing a researcher to understand how well do the methods solve SB or its equivalent continuous EOT problem. We fill this gap and propose a novel way to create pairs of probability distributions for which the ground truth OT solution in known by the construction. Our methodology is generic and works for a wide range of OT formulations, in particular, it covers the EOT which is equivalent to SB (the main interest of our study). This development allows us to create continuous benchmark distributions with the known EOT and SB solution on high-dimensional spaces such as spaces of images. As an illustration, we use these benchmark pairs to test how well do existing neural EOT/SB solvers actually compute the EOT solution. The benchmark is available via the link: https://github.com/ngushchin/EntropicOTBenchmark.

Via

Access Paper or Ask Questions

Entropic Neural Optimal Transport via Diffusion Processes

Nov 02, 2022

Nikita Gushchin, Alexander Kolesov, Alexander Korotin, Dmitry Vetrov, Evgeny Burnaev

Figure 1 for Entropic Neural Optimal Transport via Diffusion Processes

Figure 2 for Entropic Neural Optimal Transport via Diffusion Processes

Figure 3 for Entropic Neural Optimal Transport via Diffusion Processes

Figure 4 for Entropic Neural Optimal Transport via Diffusion Processes

Abstract:We propose a novel neural algorithm for the fundamental problem of computing the entropic optimal transport (EOT) plan between probability distributions which are accessible by samples. Our algorithm is based on the saddle point reformulation of the dynamic version of EOT which is known as the Schr\"odinger Bridge problem. In contrast to the prior methods for large-scale EOT, our algorithm is end-to-end and consists of a single learning step, has fast inference procedure, and allows handling small values of the entropy regularization coefficient which is of particular importance in some applied problems. Empirically, we show the performance of the method on several large-scale EOT tasks.

Via

Access Paper or Ask Questions

Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?

Jun 15, 2022

Alexander Korotin, Alexander Kolesov, Evgeny Burnaev

Figure 1 for Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?

Figure 2 for Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?

Figure 3 for Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?

Figure 4 for Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?

Abstract:Wasserstein Generative Adversarial Networks (WGANs) are the popular generative models built on the theory of Optimal Transport (OT) and the Kantorovich duality. Despite the success of WGANs, it is still unclear how well the underlying OT dual solvers approximate the OT cost (Wasserstein-1 distance, $\mathbb{W}_{1}$) and the OT gradient needed to update the generator. In this paper, we address these questions. We construct 1-Lipschitz functions and use them to build ray monotone transport plans. This strategy yields pairs of continuous benchmark distributions with the analytically known OT plan, OT cost and OT gradient in high-dimensional spaces such as spaces of images. We thoroughly evaluate popular WGAN dual form solvers (gradient penalty, spectral normalization, entropic regularization, etc.) using these benchmark pairs. Even though these solvers perform well in WGANs, none of them faithfully compute $\mathbb{W}_{1}$ in high dimensions. Nevertheless, many provide a meaningful approximation of the OT gradient. These observations suggest that these solvers should not be treated as good estimators of $\mathbb{W}_{1}$, but to some extent they indeed can be used in variational problems requiring the minimization of $\mathbb{W}_{1}$.

Via

Access Paper or Ask Questions