Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amirhossein Taghvaei

Error Analysis of Triangular Optimal Transport Maps for Filtering

Oct 22, 2025

Mohammad Al-Jarrah, Bamdad Hosseini, Niyizhen Jin, Michele Martino, Amirhossein Taghvaei

Abstract:We present a systematic analysis of estimation errors for a class of optimal transport based algorithms for filtering and data assimilation. Along the way, we extend previous error analyses of Brenier maps to the case of conditional Brenier maps that arise in the context of simulation based inference. We then apply these results in a filtering scenario to analyze the optimal transport filtering algorithm of Al-Jarrah et al. (2024, ICML). An extension of that algorithm along with numerical benchmarks on various non-Gaussian and high-dimensional examples are provided to demonstrate its effectiveness and practical potential.

* 58 pages, 4 figures

Via

Access Paper or Ask Questions

Fast filtering of non-Gaussian models using Amortized Optimal Transport Maps

Mar 16, 2025

Mohammad Al-Jarrah, Bamdad Hosseini, Amirhossein Taghvaei

Abstract:In this paper, we present the amortized optimal transport filter (A-OTF) designed to mitigate the computational burden associated with the real-time training of optimal transport filters (OTFs). OTFs can perform accurate non-Gaussian Bayesian updates in the filtering procedure, but they require training at every time step, which makes them expensive. The proposed A-OTF framework exploits the similarity between OTF maps during an initial/offline training stage in order to reduce the cost of inference during online calculations. More precisely, we use clustering algorithms to select relevant subsets of pre-trained maps whose weighted average is used to compute the A-OTF model akin to a mixture of experts. A series of numerical experiments validate that A-OTF achieves substantial computational savings during online inference while preserving the inherent flexibility and accuracy of OTF.

* 6 pages, 4 figures

Via

Access Paper or Ask Questions

Conditional Optimal Transport on Function Spaces

Nov 17, 2023

Bamdad Hosseini, Alexander W. Hsu, Amirhossein Taghvaei

Figure 1 for Conditional Optimal Transport on Function Spaces

Figure 2 for Conditional Optimal Transport on Function Spaces

Figure 3 for Conditional Optimal Transport on Function Spaces

Figure 4 for Conditional Optimal Transport on Function Spaces

Abstract:We present a systematic study of conditional triangular transport maps in function spaces from the perspective of optimal transportation and with a view towards amortized Bayesian inference. More specifically, we develop a theory of constrained optimal transport problems that describe block-triangular Monge maps that characterize conditional measures along with their Kantorovich relaxations. This generalizes the theory of optimal triangular transport to separable infinite-dimensional function spaces with general cost functions. We further tailor our results to the case of Bayesian inference problems and obtain regularity estimates on the conditioning maps from the prior to the posterior. Finally, we present numerical experiments that demonstrate the computational applicability of our theoretical results for amortized and likelihood-free inference of functional parameters.

Via

Access Paper or Ask Questions

Optimal Transport-based Nonlinear Filtering in High-dimensional Settings

Oct 21, 2023

Mohammad Al-Jarrah, Niyizhen Jin, Bamdad Hosseini, Amirhossein Taghvaei

Figure 1 for Optimal Transport-based Nonlinear Filtering in High-dimensional Settings

Figure 2 for Optimal Transport-based Nonlinear Filtering in High-dimensional Settings

Figure 3 for Optimal Transport-based Nonlinear Filtering in High-dimensional Settings

Figure 4 for Optimal Transport-based Nonlinear Filtering in High-dimensional Settings

Abstract:This paper addresses the problem of nonlinear filtering, i.e., computing the conditional distribution of the state of a stochastic dynamical system given a history of noisy partial observations. The primary focus is on scenarios involving degenerate likelihoods or high-dimensional states, where traditional sequential importance resampling (SIR) particle filters face the weight degeneracy issue. Our proposed method builds on an optimal transport interpretation of nonlinear filtering, leading to a simulation-based and likelihood-free algorithm that estimates the Brenier optimal transport map from the current distribution of the state to the distribution at the next time step. Our formulation allows us to harness the approximation power of neural networks to model complex and multi-modal distributions and employ stochastic optimization algorithms to enhance scalability. Extensive numerical experiments are presented that compare our method to the SIR particle filter and the ensemble Kalman filter, demonstrating the superior performance of our method in terms of sample efficiency, high-dimensional scalability, and the ability to capture complex and multi-modal distributions.

* 24 pages, 15 figures

Via

Access Paper or Ask Questions

Duality-Based Stochastic Policy Optimization for Estimation with Unknown Noise Covariances

Oct 26, 2022

Shahriar Talebi, Amirhossein Taghvaei, Mehran Mesbahi

Figure 1 for Duality-Based Stochastic Policy Optimization for Estimation with Unknown Noise Covariances

Abstract:Duality of control and estimation allows mapping recent advances in data-guided control to the estimation setup. This paper formalizes and utilizes such a mapping by considering learning the optimal (steady-state) Kalman gain when process and measurement noise statistics are unknown. Specifically, building on the duality between synthesizing optimal control and estimation gains, the filter design problem is formalized as direct policy learning; subsequently, a Stochastic Gradient Descent (SGD) approach is adopted to learn the optimal filter gain. In this direction, control and estimation duality is also used to extend existing theoretical results for direct policy updates for Linear Quadratic Regulator (LQR) to establish convergence of the proposed algorithm-while addressing subtle differences between the two synthesis problems. The results are illustrated via several numerical examples.

Via

Access Paper or Ask Questions

An Optimal Transport Formulation of Bayes' Law for Nonlinear Filtering Algorithms

Mar 22, 2022

Amirhossein Taghvaei, Bamdad Hosseini

Figure 1 for An Optimal Transport Formulation of Bayes' Law for Nonlinear Filtering Algorithms

Abstract:This paper presents a variational representation of the Bayes' law using optimal transportation theory. The variational representation is in terms of the optimal transportation between the joint distribution of the (state, observation) and their independent coupling. By imposing certain structure on the transport map, the solution to the variational problem is used to construct a Brenier-type map that transports the prior distribution to the posterior distribution for any value of the observation signal. The new formulation is used to derive the optimal transport form of the Ensemble Kalman filter (EnKF) for the discrete-time filtering problem and propose a novel extension of EnKF to the non-Gaussian setting utilizing input convex neural networks. Finally, the proposed methodology is used to derive the optimal transport form of the feedback particle filler (FPF) in the continuous-time limit, which constitutes its first variational construction without explicitly using the nonlinear filtering equation or Bayes' law.

Via

Access Paper or Ask Questions

Variational Wasserstein gradient flow

Dec 04, 2021

Jiaojiao Fan, Amirhossein Taghvaei, Yongxin Chen

Figure 1 for Variational Wasserstein gradient flow

Figure 2 for Variational Wasserstein gradient flow

Figure 3 for Variational Wasserstein gradient flow

Figure 4 for Variational Wasserstein gradient flow

Abstract:The gradient flow of a function over the space of probability densities with respect to the Wasserstein metric often exhibits nice properties and has been utilized in several machine learning applications. The standard approach to compute the Wasserstein gradient flow is the finite difference which discretizes the underlying space over a grid, and is not scalable. In this work, we propose a scalable proximal gradient type algorithm for Wasserstein gradient flow. The key of our method is a variational formulation of the objective function, which makes it possible to realize the JKO proximal map through a primal-dual optimization. This primal-dual problem can be efficiently solved by alternatively updating the parameters in the inner and outer loops. Our framework covers all the classical Wasserstein gradient flows including the heat equation and the porous medium equation. We demonstrate the performance and scalability of our algorithm with several numerical examples.

Via

Access Paper or Ask Questions

Deep FPF: Gain function approximation in high-dimensional setting

Oct 02, 2020

S. Yagiz Olmez, Amirhossein Taghvaei, Prashant G. Mehta

Figure 1 for Deep FPF: Gain function approximation in high-dimensional setting

Figure 2 for Deep FPF: Gain function approximation in high-dimensional setting

Figure 3 for Deep FPF: Gain function approximation in high-dimensional setting

Figure 4 for Deep FPF: Gain function approximation in high-dimensional setting

Abstract:In this paper, we present a novel approach to approximate the gain function of the feedback particle filter (FPF). The exact gain function is the solution of a Poisson equation involving a probability-weighted Laplacian. The numerical problem is to approximate the exact gain function using only finitely many particles sampled from the probability distribution. Inspired by the recent success of the deep learning methods, we represent the gain function as a gradient of the output of a neural network. Thereupon considering a certain variational formulation of the Poisson equation, an optimization problem is posed for learning the weights of the neural network. A stochastic gradient algorithm is described for this purpose. The proposed approach has two significant properties/advantages: (i) The stochastic optimization algorithm allows one to process, in parallel, only a batch of samples (particles) ensuring good scaling properties with the number of particles; (ii) The remarkable representation power of neural networks means that the algorithm is potentially applicable and useful to solve high-dimensional problems. We numerically establish these two properties and provide extensive comparison to the existing approaches.

* To be presented at 59th IEEE Conference on Decision and Control, 2020

Via

Access Paper or Ask Questions

Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Jul 08, 2020

Jiaojiao Fan, Amirhossein Taghvaei, Yongxin Chen

Figure 1 for Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Figure 2 for Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Figure 3 for Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Figure 4 for Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Abstract:Wasserstein Barycenter is a principled approach to represent the weighted mean of a given set of probability distributions, utilizing the geometry induced by optimal transport. In this work, we present a novel scalable algorithm to approximate the Wasserstein Barycenters aiming at high-dimensional applications in machine learning. Our proposed algorithm is based on the Kantorovich dual formulation of the 2-Wasserstein distance as well as a recent neural network architecture, input convex neural network, that is known to parametrize convex functions. The distinguishing features of our method are: i) it only requires samples from the marginal distributions; ii) unlike the existing semi-discrete approaches, it represents the Barycenter with a generative model; iii) it allows to compute the barycenter with arbitrary weights after one training session. We demonstrate the efficacy of our algorithm by comparing it with the state-of-art methods in multiple experiments.

* 16 pages,12 figures

Via

Access Paper or Ask Questions

An Optimal Transport Formulation of the Ensemble Kalman Filter

Oct 05, 2019

Amirhossein Taghvaei, Prashant G. Mehta

Figure 1 for An Optimal Transport Formulation of the Ensemble Kalman Filter

Figure 2 for An Optimal Transport Formulation of the Ensemble Kalman Filter

Abstract:Controlled interacting particle systems such as the ensemble Kalman filter (EnKF) and the feedback particle filter (FPF) are numerical algorithms to approximate the solution of the nonlinear filtering problem in continuous time. The distinguishing feature of these algorithms is that the Bayesian update step is implemented using a feedback control law. It has been noted in the literature that the control law is not unique. This is the main problem addressed in this paper. To obtain a unique control law, the filtering problem is formulated here as an optimal transportation problem. An explicit formula for the (mean-field type) optimal control law is derived in the linear Gaussian setting. Comparisons are made with the control laws for different types of EnKF algorithms described in the literature. Via empirical approximation of the mean-field control law, a finite-$N$ controlled interacting particle algorithm is obtained. For this algorithm, the equations for empirical mean and covariance are derived and shown to be identical to the Kalman filter. This allows strong conclusions on convergence and error properties based on the classical filter stability theory for the Kalman filter. It is shown that, under certain technical conditions, the mean squared error (m.s.e.) converges to zero even with a finite number of particles. A detailed propagation of chaos analysis is carried out for the finite-$N$ algorithm. The analysis is used to prove weak convergence of the empirical distribution as $N\rightarrow\infty$. For a certain simplified filtering problem, analytical comparison of the m.s.e. with the importance sampling-based algorithms is described. The analysis helps explain the favorable scaling properties of the control-based algorithms reported in several numerical studies in recent literature.

Via

Access Paper or Ask Questions