Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marcus Pereira

Solving Inverse Problems via Diffusion Optimal Control

Dec 21, 2024

Henry Li, Marcus Pereira

Abstract:Existing approaches to diffusion-based inverse problem solvers frame the signal recovery task as a probabilistic sampling episode, where the solution is drawn from the desired posterior distribution. This framework suffers from several critical drawbacks, including the intractability of the conditional likelihood function, strict dependence on the score network approximation, and poor $\mathbf{x}_0$ prediction quality. We demonstrate that these limitations can be sidestepped by reframing the generative process as a discrete optimal control episode. We derive a diffusion-based optimal controller inspired by the iterative Linear Quadratic Regulator (iLQR) algorithm. This framework is fully general and able to handle any differentiable forward measurement operator, including super-resolution, inpainting, Gaussian deblurring, nonlinear deblurring, and even highly nonlinear neural classifiers. Furthermore, we show that the idealized posterior sampling equation can be recovered as a special case of our algorithm. We then evaluate our method against a selection of neural inverse problem solvers, and establish a new baseline in image reconstruction with inverse problems.

* Presented at NeurIPS 2024

Via

Access Paper or Ask Questions

Neural Network Architectures for Stochastic Control using the Nonlinear Feynman-Kac Lemma

Feb 18, 2019

Marcus Pereira, Ziyi Wang, Ioannis Exarchos, Evangelos A. Theodorou

Figure 1 for Neural Network Architectures for Stochastic Control using the Nonlinear Feynman-Kac Lemma

Figure 2 for Neural Network Architectures for Stochastic Control using the Nonlinear Feynman-Kac Lemma

Figure 3 for Neural Network Architectures for Stochastic Control using the Nonlinear Feynman-Kac Lemma

Figure 4 for Neural Network Architectures for Stochastic Control using the Nonlinear Feynman-Kac Lemma

Abstract:In this paper we propose a new methodology for decision-making under uncertainty using recent advancements in the areas of nonlinear stochastic optimal control theory, applied mathematics and machine learning. Our work is grounded on the nonlinear Feynman-Kac lemma and the fundamental connection between backward nonlinear partial differential equations and forward-backward stochastic differential equations. Using these connections and results from our prior work on importance sampling for forward-backward stochastic differential equations, we develop a control framework that is scalable and applicable to general classes of stochastic systems and decision-making problem formulations in robotics and autonomy. Two architectures for stochastic control are proposed that consist of feed-forward and recurrent neural networks. The performance and scalability of the aforementioned algorithms is investigated in two stochastic optimal control problem formulations including the unconstrained L2 and control-constrained case, and three systems in simulation. We conclude with a discussion on the implications of the proposed algorithms to robotics and autonomous systems.

Via

Access Paper or Ask Questions

MPC-Inspired Neural Network Policies for Sequential Decision Making

Mar 14, 2018

Marcus Pereira, David D. Fan, Gabriel Nakajima An, Evangelos Theodorou

Figure 1 for MPC-Inspired Neural Network Policies for Sequential Decision Making

Figure 2 for MPC-Inspired Neural Network Policies for Sequential Decision Making

Figure 3 for MPC-Inspired Neural Network Policies for Sequential Decision Making

Figure 4 for MPC-Inspired Neural Network Policies for Sequential Decision Making

Abstract:In this paper we investigate the use of MPC-inspired neural network policies for sequential decision making. We introduce an extension to the DAgger algorithm for training such policies and show how they have improved training performance and generalization capabilities. We take advantage of this extension to show scalable and efficient training of complex planning policy architectures in continuous state and action spaces. We provide an extensive comparison of neural network policies by considering feed forward policies, recurrent policies, and recurrent policies with planning structure inspired by the Path Integral control framework. Our results suggest that MPC-type recurrent policies have better robustness to disturbances and modeling error.

* Fixed missing reference to section 4.1

Via

Access Paper or Ask Questions