Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jack Umenberger

Constrained Diffusers for Safe Planning and Control

Jun 14, 2025

Jichen Zhang, Liqun Zhao, Antonis Papachristodoulou, Jack Umenberger

Abstract:Diffusion models have shown remarkable potential in planning and control tasks due to their ability to represent multimodal distributions over actions and trajectories. However, ensuring safety under constraints remains a critical challenge for diffusion models. This paper proposes Constrained Diffusers, a novel framework that incorporates constraints into pre-trained diffusion models without retraining or architectural modifications. Inspired by constrained optimization, we apply a constrained Langevin sampling mechanism for the reverse diffusion process that jointly optimizes the trajectory and realizes constraint satisfaction through three iterative algorithms: projected method, primal-dual method and augmented Lagrangian approaches. In addition, we incorporate discrete control barrier functions as constraints for constrained diffusers to guarantee safety in online implementation. Experiments in Maze2D, locomotion, and pybullet ball running tasks demonstrate that our proposed methods achieve constraint satisfaction with less computation time, and are competitive to existing methods in environments with static and time-varying constraints.

* 12 pages, 5 figures

Via

Access Paper or Ask Questions

Improved Sample Complexity of Imitation Learning for Barrier Model Predictive Control

Oct 01, 2024

Daniel Pfrommer, Swati Padmanabhan, Kwangjun Ahn, Jack Umenberger, Tobia Marcucci, Zakaria Mhammedi, Ali Jadbabaie

Figure 1 for Improved Sample Complexity of Imitation Learning for Barrier Model Predictive Control

Figure 2 for Improved Sample Complexity of Imitation Learning for Barrier Model Predictive Control

Figure 3 for Improved Sample Complexity of Imitation Learning for Barrier Model Predictive Control

Abstract:Recent work in imitation learning has shown that having an expert controller that is both suitably smooth and stable enables stronger guarantees on the performance of the learned controller. However, constructing such smoothed expert controllers for arbitrary systems remains challenging, especially in the presence of input and state constraints. As our primary contribution, we show how such a smoothed expert can be designed for a general class of systems using a log-barrier-based relaxation of a standard Model Predictive Control (MPC) optimization problem. Improving upon our previous work, we show that barrier MPC achieves theoretically optimal error-to-smoothness tradeoff along some direction. At the core of this theoretical guarantee on smoothness is an improved lower bound we prove on the optimality gap of the analytic center associated with a convex Lipschitz function, which we believe could be of independent interest. We validate our theoretical findings via experiments, demonstrating the merits of our smoothing approach over randomized smoothing.

* 36 pages, 3 figures. This work extends our previous result in arXiv:2306.01914, which has been accepted for publication in CDC 2024. An earlier version of this manuscript was submitted as part of DP's Master's thesis

Via

Access Paper or Ask Questions

Smooth Model Predictive Control with Applications to Statistical Learning

Jun 02, 2023

Kwangjun Ahn, Daniel Pfrommer, Jack Umenberger, Tobia Marcucci, Zak Mhammedi, Ali Jadbabaie

Figure 1 for Smooth Model Predictive Control with Applications to Statistical Learning

Figure 2 for Smooth Model Predictive Control with Applications to Statistical Learning

Figure 3 for Smooth Model Predictive Control with Applications to Statistical Learning

Abstract:Statistical learning theory and high dimensional statistics have had a tremendous impact on Machine Learning theory and have impacted a variety of domains including systems and control theory. Over the past few years we have witnessed a variety of applications of such theoretical tools to help answer questions such as: how many state-action pairs are needed to learn a static control policy to a given accuracy? Recent results have shown that continuously differentiable and stabilizing control policies can be well-approximated using neural networks with hard guarantees on performance, yet often even the simplest constrained control problems are not smooth. To address this void, in this paper we study smooth approximations of linear Model Predictive Control (MPC) policies, in which hard constraints are replaced by barrier functions, a.k.a. barrier MPC. In particular, we show that barrier MPC inherits the exponential stability properties of the original non-smooth MPC policy. Using a careful analysis of the proposed barrier MPC, we show that its smoothness constant can be carefully controlled, thereby paving the way for new sample complexity results for approximating MPC policies from sampled state-action pairs.

* 15 pages, 1 figure

Via

Access Paper or Ask Questions

Globally Convergent Policy Search over Dynamic Filters for Output Estimation

Feb 25, 2022

Jack Umenberger, Max Simchowitz, Juan C. Perdomo, Kaiqing Zhang, Russ Tedrake

Figure 1 for Globally Convergent Policy Search over Dynamic Filters for Output Estimation

Figure 2 for Globally Convergent Policy Search over Dynamic Filters for Output Estimation

Figure 3 for Globally Convergent Policy Search over Dynamic Filters for Output Estimation

Figure 4 for Globally Convergent Policy Search over Dynamic Filters for Output Estimation

Abstract:We introduce the first direct policy search algorithm which provably converges to the globally optimal $\textit{dynamic}$ filter for the classical problem of predicting the outputs of a linear dynamical system, given noisy, partial observations. Despite the ubiquity of partial observability in practice, theoretical guarantees for direct policy search algorithms, one of the backbones of modern reinforcement learning, have proven difficult to achieve. This is primarily due to the degeneracies which arise when optimizing over filters that maintain internal state. In this paper, we provide a new perspective on this challenging problem based on the notion of $\textit{informativity}$, which intuitively requires that all components of a filter's internal state are representative of the true state of the underlying dynamical system. We show that informativity overcomes the aforementioned degeneracy. Specifically, we propose a $\textit{regularizer}$ which explicitly enforces informativity, and establish that gradient descent on this regularized objective - combined with a ``reconditioning step'' - converges to the globally optimal cost a $\mathcal{O}(1/T)$ rate. Our analysis relies on several new results which may be of independent interest, including a new framework for analyzing non-convex gradient descent via convex reformulation, and novel bounds on the solution to linear Lyapunov equations in terms of (our quantitative measure of) informativity.

Via

Access Paper or Ask Questions

Stabilizing Dynamical Systems via Policy Gradient Methods

Oct 13, 2021

Juan C. Perdomo, Jack Umenberger, Max Simchowitz

Figure 1 for Stabilizing Dynamical Systems via Policy Gradient Methods

Figure 2 for Stabilizing Dynamical Systems via Policy Gradient Methods

Figure 3 for Stabilizing Dynamical Systems via Policy Gradient Methods

Abstract:Stabilizing an unknown control system is one of the most fundamental problems in control systems engineering. In this paper, we provide a simple, model-free algorithm for stabilizing fully observed dynamical systems. While model-free methods have become increasingly popular in practice due to their simplicity and flexibility, stabilization via direct policy search has received surprisingly little attention. Our algorithm proceeds by solving a series of discounted LQR problems, where the discount factor is gradually increased. We prove that this method efficiently recovers a stabilizing controller for linear systems, and for smooth, nonlinear systems within a neighborhood of their equilibria. Our approach overcomes a significant limitation of prior work, namely the need for a pre-given stabilizing control policy. We empirically evaluate the effectiveness of our approach on common control benchmarks.

* accepted for publication at Neurips 2021

Via

Access Paper or Ask Questions

Distributed Identification of Contracting and/or Monotone Network Dynamics

Jul 29, 2021

Max Revay, Jack Umenberger, Ian R. Manchester

Figure 1 for Distributed Identification of Contracting and/or Monotone Network Dynamics

Figure 2 for Distributed Identification of Contracting and/or Monotone Network Dynamics

Figure 3 for Distributed Identification of Contracting and/or Monotone Network Dynamics

Figure 4 for Distributed Identification of Contracting and/or Monotone Network Dynamics

Abstract:This paper proposes methods for identification of large-scale networked systems with guarantees that the resulting model will be contracting -- a strong form of nonlinear stability -- and/or monotone, i.e. order relations between states are preserved. The main challenges that we address are: simultaneously searching for model parameters and a certificate of stability, and scalability to networks with hundreds or thousands of nodes. We propose a model set that admits convex constraints for stability and monotonicity, and has a separable structure that allows distributed identification via the alternating directions method of multipliers (ADMM). The performance and scalability of the approach is illustrated on a variety of linear and non-linear case studies, including a nonlinear traffic network with a 200-dimensional state space.

* Preprint of full paper accepted for publication in IEEE Trans. Automatic Control

Via

Access Paper or Ask Questions

Optimistic robust linear quadratic dual control

Dec 31, 2019

Jack Umenberger, Thomas B. Schon

Figure 1 for Optimistic robust linear quadratic dual control

Abstract:Recent work by Mania et al. has proved that certainty equivalent control achieves nearly optimal regret for linear systems with quadratic costs. However, when parameter uncertainty is large, certainty equivalence cannot be relied upon to stabilize the true, unknown system. In this paper, we present a dual control strategy that attempts to combine the performance of certainty equivalence, with the practical utility of robustness. The formulation preserves structure in the representation of parametric uncertainty, which allows the controller to target reduction of uncertainty in the parameters that `matter most' for the control task, while robustly stabilizing the uncertain system. Control synthesis proceeds via convex optimization, and the method is illustrated on a numerical example.

* Preprint submitted to L4DC 2020. 11 pages. 1 figure

Via

Access Paper or Ask Questions

Robust exploration in linear quadratic reinforcement learning

Jun 04, 2019

Jack Umenberger, Mina Ferizbegovic, Thomas B. Schön, Håkan Hjalmarsson

Figure 1 for Robust exploration in linear quadratic reinforcement learning

Figure 2 for Robust exploration in linear quadratic reinforcement learning

Figure 3 for Robust exploration in linear quadratic reinforcement learning

Figure 4 for Robust exploration in linear quadratic reinforcement learning

Abstract:This paper concerns the problem of learning control policies for an unknown linear dynamical system to minimize a quadratic cost function. We present a method, based on convex optimization, that accomplishes this task robustly: i.e., we minimize the worst-case cost, accounting for system uncertainty given the observed data. The method balances exploitation and exploration, exciting the system in such a way so as to reduce uncertainty in the model parameters to which the worst-case cost is most sensitive. Numerical simulations and application to a hardware-in-the-loop servo-mechanism demonstrate the approach, with appreciable performance and robustness gains over alternative methods observed in both.

Via

Access Paper or Ask Questions

On the Smoothness of Nonlinear System Identification

May 02, 2019

Antônio H. Ribeiro, Koen Tiels, Jack Umenberger, Thomas B. Schön, Luis A. Aguirre

Figure 1 for On the Smoothness of Nonlinear System Identification

Figure 2 for On the Smoothness of Nonlinear System Identification

Figure 3 for On the Smoothness of Nonlinear System Identification

Figure 4 for On the Smoothness of Nonlinear System Identification

Abstract:New light is shed onto optimization problems resulting from prediction error parameter estimation of linear and nonlinear systems. It is shown that the smoothness" of the objective function depends both on the simulation length and on the decay rate of the prediction model. More precisely, for regions of the parameter space where the model is not contractive, the Lipschitz constant and $\beta$-smoothness of the objective function might blow up exponentially with the simulation length, making it hard to numerically find minima within those regions or, even, to escape from them. In addition to providing theoretical understanding of this problem, this paper also proposes the use of multiple shooting as a viable solution. The proposed method minimizes the error between a prediction model and observed values. Rather than running the prediction model over the entire dataset, as in the original prediction error formulation, multiple shooting splits the data into smaller subsets and runs the prediction model over each subdivision, making the simulation length a design parameter and making it possible to solve problems that would be infeasible using a standard approach. The equivalence with the original problem is obtained by including constraints in the optimization. The method is illustrated for the parameter estimation of nonlinear systems with chaotic or unstable behavior, as well as on neural network parameter estimation.

Via

Access Paper or Ask Questions

Nonlinear input design as optimal control of a Hamiltonian system

Mar 06, 2019

Jack Umenberger, Thomas B. Schön

Figure 1 for Nonlinear input design as optimal control of a Hamiltonian system

Figure 2 for Nonlinear input design as optimal control of a Hamiltonian system

Figure 3 for Nonlinear input design as optimal control of a Hamiltonian system

Abstract:We propose an input design method for a general class of parametric probabilistic models, including nonlinear dynamical systems with process noise. The goal of the procedure is to select inputs such that the parameter posterior distribution concentrates about the true value of the parameters; however, exact computation of the posterior is intractable. By representing (samples from) the posterior as trajectories from a certain Hamiltonian system, we transform the input design task into an optimal control problem. The method is illustrated via numerical examples, including MRI pulse sequence design.

Via

Access Paper or Ask Questions