Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Valmik Prabhu

Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

Apr 06, 2020

Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

Figure 1 for Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

Abstract:This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enables the use of general function approximators to approximate the linearizing controller for the system without having to worry about singularities. However, the discrete-time and stochastic nature of these algorithms precludes the direct application of standard machinery from the adaptive control literature to provide deterministic stability proofs for the system. Nevertheless, we leverage these techniques alongside tools from the stochastic approximation literature to demonstrate that with high probability the tracking and parameter errors concentrate near zero when a certain persistence of excitation condition is satisfied. A simulated example of a double pendulum demonstrates the utility of the proposed theory. 1

Via

Access Paper or Ask Questions

Exponentially Stable First Order Control on Matrix Lie Groups

Apr 01, 2020

Valmik Prabhu, Amay Saxena, S. Shankar Sastry

Figure 1 for Exponentially Stable First Order Control on Matrix Lie Groups

Figure 2 for Exponentially Stable First Order Control on Matrix Lie Groups

Figure 3 for Exponentially Stable First Order Control on Matrix Lie Groups

Figure 4 for Exponentially Stable First Order Control on Matrix Lie Groups

Abstract:We present a novel first order controller for systems evolving on matrix Lie groups, a major use case of which is Cartesian velocity control on robot manipulators. This controller achieves global exponential trajectory tracking on a number of commonly used Lie groups including the Special Orthogonal Group SO(n), the Special Euclidean Group SE(n), and the General Linear Group over complex numbers GL(n, C). Additionally, this controller achieves local exponential trajectory tracking on all matrix Lie groups. We demonstrate the effectiveness of this controller in simulation on a number of different Lie groups as well as on hardware with a 7-DOF Sawyer robot arm.

* 8 pages, four figures, submitted to CDC 2020

Via

Access Paper or Ask Questions

Feedback Linearization for Unknown Systems via Reinforcement Learning

Oct 29, 2019

Tyler Westenbroek, David Fridovich-Keil, Eric Mazumdar, Shreyas Arora, Valmik Prabhu, S. Shankar Sastry, Claire J. Tomlin

Figure 1 for Feedback Linearization for Unknown Systems via Reinforcement Learning

Figure 2 for Feedback Linearization for Unknown Systems via Reinforcement Learning

Figure 3 for Feedback Linearization for Unknown Systems via Reinforcement Learning

Figure 4 for Feedback Linearization for Unknown Systems via Reinforcement Learning

Abstract:We present a novel approach to control design for nonlinear systems, which leverages reinforcement learning techniques to learn a linearizing controller for a physical plant with unknown dynamics. Feedback linearization is a technique from nonlinear control which renders the input-output dynamics of a nonlinear plant \emph{linear} under application of an appropriate feedback controller. Once a linearizing controller has been constructed, desired output trajectories for the nonlinear plant can be tracked using a variety of linear control techniques. A single learned policy then serves to track arbitrary desired reference signals provided by a higher-level planner. We present theoretical results which provide conditions under which the learning problem has a unique solution which exactly linearizes the plant. We demonstrate the performance of our approach on two simulated problems and a physical robotic platform. For the simulated environments, we observe that the learned feedback linearizing policies can achieve arbitrary tracking of reference trajectories for a fully actuated double pendulum and a 14 dimensional quadrotor. In hardware, we demonstrate that our approach significantly improves tracking performance on a 7-DOF Baxter robot after less than two hours of training.

Via

Access Paper or Ask Questions