Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mina Ferizbegovic

Robust exploration in linear quadratic reinforcement learning

Jun 04, 2019

Jack Umenberger, Mina Ferizbegovic, Thomas B. Schön, Håkan Hjalmarsson

Figure 1 for Robust exploration in linear quadratic reinforcement learning

Figure 2 for Robust exploration in linear quadratic reinforcement learning

Figure 3 for Robust exploration in linear quadratic reinforcement learning

Figure 4 for Robust exploration in linear quadratic reinforcement learning

Abstract:This paper concerns the problem of learning control policies for an unknown linear dynamical system to minimize a quadratic cost function. We present a method, based on convex optimization, that accomplishes this task robustly: i.e., we minimize the worst-case cost, accounting for system uncertainty given the observed data. The method balances exploitation and exploration, exciting the system in such a way so as to reduce uncertainty in the model parameters to which the worst-case cost is most sensitive. Numerical simulations and application to a hardware-in-the-loop servo-mechanism demonstrate the approach, with appreciable performance and robustness gains over alternative methods observed in both.

Via

Access Paper or Ask Questions