Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rémy Hosseinkhan Boucher

Université Paris-Saclay, CNRS

Increasing Information for Model Predictive Control with Semi-Markov Decision Processes

Jan 28, 2025

Rémy Hosseinkhan Boucher, Onofrio Semeraro, Lionel Mathelin

Abstract:Recent works in Learning-Based Model Predictive Control of dynamical systems show impressive sample complexity performances using criteria from Information Theory to accelerate the learning procedure. However, the sequential exploration opportunities are limited by the system local state, restraining the amount of information of the observations from the current exploration trajectory. This article resolves this limitation by introducing temporal abstraction through the framework of Semi-Markov Decision Processes. The framework increases the total information of the gathered data for a fixed sampling budget, thus reducing the sample complexity.

* Proceedings of the 6th Annual Learning for Dynamics & Control Conference, p. 1400--1414, volume 242, publisher: Proceedings of Machine Learning Research, 2024

Via

Access Paper or Ask Questions

Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning

Jan 28, 2025

Rémy Hosseinkhan Boucher, Onofrio Semeraro, Lionel Mathelin

Abstract:The generalisation and robustness properties of policies learnt through Maximum-Entropy Reinforcement Learning are investigated on chaotic dynamical systems with Gaussian noise on the observable. First, the robustness under noise contamination of the agent's observation of entropy regularised policies is observed. Second, notions of statistical learning theory, such as complexity measures on the learnt model, are borrowed to explain and predict the phenomenon. Results show the existence of a relationship between entropy-regularised policy optimisation and robustness to noise, which can be described by the chosen complexity measures.

Via

Access Paper or Ask Questions