Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Progressive Adaptive Chance-Constrained Safeguards for Reinforcement Learning

Oct 05, 2023

Zhaorun Chen, Binhao Chen, Tairan He, Liang Gong, Chengliang Liu

Figure 1 for Progressive Adaptive Chance-Constrained Safeguards for Reinforcement Learning

Figure 2 for Progressive Adaptive Chance-Constrained Safeguards for Reinforcement Learning

Figure 3 for Progressive Adaptive Chance-Constrained Safeguards for Reinforcement Learning

Figure 4 for Progressive Adaptive Chance-Constrained Safeguards for Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Safety assurance of Reinforcement Learning (RL) is critical for exploration in real-world scenarios. In handling the Constrained Markov Decision Process, current approaches experience intrinsic difficulties in trading-off between optimality and feasibility. Direct optimization methods cannot strictly guarantee state-wise in-training safety while projection-based methods are usually inefficient and correct actions through lengthy iterations. To address these two challenges, this paper proposes an adaptive surrogate chance constraint for the safety cost, and a hierarchical architecture that corrects actions produced by the upper policy layer via a fast Quasi-Newton method. Theoretical analysis indicates that the relaxed probabilistic constraint can sufficiently guarantee forward invariance to the safe set. We validate the proposed method on 4 simulated and real-world safety-critical robotic tasks. Results indicate that the proposed method can efficiently enforce safety (nearly zero-violation), while preserving optimality (+23.8%), robustness and generalizability to stochastic real-world settings.

* 6 pages, 7 figures

View paper on

Share this with someone who'll enjoy it:

Title:Progressive Adaptive Chance-Constrained Safeguards for Reinforcement Learning

Paper and Code