Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Revisiting Safe Exploration in Safe Reinforcement learning

Sep 02, 2024

David Eckel, Baohe Zhang, Joschka Bödecker

Figure 1 for Revisiting Safe Exploration in Safe Reinforcement learning

Figure 2 for Revisiting Safe Exploration in Safe Reinforcement learning

Figure 3 for Revisiting Safe Exploration in Safe Reinforcement learning

Figure 4 for Revisiting Safe Exploration in Safe Reinforcement learning

Share this with someone who'll enjoy it:

Abstract:Safe reinforcement learning (SafeRL) extends standard reinforcement learning with the idea of safety, where safety is typically defined through the constraint of the expected cost return of a trajectory being below a set limit. However, this metric fails to distinguish how costs accrue, treating infrequent severe cost events as equal to frequent mild ones, which can lead to riskier behaviors and result in unsafe exploration. We introduce a new metric, expected maximum consecutive cost steps (EMCC), which addresses safety during training by assessing the severity of unsafe steps based on their consecutive occurrence. This metric is particularly effective for distinguishing between prolonged and occasional safety violations. We apply EMMC in both on- and off-policy algorithm for benchmarking their safe exploration capability. Finally, we validate our metric through a set of benchmarks and propose a new lightweight benchmark task, which allows fast evaluation for algorithm design.

View paper on

Share this with someone who'll enjoy it:

Title:Revisiting Safe Exploration in Safe Reinforcement learning

Paper and Code