Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics

May 24, 2024

David Boetius, Stefan Leue

Share this with someone who'll enjoy it:

Abstract:Naively trained Deep Reinforcement Learning agents may fail to satisfy vital safety constraints. To avoid costly retraining, we may desire to repair a previously trained reinforcement learning agent to obviate unsafe behaviour. We devise a counterexample-guided repair algorithm for repairing reinforcement learning systems leveraging safety critics. The algorithm jointly repairs a reinforcement learning agent and a safety critic using gradient-based constrained optimisation.

* 7 pages + references

View paper on

Share this with someone who'll enjoy it:

Title:Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics

Paper and Code