Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Sep 22, 2022

Benjamin Evans, Johannes Betz, Hongrui Zheng, Herman A. Engelbrecht, Rahul Mangharam, Hendrik W. Jordaan

Figure 1 for Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Figure 2 for Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Figure 3 for Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Figure 4 for Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Share this with someone who'll enjoy it:

Abstract:Deep reinforcement learning (DRL) is a promising method to learn control policies for robots only from demonstration and experience. To cover the whole dynamic behaviour of the robot, the DRL training is an active exploration process typically derived in simulation environments. Although this simulation training is cheap and fast, applying DRL algorithms to real-world settings is difficult. If agents are trained until they perform safely in simulation, transferring them to physical systems is difficult due to the sim-to-real gap caused by the difference between the simulation dynamics and the physical robot. In this paper, we present a method of online training a DRL agent to drive autonomously on a physical vehicle by using a model-based safety supervisor. Our solution uses a supervisory system to check if the action selected by the agent is safe or unsafe and ensure that a safe action is always implemented on the vehicle. With this, we can bypass the sim-to-real problem while training the DRL algorithm safely, quickly, and efficiently. We provide a variety of real-world experiments where we train online a small-scale, physical vehicle to drive autonomously with no prior simulation training. The evaluation results show that our method trains agents with improved sample efficiency while never crashing, and the trained agents demonstrate better driving performance than those trained in simulation.

* 7 Pages, 10 Figures, 1 Table. Submitted to 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)

View paper on

Share this with someone who'll enjoy it:

Title:Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Paper and Code