Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Falsification-Based Robust Adversarial Reinforcement Learning

Jul 17, 2020

Xiao Wang, Saasha Nair, Matthias Althoff

Figure 1 for Falsification-Based Robust Adversarial Reinforcement Learning

Figure 2 for Falsification-Based Robust Adversarial Reinforcement Learning

Figure 3 for Falsification-Based Robust Adversarial Reinforcement Learning

Figure 4 for Falsification-Based Robust Adversarial Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Reinforcement learning (RL) has achieved tremendous progress in solving various sequential decision-making problems, e.g., control tasks in robotics. However, RL methods often fail to generalize to safety-critical scenarios since policies are overfitted to training environments. Previously, robust adversarial reinforcement learning (RARL) was proposed to train an adversarial network that applies disturbances to a system, which improves robustness in test scenarios. A drawback of neural-network-based adversaries is that integrating system requirements without handcrafting sophisticated reward signals is difficult. Safety falsification methods allow one to find a set of initial conditions as well as an input sequence, such that the system violates a given property formulated in temporal logic. In this paper, we propose falsification-based RARL (FRARL), the first generic framework for integrating temporal-logic falsification in adversarial learning to improve policy robustness. With falsification method, we do not need to construct an extra reward function for the adversary. We evaluate our approach on a braking assistance system and an adaptive cruise control system of autonomous vehicles. Experiments show that policies trained with a falsification-based adversary generalize better and show less violation of the safety specification in test scenarios than the ones trained without an adversary or with an adversarial network.

* 11 pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:Falsification-Based Robust Adversarial Reinforcement Learning

Paper and Code