Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Adversarial Policies: Attacking Deep Reinforcement Learning

May 25, 2019

Adam Gleave, Michael Dennis, Neel Kant, Cody Wild, Sergey Levine, Stuart Russell

Figure 1 for Adversarial Policies: Attacking Deep Reinforcement Learning

Figure 2 for Adversarial Policies: Attacking Deep Reinforcement Learning

Figure 3 for Adversarial Policies: Attacking Deep Reinforcement Learning

Figure 4 for Adversarial Policies: Attacking Deep Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Deep reinforcement learning (RL) policies are known to be vulnerable to adversarial perturbations to their observations, similar to adversarial examples for classifiers. However, an attacker is not usually able to directly modify another agent's observations. This might lead one to wonder: is it possible to attack an RL agent simply by choosing an adversarial policy acting in a multi-agent environment so as to create natural observations that are adversarial? We demonstrate the existence of adversarial policies in zero-sum games between simulated humanoid robots with proprioceptive observations, against state-of-the-art victims trained via self-play to be robust to opponents. The adversarial policies reliably win against the victims but generate seemingly random and uncoordinated behavior. We find that these policies are more successful in high-dimensional environments, and induce substantially different activations in the victim policy network than when the victim plays against a normal opponent. Videos are available at http://adversarialpolicies.github.io.

* Under review at NeurIPS 2019

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Adversarial Policies: Attacking Deep Reinforcement Learning

Paper and Code