Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Takashi Nagata

Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability

May 18, 2022

Jinwei Xing, Takashi Nagata, Xinyun Zou, Emre Neftci, Jeffrey L. Krichmar

Figure 1 for Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability

Figure 2 for Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability

Figure 3 for Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability

Figure 4 for Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability

Abstract:Although deep Reinforcement Learning (RL) has proven successful in a wide range of tasks, one challenge it faces is interpretability when applied to real-world problems. Saliency maps are frequently used to provide interpretability for deep neural networks. However, in the RL domain, existing saliency map approaches are either computationally expensive and thus cannot satisfy the real-time requirement of real-world scenarios or cannot produce interpretable saliency maps for RL policies. In this work, we propose an approach of Distillation with selective Input Gradient Regularization (DIGR) which uses policy distillation and input gradient regularization to produce new policies that achieve both high interpretability and computation efficiency in generating saliency maps. Our approach is also found to improve the robustness of RL policies to multiple adversarial attacks. We conduct experiments on three tasks, MiniGrid (Fetch Object), Atari (Breakout) and CARLA Autonomous Driving, to demonstrate the importance and effectiveness of our approach.

Via

Access Paper or Ask Questions

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Feb 10, 2021

Jinwei Xing, Takashi Nagata, Kexin Chen, Xinyun Zou, Emre Neftci, Jeffrey L. Krichmar

Figure 1 for Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Figure 2 for Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Figure 3 for Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Figure 4 for Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Abstract:Despite the recent success of deep reinforcement learning (RL), domain adaptation remains an open problem. Although the generalization ability of RL agents is critical for the real-world applicability of Deep RL, zero-shot policy transfer is still a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. To address this issue, we propose a two-stage RL agent that first learns a latent unified state representation (LUSR) which is consistent across multiple domains in the first stage, and then do RL training in one source domain based on LUSR in the second stage. The cross-domain consistency of LUSR allows the policy acquired from the source domain to generalize to other target domains without extra training. We first demonstrate our approach in variants of CarRacing games with customized manipulations, and then verify it in CARLA, an autonomous driving simulator with more complex and realistic visual observations. Our results show that this approach can achieve state-of-the-art domain adaptation performance in related RL tasks and outperforms prior approaches based on latent-representation based RL and image-to-image translation.

* Accepted by AAAI 2021

Via

Access Paper or Ask Questions