Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tairan Huang

CGAR: Critic Guided Action Redistribution in Reinforcement Leaning

Jun 23, 2022

Tairan Huang, Xu Li, Hao Li, Mingming Sun, Ping Li

Figure 1 for CGAR: Critic Guided Action Redistribution in Reinforcement Leaning

Figure 2 for CGAR: Critic Guided Action Redistribution in Reinforcement Leaning

Figure 3 for CGAR: Critic Guided Action Redistribution in Reinforcement Leaning

Figure 4 for CGAR: Critic Guided Action Redistribution in Reinforcement Leaning

Abstract:Training a game-playing reinforcement learning agent requires multiple interactions with the environment. Ignorant random exploration may cause a waste of time and resources. It's essential to alleviate such waste. As discussed in this paper, under the settings of the off-policy actor critic algorithms, we demonstrate that the critic can bring more expected discounted rewards than or at least equal to the actor. Thus, the Q value predicted by the critic is a better signal to redistribute the action originally sampled from the policy distribution predicted by the actor. This paper introduces the novel Critic Guided Action Redistribution (CGAR) algorithm and tests it on the OpenAI MuJoCo tasks. The experimental results demonstrate that our method improves the sample efficiency and achieves state-of-the-art performance. Our code can be found at https://github.com/tairanhuang/CGAR.

* IEEE Conference on Games (CoG), 2022

Via

Access Paper or Ask Questions

Adversarial Attacks for Embodied Agents

May 19, 2020

Aishan Liu, Tairan Huang, Xianglong Liu, Yitao Xu, Yuqing Ma, Xinyun Chen, Stephen J. Maybank, Dacheng Tao

Figure 1 for Adversarial Attacks for Embodied Agents

Figure 2 for Adversarial Attacks for Embodied Agents

Figure 3 for Adversarial Attacks for Embodied Agents

Figure 4 for Adversarial Attacks for Embodied Agents

Abstract:Adversarial attacks are valuable for providing insights into the blind-spots of deep learning models and help improve their robustness. Existing work on adversarial attacks have mainly focused on static scenes; however, it remains unclear whether such attacks are effective against embodied agents, which could navigate and interact with a dynamic environment. In this work, we take the first step to study adversarial attacks for embodied agents. In particular, we generate spatiotemporal perturbations to form 3D adversarial examples, which exploit the interaction history in both the temporal and spatial dimensions. Regarding the temporal dimension, since agents make predictions based on historical observations, we develop a trajectory attention module to explore scene view contributions, which further help localize 3D objects appeared with the highest stimuli. By conciliating with clues from the temporal dimension, along the spatial dimension, we adversarially perturb the physical properties (e.g., texture and 3D shape) of the contextual objects that appeared in the most important scene views. Extensive experiments on the EQA-v1 dataset for several embodied tasks in both the white-box and black-box settings have been conducted, which demonstrate that our perturbations have strong attack and generalization abilities.

* 17 pages, 9 figures

Via

Access Paper or Ask Questions