Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jason Pazis

Crossmodal Attentive Skill Learner

May 22, 2018

Shayegan Omidshafiei, Dong-Ki Kim, Jason Pazis, Jonathan P. How

Figure 1 for Crossmodal Attentive Skill Learner

Figure 2 for Crossmodal Attentive Skill Learner

Figure 3 for Crossmodal Attentive Skill Learner

Figure 4 for Crossmodal Attentive Skill Learner

Abstract:This paper presents the Crossmodal Attentive Skill Learner (CASL), integrated with the recently-introduced Asynchronous Advantage Option-Critic (A2OC) architecture [Harb et al., 2017] to enable hierarchical reinforcement learning across multiple sensory inputs. We provide concrete examples where the approach not only improves performance in a single task, but accelerates transfer to new tasks. We demonstrate the attention mechanism anticipates and identifies useful latent features, while filtering irrelevant sensor modalities during execution. We modify the Arcade Learning Environment [Bellemare et al., 2013] to support audio queries, and conduct evaluations of crossmodal learning in the Atari 2600 game Amidar. Finally, building on the recent work of Babaeizadeh et al. [2017], we open-source a fast hybrid CPU-GPU implementation of CASL.

* International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2018, NIPS 2017 Deep Reinforcement Learning Symposium

Via

Access Paper or Ask Questions

Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Jul 13, 2017

Shayegan Omidshafiei, Jason Pazis, Christopher Amato, Jonathan P. How, John Vian

Figure 1 for Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Figure 2 for Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Figure 3 for Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Figure 4 for Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Abstract:Many real-world tasks involve multiple agents with partial observability and limited communication. Learning is challenging in these settings due to local viewpoints of agents, which perceive the world as non-stationary due to concurrently-exploring teammates. Approaches that learn specialized policies for individual tasks face problems when applied to the real world: not only do agents have to learn and store distinct policies for each task, but in practice identities of tasks are often non-observable, making these approaches inapplicable. This paper formalizes and addresses the problem of multi-task multi-agent reinforcement learning under partial observability. We introduce a decentralized single-task learning approach that is robust to concurrent interactions of teammates, and present an approach for distilling single-task policies into a unified policy that performs well across multiple related tasks, without explicit provision of task identity.

* Proceedings of the 34th International Conference on Machine Learning (ICML 2017), Sydney, Australia, PMLR 70:2681-2690, 2017
* Accepted to ICML 2017

Via

Access Paper or Ask Questions