Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Héctor Muñoz-Avila

Reinforcement Learning based Multi-Robot Classification via Scalable Communication Structure

Dec 18, 2020

Guangyi Liu, Arash Amini, Martin Takáč, Héctor Muñoz-Avila, Nader Motee

Figure 1 for Reinforcement Learning based Multi-Robot Classification via Scalable Communication Structure

Figure 2 for Reinforcement Learning based Multi-Robot Classification via Scalable Communication Structure

Figure 3 for Reinforcement Learning based Multi-Robot Classification via Scalable Communication Structure

Figure 4 for Reinforcement Learning based Multi-Robot Classification via Scalable Communication Structure

Abstract:In the multi-robot collaboration domain, training with Reinforcement Learning (RL) can become intractable, and performance starts to deteriorate drastically as the number of robots increases. In this work, we proposed a distributed multi-robot learning architecture with a scalable communication structure capable of learning a robust communication policy for time-varying communication topology. We construct the communication structure with Long-Short Term Memory (LSTM) cells and star graphs, in which the computational complexity of the proposed learning algorithm scales linearly with the number of robots and suitable for application with a large number of robots. The proposed methodology is validated with a map classification problem in the simulated environment. It is shown that the proposed architecture achieves a comparable classification accuracy with the centralized methods, maintains high performance with various numbers of robots without additional training cost, and robust to hacking and loss of the robots in the network.

Via

Access Paper or Ask Questions

Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis

Jun 21, 2020

Weihang Yuan, Héctor Muñoz-Avila

Figure 1 for Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis

Figure 2 for Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis

Figure 3 for Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis

Figure 4 for Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis

Abstract:Hierarchical DQN (h-DQN) is a two-level architecture of feedforward neural networks where the meta level selects goals and the lower level takes actions to achieve the goals. We show tasks that cannot be solved by h-DQN, exemplifying the limitation of this type of hierarchical framework (HF). We describe the recurrent hierarchical framework (RHF), generalizing architectures that use a recurrent neural network at the meta level. We analyze the expressiveness of HF and RHF using context-sensitive grammars. We show that RHF is more expressive than HF. We perform experiments comparing an implementation of RHF with two HF baselines; the results corroborate our theoretical findings.

Via

Access Paper or Ask Questions

A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning

Sep 20, 2019

Hossein K. Mousavi, Guangyi Liu, Weihang Yuan, Martin Takáč, Héctor Muñoz-Avila, Nader Motee

Figure 1 for A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning

Figure 2 for A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning

Figure 3 for A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning

Figure 4 for A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning

Abstract:We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that evaluates the reward and makes a prediction. We design and implement these layers using deep reinforcement learning. A generalized policy gradient algorithm is utilized to learn the parameters of these layers to maximize the expected reward. Our proposed methodology is tested on the MNIST dataset of handwritten digits, which provides us with a level of explainability while interpreting the agent's intermediate goals and course of action.

* Submitted to ICRA-2020

Via

Access Paper or Ask Questions