Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohamed K. Abdelaziz

Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction

Jun 20, 2023

Mohamed K. Abdelaziz, Mohammed S. Elbamby, Sumudu Samarakoon, Mehdi Bennis

Figure 1 for Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction

Figure 2 for Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction

Figure 3 for Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction

Figure 4 for Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction

Abstract:Cooperative multi-agent reinforcement learning (MARL) for navigation enables agents to cooperate to achieve their navigation goals. Using emergent communication, agents learn a communication protocol to coordinate and share information that is needed to achieve their navigation tasks. In emergent communication, symbols with no pre-specified usage rules are exchanged, in which the meaning and syntax emerge through training. Learning a navigation policy along with a communication protocol in a MARL environment is highly complex due to the huge state space to be explored. To cope with this complexity, this work proposes a novel neural network architecture, for jointly learning an adaptive state space abstraction and a communication protocol among agents participating in navigation tasks. The goal is to come up with an adaptive abstractor that significantly reduces the size of the state space to be explored, without degradation in the policy performance. Simulation results show that the proposed method reaches a better policy, in terms of achievable rewards, resulting in fewer training iterations compared to the case where raw states or fixed state abstraction are used. Moreover, it is shown that a communication protocol emerges during training which enables the agents to learn better policies within fewer training iterations.

* 24 Pages, 13 Figures, Submitted to a journal for possible publication

Via

Access Paper or Ask Questions