Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pranav Rajbhandari

Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games

Oct 17, 2024

Pranav Rajbhandari, Prithviraj Dasgupta, Donald Sofge

Figure 1 for Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games

Figure 2 for Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games

Figure 3 for Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games

Figure 4 for Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games

Abstract:We consider the problem of team formation within multiagent adversarial games. We propose BERTeam, a novel algorithm that uses a transformer-based deep neural network with Masked Language Model training to select the best team of players from a trained population. We integrate this with coevolutionary deep reinforcement learning, which trains a diverse set of individual players to choose teams from. We test our algorithm in the multiagent adversarial game Marine Capture-The-Flag, and we find that BERTeam learns non-trivial team compositions that perform well against unseen opponents. For this game, we find that BERTeam outperforms MCAA, an algorithm that similarly optimizes team formation.

Via

Access Paper or Ask Questions

Learning Emergent Behavior in Robot Swarms with NEAT

Sep 26, 2023

Pranav Rajbhandari, Donald Sofge

Abstract:When researching robot swarms, many studies observe complex group behavior emerging from the individual agents' simple local actions. However, the task of learning an individual policy to produce a desired emergent behavior remains a challenging and largely unsolved problem. We present a method of training distributed robotic swarm algorithms to produce emergent behavior. Inspired by the biological evolution of emergent behavior in animals, we use an evolutionary algorithm to train a 'population' of individual behaviors to approximate a desired group behavior. We perform experiments using simulations of the Georgia Tech Miniature Autonomous Blimps (GT-MABs) aerial robotics platforms conducted in the CoppeliaSim simulator. Additionally, we test on simulations of Anki Vector robots to display our algorithm's effectiveness on various modes of actuation. We evaluate our algorithm on various tasks where a somewhat complex group behavior is required for success. These tasks include an Area Coverage task, a Surround Target task, and a Wall Climb task. We compare behaviors evolved using our algorithm against 'designed policies', which we create in order to exhibit the emergent behaviors we desire.

Via

Access Paper or Ask Questions