Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Mar 12, 2021

Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

Figure 1 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Figure 2 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Figure 3 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Figure 4 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Share this with someone who'll enjoy it:

Abstract:We propose a simple, general and effective technique, Reward Randomization for discovering diverse strategic policies in complex multi-agent games. Combining reward randomization and policy gradient, we derive a new algorithm, Reward-Randomized Policy Gradient (RPG). RPG is able to discover multiple distinctive human-interpretable strategies in challenging temporal trust dilemmas, including grid-world games and a real-world game Agar.io, where multiple equilibria exist but standard multi-agent policy gradient algorithms always converge to a fixed one with a sub-optimal payoff for every player even using state-of-the-art exploration techniques. Furthermore, with the set of diverse strategies from RPG, we can (1) achieve higher payoffs by fine-tuning the best policy from the set; and (2) obtain an adaptive agent by using this set of strategies as its training opponents. The source code and example videos can be found in our website: https://sites.google.com/view/staghuntrpg.

* Accepted paper on ICLR 2021. First two authors share equal contribution

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Paper and Code