Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan Yu

Explore Reinforced: Equilibrium Approximation with Reinforcement Learning

Dec 02, 2024

Ryan Yu, Mateusz Nowak, Qintong Xie, Michelle Yilin Feng, Peter Chin

Figure 1 for Explore Reinforced: Equilibrium Approximation with Reinforcement Learning

Figure 2 for Explore Reinforced: Equilibrium Approximation with Reinforcement Learning

Figure 3 for Explore Reinforced: Equilibrium Approximation with Reinforcement Learning

Abstract:Current approximate Coarse Correlated Equilibria (CCE) algorithms struggle with equilibrium approximation for games in large stochastic environments but are theoretically guaranteed to converge to a strong solution concept. In contrast, modern Reinforcement Learning (RL) algorithms provide faster training yet yield weaker solutions. We introduce Exp3-IXrl - a blend of RL and game-theoretic approach, separating the RL agent's action selection from the equilibrium computation while preserving the integrity of the learning process. We demonstrate that our algorithm expands the application of equilibrium approximation algorithms to new environments. Specifically, we show the improved performance in a complex and adversarial cybersecurity network environment - the Cyber Operations Research Gym - and in the classical multi-armed bandit settings.

Via

Access Paper or Ask Questions

Tree Search for Simultaneous Move Games via Equilibrium Approximation

Jun 14, 2024

Ryan Yu, Alex Olshevsky, Peter Chin

Figure 1 for Tree Search for Simultaneous Move Games via Equilibrium Approximation

Figure 2 for Tree Search for Simultaneous Move Games via Equilibrium Approximation

Figure 3 for Tree Search for Simultaneous Move Games via Equilibrium Approximation

Figure 4 for Tree Search for Simultaneous Move Games via Equilibrium Approximation

Abstract:Neural network supported tree-search has shown strong results in a variety of perfect information multi-agent tasks. However, the performance of these methods on partial information games has generally been below competing approaches. Here we study the class of simultaneous-move games, which are a subclass of partial information games which are most similar to perfect information games: both agents know the game state with the exception of the opponent's move, which is revealed only after each agent makes its own move. Simultaneous move games include popular benchmarks such as Google Research Football and Starcraft. In this study we answer the question: can we take tree search algorithms trained through self-play from perfect information settings and adapt them to simultaneous move games without significant loss of performance? We answer this question by deriving a practical method that attempts to approximate a coarse correlated equilibrium as a subroutine within a tree search. Our algorithm works on cooperative, competitive, and mixed tasks. Our results are better than the current best MARL algorithms on a wide range of accepted baseline environments.

* 9 pages, 5 tables, 1 figure

Via

Access Paper or Ask Questions