Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Milec

Generation of Games for Opponent Model Differentiation

Nov 28, 2023

David Milec, Viliam Lisý, Christopher Kiekintveld

Figure 1 for Generation of Games for Opponent Model Differentiation

Figure 2 for Generation of Games for Opponent Model Differentiation

Abstract:Protecting against adversarial attacks is a common multiagent problem. Attackers in the real world are predominantly human actors, and the protection methods often incorporate opponent models to improve the performance when facing humans. Previous results show that modeling human behavior can significantly improve the performance of the algorithms. However, modeling humans correctly is a complex problem, and the models are often simplified and assume humans make mistakes according to some distribution or train parameters for the whole population from which they sample. In this work, we use data gathered by psychologists who identified personality types that increase the likelihood of performing malicious acts. However, in the previous work, the tests on a handmade game could not show strategic differences between the models. We created a novel model that links its parameters to psychological traits. We optimized over parametrized games and created games in which the differences are profound. Our work can help with automatic game generation when we need a game in which some models will behave differently and to identify situations in which the models do not align.

* 4 pages

Via

Access Paper or Ask Questions

Fast Algorithms for Poker Require Modelling it as a Sequential Bayesian Game

Dec 20, 2021

Vojtěch Kovařík, David Milec, Michal Šustr, Dominik Seitz, Viliam Lisý

Figure 1 for Fast Algorithms for Poker Require Modelling it as a Sequential Bayesian Game

Figure 2 for Fast Algorithms for Poker Require Modelling it as a Sequential Bayesian Game

Abstract:Many recent results in imperfect information games were only formulated for, or evaluated on, poker and poker-like games such as liar's dice. We argue that sequential Bayesian games constitute a natural class of games for generalizing these results. In particular, this model allows for an elegant formulation of the counterfactual regret minimization algorithm, called public-state CFR (PS-CFR), which naturally lends itself to an efficient implementation. Empirically, solving a poker subgame with 10^7 states by public-state CFR takes 3 minutes and 700 MB while a comparable version of vanilla CFR takes 5.5 hours and 20 GB. Additionally, the public-state formulation of CFR opens up the possibility for exploiting domain-specific assumptions, leading to a quadratic reduction in asymptotic complexity (and a further empirical speedup) over vanilla CFR in poker and other domains. Overall, this suggests that the ability to represent poker as a sequential Bayesian game played a key role in the success of CFR-based methods. Finally, we extend public-state CFR to general extensive-form games, arguing that this extension enjoys some - but not all - of the benefits of the version for sequential Bayesian games.

* To appear at Reinforcement Learning in Games workshop at AAAI 2022

Via

Access Paper or Ask Questions

Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Sep 30, 2020

David Milec, Jakub Černý, Viliam Lisý, Bo An

Figure 1 for Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Figure 2 for Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Figure 3 for Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Figure 4 for Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Abstract:Solution concepts of traditional game theory assume entirely rational players; therefore, their ability to exploit subrational opponents is limited. One type of subrationality that describes human behavior well is the quantal response. While there exist algorithms for computing solutions against quantal opponents, they either do not scale or may provide strategies that are even worse than the entirely-rational Nash strategies. This paper aims to analyze and propose scalable algorithms for computing effective and robust strategies against a quantal opponent in normal-form and extensive-form games. Our contributions are: (1) we define two different solution concepts related to exploiting quantal opponents and analyze their properties; (2) we prove that computing these solutions is computationally hard; (3) therefore, we evaluate several heuristic approximations based on scalable counterfactual regret minimization (CFR); and (4) we identify a CFR variant that exploits the bounded opponents better than the previously used variants while being less exploitable by the worst-case perfectly-rational opponent.

* 14 pages, 11 figures, submitted to AAAI 2021

Via

Access Paper or Ask Questions