Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Francisco J. Marmolejo-Cossío

Optimally Deceiving a Learning Leader in Stackelberg Games

Jun 11, 2020

Georgios Birmpas, Jiarui Gan, Alexandros Hollender, Francisco J. Marmolejo-Cossío, Ninad Rajgopal, Alexandros A. Voudouris

Figure 1 for Optimally Deceiving a Learning Leader in Stackelberg Games

Figure 2 for Optimally Deceiving a Learning Leader in Stackelberg Games

Figure 3 for Optimally Deceiving a Learning Leader in Stackelberg Games

Abstract:Recent results in the ML community have revealed that learning algorithms used to compute the optimal strategy for the leader to commit to in a Stackelberg game, are susceptible to manipulation by the follower. Such a learning algorithm operates by querying the best responses or the payoffs of the follower, who consequently can deceive the algorithm by responding as if his payoffs were much different than what they actually are. For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the payoffs that would make the learning algorithm compute a commitment so that best responding to it maximizes the follower's utility, according to his true payoffs. While this problem has been considered before, the related literature only focused on the simplified scenario in which the payoff space is finite, thus leaving the general version of the problem unanswered. In this paper, we fill in this gap, by showing that it is always possible for the follower to compute (near-)optimal payoffs for various scenarios about the learning interaction between leader and follower.

Via

Access Paper or Ask Questions

Learning Convex Partitions and Computing Game-theoretic Equilibria from Best Response Queries

Jul 17, 2018

Paul W. Goldberg, Francisco J. Marmolejo-Cossío

Figure 1 for Learning Convex Partitions and Computing Game-theoretic Equilibria from Best Response Queries

Figure 2 for Learning Convex Partitions and Computing Game-theoretic Equilibria from Best Response Queries

Figure 3 for Learning Convex Partitions and Computing Game-theoretic Equilibria from Best Response Queries

Figure 4 for Learning Convex Partitions and Computing Game-theoretic Equilibria from Best Response Queries

Abstract:Suppose that an $m$-simplex is partitioned into $n$ convex regions having disjoint interiors and distinct labels, and we may learn the label of any point by querying it. The learning objective is to know, for any point in the simplex, a label that occurs within some distance $\epsilon$ from that point. We present two algorithms for this task: Constant-Dimension Generalised Binary Search (CD-GBS), which for constant $m$ uses $poly(n, \log \left( \frac{1}{\epsilon} \right))$ queries, and Constant-Region Generalised Binary Search (CR-GBS), which uses CD-GBS as a subroutine and for constant $n$ uses $poly(m, \log \left( \frac{1}{\epsilon} \right))$ queries. We show via Kakutani's fixed-point theorem that these algorithms provide bounds on the best-response query complexity of computing approximate well-supported equilibria of bimatrix games in which one of the players has a constant number of pure strategies. We also partially extend our results to games with multiple players, establishing further query complexity bounds for computing approximate well-supported equilibria in this setting.

Via

Access Paper or Ask Questions