Picture for Martin Schmid

Martin Schmid

Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents

Add code
Apr 25, 2024
Viaarxiv icon

Learning not to Regret

Add code
Mar 02, 2023
Viaarxiv icon

Player of Games

Add code
Dec 06, 2021
Figure 1 for Player of Games
Figure 2 for Player of Games
Figure 3 for Player of Games
Figure 4 for Player of Games
Viaarxiv icon

Search in Imperfect Information Games

Add code
Nov 10, 2021
Figure 1 for Search in Imperfect Information Games
Figure 2 for Search in Imperfect Information Games
Figure 3 for Search in Imperfect Information Games
Figure 4 for Search in Imperfect Information Games
Viaarxiv icon

Solving Common-Payoff Games with Approximate Policy Iteration

Add code
Jan 11, 2021
Figure 1 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 2 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 3 for Solving Common-Payoff Games with Approximate Policy Iteration
Figure 4 for Solving Common-Payoff Games with Approximate Policy Iteration
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Aug 27, 2020
Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

Approximate exploitability: Learning a best response in large games

Add code
Apr 20, 2020
Figure 1 for Approximate exploitability: Learning a best response in large games
Figure 2 for Approximate exploitability: Learning a best response in large games
Figure 3 for Approximate exploitability: Learning a best response in large games
Figure 4 for Approximate exploitability: Learning a best response in large games
Viaarxiv icon

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Add code
Jul 22, 2019
Figure 1 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 2 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Figure 3 for Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Viaarxiv icon

Rethinking Formal Models of Partially Observable Multiagent Decision Making

Add code
Jun 26, 2019
Figure 1 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 2 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 3 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Figure 4 for Rethinking Formal Models of Partially Observable Multiagent Decision Making
Viaarxiv icon

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines

Add code
Sep 09, 2018
Figure 1 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 2 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 3 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 4 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Viaarxiv icon