Picture for Dustin Morrill

Dustin Morrill

Composing Efficient, Robust Tests for Policy Selection

Add code
Jun 12, 2023
Viaarxiv icon

Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration

Add code
Jun 04, 2022
Figure 1 for Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections

Add code
May 24, 2022
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Viaarxiv icon

The Partially Observable History Process

Add code
Nov 15, 2021
Figure 1 for The Partially Observable History Process
Viaarxiv icon

Learning to Be Cautious

Add code
Oct 29, 2021
Figure 1 for Learning to Be Cautious
Figure 2 for Learning to Be Cautious
Figure 3 for Learning to Be Cautious
Figure 4 for Learning to Be Cautious
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Add code
Feb 13, 2021
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Viaarxiv icon

Hindsight and Sequential Rationality of Correlated Play

Add code
Dec 17, 2020
Figure 1 for Hindsight and Sequential Rationality of Correlated Play
Figure 2 for Hindsight and Sequential Rationality of Correlated Play
Figure 3 for Hindsight and Sequential Rationality of Correlated Play
Figure 4 for Hindsight and Sequential Rationality of Correlated Play
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Aug 27, 2020
Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization

Add code
Dec 06, 2019
Figure 1 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Figure 2 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Figure 3 for Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
Viaarxiv icon

OpenSpiel: A Framework for Reinforcement Learning in Games

Add code
Oct 10, 2019
Figure 1 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 2 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 3 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 4 for OpenSpiel: A Framework for Reinforcement Learning in Games
Viaarxiv icon