Picture for Amy Greenwald

Amy Greenwald

A Unifying View of Linear Function Approximation in Off-Policy RL Through Matrix Splitting and Preconditioning

Add code
Jan 03, 2025
Viaarxiv icon

Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration

Add code
Jun 04, 2022
Figure 1 for Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration
Viaarxiv icon

Robust No-Regret Learning in Min-Max Stackelberg Games

Add code
Apr 13, 2022
Figure 1 for Robust No-Regret Learning in Min-Max Stackelberg Games
Figure 2 for Robust No-Regret Learning in Min-Max Stackelberg Games
Figure 3 for Robust No-Regret Learning in Min-Max Stackelberg Games
Viaarxiv icon

Convex-Concave Min-Max Stackelberg Games

Add code
Oct 05, 2021
Figure 1 for Convex-Concave Min-Max Stackelberg Games
Figure 2 for Convex-Concave Min-Max Stackelberg Games
Figure 3 for Convex-Concave Min-Max Stackelberg Games
Figure 4 for Convex-Concave Min-Max Stackelberg Games
Viaarxiv icon

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Add code
Feb 13, 2021
Figure 1 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 2 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 3 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Figure 4 for Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Viaarxiv icon

Hindsight and Sequential Rationality of Correlated Play

Add code
Dec 17, 2020
Figure 1 for Hindsight and Sequential Rationality of Correlated Play
Figure 2 for Hindsight and Sequential Rationality of Correlated Play
Figure 3 for Hindsight and Sequential Rationality of Correlated Play
Figure 4 for Hindsight and Sequential Rationality of Correlated Play
Viaarxiv icon

RoxyBot-06: Stochastic Prediction and Optimization in TAC Travel

Add code
Jan 16, 2014
Figure 1 for RoxyBot-06: Stochastic Prediction and Optimization in TAC Travel
Figure 2 for RoxyBot-06: Stochastic Prediction and Optimization in TAC Travel
Figure 3 for RoxyBot-06: Stochastic Prediction and Optimization in TAC Travel
Figure 4 for RoxyBot-06: Stochastic Prediction and Optimization in TAC Travel
Viaarxiv icon