Picture for Brendan Bennett

Brendan Bennett

Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search

Add code
Apr 01, 2021
Figure 1 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 2 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 3 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 4 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Viaarxiv icon

Incrementally Learning Functions of the Return

Add code
Jul 05, 2019
Figure 1 for Incrementally Learning Functions of the Return
Figure 2 for Incrementally Learning Functions of the Return
Viaarxiv icon

Predicting Periodicity with Temporal Difference Learning

Add code
Sep 20, 2018
Figure 1 for Predicting Periodicity with Temporal Difference Learning
Figure 2 for Predicting Periodicity with Temporal Difference Learning
Figure 3 for Predicting Periodicity with Temporal Difference Learning
Figure 4 for Predicting Periodicity with Temporal Difference Learning
Viaarxiv icon

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

Add code
Feb 14, 2018
Figure 1 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 2 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 3 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 4 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Viaarxiv icon