Picture for Brett Daley

Brett Daley

On Centralized Critics in Multi-Agent Reinforcement Learning

Add code
Aug 26, 2024
Viaarxiv icon

Demystifying the Recency Heuristic in Temporal-Difference Learning

Add code
Jun 18, 2024
Figure 1 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 2 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 3 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 4 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Viaarxiv icon

Compound Returns Reduce Variance in Reinforcement Learning

Add code
Feb 06, 2024
Viaarxiv icon

Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning

Add code
Jan 26, 2023
Viaarxiv icon

Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning

Add code
Jun 04, 2022
Figure 1 for Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning
Viaarxiv icon

Improving the Efficiency of Off-Policy Reinforcement Learning by Accounting for Past Decisions

Add code
Dec 23, 2021
Viaarxiv icon

Virtual Replay Cache

Add code
Dec 06, 2021
Figure 1 for Virtual Replay Cache
Figure 2 for Virtual Replay Cache
Figure 3 for Virtual Replay Cache
Figure 4 for Virtual Replay Cache
Viaarxiv icon

Human-Level Control without Server-Grade Hardware

Add code
Nov 01, 2021
Figure 1 for Human-Level Control without Server-Grade Hardware
Figure 2 for Human-Level Control without Server-Grade Hardware
Figure 3 for Human-Level Control without Server-Grade Hardware
Figure 4 for Human-Level Control without Server-Grade Hardware
Viaarxiv icon

Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods

Add code
Jun 10, 2021
Figure 1 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Figure 2 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Figure 3 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Figure 4 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Viaarxiv icon

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning

Add code
Feb 22, 2021
Figure 1 for Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Figure 2 for Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Figure 3 for Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Viaarxiv icon