Picture for Eric Graves

Eric Graves

Value-aware Importance Weighting for Off-policy Reinforcement Learning

Add code
Jun 27, 2023
Viaarxiv icon

Importance Sampling Placement in Off-Policy Temporal-Difference Methods

Add code
Mar 18, 2022
Figure 1 for Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Figure 2 for Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Viaarxiv icon

Off-Policy Actor-Critic with Emphatic Weightings

Add code
Nov 16, 2021
Figure 1 for Off-Policy Actor-Critic with Emphatic Weightings
Figure 2 for Off-Policy Actor-Critic with Emphatic Weightings
Figure 3 for Off-Policy Actor-Critic with Emphatic Weightings
Figure 4 for Off-Policy Actor-Critic with Emphatic Weightings
Viaarxiv icon

An Off-policy Policy Gradient Theorem Using Emphatic Weightings

Add code
Nov 22, 2018
Figure 1 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Figure 2 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Figure 3 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Figure 4 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Viaarxiv icon