Picture for Wesley Chung

Wesley Chung

The Role of Baselines in Policy Gradient Optimization

Add code
Jan 16, 2023
Viaarxiv icon

Beyond variance reduction: Understanding the true impact of baselines on policy optimization

Add code
Aug 31, 2020
Figure 1 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Figure 2 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Figure 3 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Figure 4 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Viaarxiv icon

Incrementally Learning Functions of the Return

Add code
Jul 05, 2019
Figure 1 for Incrementally Learning Functions of the Return
Figure 2 for Incrementally Learning Functions of the Return
Viaarxiv icon

Importance Resampling for Off-policy Prediction

Add code
Jun 11, 2019
Figure 1 for Importance Resampling for Off-policy Prediction
Figure 2 for Importance Resampling for Off-policy Prediction
Figure 3 for Importance Resampling for Off-policy Prediction
Figure 4 for Importance Resampling for Off-policy Prediction
Viaarxiv icon

High-confidence error estimates for learned value functions

Add code
Aug 28, 2018
Figure 1 for High-confidence error estimates for learned value functions
Figure 2 for High-confidence error estimates for learned value functions
Viaarxiv icon