Picture for Dailin Hu

Dailin Hu

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Add code
Sep 16, 2022
Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

Target Entropy Annealing for Discrete Soft Actor-Critic

Add code
Dec 06, 2021
Figure 1 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 2 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 3 for Target Entropy Annealing for Discrete Soft Actor-Critic
Figure 4 for Target Entropy Annealing for Discrete Soft Actor-Critic
Viaarxiv icon

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning

Add code
Nov 28, 2021
Figure 1 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 2 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 3 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Figure 4 for Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Viaarxiv icon

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

Add code
Oct 28, 2021
Figure 1 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 2 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 3 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Figure 4 for Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Viaarxiv icon