Picture for Shihong Deng

Shihong Deng

Mastering Strategy Card Game via End-to-End Policy and Optimistic Smooth Fictitious Play

Add code
Mar 07, 2023
Viaarxiv icon

An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning

Add code
Jun 01, 2021
Figure 1 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Figure 2 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Figure 3 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Figure 4 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Viaarxiv icon

CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation

Add code
May 27, 2021
Figure 1 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Figure 2 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Figure 3 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Figure 4 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Viaarxiv icon