Picture for Haoxuan Pan

Haoxuan Pan

Shanghai Jiaotong University, Tencent Inc

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

Add code
Jan 20, 2023
Viaarxiv icon