Picture for Weizheng Qiao

Weizheng Qiao

Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods

Add code
Nov 03, 2023
Viaarxiv icon