Picture for Nai-Chieh Huang

Nai-Chieh Huang

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping

Add code
Dec 19, 2023
Viaarxiv icon

Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning

Add code
Oct 18, 2023
Viaarxiv icon