Picture for Xiaohan Hu

Xiaohan Hu

In-Sample Policy Iteration for Offline Reinforcement Learning

Add code
Jun 09, 2023
Viaarxiv icon