Provably Efficient $Q$-learning with Function Approximation via Distribution Shift Error Checking Oracle

Add code
Jun 14, 2019

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: