Picture for Raghav Bongole

Raghav Bongole

Information-Theoretic Minimax Regret Bounds for Reinforcement Learning based on Duality

Add code
Oct 21, 2024
Viaarxiv icon