Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improved Analysis of UCRL2 with Empirical Bernstein Inequality

Jul 10, 2020

Ronan Fruit, Matteo Pirotta, Alessandro Lazaric

Figure 1 for Improved Analysis of UCRL2 with Empirical Bernstein Inequality

Share this with someone who'll enjoy it:

Abstract:We consider the problem of exploration-exploitation in communicating Markov Decision Processes. We provide an analysis of UCRL2 with Empirical Bernstein inequalities (UCRL2B). For any MDP with $S$ states, $A$ actions, $\Gamma \leq S$ next states and diameter $D$, the regret of UCRL2B is bounded as $\widetilde{O}(\sqrt{D\Gamma S A T})$.

* Document in support of the tutorial at ALT 2019

View paper on

Share this with someone who'll enjoy it:

Title:Improved Analysis of UCRL2 with Empirical Bernstein Inequality

Paper and Code