Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Mar 27, 2020

Philip Amortila, Doina Precup, Prakash Panangaden, Marc G. Bellemare

Figure 1 for A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Share this with someone who'll enjoy it:

Abstract:We present a distributional approach to theoretical analyses of reinforcement learning algorithms for constant step-sizes. We demonstrate its effectiveness by presenting simple and unified proofs of convergence for a variety of commonly-used methods. We show that value-based methods such as TD($\lambda$) and $Q$-Learning have update rules which are contractive in the space of distributions of functions, thus establishing their exponentially fast convergence to a stationary distribution. We demonstrate that the stationary distribution obtained by any algorithm whose target is an expected Bellman update has a mean which is equal to the true value function. Furthermore, we establish that the distributions concentrate around their mean as the step-size shrinks. We further analyse the optimistic policy iteration algorithm, for which the contraction property does not hold, and formulate a probabilistic policy improvement property which entails the convergence of the algorithm.

* AISTATS 2020

View paper on

Share this with someone who'll enjoy it:

Title:A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Paper and Code