Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Sep 06, 2024

Vishesh Mittal, Rahul Meshram, Surya Prakash

Figure 1 for Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Figure 2 for Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Figure 3 for Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Figure 4 for Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Share this with someone who'll enjoy it:

Abstract:We study the Whittle index learning algorithm for restless multi-armed bandits. We consider index learning algorithm with Q-learning. We first present Q-learning algorithm with exploration policies -- epsilon-greedy, softmax, epsilon-softmax with constant stepsizes. We extend the study of Q-learning to index learning for single-armed restless bandit. The algorithm of index learning is two-timescale variant of stochastic approximation, on slower timescale we update index learning scheme and on faster timescale we update Q-learning assuming fixed index value. In Q-learning updates are in asynchronous manner. We study constant stepsizes two timescale stochastic approximation algorithm. We provide analysis of two-timescale stochastic approximation for index learning with constant stepsizes. Further, we present study on index learning with deep Q-network (DQN) learning and linear function approximation with state-aggregation method. We describe the performance of our algorithms using numerical examples. We have shown that index learning with Q learning, DQN and function approximations learns the Whittle index.

* 14 pages

View paper on

Share this with someone who'll enjoy it:

Title:Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Paper and Code