Picture for Arghyadip Roy

Arghyadip Roy

Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings

Add code
Sep 14, 2020
Figure 1 for Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Figure 2 for Hellinger KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Viaarxiv icon

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

Add code
Dec 21, 2019
Figure 1 for Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Figure 2 for Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Figure 3 for Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Viaarxiv icon

A Structure-aware Online Learning Algorithm for Markov Decision Processes

Add code
Nov 28, 2018
Figure 1 for A Structure-aware Online Learning Algorithm for Markov Decision Processes
Figure 2 for A Structure-aware Online Learning Algorithm for Markov Decision Processes
Figure 3 for A Structure-aware Online Learning Algorithm for Markov Decision Processes
Viaarxiv icon