Picture for Rahul Meshram

Rahul Meshram

Faster Q-Learning Algorithms for Restless Bandits

Add code
Sep 06, 2024
Figure 1 for Faster Q-Learning Algorithms for Restless Bandits
Figure 2 for Faster Q-Learning Algorithms for Restless Bandits
Figure 3 for Faster Q-Learning Algorithms for Restless Bandits
Viaarxiv icon

Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

Add code
Sep 06, 2024
Viaarxiv icon

Indexability of Finite State Restless Multi-Armed Bandit and Rollout Policy

Add code
Apr 30, 2023
Figure 1 for Indexability of Finite State Restless Multi-Armed Bandit and Rollout Policy
Figure 2 for Indexability of Finite State Restless Multi-Armed Bandit and Rollout Policy
Figure 3 for Indexability of Finite State Restless Multi-Armed Bandit and Rollout Policy
Figure 4 for Indexability of Finite State Restless Multi-Armed Bandit and Rollout Policy
Viaarxiv icon

Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits

Add code
Jul 30, 2021
Figure 1 for Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits
Figure 2 for Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits
Figure 3 for Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits
Figure 4 for Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits
Viaarxiv icon

Monte Carlo Rollout Policy for Recommendation Systems with Dynamic User Behavior

Add code
Feb 08, 2021
Figure 1 for Monte Carlo Rollout Policy for Recommendation Systems with Dynamic User Behavior
Figure 2 for Monte Carlo Rollout Policy for Recommendation Systems with Dynamic User Behavior
Figure 3 for Monte Carlo Rollout Policy for Recommendation Systems with Dynamic User Behavior
Figure 4 for Monte Carlo Rollout Policy for Recommendation Systems with Dynamic User Behavior
Viaarxiv icon

Simulation Based Algorithms for Markov Decision Processes and Multi-Action Restless Bandits

Add code
Jul 25, 2020
Figure 1 for Simulation Based Algorithms for Markov Decision Processes and Multi-Action Restless Bandits
Figure 2 for Simulation Based Algorithms for Markov Decision Processes and Multi-Action Restless Bandits
Figure 3 for Simulation Based Algorithms for Markov Decision Processes and Multi-Action Restless Bandits
Viaarxiv icon

Sequential Decision Making under Uncertainty with Dynamic Resource Constraints

Add code
Apr 18, 2019
Figure 1 for Sequential Decision Making under Uncertainty with Dynamic Resource Constraints
Figure 2 for Sequential Decision Making under Uncertainty with Dynamic Resource Constraints
Figure 3 for Sequential Decision Making under Uncertainty with Dynamic Resource Constraints
Figure 4 for Sequential Decision Making under Uncertainty with Dynamic Resource Constraints
Viaarxiv icon

Learning Recommendations While Influencing Interests

Add code
Mar 23, 2018
Figure 1 for Learning Recommendations While Influencing Interests
Figure 2 for Learning Recommendations While Influencing Interests
Figure 3 for Learning Recommendations While Influencing Interests
Figure 4 for Learning Recommendations While Influencing Interests
Viaarxiv icon

Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs

Add code
Mar 30, 2016
Figure 1 for Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs
Figure 2 for Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs
Figure 3 for Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs
Viaarxiv icon