Picture for Alireza Masoumian

Alireza Masoumian

Model Selection for Average Reward RL with Application to Utility Maximization in Repeated Games

Add code
Nov 09, 2024
Viaarxiv icon

Sequential Estimation under Multiple Resources: a Bandit Point of View

Add code
Sep 29, 2021
Figure 1 for Sequential Estimation under Multiple Resources: a Bandit Point of View
Viaarxiv icon