Picture for Michael N. Katehakis

Michael N. Katehakis

Optimal Activation of Halting Multi-Armed Bandit Models

Add code
Apr 20, 2023
Viaarxiv icon

Accelerating the Computation of UCB and Related Indices for Reinforcement Learning

Add code
Sep 28, 2019
Figure 1 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Figure 2 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Figure 3 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Figure 4 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Viaarxiv icon

Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies

Add code
Sep 13, 2019
Figure 1 for Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies
Figure 2 for Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies
Viaarxiv icon

Optimal Data Driven Resource Allocation under Multi-Armed Bandit Observations

Add code
Dec 13, 2018
Viaarxiv icon

Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret

Add code
Dec 17, 2015
Viaarxiv icon

Asymptotically Optimal Sequential Experimentation Under Generalized Ranking

Add code
Dec 17, 2015
Viaarxiv icon

Asymptotically Optimal Multi-Armed Bandit Policies under a Cost Constraint

Add code
Dec 17, 2015
Viaarxiv icon

Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy

Add code
Oct 22, 2015
Figure 1 for Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy
Figure 2 for Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy
Figure 3 for Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy
Figure 4 for Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy
Viaarxiv icon

An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support

Add code
Sep 24, 2015
Figure 1 for An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
Figure 2 for An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
Viaarxiv icon

Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem

Add code
Jun 03, 2015
Figure 1 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Figure 2 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Figure 3 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Figure 4 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Viaarxiv icon