Picture for Wesley Cowan

Wesley Cowan

Optimal Activation of Halting Multi-Armed Bandit Models

Add code
Apr 20, 2023
Viaarxiv icon

Accelerating the Computation of UCB and Related Indices for Reinforcement Learning

Add code
Sep 28, 2019
Figure 1 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Figure 2 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Figure 3 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Figure 4 for Accelerating the Computation of UCB and Related Indices for Reinforcement Learning
Viaarxiv icon

Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies

Add code
Sep 13, 2019
Figure 1 for Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies
Figure 2 for Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies
Viaarxiv icon

Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret

Add code
Dec 17, 2015
Viaarxiv icon

Asymptotically Optimal Sequential Experimentation Under Generalized Ranking

Add code
Dec 17, 2015
Viaarxiv icon

An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support

Add code
Sep 24, 2015
Figure 1 for An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
Figure 2 for An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
Viaarxiv icon

Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem

Add code
Jun 03, 2015
Figure 1 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Figure 2 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Figure 3 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Figure 4 for Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Viaarxiv icon