Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jason Loeppky

A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

Nov 03, 2015

Giuseppe Burtini, Jason Loeppky, Ramon Lawrence

Figure 1 for A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

Abstract:Adaptive and sequential experiment design is a well-studied area in numerous domains. We survey and synthesize the work of the online statistical learning paradigm referred to as multi-armed bandits integrating the existing research as a resource for a certain class of online experiments. We first explore the traditional stochastic model of a multi-armed bandit, then explore a taxonomic scheme of complications to that model, for each complication relating it to a specific requirement or consideration of the experiment design context. Finally, at the end of the paper, we present a table of known upper-bounds of regret for all studied algorithms providing both perspectives for future theoretical work and a decision-making tool for practitioners looking for theoretical guarantees.

* 49 pages, 1 figure

Via

Access Paper or Ask Questions