Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Preinerstorfer

Treatment recommendation with distributional targets

May 27, 2020

Anders Bredahl Kock, David Preinerstorfer, Bezirgen Veliyev

Figure 1 for Treatment recommendation with distributional targets

Figure 2 for Treatment recommendation with distributional targets

Figure 3 for Treatment recommendation with distributional targets

Abstract:We study the problem of a decision maker who must provide the best possible treatment recommendation based on an experiment. The desirability of the outcome distribution resulting from the policy recommendation is measured through a functional capturing the distributional characteristic that the decision maker is interested in optimizing. This could be, e.g., its inherent inequality, welfare, level of poverty or its distance to a desired outcome distribution. If the functional of interest is not quasi-convex or if there are constraints, the optimal recommendation may be a mixture of treatments. This vastly expands the set of recommendations that must be considered. We characterize the difficulty of the problem by obtaining maximal expected regret lower bounds. Furthermore, we propose two regret-optimal policies. The first policy is static and thus applicable irrespectively of the subjects arriving sequentially or not in the course of the experimental phase. The second policy can utilize that subjects arrive sequentially by successively eliminating inferior treatments and thus spends the sampling effort where it is most needed.

* Treatment allocation, best treatment identification, statistical decision theory, pure exploration, nonparametric multi-armed bandits

Via

Access Paper or Ask Questions

Functional Sequential Treatment Allocation with Covariates

Jan 29, 2020

Anders Bredahl Kock, David Preinerstorfer, Bezirgen Veliyev

Abstract:We consider a multi-armed bandit problem with covariates. Given a realization of the covariate vector, instead of targeting the treatment with highest conditional expectation, the decision maker targets the treatment which maximizes a general functional of the conditional potential outcome distribution, e.g., a conditional quantile, trimmed mean, or a socio-economic functional such as an inequality, welfare or poverty measure. We develop expected regret lower bounds for this problem, and construct a near minimax optimal assignment policy.

* The material in this paper replaces the material on covariates in [v5] of "Functional Sequential Treatment Allocation"

Via

Access Paper or Ask Questions