Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Evan Munro

Causal Estimation of User Learning in Personalized Systems

Jun 01, 2023

Evan Munro, David Jones, Jennifer Brennan, Roland Nelet, Vahab Mirrokni, Jean Pouget-Abadie

Figure 1 for Causal Estimation of User Learning in Personalized Systems

Figure 2 for Causal Estimation of User Learning in Personalized Systems

Figure 3 for Causal Estimation of User Learning in Personalized Systems

Figure 4 for Causal Estimation of User Learning in Personalized Systems

Abstract:In online platforms, the impact of a treatment on an observed outcome may change over time as 1) users learn about the intervention, and 2) the system personalization, such as individualized recommendations, change over time. We introduce a non-parametric causal model of user actions in a personalized system. We show that the Cookie-Cookie-Day (CCD) experiment, designed for the measurement of the user learning effect, is biased when there is personalization. We derive new experimental designs that intervene in the personalization system to generate the variation necessary to separately identify the causal effect mediated through user learning and personalization. Making parametric assumptions allows for the estimation of long-term causal effects based on medium-term experiments. In simulations, we show that our new designs successfully recover the dynamic causal effects of interest.

* EC 2023

Via

Access Paper or Ask Questions

Learning to Personalize Treatments When Agents Are Strategic

Nov 12, 2020

Evan Munro

Figure 1 for Learning to Personalize Treatments When Agents Are Strategic

Figure 2 for Learning to Personalize Treatments When Agents Are Strategic

Figure 3 for Learning to Personalize Treatments When Agents Are Strategic

Figure 4 for Learning to Personalize Treatments When Agents Are Strategic

Abstract:There is increasing interest in using observed individual-level data to formulate personalized policy. Examples of this include heterogeneous pricing, individualized credit offers, and targeted social programs. This paper provides a general model of how personalized policy creates incentives for individuals to modify their behavior to obtain a better treatment. For a given planner objective, we show that standard estimators based on repeated risk minimization produce a suboptimal policy. We propose a dynamic experiment that estimates the optimal treatment allocation function when agents are strategic and has regret that decays at a linear rate. A key insight is that random variation in how treatment assignment depends on observed characteristics is required, and that randomized treatment assignment alone is not sufficient to identify the optimal policy. We show this experimental method outperforms alternative methods that do not learn strategic effects in simulations and in a small MTurk experiment.

* 27 pages, 5 figures

Via

Access Paper or Ask Questions