Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Label Shift Estimators for Non-Ignorable Missing Data

Oct 27, 2023

Andrew C. Miller, Joseph Futoma

Figure 1 for Label Shift Estimators for Non-Ignorable Missing Data

Figure 2 for Label Shift Estimators for Non-Ignorable Missing Data

Figure 3 for Label Shift Estimators for Non-Ignorable Missing Data

Figure 4 for Label Shift Estimators for Non-Ignorable Missing Data

Share this with someone who'll enjoy it:

Abstract:We consider the problem of estimating the mean of a random variable Y subject to non-ignorable missingness, i.e., where the missingness mechanism depends on Y . We connect the auxiliary proxy variable framework for non-ignorable missingness (West and Little, 2013) to the label shift setting (Saerens et al., 2002). Exploiting this connection, we construct an estimator for non-ignorable missing data that uses high-dimensional covariates (or proxies) without the need for a generative model. In synthetic and semi-synthetic experiments, we study the behavior of the proposed estimator, comparing it to commonly used ignorable estimators in both well-specified and misspecified settings. Additionally, we develop a score to assess how consistent the data are with the label shift assumption. We use our approach to estimate disease prevalence using a large health survey, comparing ignorable and non-ignorable approaches. We show that failing to account for non-ignorable missingness can have profound consequences on conclusions drawn from non-representative samples.

* 8 pages, 5 figures

View paper on

Share this with someone who'll enjoy it:

Title:Label Shift Estimators for Non-Ignorable Missing Data

Paper and Code