Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Perturbation-based Analysis of Compositional Data

Nov 30, 2023

Anton Rask Lundborg, Niklas Pfister

Figure 1 for Perturbation-based Analysis of Compositional Data

Figure 2 for Perturbation-based Analysis of Compositional Data

Figure 3 for Perturbation-based Analysis of Compositional Data

Figure 4 for Perturbation-based Analysis of Compositional Data

Share this with someone who'll enjoy it:

Abstract:Existing statistical methods for compositional data analysis are inadequate for many modern applications for two reasons. First, modern compositional datasets, for example in microbiome research, display traits such as high-dimensionality and sparsity that are poorly modelled with traditional approaches. Second, assessing -- in an unbiased way -- how summary statistics of a composition (e.g., racial diversity) affect a response variable is not straightforward. In this work, we propose a framework based on hypothetical data perturbations that addresses both issues. Unlike existing methods for compositional data, we do not transform the data and instead use perturbations to define interpretable statistical functionals on the compositions themselves, which we call average perturbation effects. These average perturbation effects, which can be employed in many applications, naturally account for confounding that biases frequently used marginal dependence analyses. We show how average perturbation effects can be estimated efficiently by deriving a perturbation-dependent reparametrization and applying semiparametric estimation techniques. We analyze the proposed estimators empirically on simulated data and demonstrate advantages over existing techniques on US census and microbiome data. For all proposed estimators, we provide confidence intervals with uniform asymptotic coverage guarantees.

View paper on

Share this with someone who'll enjoy it:

Title:Perturbation-based Analysis of Compositional Data

Paper and Code