Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A structured regression approach for evaluating model performance across intersectional subgroups

Jan 26, 2024

Christine Herlihy, Kimberly Truong, Alexandra Chouldechova, Miroslav Dudik

Figure 1 for A structured regression approach for evaluating model performance across intersectional subgroups

Figure 2 for A structured regression approach for evaluating model performance across intersectional subgroups

Figure 3 for A structured regression approach for evaluating model performance across intersectional subgroups

Figure 4 for A structured regression approach for evaluating model performance across intersectional subgroups

Share this with someone who'll enjoy it:

Abstract:Disaggregated evaluation is a central task in AI fairness assessment, with the goal to measure an AI system's performance across different subgroups defined by combinations of demographic or other sensitive attributes. The standard approach is to stratify the evaluation data across subgroups and compute performance metrics separately for each group. However, even for moderately-sized evaluation datasets, sample sizes quickly get small once considering intersectional subgroups, which greatly limits the extent to which intersectional groups are considered in many disaggregated evaluations. In this work, we introduce a structured regression approach to disaggregated evaluation that we demonstrate can yield reliable system performance estimates even for very small subgroups. We also provide corresponding inference strategies for constructing confidence intervals and explore how goodness-of-fit testing can yield insight into the structure of fairness-related harms experienced by intersectional groups. We evaluate our approach on two publicly available datasets, and several variants of semi-synthetic data. The results show that our method is considerably more accurate than the standard approach, especially for small subgroups, and goodness-of-fit testing helps identify the key factors that drive differences in performance.

View paper on

Share this with someone who'll enjoy it:

Title:A structured regression approach for evaluating model performance across intersectional subgroups

Paper and Code