Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Evan Dong

Addressing Discretization-Induced Bias in Demographic Prediction

May 27, 2024

Evan Dong, Aaron Schein, Yixin Wang, Nikhil Garg

Figure 1 for Addressing Discretization-Induced Bias in Demographic Prediction

Figure 2 for Addressing Discretization-Induced Bias in Demographic Prediction

Figure 3 for Addressing Discretization-Induced Bias in Demographic Prediction

Figure 4 for Addressing Discretization-Induced Bias in Demographic Prediction

Abstract:Racial and other demographic imputation is necessary for many applications, especially in auditing disparities and outreach targeting in political campaigns. The canonical approach is to construct continuous predictions -- e.g., based on name and geography -- and then to $\textit{discretize}$ the predictions by selecting the most likely class (argmax). We study how this practice produces $\textit{discretization bias}$. In particular, we show that argmax labeling, as used by a prominent commercial voter file vendor to impute race/ethnicity, results in a substantial under-count of African-American voters, e.g., by 28.2% points in North Carolina. This bias can have substantial implications in downstream tasks that use such labels. We then introduce a $\textit{joint optimization}$ approach -- and a tractable $\textit{data-driven thresholding}$ heuristic -- that can eliminate this bias, with negligible individual-level accuracy loss. Finally, we theoretically analyze discretization bias, show that calibrated continuous models are insufficient to eliminate it, and that an approach such as ours is necessary. Broadly, we warn researchers and practitioners against discretizing continuous demographic predictions without considering downstream consequences.

* A version of this paper was accepted to the 2024 ACM Conference on Fairness, Accountability, and Transparency

Via

Access Paper or Ask Questions

Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

Apr 18, 2024

Sarah Dean, Evan Dong, Meena Jagadeesan, Liu Leqi

Figure 1 for Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

Figure 2 for Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

Abstract:As AI systems enter into a growing number of societal domains, these systems increasingly shape and are shaped by user preferences, opinions, and behaviors. However, the design of AI systems rarely accounts for how AI and users shape one another. In this position paper, we argue for the development of formal interaction models which mathematically specify how AI and users shape one another. Formal interaction models can be leveraged to (1) specify interactions for implementation, (2) monitor interactions through empirical analysis, (3) anticipate societal impacts via counterfactual analysis, and (4) control societal impacts via interventions. The design space of formal interaction models is vast, and model design requires careful consideration of factors such as style, granularity, mathematical complexity, and measurability. Using content recommender systems as a case study, we critically examine the nascent literature of formal interaction models with respect to these use-cases and design axes. More broadly, we call for the community to leverage formal interaction models when designing, evaluating, or auditing any AI system which interacts with users.

Via

Access Paper or Ask Questions