Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anna Katrine Jørgensen

On the Independence of Association Bias and Empirical Fairness in Language Models

Apr 20, 2023

Laura Cabello, Anna Katrine Jørgensen, Anders Søgaard

Figure 1 for On the Independence of Association Bias and Empirical Fairness in Language Models

Figure 2 for On the Independence of Association Bias and Empirical Fairness in Language Models

Figure 3 for On the Independence of Association Bias and Empirical Fairness in Language Models

Figure 4 for On the Independence of Association Bias and Empirical Fairness in Language Models

Abstract:The societal impact of pre-trained language models has prompted researchers to probe them for strong associations between protected attributes and value-loaded terms, from slur to prestigious job titles. Such work is said to probe models for bias or fairness-or such probes 'into representational biases' are said to be 'motivated by fairness'-suggesting an intimate connection between bias and fairness. We provide conceptual clarity by distinguishing between association biases (Caliskan et al., 2022) and empirical fairness (Shen et al., 2022) and show the two can be independent. Our main contribution, however, is showing why this should not come as a surprise. To this end, we first provide a thought experiment, showing how association bias and empirical fairness can be completely orthogonal. Next, we provide empirical evidence that there is no correlation between bias metrics and fairness metrics across the most widely used language models. Finally, we survey the sociological and psychological literature and show how this literature provides ample support for expecting these metrics to be uncorrelated.

* To be published in ACM FAccT 23

Via

Access Paper or Ask Questions