Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Racial Bias in Hate Speech and Abusive Language Detection Datasets

May 29, 2019

Thomas Davidson, Debasmita Bhattacharya, Ingmar Weber

Figure 1 for Racial Bias in Hate Speech and Abusive Language Detection Datasets

Figure 2 for Racial Bias in Hate Speech and Abusive Language Detection Datasets

Figure 3 for Racial Bias in Hate Speech and Abusive Language Detection Datasets

Figure 4 for Racial Bias in Hate Speech and Abusive Language Detection Datasets

Share this with someone who'll enjoy it:

Abstract:Technologies for abusive language detection are being developed and applied with little consideration of their potential biases. We examine racial bias in five different sets of Twitter data annotated for hate speech and abusive language. We train classifiers on these datasets and compare the predictions of these classifiers on tweets written in African-American English with those written in Standard American English. The results show evidence of systematic racial bias in all datasets, as classifiers trained on them tend to predict that tweets written in African-American English are abusive at substantially higher rates. If these abusive language detection systems are used in the field they will therefore have a disproportionate negative impact on African-American social media users. Consequently, these systems may discriminate against the groups who are often the targets of the abuse we are trying to detect.

* To appear in the proceedings of the Third Abusive Language Workshop (https://sites.google.com/view/alw3/) at the Annual Meeting for the Association for Computational Linguistics 2019. Please cite the published version

View paper on

Share this with someone who'll enjoy it:

Title:Racial Bias in Hate Speech and Abusive Language Detection Datasets

Paper and Code