Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Sep 26, 2019

Ameya Vaidya, Feng Mai, Yue Ning

Figure 1 for Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Figure 2 for Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Figure 3 for Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Figure 4 for Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Share this with someone who'll enjoy it:

Abstract:With the recent rise of toxicity in online conversations on social media platforms, using modern machine learning algorithms for toxic comment detection has become a central focus of many online applications. Researchers and companies have developed a variety of shallow and deep learning models to identify toxicity in online conversations, reviews, or comments with mixed successes. However, these existing approaches have learned to incorrectly associate non-toxic comments that have certain trigger-words (e.g. gay, lesbian, black, muslim) as a potential source of toxicity. In this paper, we evaluate dozens of state-of-the-art models with the specific focus of reducing model bias towards these commonly-attacked identity groups. We propose a multi-task learning model with an attention layer that jointly learns to predict the toxicity of a comment as well as the identities present in the comments in order to reduce this bias. We then compare our model to an array of shallow and deep-learning models using metrics designed especially to test for unintended model bias within these identity groups.

View paper on

Share this with someone who'll enjoy it:

Title:Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Paper and Code