Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Soft-Label Integration for Robust Toxicity Classification

Oct 18, 2024

Zelei Cheng, Xian Wu, Jiahao Yu, Shuo Han, Xin-Qiang Cai, Xinyu Xing

Figure 1 for Soft-Label Integration for Robust Toxicity Classification

Figure 2 for Soft-Label Integration for Robust Toxicity Classification

Figure 3 for Soft-Label Integration for Robust Toxicity Classification

Figure 4 for Soft-Label Integration for Robust Toxicity Classification

Share this with someone who'll enjoy it:

Abstract:Toxicity classification in textual content remains a significant problem. Data with labels from a single annotator fall short of capturing the diversity of human perspectives. Therefore, there is a growing need to incorporate crowdsourced annotations for training an effective toxicity classifier. Additionally, the standard approach to training a classifier using empirical risk minimization (ERM) may fail to address the potential shifts between the training set and testing set due to exploiting spurious correlations. This work introduces a novel bi-level optimization framework that integrates crowdsourced annotations with the soft-labeling technique and optimizes the soft-label weights by Group Distributionally Robust Optimization (GroupDRO) to enhance the robustness against out-of-distribution (OOD) risk. We theoretically prove the convergence of our bi-level optimization algorithm. Experimental results demonstrate that our approach outperforms existing baseline methods in terms of both average and worst-group accuracy, confirming its effectiveness in leveraging crowdsourced annotations to achieve more effective and robust toxicity classification.

* Accepted by Neurips 24

View paper on

Share this with someone who'll enjoy it:

Title:Soft-Label Integration for Robust Toxicity Classification

Paper and Code