Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vanessa Hahn

Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces

Jun 18, 2021

Vanessa Hahn, Dana Ruiter, Thomas Kleinbauer, Dietrich Klakow

Figure 1 for Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces

Figure 2 for Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces

Figure 3 for Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces

Figure 4 for Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces

Abstract:Hate speech and profanity detection suffer from data sparsity, especially for languages other than English, due to the subjective nature of the tasks and the resulting annotation incompatibility of existing corpora. In this study, we identify profane subspaces in word and sentence representations and explore their generalization capability on a variety of similar and distant target tasks in a zero-shot setting. This is done monolingually (German) and cross-lingually to closely-related (English), distantly-related (French) and non-related (Arabic) tasks. We observe that, on both similar and distant target tasks and across all languages, the subspace-based representations transfer more effectively than standard BERT representations in the zero-shot setting, with improvements between F1 +10.9 and F1 +42.9 over the baselines across all tested monolingual and cross-lingual scenarios.

* 9 pages, 4 figures, accepted as a long paper at Workshop on Online Abuse and Harms 2021

Via

Access Paper or Ask Questions