Picture for Debora Nozza

Debora Nozza

The Unseen Targets of Hate -- A Systematic Review of Hateful Communication Datasets

Add code
May 14, 2024
Viaarxiv icon

FairBelief - Assessing Harmful Beliefs in Language Models

Add code
Feb 27, 2024
Viaarxiv icon

A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation

Add code
Oct 25, 2023
Viaarxiv icon

Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization

Add code
Sep 05, 2023
Viaarxiv icon

Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification

Add code
Jul 24, 2023
Viaarxiv icon

What about em? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns

Add code
May 25, 2023
Viaarxiv icon

Measuring Harmful Representations in Scandinavian Language Models

Add code
Nov 21, 2022
Viaarxiv icon

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

Add code
Nov 07, 2022
Viaarxiv icon

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages

Add code
Oct 20, 2022
Figure 1 for Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
Figure 2 for Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
Figure 3 for Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
Figure 4 for Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
Viaarxiv icon

The State of Profanity Obfuscation in Natural Language Processing

Add code
Oct 14, 2022
Figure 1 for The State of Profanity Obfuscation in Natural Language Processing
Figure 2 for The State of Profanity Obfuscation in Natural Language Processing
Figure 3 for The State of Profanity Obfuscation in Natural Language Processing
Figure 4 for The State of Profanity Obfuscation in Natural Language Processing
Viaarxiv icon