Picture for Dimitri Staufer

Dimitri Staufer

Lost in Moderation: How Commercial Content Moderation APIs Over- and Under-Moderate Group-Targeted Hate Speech and Linguistic Variations

Add code
Mar 03, 2025
Viaarxiv icon

Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services

Add code
Jun 20, 2024
Viaarxiv icon

Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification

Add code
May 02, 2024
Viaarxiv icon