Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hendrik Heuer

Lost in Moderation: How Commercial Content Moderation APIs Over- and Under-Moderate Group-Targeted Hate Speech and Linguistic Variations

Mar 03, 2025

David Hartmann, Amin Oueslati, Dimitri Staufer, Lena Pohlmann, Simon Munzert, Hendrik Heuer

Abstract:Commercial content moderation APIs are marketed as scalable solutions to combat online hate speech. However, the reliance on these APIs risks both silencing legitimate speech, called over-moderation, and failing to protect online platforms from harmful speech, known as under-moderation. To assess such risks, this paper introduces a framework for auditing black-box NLP systems. Using the framework, we systematically evaluate five widely used commercial content moderation APIs. Analyzing five million queries based on four datasets, we find that APIs frequently rely on group identity terms, such as ``black'', to predict hate speech. While OpenAI's and Amazon's services perform slightly better, all providers under-moderate implicit hate speech, which uses codified messages, especially against LGBTQIA+ individuals. Simultaneously, they over-moderate counter-speech, reclaimed slurs and content related to Black, LGBTQIA+, Jewish, and Muslim people. We recommend that API providers offer better guidance on API implementation and threshold setting and more transparency on their APIs' limitations. Warning: This paper contains offensive and hateful terms and concepts. We have chosen to reproduce these terms for reasons of transparency.

* This is the author's version of the paper accepted at CHI Conference on Human Factors in Computing Systems (CHI '25), April 26-May 1, 2025, Yokohama, Japan

Via

Access Paper or Ask Questions

Writer-Defined AI Personas for On-Demand Feedback Generation

Sep 19, 2023

Karim Benharrak, Tim Zindulka, Florian Lehmann, Hendrik Heuer, Daniel Buschek

Abstract:Compelling writing is tailored to its audience. This is challenging, as writers may struggle to empathize with readers, get feedback in time, or gain access to the target group. We propose a concept that generates on-demand feedback, based on writer-defined AI personas of any target audience. We explore this concept with a prototype (using GPT-3.5) in two user studies (N=5 and N=11): Writers appreciated the concept and strategically used personas for getting different perspectives. The feedback was seen as helpful and inspired revisions of text and personas, although it was often verbose and unspecific. We discuss the impact of on-demand feedback, the limited representativity of contemporary AI systems, and further ideas for defining AI personas. This work contributes to the vision of supporting writers with AI by expanding the socio-technical perspective in AI tool design: To empower creators, we also need to keep in mind their relationship to an audience.

* 25 pages, 7 figures, 2 tables

Via

Access Paper or Ask Questions

Methods for the Design and Evaluation of HCI+NLP Systems

Feb 26, 2021

Hendrik Heuer, Daniel Buschek

Figure 1 for Methods for the Design and Evaluation of HCI+NLP Systems

Figure 2 for Methods for the Design and Evaluation of HCI+NLP Systems

Abstract:HCI and NLP traditionally focus on different evaluation methods. While HCI involves a small number of people directly and deeply, NLP traditionally relies on standardized benchmark evaluations that involve a larger number of people indirectly. We present five methodological proposals at the intersection of HCI and NLP and situate them in the context of ML-based NLP models. Our goal is to foster interdisciplinary collaboration and progress in both fields by emphasizing what the fields can learn from each other.

* Accepted at the EACL 2021 Workshop on Bridging Human-Computer Interaction and Natural Language Processing (HCI+NLP Workshop)

Via

Access Paper or Ask Questions

More Than Accuracy: Towards Trustworthy Machine Learning Interfaces for Object Recognition

Aug 05, 2020

Hendrik Heuer, Andreas Breiter

Figure 1 for More Than Accuracy: Towards Trustworthy Machine Learning Interfaces for Object Recognition

Figure 2 for More Than Accuracy: Towards Trustworthy Machine Learning Interfaces for Object Recognition

Abstract:This paper investigates the user experience of visualizations of a machine learning (ML) system that recognizes objects in images. This is important since even good systems can fail in unexpected ways as misclassifications on photo-sharing websites showed. In our study, we exposed users with a background in ML to three visualizations of three systems with different levels of accuracy. In interviews, we explored how the visualization helped users assess the accuracy of systems in use and how the visualization and the accuracy of the system affected trust and reliance. We found that participants do not only focus on accuracy when assessing ML systems. They also take the perceived plausibility and severity of misclassification into account and prefer seeing the probability of predictions. Semantically plausible errors are judged as less severe than errors that are implausible, which means that system accuracy could be communicated through the types of errors.

* UMAP 2020: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization
* UMAP '20: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization

Via

Access Paper or Ask Questions

Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation

Jun 24, 2019

Maximilian Spliethöver, Jonas Klaff, Hendrik Heuer

Figure 1 for Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation

Figure 2 for Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation

Figure 3 for Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation

Abstract:Attention mechanisms have seen some success for natural language processing downstream tasks in recent years and generated new State-of-the-Art results. A thorough evaluation of the attention mechanism for the task of Argumentation Mining is missing, though. With this paper, we report a comparative evaluation of attention layers in combination with a bidirectional long short-term memory network, which is the current state-of-the-art approach to the unit segmentation task. We also compare sentence-level contextualized word embeddings to pre-generated ones. Our findings suggest that for this task the additional attention layer does not improve upon a less complex approach. In most cases, the contextualized embeddings do also not show an improvement on the baseline score.

* Accepted to the 6th Workshop on Argument Mining 2019

Via

Access Paper or Ask Questions

Generating captions without looking beyond objects

Oct 18, 2016

Hendrik Heuer, Christof Monz, Arnold W. M. Smeulders

Figure 1 for Generating captions without looking beyond objects

Figure 2 for Generating captions without looking beyond objects

Figure 3 for Generating captions without looking beyond objects

Figure 4 for Generating captions without looking beyond objects

Abstract:This paper explores new evaluation perspectives for image captioning and introduces a noun translation task that achieves comparative image caption generation performance by translating from a set of nouns to captions. This implies that in image captioning, all word categories other than nouns can be evoked by a powerful language model without sacrificing performance on n-gram precision. The paper also investigates lower and upper bounds of how much individual word categories in the captions contribute to the final BLEU score. A large possible improvement exists for nouns, verbs, and prepositions.

* This paper was presented at the ECCV2016 2nd Workshop on Storytelling with Images and Videos (VisStory)

Via

Access Paper or Ask Questions

Text comparison using word vector representations and dimensionality reduction

Jul 02, 2016

Hendrik Heuer

Figure 1 for Text comparison using word vector representations and dimensionality reduction

Figure 2 for Text comparison using word vector representations and dimensionality reduction

Figure 3 for Text comparison using word vector representations and dimensionality reduction

Figure 4 for Text comparison using word vector representations and dimensionality reduction

Abstract:This paper describes a technique to compare large text sources using word vector representations (word2vec) and dimensionality reduction (t-SNE) and how it can be implemented using Python. The technique provides a bird's-eye view of text sources, e.g. text summaries and their source material, and enables users to explore text sources like a geographical map. Word vector representations capture many linguistic properties such as gender, tense, plurality and even semantic concepts like "capital city of". Using dimensionality reduction, a 2D map can be computed where semantically similar words are close to each other. The technique uses the word2vec model from the gensim Python library and t-SNE from scikit-learn.

Via

Access Paper or Ask Questions