Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Annerose Eichel

Modelling Adjectival Modification Effects on Semantic Plausibility

Jul 29, 2025

Anna Golub, Beate Zywietz, Annerose Eichel

Abstract:While the task of assessing the plausibility of events such as ''news is relevant'' has been addressed by a growing body of work, less attention has been paid to capturing changes in plausibility as triggered by event modification. Understanding changes in plausibility is relevant for tasks such as dialogue generation, commonsense reasoning, and hallucination detection as it allows to correctly model, for example, ''gentle sarcasm'' as a sign of closeness rather than unkindness among friends [9]. In this work, we tackle the ADEPT challenge benchmark [6] consisting of 16K English sentence pairs differing by exactly one adjectival modifier. Our modeling experiments provide a conceptually novel method by using sentence transformers, and reveal that both they and transformer-based models struggle with the task at hand, and sentence transformers - despite their conceptual alignment with the task - even under-perform in comparison to models like RoBERTa. Furthermore, an in-depth comparison with prior work highlights the importance of a more realistic, balanced evaluation method: imbalances distort model performance and evaluation metrics, and weaken result trustworthiness.

* Accepted at ESSLLI 2025 Student Session

Via

Access Paper or Ask Questions

Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds

Apr 05, 2024

Annerose Eichel, Tana Deeg, André Blessing, Milena Belosevic, Sabine Arndt-Lappe, Sabine Schulte im Walde

Abstract:We present a comprehensive computational study of the under-investigated phenomenon of personal name compounds (PNCs) in German such as Willkommens-Merkel ('Welcome-Merkel'). Prevalent in news, social media, and political discourse, PNCs are hypothesized to exhibit an evaluative function that is reflected in a more positive or negative perception as compared to the respective personal full name (such as Angela Merkel). We model 321 PNCs and their corresponding full names at discourse level, and show that PNCs bear an evaluative nature that can be captured through a variety of computational methods. Specifically, we assess through valence information whether a PNC is more positively or negatively evaluative than the person's name, by applying and comparing two approaches using (i) valence norms and (ii) pretrained language models (PLMs). We further enrich our data with personal, domain-specific, and extra-linguistic information and perform a range of regression analyses revealing that factors including compound and modifier valence, domain, and political party membership influence how a PNC is evaluated.

* Accepted at LREC-COLING 2024

Via

Access Paper or Ask Questions

A Dataset for Physical and Abstract Plausibility and Sources of Human Disagreement

Apr 05, 2024

Annerose Eichel, Sabine Schulte im Walde

Figure 1 for A Dataset for Physical and Abstract Plausibility and Sources of Human Disagreement

Figure 2 for A Dataset for Physical and Abstract Plausibility and Sources of Human Disagreement

Figure 3 for A Dataset for Physical and Abstract Plausibility and Sources of Human Disagreement

Figure 4 for A Dataset for Physical and Abstract Plausibility and Sources of Human Disagreement

Abstract:We present a novel dataset for physical and abstract plausibility of events in English. Based on naturally occurring sentences extracted from Wikipedia, we infiltrate degrees of abstractness, and automatically generate perturbed pseudo-implausible events. We annotate a filtered and balanced subset for plausibility using crowd-sourcing, and perform extensive cleansing to ensure annotation quality. In-depth quantitative analyses indicate that annotators favor plausibility over implausibility and disagree more on implausible events. Furthermore, our plausibility dataset is the first to capture abstractness in events to the same extent as concreteness, and we find that event abstractness has an impact on plausibility ratings: more concrete event participants trigger a perception of implausibility.

* Accepted at The 17th Linguistic Annotation Workshop

Via

Access Paper or Ask Questions

Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain

Apr 28, 2023

Annerose Eichel, Helena Schlipf, Sabine Schulte im Walde

Figure 1 for Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain

Figure 2 for Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain

Figure 3 for Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain

Figure 4 for Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain

Abstract:We propose a novel approach to learn domain-specific plausible materials for components in the vehicle repair domain by probing Pretrained Language Models (PLMs) in a cloze task style setting to overcome the lack of annotated datasets. We devise a new method to aggregate salient predictions from a set of cloze query templates and show that domain-adaptation using either a small, high-quality or a customized Wikipedia corpus boosts performance. When exploring resource-lean alternatives, we find a distilled PLM clearly outperforming a classic pattern-based algorithm. Further, given that 98% of our domain-specific components are multiword expressions, we successfully exploit the compositionality assumption as a way to address data sparsity.

* Accepted at EACL 2023 Main Conference

Via

Access Paper or Ask Questions

NAP at SemEval-2023 Task 3: Is Less Really More? Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Apr 27, 2023

Neele Falk, Annerose Eichel, Prisca Piccirilli

Figure 1 for NAP at SemEval-2023 Task 3: Is Less Really More? Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Figure 2 for NAP at SemEval-2023 Task 3: Is Less Really More? Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Figure 3 for NAP at SemEval-2023 Task 3: Is Less Really More? Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Figure 4 for NAP at SemEval-2023 Task 3: Is Less Really More? Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Abstract:Persuasion techniques detection in news in a multi-lingual setup is non-trivial and comes with challenges, including little training data. Our system successfully leverages (back-)translation as data augmentation strategies with multi-lingual transformer models for the task of detecting persuasion techniques. The automatic and human evaluation of our augmented data allows us to explore whether (back-)translation aid or hinder performance. Our in-depth analyses indicate that both data augmentation strategies boost performance; however, balancing human-produced and machine-generated data seems to be crucial.

* 14 pages, 17 figures, 11 tables. Accepted at SemEval2023@ACL

Via

Access Paper or Ask Questions