Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Khalid Al-Khatib

Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design

May 08, 2025

Elena Musi, Nadin Kokciyan, Khalid Al-Khatib, Davide Ceolin, Emmanuelle Dietz, Klara Gutekunst, Annette Hautli-Janisz, Cristian Manuel Santibañez Yañez, Jodi Schneider, Jonas Scholz(+3 more)

Abstract:In this position paper, we advocate for the development of conversational technology that is inherently designed to support and facilitate argumentative processes. We argue that, at present, large language models (LLMs) are inadequate for this purpose, and we propose an ideal technology design aimed at enhancing argumentative skills. This involves re-framing LLMs as tools to exercise our critical thinking rather than replacing them. We introduce the concept of 'reasonable parrots' that embody the fundamental principles of relevance, responsibility, and freedom, and that interact through argumentative dialogical moves. These principles and moves arise out of millennia of work in argumentation theory and should serve as the starting point for LLM-based technology that incorporates basic principles of argumentation.

Via

Access Paper or Ask Questions

Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts

Sep 04, 2024

Arianna Muti, Federico Ruggeri, Khalid Al-Khatib, Alberto Barrón-Cedeño, Tommaso Caselli

Figure 1 for Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts

Figure 2 for Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts

Figure 3 for Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts

Figure 4 for Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts

Abstract:We propose misogyny detection as an Argumentative Reasoning task and we investigate the capacity of large language models (LLMs) to understand the implicit reasoning used to convey misogyny in both Italian and English. The central aim is to generate the missing reasoning link between a message and the implied meanings encoding the misogyny. Our study uses argumentation theory as a foundation to form a collection of prompts in both zero-shot and few-shot settings. These prompts integrate different techniques, including chain-of-thought reasoning and augmented knowledge. Our findings show that LLMs fall short on reasoning capabilities about misogynistic comments and that they mostly rely on their implicit knowledge derived from internalized common stereotypes about women to generate implied assumptions, rather than on inductive reasoning.

Via

Access Paper or Ask Questions

TL;DR Progress: Multi-faceted Literature Exploration in Text Summarization

Feb 10, 2024

Shahbaz Syed, Khalid Al-Khatib, Martin Potthast

Abstract:This paper presents TL;DR Progress, a new tool for exploring the literature on neural text summarization. It organizes 514~papers based on a comprehensive annotation scheme for text summarization approaches and enables fine-grained, faceted search. Each paper was manually annotated to capture aspects such as evaluation metrics, quality dimensions, learning paradigms, challenges addressed, datasets, and document domains. In addition, a succinct indicative summary is provided for each paper, consisting of automatically extracted contextual factors, issues, and proposed solutions. The tool is available online at https://www.tldr-progress.de, a demo video at https://youtu.be/uCVRGFvXUj8

* EACL 2024 System Demonstration

Via

Access Paper or Ask Questions

Citance-Contextualized Summarization of Scientific Papers

Nov 13, 2023

Shahbaz Syed, Ahmad Dawar Hakimi, Khalid Al-Khatib, Martin Potthast

Abstract:Current approaches to automatic summarization of scientific papers generate informative summaries in the form of abstracts. However, abstracts are not intended to show the relationship between a paper and the references cited in it. We propose a new contextualized summarization approach that can generate an informative summary conditioned on a given sentence containing the citation of a reference (a so-called "citance"). This summary outlines the content of the cited paper relevant to the citation location. Thus, our approach extracts and models the citances of a paper, retrieves relevant passages from cited papers, and generates abstractive summaries tailored to each citance. We evaluate our approach using $\textbf{Webis-Context-SciSumm-2023}$, a new dataset containing 540K~computer science papers and 4.6M~citances therein.

* Accepted at EMNLP 2023 Findings

Via

Access Paper or Ask Questions

Indicative Summarization of Long Discussions

Nov 03, 2023

Shahbaz Syed, Dominik Schwabe, Khalid Al-Khatib, Martin Potthast

Abstract:Online forums encourage the exchange and discussion of different stances on many topics. Not only do they provide an opportunity to present one's own arguments, but may also gather a broad cross-section of others' arguments. However, the resulting long discussions are difficult to overview. This paper presents a novel unsupervised approach using large language models (LLMs) to generating indicative summaries for long discussions that basically serve as tables of contents. Our approach first clusters argument sentences, generates cluster labels as abstractive summaries, and classifies the generated cluster labels into argumentation frames resulting in a two-level summary. Based on an extensively optimized prompt engineering approach, we evaluate 19~LLMs for generative cluster labeling and frame classification. To evaluate the usefulness of our indicative summaries, we conduct a purpose-driven user study via a new visual interface called Discussion Explorer: It shows that our proposed indicative summaries serve as a convenient navigation tool to explore long discussions.

* Accepted at EMNLP 2023 Main Conference

Via

Access Paper or Ask Questions

Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

Oct 13, 2022

Alonso Palomino, Martin Potthast, Khalid Al-Khatib, Benno Stein

Figure 1 for Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

Figure 2 for Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

Figure 3 for Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

Figure 4 for Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

Abstract:Most research on natural language processing treats bias as an absolute concept: Based on a (probably complex) algorithmic analysis, a sentence, an article, or a text is classified as biased or not. Given the fact that for humans the question of whether a text is biased can be difficult to answer or is answered contradictory, we ask whether an "absolute bias classification" is a promising goal at all. We see the problem not in the complexity of interpreting language phenomena but in the diversity of sociocultural backgrounds of the readers, which cannot be handled uniformly: To decide whether a text has crossed the proverbial line between non-biased and biased is subjective. By asking "Is text X more [less, equally] biased than text Y?" we propose to analyze a simpler problem, which, by its construction, is rather independent of standpoints, views, or sociocultural aspects. In such a model, bias becomes a preference relation that induces a partial ordering from least biased to most biased texts without requiring a decision on where to draw the line. A prerequisite for this kind of bias model is the ability of humans to perceive relative bias differences in the first place. In our research, we selected a specific type of bias in argumentation, the stance bias, and designed a crowdsourcing study showing that differences in stance bias are perceptible when (light) support is provided through training or visual aid.

* Accepted at AACL-IJCNLP 2022, Findings Volume

Via

Access Paper or Ask Questions

Controlled Neural Sentence-Level Reframing of News Articles

Sep 10, 2021

Wei-Fan Chen, Khalid Al-Khatib, Benno Stein, Henning Wachsmuth

Figure 1 for Controlled Neural Sentence-Level Reframing of News Articles

Figure 2 for Controlled Neural Sentence-Level Reframing of News Articles

Figure 3 for Controlled Neural Sentence-Level Reframing of News Articles

Figure 4 for Controlled Neural Sentence-Level Reframing of News Articles

Abstract:Framing a news article means to portray the reported event from a specific perspective, e.g., from an economic or a health perspective. Reframing means to change this perspective. Depending on the audience or the submessage, reframing can become necessary to achieve the desired effect on the readers. Reframing is related to adapting style and sentiment, which can be tackled with neural text generation techniques. However, it is more challenging since changing a frame requires rewriting entire sentences rather than single phrases. In this paper, we study how to computationally reframe sentences in news articles while maintaining their coherence to the context. We treat reframing as a sentence-level fill-in-the-blank task for which we train neural models on an existing media frame corpus. To guide the training, we propose three strategies: framed-language pretraining, named-entity preservation, and adversarial learning. We evaluate respective models automatically and manually for topic consistency, coherence, and successful reframing. Our results indicate that generating properly-framed text works well but with tradeoffs.

* EMNLP 2021 Findings

Via

Access Paper or Ask Questions

Summary Explorer: Visualizing the State of the Art in Text Summarization

Aug 04, 2021

Shahbaz Syed, Tariq Yousef, Khalid Al-Khatib, Stefan Jänicke, Martin Potthast

Figure 1 for Summary Explorer: Visualizing the State of the Art in Text Summarization

Figure 2 for Summary Explorer: Visualizing the State of the Art in Text Summarization

Figure 3 for Summary Explorer: Visualizing the State of the Art in Text Summarization

Abstract:This paper introduces Summary Explorer, a new tool to support the manual inspection of text summarization systems by compiling the outputs of 55~state-of-the-art single document summarization approaches on three benchmark datasets, and visually exploring them during a qualitative assessment. The underlying design of the tool considers three well-known summary quality criteria (coverage, faithfulness, and position bias), encapsulated in a guided assessment based on tailored visualizations. The tool complements existing approaches for locally debugging summarization models and improves upon them. The tool is available at https://tldr.webis.de/

Via

Access Paper or Ask Questions

Generating Informative Conclusions for Argumentative Texts

Jun 02, 2021

Shahbaz Syed, Khalid Al-Khatib, Milad Alshomary, Henning Wachsmuth, Martin Potthast

Figure 1 for Generating Informative Conclusions for Argumentative Texts

Figure 2 for Generating Informative Conclusions for Argumentative Texts

Figure 3 for Generating Informative Conclusions for Argumentative Texts

Figure 4 for Generating Informative Conclusions for Argumentative Texts

Abstract:The purpose of an argumentative text is to support a certain conclusion. Yet, they are often omitted, expecting readers to infer them rather. While appropriate when reading an individual text, this rhetorical device limits accessibility when browsing many texts (e.g., on a search engine or on social media). In these scenarios, an explicit conclusion makes for a good candidate summary of an argumentative text. This is especially true if the conclusion is informative, emphasizing specific concepts from the text. With this paper we introduce the task of generating informative conclusions: First, Webis-ConcluGen-21 is compiled, a large-scale corpus of 136,996 samples of argumentative texts and their conclusions. Second, two paradigms for conclusion generation are investigated; one extractive, the other abstractive in nature. The latter exploits argumentative knowledge that augment the data via control codes and finetuning the BART model on several subsets of the corpus. Third, insights are provided into the suitability of our corpus for the task, the differences between the two generation paradigms, the trade-off between informativeness and conciseness, and the impact of encoding argumentative knowledge. The corpus, code, and the trained models are publicly available.

Via

Access Paper or Ask Questions

Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity

Oct 20, 2020

Wei-Fan Chen, Khalid Al-Khatib, Henning Wachsmuth, Benno Stein

Figure 1 for Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity

Figure 2 for Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity

Figure 3 for Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity

Figure 4 for Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity

Abstract:Media organizations bear great reponsibility because of their considerable influence on shaping beliefs and positions of our society. Any form of media can contain overly biased content, e.g., by reporting on political events in a selective or incomplete manner. A relevant question hence is whether and how such form of imbalanced news coverage can be exposed. The research presented in this paper addresses not only the automatic detection of bias but goes one step further in that it explores how political bias and unfairness are manifested linguistically. In this regard we utilize a new corpus of 6964 news articles with labels derived from adfontesmedia.com and develop a neural model for bias assessment. By analyzing this model on article excerpts, we find insightful bias patterns at different levels of text granularity, from single words to the whole article discourse.

* NLP+CSS 2020

Via

Access Paper or Ask Questions