Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Verónica Pérez-Rosas

Persuasion at Play: Understanding Misinformation Dynamics in Demographic-Aware Human-LLM Interactions

Mar 03, 2025

Angana Borah, Rada Mihalcea, Verónica Pérez-Rosas

Abstract:Existing challenges in misinformation exposure and susceptibility vary across demographic groups, as some populations are more vulnerable to misinformation than others. Large language models (LLMs) introduce new dimensions to these challenges through their ability to generate persuasive content at scale and reinforcing existing biases. This study investigates the bidirectional persuasion dynamics between LLMs and humans when exposed to misinformative content. We analyze human-to-LLM influence using human-stance datasets and assess LLM-to-human influence by generating LLM-based persuasive arguments. Additionally, we use a multi-agent LLM framework to analyze the spread of misinformation under persuasion among demographic-oriented LLM agents. Our findings show that demographic factors influence susceptibility to misinformation in LLMs, closely reflecting the demographic-based patterns seen in human susceptibility. We also find that, similar to human demographic groups, multi-agent LLMs exhibit echo chamber behavior. This research explores the interplay between humans and LLMs, highlighting demographic differences in the context of misinformation and offering insights for future interventions.

Via

Access Paper or Ask Questions

Examining Spanish Counseling with MIDAS: a Motivational Interviewing Dataset in Spanish

Feb 12, 2025

Aylin Gunal, Bowen Yi, John Piette, Rada Mihalcea, Verónica Pérez-Rosas

Abstract:Cultural and language factors significantly influence counseling, but Natural Language Processing research has not yet examined whether the findings of conversational analysis for counseling conducted in English apply to other languages. This paper presents a first step towards this direction. We introduce MIDAS (Motivational Interviewing Dataset in Spanish), a counseling dataset created from public video sources that contains expert annotations for counseling reflections and questions. Using this dataset, we explore language-based differences in counselor behavior in English and Spanish and develop classifiers in monolingual and multilingual settings, demonstrating its applications in counselor behavioral coding tasks.

* To appear in NAACL 2025 Main Conference

Via

Access Paper or Ask Questions

VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing

Nov 14, 2023

Do June Min, Verónica Pérez-Rosas, Kenneth Resnicow, Rada Mihalcea

Figure 1 for VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing

Figure 2 for VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing

Figure 3 for VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing

Figure 4 for VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing

Abstract:Reflective listening is a fundamental skill that counselors must acquire to achieve proficiency in motivational interviewing (MI). It involves responding in a manner that acknowledges and explores the meaning of what the client has expressed in the conversation. In this work, we introduce the task of counseling response rewriting, which transforms non-reflective statements into reflective responses. We introduce VERVE, a template-based rewriting system with paraphrase-augmented training and adaptive template updating. VERVE first creates a template by identifying and filtering out tokens that are not relevant to reflections and constructs a reflective response using the template. Paraphrase-augmented training allows the model to learn less-strict fillings of masked spans, and adaptive template updating helps discover effective templates for rewriting without significantly removing the original content. Using both automatic and human evaluations, we compare our method against text rewriting baselines and show that our framework is effective in turning non-reflective statements into more reflective responses while achieving a good content preservation-reflection style trade-off.

Via

Access Paper or Ask Questions

Adaptable Claim Rewriting with Offline Reinforcement Learning for Effective Misinformation Discovery

Oct 14, 2022

Ashkan Kazemi, Artem Abzaliev, Naihao Deng, Rui Hou, Davis Liang, Scott A. Hale, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Adaptable Claim Rewriting with Offline Reinforcement Learning for Effective Misinformation Discovery

Figure 2 for Adaptable Claim Rewriting with Offline Reinforcement Learning for Effective Misinformation Discovery

Figure 3 for Adaptable Claim Rewriting with Offline Reinforcement Learning for Effective Misinformation Discovery

Figure 4 for Adaptable Claim Rewriting with Offline Reinforcement Learning for Effective Misinformation Discovery

Abstract:We propose a novel system to help fact-checkers formulate search queries for known misinformation claims and effectively search across multiple social media platforms. We introduce an adaptable rewriting strategy, where editing actions (e.g., swap a word with its synonym; change verb tense into present simple) for queries containing claims are automatically learned through offline reinforcement learning. Specifically, we use a decision transformer to learn a sequence of editing actions that maximize query retrieval metrics such as mean average precision. Through several experiments, we show that our approach can increase the effectiveness of the queries by up to 42\% relatively, while producing editing action sequences that are human readable, thus making the system easy to use and explain.

Via

Access Paper or Ask Questions

Matching Tweets With Applicable Fact-Checks Across Languages

Feb 14, 2022

Ashkan Kazemi, Zehua Li, Verónica Pérez-Rosas, Scott A. Hale, Rada Mihalcea

Figure 1 for Matching Tweets With Applicable Fact-Checks Across Languages

Figure 2 for Matching Tweets With Applicable Fact-Checks Across Languages

Figure 3 for Matching Tweets With Applicable Fact-Checks Across Languages

Figure 4 for Matching Tweets With Applicable Fact-Checks Across Languages

Abstract:An important challenge for news fact-checking is the effective dissemination of existing fact-checks. This in turn brings the need for reliable methods to detect previously fact-checked claims. In this paper, we focus on automatically finding existing fact-checks for claims made in social media posts (tweets). We conduct both classification and retrieval experiments, in monolingual (English only), multilingual (Spanish, Portuguese), and cross-lingual (Hindi-English) settings using multilingual transformer models such as XLM-RoBERTa and multilingual embeddings such as LaBSE and SBERT. We present promising results for "match" classification (93% average accuracy) in four language pairs. We also find that a BM25 baseline outperforms state-of-the-art multilingual embedding models for the retrieval task during our monolingual experiments. We highlight and discuss NLP challenges while addressing this problem in different languages, and we introduce a novel curated dataset of fact-checks and corresponding tweets for future research.

* Accepted to De-Factify Workshop at AAAI 2022

Via

Access Paper or Ask Questions

Exploring Self-Identified Counseling Expertise in Online Support Forums

Jun 24, 2021

Allison Lahnala, Yuntian Zhao, Charles Welch, Jonathan K. Kummerfeld, Lawrence An, Kenneth Resnicow, Rada Mihalcea, Verónica Pérez-Rosas

Figure 1 for Exploring Self-Identified Counseling Expertise in Online Support Forums

Figure 2 for Exploring Self-Identified Counseling Expertise in Online Support Forums

Figure 3 for Exploring Self-Identified Counseling Expertise in Online Support Forums

Figure 4 for Exploring Self-Identified Counseling Expertise in Online Support Forums

Abstract:A growing number of people engage in online health forums, making it important to understand the quality of the advice they receive. In this paper, we explore the role of expertise in responses provided to help-seeking posts regarding mental health. We study the differences between (1) interactions with peers; and (2) interactions with self-identified mental health professionals. First, we show that a classifier can distinguish between these two groups, indicating that their language use does in fact differ. To understand this difference, we perform several analyses addressing engagement aspects, including whether their comments engage the support-seeker further as well as linguistic aspects, such as dominant language and linguistic style matching. Our work contributes toward the developing efforts of understanding how health experts engage with health information- and support-seekers in social networks. More broadly, it is a step toward a deeper understanding of the styles of interactions that cultivate supportive engagement in online communities.

* Accepted to Findings of ACL 2021

Via

Access Paper or Ask Questions

Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News

Apr 27, 2021

Ashkan Kazemi, Zehua Li, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News

Figure 2 for Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News

Figure 3 for Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News

Figure 4 for Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News

Abstract:In this paper, we explore the construction of natural language explanations for news claims, with the goal of assisting fact-checking and news evaluation applications. We experiment with two methods: (1) an extractive method based on Biased TextRank -- a resource-effective unsupervised graph-based algorithm for content extraction; and (2) an abstractive method based on the GPT-2 language model. We perform comparative evaluations on two misinformation datasets in the political and health news domains, and find that the extractive method shows the most promise.

* Accepted to NLP for Internet Freedom Workshop at NAACL 2021

Via

Access Paper or Ask Questions

Exploring the Value of Personalized Word Embeddings

Nov 11, 2020

Charles Welch, Jonathan K. Kummerfeld, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Exploring the Value of Personalized Word Embeddings

Figure 2 for Exploring the Value of Personalized Word Embeddings

Figure 3 for Exploring the Value of Personalized Word Embeddings

Figure 4 for Exploring the Value of Personalized Word Embeddings

Abstract:In this paper, we introduce personalized word embeddings, and examine their value for language modeling. We compare the performance of our proposed prediction model when using personalized versus generic word representations, and study how these representations can be leveraged for improved performance. We provide insight into what types of words can be more accurately predicted when building personalized models. Our results show that a subset of words belonging to specific psycholinguistic categories tend to vary more in their representations across users and that combining generic and personalized word embeddings yields the best performance, with a 4.7% relative reduction in perplexity. Additionally, we show that a language model using personalized word embeddings can be effectively used for authorship attribution.

* COLING 2020

Via

Access Paper or Ask Questions

Biased TextRank: Unsupervised Graph-Based Content Extraction

Nov 02, 2020

Ashkan Kazemi, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Biased TextRank: Unsupervised Graph-Based Content Extraction

Figure 2 for Biased TextRank: Unsupervised Graph-Based Content Extraction

Figure 3 for Biased TextRank: Unsupervised Graph-Based Content Extraction

Figure 4 for Biased TextRank: Unsupervised Graph-Based Content Extraction

Abstract:We introduce Biased TextRank, a graph-based content extraction method inspired by the popular TextRank algorithm that ranks text spans according to their importance for language processing tasks and according to their relevance to an input "focus." Biased TextRank enables focused content extraction for text by modifying the random restarts in the execution of TextRank. The random restart probabilities are assigned based on the relevance of the graph nodes to the focus of the task. We present two applications of Biased TextRank: focused summarization and explanation extraction, and show that our algorithm leads to improved performance on two different datasets by significant ROUGE-N score margins. Much like its predecessor, Biased TextRank is unsupervised, easy to implement and orders of magnitude faster and lighter than current state-of-the-art Natural Language Processing methods for similar tasks.

* Accepted to COLING 2020

Via

Access Paper or Ask Questions

Compositional Demographic Word Embeddings

Oct 29, 2020

Charles Welch, Jonathan K. Kummerfeld, Verónica Pérez-Rosas, Rada Mihalcea

Figure 1 for Compositional Demographic Word Embeddings

Figure 2 for Compositional Demographic Word Embeddings

Figure 3 for Compositional Demographic Word Embeddings

Figure 4 for Compositional Demographic Word Embeddings

Abstract:Word embeddings are usually derived from corpora containing text from many individuals, thus leading to general purpose representations rather than individually personalized representations. While personalized embeddings can be useful to improve language model performance and other language processing tasks, they can only be computed for people with a large amount of longitudinal data, which is not the case for new users. We propose a new form of personalized word embeddings that use demographic-specific word representations derived compositionally from full or partial demographic information for a user (i.e., gender, age, location, religion). We show that the resulting demographic-aware word representations outperform generic word representations on two tasks for English: language modeling and word associations. We further explore the trade-off between the number of available attributes and their relative effectiveness and discuss the ethical implications of using them.

* To appear at EMNLP 2020

Via

Access Paper or Ask Questions