Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ellen Riloff

University of Utah

Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation

Feb 25, 2025

Haris Riaz, Ellen Riloff, Mihai Surdeanu

Abstract:We propose a simple, unsupervised method that injects pragmatic principles in retrieval-augmented generation (RAG) frameworks such as Dense Passage Retrieval~\cite{karpukhin2020densepassageretrievalopendomain} to enhance the utility of retrieved contexts. Our approach first identifies which sentences in a pool of documents retrieved by RAG are most relevant to the question at hand, cover all the topics addressed in the input question and no more, and then highlights these sentences within their context, before they are provided to the LLM, without truncating or altering the context in any other way. We show that this simple idea brings consistent improvements in experiments on three question answering tasks (ARC-Challenge, PubHealth and PopQA) using five different LLMs. It notably enhances relative accuracy by up to 19.7\% on PubHealth and 10\% on ARC-Challenge compared to a conventional RAG system.

* 16 pages, 2 figures

Via

Access Paper or Ask Questions

Memorization In In-Context Learning

Aug 21, 2024

Shahriar Golchin, Mihai Surdeanu, Steven Bethard, Eduardo Blanco, Ellen Riloff

Figure 1 for Memorization In In-Context Learning

Figure 2 for Memorization In In-Context Learning

Figure 3 for Memorization In In-Context Learning

Figure 4 for Memorization In In-Context Learning

Abstract:In-context learning (ICL) has proven to be an effective strategy for improving the performance of large language models (LLMs) with no additional training. However, the exact mechanism behind these performance improvements remains unclear. This study is the first to show how ICL surfaces memorized training data and to explore the correlation between this memorization and performance across various ICL regimes: zero-shot, few-shot, and many-shot. Our most notable findings include: (1) ICL significantly surfaces memorization compared to zero-shot learning in most cases; (2) demonstrations, without their labels, are the most effective element in surfacing memorization; (3) ICL improves performance when the surfaced memorization in few-shot regimes reaches a high level (about 40%); and (4) there is a very strong correlation between performance and memorization in ICL when it outperforms zero-shot learning. Overall, our study uncovers a hidden phenomenon -- memorization -- at the core of ICL, raising an important question: to what extent do LLMs truly generalize from demonstrations in ICL, and how much of their success is due to memorization?

* v1

Via

Access Paper or Ask Questions

Classifying Organizations for Food System Ontologies using Natural Language Processing

Sep 19, 2023

Tianyu Jiang, Sonia Vinogradova, Nathan Stringham, E. Louise Earl, Allan D. Hollander, Patrick R. Huber, Ellen Riloff, R. Sandra Schillo, Giorgio A. Ubbiali, Matthew Lange

Abstract:Our research explores the use of natural language processing (NLP) methods to automatically classify entities for the purpose of knowledge graph population and integration with food system ontologies. We have created NLP models that can automatically classify organizations with respect to categories associated with environmental issues as well as Standard Industrial Classification (SIC) codes, which are used by the U.S. government to characterize business activities. As input, the NLP models are provided with text snippets retrieved by the Google search engine for each organization, which serves as a textual description of the organization that is used for learning. Our experimental results show that NLP models can achieve reasonably good performance for these two classification tasks, and they rely on a general framework that could be applied to many other classification problems as well. We believe that NLP models represent a promising approach for automatically harvesting information to populate knowledge graphs and aligning the information with existing ontologies through shared categories and concepts.

* Presented at IFOW 2023 Integrated Food Ontology Workshop at the Formal Ontology in Information Systems Conference (FOIS) 2023 in Sherbrooke, Quebec, Canada July 17-20th, 2023

Via

Access Paper or Ask Questions

Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue

Sep 15, 2017

Shereen Oraby, Vrindavan Harrison, Lena Reed, Ernesto Hernandez, Ellen Riloff, Marilyn Walker

Figure 1 for Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue

Figure 2 for Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue

Figure 3 for Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue

Figure 4 for Creating and Characterizing a Diverse Corpus of Sarcasm in Dialogue

Abstract:The use of irony and sarcasm in social media allows us to study them at scale for the first time. However, their diversity has made it difficult to construct a high-quality corpus of sarcasm in dialogue. Here, we describe the process of creating a large- scale, highly-diverse corpus of online debate forums dialogue, and our novel methods for operationalizing classes of sarcasm in the form of rhetorical questions and hyperbole. We show that we can use lexico-syntactic cues to reliably retrieve sarcastic utterances with high accuracy. To demonstrate the properties and quality of our corpus, we conduct supervised learning experiments with simple features, and show that we achieve both higher precision and F than previous work on sarcasm in debate forums dialogue. We apply a weakly-supervised linguistic pattern learner and qualitatively analyze the linguistic differences in each class.

* 11 pages, 4 figures, SIGDIAL 2016

Via

Access Paper or Ask Questions

Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog

Sep 15, 2017

Shereen Oraby, Vrindavan Harrison, Amita Misra, Ellen Riloff, Marilyn Walker

Figure 1 for Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog

Figure 2 for Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog

Figure 3 for Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog

Figure 4 for Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog

Abstract:Effective models of social dialog must understand a broad range of rhetorical and figurative devices. Rhetorical questions (RQs) are a type of figurative language whose aim is to achieve a pragmatic goal, such as structuring an argument, being persuasive, emphasizing a point, or being ironic. While there are computational models for other forms of figurative language, rhetorical questions have received little attention to date. We expand a small dataset from previous work, presenting a corpus of 10,270 RQs from debate forums and Twitter that represent different discourse functions. We show that we can clearly distinguish between RQs and sincere questions (0.76 F1). We then show that RQs can be used both sarcastically and non-sarcastically, observing that non-sarcastic (other) uses of RQs are frequently argumentative in forums, and persuasive in tweets. We present experiments to distinguish between these uses of RQs using SVM and LSTM models that represent linguistic features and post-level context, achieving results as high as 0.76 F1 for "sarcastic" and 0.77 F1 for "other" in forums, and 0.83 F1 for both "sarcastic" and "other" in tweets. We supplement our quantitative experiments with an in-depth characterization of the linguistic variation in RQs.

* 10 pages, 1 figure, SIGDIAL 2016

Via

Access Paper or Ask Questions

And That's A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue

Sep 15, 2017

Shereen Oraby, Lena Reed, Ryan Compton, Ellen Riloff, Marilyn Walker, Steve Whittaker

Figure 1 for And That's A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue

Figure 2 for And That's A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue

Figure 3 for And That's A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue

Figure 4 for And That's A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue

Abstract:We investigate the characteristics of factual and emotional argumentation styles observed in online debates. Using an annotated set of "factual" and "feeling" debate forum posts, we extract patterns that are highly correlated with factual and emotional arguments, and then apply a bootstrapping methodology to find new patterns in a larger pool of unannotated forum posts. This process automatically produces a large set of patterns representing linguistic expressions that are highly correlated with factual and emotional language. Finally, we analyze the most discriminating patterns to better understand the defining characteristics of factual and emotional arguments.

* 11 pages, 6 figures, Proceedings of the 2nd Workshop on Argumentation Mining at NAACL 2015

Via

Access Paper or Ask Questions

Looking Under the Hood : Tools for Diagnosing your Question Answering Engine

Jul 03, 2001

Eric Breck, Marc Light, Gideon S. Mann, Ellen Riloff, Brianne Brown Pranav Anand, Mats Rooth, Michael Thelen

Figure 1 for Looking Under the Hood : Tools for Diagnosing your Question Answering Engine

Figure 2 for Looking Under the Hood : Tools for Diagnosing your Question Answering Engine

Figure 3 for Looking Under the Hood : Tools for Diagnosing your Question Answering Engine

Figure 4 for Looking Under the Hood : Tools for Diagnosing your Question Answering Engine

Abstract:In this paper we analyze two question answering tasks : the TREC-8 question answering task and a set of reading comprehension exams. First, we show that Q/A systems perform better when there are multiple answer opportunities per question. Next, we analyze common approaches to two subproblems: term overlap for answer sentence identification, and answer typing for short answer extraction. We present general tools for analyzing the strengths and limitations of techniques for these subproblems. Our results quantify the limitations of both term overlap and answer typing to distinguish between competing answer candidates.

* Revision of paper appearing in the Proceedings of the Workshop on Open-Domain Question Answering

Via

Access Paper or Ask Questions

A Corpus-Based Approach for Building Semantic Lexicons

Jun 10, 1997

Ellen Riloff, Jessica Shepherd

Figure 1 for A Corpus-Based Approach for Building Semantic Lexicons

Figure 2 for A Corpus-Based Approach for Building Semantic Lexicons

Figure 3 for A Corpus-Based Approach for Building Semantic Lexicons

Figure 4 for A Corpus-Based Approach for Building Semantic Lexicons

Abstract:Semantic knowledge can be a great asset to natural language processing systems, but it is usually hand-coded for each application. Although some semantic information is available in general-purpose knowledge bases such as WordNet and Cyc, many applications require domain-specific lexicons that represent words and categories for a particular topic. In this paper, we present a corpus-based method that can be used to build semantic lexicons for specific categories. The input to the system is a small set of seed words for a category and a representative text corpus. The output is a ranked list of words that are associated with the category. A user then reviews the top-ranked words and decides which ones should be entered in the semantic lexicon. In experiments with five categories, users typically found about 60 words per category in 10-15 minutes to build a core semantic lexicon.

* 8 pages - to appear in Proceedings of EMNLP-2

Via

Access Paper or Ask Questions