Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Julia White

Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction

Jun 06, 2023

Julia White, Arushi Raghuvanshi, Yada Pruksachatkun

Abstract:Task-oriented dialogues often require agents to enact complex, multi-step procedures in order to meet user requests. While large language models have found success automating these dialogues in constrained environments, their widespread deployment is limited by the substantial quantities of task-specific data required for training. The following paper presents a data-efficient solution to constructing dialogue systems, leveraging explicit instructions derived from agent guidelines, such as company policies or customer service manuals. Our proposed Knowledge-Augmented Dialogue System (KADS) combines a large language model with a knowledge retrieval module that pulls documents outlining relevant procedures from a predefined set of policies, given a user-agent interaction. To train this system, we introduce a semi-supervised pre-training scheme that employs dialogue-document matching and action-oriented masked language modeling with partial parameter freezing. We evaluate the effectiveness of our approach on prominent task-oriented dialogue datasets, Action-Based Conversations Dataset and Schema-Guided Dialogue, for two dialogue tasks: action state tracking and workflow discovery. Our results demonstrate that procedural knowledge augmentation improves accuracy predicting in- and out-of-distribution actions while preserving high performance in settings with low or sparse data.

Via

Access Paper or Ask Questions

Mixed-effects transformers for hierarchical adaptation

May 03, 2022

Julia White, Noah Goodman, Robert Hawkins

Figure 1 for Mixed-effects transformers for hierarchical adaptation

Figure 2 for Mixed-effects transformers for hierarchical adaptation

Figure 3 for Mixed-effects transformers for hierarchical adaptation

Figure 4 for Mixed-effects transformers for hierarchical adaptation

Abstract:Language use differs dramatically from context to context. To some degree, modern language models like GPT-3 are able to account for such variance by conditioning on a string of previous input text, or prompt. Yet prompting is ineffective when contexts are sparse, out-of-sample, or extra-textual; for instance, accounting for when and where the text was produced or who produced it. In this paper, we introduce the mixed-effects transformer (MET), a novel approach for learning hierarchically-structured prefixes -- lightweight modules prepended to the input -- to account for structured variation. Specifically, we show how the popular class of mixed-effects models may be extended to transformer-based architectures using a regularized prefix-tuning procedure with dropout. We evaluate this approach on several domain-adaptation benchmarks, finding that it efficiently adapts to novel contexts with minimal data while still effectively generalizing to unseen contexts.

Via

Access Paper or Ask Questions

Open-domain clarification question generation without question examples

Oct 19, 2021

Julia White, Gabriel Poesia, Robert Hawkins, Dorsa Sadigh, Noah Goodman

Figure 1 for Open-domain clarification question generation without question examples

Figure 2 for Open-domain clarification question generation without question examples

Figure 3 for Open-domain clarification question generation without question examples

Figure 4 for Open-domain clarification question generation without question examples

Abstract:An overarching goal of natural language processing is to enable machines to communicate seamlessly with humans. However, natural language can be ambiguous or unclear. In cases of uncertainty, humans engage in an interactive process known as repair: asking questions and seeking clarification until their uncertainty is resolved. We propose a framework for building a visually grounded question-asking model capable of producing polar (yes-no) clarification questions to resolve misunderstandings in dialogue. Our model uses an expected information gain objective to derive informative questions from an off-the-shelf image captioner without requiring any supervised question-answer data. We demonstrate our model's ability to pose questions that improve communicative success in a goal-oriented 20 questions game with synthetic and human answerers.

* EMNLP 2021

Via

Access Paper or Ask Questions

Calibrate your listeners! Robust communication-based training for pragmatic speakers

Oct 11, 2021

Rose E. Wang, Julia White, Jesse Mu, Noah D. Goodman

Figure 1 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Figure 2 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Figure 3 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Figure 4 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Abstract:To be good conversational partners, natural language processing (NLP) systems should be trained to produce contextually useful utterances. Prior work has investigated training NLP systems with communication-based objectives, where a neural listener stands in as a communication partner. However, these systems commonly suffer from semantic drift where the learned language diverges radically from natural language. We propose a method that uses a population of neural listeners to regularize speaker training. We first show that language drift originates from the poor uncertainty calibration of a neural listener, which makes high-certainty predictions on novel sentences. We explore ensemble- and dropout-based populations of listeners and find that the former results in better uncertainty quantification. We evaluate both population-based objectives on reference games, and show that the ensemble method with better calibration enables the speaker to generate pragmatic utterances while scaling to a large vocabulary and generalizing to new games and listeners.

* Findings of EMNLP 2021 Code: https://github.com/rosewang2008/calibrate_your_listeners

Via

Access Paper or Ask Questions

Learning to refer informatively by amortizing pragmatic reasoning

May 31, 2020

Julia White, Jesse Mu, Noah D. Goodman

Figure 1 for Learning to refer informatively by amortizing pragmatic reasoning

Figure 2 for Learning to refer informatively by amortizing pragmatic reasoning

Figure 3 for Learning to refer informatively by amortizing pragmatic reasoning

Figure 4 for Learning to refer informatively by amortizing pragmatic reasoning

Abstract:A hallmark of human language is the ability to effectively and efficiently convey contextually relevant information. One theory for how humans reason about language is presented in the Rational Speech Acts (RSA) framework, which captures pragmatic phenomena via a process of recursive social reasoning (Goodman & Frank, 2016). However, RSA represents ideal reasoning in an unconstrained setting. We explore the idea that speakers might learn to amortize the cost of RSA computation over time by directly optimizing for successful communication with an internal listener model. In simulations with grounded neural speakers and listeners across two communication game datasets representing synthetic and human-generated data, we find that our amortized model is able to quickly generate language that is effective and concise across a range of contexts, without the need for explicit pragmatic reasoning.

* Accepted to CogSci 2020

Via

Access Paper or Ask Questions