Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Calibrate your listeners! Robust communication-based training for pragmatic speakers

Oct 11, 2021

Rose E. Wang, Julia White, Jesse Mu, Noah D. Goodman

Figure 1 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Figure 2 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Figure 3 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Figure 4 for Calibrate your listeners! Robust communication-based training for pragmatic speakers

Share this with someone who'll enjoy it:

Abstract:To be good conversational partners, natural language processing (NLP) systems should be trained to produce contextually useful utterances. Prior work has investigated training NLP systems with communication-based objectives, where a neural listener stands in as a communication partner. However, these systems commonly suffer from semantic drift where the learned language diverges radically from natural language. We propose a method that uses a population of neural listeners to regularize speaker training. We first show that language drift originates from the poor uncertainty calibration of a neural listener, which makes high-certainty predictions on novel sentences. We explore ensemble- and dropout-based populations of listeners and find that the former results in better uncertainty quantification. We evaluate both population-based objectives on reference games, and show that the ensemble method with better calibration enables the speaker to generate pragmatic utterances while scaling to a large vocabulary and generalizing to new games and listeners.

* Findings of EMNLP 2021 Code: https://github.com/rosewang2008/calibrate_your_listeners

View paper on

Share this with someone who'll enjoy it:

Title:Calibrate your listeners! Robust communication-based training for pragmatic speakers

Paper and Code