Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Synthetic Audio Helps for Cognitive State Tasks

Feb 10, 2025

Adil Soubki, John Murzaku, Peter Zeng, Owen Rambow

Figure 1 for Synthetic Audio Helps for Cognitive State Tasks

Figure 2 for Synthetic Audio Helps for Cognitive State Tasks

Share this with someone who'll enjoy it:

Abstract:The NLP community has broadly focused on text-only approaches of cognitive state tasks, but audio can provide vital missing cues through prosody. We posit that text-to-speech models learn to track aspects of cognitive state in order to produce naturalistic audio, and that the signal audio models implicitly identify is orthogonal to the information that language models exploit. We present Synthetic Audio Data fine-tuning (SAD), a framework where we show that 7 tasks related to cognitive state modeling benefit from multimodal training on both text and zero-shot synthetic audio data from an off-the-shelf TTS system. We show an improvement over the text-only modality when adding synthetic audio data to text-only corpora. Furthermore, on tasks and corpora that do contain gold audio, we show our SAD framework achieves competitive performance with text and synthetic audio compared to text and gold audio.

* NAACL 2025 * John Murzaku and Adil Soubki contributed equally to this work

View paper on

Share this with someone who'll enjoy it:

Title:Synthetic Audio Helps for Cognitive State Tasks

Paper and Code