Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:E2E Spoken Entity Extraction for Virtual Agents

Mar 01, 2023

Karan Singla, Yeon-Jun Kim, Ryan Price, Shahab Jalalvand, Srinivas Bangalore

Figure 1 for E2E Spoken Entity Extraction for Virtual Agents

Figure 2 for E2E Spoken Entity Extraction for Virtual Agents

Figure 3 for E2E Spoken Entity Extraction for Virtual Agents

Figure 4 for E2E Spoken Entity Extraction for Virtual Agents

Share this with someone who'll enjoy it:

Abstract:This paper reimagines some aspects of speech processing using speech encoders, specifically about extracting entities directly from speech, with no intermediate textual representation. In human-computer conversations, extracting entities such as names, postal addresses and email addresses from speech is a challenging task. In this paper, we study the impact of fine-tuning pre-trained speech encoders on extracting spoken entities in human-readable form directly from speech without the need for text transcription. We illustrate that such a direct approach optimizes the encoder to transcribe only the entity relevant portions of speech, ignoring the superfluous portions such as carrier phrases and spellings of entities. In the context of dialogs from an enterprise virtual agent, we demonstrate that the 1-step approach outperforms the typical 2-step cascade of first generating lexical transcriptions followed by text-based entity extraction for identifying spoken entities.

View paper on

Share this with someone who'll enjoy it:

Title:E2E Spoken Entity Extraction for Virtual Agents

Paper and Code