Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

Apr 15, 2018

Trang Tran, Shubham Toshniwal, Mohit Bansal, Kevin Gimpel, Karen Livescu, Mari Ostendorf

Figure 1 for Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

Figure 2 for Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

Figure 3 for Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

Figure 4 for Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

Share this with someone who'll enjoy it:

Abstract:In conversational speech, the acoustic signal provides cues that help listeners disambiguate difficult parses. For automatically parsing spoken utterances, we introduce a model that integrates transcribed text and acoustic-prosodic features using a convolutional neural network over energy and pitch trajectories coupled with an attention-based recurrent neural network that accepts text and prosodic features. We find that different types of acoustic-prosodic features are individually helpful, and together give statistically significant improvements in parse and disfluency detection F1 scores over a strong text-only baseline. For this study with known sentence boundaries, error analyses show that the main benefit of acoustic-prosodic features is in sentences with disfluencies, attachment decisions are most improved, and transcription errors obscure gains from prosody.

* Accepted in NAACL HLT 2018

View paper on

Share this with someone who'll enjoy it:

Title:Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

Paper and Code