Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sabrina Stehwien

Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Jun 07, 2018

Sabrina Stehwien, Ngoc Thang Vu, Antje Schweitzer

Figure 1 for Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Figure 2 for Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Figure 3 for Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Figure 4 for Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Abstract:Pitch accent detection often makes use of both acoustic and lexical features based on the fact that pitch accents tend to correlate with certain words. In this paper, we extend a pitch accent detector that involves a convolutional neural network to include word embeddings, which are state-of-the-art vector representations of words. We examine the effect these features have on within-corpus and cross-corpus experiments on three English datasets. The results show that while word embeddings can improve the performance in corpus-dependent experiments, they also have the potential to make generalization to unseen data more challenging.

* This is an updated version of the paper that has been accepted at Speech Prosody 2018 and published on the ISCA archive. The updates consist of minor corrections that do not change the main conclusions in this work

Via

Access Paper or Ask Questions

Improving coreference resolution with automatically predicted prosodic information

Jul 28, 2017

Ina Rösiger, Sabrina Stehwien, Arndt Riester, Ngoc Thang Vu

Figure 1 for Improving coreference resolution with automatically predicted prosodic information

Figure 2 for Improving coreference resolution with automatically predicted prosodic information

Figure 3 for Improving coreference resolution with automatically predicted prosodic information

Abstract:Adding manually annotated prosodic information, specifically pitch accents and phrasing, to the typical text-based feature set for coreference resolution has previously been shown to have a positive effect on German data. Practical applications on spoken language, however, would rely on automatically predicted prosodic information. In this paper we predict pitch accents (and phrase boundaries) using a convolutional neural network (CNN) model from acoustic features extracted from the speech signal. After an assessment of the quality of these automatic prosodic annotations, we show that they also significantly improve coreference resolution.

* 1st Workshop on Speech-Centric Natural Language Processing (SCNLP) at EMNLP 2017; 6 pages, 1 figure

Via

Access Paper or Ask Questions

Prosodic Event Recognition using Convolutional Neural Networks with Context Information

Jun 02, 2017

Sabrina Stehwien, Ngoc Thang Vu

Figure 1 for Prosodic Event Recognition using Convolutional Neural Networks with Context Information

Figure 2 for Prosodic Event Recognition using Convolutional Neural Networks with Context Information

Figure 3 for Prosodic Event Recognition using Convolutional Neural Networks with Context Information

Figure 4 for Prosodic Event Recognition using Convolutional Neural Networks with Context Information

Abstract:This paper demonstrates the potential of convolutional neural networks (CNN) for detecting and classifying prosodic events on words, specifically pitch accents and phrase boundary tones, from frame-based acoustic features. Typical approaches use not only feature representations of the word in question but also its surrounding context. We show that adding position features indicating the current word benefits the CNN. In addition, this paper discusses the generalization from a speaker-dependent modelling approach to a speaker-independent setup. The proposed method is simple and efficient and yields strong results not only in speaker-dependent but also speaker-independent cases.

* Interspeech 2017 4 pages, 1 figure

Via

Access Paper or Ask Questions