Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Articulatory Features for ASR of Pathological Speech

Jul 28, 2018

Emre Yılmaz, Vikramjit Mitra, Chris Bartels, Horacio Franco

Figure 1 for Articulatory Features for ASR of Pathological Speech

Figure 2 for Articulatory Features for ASR of Pathological Speech

Figure 3 for Articulatory Features for ASR of Pathological Speech

Share this with someone who'll enjoy it:

Abstract:In this work, we investigate the joint use of articulatory and acoustic features for automatic speech recognition (ASR) of pathological speech. Despite long-lasting efforts to build speaker- and text-independent ASR systems for people with dysarthria, the performance of state-of-the-art systems is still considerably lower on this type of speech than on normal speech. The most prominent reason for the inferior performance is the high variability in pathological speech that is characterized by the spectrotemporal deviations caused by articulatory impairments due to various etiologies. To cope with this high variation, we propose to use speech representations which utilize articulatory information together with the acoustic properties. A designated acoustic model, namely a fused-feature-map convolutional neural network (fCNN), which performs frequency convolution on acoustic features and time convolution on articulatory features is trained and tested on a Dutch and a Flemish pathological speech corpus. The ASR performance of fCNN-based ASR system using joint features is compared to other neural network architectures such conventional CNNs and time-frequency convolutional networks (TFCNNs) in several training scenarios.

* Accepted for publication at Interspeech 2018

View paper on

Share this with someone who'll enjoy it:

Title:Articulatory Features for ASR of Pathological Speech

Paper and Code