Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Juliette Kahn

ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Apr 23, 2018

Zied Elloumi, Laurent Besacier, Olivier Galibert, Juliette Kahn, Benjamin Lecouteux

Figure 1 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Figure 2 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Figure 3 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Figure 4 for ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

Abstract:In this paper, we address a relatively new task: prediction of ASR performance on unseen broadcast programs. We first propose an heterogenous French corpus dedicated to this task. Two prediction approaches are compared: a state-of-the-art performance prediction based on regression (engineered features) and a new strategy based on convolutional neural networks (learnt features). We particularly focus on the combination of both textual (ASR transcription) and signal inputs. While the joint use of textual and signal features did not work for the regression baseline, the combination of inputs for CNNs leads to the best WER prediction performance. We also show that our CNN prediction remarkably predicts the WER distribution on a collection of speech recordings.

* IEEE ICASSP 2018

Via

Access Paper or Ask Questions