Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

Jun 25, 2018

Raghav Menon, Herman Kamper, John Quinn, Thomas Niesler

Figure 1 for Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

Figure 2 for Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

Figure 3 for Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

Figure 4 for Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

Share this with someone who'll enjoy it:

Abstract:We use dynamic time warping (DTW) as supervision for training a convolutional neural network (CNN) based keyword spotting system using a small set of spoken isolated keywords. The aim is to allow rapid deployment of a keyword spotting system in a new language to support urgent United Nations (UN) relief programmes in parts of Africa where languages are extremely under-resourced and the development of annotated speech resources is infeasible. First, we use 1920 recorded keywords (40 keyword types, 34 minutes of speech) as exemplars in a DTW-based template matching system and apply it to untranscribed broadcast speech. Then, we use the resulting DTW scores as targets to train a CNN on the same unlabelled speech. In this way we use just 34 minutes of labelled speech, but leverage a large amount of unlabelled data for training. While the resulting CNN keyword spotter cannot match the performance of the DTW-based system, it substantially outperforms a CNN classifier trained only on the keywords, improving the area under the ROC curve from 0.54 to 0.64. Because our CNN system is several orders of magnitude faster at runtime than the DTW system, it represents the most viable keyword spotter on this extremely limited dataset.

* 5 pages, 4 figures, 3 tables, accepted at Interspeech 2018

View paper on

Share this with someone who'll enjoy it:

Title:Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

Paper and Code