Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vincent Roger

Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

Mar 09, 2020

Vincent Roger, Jérôme Farinas, Julien Pinquier

Figure 1 for Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

Figure 2 for Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

Figure 3 for Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

Figure 4 for Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

Abstract:Most state-of-the-art speech systems are using Deep Neural Networks (DNNs). Those systems require a large amount of data to be learned. Hence, learning state-of-the-art frameworks on under-resourced speech languages/problems is a difficult task. Problems could be the limited amount of data for impaired speech. Furthermore, acquiring more data and/or expertise is time-consuming and expensive. In this paper we position ourselves for the following speech processing tasks: Automatic Speech Recognition, speaker identification and emotion recognition. To assess the problem of limited data, we firstly investigate state-of-the-art Automatic Speech Recognition systems as it represents the hardest tasks (due to the large variability in each language). Next, we provide an overview of techniques and tasks requiring fewer data. In the last section we investigate few-shot techniques as we interpret under-resourced speech as a few-shot problem. In that sense we propose an overview of few-shot techniques and perspectives of using such techniques for the focused speech problems in this survey. It occurs that the reviewed techniques are not well adapted for large datasets. Nevertheless, some promising results from the literature encourage the usage of such techniques for speech processing.

Via

Access Paper or Ask Questions

Semi-Supervised Learning via New Deep Network Inversion

Nov 12, 2017

Randall Balestriero, Vincent Roger, Herve G. Glotin, Richard G. Baraniuk

Figure 1 for Semi-Supervised Learning via New Deep Network Inversion

Figure 2 for Semi-Supervised Learning via New Deep Network Inversion

Figure 3 for Semi-Supervised Learning via New Deep Network Inversion

Figure 4 for Semi-Supervised Learning via New Deep Network Inversion

Abstract:We exploit a recently derived inversion scheme for arbitrary deep neural networks to develop a new semi-supervised learning framework that applies to a wide range of systems and problems. The approach outperforms current state-of-the-art methods on MNIST reaching $99.14\%$ of test set accuracy while using $5$ labeled examples per class. Experiments with one-dimensional signals highlight the generality of the method. Importantly, our approach is simple, efficient, and requires no change in the deep network architecture.

* arXiv admin note: substantial text overlap with arXiv:1710.09302

Via

Access Paper or Ask Questions