Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Jul 25, 2022

Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli

Figure 1 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Figure 2 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Figure 3 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Figure 4 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Share this with someone who'll enjoy it:

Abstract:There is growing interest in unifying the streaming and full-context automatic speech recognition (ASR) networks into a single end-to-end ASR model to simplify the model training and deployment for both use cases. While in real-world ASR applications, the streaming ASR models typically operate under more storage and computational constraints - e.g., on embedded devices - than any server-side full-context models. Motivated by the recent progress in Omni-sparsity supernet training, where multiple subnetworks are jointly optimized in one single model, this work aims to jointly learn a compact sparse on-device streaming ASR model, and a large dense server non-streaming model, in a single supernet. Next, we present that, performing supernet training on both wav2vec 2.0 self-supervised learning and supervised ASR fine-tuning can not only substantially improve the large non-streaming model as shown in prior works, and also be able to improve the compact sparse streaming model.

* 7 pages, 1 figure

View paper on

Share this with someone who'll enjoy it:

Title:Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Paper and Code