Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Nov 01, 2023

Sanchit Gandhi, Patrick von Platen, Alexander M. Rush

Figure 1 for Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Figure 2 for Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Figure 3 for Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Figure 4 for Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Share this with someone who'll enjoy it:

Abstract:As the size of pre-trained speech recognition models increases, running these large models in low-latency or resource-constrained environments becomes challenging. In this work, we leverage pseudo-labelling to assemble a large-scale open-source dataset which we use to distill the Whisper model into a smaller variant, called Distil-Whisper. Using a simple word error rate (WER) heuristic, we select only the highest quality pseudo-labels for training. The distilled model is 5.8 times faster with 51% fewer parameters, while performing to within 1% WER on out-of-distribution test data in a zero-shot transfer setting. Distil-Whisper maintains the robustness of the Whisper model to difficult acoustic conditions, while being less prone to hallucination errors on long-form audio. Distil-Whisper is designed to be paired with Whisper for speculative decoding, yielding a 2 times speed-up while mathematically ensuring the same outputs as the original model. To facilitate further research in this domain, we make our training code, inference code and models publicly accessible.

* 30 pages, 2 figures, 25 tables

View paper on

Share this with someone who'll enjoy it:

Title:Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper and Code