Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Representative Subset Selection for Self-Supervised Speech Recognition

Mar 18, 2022

Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza

Figure 1 for Towards Representative Subset Selection for Self-Supervised Speech Recognition

Figure 2 for Towards Representative Subset Selection for Self-Supervised Speech Recognition

Figure 3 for Towards Representative Subset Selection for Self-Supervised Speech Recognition

Figure 4 for Towards Representative Subset Selection for Self-Supervised Speech Recognition

Share this with someone who'll enjoy it:

Abstract:Self-supervised speech recognition models require considerable labeled training data for learning high-fidelity representations for Automatic Speech Recognition (ASR), which hinders their application to low-resource languages. We consider the task of identifying an optimal subset of training data to fine-tune self-supervised speech models for ASR. We make a surprising observation that active learning strategies for sampling harder-to-learn examples do not perform better than random subset selection for fine-tuning self-supervised ASR. We then present the COWERAGE algorithm for better subset selection in self-supervised ASR which is based on our finding that ensuring the coverage of examples based on training WER in the early training epochs leads to better generalization performance. Extensive experiments on the wav2vec 2.0 model and TIMIT dataset show the effectiveness of COWERAGE, with up to 27% absolute WER improvement over active learning methods. We also report the connection between training WER and the phonemic cover and demonstrate that our algorithm ensures inclusion of phonemically diverse examples.

* 12 pages, 7 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Towards Representative Subset Selection for Self-Supervised Speech Recognition

Paper and Code