Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Nov 02, 2022

Vasista Sai Lodagala, Sreyan Ghosh, S. Umesh

Figure 1 for data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Figure 2 for data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Figure 3 for data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Share this with someone who'll enjoy it:

Abstract:In this paper, we propose a new Self-Supervised Learning (SSL) algorithm called data2vec-aqc, for speech representation learning from unlabeled speech data. Our goal is to improve SSL for speech in domains where both unlabeled and labeled data are limited. Building on the recently introduced data2vec, we introduce additional modules to the data2vec framework that leverage the benefit of data augmentations, quantized representations, and clustering. The interaction between these modules helps solve the cross-contrastive loss as an additional self-supervised objective. data2vec-aqc achieves up to 14.1% and 20.9% relative WER improvement over the existing state-of-the-art data2vec system on the test-clean and test-other sets, respectively, of LibriSpeech, without the use of any language model. Our proposed model also achieves up to 17.8% relative WER improvement over the baseline data2vec when fine-tuned on Switchboard data.

* Submitted to ICASSP 2023. arXiv admin note: text overlap with arXiv:2210.02592

View paper on

Share this with someone who'll enjoy it:

Title:data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Paper and Code