Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Oct 16, 2023

Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W Black, Gopala K. Anumanchipalli

Figure 1 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Figure 2 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Figure 3 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Figure 4 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Share this with someone who'll enjoy it:

Abstract:Data-driven unit discovery in self-supervised learning (SSL) of speech has embarked on a new era of spoken language processing. Yet, the discovered units often remain in phonetic space, limiting the utility of SSL representations. Here, we demonstrate that a syllabic organization emerges in learning sentence-level representation of speech. In particular, we adopt "self-distillation" objective to fine-tune the pretrained HuBERT with an aggregator token that summarizes the entire sentence. Without any supervision, the resulting model draws definite boundaries in speech, and the representations across frames show salient syllabic structures. We demonstrate that this emergent structure largely corresponds to the ground truth syllables. Furthermore, we propose a new benchmark task, Spoken Speech ABX, for evaluating sentence-level representation of speech. When compared to previous models, our model outperforms in both unsupervised syllable discovery and learning sentence-level representation. Together, we demonstrate that the self-distillation of HuBERT gives rise to syllabic organization without relying on external labels or modalities, and potentially provides novel data-driven units for spoken language modeling.

View paper on

Share this with someone who'll enjoy it:

Title:SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Paper and Code