Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FGCL: Fine-grained Contrastive Learning For Mandarin Stuttering Event Detection

Oct 08, 2024

Han Jiang, Wenyu Wang, Yiquan Zhou, Hongwu Ding, Jiacheng Xu, Jihua Zhu

Figure 1 for FGCL: Fine-grained Contrastive Learning For Mandarin Stuttering Event Detection

Figure 2 for FGCL: Fine-grained Contrastive Learning For Mandarin Stuttering Event Detection

Figure 3 for FGCL: Fine-grained Contrastive Learning For Mandarin Stuttering Event Detection

Figure 4 for FGCL: Fine-grained Contrastive Learning For Mandarin Stuttering Event Detection

Share this with someone who'll enjoy it:

Abstract:This paper presents the T031 team's approach to the StutteringSpeech Challenge in SLT2024. Mandarin Stuttering Event Detection (MSED) aims to detect instances of stuttering events in Mandarin speech. We propose a detailed acoustic analysis method to improve the accuracy of stutter detection by capturing subtle nuances that previous Stuttering Event Detection (SED) techniques have overlooked. To this end, we introduce the Fine-Grained Contrastive Learning (FGCL) framework for MSED. Specifically, we model the frame-level probabilities of stuttering events and introduce a mining algorithm to identify both easy and confusing frames. Then, we propose a stutter contrast loss to enhance the distinction between stuttered and fluent speech frames, thereby improving the discriminative capability of stuttered feature embeddings. Extensive evaluations on English and Mandarin datasets demonstrate the effectiveness of FGCL, achieving a significant increase of over 5.0% in F1 score on Mandarin data.

* Accepted to SLT 2024

View paper on

Share this with someone who'll enjoy it:

Title:FGCL: Fine-grained Contrastive Learning For Mandarin Stuttering Event Detection

Paper and Code