Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Sep 24, 2024

Aditya Ashvin, Rimita Lahiri, Aditya Kommineni, Somer Bishop, Catherine Lord, Sudarsana Reddy Kadiri, Shrikanth Narayanan

Figure 1 for Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Figure 2 for Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Figure 3 for Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Figure 4 for Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Share this with someone who'll enjoy it:

Abstract:The ability to reliably transcribe child-adult conversations in a clinical setting is valuable for diagnosis and understanding of numerous developmental disorders such as Autism Spectrum Disorder. Recent advances in deep learning architectures and availability of large scale transcribed data has led to development of speech foundation models that have shown dramatic improvements in ASR performance. However, the ability of these models to translate well to conversational child-adult interactions is under studied. In this work, we provide a comprehensive evaluation of ASR performance on a dataset containing child-adult interactions from autism diagnostic sessions, using Whisper, Wav2Vec2, HuBERT, and WavLM. We find that speech foundation models show a noticeable performance drop (15-20% absolute WER) for child speech compared to adult speech in the conversational setting. Then, we employ LoRA on the best performing zero shot model (whisper-large) to probe the effectiveness of fine-tuning in a low resource setting, resulting in ~8% absolute WER improvement for child speech and ~13% absolute WER improvement for adult speech.

* 5 pages, 3 figures, 4 tables

View paper on

Share this with someone who'll enjoy it:

Title:Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Paper and Code