Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Earnings-21: A Practical Benchmark for ASR in the Wild

Apr 28, 2021

Miguel Del Rio, Natalie Delworth, Ryan Westerman, Michelle Huang, Nishchal Bhandari, Joseph Palakapilly, Quinten McNamara, Joshua Dong, Piotr Zelasko, Miguel Jette

Figure 1 for Earnings-21: A Practical Benchmark for ASR in the Wild

Figure 2 for Earnings-21: A Practical Benchmark for ASR in the Wild

Figure 3 for Earnings-21: A Practical Benchmark for ASR in the Wild

Figure 4 for Earnings-21: A Practical Benchmark for ASR in the Wild

Share this with someone who'll enjoy it:

Abstract:Commonly used speech corpora inadequately challenge academic and commercial ASR systems. In particular, speech corpora lack metadata needed for detailed analysis and WER measurement. In response, we present Earnings-21, a 39-hour corpus of earnings calls containing entity-dense speech from nine different financial sectors. This corpus is intended to benchmark ASR systems in the wild with special attention towards named entity recognition. We benchmark four commercial ASR models, two internal models built with open-source tools, and an open-source LibriSpeech model and discuss their differences in performance on Earnings-21. Using our recently released fstalign tool, we provide a candid analysis of each model's recognition capabilities under different partitions. Our analysis finds that ASR accuracy for certain NER categories is poor, presenting a significant impediment to transcript comprehension and usage. Earnings-21 bridges academic and commercial ASR system evaluation and enables further research on entity modeling and WER on real world audio.

* submitted to INTERSPEECH 2021 Update April 28th, 2021: We found and resolved an issue in our experimental evaluation that scored the LibriSpeech model at ~20% worse relative WER than the actual WER. The updated results do not affect our conclusions

View paper on

Share this with someone who'll enjoy it:

Title:Earnings-21: A Practical Benchmark for ASR in the Wild

Paper and Code