Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zidu Feng

A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

May 07, 2021

Binbin Xu, Chongyang Tao, Zidu Feng, Youssef Raqui, Sylvie Ranwez

Figure 1 for A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

Figure 2 for A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

Figure 3 for A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

Figure 4 for A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

Abstract:This study presents a large scale benchmarking on cloud based Speech-To-Text systems: {Google Cloud Speech-To-Text}, {Microsoft Azure Cognitive Services}, {Amazon Transcribe}, {IBM Watson Speech to Text}. For each systems, 40158 clean and noisy speech files about 101 hours are tested. Effect of background noise on STT quality is also evaluated with 5 different Signal-to-noise ratios from 40dB to 0dB. Results showed that {Microsoft Azure} provided lowest transcription error rate $9.09\%$ on clean speech, with high robustness to noisy environment. {Google Cloud} and {Amazon Transcribe} gave similar performance, but the latter is very limited for time-constraint usage. Though {IBM Watson} could work correctly in quiet conditions, it is highly sensible to noisy speech which could strongly limit its application in real life situations.

* 6th National Conference on Practical Applications of Artificial Intelligence, 2021, Bordeaux, France

Via

Access Paper or Ask Questions