Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shreyansh Jain

Error Correction in ASR using Sequence-to-Sequence Models

Feb 02, 2022

Samrat Dutta, Shreyansh Jain, Ayush Maheshwari, Ganesh Ramakrishnan, Preethi Jyothi

Figure 1 for Error Correction in ASR using Sequence-to-Sequence Models

Figure 2 for Error Correction in ASR using Sequence-to-Sequence Models

Figure 3 for Error Correction in ASR using Sequence-to-Sequence Models

Figure 4 for Error Correction in ASR using Sequence-to-Sequence Models

Abstract:Post-editing in Automatic Speech Recognition (ASR) entails automatically correcting common and systematic errors produced by the ASR system. The outputs of an ASR system are largely prone to phonetic and spelling errors. In this paper, we propose to use a powerful pre-trained sequence-to-sequence model, BART, further adaptively trained to serve as a denoising model, to correct errors of such types. The adaptive training is performed on an augmented dataset obtained by synthetically inducing errors as well as by incorporating actual errors from an existing ASR system. We also propose a simple approach to rescore the outputs using word level alignments. Experimental results on accented speech data demonstrate that our strategy effectively rectifies a significant number of ASR errors and produces improved WER results when compared against a competitive baseline.

Via

Access Paper or Ask Questions