Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mark J F Gales

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Mar 01, 2023

Rao Ma, Mark J F Gales, Kate Knill, Mengjie Qian

Figure 1 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Figure 2 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Figure 3 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Figure 4 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Abstract:Error correction models form an important part of Automatic Speech Recognition (ASR) post-processing to improve the readability and quality of transcriptions. Most prior works use the 1-best ASR hypothesis as input and therefore can only perform correction by leveraging the context within one sentence. In this work, we propose a novel N-best T5 model for this task, which is fine-tuned from a T5 model and utilizes ASR N-best lists as model input. By transferring knowledge from the pre-trained language model and obtaining richer information from the ASR decoding space, the proposed approach outperforms a strong Conformer-Transducer baseline. Another issue with standard error correction is that the generation process is not well-guided. To address this a constrained decoding process, either based on the N-best list or an ASR lattice, is used which allows additional information to be propagated.

Via

Access Paper or Ask Questions