Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

May 21, 2023

Kaixun Huang, Ao Zhang, Zhanheng Yang, Pengcheng Guo, Bingshen Mu, Tianyi Xu, Lei Xie

Figure 1 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Figure 2 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Figure 3 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Figure 4 for Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Share this with someone who'll enjoy it:

Abstract:Contextual information plays a crucial role in speech recognition technologies and incorporating it into the end-to-end speech recognition models has drawn immense interest recently. However, previous deep bias methods lacked explicit supervision for bias tasks. In this study, we introduce a contextual phrase prediction network for an attention-based deep bias method. This network predicts context phrases in utterances using contextual embeddings and calculates bias loss to assist in the training of the contextualized model. Our method achieved a significant word error rate (WER) reduction across various end-to-end speech recognition models. Experiments on the LibriSpeech corpus show that our proposed model obtains a 12.1% relative WER improvement over the baseline model, and the WER of the context phrases decreases relatively by 40.5%. Moreover, by applying a context phrase filtering strategy, we also effectively eliminate the WER degradation when using a larger biasing list.

* Accepted by interspeech2023

View paper on

Share this with someone who'll enjoy it:

Title:Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Paper and Code