Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Sep 25, 2023

Zijian Yang, Wei Zhou, Ralf Schlüter, Hermann Ney

Figure 1 for On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Figure 2 for On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Figure 3 for On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Figure 4 for On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Share this with someone who'll enjoy it:

Abstract:Internal language model (ILM) subtraction has been widely applied to improve the performance of the RNN-Transducer with external language model (LM) fusion for speech recognition. In this work, we show that sequence discriminative training has a strong correlation with ILM subtraction from both theoretical and empirical points of view. Theoretically, we derive that the global optimum of maximum mutual information (MMI) training shares a similar formula as ILM subtraction. Empirically, we show that ILM subtraction and sequence discriminative training achieve similar performance across a wide range of experiments on Librispeech, including both MMI and minimum Bayes risk (MBR) criteria, as well as neural transducers and LMs of both full and limited context. The benefit of ILM subtraction also becomes much smaller after sequence discriminative training. We also provide an in-depth study to show that sequence discriminative training has a minimal effect on the commonly used zero-encoder ILM estimation, but a joint effect on both encoder and prediction + joint network for posterior probability reshaping including both ILM and blank suppression.

* submitted to ICASSP 2024

View paper on

Share this with someone who'll enjoy it:

Title:On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Paper and Code