Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ali Orkan Bayer

Lattention: Lattice-attention in ASR rescoring

Nov 19, 2021

Prabhat Pandey, Sergio Duarte Torres, Ali Orkan Bayer, Ankur Gandhe, Volker Leutnant

Figure 1 for Lattention: Lattice-attention in ASR rescoring

Figure 2 for Lattention: Lattice-attention in ASR rescoring

Figure 3 for Lattention: Lattice-attention in ASR rescoring

Figure 4 for Lattention: Lattice-attention in ASR rescoring

Abstract:Lattices form a compact representation of multiple hypotheses generated from an automatic speech recognition system and have been shown to improve performance of downstream tasks like spoken language understanding and speech translation, compared to using one-best hypothesis. In this work, we look into the effectiveness of lattice cues for rescoring n-best lists in second-pass. We encode lattices with a recurrent network and train an attention encoder-decoder model for n-best rescoring. The rescoring model with attention to lattices achieves 4-5% relative word error rate reduction over first-pass and 6-8% with attention to both lattices and acoustic features. We show that rescoring models with attention to lattices outperform models with attention to n-best hypotheses. We also study different ways to incorporate lattice weights in the lattice encoder and demonstrate their importance for n-best rescoring.

* Submitted to ICASSP 2022

Via

Access Paper or Ask Questions