Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhaobiao Lv

Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

May 26, 2023

Hong Liu, Zhaobiao Lv, Zhijian Ou, Wenbo Zhao, Qing Xiao

Figure 1 for Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

Figure 2 for Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

Figure 3 for Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

Figure 4 for Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

Abstract:Energy-based language models (ELMs) parameterize an unnormalized distribution for natural sentences and are radically different from popular autoregressive language models (ALMs). As an important application, ELMs have been successfully used as a means for calculating sentence scores in speech recognition, but they all use less-modern CNN or LSTM networks. The recent progress in Transformer networks and large pretrained models such as BERT and GPT2 opens new possibility to further advancing ELMs. In this paper, we explore different architectures of energy functions and different training methods to investigate the capabilities of ELMs in rescoring for speech recognition, all using large pretrained models as backbones.

* Accepted into INTERSPEECH 2023

Via

Access Paper or Ask Questions