Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Can Contextual Biasing Remain Effective with Whisper and GPT-2?

Jun 02, 2023

Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland

Share this with someone who'll enjoy it:

Abstract:End-to-end automatic speech recognition (ASR) and large language models, such as Whisper and GPT-2, have recently been scaled to use vast amounts of training data. Despite the large amount of training data, infrequent content words that occur in a particular task may still exhibit poor ASR performance, with contextual biasing a possible remedy. This paper investigates the effectiveness of neural contextual biasing for Whisper combined with GPT-2. Specifically, this paper proposes integrating an adapted tree-constrained pointer generator (TCPGen) component for Whisper and a dedicated training scheme to dynamically adjust the final output without modifying any Whisper model parameters. Experiments across three datasets show a considerable reduction in errors on biasing words with a biasing list of 1000 words. Contextual biasing was more effective when applied to domain-specific data and can boost the performance of Whisper and GPT-2 without losing their generality.

* To appear in Interspeech 2023

View paper on

Share this with someone who'll enjoy it:

Title:Can Contextual Biasing Remain Effective with Whisper and GPT-2?

Paper and Code