Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving Deliberation by Text-Only and Semi-Supervised Training

Jun 29, 2022

Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang

Figure 1 for Improving Deliberation by Text-Only and Semi-Supervised Training

Figure 2 for Improving Deliberation by Text-Only and Semi-Supervised Training

Figure 3 for Improving Deliberation by Text-Only and Semi-Supervised Training

Figure 4 for Improving Deliberation by Text-Only and Semi-Supervised Training

Share this with someone who'll enjoy it:

Abstract:Text-only and semi-supervised training based on audio-only data has gained popularity recently due to the wide availability of unlabeled text and speech data. In this work, we propose incorporating text-only and semi-supervised training into an attention-based deliberation model. By incorporating text-only data in training a bidirectional encoder representation from transformer (BERT) for the deliberation text encoder, and large-scale text-to-speech and audio-only utterances using joint acoustic and text decoder (JATD) and semi-supervised training, we achieved 4%-12% WER reduction for various tasks compared to the baseline deliberation. Compared to a state-of-the-art language model (LM) rescoring method, the deliberation model reduces the Google Voice Search WER by 11% relative. We show that the deliberation model also achieves a positive human side-by-side evaluation compared to the state-of-the-art LM rescorer with reasonable endpointer latencies.

* Accepted by Interspeech 2022

View paper on

Share this with someone who'll enjoy it:

Title:Improving Deliberation by Text-Only and Semi-Supervised Training

Paper and Code