Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach

Oct 20, 2021

Mun-Hak Lee, Joon-Hyuk Chang

Figure 1 for Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach

Figure 2 for Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach

Figure 3 for Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach

Figure 4 for Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach

Share this with someone who'll enjoy it:

Abstract:The remarkable performance of the pre-trained language model (LM) using self-supervised learning has led to a major paradigm shift in the study of natural language processing. In line with these changes, leveraging the performance of speech recognition systems with massive deep learning-based LMs is a major topic of speech recognition research. Among the various methods of applying LMs to speech recognition systems, in this paper, we focus on a cross-modal knowledge distillation method that transfers knowledge between two types of deep neural networks with different modalities. We propose an acoustic model structure with multiple auxiliary output layers for cross-modal distillation and demonstrate that the proposed method effectively compensates for the shortcomings of the existing label-interpolation-based distillation method. In addition, we extend the proposed method to a hierarchical distillation method using LMs trained in different units (senones, monophones, and subwords) and reveal the effectiveness of the hierarchical distillation method through an ablation study.

* 4page + 1page for citation + 2 pages for appendix

View paper on

Share this with someone who'll enjoy it:

Title:Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach

Paper and Code