Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Oct 08, 2021

Lillian Zhou, Dhruv Guliani, Andreas Kabel, Giovanni Motta, Françoise Beaufays

Figure 1 for Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Figure 2 for Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Figure 3 for Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Figure 4 for Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Share this with someone who'll enjoy it:

Abstract:Transformer-based architectures have been the subject of research aimed at understanding their overparameterization and the non-uniform importance of their layers. Applying these approaches to Automatic Speech Recognition, we demonstrate that the state-of-the-art Conformer models generally have multiple ambient layers. We study the stability of these layers across runs and model sizes, propose that group normalization may be used without disrupting their formation, and examine their correlation with model weight updates in each layer. Finally, we apply these findings to Federated Learning in order to improve the training procedure, by targeting Federated Dropout to layers by importance. This allows us to reduce the model size optimized by clients without quality degradation, and shows potential for future exploration.

* \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

View paper on

Share this with someone who'll enjoy it:

Title:Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Paper and Code