Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Mar 01, 2023

Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin(+1 more)

Figure 1 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Figure 2 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Figure 3 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Figure 4 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Share this with someone who'll enjoy it:

Abstract:We propose gated language experts to improve multilingual transformer transducer models without any language identification (LID) input from users during inference. We define gating mechanism and LID loss to let transformer encoders learn language-dependent information, construct the multilingual transformer block with gated transformer experts and shared transformer layers for compact models, and apply linear experts on joint network output to better regularize speech acoustic and token label joint information. Furthermore, a curriculum training scheme is proposed to let LID guide the gated language experts for better serving their corresponding languages. Evaluated on the English and Spanish bilingual task, our methods achieve average 12.5% and 7.3% relative word error reductions over the baseline bilingual model and monolingual models, respectively, obtaining similar results to the upper bound model trained and inferred with oracle LID. We further explore our method on trilingual, quadrilingual, and pentalingual models, and observe similar advantages as in the bilingual models, which demonstrates the easy extension to more languages.

View paper on

Share this with someone who'll enjoy it:

Title:Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Paper and Code