Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition

Apr 05, 2023

Saumya Y. Sahai, Jing Liu, Thejaswi Muniyappa, Kanthashree M. Sathyendra, Anastasios Alexandridis, Grant P. Strimel, Ross McGowan, Ariya Rastrow, Feng-Ju Chang, Athanasios Mouchtaris(+1 more)

Figure 1 for Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition

Figure 2 for Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition

Figure 3 for Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition

Figure 4 for Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition

Share this with someone who'll enjoy it:

Abstract:We present dual-attention neural biasing, an architecture designed to boost Wake Words (WW) recognition and improve inference time latency on speech recognition tasks. This architecture enables a dynamic switch for its runtime compute paths by exploiting WW spotting to select which branch of its attention networks to execute for an input audio frame. With this approach, we effectively improve WW spotting accuracy while saving runtime compute cost as defined by floating point operations (FLOPs). Using an in-house de-identified dataset, we demonstrate that the proposed dual-attention network can reduce the compute cost by $90\%$ for WW audio frames, with only $1\%$ increase in the number of parameters. This architecture improves WW F1 score by $16\%$ relative and improves generic rare word error rate by $3\%$ relative compared to the baselines.

* Accepted to Proc. IEEE ICASSP 2023

View paper on

Share this with someone who'll enjoy it:

Title:Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition

Paper and Code