Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohammad Omar Khursheed

Small-footprint slimmable networks for keyword spotting

Apr 21, 2023

Zuhaib Akhtar, Mohammad Omar Khursheed, Dongsu Du, Yuzong Liu

Abstract:In this work, we present Slimmable Neural Networks applied to the problem of small-footprint keyword spotting. We show that slimmable neural networks allow us to create super-nets from Convolutioanl Neural Networks and Transformers, from which sub-networks of different sizes can be extracted. We demonstrate the usefulness of these models on in-house Alexa data and Google Speech Commands, and focus our efforts on models for the on-device use case, limiting ourselves to less than 250k parameters. We show that slimmable models can match (and in some cases, outperform) models trained from scratch. Slimmable neural networks are therefore a class of models particularly useful when the same functionality is to be replicated at different memory and compute budgets, with different accuracy requirements.

Via

Access Paper or Ask Questions

Latency Control for Keyword Spotting

Jun 15, 2022

Christin Jose, Joseph Wang, Grant P. Strimel, Mohammad Omar Khursheed, Yuriy Mishchenko, Brian Kulis

Figure 1 for Latency Control for Keyword Spotting

Figure 2 for Latency Control for Keyword Spotting

Figure 3 for Latency Control for Keyword Spotting

Figure 4 for Latency Control for Keyword Spotting

Abstract:Conversational agents commonly utilize keyword spotting (KWS) to initiate voice interaction with the user. For user experience and privacy considerations, existing approaches to KWS largely focus on accuracy, which can often come at the expense of introduced latency. To address this tradeoff, we propose a novel approach to control KWS model latency and which generalizes to any loss function without explicit knowledge of the keyword endpoint. Through a single, tunable hyperparameter, our approach enables one to balance detection latency and accuracy for the targeted application. Empirically, we show that our approach gives superior performance under latency constraints when compared to existing methods. Namely, we make a substantial 25\% relative false accepts improvement for a fixed latency target when compared to the baseline state-of-the-art. We also show that when our approach is used in conjunction with a max-pooling loss, we are able to improve relative false accepts by 25 % at a fixed latency when compared to cross entropy loss.

* Proceedings of INTERSPEECH

Via

Access Paper or Ask Questions

Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

Sep 29, 2021

Mohammad Omar Khursheed, Christin Jose, Rajath Kumar, Gengshen Fu, Brian Kulis, Santosh Kumar Cheekatmalla

Figure 1 for Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

Figure 2 for Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

Figure 3 for Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

Figure 4 for Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

Abstract:In this work, we propose Tiny-CRNN (Tiny Convolutional Recurrent Neural Network) models applied to the problem of wakeword detection, and augment them with scaled dot product attention. We find that, compared to Convolutional Neural Network models, False Accepts in a 250k parameter budget can be reduced by 25% with a 10% reduction in parameter size by using models based on the Tiny-CRNN architecture, and we can get up to 32% reduction in False Accepts at a 50k parameter budget with 75% reduction in parameter size compared to word-level Dense Neural Network models. We discuss solutions to the challenging problem of performing inference on streaming audio with this architecture, as well as differences in start-end index errors and latency in comparison to CNN, DNN, and DNN-HMM models.

* arXiv admin note: substantial text overlap with arXiv:2011.12941

Via

Access Paper or Ask Questions

Analyzing Gender Bias within Narrative Tropes

Oct 30, 2020

Dhruvil Gala, Mohammad Omar Khursheed, Hannah Lerner, Brendan O'Connor, Mohit Iyyer

Figure 1 for Analyzing Gender Bias within Narrative Tropes

Figure 2 for Analyzing Gender Bias within Narrative Tropes

Figure 3 for Analyzing Gender Bias within Narrative Tropes

Figure 4 for Analyzing Gender Bias within Narrative Tropes

Abstract:Popular media reflects and reinforces societal biases through the use of tropes, which are narrative elements, such as archetypal characters and plot arcs, that occur frequently across media. In this paper, we specifically investigate gender bias within a large collection of tropes. To enable our study, we crawl tvtropes.org, an online user-created repository that contains 30K tropes associated with 1.9M examples of their occurrences across film, television, and literature. We automatically score the "genderedness" of each trope in our TVTROPES dataset, which enables an analysis of (1) highly-gendered topics within tropes, (2) the relationship between gender bias and popular reception, and (3) how the gender of a work's creator correlates with the types of tropes that they use.

Via

Access Paper or Ask Questions