Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joseph Dureau

Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

Nov 05, 2018

Alice Coucke, Alaa Saade, Adrien Ball, Théodore Bluche, Alexandre Caulier, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril(+2 more)

Figure 1 for Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

Figure 2 for Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

Figure 3 for Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

Figure 4 for Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

Abstract:This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our approach to training high-performance Machine Learning models that are small enough to run in real-time on small devices. Additionally, we describe a data generation procedure that provides sufficient, high-quality training data without compromising user privacy.

Via

Access Paper or Ask Questions

Federated Learning for Keyword Spotting

Oct 31, 2018

David Leroy, Alice Coucke, Thibaut Lavril, Thibault Gisselbrecht, Joseph Dureau

Figure 1 for Federated Learning for Keyword Spotting

Figure 2 for Federated Learning for Keyword Spotting

Figure 3 for Federated Learning for Keyword Spotting

Abstract:We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors. We conduct an extensive empirical study of the federated averaging algorithm for the "Hey Snips" wake word based on a crowdsourced dataset that mimics a federation of wake word users. We empirically demonstrate that using an adaptive averaging strategy inspired from Adam in place of standard weighted model averaging highly reduces the number of communication rounds required to reach our target performance. The associated upstream communication costs per user are estimated at 8 MB, which is a reasonable in the context of smart home voice assistants. Additionally, the dataset used for these experiments is being open sourced with the aim of fostering further transparent research in the application of federated learning to speech data.

Via

Access Paper or Ask Questions

Spoken Language Understanding on the Edge

Oct 30, 2018

Alaa Saade, Alice Coucke, Alexandre Caulier, Joseph Dureau, Adrien Ball, Théodore Bluche, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone(+2 more)

Figure 1 for Spoken Language Understanding on the Edge

Figure 2 for Spoken Language Understanding on the Edge

Figure 3 for Spoken Language Understanding on the Edge

Figure 4 for Spoken Language Understanding on the Edge

Abstract:We consider the problem of performing Spoken Language Understanding (SLU) on small devices typical of IoT applications. Our contributions are twofold. First, we outline the design of an embedded, private-by-design SLU system and show that it has performance on par with cloud-based commercial solutions. Second, we release the datasets used in our experiments in the interest of reproducibility and in the hope that they can prove useful to the SLU community.

Via

Access Paper or Ask Questions

Learning Operations on a Stack with Neural Turing Machines

Dec 02, 2016

Tristan Deleu, Joseph Dureau

Figure 1 for Learning Operations on a Stack with Neural Turing Machines

Figure 2 for Learning Operations on a Stack with Neural Turing Machines

Figure 3 for Learning Operations on a Stack with Neural Turing Machines

Figure 4 for Learning Operations on a Stack with Neural Turing Machines

Abstract:Multiple extensions of Recurrent Neural Networks (RNNs) have been proposed recently to address the difficulty of storing information over long time periods. In this paper, we experiment with the capacity of Neural Turing Machines (NTMs) to deal with these long-term dependencies on well-balanced strings of parentheses. We show that not only does the NTM emulate a stack with its heads and learn an algorithm to recognize such words, but it is also capable of strongly generalizing to much longer sequences.

* 1st Workshop on Neural Abstract Machines & Program Induction (NAMPI), NIPS 2016, Barcelona, Spain

Via

Access Paper or Ask Questions