Skip to main content

Showing 1–8 of 8 results for author: Gisselbrecht, T

.
  1. arXiv:2006.02774  [pdf, other

    cs.SD eess.AS

    A study on more realistic room simulation for far-field keyword spotting

    Authors: Eric Bezzam, Robin Scheibler, Cyril Cadoux, Thibault Gisselbrecht

    Abstract: We investigate the impact of more realistic room simulation for training far-field keyword spotting systems without fine-tuning on in-domain data. To this end, we study the impact of incorporating the following factors in the room impulse response (RIR) generation: air absorption, surface- and frequency-dependent coefficients of real materials, and stochastic ray tracing. Through an ablation study… ▽ More

    Submitted 18 November, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: 7 pages, 4 figures, accepted at APSIPA 2020, room impulse response generation code can be found at https://github.com/ebezzam/room-simulation

  2. arXiv:2002.10851  [pdf, other

    cs.CL

    Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks

    Authors: Théodore Bluche, Maël Primet, Thibault Gisselbrecht

    Abstract: We explore a keyword-based spoken language understanding system, in which the intent of the user can directly be derived from the detection of a sequence of keywords in the query. In this paper, we focus on an open-vocabulary keyword spotting method, allowing the user to define their own keywords without having to retrain the whole model. We describe the different design choices leading to a fast… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  3. arXiv:1912.07575  [pdf, other

    cs.CL cs.LG

    Predicting detection filters for small footprint open-vocabulary keyword spotting

    Authors: Theodore Bluche, Thibault Gisselbrecht

    Abstract: In this paper, we propose a fully-neural approach to open-vocabulary keyword spotting, that allows the users to include a customizable voice interface to their device and that does not require task-specific data. We present a keyword detection neural network weighing less than 250KB, in which the topmost layer performing keyword detection is predicted by an auxiliary network, that may be run offli… ▽ More

    Submitted 29 September, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Submtted to Interspeech 2020

  4. arXiv:1811.07684  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Efficient keyword spotting using dilated convolutions and gating

    Authors: Alice Coucke, Mohammed Chlieh, Thibault Gisselbrecht, David Leroy, Mathieu Poumeyrol, Thibaut Lavril

    Abstract: We explore the application of end-to-end stateless temporal modeling to small-footprint keyword spotting as opposed to recurrent networks that model long-term temporal dependencies using internal states. We propose a model inspired by the recent success of dilated convolutions in sequence modeling applications, allowing to train deeper architectures in resource-constrained configurations. Gated ac… ▽ More

    Submitted 18 February, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: Accepted for publication to ICASSP 2019

  5. arXiv:1810.12735  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Spoken Language Understanding on the Edge

    Authors: Alaa Saade, Alice Coucke, Alexandre Caulier, Joseph Dureau, Adrien Ball, Théodore Bluche, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet

    Abstract: We consider the problem of performing Spoken Language Understanding (SLU) on small devices typical of IoT applications. Our contributions are twofold. First, we outline the design of an embedded, private-by-design SLU system and show that it has performance on par with cloud-based commercial solutions. Second, we release the datasets used in our experiments in the interest of reproducibility and i… ▽ More

    Submitted 2 October, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: text overlap with arXiv:1805.10190

  6. arXiv:1810.05512  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Federated Learning for Keyword Spotting

    Authors: David Leroy, Alice Coucke, Thibaut Lavril, Thibault Gisselbrecht, Joseph Dureau

    Abstract: We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors. We conduct an extensive empirical study of the federated averaging algorithm for the "Hey Snips" wake word based on a crowdsourced dataset that mimics a federation of wake word users. We empirically demonstrate that using an a… ▽ More

    Submitted 18 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Accepted for publication to ICASSP 2019

  7. arXiv:1808.10725  [pdf, other

    cs.SI

    Bandit algorithms for real-time data capture on large social medias

    Authors: Thibault Gisselbrecht

    Abstract: We study the problem of real time data capture on social media. Due to the different limitations imposed by those media, but also to the very large amount of information, it is impossible to collect all the data produced by social networks such as Twitter. Therefore, to be able to gather enough relevant information related to a predefined need, it is necessary to focus on a subset of the informati… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: in French

  8. arXiv:1805.10190  [pdf, other

    cs.CL cs.NE

    Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

    Authors: Alice Coucke, Alaa Saade, Adrien Ball, Théodore Bluche, Alexandre Caulier, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet, Joseph Dureau

    Abstract: This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our… ▽ More

    Submitted 6 December, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 29 pages, 9 figures, 17 tables