Skip to main content

Showing 1–11 of 11 results for author: Coucke, A

.
  1. arXiv:2405.19342  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants

    Authors: Chloé Sekkat, Fanny Leroy, Salima Mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau, Alice Coucke

    Abstract: Recent works demonstrate that voice assistants do not perform equally well for everyone, but research on demographic robustness of speech technologies is still scarce. This is mainly due to the rarity of large datasets with controlled demographic tags. This paper introduces the Sonos Voice Control Bias Assessment Dataset, an open dataset composed of voice assistant requests for North American Engl… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  2. arXiv:2011.02143  [pdf, other

    cs.CL cs.AI cs.LG

    Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems

    Authors: Stéphane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier, Marc Lelarge

    Abstract: Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. Our contribution is twofold. First we show how to optimally train and contr… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1911.03698

  3. arXiv:2011.01709  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Small footprint Text-Independent Speaker Verification for Embedded Systems

    Authors: Julien Balian, Raffaele Tavarone, Mathieu Poumeyrol, Alice Coucke

    Abstract: Deep neural network approaches to speaker verification have proven successful, but typical computational requirements of State-Of-The-Art (SOTA) systems make them unsuited for embedded applications. In this work, we present a two-stage model architecture orders of magnitude smaller than common solutions (237.5K learning parameters, 11.5MFLOPS) reaching a competitive result of 3.31% Equal Error Rat… ▽ More

    Submitted 21 April, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Journal ref: Acoustics, Speech and Signal Processing (ICASSP), 2021 IEEE International Conference

  4. arXiv:1911.03698  [pdf, other

    cs.CL cs.LG stat.ML

    Conditioned Query Generation for Task-Oriented Dialogue Systems

    Authors: Stéphane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier, Marc Lelarge

    Abstract: Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. In this paper we propose a novel controlled data generation method that cou… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  5. Inference of compressed Potts graphical models

    Authors: Francesca Rizzato, Alice Coucke, Eleonora de Leonardis, J. P. Barton, Jérôme Tubiana, Remi Monasson, Simona Cocco

    Abstract: We consider the problem of inferring a graphical Potts model on a population of variables, with a non-uniform number of Potts colors (symbols) across variables. This inverse Potts problem generally involves the inference of a large number of parameters, often larger than the number of available data, and, hence, requires the introduction of regularization. We study here a double regularization sch… ▽ More

    Submitted 3 January, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Journal ref: Phys. Rev. E 101, 012309 (2020)

  6. arXiv:1811.07684  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Efficient keyword spotting using dilated convolutions and gating

    Authors: Alice Coucke, Mohammed Chlieh, Thibault Gisselbrecht, David Leroy, Mathieu Poumeyrol, Thibaut Lavril

    Abstract: We explore the application of end-to-end stateless temporal modeling to small-footprint keyword spotting as opposed to recurrent networks that model long-term temporal dependencies using internal states. We propose a model inspired by the recent success of dilated convolutions in sequence modeling applications, allowing to train deeper architectures in resource-constrained configurations. Gated ac… ▽ More

    Submitted 18 February, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: Accepted for publication to ICASSP 2019

  7. arXiv:1810.12735  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Spoken Language Understanding on the Edge

    Authors: Alaa Saade, Alice Coucke, Alexandre Caulier, Joseph Dureau, Adrien Ball, Théodore Bluche, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet

    Abstract: We consider the problem of performing Spoken Language Understanding (SLU) on small devices typical of IoT applications. Our contributions are twofold. First, we outline the design of an embedded, private-by-design SLU system and show that it has performance on par with cloud-based commercial solutions. Second, we release the datasets used in our experiments in the interest of reproducibility and i… ▽ More

    Submitted 2 October, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: text overlap with arXiv:1805.10190

  8. arXiv:1810.05512  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Federated Learning for Keyword Spotting

    Authors: David Leroy, Alice Coucke, Thibaut Lavril, Thibault Gisselbrecht, Joseph Dureau

    Abstract: We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors. We conduct an extensive empirical study of the federated averaging algorithm for the "Hey Snips" wake word based on a crowdsourced dataset that mimics a federation of wake word users. We empirically demonstrate that using an a… ▽ More

    Submitted 18 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Accepted for publication to ICASSP 2019

  9. arXiv:1805.10190  [pdf, other

    cs.CL cs.NE

    Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

    Authors: Alice Coucke, Alaa Saade, Adrien Ball, Théodore Bluche, Alexandre Caulier, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet, Joseph Dureau

    Abstract: This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our… ▽ More

    Submitted 6 December, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 29 pages, 9 figures, 17 tables

  10. arXiv:1803.09533  [pdf, other

    cs.CY cs.LG stat.ML

    Deep Representation for Patient Visits from Electronic Health Records

    Authors: Jean-Baptiste Escudié, Alaa Saade, Alice Coucke, Marc Lelarge

    Abstract: We show how to learn low-dimensional representations (embeddings) of patient visits from the corresponding electronic health record (EHR) where International Classification of Diseases (ICD) diagnosis codes are removed. We expect that these embeddings will be useful for the construction of predictive statistical models anticipated to drive personalized medicine and improve healthcare quality. Thes… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  11. arXiv:1512.05896  [pdf, other

    physics.bio-ph q-bio.CB

    An interplay of migratory and division forces as a generic mechanism for stem cell patterns

    Authors: Edouard Hannezo, Alice Coucke, Jean-François Joanny

    Abstract: In many adult tissues, stem cells and differentiated cells are not homogeneously distributed : stem cells are arranged in periodic "niches", and differentiated cells are constantly produced and migrate out of these niches. In this article, we provide a general theoretical framework to study mixtures of dividing and actively migrating particles, which we apply to biological tissues. We show in part… ▽ More

    Submitted 18 December, 2015; originally announced December 2015.

    Journal ref: Physical Review E 93, 022405 (2016)