Skip to main content

Showing 1–7 of 7 results for author: Bluche, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2002.10851  [pdf, other

    cs.CL

    Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks

    Authors: Théodore Bluche, Maël Primet, Thibault Gisselbrecht

    Abstract: We explore a keyword-based spoken language understanding system, in which the intent of the user can directly be derived from the detection of a sequence of keywords in the query. In this paper, we focus on an open-vocabulary keyword spotting method, allowing the user to define their own keywords without having to retrain the whole model. We describe the different design choices leading to a fast… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  2. arXiv:1912.07575  [pdf, other

    cs.CL cs.LG

    Predicting detection filters for small footprint open-vocabulary keyword spotting

    Authors: Theodore Bluche, Thibault Gisselbrecht

    Abstract: In this paper, we propose a fully-neural approach to open-vocabulary keyword spotting, that allows the users to include a customizable voice interface to their device and that does not require task-specific data. We present a keyword detection neural network weighing less than 250KB, in which the topmost layer performing keyword detection is predicted by an auxiliary network, that may be run offli… ▽ More

    Submitted 29 September, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Submtted to Interspeech 2020

  3. arXiv:1810.12735  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Spoken Language Understanding on the Edge

    Authors: Alaa Saade, Alice Coucke, Alexandre Caulier, Joseph Dureau, Adrien Ball, Théodore Bluche, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet

    Abstract: We consider the problem of performing Spoken Language Understanding (SLU) on small devices typical of IoT applications. Our contributions are twofold. First, we outline the design of an embedded, private-by-design SLU system and show that it has performance on par with cloud-based commercial solutions. Second, we release the datasets used in our experiments in the interest of reproducibility and i… ▽ More

    Submitted 2 October, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: text overlap with arXiv:1805.10190

  4. arXiv:1805.10190  [pdf, other

    cs.CL cs.NE

    Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

    Authors: Alice Coucke, Alaa Saade, Adrien Ball, Théodore Bluche, Alexandre Caulier, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet, Joseph Dureau

    Abstract: This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our… ▽ More

    Submitted 6 December, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 29 pages, 9 figures, 17 tables

  5. arXiv:1604.08352  [pdf, other

    cs.CV cs.LG cs.NE

    Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

    Authors: Théodore Bluche

    Abstract: Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other hand, automatic line segmentation algorithms are prone to errors, compromising the subsequent recognition. In this paper, we propose a modification of the popular and efficient multi-di… ▽ More

    Submitted 28 April, 2016; originally announced April 2016.

  6. arXiv:1604.03286  [pdf, other

    cs.CV

    Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

    Authors: Théodore Bluche, Jérôme Louradour, Ronaldo Messina

    Abstract: We present an attention-based model for end-to-end handwriting recognition. Our system does not require any segmentation of the input paragraph. The model is inspired by the differentiable attention models presented recently for speech recognition, image captioning or translation. The main difference is the covert and overt attention, implemented as a multi-dimensional LSTM network. Our principal… ▽ More

    Submitted 23 August, 2016; v1 submitted 12 April, 2016; originally announced April 2016.

  7. arXiv:1312.4569  [pdf, other

    cs.CV cs.LG cs.NE

    Dropout improves Recurrent Neural Networks for Handwriting Recognition

    Authors: Vu Pham, Théodore Bluche, Christopher Kermorvant, Jérôme Louradour

    Abstract: Recurrent neural networks (RNNs) with Long Short-Term memory cells currently hold the best known results in unconstrained handwriting recognition. We show that their performance can be greatly improved using dropout - a recently proposed regularization method for deep architectures. While previous works showed that dropout gave superior performance in the context of convolutional networks, it had… ▽ More

    Submitted 10 March, 2014; v1 submitted 5 November, 2013; originally announced December 2013.