Skip to main content

Showing 1–4 of 4 results for author: Lopez, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2310.11379  [pdf, other

    cs.SD cs.CL eess.AS

    Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles

    Authors: Fernando López, Jordi Luque, Carlos Segura, Pablo Gómez

    Abstract: Voice-based interfaces rely on a wake-up word mechanism to initiate communication with devices. However, achieving a robust, energy-efficient, and fast detection remains a challenge. This paper addresses these real production needs by enhancing data with temporal alignments and using detection based on two phases with multi-resolution. It employs two models: a lightweight on-device model for real-… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 5 pages, 3 figures

  2. arXiv:2210.15226  [pdf, other

    cs.CL cs.SD eess.AS

    Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation

    Authors: Fernando López, Jordi Luque

    Abstract: High-quality data labeling from specific domains is costly and human time-consuming. In this work, we propose a self-supervised domain adaptation method, based upon an iterative pseudo-forced alignment algorithm. The produced alignments are employed to customize an end-to-end Automatic Speech Recognition (ASR) and iteratively refined. The algorithm is fed with frame-wise character posteriors produ… ▽ More

    Submitted 15 January, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures, IberSPEECH2022

  3. Assessing deep learning methods for the identification of kidney stones in endoscopic images

    Authors: Francisco Lopez, Andres Varela, Oscar Hinojosa, Mauricio Mendez, Dinh-Hoan Trinh, Jonathan ElBeze, Jacques Hubert, Vincent Estrade, Miguel Gonzalez, Gilberto Ochoa, Christian Daul

    Abstract: Knowing the type (i.e., the biochemical composition) of kidney stones is crucial to prevent relapses with an appropriate treatment. During ureteroscopies, kidney stones are fragmented, extracted from the urinary tract, and their composition is determined using a morpho-constitutional analysis. This procedure is time consuming (the morpho-constitutional analysis results are only available after som… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: This paper is currently under review for the IEEE Engineering in Medicine and Biology Conference (EMBC 2021)

  4. arXiv:2101.12732  [pdf, other

    eess.AS cs.CL

    Speech Enhancement for Wake-Up-Word detection in Voice Assistants

    Authors: David Bonet, Guillermo Cámbara, Fernando López, Pablo Gómez, Carlos Segura, Jordi Luque

    Abstract: Keyword spotting and in particular Wake-Up-Word (WUW) detection is a very important task for voice assistants. A very common issue of voice assistants is that they get easily activated by background noise like music, TV or background speech that accidentally triggers the device. In this paper, we propose a Speech Enhancement (SE) model adapted to the task of WUW detection that aims at increasing t… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: keyword spotting, speech enhancement, wake-up-word, deep learning, convolutional neural network