Skip to main content

Showing 1–9 of 9 results for author: Wuth, J

.
  1. arXiv:2110.14594  [pdf

    eess.SP cs.LG physics.geo-ph

    End-to-end LSTM based estimation of volcano event epicenter localization

    Authors: Nestor Becerra Yoma, Jorge Wuth, Andres Pinto, Nicolas de Celis, Jorge Celis, Fernando Huenupan

    Abstract: In this paper, an end-to-end based LSTM scheme is proposed to address the problem of volcano event localization without any a priori model relating phase picking with localization estimation. It is worth emphasizing that automatic phase picking in volcano signals is highly inaccurate because of the short distances between the event epicenters and the seismograph stations. LSTM was chosen due to it… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 16 pages, 7 figures

  2. arXiv:2009.02832  [pdf

    eess.AS cs.SD

    Non causal deep learning based dereverberation

    Authors: Jorge Wuth, Richard M. Stern, Nestor Becerra Yoma

    Abstract: In this paper we demonstrate the effectiveness of non-causal context for mitigating the effects of reverberation in deep-learning-based automatic speech recognition (ASR) systems. First, the value of non-causal context using a non-causal FIR filter is shown by comparing the contributions of previous vs. future information. Second, MLP- and LSTM-based dereverberation networks were trained to confir… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 33 pages

  3. arXiv:1906.07299  [pdf

    eess.AS cs.SD

    On combining features for single-channel robust speech recognition in reverberant environments

    Authors: José Novoa, Josué Fredes, Jorge Wuth, Fernando Huenupán, Richard M. Stern, Nestor Becerra Yoma

    Abstract: This paper addresses the combination of complementary parallel speech recognition systems to reduce the error rate of speech recognition systems operating in real highly-reverberant environments. First, the testing environment consists of recordings of speech in a calibrated real room with reverberation times from 0.47 to 1.77 seconds and speaker-to-microphone distances of 0.16 to 2.56 meters. We… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

  4. arXiv:1906.07298  [pdf

    eess.AS cs.SD eess.IV

    Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction

    Authors: José Novoa, Rodrigo Mahu, Alejandro Díaz, Jorge Wuth, Richard Stern, Nestor Becerra Yoma

    Abstract: This paper describes the integration of weighted delay-and-sum beamforming with speech source localization using image processing and robot head visual servoing for source tracking. We take into consideration the fact that the directivity gain provided by the beamforming depends on the angular distance between its main lobe and the main response axis of the microphone array. A visual servoing sche… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

  5. arXiv:1803.09016  [pdf

    eess.AS cs.SD

    An improved DNN-based spectral feature map** that removes noise and reverberation for robust automatic speech recognition

    Authors: Juan Pablo Escudero, José Novoa, Rodrigo Mahu, Jorge Wuth, Fernando Huenupán, Richard Stern, Néstor Becerra Yoma

    Abstract: Reverberation and additive noise have detrimental effects on the performance of automatic speech recognition systems. In this paper we explore the ability of a DNN-based spectral feature map** to remove the effects of reverberation and additive noise. Experiments with the CHiME-2 database show that this DNN can achieve an average reduction in WER of 4.5%, when compared to the baseline system, at… ▽ More

    Submitted 3 April, 2018; v1 submitted 23 March, 2018; originally announced March 2018.

    Comments: 5 pages

  6. arXiv:1803.09013  [pdf

    eess.AS cs.SD

    Exploring the robustness of features and enhancement on speech recognition systems in highly-reverberant real environments

    Authors: José Novoa, Juan Pablo Escudero, Jorge Wuth, Victor Poblete, Simon King, Richard Stern, Néstor Becerra Yoma

    Abstract: This paper evaluates the robustness of a DNN-HMM-based speech recognition system in highly-reverberant real environments using the HRRE database. The performance of locally-normalized filter bank (LNFB) and Mel filter bank (MelFB) features in combination with Non-negative Matrix Factorization (NMF), Suppression of Slowly-varying components and the Falling edge (SSF) and Weighted Prediction Error (… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

    Comments: 5 pages

  7. arXiv:1801.09651  [pdf

    eess.AS cs.SD

    Highly-Reverberant Real Environment database: HRRE

    Authors: Juan Pablo Escudero, Victor Poblete, José Novoa, Jorge Wuth, Josué Fredes, Rodrigo Mahu, Richard Stern, Néstor Becerra Yoma

    Abstract: Speech recognition in highly-reverberant real environments remains a major challenge. An evaluation dataset for this task is needed. This report describes the generation of the Highly-Reverberant Real Environment database (HRRE). This database contains 13.4 hours of data recorded in real reverberant environments and consists of 20 different testing conditions which consider a wide range of reverbe… ▽ More

    Submitted 23 March, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

    Comments: five pages

  8. arXiv:1801.00061  [pdf

    cs.HC cs.RO

    Multichannel Robot Speech Recognition Database: MChRSR

    Authors: José Novoa, Juan Pablo Escudero, Josué Fredes, Jorge Wuth, Rodrigo Mahu, Néstor Becerra Yoma

    Abstract: In real human robot interaction (HRI) scenarios, speech recognition represents a major challenge due to robot noise, background noise and time-varying acoustic channel. This document describes the procedure used to obtain the Multichannel Robot Speech Recognition Database (MChRSR). It is composed of 12 hours of multichannel evaluation data recorded in a real mobile HRI scenario. This database was… ▽ More

    Submitted 29 December, 2017; originally announced January 2018.

  9. arXiv:1403.7646  [pdf, ps, other

    astro-ph.EP astro-ph.IM

    Improved signal detection algorithms for unevenly sampled data. Six signals in the radial velocity data for GJ876

    Authors: James S. Jenkins, Nestor Becerra Yoma, Patricio Rojo, Rodrigo Mahu, Jorge Wuth

    Abstract: The hunt for Earth analogue planets orbiting Sun-like stars has forced the introduction of novel methods to detect signals at, or below, the level of the intrinsic noise of the observations. We present a new global periodogram method that returns more information than the classic Lomb-Scargle periodogram method for radial velocity signal detection. Our method uses the Minimum Mean Squared Error as… ▽ More

    Submitted 3 April, 2014; v1 submitted 29 March, 2014; originally announced March 2014.

    Comments: 30 pages, 9 Figures, 2 Tables, Accepted for publication in MNRAS