Skip to main content

Showing 1–2 of 2 results for author: Masztalski, P

.
  1. arXiv:2008.07244  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks

    Authors: Michał Romaniuk, Piotr Masztalski, Karol Piaskowski, Mateusz Matuszewski

    Abstract: We propose Mobile Audio Streaming Networks (MASnet) for efficient low-latency speech enhancement, which is particularly suitable for mobile devices and other applications where computational capacity is a limitation. MASnet processes linear-scale spectrograms, transforming successive noisy frames into complex-valued ratio masks which are then applied to the respective noisy frames. MASnet can oper… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted for INTERSPEECH 2020

  2. arXiv:2008.07231  [pdf, other

    eess.AS cs.LG cs.SD

    StoRIR: Stochastic Room Impulse Response Generation for Audio Data Augmentation

    Authors: Piotr Masztalski, Mateusz Matuszewski, Karol Piaskowski, Michał Romaniuk

    Abstract: In this paper we introduce StoRIR - a stochastic room impulse response generation method dedicated to audio data augmentation in machine learning applications. This technique, in contrary to geometrical methods like image-source or ray tracing, does not require prior definition of room geometry, absorption coefficients or microphone and source placement and is dependent solely on the acoustic para… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted for INTERSPEECH 2020