Skip to main content

Showing 1–8 of 8 results for author: Pariente, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.01413  [pdf, other

    cs.SD cs.LG eess.AS

    Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge

    Authors: Simon Leglaive, Matthieu Fraticelli, Hend ElGhazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker

    Abstract: Supervised models for speech enhancement are trained using artificially generated mixtures of clean speech and noise signals. However, the synthetic training conditions may not accurately reflect real-world conditions encountered during testing. This discrepancy can result in poor performance when the test domain significantly differs from the synthetic training domain. To tackle this issue, the U… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  2. The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement

    Authors: Simon Leglaive, Léonie Borne, Efthymios Tzinis, Mostafa Sadeghi, Matthieu Fraticelli, Scott Wisdom, Manuel Pariente, Daniel Pressnitzer, John R. Hershey

    Abstract: Supervised speech enhancement models are trained using artificially generated mixtures of clean speech and noise signals, which may not match real-world recording conditions at test time. This mismatch can lead to poor performance if the test domain significantly differs from the synthetic training domain. This paper introduces the unsupervised domain adaptation for conversational speech enhanceme… ▽ More

    Submitted 2 October, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Journal ref: The 7th International Workshop on Speech Processing in Everyday Environments (CHiME), Dublin, Ireland, 2023

  3. arXiv:2302.07928  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge

    Authors: Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono

    Abstract: This paper describes our submission to the Second Clarity Enhancement Challenge (CEC2), which consists of target speech enhancement for hearing-aid (HA) devices in noisy-reverberant environments with multiple interferers such as music and competing speakers. Our approach builds upon the powerful iterative neural/beamforming enhancement (iNeuBe) framework introduced in our recent work, and this p… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  4. arXiv:2111.04614  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Learning Filterbanks for End-to-End Acoustic Beamforming

    Authors: Samuele Cornell, Manuel Pariente, François Grondin, Stefano Squartini

    Abstract: Recent work on monaural source separation has shown that performance can be increased by using fully learned filterbanks with short windows. On the other hand it is widely known that, for conventional beamforming techniques, performance increases with long analysis windows. This applies also to most hybrid neural beamforming methods which rely on a deep neural network (DNN) to estimate the spatial… ▽ More

    Submitted 19 February, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: accepted at ICASSP 2022

  5. arXiv:2005.11262  [pdf, other

    eess.AS

    LibriMix: An Open-Source Dataset for Generalizable Speech Separation

    Authors: Joris Cosentino, Manuel Pariente, Samuele Cornell, Antoine Deleforge, Emmanuel Vincent

    Abstract: In recent years, wsj0-2mix has become the reference dataset for single-channel speech separation. Most deep learning-based speech separation models today are benchmarked on it. However, recent studies have shown important performance drops when models trained on wsj0-2mix are evaluated on other, similar datasets. To address this generalization issue, we created LibriMix, an open-source alternative… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Comments: submitted to INTERSPEECH 2020

  6. arXiv:2005.04132  [pdf, other

    eess.AS cs.SD

    Asteroid: the PyTorch-based audio source separation toolkit for researchers

    Authors: Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent

    Abstract: This paper describes Asteroid, the PyTorch-based audio source separation toolkit for researchers. Inspired by the most successful neural source separation systems, it provides all neural building blocks required to build such a system. To improve reproducibility, Kaldi-style recipes on common audio source separation datasets are also provided. This paper describes the software architecture of Aste… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  7. arXiv:1910.10400  [pdf, other

    cs.SD cs.LG eess.AS eess.SP

    Filterbank design for end-to-end speech separation

    Authors: Manuel Pariente, Samuele Cornell, Antoine Deleforge, Emmanuel Vincent

    Abstract: Single-channel speech separation has recently made great progress thanks to learned filterbanks as used in ConvTasNet. In parallel, parameterized filterbanks have been proposed for speaker recognition where only center frequencies and bandwidths are learned. In this work, we extend real-valued learned and parameterized filterbanks into complex-valued analytic filterbanks and define a set of corres… ▽ More

    Submitted 28 February, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: ICASSP 2020

  8. arXiv:1905.01209  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders

    Authors: Manuel Pariente, Antoine Deleforge, Emmanuel Vincent

    Abstract: Recent studies have explored the use of deep generative models of speech spectra based of variational autoencoders (VAEs), combined with unsupervised noise models, to perform speech enhancement. These studies developed iterative algorithms involving either Gibbs sampling or gradient descent at each step, making them computationally expensive. This paper proposes a variational inference method to i… ▽ More

    Submitted 14 May, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: Submitted to INTERSPEECH 2019