Skip to main content

Showing 1–3 of 3 results for author: Olvera, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14005  [pdf, ps, other

    cs.SD cs.AI eess.AS

    On the choice of the optimal temporal support for audio classification with Pre-trained embeddings

    Authors: Aurian Quelennec, Michel Olvera, Geoffroy Peeters, Slim Essid

    Abstract: Current state-of-the-art audio analysis systems rely on pre-trained embedding models, often used off-the-shelf as (frozen) feature extractors. Choosing the best one for a set of tasks is the subject of many recent publications. However, one aspect often overlooked in these works is the influence of the duration of audio input considered to extract an embedding, which we refer to as Temporal Suppor… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Copyright 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  2. arXiv:2005.07006  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Foreground-Background Ambient Sound Scene Separation

    Authors: Michel Olvera, Emmanuel Vincent, Romain Serizel, Gilles Gasso

    Abstract: Ambient sound scenes typically comprise multiple short events occurring on top of a somewhat stationary background. We consider the task of separating these events from the background, which we call foreground-background ambient sound scene separation. We propose a deep learning-based separation framework with a suitable feature normaliza-tion scheme and an optional auxiliary network capturing the… ▽ More

    Submitted 27 July, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Report number: EUSIPCO 2020

    Journal ref: 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands

  3. arXiv:2005.04132  [pdf, other

    eess.AS cs.SD

    Asteroid: the PyTorch-based audio source separation toolkit for researchers

    Authors: Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent

    Abstract: This paper describes Asteroid, the PyTorch-based audio source separation toolkit for researchers. Inspired by the most successful neural source separation systems, it provides all neural building blocks required to build such a system. To improve reproducibility, Kaldi-style recipes on common audio source separation datasets are also provided. This paper describes the software architecture of Aste… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020