Skip to main content

Showing 1–11 of 11 results for author: Badeau, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.09592  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Unsupervised Music Source Separation Using Differentiable Parametric Source Models

    Authors: Kilian Schulze-Forster, Gaël Richard, Liam Kelley, Clement S. J. Doire, Roland Badeau

    Abstract: Supervised deep learning approaches to underdetermined audio source separation achieve state-of-the-art performance but require a dataset of mixtures along with their corresponding isolated source signals. Such datasets can be extremely costly to obtain for musical mixtures. This raises a need for unsupervised methods. We propose a novel unsupervised model-based deep learning approach to musical s… ▽ More

    Submitted 31 January, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: Revised version of the submission

  2. arXiv:2106.15427  [pdf, other

    stat.ML cs.LG

    Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections

    Authors: Kimia Nadjahi, Alain Durmus, Pierre E. Jacob, Roland Badeau, Umut Şimşekli

    Abstract: The Sliced-Wasserstein distance (SW) is being increasingly used in machine learning applications as an alternative to the Wasserstein distance and offers significant computational and statistical benefits. Since it is defined as an expectation over random projections, SW is commonly approximated by Monte Carlo. We adopt a new perspective to approximate SW by making use of the concentration of meas… ▽ More

    Submitted 4 January, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021

  3. arXiv:1906.04516  [pdf, other

    stat.ML cs.LG

    Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance

    Authors: Kimia Nadjahi, Alain Durmus, Umut Şimşekli, Roland Badeau

    Abstract: Minimum expected distance estimation (MEDE) algorithms have been widely used for probabilistic models with intractable likelihood functions and they have become increasingly popular due to their use in implicit generative modeling (e.g. Wasserstein generative adversarial networks, Wasserstein autoencoders). Emerging from computational optimal transport, the Sliced-Wasserstein (SW) distance has bec… ▽ More

    Submitted 24 March, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted at NeurIPS 2019 (publication and spotlight presentation)

  4. arXiv:1902.00434  [pdf, other

    cs.LG stat.ML

    Generalized Sliced Wasserstein Distances

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Roland Badeau, Gustavo K. Rohde

    Abstract: The Wasserstein distance and its variations, e.g., the sliced-Wasserstein (SW) distance, have recently drawn attention from the machine learning community. The SW distance, specifically, was shown to have similar properties to the Wasserstein distance, while being much simpler to compute, and is therefore used in various applications including generative modeling and general supervised/unsupervise… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

  5. arXiv:1608.01953  [pdf, ps, other

    cs.SD

    Model-based STFT phase recovery for audio source separation

    Authors: Paul Magron, Roland Badeau, Bertrand David

    Abstract: For audio source separation applications, it is common to estimate the magnitude of the short-time Fourier transform (STFT) of each source. In order to further synthesizing time-domain signals, it is necessary to recover the phase of the corresponding complex-valued STFT. Most authors in this field choose a Wiener-like filtering approach which boils down to using the phase of the original mixture.… ▽ More

    Submitted 27 February, 2018; v1 submitted 5 August, 2016; originally announced August 2016.

  6. arXiv:1608.01844  [pdf, ps, other

    cs.SD

    Lévy NMF for robust nonnegative source separation

    Authors: Paul Magron, Roland Badeau, Antoine Liutkus

    Abstract: Source separation, which consists in decomposing data into meaningful structured components, is an active research topic in many areas, such as music and image signal processing, applied physics and text mining. In this paper, we introduce the Positive $α$-stable (P$α$S) distributions to model the latent sources, which are a subclass of the stable distributions family. They notably permit us to mo… ▽ More

    Submitted 8 November, 2016; v1 submitted 5 August, 2016; originally announced August 2016.

  7. arXiv:1606.00037  [pdf, other

    cs.SD

    Nonnegative tensor factorization with frequency modulation cues for blind audio source separation

    Authors: Elliot Creager, Noah D. Stein, Roland Badeau, Philippe Depalle

    Abstract: We present Vibrato Nonnegative Tensor Factorization, an algorithm for single-channel unsupervised audio source separation with an application to separating instrumental or vocal sources with nonstationary pitch from music recordings. Our approach extends Nonnegative Matrix Factorization for audio modeling by including local estimates of frequency modulation as cues in the separation. This permits… ▽ More

    Submitted 31 May, 2016; originally announced June 2016.

    Comments: Accepted at the 17th International Society for Music Information Retrieval (ISMIR) Conference, New York, NY, August 2016

  8. Phase recovery in NMF for audio source separation: an insightful benchmark

    Authors: Paul Magron, Roland Badeau, Bertrand David

    Abstract: Nonnegative Matrix Factorization (NMF) is a powerful tool for decomposing mixtures of audio signals in the Time-Frequency (TF) domain. In applications such as source separation, the phase recovery for each extracted component is a major issue since it often leads to audible artifacts. In this paper, we present a methodology for evaluating various NMF-based source separation techniques involving ph… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015

  9. Phase reconstruction of spectrograms based on a model of repeated audio events

    Authors: Paul Magron, Roland Badeau, Bertrand David

    Abstract: Phase recovery of modified spectrograms is a major issue in audio signal processing applications, such as source separation. This paper introduces a novel technique for estimating the phases of components in complex mixtures within onset frames in the Time-Frequency (TF) domain. We propose to exploit the phase repetitions from one onset frame to another. We introduce a reference phase which charac… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2015

  10. arXiv:1605.07467  [pdf, ps, other

    cs.SD

    Phase reconstruction of spectrograms with linear unwrap**: application to audio signal restoration

    Authors: Paul Magron, Roland Badeau, Bertrand David

    Abstract: This paper introduces a novel technique for reconstructing the phase of modified spectrograms of audio signals. From the analysis of mixtures of sinusoids we obtain relationships between phases of successive time frames in the Time-Frequency (TF) domain. To obtain similar relationships over frequencies, in particular within onset frames, we study an impulse model. Instantaneous frequencies and att… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: European Signal Processing Conference (EUSIPCO) 2015

  11. Complex NMF under phase constraints based on signal modeling: application to audio source separation

    Authors: Paul Magron, Roland Badeau, Bertrand David

    Abstract: Nonnegative Matrix Factorization (NMF) is a powerful tool for decomposing mixtures of audio signals in the Time-Frequency (TF) domain. In the source separation framework, the phase recovery for each extracted component is necessary for synthesizing time-domain signals. The Complex NMF (CNMF) model aims to jointly estimate the spectrogram and the phase of the sources, but requires to constrain the… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016