Skip to main content

Showing 1–7 of 7 results for author: Bertin, N

Searching in archive eess. Search in all archives.
.
  1. arXiv:2107.11250  [pdf

    cs.SD cs.IR cs.LG eess.AS

    Multi-Channel Automatic Music Transcription Using Tensor Algebra

    Authors: Axel Marmoret, Nancy Bertin, Jeremy Cohen

    Abstract: Music is an art, perceived in unique ways by every listener, coming from acoustic signals. In the meantime, standards as musical scores exist to describe it. Even if humans can make this transcription, it is costly in terms of time and efforts, even more with the explosion of information consecutively to the rise of the Internet. In that sense, researches are driven in the direction of Automatic M… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: 40 pages, 14 figues, 5 tables, code can be found at: https://gitlab.inria.fr/amarmore/nonnegative-factorization

    ACM Class: H.5.5

  2. arXiv:2104.13168  [pdf, other

    eess.AS cs.SD

    dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing

    Authors: Diego Di Carlo, Pinchas Tandeitnik, Cédric Foy, Antoine Deleforge, Nancy Bertin, Sharon Gannot

    Abstract: This paper presents dEchorate: a new database of measured multichannel Room Impulse Responses (RIRs) including annotations of early echo timings and 3D positions of microphones, real sources and image sources under different wall configurations in a cuboid room. These data provide a tool for benchmarking recent methods in echo-aware speech enhancement, room geometry estimation, RIR estimation, aco… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  3. arXiv:2104.08580  [pdf, other

    cs.SD cs.LG eess.AS

    Uncovering audio patterns in music with Nonnegative Tucker Decomposition for structural segmentation

    Authors: Axel Marmoret, Jérémy E. Cohen, Nancy Bertin, Frédéric Bimbot

    Abstract: Recent work has proposed the use of tensor decomposition to model repetitions and to separate tracks in loop-based electronic music. The present work investigates further on the ability of Nonnegative Tucker Decompositon (NTD) to uncover musical patterns and structure in pop songs in their audio form. Exploiting the fact that NTD tends to express the content of bars as linear combinations of a few… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 7 pages, 6 figures; Code and experiments details available at https://gitlab.inria.fr/amarmore/musicntd/-/tree/0.1.0; Experiments details available at https://ax-le.github.io/resources/ISMIR2020/Notebooks_mainpage.html

    Report number: ISBN: 978-0-9813537-0-8 ACM Class: H.5.5

    Journal ref: 21st International Society for Music Information Retrieval Conference (ISMIR), Montréal, Canada, 2020, 788-794

  4. arXiv:2005.10228  [pdf, other

    cs.SD eess.AS eess.SP

    Sparsity-based audio declip** methods: selected overview, new algorithms, and large-scale evaluation

    Authors: Clément Gaultier, Srđan Kitić, Rémi Gribonval, Nancy Bertin

    Abstract: Recent advances in audio declip** have substantially improved the state of the art.% in certain saturation regimes. Yet, practitioners need guidelines to choose a method, and while existing benchmarks have been instrumental in advancing the field, larger-scale experiments are needed to guide such choices. First, we show that the clip** levels in existing small-scale benchmarks are moderate and… ▽ More

    Submitted 30 November, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

  5. arXiv:1906.08968  [pdf, other

    eess.AS eess.SP physics.class-ph

    Mirage: 2D Source Localization Using Microphone Pair Augmentation with Echoes

    Authors: Diego Di Carlo, Antoine Deleforge, Nancy Bertin

    Abstract: It is commonly observed that acoustic echoes hurt performance of sound source localization (SSL) methods. We introduce the concept of microphone array augmentation with echoes (MIRAGE) and show how estimation of early-echo characteristics can in fact benefit SSL. We propose a learning-based scheme for echo estimation combined with a physics-based scheme for echo aggregation. In a simple scenario i… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Journal ref: International Conferenze on Acoustic, Speech Signal Processing - ICASSP 2019, May 2019, Calgary, United Kingdom

  6. arXiv:1812.05901  [pdf, ps, other

    cs.SD eess.AS

    Evaluation of an open-source implementation of the SRP-PHAT algorithm within the 2018 LOCATA challenge

    Authors: Romain Lebarbenchon, Ewen Camberlein, Diego di Carlo, Clément Gaultier, Antoine Deleforge, Nancy Bertin

    Abstract: This short paper presents an efficient, flexible implementation of the SRP-PHAT multichannel sound source localization method. The method is evaluated on the single-source tasks of the LOCATA 2018 development dataset, and an associated Matlab toolbox is made available online.

    Submitted 14 December, 2018; originally announced December 2018.

    Comments: In Proceedings of the LOCATA Challenge Workshop - a satellite event of IWAENC 2018 (arXiv:1811.08482 )

    Report number: LOCATAchallenge/2018/01

  7. arXiv:1711.11259  [pdf, other

    cs.SD eess.AS

    A modeling and algorithmic framework for (non)social (co)sparse audio restoration

    Authors: Clément Gaultier, Nancy Bertin, Srđan Kitić, Rémi Gribonval

    Abstract: We propose a unified modeling and algorithmic framework for audio restoration problem. It encompasses analysis sparse priors as well as more classical synthesis sparse priors, and regular sparsity as well as various forms of structured sparsity embodied by shrinkage operators (such as social shrinkage). The versatility of the framework is illustrated on two restoration scenarios: denoising, and de… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.