Skip to main content

Showing 1–9 of 9 results for author: Michaud, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2302.07560  [pdf

    cs.LG cs.SD eess.AS

    Unsupervised classification to improve the quality of a bird song recording dataset

    Authors: Félix Michaud, Jérôme Sueur, Maxime Le Cesne, Sylvain Haupert

    Abstract: Open audio databases such as Xeno-Canto are widely used to build datasets to explore bird song repertoire or to train models for automatic bird sound classification by deep learning algorithms. However, such databases suffer from the fact that bird sounds are weakly labelled: a species name is attributed to each audio recording without timestamps that provide the temporal localization of the bird… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Journal ref: Ecological Informatics, 2023, pp.101952

  2. arXiv:2204.13622  [pdf, other

    eess.SP cs.SD eess.AS

    Fast Cross-Correlation for TDoA Estimation on Small Aperture Microphone Arrays

    Authors: François Grondin, Marc-Antoine Maheux, Jean-Samuel Lauzon, Jonathan Vincent, François Michaud

    Abstract: This paper introduces the Fast Cross-Correlation (FCC) method for Time Difference of Arrival (TDoA) Estimation for pairs of microphones on a small aperture microphone array. FCC relies on low-rank decomposition and exploits symmetry in even and odd bases to speed up computation while preserving TDoA accuracy. FCC reduces the number of flops by a factor of 4.5 and the execution speed by factors bet… ▽ More

    Submitted 10 March, 2023; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Submitted to IEEE ICASSP 2023

  3. arXiv:2203.14409  [pdf, other

    cs.SD eess.AS

    SMP-PHAT: Lightweight DoA Estimation by Merging Microphone Pairs

    Authors: François Grondin, Marc-Antoine Maheux, Jean-Samuel Lauzon, Jonathan Vincent, François Michaud

    Abstract: This paper introduces SMP-PHAT, which performs direction of arrival (DoA) of sound estimation with a microphone array by merging pairs of microphones that are parallel in space. This approach reduces the number of pairwise cross-correlation computations, and brings down the number of flops and memory lookups when searching for DoA. Experiments on low-cost hardware with commonly used microphone arr… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

    Comments: Submitted to Interspeech 2022

  4. arXiv:2103.03954  [pdf, other

    eess.AS cs.RO cs.SD

    ODAS: Open embeddeD Audition System

    Authors: François Grondin, Dominic Létourneau, Cédric Godin, Jean-Samuel Lauzon, Jonathan Vincent, Simon Michaud, Samuel Faucher, François Michaud

    Abstract: Artificial audition aims at providing hearing capabilities to machines, computers and robots. Existing frameworks in robot audition offer interesting sound source localization, tracking and separation performance, although involve a significant amount of computations that limit their use on robots with embedded computing capabilities. This paper presents ODAS, the Open embeddeD Audition System fra… ▽ More

    Submitted 11 May, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: This paper was published in Frontiers Robotics and AI

  5. arXiv:2010.09930  [pdf, other

    cs.SD eess.AS

    BIRD: Big Impulse Response Dataset

    Authors: François Grondin, Jean-Samuel Lauzon, Simon Michaud, Mirco Ravanelli, François Michaud

    Abstract: This paper introduces BIRD, the Big Impulse Response Dataset. This open dataset consists of 100,000 multichannel room impulse responses (RIRs) generated from simulations using the Image Method, making it the largest multichannel open dataset currently available. These RIRs can be used toperform efficient online data augmentation for scenarios that involve two microphones and multiple sound sources… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  6. arXiv:2008.00072  [pdf, other

    cs.CV eess.IV

    Dynamic Object Tracking and Masking for Visual SLAM

    Authors: Jonathan Vincent, Mathieu Labbé, Jean-Samuel Lauzon, François Grondin, Pier-Marc Comtois-Rivet, François Michaud

    Abstract: In dynamic environments, performance of visual SLAM techniques can be impaired by visual features taken from moving objects. One solution is to identify those objects so that their visual features can be removed for localization and map**. This paper presents a simple and fast pipeline that uses deep neural networks, extended Kalman filters and visual SLAM to improve both localization and mappin… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

  7. arXiv:2007.11079  [pdf, other

    eess.AS cs.SD

    3D Localization of a Sound Source Using Mobile Microphone Arrays Referenced by SLAM

    Authors: Simon Michaud, Samuel Faucher, François Grondin, Jean-Samuel Lauzon, Mathieu Labbé, Dominic Létourneau, François Ferland, François Michaud

    Abstract: A microphone array can provide a mobile robot with the capability of localizing, tracking and separating distant sound sources in 2D, i.e., estimating their relative elevation and azimuth. To combine acoustic data with visual information in real world settings, spatial correlation must be established. The approach explored in this paper consists of having two robots, each equipped with a microphon… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  8. arXiv:2005.09587  [pdf, other

    eess.AS

    GEV Beamforming Supported by DOA-based Masks Generated on Pairs of Microphones

    Authors: Francois Grondin, Jean-Samuel Lauzon, Jonathan Vincent, Francois Michaud

    Abstract: Distant speech processing is a challenging task, especially when dealing with the cocktail party effect. Sound source separation is thus often required as a preprocessing step prior to speech recognition to improve the signal to distortion ratio (SDR). Recently, a combination of beamforming and speech separation networks have been proposed to improve the target source quality in the direction of a… ▽ More

    Submitted 5 August, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

  9. arXiv:1812.00115  [pdf, other

    eess.AS cs.SD

    Lightweight and Optimized Sound Source Localization and Tracking Methods for Open and Closed Microphone Array Configurations

    Authors: Francois Grondin, Francois Michaud

    Abstract: Human-robot interaction in natural settings requires filtering out the different sources of sounds from the environment. Such ability usually involves the use of microphone arrays to localize, track and separate sound sources online. Multi-microphone signal processing techniques can improve robustness to noise but the processing cost increases with the number of microphones used, limiting response… ▽ More

    Submitted 30 November, 2018; originally announced December 2018.