Skip to main content

Showing 1–6 of 6 results for author: Guimarães, H R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.03657  [pdf, other

    eess.AS cs.SD

    UrBAN: Urban Beehive Acoustics and PheNoty** Dataset

    Authors: Mahsa Abdollahi, Yi Zhu, Heitor R. Guimarães, Nico Coallier, Ségolène Maucourt, Pierre Giovenazzo, Tiago H. Falk

    Abstract: In this paper, we present a multimodal dataset obtained from a honey bee colony in Montréal, Quebec, Canada, spanning the years of 2021 to 2022. This apiary comprised 10 beehives, with microphones recording more than 2000 hours of high quality raw audio, and also sensors capturing temperature, and humidity. Periodic hive inspections involved monitoring colony honey bee population changes, assessin… ▽ More

    Submitted 20 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2403.08654  [pdf, other

    eess.AS cs.SD

    An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning

    Authors: Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk

    Abstract: Self-supervised speech representation learning enables the extraction of meaningful features from raw waveforms. These features can then be efficiently used across multiple downstream tasks. However, two significant issues arise when considering the deployment of such methods ``in-the-wild": (i) Their large size, which can be prohibitive for edge applications; and (ii) their robustness to detrimen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Under review on IEEE Transactions on Audio, Speech, and Language Processing (2024)

  3. arXiv:2311.10876  [pdf, other

    eess.AS cs.SD q-bio.QM

    MSPB: a longitudinal multi-sensor dataset with phenotypic trait measurements from honey bees

    Authors: Yi Zhu, Mahsa Abdollahi, Ségolène Maucourt, Nico Coallier, Heitor R. Guimarães, Pierre Giovenazzo, Tiago H. Falk

    Abstract: We present a longitudinal multi-sensor dataset collected from honey bee colonies (Apis mellifera) with rich phenotypic measurements. Data were continuously collected between May-2020 and April-2021 from 53 hives located at two apiaries in Québec, Canada. The sensor data included audio features, temperature, and relative humidity. The phenotypic measurements contained beehive population, number of… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Under review; project webpage: https://zhu00121.github.io/MSPB-webpage/

  4. arXiv:2309.12914  [pdf, other

    eess.AS cs.SD

    VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks

    Authors: Heitor R. Guimarães, Arthur Pimentel, Anderson Avila, Tiago H. Falk

    Abstract: Keyword spotting (KWS) refers to the task of identifying a set of predefined words in audio streams. With the advances seen recently with deep neural networks, it has become a popular technology to activate and control small devices, such as voice assistants. Relying on such models for edge devices, however, can be challenging due to hardware constraints. Moreover, as adversarial attacks have incr… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: Submitted to ICASSP 2024

  5. arXiv:2302.09437  [pdf, other

    eess.AS cs.SD

    RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness

    Authors: Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk

    Abstract: Self-supervised speech pre-training enables deep neural network models to capture meaningful and disentangled factors from raw waveform signals. The learned universal speech representations can then be used across numerous downstream tasks. These representations, however, are sensitive to distribution shifts caused by environmental factors, such as noise and/or room reverberation. Their large size… ▽ More

    Submitted 22 February, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  6. arXiv:2211.06562  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement

    Authors: Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk

    Abstract: Self-supervised speech representation learning aims to extract meaningful factors from the speech signal that can later be used across different downstream tasks, such as speech and/or emotion recognition. Existing models, such as HuBERT, however, can be fairly large thus may not be suitable for edge speech applications. Moreover, realistic applications typically involve speech corrupted by noise… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: ENLSP-II NeurIPS Workshop 2022, 6 pages