Skip to main content

Showing 1–4 of 4 results for author: Nespoli, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.14129  [pdf, other

    eess.AS cs.SD

    Speaker anonymization using neural audio codec language models

    Authors: Michele Panariello, Francesco Nespoli, Massimiliano Todisco, Nicholas Evans

    Abstract: The vast majority of approaches to speaker anonymization involve the extraction of fundamental frequency estimates, linguistic features and a speaker embedding which is perturbed to obfuscate the speaker identity before an anonymized speech waveform is resynthesized using a vocoder. Recent work has shown that x-vector transformations are difficult to control consistently: other sources of speaker… ▽ More

    Submitted 12 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at ICASSP 2024

  2. arXiv:2306.16071  [pdf, other

    eess.AS cs.CL cs.SD

    Long-term Conversation Analysis: Exploring Utility and Privacy

    Authors: Francesco Nespoli, Jule Pohlhausen, Patrick A. Naylor, Joerg Bitzer

    Abstract: The analysis of conversations recorded in everyday life requires privacy protection. In this contribution, we explore a privacy-preserving feature extraction method based on input feature dimension reduction, spectral smoothing and the low-cost speaker anonymization technique based on McAdams coefficient. We assess the utility of the feature extraction methods with a voice activity detection and a… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Submitted to ITG Conference on Speech Communication, 2023

  3. arXiv:2306.16069  [pdf, other

    eess.AS cs.SD eess.SP

    Two-Stage Voice Anonymization for Enhanced Privacy

    Authors: Francesco Nespoli, Daniel Barreda, Joerg Bitzer, Patrick A. Naylor

    Abstract: In recent years, the need for privacy preservation when manipulating or storing personal data, including speech , has become a major issue. In this paper, we present a system addressing the speaker-level anonymization problem. We propose and evaluate a two-stage anonymization pipeline exploiting a state-of-the-art anonymization model described in the Voice Privacy Challenge 2022 in combination wit… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: submitted to INTERSPEECH

  4. arXiv:2212.01306  [pdf, other

    eess.AS cs.SD

    Relative Acoustic Features for Distance Estimation in Smart-Homes

    Authors: Francesco Nespoli, Daniel Barreda, Patrick A. Naylor

    Abstract: Any audio recording encapsulates the unique fingerprint of the associated acoustic environment, namely the background noise and reverberation. Considering the scenario of a room equipped with a fixed smart speaker device with one or more microphones and a wearable smart device (watch, glasses or smartphone), we employed the improved proportionate normalized least mean square adaptive filter to est… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Journal ref: Interspeech 2022