Skip to main content

Showing 1–2 of 2 results for author: Tailleur, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.17529  [pdf, other

    cs.SD eess.AS

    Detection of Deepfake Environmental Audio

    Authors: Hafsa Ouajdi, Oussama Hadder, Modan Tailleur, Mathieu Lagrange, Laurie M. Heller

    Abstract: With the ever-rising quality of deep generative models, it is increasingly important to be able to discern whether the audio data at hand have been recorded or synthesized. Although the detection of fake speech signals has been studied extensively, this is not the case for the detection of fake environmental audio. We propose a simple and efficient pipeline for detecting fake environmental sound… ▽ More

    Submitted 13 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  2. arXiv:2403.17508  [pdf, other

    cs.SD eess.AS

    Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant

    Authors: Modan Tailleur, Junwon Lee, Mathieu Lagrange, Keunwoo Choi, Laurie M. Heller, Keisuke Imoto, Yuki Okamoto

    Abstract: This paper explores whether considering alternative domain-specific embeddings to calculate the Fréchet Audio Distance (FAD) metric can help the FAD to correlate better with perceptual ratings of environmental sounds. We used embeddings from VGGish, PANNs, MS-CLAP, L-CLAP, and MERT, which are tailored for either music or environmental sound evaluation. The FAD scores were calculated for sounds fro… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.