Skip to main content

Showing 1–1 of 1 results for author: Germain, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.10422  [pdf, other

    eess.AS cs.SD eess.SP

    Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice

    Authors: Shubham Gupta, Mirco Ravanelli, Pascal Germain, Cem Subakan

    Abstract: In this paper, we propose Phoneme Discretized Saliency Maps (PDSM), a discretization algorithm for saliency maps that takes advantage of phoneme boundaries for explainable detection of AI-generated voice. We experimentally show with two different Text-to-Speech systems (i.e., Tacotron2 and Fastspeech2) that the proposed algorithm produces saliency maps that result in more faithful explanations com… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024