Skip to main content

Showing 1–9 of 9 results for author: Halpern, B M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.06208  [pdf, other

    cs.SD eess.AS

    Quantifying the effect of speech pathology on automatic and human speaker verification

    Authors: Bence Mark Halpern, Thomas Tienkamp, Wen-Chin Huang, Lester Phillip Violeta, Teja Rebernik, Sebastiaan de Visscher, Max Witjes, Martijn Wieling, Defne Abur, Tomoki Toda

    Abstract: This study investigates how surgical intervention for speech pathology (specifically, as a result of oral cancer surgery) impacts the performance of an automatic speaker verification (ASV) system. Using two recently collected Dutch datasets with parallel pre and post-surgery audio from the same speaker, NKI-OC-VC and SPOKE, we assess the extent to which speech pathology influences ASV performance,… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, 2 tables. Accepted to Interspeech 2024

    ACM Class: I.2.7

  2. arXiv:2310.02570  [pdf, other

    cs.SD eess.AS

    Improving severity preservation of healthy-to-pathological voice conversion with global style tokens

    Authors: Bence Mark Halpern, Wen-Chin Huang, Lester Phillip Violeta, R. J. J. H. van Son, Tomoki Toda

    Abstract: In healthy-to-pathological voice conversion (H2P-VC), healthy speech is converted into pathological while preserving the identity. The paper improves on previous two-stage approach to H2P-VC where (1) speech is created first with the appropriate severity, (2) then the speaker identity of the voice is converted while preserving the severity of the voice. Specifically, we propose improvements to (2)… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures, 5 tables. Accepted to IEEE Automatic Speech Recognition and Understanding Workshop 2023

    ACM Class: I.2.7

  3. arXiv:2203.17072  [pdf, other

    cs.SD cs.CL eess.AS

    Manipulation of oral cancer speech using neural articulatory synthesis

    Authors: Bence Mark Halpern, Teja Rebernik, Thomas Tienkamp, Rob van Son, Michiel van den Brekel, Martijn Wieling, Max Witjes, Odette Scharenborg

    Abstract: We present an articulatory synthesis framework for the synthesis and manipulation of oral cancer speech for clinical decision making and alleviation of patient stress. Objective and subjective evaluations demonstrate that the framework has acceptable naturalness and is worth further investigation. A subsequent subjective vowel and consonant identification experiment showed that the articulatory sy… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: 5 pages, 4 tables, 1 figure. Submitted to Interspeech 2022

  4. arXiv:2201.04908  [pdf, ps, other

    cs.SD cs.AI eess.AS

    The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition

    Authors: Luke Prananta, Bence Mark Halpern, Siyuan Feng, Odette Scharenborg

    Abstract: In this paper, we investigate several existing and a new state-of-the-art generative adversarial network-based (GAN) voice conversion method for enhancing dysarthric speech for improved dysarthric speech recognition. We compare key components of existing methods as part of a rigorous ablation study to find the most effective solution to improve dysarthric speech recognition. We find that straightf… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: Extended version of paper to be submitted to Interspeech 2022. 6 pages, 2 tables

  5. arXiv:2110.08213  [pdf, other

    cs.SD cs.CL eess.AS q-bio.QM

    Towards Identity Preserving Normal to Dysarthric Voice Conversion

    Authors: Wen-Chin Huang, Bence Mark Halpern, Lester Phillip Violeta, Odette Scharenborg, Tomoki Toda

    Abstract: We present a voice conversion framework that converts normal speech into dysarthric speech while preserving the speaker identity. Such a framework is essential for (1) clinical decision making processes and alleviation of patient stress, (2) data augmentation for dysarthric speech recognition. This is an especially challenging task since the converted samples should capture the severity of dysarth… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: Submitted to ICASSP 2022

  6. arXiv:2107.00308  [pdf, other

    cs.SD cs.CL eess.AS

    An Objective Evaluation Framework for Pathological Speech Synthesis

    Authors: Bence Mark Halpern, Julian Fritsch, Enno Hermann, Rob van Son, Odette Scharenborg, Mathew Magimai. -Doss

    Abstract: The development of pathological speech systems is currently hindered by the lack of a standardised objective evaluation framework. In this work, (1) we utilise existing detection and analysis techniques to propose a general framework for the consistent evaluation of synthetic pathological speech. This framework evaluates the voice quality and the intelligibility aspects of speech and is shown to b… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 4 pages, 4 figures. Accepted to the ITG Conference on Speech Communication | 29.09.2021 - 01.10.2021 | Kiel

  7. arXiv:2106.08427  [pdf, other

    cs.SD cs.CL eess.AS

    Pathological voice adaptation with autoencoder-based voice conversion

    Authors: Marc Illa, Bence Mark Halpern, Rob van Son, Laureano Moro-Velazquez, Odette Scharenborg

    Abstract: In this paper, we propose a new approach to pathological speech synthesis. Instead of using healthy speech as a source, we customise an existing pathological speech sample to a new speaker's voice characteristics. This approach alleviates the evaluation problem one normally has when converting typical speech to pathological speech, as in our approach, the voice conversion (VC) model does not need… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 6 pages, 3 figures. Accepted to the 11th ISCA Speech Synthesis Workshop (2021)

  8. arXiv:2103.15122  [pdf, other

    eess.AS cs.CL cs.SD

    Quantifying Bias in Automatic Speech Recognition

    Authors: Siyuan Feng, Olya Kudina, Bence Mark Halpern, Odette Scharenborg

    Abstract: Automatic speech recognition (ASR) systems promise to deliver objective interpretation of human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs struggle with the large variation in speech due to e.g., gender, age, speech impairment, race, and accents. Many factors can cause the bias of an ASR system. Our overarching goal is to uncover bias in ASR systems to work… ▽ More

    Submitted 1 April, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

    Comments: Submitted to INTERSPEECH (IS) 2021. This preprint version differs slightly from the version submitted to IS 2021: Figure 1 is not included in IS 2021

  9. arXiv:2007.14205  [pdf, other

    eess.AS cs.LG cs.SD

    Detecting and analysing spontaneous oral cancer speech in the wild

    Authors: Bence Mark Halpern, Rob van Son, Michiel van den Brekel, Odette Scharenborg

    Abstract: Oral cancer speech is a disease which impacts more than half a million people worldwide every year. Analysis of oral cancer speech has so far focused on read speech. In this paper, we 1) present and 2) analyse a three-hour long spontaneous oral cancer speech dataset collected from YouTube. 3) We set baselines for an oral cancer speech detection task on this dataset. The analysis of these explainab… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted to Interspeech 2020