Skip to main content

Showing 1–5 of 5 results for author: Wieling, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.06208  [pdf, other

    cs.SD eess.AS

    Quantifying the effect of speech pathology on automatic and human speaker verification

    Authors: Bence Mark Halpern, Thomas Tienkamp, Wen-Chin Huang, Lester Phillip Violeta, Teja Rebernik, Sebastiaan de Visscher, Max Witjes, Martijn Wieling, Defne Abur, Tomoki Toda

    Abstract: This study investigates how surgical intervention for speech pathology (specifically, as a result of oral cancer surgery) impacts the performance of an automatic speaker verification (ASV) system. Using two recently collected Dutch datasets with parallel pre and post-surgery audio from the same speaker, NKI-OC-VC and SPOKE, we assess the extent to which speech pathology influences ASV performance,… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, 2 tables. Accepted to Interspeech 2024

    ACM Class: I.2.7

  2. arXiv:2305.10951  [pdf, other

    cs.CL eess.AS

    Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation

    Authors: Martijn Bartelds, Nay San, Bradley McDonnell, Dan Jurafsky, Martijn Wieling

    Abstract: The performance of automatic speech recognition (ASR) systems has advanced substantially in recent years, particularly for languages for which a large amount of transcribed speech is available. Unfortunately, for low-resource languages, such as minority languages, regional languages or dialects, ASR performance generally remains much lower. In this study, we investigate whether data augmentation t… ▽ More

    Submitted 18 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

  3. arXiv:2205.02694  [pdf, other

    cs.CL cs.SD eess.AS

    Quantifying Language Variation Acoustically with Few Resources

    Authors: Martijn Bartelds, Martijn Wieling

    Abstract: Deep acoustic models represent linguistic information based on massive amounts of data. Unfortunately, for regional languages and dialects such resources are mostly not available. However, deep acoustic models might have learned linguistic information that transfers to low-resource languages. In this study, we evaluate whether this is the case through the task of distinguishing low-resource (Dutch… ▽ More

    Submitted 25 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: Accepted at NAACL 2022

  4. arXiv:2203.17072  [pdf, other

    cs.SD cs.CL eess.AS

    Manipulation of oral cancer speech using neural articulatory synthesis

    Authors: Bence Mark Halpern, Teja Rebernik, Thomas Tienkamp, Rob van Son, Michiel van den Brekel, Martijn Wieling, Max Witjes, Odette Scharenborg

    Abstract: We present an articulatory synthesis framework for the synthesis and manipulation of oral cancer speech for clinical decision making and alleviation of patient stress. Objective and subjective evaluations demonstrate that the framework has acceptable naturalness and is worth further investigation. A subsequent subjective vowel and consonant identification experiment showed that the articulatory sy… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: 5 pages, 4 tables, 1 figure. Submitted to Interspeech 2022

  5. arXiv:2011.12649  [pdf, other

    cs.CL eess.AS

    Neural Representations for Modeling Variation in Speech

    Authors: Martijn Bartelds, Wietse de Vries, Faraz Sanal, Caitlin Richter, Mark Liberman, Martijn Wieling

    Abstract: Variation in speech is often quantified by comparing phonetic transcriptions of the same utterance. However, manually transcribing speech is time-consuming and error prone. As an alternative, therefore, we investigate the extraction of acoustic embeddings from several self-supervised neural models. We use these representations to compute word-based pronunciation differences between non-native and… ▽ More

    Submitted 26 January, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: Submitted to Journal of Phonetics