Skip to main content

Showing 1–2 of 2 results for author: Sydnor, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.10149  [pdf, other

    eess.AS

    Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

    Authors: Helin Wang, Venkatesh Ravichandran, Milind Rao, Becky Lammers, Myra Sydnor, Nicholas Maragakis, Ankur A. Butala, Jayne Zhang, Lora Clawson, Victoria Chovaz, Laureano Moro-Velazquez

    Abstract: Spoken language understanding (SLU) systems often exhibit suboptimal performance in processing atypical speech, typically caused by neurological conditions and motor impairments. Recent advancements in Text-to-Speech (TTS) synthesis-based augmentation for more fair SLU have struggled to accurately capture the unique vocal characteristics of atypical speakers, largely due to insufficient data. To a… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted at SyntheticData4ML 2023 Oral

  2. arXiv:2306.10588  [pdf, other

    eess.AS eess.SP

    DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

    Authors: Helin Wang, Thomas Thebaud, Jesus Villalba, Myra Sydnor, Becky Lammers, Najim Dehak, Laureano Moro-Velazquez

    Abstract: We present a novel typical-to-atypical voice conversion approach (DuTa-VC), which (i) can be trained with nonparallel data (ii) first introduces diffusion probabilistic model (iii) preserves the target speaker identity (iv) is aware of the phoneme duration of the target speaker. DuTa-VC consists of three parts: an encoder transforms the source mel-spectrogram into a duration-modified speaker-indep… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.