Skip to main content

Showing 1–3 of 3 results for author: Wotherspoon, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.11619  [pdf, ps, other

    eess.AS cs.CL cs.SD

    Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech

    Authors: Shannon Wotherspoon, William Hartmann, Matthew Snover

    Abstract: This paper introduces a set of English translations for a 123-hour subset of the CallHome Mandarin Chinese data and the HKUST Mandarin Telephone Speech data for the task of speech translation. Paired source-language speech and target-language text is essential for training end-to-end speech translation systems and can provide substantial performance improvements for cascaded systems as well, relat… ▽ More

    Submitted 25 March, 2024; originally announced April 2024.

    Comments: 2 pages

  2. arXiv:2106.07699  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition

    Authors: Andrew Slottje, Shannon Wotherspoon, William Hartmann, Matthew Snover, Owen Kimball

    Abstract: Modeling code-switched speech is an important problem in automatic speech recognition (ASR). Labeled code-switched data are rare, so monolingual data are often used to model code-switched speech. These monolingual data may be more closely matched to one of the languages in the code-switch pair. We show that such asymmetry can bias prediction toward the better-matched language and degrade overall m… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 5 pages

  3. arXiv:2012.13004  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Speech Synthesis as Augmentation for Low-Resource ASR

    Authors: Deblin Bagchi, Shannon Wotherspoon, Zhuolin Jiang, Prasanna Muthukumar

    Abstract: Speech synthesis might hold the key to low-resource speech recognition. Data augmentation techniques have become an essential part of modern speech recognition training. Yet, they are simple, naive, and rarely reflect real-world conditions. Meanwhile, speech synthesis techniques have been rapidly getting closer to the goal of achieving human-like speech. In this paper, we investigate the possibili… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.