Skip to main content

Showing 1–2 of 2 results for author: Foley, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.04975  [pdf, other

    cs.CL

    Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions

    Authors: Nay San, Martijn Bartelds, Blaine Billings, Ella de Falco, Hendi Feriza, Johan Safri, Wawan Sahrozi, Ben Foley, Bradley McDonnell, Dan Jurafsky

    Abstract: Recent research using pre-trained transformer models suggests that just 10 minutes of transcribed speech may be enough to fine-tune such a model for automatic speech recognition (ASR) -- at least if we can also leverage vast amounts of text data (803 million tokens). But is that much text data necessary? We study the use of different amounts of text data, both for creating a lexicon that constrain… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted for ComputEL-6

  2. arXiv:2101.03027  [pdf, other

    cs.CL cs.AI eess.SP

    User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis

    Authors: Oliver Adams, Benjamin Galliot, Guillaume Wisniewski, Nicholas Lambourne, Ben Foley, Rahasya Sanders-Dwyer, Janet Wiles, Alexis Michaud, Séverine Guillaume, Laurent Besacier, Christopher Cox, Katya Aplonova, Guillaume Jacques, Nathan Hill

    Abstract: This paper reports on progress integrating the speech recognition toolkit ESPnet into Elpis, a web front-end originally designed to provide access to the Kaldi automatic speech recognition toolkit. The goal of this work is to make end-to-end speech recognition models available to language workers via a user-friendly graphical interface. Encouraging results are reported on (i) development of an ESP… ▽ More

    Submitted 22 February, 2021; v1 submitted 15 December, 2020; originally announced January 2021.