Skip to main content

Showing 1–2 of 2 results for author: Antonova, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.17267  [pdf, other

    eess.AS cs.CL cs.SD

    Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization

    Authors: Alexandra Antonova

    Abstract: We present a first large-scale public synthetic dataset for contextual spellchecking customization of automatic speech recognition (ASR) with focus on diverse rare and out-of-vocabulary (OOV) phrases, such as proper names or terms. The proposed approach allows creating millions of realistic examples of corrupted ASR hypotheses and simulate non-trivial biasing lists for the customization task. Furt… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE ASRU 2023

  2. arXiv:2306.02317  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram map**s

    Authors: Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg

    Abstract: Contextual spelling correction models are an alternative to shallow fusion to improve automatic speech recognition (ASR) quality given user vocabulary. To deal with large user vocabularies, most of these models include candidate retrieval mechanisms, usually based on minimum edit distance between fragments of ASR hypothesis and user phrases. However, the edit-distance approach is slow, non-trainab… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by INTERSPEECH 2023