Skip to main content

Showing 1–1 of 1 results for author: Dufraux, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.07323  [pdf, ps, other

    cs.CL cs.AI cs.LG eess.AS

    Lead2Gold: Towards exploiting the full potential of noisy transcriptions for speech recognition

    Authors: Adrien Dufraux, Emmanuel Vincent, Awni Hannun, Armelle Brun, Matthijs Douze

    Abstract: The transcriptions used to train an Automatic Speech Recognition (ASR) system may contain errors. Usually, either a quality control stage discards transcriptions with too many errors, or the noisy transcriptions are used as is. We introduce Lead2Gold, a method to train an ASR system that exploits the full potential of noisy transcriptions. Based on a noise model of transcription errors, Lead2Gold… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: 8 pages, 4 tables, Accepted for publication in ASRU 2019

    ACM Class: I.2.6; I.2.7