-
arXiv:2104.03416 [pdf, ps, other]
Pushing the Limits of Non-Autoregressive Speech Recognition
Abstract: We combine recent advancements in end-to-end speech recognition to non-autoregressive automatic speech recognition. We push the limits of non-autoregressive state-of-the-art results for multiple datasets: LibriSpeech, Fisher+Switchboard and Wall Street Journal. Key to our recipe, we leverage CTC on giant Conformer neural network architectures with SpecAugment and wav2vec2 pre-training. We achieve… ▽ More
Submitted 11 September, 2021; v1 submitted 7 April, 2021; originally announced April 2021.
Comments: Proceedings of INTERSPEECH