Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Tian, **chuan; Yu, Jianwei; Weng, Chao; Zhang, Shi-Xiong; Su, Dan; Yu, Dong; Zou, Yuexian

Computer Science > Artificial Intelligence

arXiv:2112.02498 (cs)

[Submitted on 5 Dec 2021 (v1), last revised 30 Dec 2021 (this version, v2)]

Title:Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Authors:**chuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou

View PDF

Abstract:Recently, End-to-End (E2E) frameworks have achieved remarkable results on various Automatic Speech Recognition (ASR) tasks. However, Lattice-Free Maximum Mutual Information (LF-MMI), as one of the discriminative training criteria that show superior performance in hybrid ASR systems, is rarely adopted in E2E ASR frameworks. In this work, we propose a novel approach to integrate LF-MMI criterion into E2E ASR frameworks in both training and decoding stages. The proposed approach shows its effectiveness on two of the most widely used E2E frameworks including Attention-Based Encoder-Decoders (AEDs) and Neural Transducers (NTs). Experiments suggest that the introduction of the LF-MMI criterion consistently leads to significant performance improvements on various datasets and different E2E ASR frameworks. The best of our models achieves competitive CER of 4.1\% / 4.4\% on Aishell-1 dev/test set; we also achieve significant error reduction on Aishell-2 and Librispeech datasets over strong baselines.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2112.02498 [cs.AI]
	(or arXiv:2112.02498v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2112.02498

Submission history

From: **chuan Tian [view email]
[v1] Sun, 5 Dec 2021 07:30:17 UTC (108 KB)
[v2] Thu, 30 Dec 2021 03:24:56 UTC (108 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2021-12

Change to browse by:

cs
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chao Weng
Shi-Xiong Zhang
Dan Su
Dong Yu
Yuexian Zou

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators