Wake Word Detection with Alignment-Free Lattice-Free MMI

Wang, Yiming; Lv, Hang; Povey, Daniel; Xie, Lei; Khudanpur, Sanjeev

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2005.08347 (eess)

[Submitted on 17 May 2020 (v1), last revised 28 Jul 2020 (this version, v3)]

Title:Wake Word Detection with Alignment-Free Lattice-Free MMI

Authors:Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur

View PDF

Abstract:Always-on spoken language interfaces, e.g. personal digital assistants, rely on a wake word to start processing spoken input. We present novel methods to train a hybrid DNN/HMM wake word detection system from partially labeled training data, and to use it in on-line applications: (i) we remove the prerequisite of frame-level alignments in the LF-MMI training algorithm, permitting the use of un-transcribed training examples that are annotated only for the presence/absence of the wake word; (ii) we show that the classical keyword/filler model must be supplemented with an explicit non-speech (silence) model for good performance; (iii) we present an FST-based decoder to perform online detection. We evaluate our methods on two real data sets, showing 50%--90% reduction in false rejection rates at pre-specified false alarm rates over the best previously published figures, and re-validate them on a third (large) data set.

Comments:	Accepted at Interspeech 2020. 5 pages, 3 figures
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
Cite as:	arXiv:2005.08347 [eess.AS]
	(or arXiv:2005.08347v3 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2005.08347

Submission history

From: Yiming Wang [view email]
[v1] Sun, 17 May 2020 19:22:25 UTC (91 KB)
[v2] Mon, 25 May 2020 05:52:20 UTC (93 KB)
[v3] Tue, 28 Jul 2020 22:06:14 UTC (92 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Wake Word Detection with Alignment-Free Lattice-Free MMI

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Wake Word Detection with Alignment-Free Lattice-Free MMI

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators