Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoring

Retsinas, George; Sfikas, Giorgos; Nikou, Christophoros

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.03515 (cs)

[Submitted on 7 Aug 2023]

Title:Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoring

Authors:George Retsinas, Giorgos Sfikas, Christophoros Nikou

View PDF

Abstract:Recent advances in segmentation-free keyword spotting treat this problem w.r.t. an object detection paradigm and borrow from state-of-the-art detection systems to simultaneously propose a word bounding box proposal mechanism and compute a corresponding representation. Contrary to the norm of such methods that rely on complex and large DNN models, we propose a novel segmentation-free system that efficiently scans a document image to find rectangular areas that include the query information. The underlying model is simple and compact, predicting character occurrences over rectangular areas through an implicitly learned scale map, trained on word-level annotated images. The proposed document scanning is then performed using this character counting in a cost-effective manner via integral images and binary search. Finally, the retrieval similarity by character counting is refined by a pyramidal representation and a CTC-based re-scoring algorithm, fully utilizing the trained CNN model. Experimental validation on two widely-used datasets shows that our method achieves state-of-the-art results outperforming the more complex alternatives, despite the simplicity of the underlying model.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.03515 [cs.CV]
	(or arXiv:2308.03515v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.03515

Submission history

From: George Retsinas [view email]
[v1] Mon, 7 Aug 2023 12:11:04 UTC (4,963 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoring

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoring

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators