-
Line and Word Matching in Old Documents
Abstract: This paper is concerned with the problem of establishing an index based on word matching. It is assumed that the book was digitised as better as possible and some pre-processing techniques were already applied as line orientation correction and some noise removal. However two main factor are responsible for being not possible to apply ordinary optical character recognition techniques (OCR): the… ▽ More
Submitted 17 December, 2004; originally announced December 2004.
Comments: 12 pages, 7 figures, Author at http://alfa.ist.utl.pt/~cvrm/staff/vramos/ref_32.html
ACM Class: I.2; I.5
Journal ref: SIARP 2000 - 5th IberoAmerican Symp. on Pattern Rec., F. Muge, Moises P. and R. Caldas Pinto (Eds.), ISBN 972-97711-1-1, pp. 123-135, Lisbon, Portugal, 11-13 Sep. 2000