Skip to main content

Showing 1–1 of 1 results for author: Maurer, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.01661  [pdf, other

    cs.CL cs.AI cs.LG

    Rerunning OCR: A Machine Learning Approach to Quality Assessment and Enhancement Prediction

    Authors: Pit Schneider, Yves Maurer

    Abstract: Iterating with new and improved OCR solutions enforces decision making when it comes to targeting the right candidates for reprocessing. This especially applies when the underlying data collection is of considerable size and rather diverse in terms of fonts, languages, periods of publication and consequently OCR quality. This article captures the efforts of the National Library of Luxembourg to su… ▽ More

    Submitted 31 October, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: Journal of Data Mining and Digital Humanities; Minor revision

    ACM Class: I.2.7

    Journal ref: Journal of Data Mining & Digital Humanities, 2022, Digital humanities in languages (November 30, 2022) jdmdh:8561