Skip to main content

Showing 1–3 of 3 results for author: Göderle, W

.
  1. arXiv:2401.07787  [pdf, ps, other

    cs.CV cs.LG

    Improving OCR Quality in 19th Century Historical Documents Using a Combined Machine Learning Based Approach

    Authors: David Fleischhacker, Wolfgang Goederle, Roman Kern

    Abstract: This paper addresses a major challenge to historical research on the 19th century. Large quantities of sources have become digitally available for the first time, while extraction techniques are lagging behind. Therefore, we researched machine learning (ML) models to recognise and extract complex data structures in a high-value historical primary source, the Schematismus. It records every single p… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 29 pages, 23 figures, 7 tables

  2. arXiv:2312.07560  [pdf

    cs.CV cs.LG

    AI-driven Structure Detection and Information Extraction from Historical Cadastral Maps (Early 19th Century Franciscean Cadastre in the Province of Styria) and Current High-resolution Satellite and Aerial Imagery for Remote Sensing

    Authors: Wolfgang Göderle, Christian Macher, Katrin Mauthner, Oliver Pimas, Fabian Rampetsreiter

    Abstract: Cadastres from the 19th century are a complex as well as rich source for historians and archaeologists, whose use presents them with great challenges. For archaeological and historical remote sensing, we have trained several Deep Learning models, CNNs as well as Vision Transformers, to extract large-scale data from this knowledge representation. We present the principle results of our work here an… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 18 pages, 7 figures

  3. arXiv:2303.06026  [pdf

    eess.AS cs.CL cs.LG cs.SD

    wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts

    Authors: Michael Fleck, Wolfgang Göderle

    Abstract: In this case study we trained and published a state-of-the-art open-source model for Automatic Speech Recognition (ASR) for German to evaluate the current potential of this technology for the use in the larger context of Digital Humanities and cultural heritage indexation. Along with this paper we publish our wav2vec2 based speech to text model while we evaluate its performance on a corpus of hist… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 11 pages, 2 tables