Skip to main content

Showing 1–20 of 20 results for author: Ingold, R

Searching in archive cs. Search in all archives.
.
  1. Impact of Ground Truth Quality on Handwriting Recognition

    Authors: Michael Jungo, Lars Vögtlin, Atefeh Fakhari, Nathan Wegmann, Rolf Ingold, Andreas Fischer, Anna Scius-Bertrand

    Abstract: Handwriting recognition is a key technology for accessing the content of old manuscripts, hel** to preserve cultural heritage. Deep learning shows an impressive performance in solving this task. However, to achieve its full potential, it requires a large amount of labeled data, which is difficult to obtain for ancient languages and scripts. Often, a trade-off has to be made between ground truth… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: SOICT 2023

    Journal ref: SOICT 2023: The 12th International Symposium on Information and Communication Technology

  2. arXiv:2201.08295  [pdf, other

    cs.CV

    DIVA-DAF: A Deep Learning Framework for Historical Document Image Analysis

    Authors: Lars Vögtlin, Anna Scius-Bertrand, Paul Maergner, Andreas Fischer, Rolf Ingold

    Abstract: Deep learning methods have shown strong performance in solving tasks for historical document image analysis. However, despite current libraries and frameworks, programming an experiment or a set of experiments and executing them can be time-consuming. This is why we propose an open-source deep learning framework, DIVA-DAF, which is based on PyTorch Lightning and specifically designed for historica… ▽ More

    Submitted 15 February, 2024; v1 submitted 20 January, 2022; originally announced January 2022.

  3. arXiv:2103.08236  [pdf, other

    cs.CV cs.AI

    Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs

    Authors: Lars Vögtlin, Manuel Drazyk, Vinaychandran Pondenkandath, Michele Alberti, Rolf Ingold

    Abstract: We present a framework to generate synthetic historical documents with precise ground truth using nothing more than a collection of unlabeled historical images. Obtaining large labeled datasets is often the limiting factor to effectively use supervised deep learning methods for Document Image Analysis (DIA). Prior approaches towards synthetic data generation either require expertise or result in p… ▽ More

    Submitted 16 May, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

  4. Document Towers: A MATLAB software implementing a three-dimensional architectural paradigm for the visual exploration of digital documents and libraries

    Authors: Vlad Atanasiu, Rolf Ingold

    Abstract: This article introduces the generic Document Towers paradigm, visualization, and software for visualizing the structure of paginated documents, based on the metaphor of documents-as-architecture. The Document Towers visualizations resemble three-dimensional building models and represent the physical boundaries of logical (e.g., titles, images), semantic (e.g., topics, named entities), graphical (e… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Afferent Open Source software is found at https://github.com/ElsevierSoftwareX/SOFTX-D-21-00009

    Journal ref: SoftwareX, 2021 (14): 100684

  5. arXiv:1911.05045  [pdf, other

    cs.CV cs.LG

    Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

    Authors: Michele Alberti, Angela Botros, Narayan Schuez, Rolf Ingold, Marcus Liwicki, Mathias Seuret

    Abstract: In this work, we investigate the application of trainable and spectrally initializable matrix transformations on the feature maps produced by convolution operations. While previous literature has already demonstrated the possibility of adding static spectral transformations as feature processors, our focus is on more general trainable transforms. We study the transforms in various architectural co… ▽ More

    Submitted 13 November, 2019; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: 8 pages

  6. arXiv:1906.11894  [pdf, other

    cs.CV cs.CL

    Labeling, Cutting, Grou**: an Efficient Text Line Segmentation Method for Medieval Manuscripts

    Authors: Michele Alberti, Lars Vögtlin, Vinaychandran Pondenkandath, Mathias Seuret, Rolf Ingold, Marcus Liwicki

    Abstract: This paper introduces a new way for text-line extraction by integrating deep-learning based pre-classification and state-of-the-art segmentation methods. Text-line extraction in complex handwritten documents poses a significant challenge, even to the most modern computer vision algorithms. Historical manuscripts are a particularly hard class of documents as they present several forms of noise, suc… ▽ More

    Submitted 1 July, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Journal ref: 2019 15th IAPR International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia

  7. arXiv:1906.10401  [pdf, other

    cs.CV

    Graph-Based Offline Signature Verification

    Authors: Paul Maergner, Nicholas R. Howe, Kaspar Riesen, Rolf Ingold, Andreas Fischer

    Abstract: Graphs provide a powerful representation formalism that offers great promise to benefit tasks like handwritten signature verification. While most state-of-the-art approaches to signature verification rely on fixed-size representations, graphs are flexible in size and allow modeling local features as well as the global structure of the handwriting. In this article, we present two recent graph-based… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  8. arXiv:1906.04736  [pdf, other

    cs.LG stat.ML

    Improving Reproducible Deep Learning Workflows with DeepDIVA

    Authors: Michele Alberti, Vinaychandran Pondenkandath, Lars Vögtlin, Marcel Würsch, Rolf Ingold, Marcus Liwicki

    Abstract: The field of deep learning is experiencing a trend towards producing reproducible research. Nevertheless, it is still often a frustrating experience to reproduce scientific results. This is especially true in the machine learning community, where it is considered acceptable to have black boxes in your experiments. We present DeepDIVA, a framework designed to facilitate easy experimentation and the… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Journal ref: 6th Swiss Conference on Data Science (SDS), Bern, Switzerland, 2019

  9. arXiv:1906.04439  [pdf, ps, other

    cs.AI

    Survey of Artificial Intelligence for Card Games and Its Application to the Swiss Game Jass

    Authors: Joel Niklaus, Michele Alberti, Vinaychandran Pondenkandath, Rolf Ingold, Marcus Liwicki

    Abstract: In the last decades we have witnessed the success of applications of Artificial Intelligence to playing games. In this work we address the challenging field of games with hidden information and card games in particular. Jass is a very popular card game in Switzerland and is closely connected with Swiss culture. To the best of our knowledge, performances of Artificial Intelligence agents in the gam… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Journal ref: 6th Swiss Conference on Data Science (SDS), Bern, Switzerland, 2019

  10. arXiv:1905.09113  [pdf, other

    cs.CV

    A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

    Authors: Linda Studer, Michele Alberti, Vinaychandran Pondenkandath, Pinar Goktepe, Thomas Kolonko, Andreas Fischer, Marcus Liwicki, Rolf Ingold

    Abstract: Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, which are often challenging for machine learning due to a lack of human-annotated learning samples. With the advent of deep neural networks, a promising way to cope with the lack of training data is to pre-train models on images from a different domain and then fine-tune them on historical documents.… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  11. arXiv:1811.01640  [pdf, other

    cs.LG stat.ML

    Leveraging Random Label Memorization for Unsupervised Pre-Training

    Authors: Vinaychandran Pondenkandath, Michele Alberti, Sammer Puran, Rolf Ingold, Marcus Liwicki

    Abstract: We present a novel approach to leverage large unlabeled datasets by pre-training state-of-the-art deep neural networks on randomly-labeled datasets. Specifically, we train the neural networks to memorize arbitrary labels for all the samples in a dataset and use these pre-trained networks as a starting point for regular supervised learning. Our assumption is that the "memorization infrastructure" l… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

    Comments: 6 pages

  12. Offline Signature Verification by Combining Graph Edit Distance and Triplet Networks

    Authors: Paul Maergner, Vinaychandran Pondenkandath, Michele Alberti, Marcus Liwicki, Kaspar Riesen, Rolf Ingold, Andreas Fischer

    Abstract: Biometric authentication by means of handwritten signatures is a challenging pattern recognition task, which aims to infer a writer model from only a handful of genuine signatures. In order to make it more difficult for a forger to attack the verification system, a promising strategy is to combine different writer models. In this work, we propose to complement a recent structural approach to offli… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Journal ref: Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2018. Lecture Notes in Computer Science, vol 11004. Springer, Cham

  13. arXiv:1808.06809  [pdf, other

    cs.LG stat.ML

    Are You Tampering With My Data?

    Authors: Michele Alberti, Vinaychandran Pondenkandath, Marcel Würsch, Manuel Bouillon, Mathias Seuret, Rolf Ingold, Marcus Liwicki

    Abstract: We propose a novel approach towards adversarial attacks on neural networks (NN), focusing on tampering the data used for training instead of generating attacks on trained models. Our network-agnostic method creates a backdoor during training which can be exploited at test time to force a neural network to exhibit abnormal behaviour. We demonstrate on two widely used datasets (CIFAR-10 and SVHN) th… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

    Comments: 18 pages

    Journal ref: European Conference on Computer Vision (ECCV 2018), Workshop on Objectionable Content and Misinformation

  14. arXiv:1805.00329  [pdf, other

    cs.CV

    DeepDIVA: A Highly-Functional Python Framework for Reproducible Experiments

    Authors: Michele Alberti, Vinaychandran Pondenkandath, Marcel Würsch, Rolf Ingold, Marcus Liwicki

    Abstract: We introduce DeepDIVA: an infrastructure designed to enable quick and intuitive setup of reproducible experiments with a large range of useful analysis functionality. Reproducing scientific results can be a frustrating experience, not only in document image analysis but in machine learning in general. Using DeepDIVA a researcher can either reproduce a given experiment with a very limited amount of… ▽ More

    Submitted 23 April, 2018; originally announced May 2018.

    Comments: Submitted at the 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), 6 pages, 6 Figures

  15. arXiv:1804.01728  [pdf, other

    cs.CV

    Identifying Cross-Depicted Historical Motifs

    Authors: Vinaychandran Pondenkandath, Michele Alberti, Nicole Eichenberger, Rolf Ingold, Marcus Liwicki

    Abstract: Cross-depiction is the problem of identifying the same object even when it is depicted in a variety of manners. This is a common problem in handwritten historical documents image analysis, for instance when the same letter or motif is depicted in several different ways. It is a simple task for humans yet conventional heuristic computer vision methods struggle to cope with it. In this paper we addr… ▽ More

    Submitted 6 December, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: 6 pages, 6 figures

    Journal ref: 16th International Conference on Frontiers in Handwriting Recognition (Vol. 16, pp. 333-338), IEEE, 2018

  16. Open Evaluation Tool for Layout Analysis of Document Images

    Authors: Michele Alberti, Manuel Bouillon, Rolf Ingold, Marcus Liwicki

    Abstract: This paper presents an open tool for standardizing the evaluation process of the layout analysis task of document images at pixel level. We introduce a new evaluation tool that is both available as a standalone Java application and as a RESTful web service. This evaluation tool is free and open-source in order to be a common tool that anyone can use and contribute to. It aims at providing as many… ▽ More

    Submitted 23 November, 2017; originally announced December 2017.

    Comments: The 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), HIP: 4th International Workshop on Historical Document Imaging and Processing, Kyoto, Japan, 2017

    Journal ref: ICDAR-OST 2017

  17. arXiv:1712.01655   

    cs.CV cs.LG

    A Pitfall of Unsupervised Pre-Training

    Authors: Michele Alberti, Mathias Seuret, Rolf Ingold, Marcus Liwicki

    Abstract: The point of this paper is to question typical assumptions in deep learning and suggest alternatives. A particular contribution is to prove that even if a Stacked Convolutional Auto-Encoder is good at reconstructing pictures, it is not necessarily good at discriminating their classes. When using Auto-Encoders, intuitively one assumes that features which are good for reconstruction will also lead t… ▽ More

    Submitted 17 December, 2017; v1 submitted 23 November, 2017; originally announced December 2017.

    Comments: This submission has been withdrawn by the author, it is a duplicate of arXiv:1703.04332

    Journal ref: Conference on Neural Information Processing Systems, Deep Learning: Bridging Theory and Practice, December 2017

  18. Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

    Authors: Michele Alberti, Mathias Seuret, Vinaychandran Pondenkandath, Rolf Ingold, Marcus Liwicki

    Abstract: In this paper, we present a novel approach to perform deep neural networks layer-wise weight initialization using Linear Discriminant Analysis (LDA). Typically, the weights of a deep neural network are initialized with: random values, greedy layer-wise pre-training (usually as Deep Belief Network or as auto-encoder) or by re-using the layers from another network (transfer learning). Hence, many tr… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Comments: 5 pages

    Journal ref: ICDAR-HIP 2017

  19. arXiv:1703.04332  [pdf, other

    cs.CV

    A Pitfall of Unsupervised Pre-Training

    Authors: Michele Alberti, Mathias Seuret, Rolf Ingold, Marcus Liwicki

    Abstract: The point of this paper is to question typical assumptions in deep learning and suggest alternatives. A particular contribution is to prove that even if a Stacked Convolutional Auto-Encoder is good at reconstructing pictures, it is not necessarily good at discriminating their classes. When using Auto-Encoders, intuitively one assumes that features which are good for reconstruction will also lead t… ▽ More

    Submitted 17 December, 2017; v1 submitted 13 March, 2017; originally announced March 2017.

    Comments: Conference on Neural Information Processing Systems, Deep Learning: Bridging Theory and Practice, December 2017

    ACM Class: I.2.6, I.5.2, I.7.5

  20. PCA-Initialized Deep Neural Networks Applied To Document Image Analysis

    Authors: Mathias Seuret, Michele Alberti, Rolf Ingold, Marcus Liwicki

    Abstract: In this paper, we present a novel approach for initializing deep neural networks, i.e., by turning PCA into neural layers. Usually, the initialization of the weights of a deep neural network is done in one of the three following ways: 1) with random values, 2) layer-wise, usually as Deep Belief Network or as auto-encoder, and 3) re-use of layers from another network (transfer learning). Therefore,… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

    Journal ref: ICDAR 2017