Skip to main content

Showing 1–50 of 55 results for author: Christlein, V

.
  1. arXiv:2407.04592  [pdf, other

    cs.CV

    Smell and Emotion: Recognising emotions in smell-related artworks

    Authors: Vishal Patoliya, Mathias Zinnen, Andreas Maier, Vincent Christlein

    Abstract: Emotions and smell are underrepresented in digital art history. In this exploratory work, we show that recognising emotions from smell-related artworks is technically feasible but has room for improvement. Using style transfer and hyperparameter optimization we achieve a minor performance boost and open up the field for future extensions.

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 5 pages, 3 figures

  2. Offline Writer Identification Using Convolutional Neural Network Activation Features

    Authors: Vincent Christlein, David Bernecker, Andreas Maier, Elli Angelopoulou

    Abstract: Convolutional neural networks (CNNs) have recently become the state-of-the-art tool for large-scale image classification. In this work we propose the use of activation features from CNNs as local descriptors for writer identification. A global descriptor is then formed by means of GMM supervector encoding, which is further improved by normalization with the KL-Kernel. We evaluate our method on two… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: fixed tab 1b

    Journal ref: Pattern Recognition. DAGM 2015. Lecture Notes in Computer Science(), vol 9358. Springer, Cham

  3. ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings

    Authors: Alexander Schmidt, Prathmesh Madhu, Andreas Maier, Vincent Christlein, Ronak Kosti

    Abstract: Image enhancement algorithms are very useful for real world computer vision tasks where image resolution is often physically limited by the sensor size. While state-of-the-art deep neural networks show impressive results for image enhancement, they often struggle to enhance real-world images. In this work, we tackle a real-world setting: inpainting of images from Dunhuang caves. The Dunhuang datas… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Journal ref: 2022 Eleventh International Conference on Image Processing Theory, Tools and Applications (IPTA), Salzburg, Austria, 2022, pp. 1-6

  4. A Fair Evaluation of Various Deep Learning-Based Document Image Binarization Approaches

    Authors: Richin Sukesh, Mathias Seuret, Anguelos Nicolaou, Martin Mayr, Vincent Christlein

    Abstract: Binarization of document images is an important pre-processing step in the field of document analysis. Traditional image binarization techniques usually rely on histograms or local statistics to identify a valid threshold to differentiate between different aspects of the image. Deep learning techniques are able to generate binarized versions of the images by learning context-dependent features tha… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: DAS 2022

    Journal ref: Document Analysis Systems. DAS 2022. Lecture Notes in Computer Science, vol 13237. Springer, Cham

  5. SniffyArt: The Dataset of Smelling Persons

    Authors: Mathias Zinnen, Azhar Hussian, Hang Tran, Prathmesh Madhu, Andreas Maier, Vincent Christlein

    Abstract: Smell gestures play a crucial role in the investigation of past smells in the visual arts yet their automated recognition poses significant challenges. This paper introduces the SniffyArt dataset, consisting of 1941 individuals represented in 441 historical artworks. Each person is annotated with a tightly fitting bounding box, 17 pose keypoints, and a gesture label. By integrating these annotatio… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 8 figures

    Journal ref: Proceedings of the 5th Workshop on analySis, Understanding and proMotion of heritAge Contents. 2023. S. 49-58

  6. arXiv:2306.14071  [pdf, other

    cs.CV

    Efficient Annotation of Medieval Charters

    Authors: Anguelos Nicolaou, Daniel Luger, Franziska Decker, Nicolas Renet, Vincent Christlein, Georg Vogeler

    Abstract: Diplomatics, the analysis of medieval charters, is a major field of research in which paleography is applied. Annotating data, if performed by laymen, needs validation and correction by experts. In this paper, we propose an effective and efficient annotation approach for charter segmentation, essentially reducing it to object detection. This approach allows for a much more efficient use of the pal… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  7. A Vessel-Segmentation-Based CycleGAN for Unpaired Multi-modal Retinal Image Synthesis

    Authors: Aline Sindel, Andreas Maier, Vincent Christlein

    Abstract: Unpaired image-to-image translation of retinal images can efficiently increase the training dataset for deep-learning-based multi-modal retinal registration methods. Our method integrates a vessel segmentation network into the image-to-image translation task by extending the CycleGAN framework. The segmentation network is inserted prior to a UNet vision transformer generator network and serves as… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to BVM 2023

    Journal ref: BVM 2023

  8. Combining OCR Models for Reading Early Modern Printed Books

    Authors: Mathias Seuret, Janne van der Loop, Nikolaus Weichselbaumer, Martin Mayr, Janina Molnar, Tatjana Hass, Florian Kordon, Anguelos Nicolau, Vincent Christlein

    Abstract: In this paper, we investigate the usage of fine-grained font recognition on OCR for books printed from the 15th to the 18th century. We used a newly created dataset for OCR of early printed books for which fonts are labeled with bounding boxes. We know not only the font group used for each character, but the locations of font changes as well. In books of this period, we frequently find font group… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted to ICDAR23

    Journal ref: Document Analysis and Recognition - ICDAR 2023. ICDAR 2023. Lecture Notes in Computer Science, vol 14191. Springer, Cham

  9. arXiv:2303.16576  [pdf, other

    cs.CV

    WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models

    Authors: Konstantina Nikolaidou, George Retsinas, Vincent Christlein, Mathias Seuret, Giorgos Sfikas, Elisa Barney Smith, Hamam Mokayed, Marcus Liwicki

    Abstract: Text-to-Image synthesis is the task of generating an image according to a specific text description. Generative Adversarial Networks have been considered the standard method for image synthesis virtually since their introduction. Denoising Diffusion Probabilistic Models are recently setting a new baseline, with remarkable results in Text-to-Image synthesis, among other fields. Aside its usefulness… ▽ More

    Submitted 17 May, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  10. AMD-HookNet for Glacier Front Segmentation

    Authors: Fei Wu, Nora Gourmelon, Thorsten Seehaus, Jianlin Zhang, Matthias Braun, Andreas Maier, Vincent Christlein

    Abstract: Knowledge on changes in glacier calving front positions is important for assessing the status of glaciers. Remote sensing imagery provides the ideal database for monitoring calving front positions, however, it is not feasible to perform this task manually for all calving glaciers globally due to time-constraints. Deep learning-based methods have shown great potential for glacier calving front deli… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  11. arXiv:2301.09906  [pdf, other

    cs.CV

    Transfer Learning for Olfactory Object Detection

    Authors: Mathias Zinnen, Prathmesh Madhu, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: We investigate the effect of style and category similarity in multiple datasets used for object detection pretraining. We find that including an additional stage of object-detection pretraining can increase the detection performance considerably. While our experiments suggest that style similarities between pre-training and target datasets are less important than matching categories, further exper… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 6 pages, 4 figures

    Journal ref: 2022 Digital Humanities Conference, Tokyo, Japan, 2022, pp.409-413

  12. ODOR: The ICPR2022 ODeuropa Challenge on Olfactory Object Recognition

    Authors: Mathias Zinnen, Prathmesh Madhu, Ronak Kosti, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: The Odeuropa Challenge on Olfactory Object Recognition aims to foster the development of object detection in the visual arts and to promote an olfactory perspective on digital heritage. Object detection in historical artworks is particularly challenging due to varying styles and artistic periods. Moreover, the task is complicated due to the particularity and historical variance of predefined targe… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 6 pages, 6 figures

    Journal ref: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, pp. 4989-4994

  13. Writer Retrieval and Writer Identification in Greek Papyri

    Authors: Vincent Christlein, Isabelle Marthot-Santaniello, Martin Mayr, Anguelos Nicolaou, Mathias Seuret

    Abstract: The analysis of digitized historical manuscripts is typically addressed by paleographic experts. Writer identification refers to the classification of known writers while writer retrieval seeks to find the writer by means of image similarity in a dataset of images. While automatic writer identification/retrieval methods already provide promising results for many historical document types, papyri d… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Journal ref: IGS 2022. Lecture Notes in Computer Science, vol 13424. Springer, Cham

  14. arXiv:2210.09204  [pdf, other

    cs.CV

    ArtFacePoints: High-resolution Facial Landmark Detection in Paintings and Prints

    Authors: Aline Sindel, Andreas Maier, Vincent Christlein

    Abstract: Facial landmark detection plays an important role for the similarity analysis in artworks to compare portraits of the same or similar artists. With facial landmarks, portraits of different genres, such as paintings and prints, can be automatically aligned using control-point-based image registration. We propose a deep-learning-based method for facial landmark detection in high-resolution images of… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 16 pages, 8 figures, 3 tables, accepted to VISART workshop at ECCV 2022

  15. arXiv:2208.08836  [pdf, other

    cs.CV

    A Multi-modal Registration and Visualization Software Tool for Artworks using CraquelureNet

    Authors: Aline Sindel, Andreas Maier, Vincent Christlein

    Abstract: For art investigations of paintings, multiple imaging technologies, such as visual light photography, infrared reflectography, ultraviolet fluorescence photography, and x-radiography are often used. For a pixel-wise comparison, the multi-modal images have to be registered. We present a registration and visualization software tool, that embeds a convolutional neural network to extract cross-modal f… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: 14 pages, 9 figures, 1 table, accepted to PatReCH 2022 Workshop at ICPR 2022

  16. arXiv:2207.10506  [pdf, other

    cs.CV

    Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network

    Authors: Aline Sindel, Bettina Hohberger, Andreas Maier, Vincent Christlein

    Abstract: In ophthalmological imaging, multiple imaging systems, such as color fundus, infrared, fluorescein angiography, optical coherence tomography (OCT) or OCT angiography, are often involved to make a diagnosis of retinal disease. Multi-modal retinal registration techniques can assist ophthalmologists by providing a pixel-based comparison of aligned vessel structures in images from different modalities… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: 11 pages, 3 figures, 3 tables, accepted to MICCAI 2022

  17. arXiv:2206.11115  [pdf, other

    cs.CV

    ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas

    Authors: Prathmesh Madhu, Tilman Marquart, Ronak Kosti, Dirk Suckow, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: Image compositions are helpful in the study of image structures and assist in discovering the semantics of the underlying scene portrayed across art forms and styles. With the digitization of artworks in recent years, thousands of images of a particular scene or narrative could potentially be linked together. However, manually linking this data with consistent objectiveness can be a highly challen… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  18. arXiv:2205.14892  [pdf, other

    cs.LG cs.CV

    Exploring the Open World Using Incremental Extreme Value Machines

    Authors: Tobias Koch, Felix Liebezeit, Christian Riess, Vincent Christlein, Thomas Köhler

    Abstract: Dynamic environments require adaptive applications. One particular machine learning problem in dynamic environments is open world recognition. It characterizes a continuously changing domain where only some classes are seen in one batch of the training data and such batches can only be learned incrementally. Open world recognition is a demanding task that is, to the best of our knowledge, addresse… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted at ICPR 2022

  19. arXiv:2204.03776  [pdf, other

    cs.CV cs.AI

    TorMentor: Deterministic dynamic-path, data augmentations with fractals

    Authors: Anguelos Nicolaou, Vincent Christlein, Edgar Riba, Jian Shi, Georg Vogeler, Mathias Seuret

    Abstract: We propose the use of fractals as a means of efficient data augmentation. Specifically, we employ plasma fractals for adapting global image augmentation transformations into continuous local transforms. We formulate the diamond square algorithm as a cascade of simple convolution operations allowing efficient computation of plasma fractals on the GPU. We present the TorMentor image augmentation fra… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: Accepted at ECV 2022 CVPR workshop

  20. arXiv:2202.03540  [pdf, other

    cs.CV

    SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks

    Authors: Aline Sindel, Abner Hernandez, Seung Hee Yang, Vincent Christlein, Andreas Maier

    Abstract: With the increasing number of online learning material in the web, search for specific content in lecture videos can be time consuming. Therefore, automatic slide extraction from the lecture videos can be helpful to give a brief overview of the main content and to support the students in their studies. For this task, we propose a deep learning method to detect slide transitions in lectures videos.… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: 6 pages, 5 figures, 1 table, accepted to OAGM Workshop 2021

  21. arXiv:2201.02242  [pdf, other

    eess.IV cs.CV

    A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration

    Authors: Aline Sindel, Bettina Hohberger, Sebastian Fassihi Dehcordi, Christian Mardin, Robert Lämmer, Andreas Maier, Vincent Christlein

    Abstract: Ophthalmological imaging utilizes different imaging systems, such as color fundus, infrared, fluorescein angiography, optical coherence tomography (OCT) or OCT angiography. Multiple images with different modalities or acquisition times are often analyzed for the diagnosis of retinal diseases. Automatically aligning the vessel structures in the images by means of multi-modal registration can suppor… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    Comments: 6 pages, 4 figures, 1 table, accepted to BVM 2022

  22. arXiv:2111.03663  [pdf

    eess.IV cs.CV cs.LG

    First steps on Gamification of Lung Fluid Cells Annotations in the Flower Domain

    Authors: Sonja Kunzmann, Christian Marzahl, Felix Denzinger, Christof A. Bertram, Robert Klopfleisch, Katharina Breininger, Vincent Christlein, Andreas Maier

    Abstract: Annotating data, especially in the medical domain, requires expert knowledge and a lot of effort. This limits the amount and/or usefulness of available medical data sets for experimentation. Therefore, develo** strategies to increase the number of annotations while lowering the needed domain knowledge is of interest. A possible strategy is the use of gamification, i.e. transforming the annotatio… ▽ More

    Submitted 17 January, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: 6 pages, 4 figures

  23. arXiv:2108.13640  [pdf, other

    cs.CV

    Module-Power Prediction from PL Measurements using Deep Learning

    Authors: Mathis Hoffmann, Johannes Hepp, Bernd Doll, Claudia Buerhop-Lutz, Ian Marius Peters, Christoph Brabec, Andreas Maier, Vincent Christlein

    Abstract: The individual causes for power loss of photovoltaic modules are investigated for quite some time. Recently, it has been shown that the power loss of a module is, for example, related to the fraction of inactive areas. While these areas can be easily identified from electroluminescense (EL) images, this is much harder for photoluminescence (PL) images. With this work, we close the gap between powe… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

  24. SmartPatch: Improving Handwritten Word Imitation with Patch Discriminators

    Authors: Alexander Mattick, Martin Mayr, Mathias Seuret, Andreas Maier, Vincent Christlein

    Abstract: As of recent generative adversarial networks have allowed for big leaps in the realism of generated images in diverse domains, not the least of which being handwritten text generation. The generation of realistic-looking hand-written text is important because it can be used for data augmentation in handwritten text recognition (HTR) systems or human-computer interaction. We propose SmartPatch, a n… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: to be published the in 16th International Conference on Document Analysis and Recognition 2021 ICDAR

  25. How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies

    Authors: Soroosh Tayebi Arasteh, Mehrpad Monajem, Vincent Christlein, Philipp Heinrich, Anguelos Nicolaou, Hamidreza Naderi Boldaji, Mahshad Lotfinia, Stefan Evert

    Abstract: Twitter sentiment analysis, which often focuses on predicting the polarity of tweets, has attracted increasing attention over the last years, in particular with the rise of deep learning (DL). In this paper, we propose a new task: predicting the predominant sentiment among (first-order) replies to a given tweet. Therefore, we created RETWEET, a large dataset of tweets and replies manually annotate… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Published in 2021 IEEE 15th International Conference on Semantic Computing (ICSC)

    Journal ref: 2021 IEEE 15th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2021, pp. 356-359

  26. arXiv:2103.08562  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Deep Learning-based Patient Re-identification Is able to Exploit the Biometric Nature of Medical Chest X-ray Data

    Authors: Kai Packhäuser, Sebastian Gündel, Nicolas Münster, Christopher Syben, Vincent Christlein, Andreas Maier

    Abstract: With the rise and ever-increasing potential of deep learning techniques in recent years, publicly available medical datasets became a key factor to enable reproducible development of diagnostic algorithms in the medical domain. Medical data contains sensitive patient-related information and is therefore usually anonymized by removing patient identifiers, e.g., patient names before publication. To… ▽ More

    Submitted 2 September, 2022; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: Published in Scientific Reports

    Journal ref: Scientific Reports, 12, Article number: 14851 (2022)

  27. Pixel-wise Distance Regression for Glacier Calving Front Detection and Segmentation

    Authors: Amirabbas Davari, Christoph Baller, Thorsten Seehaus, Matthias Braun, Andreas Maier, Vincent Christlein

    Abstract: Glacier calving front position (CFP) is an important glaciological variable. Traditionally, delineating the CFPs has been carried out manually, which was subjective, tedious and expensive. Automating this process is crucial for continuously monitoring the evolution and status of glaciers. Recently, deep learning approaches have been investigated for this application. However, the current methods g… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  28. On Mathews Correlation Coefficient and Improved Distance Map Loss for Automatic Glacier Calving Front Segmentation in SAR Imagery

    Authors: Amirabbas Davari, Saahil Islam, Thorsten Seehaus, Matthias Braun, Andreas Maier, Vincent Christlein

    Abstract: The vast majority of the outlet glaciers and ice streams of the polar ice sheets end in the ocean. Ice mass loss via calving of the glaciers into the ocean has increased over the last few decades. Information on the temporal variability of the calving front position provides fundamental information on the state of the glacier and ice stream, which can be exploited as calibration and validation dat… ▽ More

    Submitted 9 March, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  29. arXiv:2101.03252  [pdf, other

    cs.LG cs.CV

    Synthetic Glacier SAR Image Generation from Arbitrary Masks Using Pix2Pix Algorithm

    Authors: Rosanna Dietrich-Sussner, Amirabbas Davari, Thorsten Seehaus, Matthias Braun, Vincent Christlein, Andreas Maier, Christian Riess

    Abstract: Supervised machine learning requires a large amount of labeled data to achieve proper test results. However, generating accurately labeled segmentation maps on remote sensing imagery, including images from synthetic aperture radar (SAR), is tedious and highly subjective. In this work, we propose to alleviate the issue of limited training data by generating synthetic SAR images with the pix2pix alg… ▽ More

    Submitted 14 January, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

  30. arXiv:2101.03249  [pdf, other

    cs.LG cs.CV

    Bayesian U-Net for Segmenting Glaciers in SAR Imagery

    Authors: Andreas Hartmann, Amirabbas Davari, Thorsten Seehaus, Matthias Braun, Andreas Maier, Vincent Christlein

    Abstract: Fluctuations of the glacier calving front have an important influence over the ice flow of whole glacier systems. It is therefore important to precisely monitor the position of the calving front. However, the manual delineation of SAR images is a difficult, laborious and subjective task. Convolutional neural networks have previously shown promising results in automating the glacier segmentation in… ▽ More

    Submitted 4 May, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

  31. arXiv:2101.03247  [pdf, other

    cs.LG cs.CV

    Glacier Calving Front Segmentation Using Attention U-Net

    Authors: Michael Holzmann, Amirabbas Davari, Thorsten Seehaus, Matthias Braun, Andreas Maier, Vincent Christlein

    Abstract: An essential climate variable to determine the tidewater glacier status is the location of the calving front position and the separation of seasonal variability from long-term trends. Previous studies have proposed deep learning-based methods to semi-automatically delineate the calving fronts of tidewater glaciers. They used U-Net to segment the ice and non-ice regions and extracted the calving fr… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

  32. Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

    Authors: Prathmesh Madhu, Angel Villar-Corrales, Ronak Kosti, Torsten Bendschus, Corinna Reinhardt, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: Human pose estimation (HPE) is a central part of understanding the visual narration and body movements of characters depicted in artwork collections, such as Greek vase paintings. Unfortunately, existing HPE methods do not generalise well across domains resulting in poorly recognized poses. Therefore, we propose a two step approach: (1) adapting a dataset of natural images of known person and pose… ▽ More

    Submitted 25 February, 2024; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: Link to the repository containing the code to reproduce the experiments. For further details, please read the README. Link: https://anonymous.4open.science/r/3b1bd8ac-bd3a-4df6-8671-56d4f9bdbd8d/

    Journal ref: J. Comput. Cult. Herit. 16, 1, Article 16 (March 2023), 17 pages

  33. Joint Super-Resolution and Rectification for Solar Cell Inspection

    Authors: Mathis Hoffmann, Thomas Köhler, Bernd Doll, Frank Schebesch, Florian Talkenberg, Ian Marius Peters, Christoph J. Brabec, Andreas Maier, Vincent Christlein

    Abstract: Visual inspection of solar modules is an important monitoring facility in photovoltaic power plants. Since a single measurement of fast CMOS sensors is limited in spatial resolution and often not sufficient to reliably detect small defects, we apply multi-frame super-resolution (MFSR) to a sequence of low resolution measurements. In addition, the rectification and removal of lens distortion simpli… ▽ More

    Submitted 7 April, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

  34. arXiv:2010.10197  [pdf, other

    cs.CV

    ICFHR 2020 Competition on Image Retrieval for Historical Handwritten Fragments

    Authors: Mathias Seuret, Anguelos Nicolaou, Dominique Stutzmann, Andreas Maier, Vincent Christlein

    Abstract: This competition succeeds upon a line of competitions for writer and style analysis of historical document images. In particular, we investigate the performance of large-scale retrieval of historical document fragments in terms of style and writer identification. The analysis of historic fragments is a difficult challenge commonly solved by trained humanists. In comparison to previous competitions… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: ICFHR 2020

  35. arXiv:2009.14712  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Learning-based Pipeline for Module Power Prediction from EL Measurements

    Authors: Mathis Hoffmann, Claudia Buerhop-Lutz, Luca Reeb, Tobias Pickel, Thilo Winkler, Bernd Doll, Tobias Würfl, Ian Marius Peters, Christoph Brabec, Andreas Maier, Vincent Christlein

    Abstract: Automated inspection plays an important role in monitoring large-scale photovoltaic power plants. Commonly, electroluminescense measurements are used to identify various types of defects on solar modules but have not been used to determine the power of a module. However, knowledge of the power at maximum power point is important as well, since drops in the power of a single module can affect the p… ▽ More

    Submitted 26 November, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

  36. arXiv:2009.03807  [pdf, other

    cs.CV eess.IV

    Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors

    Authors: Prathmesh Madhu, Tilman Marquart, Ronak Kosti, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: Image compositions as a tool for analysis of artworks is of extreme significance for art historians. These compositions are useful in analyzing the interactions in an image to study artists and their artworks. Max Imdahl in his work called Ikonik, along with other prominent art historians of the 20th century, underlined the aesthetic and semantic importance of the structural composition of an imag… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: To be Published in ECCV 2020 Workshops (VISART V)

  37. arXiv:2007.07943  [pdf, other

    cs.CV eess.IV

    The Notary in the Haystack -- Countering Class Imbalance in Document Processing with CNNs

    Authors: Martin Leipert, Georg Vogeler, Mathias Seuret, Andreas Maier, Vincent Christlein

    Abstract: Notarial instruments are a category of documents. A notarial instrument can be distinguished from other documents by its notary sign, a prominent symbol in the certificate, which also allows to identify the document's issuer. Naturally, notarial instruments are underrepresented in regard to other documents. This makes a classification difficult because class imbalance in training data worsens the… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: Accepted at DAS Workshop 2020

  38. arXiv:2007.07690  [pdf, other

    cs.CV

    Proof of Concept: Automatic Type Recognition

    Authors: Vincent Christlein, Nikolaus Weichselbaumer, Saskia Limbach, Mathias Seuret

    Abstract: The type used to print an early modern book can give scholars valuable information about the time and place of its production as well as its producer. Recognizing such type is currently done manually using both the character shapes of `M' or `Qu' and the size of the total type to look it up in a large reference work. This is a reliable method, but it is also slow and requires specific skills. We i… ▽ More

    Submitted 20 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: InfDH 2020

  39. arXiv:2007.07101  [pdf, other

    cs.CV

    Re-ranking for Writer Identification and Writer Retrieval

    Authors: Simon Jordan, Mathias Seuret, Pavel Král, Ladislav Lenc, Jiří Martínek, Barbara Wiermann, Tobias Schwinger, Andreas Maier, Vincent Christlein

    Abstract: Automatic writer identification is a common problem in document analysis. State-of-the-art methods typically focus on the feature extraction step with traditional or deep-learning-based techniques. In retrieval problems, re-ranking is a commonly used technique to improve the results. Re-ranking refines an initial ranking result by using the knowledge contained in the ranked result, e. g., by explo… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  40. Recognizing Characters in Art History Using Deep Learning

    Authors: Prathmesh Madhu, Ronak Kosti, Lara Mührenberg, Peter Bell, Andreas Maier, Vincent Christlein

    Abstract: In the field of Art History, images of artworks and their contexts are core to understanding the underlying semantic information. However, the highly complex and sophisticated representation of these artworks makes it difficult, even for the experts, to analyze the scene. From the computer vision perspective, the task of analyzing such artworks can be divided into sub-problems by taking a bottom-u… ▽ More

    Submitted 1 April, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

    ACM Class: I.5.1; I.4.8; J.5

    Journal ref: In Proceedings of the 1st Workshop on Structuring and Understanding of Multimedia heritAge Contents, pp. 15-22 (2019, October)

  41. Spatio-Temporal Handwriting Imitation

    Authors: Martin Mayr, Martin Stumpf, Anguelos Nicolaou, Mathias Seuret, Andreas Maier, Vincent Christlein

    Abstract: Most people think that their handwriting is unique and cannot be imitated by machines, especially not using completely new content. Current cursive handwriting synthesis is visually limited or needs user interaction. We show that subdividing the process into smaller subtasks makes it possible to imitate someone's handwriting with a high chance to be visually indistinguishable for humans. Therefore… ▽ More

    Submitted 16 April, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: Main paper: 16 pages, supplemental material: 7 pages

  42. arXiv:2001.11248  [pdf, other

    cs.CV

    Weakly Supervised Segmentation of Cracks on Solar Cells using Normalized Lp Norm

    Authors: Martin Mayr, Mathis Hoffmann, Andreas Maier, Vincent Christlein

    Abstract: Photovoltaic is one of the most important renewable energy sources for dealing with world-wide steadily increasing energy consumption. This raises the demand for fast and scalable automatic quality management during production and operation. However, the detection and segmentation of cracks on electroluminescence (EL) images of mono- or polycrystalline solar modules is a challenging task. In this… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

    Comments: ICIP'2019

  43. arXiv:1912.03713  [pdf, other

    cs.CV

    ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents

    Authors: Vincent Christlein, Anguelos Nicolaou, Mathias Seuret, Dominique Stutzmann, Andreas Maier

    Abstract: This competition investigates the performance of large-scale retrieval of historical document images based on writing style. Based on large image data sets provided by cultural heritage institutions and digital libraries, providing a total of 20 000 document images representing about 10 000 writers, divided in three types: writers of (i) manuscript books, (ii) letters, (iii) charters and legal doc… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  44. Deep Generalized Max Pooling

    Authors: Vincent Christlein, Lukas Spranger, Mathias Seuret, Anguelos Nicolaou, Pavel Král, Andreas Maier

    Abstract: Global pooling layers are an essential part of Convolutional Neural Networks (CNN). They are used to aggregate activations of spatial locations to produce a fixed-size vector in several state-of-the-art CNNs. Global average pooling or global max pooling are commonly used for converting convolutional features of variable size images to a fix-sized embedding. However, both pooling layer types are co… ▽ More

    Submitted 26 February, 2024; v1 submitted 14 August, 2019; originally announced August 2019.

    Comments: ICDAR'19 (v2: fixed Fig. 1)

    Journal ref: 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, NSW, Australia, 2019, pp. 1090-1096

  45. Fast and robust detection of solar modules in electroluminescence images

    Authors: Mathis Hoffmann, Bernd Doll, Florian Talkenberg, Christoph J. Brabec, Andreas K. Maier, Vincent Christlein

    Abstract: Fast, non-destructive and on-site quality control tools, mainly high sensitive imaging techniques, are important to assess the reliability of photovoltaic plants. To minimize the risk of further damages and electrical yield losses, electroluminescence (EL) imaging is used to detect local defects in an early stage, which might cause future electric losses. For an automated defect recognition on EL… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

  46. Automatic Classification of Defective Photovoltaic Module Cells in Electroluminescence Images

    Authors: Sergiu Deitsch, Vincent Christlein, Stephan Berger, Claudia Buerhop-Lutz, Andreas Maier, Florian Gallwitz, Christian Riess

    Abstract: Electroluminescence (EL) imaging is a useful modality for the inspection of photovoltaic (PV) modules. EL images provide high spatial resolution, which makes it possible to detect even finest defects on the surface of PV modules. However, the analysis of EL images is typically a manual process that is expensive, time-consuming, and requires expert knowledge of many different types of defects. In t… ▽ More

    Submitted 16 March, 2019; v1 submitted 8 July, 2018; originally announced July 2018.

  47. arXiv:1806.11216  [pdf, other

    cs.CV

    Adversarial and Perceptual Refinement for Compressed Sensing MRI Reconstruction

    Authors: Maximilian Seitzer, Guang Yang, Jo Schlemper, Ozan Oktay, Tobias Würfl, Vincent Christlein, Tom Wong, Raad Mohiaddin, David Firmin, Jennifer Keegan, Daniel Rueckert, Andreas Maier

    Abstract: Deep learning approaches have shown promising performance for compressed sensing-based Magnetic Resonance Imaging. While deep neural networks trained with mean squared error (MSE) loss functions can achieve high peak signal to noise ratio, the reconstructed images are often blurry and lack sharp details, especially for higher undersampling rates. Recently, adversarial and perceptual loss functions… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Comments: To be published at MICCAI 2018

  48. arXiv:1806.07171  [pdf, other

    cs.CV

    Non-deterministic Behavior of Ranking-based Metrics when Evaluating Embeddings

    Authors: Anguelos Nicolaou, Sounak Dey, Vincent Christlein, Andreas Maier, Dimosthenis Karatzas

    Abstract: Embedding data into vector spaces is a very popular strategy of pattern recognition methods. When distances between embeddings are quantized, performance metrics become ambiguous. In this paper, we present an analysis of the ambiguity quantized distances introduce and provide bounds on the effect. We demonstrate that it can have a measurable effect in empirical data in state-of-the-art systems. We… ▽ More

    Submitted 20 February, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

  49. arXiv:1801.09472  [pdf, other

    cs.CV cs.DL

    Hyper-Hue and EMAP on Hyperspectral Images for Supervised Layer Decomposition of Old Master Drawings

    Authors: AmirAbbas Davari, Nikolaos Sakaltras, Armin Haeberle, Sulaiman Vesal, Vincent Christlein, Andreas Maier, Christian Riess

    Abstract: Old master drawings were mostly created step by step in several layers using different materials. To art historians and restorers, examination of these layers brings various insights into the artistic work process and helps to answer questions about the object, its attribution and its authenticity. However, these layers typically overlap and are oftentimes difficult to differentiate with the unaid… ▽ More

    Submitted 28 May, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

  50. arXiv:1801.04211  [pdf, ps, other

    cs.LG stat.ML

    Towards Arbitrary Noise Augmentation - Deep Learning for Sampling from Arbitrary Probability Distributions

    Authors: Felix Horger, Tobias Würfl, Vincent Christlein, Andreas Maier

    Abstract: Accurate noise modelling is important for training of deep learning reconstruction algorithms. While noise models are well known for traditional imaging techniques, the noise distribution of a novel sensor may be difficult to determine a priori. Therefore, we propose learning arbitrary noise distributions. To do so, this paper proposes a fully connected neural network model to map samples from a u… ▽ More

    Submitted 10 July, 2018; v1 submitted 12 January, 2018; originally announced January 2018.