Skip to main content

Showing 1–10 of 10 results for author: Kišš, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00420  [pdf, other

    cs.CV cs.AI cs.LG

    Self-supervised Pre-training of Text Recognizers

    Authors: Martin Kišš, Michal Hradiš

    Abstract: In this paper, we investigate self-supervised pre-training methods for document text recognition. Nowadays, large unlabeled datasets can be collected for many research tasks, including text recognition, but it is costly to annotate them. Therefore, methods utilizing unlabeled data are researched. We study self-supervised pre-training methods based on masked label prediction using three different a… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 18 pages, 6 figures, 4 tables, accepted to ICDAR24

  2. arXiv:2308.11511  [pdf, other

    cs.LG

    Mode Combinability: Exploring Convex Combinations of Permutation Aligned Models

    Authors: Adrián Csiszárik, Melinda F. Kiss, Péter Kőrösi-Szabó, Márton Muntag, Gergely Papp, Dániel Varga

    Abstract: We explore element-wise convex combinations of two permutation-aligned neural network parameter vectors $Θ_A$ and $Θ_B$ of size $d$. We conduct extensive experiments by examining various distributions of such model combinations parametrized by elements of the hypercube $[0,1]^{d}$ and its vicinity. Our findings reveal that broad regions of the hypercube form surfaces of low loss values, indicating… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  3. 2DeteCT -- A large 2D expandable, trainable, experimental Computed Tomography dataset for machine learning

    Authors: Maximilian B. Kiss, Sophia B. Coban, K. Joost Batenburg, Tristan van Leeuwen, Felix Lucka

    Abstract: Recent research in computational imaging largely focuses on develo** machine learning (ML) techniques for image reconstruction, which requires large-scale training datasets consisting of measurement data and ground-truth images. However, suitable experimental datasets for X-ray Computed Tomography (CT) are scarce, and methods are often developed and evaluated only on simulated data. We fill this… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Journal ref: Scientific Data 10, 576 (2023)

  4. arXiv:2302.06318  [pdf, other

    cs.CV

    Towards Writing Style Adaptation in Handwriting Recognition

    Authors: Jan Kohút, Michal Hradiš, Martin Kišš

    Abstract: One of the challenges of handwriting recognition is to transcribe a large number of vastly different writing styles. State-of-the-art approaches do not explicitly use information about the writer's style, which may be limiting overall accuracy due to various ambiguities. We explore models with writer-dependent parameters which take the writer's identity as an additional input. The proposed models… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: Submitted to ICDAR2023 conference

  5. arXiv:2212.02135  [pdf, other

    cs.LG cs.CV

    SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels

    Authors: Martin Kišš, Michal Hradiš, Karel Beneš, Petr Buchal, Michal Kula

    Abstract: This paper explores semi-supervised training for sequence tasks, such as Optical Character Recognition or Automatic Speech Recognition. We propose a novel loss function $\unicode{x2013}$ SoftCTC $\unicode{x2013}$ which is an extension of CTC allowing to consider multiple transcription variants at the same time. This allows to omit the confidence based filtering step which is otherwise a crucial co… ▽ More

    Submitted 19 September, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 21 pages, 8 figures, 6 tables, accepted to International Journal on Document Analysis and Recognition (IJDAR)

    MSC Class: 68T07; 68T10

  6. arXiv:2201.09575  [pdf, other

    cs.CV

    Importance of Textlines in Historical Document Classification

    Authors: Martin Kišš, Jan Kohút, Karel Beneš, Michal Hradiš

    Abstract: This paper describes a system prepared at Brno University of Technology for ICDAR 2021 Competition on Historical Document Classification, experiments leading to its design, and the main findings. The solved tasks include script and font classification, document origin localization, and dating. We combined patch-level and line-level approaches, where the line-level system utilizes an existing, publ… ▽ More

    Submitted 30 March, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: 13 pages, 7 figures, 5 tables

    MSC Class: 68T07; 68T10

  7. AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions

    Authors: Martin Kišš, Karel Beneš, Michal Hradiš

    Abstract: This paper addresses text recognition for domains with limited manual annotations by a simple self-training strategy. Our approach should reduce human annotation effort when target domain data is plentiful, such as when transcribing a collection of single person's correspondence or a large manuscript. We propose to train a seed system on large scale data from related domains mixed with available a… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: 15 pages, 6 figures, 5 tables

  8. arXiv:2009.04399  [pdf, other

    cs.CE cs.MS

    Performance Analysis of FEM Solvers on Practical Electromagnetic Problems

    Authors: Gergely Máté Kiss, Jan Kaska, Roberto André Henrique de Oliveira, Olena Rubanenko, Balázs Tóth

    Abstract: The paper presents a comparative analysis of different commercial and academic software. The comparison aims to examine how the integrated adaptive grid refinement methodologies can deal with challenging, electromagnetic-field related problems. For this comparison, two benchmark problems were examined in the paper. The first example is a solution of an L-shape domain like test problem, which has a… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    MSC Class: G.1.10 ACM Class: G.1.10

  9. arXiv:1907.01307  [pdf, other

    cs.CV

    Brno Mobile OCR Dataset

    Authors: Martin Kišš, Michal Hradiš, Oldřich Kodym

    Abstract: We introduce the Brno Mobile OCR Dataset (B-MOD) for document Optical Character Recognition from low-quality images captured by handheld mobile devices. While OCR of high-quality scanned documents is a mature field where many commercial tools are available, and large datasets of text in the wild exist, no existing datasets can be used to develop and test document OCR methods robust to non-uniform… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

  10. arXiv:1210.0330  [pdf

    q-bio.MN cond-mat.dis-nn cs.SI nlin.AO physics.bio-ph

    Structure and dynamics of molecular networks: A novel paradigm of drug discovery. A comprehensive review

    Authors: Peter Csermely, Tamas Korcsmaros, Huba J. M. Kiss, Gabor London, Ruth Nussinov

    Abstract: Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug desig… ▽ More

    Submitted 11 May, 2013; v1 submitted 1 October, 2012; originally announced October 2012.

    Comments: 76 pages, 23 Figures, 12 Tables and 1270 references

    Journal ref: Pharmacology and Therapeutics 138:333-408 (2013)