Skip to main content

Showing 1–6 of 6 results for author: Lysak, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00505  [pdf, other

    cs.IR cs.LG

    KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

    Authors: Oshri Naparstek, Roi Pony, Inbar Shapira, Foad Abo Dahood, Ophir Azulai, Yevgeny Yaroker, Nadav Rubinstein, Maksym Lysak, Peter Staar, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Elad Amrani, Idan Friedman, Orit Prince, Yevgeny Burshtein, Adi Raz Goldfarb, Udi Barzelay

    Abstract: In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: accepted ICDAR2024

  2. ESG Accountability Made Easy: DocQA at Your Service

    Authors: Lokesh Mishra, Cesar Berrospi, Kasper Dinkla, Diego Antognini, Francesco Fusco, Benedikt Bothur, Maksym Lysak, Nikolaos Livathinos, Ahmed Nassar, Panagiotis Vagenas, Lucas Morin, Christoph Auer, Michele Dolfi, Peter Staar

    Abstract: We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document conversion to machine-readable format (via computer vision), finding relevant data (via natural language processing), and formulating an eloquent response (via… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Demonstration Track of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI 24)

    Journal ref: AAAI 2024, 38, 23814-23816

  3. ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents

    Authors: Christoph Auer, Ahmed Nassar, Maksym Lysak, Michele Dolfi, Nikolaos Livathinos, Peter Staar

    Abstract: Transforming documents into machine-processable representations is a challenging task due to their complex structures and variability in formats. Recovering the layout structure and content from PDF files or scanned material has remained a key problem for decades. ICDAR has a long tradition in hosting competitions to benchmark the state-of-the-art and encourage the development of novel solutions t… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: ICDAR 2023, 10 pages, 4 figures

  4. arXiv:2305.03393  [pdf, other

    cs.CV

    Optimized Table Tokenization for Table Structure Recognition

    Authors: Maksym Lysak, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Peter Staar

    Abstract: Extracting tables from documents is a crucial task in any document conversion pipeline. Recently, transformer-based models have demonstrated that table-structure can be recognized with impressive accuracy using Image-to-Markup-Sequence (Im2Seq) approaches. Taking only the image of a table, such models predict a sequence of tokens (e.g. in HTML, LaTeX) which represent the structure of the table. Si… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to ICDAR 2023, 12 pages, 6 figures

  5. arXiv:2203.01017  [pdf, other

    cs.CV cs.LG

    TableFormer: Table Structure Understanding with Transformers

    Authors: Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar

    Abstract: Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables come in a large variety of shapes and sizes. Furthermore, they can have complex column/row-header configurations, multiline rows, different variety of separati… ▽ More

    Submitted 11 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  6. arXiv:2102.09395  [pdf, other

    cs.LG cs.CV cs.IR

    Robust PDF Document Conversion Using Recurrent Neural Networks

    Authors: Nikolaos Livathinos, Cesar Berrospi, Maksym Lysak, Viktor Kuropiatnyk, Ahmed Nassar, Andre Carvalho, Michele Dolfi, Christoph Auer, Kasper Dinkla, Peter Staar

    Abstract: The number of published PDF documents has increased exponentially in recent decades. There is a growing need to make their rich content discoverable to information retrieval tools. In this paper, we present a novel approach to document structure recovery in PDF using recurrent neural networks to process the low-level PDF data representation directly, instead of relying on a visual re-interpretatio… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: 9 pages, 2 tables, 4 figures, uses aaai21.sty. Accepted at the "Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21)". Received the "IAAI-21 Innovative Application Award"

    ACM Class: I.7.5; I.5.1; I.5.2; I.5.4; I.5.5; I.2.1