Skip to main content

Showing 1–5 of 5 results for author: Azulai, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00505  [pdf, other

    cs.IR cs.LG

    KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

    Authors: Oshri Naparstek, Roi Pony, Inbar Shapira, Foad Abo Dahood, Ophir Azulai, Yevgeny Yaroker, Nadav Rubinstein, Maksym Lysak, Peter Staar, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Elad Amrani, Idan Friedman, Orit Prince, Yevgeny Burshtein, Adi Raz Goldfarb, Udi Barzelay

    Abstract: In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: accepted ICDAR2024

  2. arXiv:2207.01220  [pdf, other

    cs.CV cs.AI

    BusiNet -- a Light and Fast Text Detection Network for Business Documents

    Authors: Oshri Naparstek, Ophir Azulai, Daniel Rotman, Yevgeny Burshtein, Peter Staar, Udi Barzelay

    Abstract: For digitizing or indexing physical documents, Optical Character Recognition (OCR), the process of extracting textual information from scanned documents, is a vital technology. When a document is visually damaged or contains non-textual elements, existing technologies can yield poor results, as erroneous detection results can greatly affect the quality of OCR. In this paper we present a detection… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  3. arXiv:2205.08257  [pdf, other

    cs.CV

    Detection Masking for Improved OCR on Noisy Documents

    Authors: Daniel Rotman, Ophir Azulai, Inbar Shapira, Yevgeny Burshtein, Udi Barzelay

    Abstract: Optical Character Recognition (OCR), the task of extracting textual information from scanned documents is a vital and broadly used technology for digitizing and indexing physical documents. Existing technologies perform well for clean documents, but when the document is visually degraded, or when there are non-textual elements, OCR quality can be greatly impacted, specifically due to erroneous det… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  4. arXiv:2111.14103  [pdf, other

    cs.CV

    CHARTER: heatmap-based multi-type chart data extraction

    Authors: Joseph Shtok, Sivan Harary, Ophir Azulai, Adi Raz Goldfarb, Assaf Arbelle, Leonid Karlinsky

    Abstract: The digital conversion of information stored in documents is a great source of knowledge. In contrast to the documents text, the conversion of the embedded documents graphics, such as charts and plots, has been much less explored. We present a method and a system for end-to-end conversion of document charts into machine readable tabular data format, which can be easily stored and analyzed in the d… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: Joseph Shtok, Sivan Harary and Leonid Karlinsky had equal contribution

    Journal ref: Document Intelligence workshop at KDD 2021 conference

  5. arXiv:2004.10141  [pdf, other

    cs.CV

    TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition

    Authors: Rami Ben-Ari, Mor Shpigel, Ophir Azulai, Udi Barzelay, Daniel Rotman

    Abstract: Classification of new class entities requires collecting and annotating hundreds or thousands of samples that is often prohibitively costly. Few-shot learning suggests learning to classify new classes using just a few examples. Only a small number of studies address the challenge of few-shot learning on spatio-temporal patterns such as videos. In this paper, we present the Temporal Aware Embedding… ▽ More

    Submitted 17 July, 2021; v1 submitted 21 April, 2020; originally announced April 2020.

    Journal ref: Published in Learning from Limited and Imperfect Data (L2ID) Workshop - CVPR 2021