-
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
Authors:
Oshri Naparstek,
Roi Pony,
Inbar Shapira,
Foad Abo Dahood,
Ophir Azulai,
Yevgeny Yaroker,
Nadav Rubinstein,
Maksym Lysak,
Peter Staar,
Ahmed Nassar,
Nikolaos Livathinos,
Christoph Auer,
Elad Amrani,
Idan Friedman,
Orit Prince,
Yevgeny Burshtein,
Adi Raz Goldfarb,
Udi Barzelay
Abstract:
In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where…
▽ More
In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where the extraction process revolves around extracting information using a specific, predefined set of keys. Unlike most existing datasets and benchmarks, our focus is on discovering key-value pairs (KVPs) without relying on predefined keys, navigating through an array of diverse templates and complex layouts. This task presents unique challenges, primarily due to the absence of comprehensive datasets and benchmarks tailored for non-predetermined KVP extraction. To address this gap, we introduce KVP10k , a new dataset and benchmark specifically designed for KVP extraction. The dataset contains 10707 richly annotated images. In our benchmark, we also introduce a new challenging task that combines elements of KIE as well as KVP in a single task. KVP10k sets itself apart with its extensive diversity in data and richly detailed annotations, paving the way for advancements in the field of information extraction from complex business documents.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
BusiNet -- a Light and Fast Text Detection Network for Business Documents
Authors:
Oshri Naparstek,
Ophir Azulai,
Daniel Rotman,
Yevgeny Burshtein,
Peter Staar,
Udi Barzelay
Abstract:
For digitizing or indexing physical documents, Optical Character Recognition (OCR), the process of extracting textual information from scanned documents, is a vital technology. When a document is visually damaged or contains non-textual elements, existing technologies can yield poor results, as erroneous detection results can greatly affect the quality of OCR. In this paper we present a detection…
▽ More
For digitizing or indexing physical documents, Optical Character Recognition (OCR), the process of extracting textual information from scanned documents, is a vital technology. When a document is visually damaged or contains non-textual elements, existing technologies can yield poor results, as erroneous detection results can greatly affect the quality of OCR. In this paper we present a detection network dubbed BusiNet aimed at OCR of business documents. Business documents often include sensitive information and as such they cannot be uploaded to a cloud service for OCR. BusiNet was designed to be fast and light so it could run locally preventing privacy issues. Furthermore, BusiNet is built to handle scanned document corruption and noise using a specialized synthetic dataset. The model is made robust to unseen noise by employing adversarial training strategies. We perform an evaluation on publicly available datasets demonstrating the usefulness and broad applicability of our model.
△ Less
Submitted 4 July, 2022;
originally announced July 2022.
-
Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access
Authors:
Oshri Naparstek,
Kobi Cohen
Abstract:
We consider the problem of dynamic spectrum access for network utility maximization in multichannel wireless networks. The shared bandwidth is divided into K orthogonal channels. In the beginning of each time slot, each user selects a channel and transmits a packet with a certain transmission probability. After each time slot, each user that has transmitted a packet receives a local observation in…
▽ More
We consider the problem of dynamic spectrum access for network utility maximization in multichannel wireless networks. The shared bandwidth is divided into K orthogonal channels. In the beginning of each time slot, each user selects a channel and transmits a packet with a certain transmission probability. After each time slot, each user that has transmitted a packet receives a local observation indicating whether its packet was successfully delivered or not (i.e., ACK signal). The objective is a multi-user strategy for accessing the spectrum that maximizes a certain network utility in a distributed manner without online coordination or message exchanges between users. Obtaining an optimal solution for the spectrum access problem is computationally expensive in general due to the large state space and partial observability of the states. To tackle this problem, we develop a novel distributed dynamic spectrum access algorithm based on deep multi-user reinforcement leaning. Specifically, at each time slot, each user maps its current state to spectrum access actions based on a trained deep-Q network used to maximize the objective function. Game theoretic analysis of the system dynamics is developed for establishing design principles for the implementation of the algorithm. Experimental results demonstrate strong performance of the algorithm.
△ Less
Submitted 5 November, 2018; v1 submitted 9 April, 2017;
originally announced April 2017.
-
Distributed Energy Efficient Channel Allocation
Authors:
Oshri Naparstek,
S. M. Zafaruddin,
Amir Leshem,
Eduard Jorswieck
Abstract:
Design of energy efficient protocols for modern wireless systems has become an important area of research. In this paper, we propose a distributed optimization algorithm for the channel assignment problem for multiple interfering transceiver pairs that cannot communicate with each other. We first modify the auction algorithm for maximal energy efficiency and show that the problem can be solved wit…
▽ More
Design of energy efficient protocols for modern wireless systems has become an important area of research. In this paper, we propose a distributed optimization algorithm for the channel assignment problem for multiple interfering transceiver pairs that cannot communicate with each other. We first modify the auction algorithm for maximal energy efficiency and show that the problem can be solved without explicit message passing using the carrier sense multiple access (CSMA) protocols. We then develop a novel scheme by converting the channel assignment problem into perfect matchings on bipartite graphs. The proposed scheme improves the energy efficiency and does not require any explicit message passing or a shared memory between the users. We derive bounds on the convergence rate and show that the proposed algorithm converges faster than the distributed auction algorithm and achieves near-optimal performance under Rayleigh fading channels. We also present an asymptotic performance analysis of the fast matching algorithm for energy efficient resource allocation and prove the optimality for large enough number of users and number of channels. Finally, we provide numerical assessments that confirm the energy efficiency gains compared to the state of the art.
△ Less
Submitted 15 February, 2018; v1 submitted 8 January, 2014;
originally announced January 2014.
-
Expected time complexity of the auction algorithm and the push relabel algorithm for maximal bipartite matching on random graphs
Authors:
Oshri Naparstek,
Amir Leshem
Abstract:
In this paper we analyze the expected time complexity of the auction algorithm for the matching problem on random bipartite graphs. We prove that the expected time complexity of the auction algorithm for bipartite matching is $O\left(\frac{N\log^2(N)}{\log\left(Np\right)}\right)$ on sequential machines. This is equivalent to other augmenting path algorithms such as the HK algorithm. Furthermore, w…
▽ More
In this paper we analyze the expected time complexity of the auction algorithm for the matching problem on random bipartite graphs. We prove that the expected time complexity of the auction algorithm for bipartite matching is $O\left(\frac{N\log^2(N)}{\log\left(Np\right)}\right)$ on sequential machines. This is equivalent to other augmenting path algorithms such as the HK algorithm. Furthermore, we show that the algorithm can be implemented on parallel machines with $O(\log(N))$ processors and shared memory with an expected time complexity of $O(N\log(N))$.
△ Less
Submitted 31 December, 2013;
originally announced January 2014.
-
Fully distributed optimal channel assignment for open spectrum access
Authors:
Oshri Naparstek,
Amir Leshem
Abstract:
In this paper we address the problem of fully distributed assignment of users to sub-bands such that the sum-rate of the system is maximized. We introduce a modified auction algorithm that can be applied in a fully distributed way using an opportunistic CSMA assignment scheme and is $ε$ optimal. We analyze the expected time complexity of the algorithm and suggest a variant to the algorithm that…
▽ More
In this paper we address the problem of fully distributed assignment of users to sub-bands such that the sum-rate of the system is maximized. We introduce a modified auction algorithm that can be applied in a fully distributed way using an opportunistic CSMA assignment scheme and is $ε$ optimal. We analyze the expected time complexity of the algorithm and suggest a variant to the algorithm that has lower expected complexity. We then show that in the case of i.i.d Rayleigh channels a simple greedy scheme is asymptotically optimal as $\SNR$ increases or as the number of users is increased to infinity. We conclude by providing simulated results of the suggested algorithms.
△ Less
Submitted 30 December, 2013;
originally announced December 2013.