Search | arXiv e-print repository

doi 10.1007/978-3-030-86549-8_36

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Authors: Tomasz Stanisławek, Filip Graliński, Anna Wróblewska, Dawid Lipiński, Agnieszka Kaliska, Paulina Rosalska, Bartosz Topolski, Przemysław Biecek

Abstract: The relevance of the Key Information Extraction (KIE) task is increasingly important in natural language processing problems. But there are still only a few well-defined problems that serve as benchmarks for solutions in this area. To bridge this gap, we introduce two new datasets (Kleister NDA and Kleister Charity). They involve a mix of scanned and born-digital long formal English-language docum… ▽ More The relevance of the Key Information Extraction (KIE) task is increasingly important in natural language processing problems. But there are still only a few well-defined problems that serve as benchmarks for solutions in this area. To bridge this gap, we introduce two new datasets (Kleister NDA and Kleister Charity). They involve a mix of scanned and born-digital long formal English-language documents. In these datasets, an NLP system is expected to find or infer various types of entities by employing both textual and structural layout features. The Kleister Charity dataset consists of 2,788 annual financial reports of charity organizations, with 61,643 unique pages and 21,612 entities to extract. The Kleister NDA dataset has 540 Non-disclosure Agreements, with 3,229 unique pages and 2,160 entities to extract. We provide several state-of-the-art baseline systems from the KIE domain (Flair, BERT, RoBERTa, LayoutLM, LAMBERT), which show that our datasets pose a strong challenge to existing models. The best model achieved an 81.77% and an 83.57% F1-score on respectively the Kleister NDA and the Kleister Charity datasets. We share the datasets to encourage progress on more in-depth and complex information extraction tasks. △ Less

Submitted 12 May, 2021; originally announced May 2021.

Comments: accepted to ICDAR 2021

Journal ref: International Conference on Document Analysis and Recognition ICDAR 2021

arXiv:2102.05897 [pdf, other]

Corner Cases for Visual Perception in Automated Driving: Some Guidance on Detection Approaches

Authors: Jasmin Breitenstein, Jan-Aike Termöhlen, Daniel Lipinski, Tim Fingscheidt

Abstract: Automated driving has become a major topic of interest not only in the active research community but also in mainstream media reports. Visual perception of such intelligent vehicles has experienced large progress in the last decade thanks to advances in deep learning techniques but some challenges still remain. One such challenge is the detection of corner cases. They are unexpected and unknown si… ▽ More Automated driving has become a major topic of interest not only in the active research community but also in mainstream media reports. Visual perception of such intelligent vehicles has experienced large progress in the last decade thanks to advances in deep learning techniques but some challenges still remain. One such challenge is the detection of corner cases. They are unexpected and unknown situations that occur while driving. Conventional visual perception methods are often not able to detect them because corner cases have not been witnessed during training. Hence, their detection is highly safety-critical, and detection methods can be applied to vast amounts of collected data to select suitable training data. A reliable detection of corner cases will not only further automate the data selection procedure and increase safety in autonomous driving but can thereby also affect the public acceptance of the new technology in a positive manner. In this work, we continue a previous systematization of corner cases on different levels by an extended set of examples for each level. Moreover, we group detection approaches into different categories and link them with the corner case levels. Hence, we give directions to showcase specific corner cases and basic guidelines on how to technically detect them. △ Less

Submitted 11 February, 2021; originally announced February 2021.

arXiv:2003.02356 [pdf, other]

Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout

Authors: Filip Graliński, Tomasz Stanisławek, Anna Wróblewska, Dawid Lipiński, Agnieszka Kaliska, Paulina Rosalska, Bartosz Topolski, Przemysław Biecek

Abstract: State-of-the-art solutions for Natural Language Processing (NLP) are able to capture a broad range of contexts, like the sentence-level context or document-level context for short documents. But these solutions are still struggling when it comes to longer, real-world documents with the information encoded in the spatial structure of the document, such as page elements like tables, forms, headers,… ▽ More State-of-the-art solutions for Natural Language Processing (NLP) are able to capture a broad range of contexts, like the sentence-level context or document-level context for short documents. But these solutions are still struggling when it comes to longer, real-world documents with the information encoded in the spatial structure of the document, such as page elements like tables, forms, headers, openings or footers; complex page layout or presence of multiple pages. To encourage progress on deeper and more complex Information Extraction (IE) we introduce a new task (named Kleister) with two new datasets. Utilizing both textual and structural layout features, an NLP system must find the most important information, about various types of entities, in long formal documents. We propose Pipeline method as a text-only baseline with different Named Entity Recognition architectures (Flair, BERT, RoBERTa). Moreover, we checked the most popular PDF processing tools for text extraction (pdf2djvu, Tesseract and Textract) in order to analyze behavior of IE system in presence of errors introduced by these tools. △ Less

Submitted 6 March, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

arXiv:1902.09184 [pdf, other]

Towards Corner Case Detection for Autonomous Driving

Authors: Jan-Aike Bolte, Andreas Bär, Daniel Lipinski, Tim Fingscheidt

Abstract: The progress in autonomous driving is also due to the increased availability of vast amounts of training data for the underlying machine learning approaches. Machine learning systems are generally known to lack robustness, e.g., if the training data did rarely or not at all cover critical situations. The challenging task of corner case detection in video, which is also somehow related to unusual e… ▽ More The progress in autonomous driving is also due to the increased availability of vast amounts of training data for the underlying machine learning approaches. Machine learning systems are generally known to lack robustness, e.g., if the training data did rarely or not at all cover critical situations. The challenging task of corner case detection in video, which is also somehow related to unusual event or anomaly detection, aims at detecting these unusual situations, which could become critical, and to communicate this to the autonomous driving system (online use case). Such a system, however, could be also used in offline mode to screen vast amounts of data and select only the relevant situations for storing and (re)training machine learning algorithms. So far, the approaches for corner case detection have been limited to videos recorded from a fixed camera, mostly for security surveillance. In this paper, we provide a formal definition of a corner case and propose a system framework for both the online and the offline use case that can handle video signals from front cameras of a naturally moving vehicle and can output a corner case score. △ Less

Submitted 26 February, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

arXiv:1309.4332 [pdf, other]

doi 10.1007/s10236-013-0674-5

Observations on the flow structures and transport in a warm-core ring in the Gulf of Mexico

Authors: Doug Lipinski, Kamran Mohseni

Abstract: This study presents several new observations from the study of a warm-core ring (WCR) in the Gulf of Mexico based on the ECCO2 global ocean simulation. Using Lagrangian coherent structures (LCS) techniques to investigate this flow reveals a pattern of transversely intersecting LCS in the mixed layer of the WCR which experiences consistent stretching behavior over a large region of space and time.… ▽ More This study presents several new observations from the study of a warm-core ring (WCR) in the Gulf of Mexico based on the ECCO2 global ocean simulation. Using Lagrangian coherent structures (LCS) techniques to investigate this flow reveals a pattern of transversely intersecting LCS in the mixed layer of the WCR which experiences consistent stretching behavior over a large region of space and time. A detailed analysis of this flow region leads to an analytical model velocity field which captures the essential elements that generate the transversely intersecting LCS. The model parameters are determined from the WCR and the resulting LCS show excellent agreement with those observed in the WCR. The three-dimensional transport behavior which creates these structures relies on the small radial outflow which is present in the mixed layer and is not seen below the pycnocline, leading to a sharp change in the character of the LCS at the bottom of the mixed layer. The flow behavior revealed by the LCS limits fluid exchange between the WCR and the surrounding ocean, contributing to the long life of WCRs. Further study of these structures and their associated transport behavior may lead to further insights into the development and persistence of such geophysical vortices as well as their transport behavior. △ Less

Submitted 17 September, 2013; originally announced September 2013.

Comments: 12 pages

arXiv:1202.5236 [pdf, other]

A 3D fast algorithm for computing Lagrangian coherent structures via ridge tracking

Authors: Doug Lipinski, Kamran Mohseni

Abstract: Lagrangian coherent structures (LCS) in fluid flows appear as co-dimension one ridges of the finite time Lyapunov exponent (FTLE) field. In three- dimensions this means two-dimensional ridges. A fast algorithm is presented here to locate and extract such ridge surfaces while avoiding unnecessary computations away from the LCS. This algorithm reduces the order of the computational complexity from O… ▽ More Lagrangian coherent structures (LCS) in fluid flows appear as co-dimension one ridges of the finite time Lyapunov exponent (FTLE) field. In three- dimensions this means two-dimensional ridges. A fast algorithm is presented here to locate and extract such ridge surfaces while avoiding unnecessary computations away from the LCS. This algorithm reduces the order of the computational complexity from O(1/dx^3) to about O(1/dx^2) by eliminating computations over most of the three dimensional domain and computing the FTLE only near the two-dimensional ridge surfaces. The algorithm is grid based and proofs of error bounds for ridge locations are included. The algorithm performance and error bounds are verified in several examples. The algorithm offers significant advantages in computational cost as well as later data analysis. △ Less

Submitted 23 February, 2012; originally announced February 2012.

Comments: 28 pages, 10 figures

Showing 1–6 of 6 results for author: Lipiński, D