Skip to main content

Showing 1–7 of 7 results for author: Pietruszka, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.08455  [pdf, other

    cs.CV cs.CL cs.LG

    Document Understanding Dataset and Evaluation (DUDE)

    Authors: Jordy Van Landeghem, Rubén Tito, Łukasz Borchmann, Michał Pietruszka, Paweł Józiak, Rafał Powalski, Dawid Jurkiewicz, Mickaël Coustaty, Bertrand Ackaert, Ernest Valveny, Matthew Blaschko, Sien Moens, Tomasz Stanisławek

    Abstract: We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted at ICCV 2023

  2. arXiv:2206.04045  [pdf, other

    cs.CL cs.LG

    STable: Table Generation Framework for Encoder-Decoder Models

    Authors: Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Pałka, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek

    Abstract: The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutatio… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  3. arXiv:2102.09550  [pdf, other

    cs.CL cs.LG

    Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

    Authors: Rafał Powalski, Łukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michał Pietruszka, Gabriela Pałka

    Abstract: We address the challenging problem of Natural Language Comprehension beyond plain-text documents by introducing the TILT neural network architecture which simultaneously learns layout information, visual features, and textual semantics. Contrary to previous approaches, we rely on a decoder capable of unifying a variety of problems involving natural language. The layout is represented as an attenti… ▽ More

    Submitted 12 July, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at ICDAR 2021

  4. arXiv:2011.03228  [pdf, other

    cs.CL cs.IR

    From Dataset Recycling to Multi-Property Extraction and Beyond

    Authors: Tomasz Dwojak, Michał Pietruszka, Łukasz Borchmann, Jakub Chłędowski, Filip Graliński

    Abstract: This paper investigates various Transformer architectures on the WikiReading Information Extraction and Machine Reading Comprehension dataset. The proposed dual-source model outperforms the current state-of-the-art by a large margin. Next, we introduce WikiReading Recycled-a newly developed public dataset and the task of multiple property extraction. It uses the same data as WikiReading but does n… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted at CoNLL 2020; this article supersedes arXiv: 2006.08281

  5. arXiv:2010.15552  [pdf, other

    cs.LG

    Successive Halving Top-k Operator

    Authors: Michał Pietruszka, Łukasz Borchmann, Filip Graliński

    Abstract: We propose a differentiable successive halving method of relaxing the top-k operator, rendering gradient-based optimization possible. The need to perform softmax iteratively on the entire vector of scores is avoided by using a tournament-style selection. As a result, a much better approximation of top-k with lower computational cost is achieved compared to the previous approach.

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Work in progress

  6. arXiv:2009.05169  [pdf, other

    cs.CL cs.LG

    Sparsifying Transformer Models with Trainable Representation Pooling

    Authors: Michał Pietruszka, Łukasz Borchmann, Łukasz Garncarek

    Abstract: We propose a novel method to sparsify attention in the Transformer model by learning to select the most-informative token representations during the training process, thus focusing on the task-specific parts of an input. A reduction of quadratic time and memory complexity to sublinear was achieved due to a robust trainable top-$k$ operator. Our experiments on a challenging long document summarizat… ▽ More

    Submitted 7 March, 2022; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: Accepted at ACL 2022

  7. arXiv:2006.08281  [pdf, other

    cs.CL cs.IR

    On the Multi-Property Extraction and Beyond

    Authors: Tomasz Dwojak, Michał Pietruszka, Łukasz Borchmann, Filip Graliński, Jakub Chłędowski

    Abstract: In this paper, we investigate the Dual-source Transformer architecture on the WikiReading information extraction and machine reading comprehension dataset. The proposed model outperforms the current state-of-the-art by a large margin. Next, we introduce WikiReading Recycled - a newly developed public dataset, supporting the task of multiple property extraction. It keeps the spirit of the original… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 5 pages