Skip to main content

Showing 1–12 of 12 results for author: Pietruszka, M

.
  1. arXiv:2305.08455  [pdf, other

    cs.CV cs.CL cs.LG

    Document Understanding Dataset and Evaluation (DUDE)

    Authors: Jordy Van Landeghem, Rubén Tito, Łukasz Borchmann, Michał Pietruszka, Paweł Józiak, Rafał Powalski, Dawid Jurkiewicz, Mickaël Coustaty, Bertrand Ackaert, Ernest Valveny, Matthew Blaschko, Sien Moens, Tomasz Stanisławek

    Abstract: We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted at ICCV 2023

  2. arXiv:2206.04045  [pdf, other

    cs.CL cs.LG

    STable: Table Generation Framework for Encoder-Decoder Models

    Authors: Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Pałka, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek

    Abstract: The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutatio… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  3. arXiv:2102.09550  [pdf, other

    cs.CL cs.LG

    Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

    Authors: Rafał Powalski, Łukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michał Pietruszka, Gabriela Pałka

    Abstract: We address the challenging problem of Natural Language Comprehension beyond plain-text documents by introducing the TILT neural network architecture which simultaneously learns layout information, visual features, and textual semantics. Contrary to previous approaches, we rely on a decoder capable of unifying a variety of problems involving natural language. The layout is represented as an attenti… ▽ More

    Submitted 12 July, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at ICDAR 2021

  4. arXiv:2011.03228  [pdf, other

    cs.CL cs.IR

    From Dataset Recycling to Multi-Property Extraction and Beyond

    Authors: Tomasz Dwojak, Michał Pietruszka, Łukasz Borchmann, Jakub Chłędowski, Filip Graliński

    Abstract: This paper investigates various Transformer architectures on the WikiReading Information Extraction and Machine Reading Comprehension dataset. The proposed dual-source model outperforms the current state-of-the-art by a large margin. Next, we introduce WikiReading Recycled-a newly developed public dataset and the task of multiple property extraction. It uses the same data as WikiReading but does n… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted at CoNLL 2020; this article supersedes arXiv: 2006.08281

  5. arXiv:2010.15552  [pdf, other

    cs.LG

    Successive Halving Top-k Operator

    Authors: Michał Pietruszka, Łukasz Borchmann, Filip Graliński

    Abstract: We propose a differentiable successive halving method of relaxing the top-k operator, rendering gradient-based optimization possible. The need to perform softmax iteratively on the entire vector of scores is avoided by using a tournament-style selection. As a result, a much better approximation of top-k with lower computational cost is achieved compared to the previous approach.

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Work in progress

  6. arXiv:2009.05169  [pdf, other

    cs.CL cs.LG

    Sparsifying Transformer Models with Trainable Representation Pooling

    Authors: Michał Pietruszka, Łukasz Borchmann, Łukasz Garncarek

    Abstract: We propose a novel method to sparsify attention in the Transformer model by learning to select the most-informative token representations during the training process, thus focusing on the task-specific parts of an input. A reduction of quadratic time and memory complexity to sublinear was achieved due to a robust trainable top-$k$ operator. Our experiments on a challenging long document summarizat… ▽ More

    Submitted 7 March, 2022; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: Accepted at ACL 2022

  7. arXiv:2006.08281  [pdf, other

    cs.CL cs.IR

    On the Multi-Property Extraction and Beyond

    Authors: Tomasz Dwojak, Michał Pietruszka, Łukasz Borchmann, Filip Graliński, Jakub Chłędowski

    Abstract: In this paper, we investigate the Dual-source Transformer architecture on the WikiReading information extraction and machine reading comprehension dataset. The proposed model outperforms the current state-of-the-art by a large margin. Next, we introduce WikiReading Recycled - a newly developed public dataset, supporting the task of multiple property extraction. It keeps the spirit of the original… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 5 pages

  8. arXiv:1610.05100  [pdf

    cond-mat.mtrl-sci

    Modified Bohm's theory for abstruse measurements: application to layer depth profiling by Auger spectroscopy

    Authors: Edward Rówiński, Mariusz Pietruszka

    Abstract: Modified Bohm formalism is applied to solve a problem of abstruse layer depth profiles measured by the Auger electron spectroscopy technique in real physical systems, i.e., the desorbed carbon/passive layer on NiTi substrate and the adsorbed oxygen/surface of NiTi alloy. It is shown that abstruse layer profiles may be converted to real layer structures using the modified Bohm theory, where the qua… ▽ More

    Submitted 18 February, 2019; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: 6 pages, 4 figures

    Journal ref: Arch. Metall. Mater. 64(1) (2019), pp, 175-180

  9. arXiv:1508.00328  [pdf

    q-bio.CB

    Effective diffusion rates and cross-correlation analysis of "acid growth" data

    Authors: Mariusz Pietruszka, Aleksandra Haduch-Sendecka

    Abstract: We investigated the growth-temperature relationship in plants using a quantitative perspective of a recently derived growth functional. We showed that auxin-induced growth is achieved by the diffusion rate, which is almost constant or slowly ascending in temperature while the diffusion rate of fusicoccin-induced growth increases monotonically with temperature for the entire temperature range, thou… ▽ More

    Submitted 3 August, 2015; originally announced August 2015.

    Comments: 31 pages, 1 table, 10 figures, extensive SI

  10. arXiv:1506.00373  [pdf

    q-bio.CB

    Frequency landscape of tip-growing plants

    Authors: Mariusz Pietruszka, Aleksandra Haduch-Sendecka

    Abstract: It has been interesting that nearly all of the ion activities that have been analysed thus far have exhibited oscillations that are tightly coupled to growth. Here, we present discrete Fourier transform (DFT) spectra with a finite sampling of tip-growing cells and organs that were obtained from voltage measurements of the elongating coleoptiles of maize in situ. The electromotive force (EMF) oscil… ▽ More

    Submitted 28 July, 2015; v1 submitted 1 June, 2015; originally announced June 2015.

    Comments: 9 pages, 3 figures

  11. arXiv:1505.00327  [pdf

    q-bio.CB

    pH/$T$ duality - wall properties and time evolution of plant cells

    Authors: Mariusz A. Pietruszka

    Abstract: We examined the pH/$T$ (or $μ$/$T$) duality of acidic pH and temperature ($T$) for the growth of grass shoots in order to determine the equation of state (EoS) for living plants. By considering non-meristematic growth as a dynamic series of 'state transitions' (STs) in the extending primary wall, we identified the critical (read: optimum) exponents for this phenomenon, which exhibit a singular beh… ▽ More

    Submitted 27 July, 2017; v1 submitted 2 May, 2015; originally announced May 2015.

    Comments: 50 pages, 10 figures

  12. arXiv:1211.1143  [pdf, other

    q-bio.CB cond-mat.soft physics.bio-ph

    Frustration-induced inherent instability and growth oscillations in pollen tubes

    Authors: Mariusz Pietruszka

    Abstract: In a seed plant a pollen tube is a vessel that transports male gamete cells to an ovule to achieve fertilization. It consists of one elongated cell, which exhibits growth oscillations, until it bursts completing its function. Up till now, the mechanism behind the periodic character of the growth has not been fully understood. An attempt to understand these oscillations lead us to an attractive sce… ▽ More

    Submitted 31 December, 2012; v1 submitted 6 November, 2012; originally announced November 2012.

    Comments: 35 pages, 7 figures