Skip to main content

Showing 1–5 of 5 results for author: Pałka, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.05336  [pdf, other

    cs.CL

    Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages

    Authors: Gabriela Pałka, Artur Nowakowski

    Abstract: This paper describes Adam Mickiewicz University's (AMU) solution for the 4th Shared Task on SlavNER. The task involves the identification, categorization, and lemmatization of named entities in Slavic languages. Our approach involved exploring the use of foundation models for these tasks. In particular, we used models based on the popular BERT and T5 model architectures. Additionally, we used exte… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: Slavic NLP 2023 @ EACL 2023

  2. arXiv:2209.02962  [pdf, other

    cs.CL

    Adam Mickiewicz University at WMT 2022: NER-Assisted and Quality-Aware Neural Machine Translation

    Authors: Artur Nowakowski, Gabriela Pałka, Kamil Guttmann, Mikołaj Pokrywka

    Abstract: This paper presents Adam Mickiewicz University's (AMU) submissions to the constrained track of the WMT 2022 General MT Task. We participated in the Ukrainian $\leftrightarrow$ Czech translation directions. The systems are a weighted ensemble of four models based on the Transformer (big) architecture. The models use source factors to utilize the information about named entities present in the input… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: WMT 2022

  3. arXiv:2206.04045  [pdf, other

    cs.CL cs.LG

    STable: Table Generation Framework for Encoder-Decoder Models

    Authors: Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Pałka, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek

    Abstract: The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutatio… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  4. arXiv:2102.09550  [pdf, other

    cs.CL cs.LG

    Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

    Authors: Rafał Powalski, Łukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michał Pietruszka, Gabriela Pałka

    Abstract: We address the challenging problem of Natural Language Comprehension beyond plain-text documents by introducing the TILT neural network architecture which simultaneously learns layout information, visual features, and textual semantics. Contrary to previous approaches, we rely on a decoder capable of unifying a variety of problems involving natural language. The layout is represented as an attenti… ▽ More

    Submitted 12 July, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at ICDAR 2021

  5. arXiv:1911.03911  [pdf, other

    cs.CL

    Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

    Authors: Łukasz Borchmann, Dawid Wiśniewski, Andrzej Gretkowski, Izabela Kosmala, Dawid Jurkiewicz, Łukasz Szałkiewicz, Gabriela Pałka, Karol Kaczmarek, Agnieszka Kaliska, Filip Graliński

    Abstract: We propose a new shared task of semantic retrieval from legal texts, in which a so-called contract discovery is to be performed, where legal clauses are extracted from documents, given a few examples of similar clauses from other legal acts. The task differs substantially from conventional NLI and shared tasks on legal information extraction (e.g., one has to identify text span instead of a single… ▽ More

    Submitted 8 October, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Submitted to Findings of EMNLP