Skip to main content

Showing 1–2 of 2 results for author: Gawlik, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.01735  [pdf, other

    cs.CL cs.LG

    HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish

    Authors: Robert Mroczkowski, Piotr Rybak, Alina Wróblewska, Ireneusz Gawlik

    Abstract: BERT-based models are currently used for solving nearly all Natural Language Processing (NLP) tasks and most often achieve state-of-the-art results. Therefore, the NLP community conducts extensive research on understanding these models, but above all on designing effective and efficient training procedures. Several ablation studies investigating how to train BERT-like models have been carried out,… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: Published in Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing

  2. arXiv:2005.00630  [pdf, other

    cs.CL

    KLEJ: Comprehensive Benchmark for Polish Language Understanding

    Authors: Piotr Rybak, Robert Mroczkowski, Janusz Tracz, Ireneusz Gawlik

    Abstract: In recent years, a series of Transformer-based models unlocked major improvements in general natural language understanding (NLU) tasks. Such a fast pace of research would not be possible without general NLU benchmarks, which allow for a fair comparison of the proposed methods. However, such benchmarks are available only for a handful of languages. To alleviate this issue, we introduce a comprehen… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.