Skip to main content

Showing 1–1 of 1 results for author: Lauc, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.09243  [pdf, ps, other

    cs.CL

    BERTić -- The Transformer Language Model for Bosnian, Croatian, Montenegrin and Serbian

    Authors: Nikola Ljubešić, Davor Lauc

    Abstract: In this paper we describe a transformer model pre-trained on 8 billion tokens of crawled text from the Croatian, Bosnian, Serbian and Montenegrin web domains. We evaluate the transformer model on the tasks of part-of-speech tagging, named-entity-recognition, geo-location prediction and commonsense causal reasoning, showing improvements on all tasks over state-of-the-art models. For commonsense rea… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.