Skip to main content

Showing 1–3 of 3 results for author: Krizhanovskaya, N

Searching in archive cs. Search in all archives.
.
  1. The Open corpus of the Veps and Karelian languages: overview and applications

    Authors: Tatyana Boyko, Nina Zaitseva, Natalia Krizhanovskaya, Andrew Krizhanovsky, Irina Novak, Nataliya Pellinen, Aleksandra Rodionova

    Abstract: A growing priority in the study of Baltic-Finnic languages of the Republic of Karelia has been the methods and tools of corpus linguistics. Since 2016, linguists, mathematicians, and programmers at the Karelian Research Centre have been working with the Open Corpus of the Veps and Karelian Languages (VepKar), which is an extension of the Veps Corpus created in 2009. The VepKar corpus comprises tex… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: 9 pages, 9 figures, published in the journal

    MSC Class: 68T50 ACM Class: H.3.1; H.3.6

    Journal ref: KnE Social Sciences. 7 (3). 2022. P. 29-40

  2. arXiv:2205.03608  [pdf, other

    cs.CL

    UniMorph 4.0: Universal Morphology

    Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

    Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More

    Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: LREC 2022; The first two authors made equal contributions

  3. arXiv:2001.11285  [pdf, other

    cs.CL

    LowResourceEval-2019: a shared task on morphological analysis for low-resource languages

    Authors: Elena Klyachko, Alexey Sorokin, Natalia Krizhanovskaya, Andrew Krizhanovsky, Galina Ryazanskaya

    Abstract: The paper describes the results of the first shared task on morphological analysis for the languages of Russia, namely, Evenki, Karelian, Selkup, and Veps. For the languages in question, only small-sized corpora are available. The tasks include morphological analysis, word form generation and morpheme segmentation. Four teams participated in the shared task. Most of them use machine-learning appro… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

    Comments: 16 pages, 4 tables, 2 figures, published in the conference proceeding

    MSC Class: 68T50

    Journal ref: Dialog 2019, Issue 18, Supplementary volume, Pp. 45-62