Skip to main content

Showing 1–5 of 5 results for author: Krizhanovsky, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2103.11859  [pdf, other

    cs.CL cs.IR

    Part of speech and gramset tagging algorithms for unknown words based on morphological dictionaries of the Veps and Karelian languages

    Authors: Andrew Krizhanovsky, Natalia Krizhanovsky, Irina Novak

    Abstract: This research devoted to the low-resource Veps and Karelian languages. Algorithms for assigning part of speech tags to words and grammatical properties to words are presented in the article. These algorithms use our morphological dictionaries, where the lemma, part of speech and a set of grammatical features (gramset) are known for each word form. The algorithms are based on the analogy hypothesis… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 17 pages, 4 tables, 7 figures, published in the conference proceeding

    MSC Class: 68T50 ACM Class: H.3.1; H.3.6

  2. arXiv:2006.11572  [pdf, other

    cs.CL

    SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

    Authors: Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff , et al. (3 additional authors not shown)

    Abstract: A broad goal in natural language processing (NLP) is to develop a system that has the capacity to process any natural language. Most systems, however, are developed using data from just one language such as English. The SIGMORPHON 2020 shared task on morphological reinflection aims to investigate systems' ability to generalize across typologically distinct languages, many of which are low resource… ▽ More

    Submitted 14 July, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: 39 pages, SIGMORPHON

  3. arXiv:2002.00734  [pdf

    cs.CL cs.IR

    Analysis of the quotation corpus of the Russian Wiktionary

    Authors: A. Smirnov, T. Levashova, A. Karpov, I. Kipyatkova, A. Ronzhin, A. Krizhanovsky, N. Krizhanovsky

    Abstract: The quantitative evaluation of quotations in the Russian Wiktionary was performed using the developed Wiktionary parser. It was found that the number of quotations in the dictionary is growing fast (51.5 thousands in 2011, 62 thousands in 2012). These quotations were extracted and saved in the relational database of a machine-readable dictionary. For this database, tables related to the quotations… ▽ More

    Submitted 20 January, 2020; originally announced February 2020.

    Comments: 12 pages, 3 tables, 5 figures, published in the journal (preprint)

    MSC Class: 68T50 ACM Class: H.3.3

    Journal ref: Research in Computing Science, Vol. 56, pp. 101-112, 2012

  4. arXiv:2001.04719  [pdf

    cs.IR

    Semi-automatic methods for adding words to the dictionary of VepKar corpus based on inflectional rules extracted from Wiktionary

    Authors: Natalia Krizhanovsky, Andrew Krizhanovsky

    Abstract: The article describes a technique for using English Wiktionary inflection tables for generating word forms for Veps verbs and nominals in the Open corpus of Veps and Karelian languages. The information concerning Karelian and Veps Wiktionary entries with inflection tables is given. The operating principle of the Wiktionary static and dynamic templates is explained with the use of the jogi (river)… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 10 pages, 1 table, 2 figures, published in the conference proceeding https://events.spbu.ru/eventsContent/events/2019/corpora/corp_sborn.pdf#page=211

    Journal ref: Corpora 2019, 24-28 June, 2019. Saint-Petersburg. P. 211-217

  5. arXiv:1805.09559  [pdf, other

    cs.IR cs.CL

    WSD algorithm based on a new method of vector-word contexts proximity calculation via epsilon-filtration

    Authors: Alexander Kirillov, Natalia Krizhanovsky, Andrew Krizhanovsky

    Abstract: The problem of word sense disambiguation (WSD) is considered in the article. Given a set of synonyms (synsets) and sentences with these synonyms. It is necessary to select the meaning of the word in the sentence automatically. 1285 sentences were tagged by experts, namely, one of the dictionary meanings was selected by experts for target words. To solve the WSD-problem, an algorithm based on a new… ▽ More

    Submitted 18 June, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: 15 pages, 1 table, 15 figures, accepted in the journal Transactions of Karelian Research Centre of the Russian Academy of Sciences

    MSC Class: 68T50 ACM Class: I.5.3; H.3.1; H.3.3

    Journal ref: Transactions of Karelian Research Centre RAS. No. 7. 2018. P. 149-163