Skip to main content

Showing 1–2 of 2 results for author: Krahn, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20145  [pdf, other

    cs.CL

    Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers

    Authors: Frederick Riemenschneider, Kevin Krahn

    Abstract: Historical languages present unique challenges to the NLP community, with one prominent hurdle being the limited resources available in their closed corpora. This work describes our submission to the constrained subtask of the SIGTYP 2024 shared task, focusing on PoS tagging, morphological tagging, and lemmatization for 13 historical languages. For PoS and morphological tagging we adapt a hierarch… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted for publication at the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP-WS) 2024; 11 pages, 1 figure, 9 tables

    ACM Class: I.2.7

  2. arXiv:2308.13116  [pdf, other

    cs.CL

    Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation

    Authors: Kevin Krahn, Derrick Tate, Andrew C. Lamicela

    Abstract: Contextual language models have been trained on Classical languages, including Ancient Greek and Latin, for tasks such as lemmatization, morphological tagging, part of speech tagging, authorship attribution, and detection of scribal errors. However, high-quality sentence embedding models for these historical languages are significantly more difficult to achieve due to the lack of training data. In… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Paper accepted for publication at the First Workshop on Ancient Language Processing (ALP) 2023; 10 pages, 3 figures, 9 tables

    ACM Class: I.2.7