Skip to main content

Showing 1–2 of 2 results for author: Štěpánková, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.09307  [pdf, other

    cs.CL

    Quality and Efficiency of Manual Annotation: Pre-annotation Bias

    Authors: Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková, Jan Hajič

    Abstract: This paper presents an analysis of annotation using an automatic pre-annotation for a mid-level annotation complexity task -- dependency syntax annotation. It compares the annotation efforts made by annotators using a pre-annotated version (with a high-accuracy parser) and those made by fully manual annotation. The aim of the experiment is to judge the final annotation quality when pre-annotation… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Published in LREC 2022

  2. arXiv:2006.03679  [pdf, other

    cs.CL

    Prague Dependency Treebank -- Consolidated 1.0

    Authors: Jan Hajič, Eduard Bejček, Jaroslava Hlaváčová, Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková

    Abstract: We present a richly annotated and genre-diversified language resource, the Prague Dependency Treebank-Consolidated 1.0 (PDT-C 1.0), the purpose of which is - as it always been the case for the family of the Prague Dependency Treebanks - to serve both as a training data for various types of NLP tasks as well as for linguistically-oriented research. PDT-C 1.0 contains four different datasets of Czec… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted at LREC 2020 (Proceedings of Language Resources and Evaluation, Marseille, France)