Skip to main content

Showing 1–5 of 5 results for author: Zeman, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.12220  [pdf

    cs.CL

    A Unified Taxonomy of Deep Syntactic Relations

    Authors: Kira Droganova, Daniel Zeman

    Abstract: This paper analyzes multiple deep-syntactic frameworks with the goal of creating a proposal for a set of universal semantic role labels. The proposal examines various theoretic linguistic perspectives and focuses on Meaning-Text Theory and Functional Generative Description frameworks. For the purpose of this research, data from four languages is used -- Spanish and Catalan (Taule et al., 2011),… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  2. arXiv:2209.07841  [pdf, other

    cs.CL

    Findings of the Shared Task on Multilingual Coreference Resolution

    Authors: Zdeněk Žabokrtský, Miloslav Konopík, Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk, Martin Popel, Ondřej Pražák, Jakub Sido, Daniel Zeman, Yilun Zhu

    Abstract: This paper presents an overview of the shared task on multilingual coreference resolution associated with the CRAC 2022 workshop. Shared task participants were supposed to develop trainable systems capable of identifying mentions and clustering them according to identity coreference. The public edition of CorefUD 1.0, which contains 13 datasets for 10 languages, was used as the source of training… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  3. Predicting Typological Features in WALS using Language Embeddings and Conditional Probabilities: ÚFAL Submission to the SIGTYP 2020 Shared Task

    Authors: Martin Vastl, Daniel Zeman, Rudolf Rosa

    Abstract: We present our submission to the SIGTYP 2020 Shared Task on the prediction of typological features. We submit a constrained system, predicting typological features only based on the WALS database. We investigate two approaches. The simpler of the two is a system based on estimating correlation of feature values within languages by computing conditional probabilities and mutual information. The sec… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Journal ref: Proc. SIGTYP Workshop on Computational Research in Linguistic Typology (2020) 29-35

  4. arXiv:2004.10643  [pdf, other

    cs.CL

    Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection

    Authors: Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Jan Hajič, Christopher D. Manning, Sampo Pyysalo, Sebastian Schuster, Francis Tyers, Daniel Zeman

    Abstract: Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word segmentation; a morphological layer comprising lemmas, universal part-of-speech tags, and standardized morphological features; and a syntactic layer focusing on… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: LREC 2020

  5. arXiv:cs/0009003  [pdf, ps, other

    cs.CL

    Automatic Extraction of Subcategorization Frames for Czech

    Authors: Anoop Sarkar, Daniel Zeman

    Abstract: We present some novel machine learning techniques for the identification of subcategorization information for verbs in Czech. We compare three different statistical techniques applied to this problem. We show how the learning algorithm can be used to discover previously unknown subcategorization frames from the Czech Prague Dependency Treebank. The algorithm can then be used to label dependents… ▽ More

    Submitted 8 September, 2000; originally announced September 2000.

    Comments: 7 pages. Another version under the name "Learning Verb Subcategorization from Corpora: Counting Frame Subsets", authors: Zeman, Sarkar, in proceedings of LREC 2000, Athens, Greece

    ACM Class: I.2.7, G.3

    Journal ref: Proceedings of the 18th International Conference on Computational Linguistics (Coling 2000), Universit