UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies
Authors:
Leonie Weissweiler,
Nina Böbel,
Kirian Guiller,
Santiago Herrera,
Wesley Scivetti,
Arthur Lorenzi,
Nurit Melnik,
Archna Bhatia,
Hinrich Schütze,
Lori Levin,
Amir Zeldes,
Joakim Nivre,
William Croft,
Nathan Schneider
Abstract:
The Universal Dependencies (UD) project has created an invaluable collection of treebanks with contributions in over 140 languages. However, the UD annotations do not tell the full story. Grammatical constructions that convey meaning through a particular combination of several morphosyntactic elements -- for example, interrogative sentences with special markers and/or word orders -- are not labele…
▽ More
The Universal Dependencies (UD) project has created an invaluable collection of treebanks with contributions in over 140 languages. However, the UD annotations do not tell the full story. Grammatical constructions that convey meaning through a particular combination of several morphosyntactic elements -- for example, interrogative sentences with special markers and/or word orders -- are not labeled holistically. We argue for (i) augmenting UD annotations with a 'UCxn' annotation layer for such meaning-bearing grammatical constructions, and (ii) approaching this in a typologically informed way so that morphosyntactic strategies can be compared across languages. As a case study, we consider five construction families in ten languages, identifying instances of each construction in UD treebanks through the use of morphosyntactic patterns. In addition to findings regarding these particular constructions, our study yields important insights on methodology for describing and identifying constructions in language-general and language-particular ways, and lays the foundation for future constructional enrichment of UD treebanks.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
Lutma: a Frame-Making Tool for Collaborative FrameNet Development
Authors:
Tiago Timponi Torrent,
Arthur Lorenzi,
Ely Edison da Silva Matos,
Frederico Belcavello,
Marcelo Viridiano,
Maucha Andrade Gamonal
Abstract:
This paper presents Lutma, a collaborative, semi-constrained, tutorial-based tool for contributing frames and lexical units to the Global FrameNet initiative. The tool parameterizes the process of frame creation, avoiding consistency violations and promoting the integration of frames contributed by the community with existing frames. Lutma is structured in a wizard-like fashion so as to provide us…
▽ More
This paper presents Lutma, a collaborative, semi-constrained, tutorial-based tool for contributing frames and lexical units to the Global FrameNet initiative. The tool parameterizes the process of frame creation, avoiding consistency violations and promoting the integration of frames contributed by the community with existing frames. Lutma is structured in a wizard-like fashion so as to provide users with text and video tutorials relevant for each step in the frame creation process. We argue that this tool will allow for a sensible expansion of FrameNet coverage in terms of both languages and cultural perspectives encoded by them, positioning frames as a viable alternative for representing perspective in language models.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.