Skip to main content

Showing 1–8 of 8 results for author: Grünewald, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.15569  [pdf, other

    cs.CL

    MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

    Authors: Timo Pierre Schrader, Matteo Finco, Stefan Grünewald, Felix Hildebrand, Annemarie Friedrich

    Abstract: Kee** track of all relevant recent publications and experimental results for a research area is a challenging task. Prior work has demonstrated the efficacy of information extraction models in various scientific areas. Recently, several datasets have been released for the yet understudied materials science domain. However, these datasets focus on sub-problems such as parsing synthesis procedures… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 17 pages, 2 figures, 28 tables, to be published in "Proceedings of the second Workshop on Information Extraction from Scientific Publications"

  2. arXiv:2307.02340  [pdf, other

    cs.CL

    MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

    Authors: Timo Pierre Schrader, Teresa Bürkle, Sophie Henning, Sherry Tan, Matteo Finco, Stefan Grünewald, Maira Indrikova, Felix Hildebrand, Annemarie Friedrich

    Abstract: Scientific publications follow conventionalized rhetorical structures. Classifying the Argumentative Zone (AZ), e.g., identifying whether a sentence states a Motivation, a Result or Background information, has been proposed to improve processing of scholarly documents. In this work, we adapt and extend this idea to the domain of materials science research. We present and release a new dataset of 5… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 15 pages, 2 figures, 14 tables, to be published in "Proceedings of the 4th Workshop on Computational Approaches to Discourse"

  3. arXiv:2212.07156  [pdf, other

    cs.CL cs.AI

    MIST: a Large-Scale Annotated Resource and Neural Models for Functions of Modal Verbs in English Scientific Text

    Authors: Sophie Henning, Nicole Macher, Stefan Grünewald, Annemarie Friedrich

    Abstract: Modal verbs (e.g., "can", "should", or "must") occur highly frequently in scientific articles. Decoding their function is not straightforward: they are often used for hedging, but they may also denote abilities and restrictions. Understanding their meaning is important for various NLP tasks such as writing assistance or accurate information extraction from scientific text. To foster research on… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 20 pages, 7 figures. Accepted to EMNLP Findings 2022; typesetting of this version slightly differs from conference version

  4. arXiv:2109.10013  [pdf, other

    cs.CL

    Negation-Instance Based Evaluation of End-to-End Negation Resolution

    Authors: Elizaveta Sineva, Stefan Grünewald, Annemarie Friedrich, Jonas Kuhn

    Abstract: In this paper, we revisit the task of negation resolution, which includes the subtasks of cue detection (e.g. "not", "never") and scope resolution. In the context of previous shared tasks, a variety of evaluation metrics have been proposed. Subsequent works usually use different subsets of these, including variations and custom implementations, rendering meaningful comparisons between systems diff… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 16 pages, 5 figures; to be published at CoNLL 2021

  5. arXiv:2106.08159  [pdf, other

    cs.CL

    Maximum Spanning Trees Are Invariant to Temperature Scaling in Graph-based Dependency Parsing

    Authors: Stefan Grünewald

    Abstract: Modern graph-based syntactic dependency parsers operate by predicting, for each token within a sentence, a probability distribution over its possible syntactic heads (i.e., all other tokens) and then extracting a maximum spanning tree from the resulting log-probabilities. Nowadays, virtually all such parsers utilize deep neural networks and may thus be susceptible to miscalibration (in particular,… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 4 pages, 2 figures

  6. arXiv:2103.08955  [pdf, other

    cs.CL

    Coordinate Constructions in English Enhanced Universal Dependencies: Analysis and Computational Modeling

    Authors: Stefan Grünewald, Prisca Piccirilli, Annemarie Friedrich

    Abstract: In this paper, we address the representation of coordinate constructions in Enhanced Universal Dependencies (UD), where relevant dependency links are propagated from conjunction heads to other conjuncts. English treebanks for enhanced UD have been created from gold basic dependencies using a heuristic rule-based converter, which propagates only core arguments. With the aim of determining which set… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Comments: 15 pages, 2 figures; to be published at EACL 2021

  7. arXiv:2010.12699  [pdf, other

    cs.CL

    Applying Occam's Razor to Transformer-Based Dependency Parsing: What Works, What Doesn't, and What is Really Necessary

    Authors: Stefan Grünewald, Annemarie Friedrich, Jonas Kuhn

    Abstract: The introduction of pre-trained transformer-based contextualized word embeddings has led to considerable improvements in the accuracy of graph-based parsers for frameworks such as Universal Dependencies (UD). However, previous works differ in various dimensions, including their choice of pre-trained language models and whether they use LSTM layers. With the aims of disentangling the effects of the… ▽ More

    Submitted 29 July, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 14 pages, 1 figure; camera-ready version for IWPT 2021

  8. arXiv:1806.10654  [pdf, other

    cs.CL

    Generalized chart constraints for efficient PCFG and TAG parsing

    Authors: Stefan Grünewald, Sophie Henning, Alexander Koller

    Abstract: Chart constraints, which specify at which string positions a constituent may begin or end, have been shown to speed up chart parsers for PCFGs. We generalize chart constraints to more expressive grammar formalisms and describe a neural tagger which predicts chart constraints at very high precision. Our constraints accelerate both PCFG and TAG parsing, and combine effectively with other pruning tec… ▽ More

    Submitted 27 June, 2018; originally announced June 2018.

    Journal ref: Proceedings of ACL 2018 (Short Papers)