Skip to main content

Showing 1–17 of 17 results for author: Friedrich, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18585  [pdf, other

    cs.CL

    FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question Answering

    Authors: Wei Zhou, Mohsen Mesgar, Heike Adel, Annemarie Friedrich

    Abstract: Table Question Answering (TQA) aims at composing an answer to a question based on tabular data. While prior research has shown that TQA models lack robustness, understanding the underlying cause and nature of this issue remains predominantly unclear, posing a significant obstacle to the development of robust TQA systems. In this paper, we formalize three major desiderata for a fine-grained evaluat… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL 2024

  2. arXiv:2404.07765  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports

    Authors: Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Pujari, Annemarie Friedrich

    Abstract: Monitoring the threat landscape to be aware of actual or potential attacks is of utmost importance to cybersecurity professionals. Information about cyber threats is typically distributed using natural language reports. Natural language processing can help with managing this large amount of unstructured information, yet to date, the topic has received little attention. With this paper, we present… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024. Corpus available at https://github.com/boschresearch/anno-ctr-lrec-coling-2024

  3. arXiv:2312.06338  [pdf, other

    cs.CL cs.AI

    BoschAI @ Causal News Corpus 2023: Robust Cause-Effect Span Extraction using Multi-Layer Sequence Tagging and Data Augmentation

    Authors: Timo Pierre Schrader, Simon Razniewski, Lukas Lange, Annemarie Friedrich

    Abstract: Understanding causality is a core aspect of intelligence. The Event Causality Identification with Causal News Corpus Shared Task addresses two aspects of this challenge: Subtask 1 aims at detecting causal relationships in texts, and Subtask 2 requires identifying signal words and the spans that refer to the cause or effect, respectively. Our system, which is based on pre-trained transformers, stac… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 6 pages, 6 tables, 1 figure, published in "Proceedings of the 6th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text"

  4. arXiv:2311.01907  [pdf, other

    cs.CL

    BoschAI @ PLABA 2023: Leveraging Edit Operations in End-to-End Neural Sentence Simplification

    Authors: Valentin Knappich, Simon Razniewski, Annemarie Friedrich

    Abstract: Automatic simplification can help laypeople to comprehend complex scientific text. Language models are frequently applied to this task by translating from complex to simple language. In this paper, we describe our system based on Llama 2, which ranked first in the PLABA shared task addressing the simplification of biomedical text. We find that the large portion of shared tokens between input and o… ▽ More

    Submitted 6 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

  5. arXiv:2310.15569  [pdf, other

    cs.CL

    MuLMS: A Multi-Layer Annotated Text Corpus for Information Extraction in the Materials Science Domain

    Authors: Timo Pierre Schrader, Matteo Finco, Stefan Grünewald, Felix Hildebrand, Annemarie Friedrich

    Abstract: Kee** track of all relevant recent publications and experimental results for a research area is a challenging task. Prior work has demonstrated the efficacy of information extraction models in various scientific areas. Recently, several datasets have been released for the yet understudied materials science domain. However, these datasets focus on sub-problems such as parsing synthesis procedures… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 17 pages, 2 figures, 28 tables, to be published in "Proceedings of the second Workshop on Information Extraction from Scientific Publications"

  6. arXiv:2307.02340  [pdf, other

    cs.CL

    MuLMS-AZ: An Argumentative Zoning Dataset for the Materials Science Domain

    Authors: Timo Pierre Schrader, Teresa Bürkle, Sophie Henning, Sherry Tan, Matteo Finco, Stefan Grünewald, Maira Indrikova, Felix Hildebrand, Annemarie Friedrich

    Abstract: Scientific publications follow conventionalized rhetorical structures. Classifying the Argumentative Zone (AZ), e.g., identifying whether a sentence states a Motivation, a Result or Background information, has been proposed to improve processing of scholarly documents. In this work, we adapt and extend this idea to the domain of materials science research. We present and release a new dataset of 5… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 15 pages, 2 figures, 14 tables, to be published in "Proceedings of the 4th Workshop on Computational Approaches to Discourse"

  7. arXiv:2212.07156  [pdf, other

    cs.CL cs.AI

    MIST: a Large-Scale Annotated Resource and Neural Models for Functions of Modal Verbs in English Scientific Text

    Authors: Sophie Henning, Nicole Macher, Stefan Grünewald, Annemarie Friedrich

    Abstract: Modal verbs (e.g., "can", "should", or "must") occur highly frequently in scientific articles. Decoding their function is not straightforward: they are often used for hedging, but they may also denote abilities and restrictions. Understanding their meaning is important for various NLP tasks such as writing assistance or accurate information extraction from scientific text. To foster research on… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 20 pages, 7 figures. Accepted to EMNLP Findings 2022; typesetting of this version slightly differs from conference version

  8. arXiv:2210.04675  [pdf, other

    cs.CL cs.AI

    A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing

    Authors: Sophie Henning, William Beluch, Alexander Fraser, Annemarie Friedrich

    Abstract: Many natural language processing (NLP) tasks are naturally imbalanced, as some target categories occur much more frequently than others in the real world. In such scenarios, current NLP models still tend to perform poorly on less frequent classes. Addressing class imbalance in NLP is an active research topic, yet, finding a good approach for a particular task and imbalance scenario is difficult.… ▽ More

    Submitted 22 February, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Camera-ready version for EACL 2023

  9. arXiv:2208.09012  [pdf, other

    cs.CL

    A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of Computational Approaches

    Authors: Annemarie Friedrich, Nianwen Xue, Alexis Palmer

    Abstract: Aspectual meaning refers to how the internal temporal structure of situations is presented. This includes whether a situation is described as a state or as an event, whether the situation is finished or ongoing, and whether it is viewed as a whole or with a focus on a particular phase. This survey gives an overview of computational approaches to modeling lexical and grammatical aspect along with i… ▽ More

    Submitted 10 March, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted at EACL 2023, camera ready version

  10. arXiv:2204.09781  [pdf

    cs.DL cs.CL cs.IR cs.LG

    Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations

    Authors: Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj Doğan, **gcheng Du, Li Fang, Kai Wang, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Senja Pollak, Shubo Tian, **feng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu , et al. (14 additional authors not shown)

    Abstract: The COVID-19 pandemic has been severely impacting global society since December 2019. Massive research has been undertaken to understand the characteristics of the virus and design vaccines and drugs. The related findings have been reported in biomedical literature at a rate of about 10,000 articles on COVID-19 per month. Such rapid growth significantly challenges manual curation and interpretatio… ▽ More

    Submitted 3 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  11. SCoT: Sense Clustering over Time: a tool for the analysis of lexical change

    Authors: Christian Haase, Saba Anwar, Seid Muhie Yimam, Alexander Friedrich, Chris Biemann

    Abstract: We present Sense Clustering over Time (SCoT), a novel network-based tool for analysing lexical change. SCoT represents the meanings of a word as clusters of similar words. It visualises their formation, change, and demise. There are two main approaches to the exploration of dynamic networks: the discrete one compares a series of clustered graphs from separate points in time. The continuous one ana… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Update of https://aclanthology.org/2021.eacl-demos.23/

    Journal ref: https://aclanthology.org/2021.eacl-demos.23/

  12. arXiv:2109.10013  [pdf, other

    cs.CL

    Negation-Instance Based Evaluation of End-to-End Negation Resolution

    Authors: Elizaveta Sineva, Stefan Grünewald, Annemarie Friedrich, Jonas Kuhn

    Abstract: In this paper, we revisit the task of negation resolution, which includes the subtasks of cue detection (e.g. "not", "never") and scope resolution. In the context of previous shared tasks, a variety of evaluation metrics have been proposed. Subsequent works usually use different subsets of these, including variations and custom implementations, rendering meaningful comparisons between systems diff… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 16 pages, 5 figures; to be published at CoNLL 2021

  13. arXiv:2103.08955  [pdf, other

    cs.CL

    Coordinate Constructions in English Enhanced Universal Dependencies: Analysis and Computational Modeling

    Authors: Stefan Grünewald, Prisca Piccirilli, Annemarie Friedrich

    Abstract: In this paper, we address the representation of coordinate constructions in Enhanced Universal Dependencies (UD), where relevant dependency links are propagated from conjunction heads to other conjuncts. English treebanks for enhanced UD have been created from gold basic dependencies using a heuristic rule-based converter, which propagates only core arguments. With the aim of determining which set… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Comments: 15 pages, 2 figures; to be published at EACL 2021

  14. arXiv:2010.12699  [pdf, other

    cs.CL

    Applying Occam's Razor to Transformer-Based Dependency Parsing: What Works, What Doesn't, and What is Really Necessary

    Authors: Stefan Grünewald, Annemarie Friedrich, Jonas Kuhn

    Abstract: The introduction of pre-trained transformer-based contextualized word embeddings has led to considerable improvements in the accuracy of graph-based parsers for frameworks such as Universal Dependencies (UD). However, previous works differ in various dimensions, including their choice of pre-trained language models and whether they use LSTM layers. With the aims of disentangling the effects of the… ▽ More

    Submitted 29 July, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 14 pages, 1 figure; camera-ready version for IWPT 2021

  15. arXiv:2006.03039  [pdf, other

    cs.CL cs.LG

    The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain

    Authors: Annemarie Friedrich, Heike Adel, Federico Tomazic, Johannes Hingerl, Renou Benteau, Anika Maruscyk, Lukas Lange

    Abstract: This paper presents a new challenging information extraction task in the domain of materials science. We develop an annotation scheme for marking information on experiments related to solid oxide fuel cells in scientific publications, such as involved materials and measurement conditions. With this paper, we publish our annotation guidelines, as well as our SOFC-Exp corpus consisting of 45 open-ac… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at ACL 2020

  16. arXiv:1904.04805  [pdf, other

    cs.NE cs.AI cs.LG

    Embodied Neuromorphic Vision with Event-Driven Random Backpropagation

    Authors: Jacques Kaiser, Alexander Friedrich, J. Camilo Vasquez Tieck, Daniel Reichard, Arne Roennau, Emre Neftci, Rüdiger Dillmann

    Abstract: Spike-based communication between biological neurons is sparse and unreliable. This enables the brain to process visual information from the eyes efficiently. Taking inspiration from biology, artificial spiking neural networks coupled with silicon retinas attempt to model these computations. Recent findings in machine learning allowed the derivation of a family of powerful synaptic plasticity rule… ▽ More

    Submitted 6 May, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

    Comments: v2: title update, better plots and wordings. 8 pages, 9 figures, 1 table, video: https://neurorobotics-files.net/index.php/s/sBQzWFrBPoH9Dx7

  17. Coherent Multi-Sentence Video Description with Variable Level of Detail

    Authors: Anna Senina, Marcus Rohrbach, Wei Qiu, Annemarie Friedrich, Sikandar Amin, Mykhaylo Andriluka, Manfred Pinkal, Bernt Schiele

    Abstract: Humans can easily describe what they see in a coherent way and at varying level of detail. However, existing approaches for automatic video description are mainly focused on single sentence generation and produce descriptions at a fixed level of detail. In this paper, we address both of these limitations: for a variable level of detail we produce coherent multi-sentence descriptions of complex vid… ▽ More

    Submitted 24 March, 2014; originally announced March 2014.

    Comments: 10 pages