Skip to main content

Showing 1–15 of 15 results for author: Akbik, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07609  [pdf, other

    cs.CL cs.AI cs.LG

    NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition

    Authors: Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik

    Abstract: Available training data for named entity recognition (NER) often contains a significant percentage of incorrect labels for entity types and entity boundaries. Such label noise poses challenges for supervised learning and may significantly deteriorate model quality. To address this, prior work proposed various noise-robust learning approaches capable of learning from data with partially incorrect l… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: data available at https://github.com/elenamer/NoiseBench

  2. arXiv:2404.18766  [pdf, other

    cs.AI

    PECC: Problem Extraction and Coding Challenges

    Authors: Patrick Haller, Jonas Golde, Alan Akbik

    Abstract: Recent advancements in large language models (LLMs) have showcased their exceptional abilities across various tasks, such as code generation, problem-solving and reasoning. Existing benchmarks evaluate tasks in isolation, yet the extent to which LLMs can understand prose-style tasks, identify the underlying problems, and then generate appropriate code solutions is still unexplored. Addressing this… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: This paper got accepted at LREC-COLING 2024 (long)

  3. arXiv:2404.04113  [pdf, other

    cs.CL

    BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models

    Authors: Jacek Wiland, Max Ploner, Alan Akbik

    Abstract: Knowledge probing assesses to which degree a language model (LM) has successfully learned relational knowledge during pre-training. Probing is an inexpensive way to compare LMs of different sizes and training configurations. However, previous approaches rely on the objective function used in pre-training LMs and are thus applicable only to masked or causal LMs. As a result, comparing different typ… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  4. arXiv:2403.15279  [pdf, other

    cs.CL cs.IR

    Fundus: A Simple-to-Use News Scraper Optimized for High Quality Extractions

    Authors: Max Dallabetta, Conrad Dobberstein, Adrian Breiding, Alan Akbik

    Abstract: This paper introduces Fundus, a user-friendly news scraper that enables users to obtain millions of high-quality news articles with just a few lines of code. Unlike existing news scrapers, we use manually crafted, bespoke content extractors that are specifically tailored to the formatting guidelines of each supported online newspaper. This allows us to optimize our scra** for quality such that r… ▽ More

    Submitted 24 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures, ACL 2024, for a screencast see https://www.youtube.com/watch?v=9GJExMelhdI

  5. arXiv:2403.14222  [pdf, other

    cs.CL

    Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition

    Authors: Jonas Golde, Felix Hamborg, Alan Akbik

    Abstract: Few-shot named entity recognition (NER) detects named entities within text using only a few annotated examples. One promising line of research is to leverage natural language descriptions of each entity type: the common label PER might, for example, be verbalized as ''person entity.'' In an initial label interpretation learning phase, the model learns to interpret such verbalized descriptions of e… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 8 pages

  6. arXiv:2402.12372  [pdf, other

    cs.CL

    HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools

    Authors: Mario Sänger, Samuele Garda, Xing David Wang, Leon Weber-Genzel, Pia Droop, Benedikt Fuchs, Alan Akbik, Ulf Leser

    Abstract: With the exponential growth of the life science literature, biomedical text mining (BTM) has become an essential technology for accelerating the extraction of insights from publications. Identifying named entities (e.g., diseases, drugs, or genes) in texts and their linkage to reference knowledge bases are crucial steps in BTM pipelines to enable information aggregation from different documents. H… ▽ More

    Submitted 20 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2401.17072  [pdf, other

    cs.CL

    SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

    Authors: Ansar Aynetdinov, Alan Akbik

    Abstract: Instruction-tuned Large Language Models (LLMs) have recently showcased remarkable advancements in their ability to generate fitting responses to natural language instructions. However, many current works rely on manual evaluation to judge the quality of generated responses. Since such manual evaluation is time-consuming, it does not easily scale to the evaluation of multiple models and model varia… ▽ More

    Submitted 5 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  8. arXiv:2310.16225  [pdf, other

    cs.CL cs.AI cs.LG

    CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset

    Authors: Susanna Rücker, Alan Akbik

    Abstract: The CoNLL-03 corpus is arguably the most well-known and utilized benchmark dataset for named entity recognition (NER). However, prior works found significant numbers of annotation errors, incompleteness, and inconsistencies in the data. This poses challenges to objectively comparing NER approaches and analyzing their errors, as current state-of-the-art models achieve F1-scores that are comparable… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 camera-ready version

  9. Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

    Authors: Jonas Golde, Patrick Haller, Felix Hamborg, Julian Risch, Alan Akbik

    Abstract: Most NLP tasks are modeled as supervised learning and thus require labeled training data to train effective models. However, manually producing such data at sufficient quality and quantity is known to be costly and time-intensive. Current research addresses this bottleneck by exploring a novel paradigm called zero-shot learning via dataset generation. Here, a powerful LLM is prompted with a task d… ▽ More

    Submitted 2 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 3 Figures and 2 Tables

  10. arXiv:2309.03876  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs

    Authors: Patrick Haller, Ansar Aynetdinov, Alan Akbik

    Abstract: Instruction-tuned Large Language Models (LLMs) have recently showcased remarkable ability to generate fitting responses to natural language instructions. However, an open research question concerns the inherent biases of trained models and their responses. For instance, if the data used to tune an LLM is dominantly written by persons with a specific political bias, we might expect generated answer… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 6 pages, 1 figure, 3 tables

  11. arXiv:2212.00086  [pdf, other

    cs.CL

    Task-Specific Embeddings for Ante-Hoc Explainable Text Classification

    Authors: Kishaloy Halder, Josip Krapac, Alan Akbik, Anthony Brew, Matti Lyra

    Abstract: Current state-of-the-art approaches to text classification typically leverage BERT-style Transformer models with a softmax classifier, jointly fine-tuned to predict class labels of a target task. In this paper, we instead propose an alternative training objective in which we learn task-specific embeddings of text: our proposed objective learns embeddings such that all texts that share the same tar… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  12. Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning

    Authors: Angelo Ziletti, Alan Akbik, Christoph Berns, Thomas Herold, Marion Legler, Martina Viell

    Abstract: Medical coding (MC) is an essential pre-requisite for reliable data retrieval and reporting. Given a free-text reported term (RT) such as "pain of right thigh to the knee", the task is to identify the matching lowest-level term (LLT) - in this case "unilateral leg pain" - from a very large and continuously growing repository of standardized medical terms. However, automating this task is challengi… ▽ More

    Submitted 1 May, 2022; originally announced June 2022.

    Comments: NAACL-HLT 2022 Industry Track

    Journal ref: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track

  13. arXiv:2011.06993  [pdf, other

    cs.CL

    FLERT: Document-Level Features for Named Entity Recognition

    Authors: Stefan Schweter, Alan Akbik

    Abstract: Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NE… ▽ More

    Submitted 14 May, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

  14. arXiv:2008.07347  [pdf, other

    cs.CL

    HunFlair: An Easy-to-Use Tool for State-of-the-Art Biomedical Named Entity Recognition

    Authors: Leon Weber, Mario Sänger, Jannes Münchmeyer, Maryam Habibi, Ulf Leser, Alan Akbik

    Abstract: Summary: Named Entity Recognition (NER) is an important step in biomedical information extraction pipelines. Tools for NER should be easy to use, cover multiple entity types, highly accurate, and robust towards variations in text genre and style. To this end, we propose HunFlair, an NER tagger covering multiple entity types integrated into the widely used NLP framework Flair. HunFlair outperforms… ▽ More

    Submitted 18 August, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: - Corrected author list - Updated project link

  15. arXiv:1803.03665  [pdf, other

    cs.CL cs.LG

    Syntax-Aware Language Modeling with Recurrent Neural Networks

    Authors: Duncan Blythe, Alan Akbik, Roland Vollgraf

    Abstract: Neural language models (LMs) are typically trained using only lexical features, such as surface forms of words. In this paper, we argue this deprives the LM of crucial syntactic signals that can be detected at high confidence using existing parsers. We present a simple but highly effective approach for training neural LMs using both lexical and syntactic information, and a novel approach for apply… ▽ More

    Submitted 2 March, 2018; originally announced March 2018.