Skip to main content

Showing 1–19 of 19 results for author: Hedderich, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11096  [pdf, other

    cs.CL

    The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models

    Authors: Bolei Ma, Xinpeng Wang, Tiancheng Hu, Anna-Carolina Haensch, Michael A. Hedderich, Barbara Plank, Frauke Kreuter

    Abstract: Recent advances in Large Language Models (LLMs) have sparked wide interest in validating and comprehending the human-like cognitive-behavioral traits LLMs may have. These cognitive-behavioral traits include typically Attitudes, Opinions, Values (AOV). However, measuring AOV embedded within LLMs remains opaque, and different evaluation methods may yield different results. This has led to a lack of… ▽ More

    Submitted 1 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2402.17456  [pdf, other

    cs.HC cs.AI cs.CL

    A Piece of Theatre: Investigating How Teachers Design LLM Chatbots to Assist Adolescent Cyberbullying Education

    Authors: Michael A. Hedderich, Natalie N. Bazarova, Wenting Zou, Ryun Shim, Xinda Ma, Qian Yang

    Abstract: Cyberbullying harms teenagers' mental health, and teaching them upstanding intervention is crucial. Wizard-of-Oz studies show chatbots can scale up personalized and interactive cyberbullying education, but implementing such chatbots is a challenging and delicate task. We created a no-code chatbot design tool for K-12 teachers. Using large language models and prompt chaining, our tool allows teache… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2311.10920  [pdf, other

    cs.CL cs.AI

    Understanding and Mitigating Classification Errors Through Interpretable Token Patterns

    Authors: Michael A. Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken

    Abstract: State-of-the-art NLP methods achieve human-like performance on many tasks, but make errors nevertheless. Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier. We propose to discover those patterns of tokens that distinguish correct and erroneous predictions as t… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Extended abstract at BlackboxNLP'23

  4. arXiv:2206.01476  [pdf, ps, other

    cs.CL

    Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages

    Authors: Dawei Zhu, Michael A. Hedderich, Fangzhou Zhai, David Ifeoluwa Adelani, Dietrich Klakow

    Abstract: For high-resource languages like English, text classification is a well-studied task. The performance of modern NLP models easily achieves an accuracy of more than 90% in many standard datasets for text classification in English (Xie et al., 2019; Yang et al., 2019; Zaheer et al., 2020). However, text classification in low-resource languages is still challenging due to the lack of annotated data.… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: AfricaNLP Workshop @ ICLR2022

  5. arXiv:2205.07290  [pdf, other

    cs.CL

    Meta Self-Refinement for Robust Learning with Weak Supervision

    Authors: Dawei Zhu, Xiaoyu Shen, Michael A. Hedderich, Dietrich Klakow

    Abstract: Training deep neural networks (DNNs) under weak supervision has attracted increasing research attention as it can significantly reduce the annotation cost. However, labels from weak supervision can be noisy, and the high capacity of DNNs enables them to easily overfit the label noise, resulting in poor generalization. Recent methods leverage self-training to build noise-resistant models, in which… ▽ More

    Submitted 30 April, 2023; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: EACL 2023 (long paper)

  6. arXiv:2204.10931  [pdf, other

    cs.CL

    MCSE: Multimodal Contrastive Learning of Sentence Embeddings

    Authors: Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow

    Abstract: Learning semantically meaningful sentence embeddings is an open problem in natural language processing. In this work, we propose a sentence embedding learning approach that exploits both visual and textual information via a multimodal contrastive objective. Through experiments on a variety of semantic textual similarity tasks, we demonstrate that our approach consistently improves the performance… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted by NAACL 2022 main conference (short paper), 11 pages

  7. arXiv:2204.09371  [pdf, other

    cs.CL

    Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

    Authors: Dawei Zhu, Michael A. Hedderich, Fangzhou Zhai, David Ifeoluwa Adelani, Dietrich Klakow

    Abstract: Incorrect labels in training data occur when human annotators make mistakes or when the data is generated via weak or distant supervision. It has been shown that complex noise-handling techniques - by modeling, cleaning or filtering the noisy instances - are required to prevent models from fitting this label noise. However, we show in this work that, for text classification tasks with modern NLP m… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Accepted at Workshop on Insights from Negative Results in NLP 2022 @ACL 2022

  8. arXiv:2107.03690  [pdf, other

    cs.LG

    Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL)

    Authors: Michael A. Hedderich, Benjamin Roth, Katharina Kann, Barbara Plank, Alex Ratner, Dietrich Klakow

    Abstract: Welcome to WeaSuL 2021, the First Workshop on Weakly Supervised Learning, co-located with ICLR 2021. In this workshop, we want to advance theory, methods and tools for allowing experts to express prior coded knowledge for automatic data annotations that can be used to train arbitrary deep neural networks for prediction. The ICLR 2021 Workshop on Weak Supervision aims at advancing methods that help… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  9. arXiv:2102.13129  [pdf, other

    cs.CL

    ANEA: Distant Supervision for Low-Resource Named Entity Recognition

    Authors: Michael A. Hedderich, Lukas Lange, Dietrich Klakow

    Abstract: Distant supervision allows obtaining labeled training corpora for low-resource settings where only limited hand-annotated data exists. However, to be used effectively, the distant supervision must be easy to gather. In this work, we present ANEA, a tool to automatically annotate named entities in texts based on entity lists. It spans the whole pipeline from obtaining the lists to analyzing the err… ▽ More

    Submitted 15 April, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Accepted at Practical Machine Learning For Develo** Countries @ ICLR 2021

  10. arXiv:2101.09763  [pdf, other

    cs.LG cs.CL stat.ML

    Analysing the Noise Model Error for Realistic Noisy Label Data

    Authors: Michael A. Hedderich, Dawei Zhu, Dietrich Klakow

    Abstract: Distant and weak supervision allow to obtain large amounts of labeled training data quickly and cheaply, but these automatic annotations tend to contain a high amount of errors. A popular technique to overcome the negative effects of these noisy labels is noise modelling where the underlying noise process is modelled. In this work, we study the quality of these estimated noise models from the theo… ▽ More

    Submitted 1 March, 2021; v1 submitted 24 January, 2021; originally announced January 2021.

    Comments: Accepted at AAAI 2021, additional material at https://github.com/uds-lsv/noise-estimation

  11. arXiv:2010.12309  [pdf, other

    cs.CL cs.LG

    A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

    Authors: Michael A. Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

    Abstract: Deep neural networks and huge language models are becoming omnipresent in natural language applications. As they are known for requiring large amounts of training data, there is a growing body of work to improve the performance in low-resource settings. Motivated by the recent fundamental changes towards neural models and the popular pre-train and fine-tune paradigm, we survey promising approaches… ▽ More

    Submitted 9 April, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted at NAACL 2021

  12. arXiv:2010.03179  [pdf, other

    cs.CL cs.LG

    Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages

    Authors: Michael A. Hedderich, David Adelani, Dawei Zhu, Jesujoba Alabi, Udia Markus, Dietrich Klakow

    Abstract: Multilingual transformer models like mBERT and XLM-RoBERTa have obtained great improvements for many NLP tasks on a variety of languages. However, recent works also showed that results from high-resource languages could not be easily transferred to realistic, low-resource scenarios. In this work, we study trends in performance for different amounts of available resources for the three African lang… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted at EMNLP'20

  13. arXiv:2010.02616  [pdf, other

    cs.CL cs.LG

    On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers

    Authors: Marius Mosbach, Anna Khokhlova, Michael A. Hedderich, Dietrich Klakow

    Abstract: Fine-tuning pre-trained contextualized embedding models has become an integral part of the NLP pipeline. At the same time, probing has emerged as a way to investigate the linguistic knowledge captured by pre-trained models. Very little is, however, understood about how fine-tuning affects the representations of pre-trained models and thereby the linguistic knowledge they encode. This paper contrib… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted at Findings of EMNLP 2020 and BlackboxNLP 2020

  14. arXiv:2006.05561  [pdf, other

    cs.LG cs.CL stat.ML

    Learning Functions to Study the Benefit of Multitask Learning

    Authors: Gabriele Bettgenhäuser, Michael A. Hedderich, Dietrich Klakow

    Abstract: We study and quantify the generalization patterns of multitask learning (MTL) models for sequence labeling tasks. MTL models are trained to optimize a set of related tasks jointly. Although multitask learning has achieved improved performance in some problems, there are also tasks that lose performance when trained together. These mixed results motivate us to study the factors that impact the perf… ▽ More

    Submitted 28 September, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  15. arXiv:2003.08370  [pdf, other

    cs.CL cs.LG

    Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá

    Authors: David Ifeoluwa Adelani, Michael A. Hedderich, Dawei Zhu, Esther van den Berg, Dietrich Klakow

    Abstract: The lack of labeled training data has limited the development of natural language processing tools, such as named entity recognition, for many languages spoken in develo** countries. Techniques such as distant and weak supervision can be used to create labeled data in a (semi-) automatic way. Additionally, to alleviate some of the negative effects of the errors in automatic annotation, noise-han… ▽ More

    Submitted 31 March, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Accepted to ICLR 2020 Workshop

  16. arXiv:1910.06061  [pdf, other

    cs.CL cs.LG

    Feature-Dependent Confusion Matrices for Low-Resource NER Labeling with Noisy Labels

    Authors: Lukas Lange, Michael A. Hedderich, Dietrich Klakow

    Abstract: In low-resource settings, the performance of supervised labeling models can be improved with automatically annotated or distantly supervised data, which is cheap to create but often noisy. Previous works have shown that significant improvements can be reached by injecting information about the confusion between clean and noisy labels in this additional training data into the classifier training. H… ▽ More

    Submitted 4 November, 2019; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Published at EMNLP-IJCNLP 2019

  17. arXiv:1904.01451  [pdf, other

    cs.CL cs.LG

    Using Multi-Sense Vector Embeddings for Reverse Dictionaries

    Authors: Michael A. Hedderich, Andrew Yates, Dietrich Klakow, Gerard de Melo

    Abstract: Popular word embedding methods such as word2vec and GloVe assign a single vector representation to each word, even if a word has multiple distinct meanings. Multi-sense embeddings instead provide different vectors for each sense of a word. However, they typically cannot serve as a drop-in replacement for conventional single-sense embeddings, because the correct sense vector needs to be selected fo… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: Accepted as long paper at the 13th International Conference on Computational Semantics (IWCS 2019)

  18. arXiv:1903.12008  [pdf, other

    cs.CL cs.LG

    Handling Noisy Labels for Robustly Learning from Self-Training Data for Low-Resource Sequence Labeling

    Authors: Debjit Paul, Mittul Singh, Michael A. Hedderich, Dietrich Klakow

    Abstract: In this paper, we address the problem of effectively self-training neural networks in a low-resource setting. Self-training is frequently used to automatically increase the amount of training data. However, in a low-resource scenario, it is less effective due to unreliable annotations created using self-labeling of unlabeled data. We propose to combine self-training with noise handling on the self… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  19. arXiv:1807.00745  [pdf, other

    cs.LG cs.CL stat.ML

    Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data

    Authors: Michael A. Hedderich, Dietrich Klakow

    Abstract: Manually labeled corpora are expensive to create and often not available for low-resource languages or domains. Automatic labeling approaches are an alternative way to obtain labeled data in a quicker and cheaper way. However, these labels often contain more errors which can deteriorate a classifier's performance when trained on this data. We propose a noise layer that is added to a neural network… ▽ More

    Submitted 22 July, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: In Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP 2018