Skip to main content

Showing 1–14 of 14 results for author: Brandl, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13592  [pdf, other

    cs.CL

    Llama meets EU: Investigating the European Political Spectrum through the Lens of LLMs

    Authors: Ilias Chalkidis, Stephanie Brandl

    Abstract: Instruction-finetuned Large Language Models inherit clear political leanings that have been shown to influence downstream task performance. We expand this line of research beyond the two-party system in the US and audit Llama Chat in the context of EU politics in various settings to analyze the model's political knowledge and its ability to reason in context. We adapt, i.e., further fine-tune, Lla… ▽ More

    Submitted 22 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: accepted to NAACL 2024 as a short paper

  2. arXiv:2402.19133  [pdf, other

    cs.CL

    Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations

    Authors: Stephanie Brandl, Oliver Eberle, Tiago Ribeiro, Anders Søgaard, Nora Hollenstein

    Abstract: Rationales in the form of manually annotated input spans usually serve as ground truth when evaluating explainability methods in NLP. They are, however, time-consuming and often biased by the annotation process. In this paper, we debate whether human gaze, in the form of webcam-based eye-tracking recordings, poses a valid alternative when evaluating importance scores. We evaluate the additional in… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to LREC-COLING 2024

  3. arXiv:2310.17530  [pdf, other

    cs.CV cs.CL cs.LG

    Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models

    Authors: Laura Cabello, Emanuele Bugliarello, Stephanie Brandl, Desmond Elliott

    Abstract: Pretrained machine learning models are known to perpetuate and even amplify existing biases in data, which can result in unfair outcomes that ultimately impact user experience. Therefore, it is crucial to understand the mechanisms behind those prejudicial biases to ensure that model performance does not result in discriminatory behaviour toward certain groups or populations. In this work, we defin… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: To appear in EMNLP 2024

  4. arXiv:2310.16607  [pdf, other

    cs.CL

    On the Interplay between Fairness and Explainability

    Authors: Stephanie Brandl, Emanuele Bugliarello, Ilias Chalkidis

    Abstract: In order to build reliable and trustworthy NLP applications, models need to be both fair across different demographics and explainable. Usually these two objectives, fairness and explainability, are optimized and/or examined independently of each other. Instead, we argue that forthcoming, trustworthy NLP systems should consider both. In this work, we perform a first study to understand how they in… ▽ More

    Submitted 13 November, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 15 pages (incl Appendix), 4 figures, 8 tables

  5. arXiv:2310.11906  [pdf, other

    cs.CL

    Rather a Nurse than a Physician -- Contrastive Explanations under Investigation

    Authors: Oliver Eberle, Ilias Chalkidis, Laura Cabello, Stephanie Brandl

    Abstract: Contrastive explanations, where one decision is explained in contrast to another, are supposed to be closer to how humans explain a decision than non-contrastive explanations, where the decision is not necessarily referenced to an alternative. This claim has never been empirically validated. We analyze four English text-classification datasets (SST2, DynaSent, BIOS and DBpedia-Animals). We fine-tu… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 9 pages, long paper at EMNLP 2023 proceedings

  6. arXiv:2303.17876  [pdf, other

    cs.CL

    WebQAmGaze: A Multilingual Webcam Eye-Tracking-While-Reading Dataset

    Authors: Tiago Ribeiro, Stephanie Brandl, Anders Søgaard, Nora Hollenstein

    Abstract: We present WebQAmGaze, a multilingual low-cost eye-tracking-while-reading dataset, designed as the first webcam-based eye-tracking corpus of reading to support the development of explainable computational language processing models. WebQAmGaze includes webcam eye-tracking data from 600 participants of a wide age range naturally reading English, German, Spanish, and Turkish texts. Each participant… ▽ More

    Submitted 15 March, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

  7. arXiv:2210.04963  [pdf, other

    cs.CL

    Every word counts: A multilingual analysis of individual human alignment with model attention

    Authors: Stephanie Brandl, Nora Hollenstein

    Abstract: Human fixation patterns have been shown to correlate strongly with Transformer-based attention. Those correlation analyses are usually carried out without taking into account individual differences between participants and are mostly done on monolingual datasets making it difficult to generalise findings. In this paper, we analyse eye-tracking data from speakers of 13 different languages reading b… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: short paper, accepted at AACL 2022

  8. arXiv:2210.04962  [pdf, other

    cs.CL

    Domain-Specific Word Embeddings with Structure Prediction

    Authors: Stephanie Brandl, David Lassner, Anne Baillot, Shinichi Nakajima

    Abstract: Complementary to finding good general word embeddings, an important question for representation learning is to find dynamic word embeddings, e.g., across time or domain. Current methods do not offer a way to use or predict information on structure between sub-corpora, time or domain and dynamic embeddings can only be compared after post-alignment. We propose novel word embedding methods that provi… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: accepted at TACL 13 pages, 4 figures

  9. arXiv:2206.02661  [pdf, other

    cs.CL

    Evaluating Deep Taylor Decomposition for Reliability Assessment in the Wild

    Authors: Stephanie Brandl, Daniel Hershcovich, Anders Søgaard

    Abstract: We argue that we need to evaluate model interpretability methods 'in the wild', i.e., in situations where professionals make critical decisions, and models can potentially assist them. We present an in-the-wild evaluation of token attribution based on Deep Taylor Decomposition, with professional journalists performing reliability assessments. We find that using this method in conjunction with RoBE… ▽ More

    Submitted 3 May, 2022; originally announced June 2022.

    Comments: ICWSM 2022

  10. arXiv:2205.10226  [pdf, other

    cs.CL cs.LG

    Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?

    Authors: Stephanie Brandl, Oliver Eberle, Jonas Pilot, Anders Søgaard

    Abstract: Learned self-attention functions in state-of-the-art NLP models often correlate with human attention. We investigate whether self-attention in large-scale pre-trained language models is as predictive of human eye fixation patterns during task-reading as classical cognitive models of human attention. We compare attention functions across two task-specific reading datasets for sentiment analysis and… ▽ More

    Submitted 25 April, 2022; originally announced May 2022.

    Comments: Accepted to ACL 2022

  11. arXiv:2204.10281  [pdf, other

    cs.CL

    How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns

    Authors: Stephanie Brandl, Ruixiang Cui, Anders Søgaard

    Abstract: Gender-neutral pronouns have recently been introduced in many languages to a) include non-binary people and b) as a generic singular. Recent results from psycholinguistics suggest that gender-neutral pronouns (in Swedish) are not associated with human processing difficulties. This, we show, is in sharp contrast with automated processing. We show that gender-neutral pronouns in Danish, English, and… ▽ More

    Submitted 3 May, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: To appear at NAACL 2022

  12. arXiv:2203.10020  [pdf, other

    cs.CL

    Challenges and Strategies in Cross-Cultural NLP

    Authors: Daniel Hershcovich, Stella Frank, Heather Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello, Laura Cabello Piqueras, Ilias Chalkidis, Ruixiang Cui, Constanza Fierro, Katerina Margatina, Phillip Rust, Anders Søgaard

    Abstract: Various efforts in the Natural Language Processing (NLP) community have been made to accommodate linguistic diversity and serve speakers of many different languages. However, it is important to acknowledge that speakers and the content they produce and require, vary not just by language, but also by culture. Although language and culture are tightly linked, there are important differences. Analogo… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: ACL 2022 - Theme track

  13. Analyzing Item Popularity Bias of Music Recommender Systems: Are Different Genders Equally Affected?

    Authors: Oleg Lesota, Alessandro B. Melchiorre, Navid Rekabsaz, Stefan Brandl, Dominik Kowald, Elisabeth Lex, Markus Schedl

    Abstract: Several studies have identified discrepancies between the popularity of items in user profiles and the corresponding recommendation lists. Such behavior, which concerns a variety of recommendation algorithms, is referred to as popularity bias. Existing work predominantly adopts simple statistical measures, such as the difference of mean or median popularity, to quantify popularity bias. Moreover,… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: RecSys 2021 - LBR

  14. arXiv:2001.04693  [pdf, other

    cs.CL cs.LG

    Balancing the composition of word embeddings across heterogenous data sets

    Authors: Stephanie Brandl, David Lassner, Maximilian Alber

    Abstract: Word embeddings capture semantic relationships based on contextual information and are the basis for a wide variety of natural language processing applications. Notably these relationships are solely learned from the data and subsequently the data composition impacts the semantic of embeddings which arguably can lead to biased word vectors. Given qualitatively different data subsets, we aim to ali… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.