Skip to main content

Showing 1–18 of 18 results for author: Fraser, K C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15583  [pdf, other

    cs.CL cs.CY

    Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

    Authors: Kathleen C. Fraser, Hillary Dawkins, Svetlana Kiritchenko

    Abstract: Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial intelligence (AI) is important to determining its trustworthiness, and has applications in many domains including detecting fraud and academic dishonesty, as well as com… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.20152  [pdf, other

    cs.CV

    Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals

    Authors: Phillip Howard, Kathleen C. Fraser, Anahita Bhiwandiwalla, Svetlana Kiritchenko

    Abstract: With the advent of Large Language Models (LLMs) possessing increasingly impressive capabilities, a number of Large Vision-Language Models (LVLMs) have been proposed to augment LLMs with visual inputs. Such models condition generated text on both an input image and a text prompt, enabling a variety of use cases such as visual question answering and multimodal chat. While prior studies have examined… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2404.11845  [pdf, other

    cs.CL cs.CY

    Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes

    Authors: Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof, Svetlana Kiritchenko

    Abstract: Gender stereotypes are pervasive beliefs about individuals based on their gender that play a significant role in sha** societal attitudes, behaviours, and even opportunities. Recognizing the negative implications of gender stereotypes, particularly in online communications, this study investigates eleven strategies to automatically counter-act and challenge these views. We present AI-generated g… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: LREC-COLING2024

  4. arXiv:2404.00166  [pdf, other

    cs.CV cs.AI

    Uncovering Bias in Large Vision-Language Models with Counterfactuals

    Authors: Phillip Howard, Anahita Bhiwandiwalla, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: With the advent of Large Language Models (LLMs) possessing increasingly impressive capabilities, a number of Large Vision-Language Models (LVLMs) have been proposed to augment LLMs with visual inputs. Such models condition generated text on both an input image and a text prompt, enabling a variety of use cases such as visual question answering and multimodal chat. While prior studies have examined… ▽ More

    Submitted 7 June, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted to the CVPR 2024 Responsible Generative AI (ReGenAI) Workshop

  5. arXiv:2402.05779  [pdf, other

    cs.CY cs.CL cs.CV

    Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images

    Authors: Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: Following on recent advances in large language models (LLMs) and subsequent chat models, a new wave of large vision-language models (LVLMs) has emerged. Such models can incorporate images as input in addition to text, and perform tasks such as visual question answering, image captioning, story generation, etc. Here, we examine potential gender and racial biases in such systems, based on the percei… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: To appear at EACL 2024

  6. arXiv:2307.01900  [pdf, other

    cs.CL cs.AI

    Concept-Based Explanations to Test for False Causal Relationships Learned by Abusive Language Classifiers

    Authors: Isar Nejadgholi, Svetlana Kiritchenko, Kathleen C. Fraser, Esma Balkır

    Abstract: Classifiers tend to learn a false causal relationship between an over-represented concept and a label, which can result in over-reliance on the concept and compromised classification accuracy. It is imperative to have methods in place that can compare different models and identify over-reliances on specific concepts. We consider three well-known abusive language classifiers trained on large Englis… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Published at WOAH2023 co-located with ACL2023

  7. arXiv:2303.14128  [pdf, other

    cs.CL

    The crime of being poor

    Authors: Georgina Curto, Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

    Abstract: The criminalization of poverty has been widely denounced as a collective bias against the most vulnerable. NGOs and international organizations claim that the poor are blamed for their situation, are more often associated with criminal offenses than the wealthy strata of society and even incur criminal offenses simply as a result of being poor. While no evidence has been found in the literature th… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  8. arXiv:2302.07159  [pdf, other

    cs.CY cs.CL

    A Friendly Face: Do Text-to-Image Systems Rely on Stereotypes when the Input is Under-Specified?

    Authors: Kathleen C. Fraser, Svetlana Kiritchenko, Isar Nejadgholi

    Abstract: As text-to-image systems continue to grow in popularity with the general public, questions have arisen about bias and diversity in the generated images. Here, we investigate properties of images generated in response to prompts which are visually under-specified, but contain salient social attributes (e.g., 'a portrait of a threatening person' versus 'a portrait of a friendly person'). Grounding o… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Appearing in the AAAI 2023 Workshop on Creative AI Across Modalities

  9. arXiv:2210.10689  [pdf, other

    cs.CL

    Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information

    Authors: Isar Nejadgholi, Esma Balkır, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: Previous works on the fairness of toxic language classifiers compare the output of models with different identity terms as input features but do not consider the impact of other important concepts present in the context. Here, besides identity terms, we take into account high-level latent features learned by the classifier and investigate the interaction between these features and identity terms.… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 13 pages, 2 figures, accepted at the fifth edition of BlackBoxNLP collocated with EMNLP2022

  10. arXiv:2206.03945  [pdf, other

    cs.CL

    Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models

    Authors: Esma Balkir, Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

    Abstract: Motivations for methods in explainable artificial intelligence (XAI) often include detecting, quantifying and mitigating bias, and contributing to making machine learning models fairer. However, exactly how an XAI method can help in combating biases is often left unspecified. In this paper, we briefly review trends in explainability and fairness in NLP research, identify the current practices in w… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: TrustNLP Workshop at NAACL 2022

  11. arXiv:2205.12771  [pdf, other

    cs.CY cs.CL

    Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy

    Authors: Kathleen C. Fraser, Svetlana Kiritchenko, Esma Balkir

    Abstract: In an effort to guarantee that machine learning model outputs conform with human moral values, recent work has begun exploring the possibility of explicitly training models to learn the difference between right and wrong. This is typically done in a bottom-up fashion, by exposing the model to different scenarios, annotated with human moral judgements. One question, however, is whether the trained… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: To appear at TrustNLP Workshop @ NAACL 2022

  12. arXiv:2205.03302  [pdf, other

    cs.CL

    Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection

    Authors: Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: We present a novel feature attribution method for explaining text classifiers, and analyze it in the context of hate speech detection. Although feature attribution models usually provide a single importance score for each token, we instead provide two complementary and theoretically-grounded scores -- necessity and sufficiency -- resulting in more informative explanations. We propose a transparent… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  13. arXiv:2204.02261  [pdf, other

    cs.CL cs.LG

    Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors

    Authors: Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: Robustness of machine learning models on ever-changing real-world data is critical, especially for applications affecting human well-being such as content moderation. New kinds of abusive language continually emerge in online discussions in response to current events (e.g., COVID-19), and the deployed abuse detection systems should be updated regularly to remain accurate. In this paper, we show th… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: accepted to be published at ACL2022

  14. arXiv:2110.09421  [pdf, other

    cs.CL cs.AI cs.CY

    Measuring Cognitive Status from Speech in a Smart Home Environment

    Authors: Kathleen C. Fraser, Majid Komeili

    Abstract: The population is aging, and becoming more tech-savvy. The United Nations predicts that by 2050, one in six people in the world will be over age 65 (up from one in 11 in 2019), and this increases to one in four in Europe and Northern America. Meanwhile, the proportion of American adults over 65 who own a smartphone has risen 24 percentage points from 2013-2017, and the majority have Internet in th… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Journal ref: IEEE Instrumentation & Measurement Magazine (Volume: 24, Issue: 6, September 2021)

  15. arXiv:2106.02596  [pdf, other

    cs.CY cs.AI cs.CL

    Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model

    Authors: Kathleen C. Fraser, Isar Nejadgholi, Svetlana Kiritchenko

    Abstract: Stereotypical language expresses widely-held beliefs about different social categories. Many stereotypes are overtly negative, while others may appear positive on the surface, but still lead to negative consequences. In this work, we present a computational approach to interpreting stereotypes in text through the Stereotype Content Model (SCM), a comprehensive causal theory from social psychology.… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: In Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  16. arXiv:2012.12305  [pdf, other

    cs.CL cs.AI cs.CY

    Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective

    Authors: Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

    Abstract: The pervasiveness of abusive content on the internet can lead to severe psychological and physical harm. Significant effort in Natural Language Processing (NLP) research has been devoted to addressing this problem through abusive content detection and related sub-areas, such as the detection of hate speech, toxicity, cyberbullying, etc. Although current technologies achieve high classification per… ▽ More

    Submitted 22 July, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: published in Journal of Artificial Intelligence Research, 71: 431-478, July 2021

  17. arXiv:2006.05281  [pdf, other

    cs.CL cs.LG

    Extensive Error Analysis and a Learning-Based Evaluation of Medical Entity Recognition Systems to Approximate User Experience

    Authors: Isar Nejadgholi, Kathleen C. Fraser, Berry De Bruijn

    Abstract: When comparing entities extracted by a medical entity recognition system with gold standard annotations over a test set, two types of mismatches might occur, label mismatch or span mismatch. Here we focus on span mismatch and show that its severity can vary from a serious error to a fully acceptable entity extraction due to the subjectivity of span annotations. For a domain-specific BERT-based NER… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: to appear at BioNLP2020

  18. arXiv:1910.01274  [pdf, other

    cs.CL cs.NE

    Extracting UMLS Concepts from Medical Text Using General and Domain-Specific Deep Learning Models

    Authors: Kathleen C. Fraser, Isar Nejadgholi, Berry De Bruijn, Muqun Li, Astha LaPlante, Khaldoun Zine El Abidine

    Abstract: Entity recognition is a critical first step to a number of clinical NLP applications, such as entity linking and relation extraction. We present the first attempt to apply state-of-the-art entity recognition approaches on a newly released dataset, MedMentions. This dataset contains over 4000 biomedical abstracts, annotated for UMLS semantic types. In comparison to existing datasets, MedMentions co… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: 11 pages, accepted at LOUHI2019 workshop