Skip to main content

Showing 1–3 of 3 results for author: van Boven, G

.
  1. arXiv:2406.19097  [pdf, other

    cs.CL

    Fairness and Bias in Multimodal AI: A Survey

    Authors: Tosin Adewumi, Lama Alkhaled, Namrata Gurung, Goya van Boven, Irene Pagliai

    Abstract: The importance of addressing fairness and bias in artificial intelligence (AI) systems cannot be over-emphasized. Mainstream media has been awashed with news of incidents around stereotypes and bias in many of these systems in recent years. In this survey, we fill a gap with regards to the minimal study of fairness and bias in Large Multimodal Models (LMMs) compared to Large Language Models (LLMs)… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages

  2. Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns

    Authors: Goya van Boven, Yupei Du, Dong Nguyen

    Abstract: Gender-neutral pronouns are increasingly being introduced across Western languages. Recent evaluations have however demonstrated that English NLP systems are unable to correctly process gender-neutral pronouns, with the risk of erasing and misgendering non-binary individuals. This paper examines a Dutch coreference resolution system's performance on gender-neutral pronouns, specifically hen and di… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 22 pages, 2 figures. Accepted at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)

    ACM Class: I.2.7

  3. arXiv:2404.04838  [pdf, other

    cs.CL

    Data Bias According to Bipol: Men are Naturally Right and It is the Role of Women to Follow Their Lead

    Authors: Irene Pagliai, Goya van Boven, Tosin Adewumi, Lama Alkhaled, Namrata Gurung, Isabella Södergren, Elisa Barney

    Abstract: We introduce new large labeled datasets on bias in 3 languages and show in experiments that bias exists in all 10 datasets of 5 languages evaluated, including benchmark datasets on the English GLUE/SuperGLUE leaderboards. The 3 new languages give a total of almost 6 million labeled samples and we benchmark on these datasets using SotA multilingual pretrained models: mT5 and mBERT. The challenge of… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 11 pages, 6 figures