Skip to main content

Showing 1–11 of 11 results for author: Alkhaled, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19097  [pdf, other

    cs.CL

    Fairness and Bias in Multimodal AI: A Survey

    Authors: Tosin Adewumi, Lama Alkhaled, Namrata Gurung, Goya van Boven, Irene Pagliai

    Abstract: The importance of addressing fairness and bias in artificial intelligence (AI) systems cannot be over-emphasized. Mainstream media has been awashed with news of incidents around stereotypes and bias in many of these systems in recent years. In this survey, we fill a gap with regards to the minimal study of fairness and bias in Large Multimodal Models (LMMs) compared to Large Language Models (LLMs)… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages

  2. arXiv:2404.04838  [pdf, other

    cs.CL

    Data Bias According to Bipol: Men are Naturally Right and It is the Role of Women to Follow Their Lead

    Authors: Irene Pagliai, Goya van Boven, Tosin Adewumi, Lama Alkhaled, Namrata Gurung, Isabella Södergren, Elisa Barney

    Abstract: We introduce new large labeled datasets on bias in 3 languages and show in experiments that bias exists in all 10 datasets of 5 languages evaluated, including benchmark datasets on the English GLUE/SuperGLUE leaderboards. The 3 new languages give a total of almost 6 million labeled samples and we benchmark on these datasets using SotA multilingual pretrained models: mT5 and mBERT. The challenge of… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 11 pages, 6 figures

  3. arXiv:2404.04631  [pdf, other

    cs.CL

    On the Limitations of Large Language Models (LLMs): False Attribution

    Authors: Tosin Adewumi, Nudrat Habib, Lama Alkhaled, Elisa Barney

    Abstract: In this work, we provide insight into one important limitation of large language models (LLMs), i.e. false attribution, and introduce a new hallucination metric - Simple Hallucination Index (SHI). The task of automatic author attribution for relatively small chunks of text is an important NLP task but can be challenging. We empirically evaluate the power of 3 open SotA LLMs in zero-shot setting (L… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 8 pages, 5 figures

  4. arXiv:2403.15017  [pdf, other

    cs.CV cs.LG

    Vehicle Detection Performance in Nordic Region

    Authors: Hamam Mokayed, Rajkumar Saini, Oluwatosin Adewumi, Lama Alkhaled, Bjorn Backe, Palaiahnakote Shivakumara, Olle Hagner, Yan Chai Hum

    Abstract: This paper addresses the critical challenge of vehicle detection in the harsh winter conditions in the Nordic regions, characterized by heavy snowfall, reduced visibility, and low lighting. Due to their susceptibility to environmental distortions and occlusions, traditional vehicle detection methods have struggled in these adverse conditions. The advanced proposed deep learning architectures broug… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: submitted to ICPR2024

  5. arXiv:2402.00453  [pdf, other

    cs.CV cs.CL

    Instruction Makes a Difference

    Authors: Tosin Adewumi, Nudrat Habib, Lama Alkhaled, Elisa Barney

    Abstract: We introduce Instruction Document Visual Question Answering (iDocVQA) dataset and Large Language Document (LLaDoc) model, for training Language-Vision (LV) models for document analysis and predictions on document images, respectively. Usually, deep neural networks for the DocVQA task are trained on datasets lacking instructions. We show that using instruction-following datasets improves performanc… ▽ More

    Submitted 13 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at the 16th IAPR International Workshop On Document Analysis Systems (DAS)

  6. arXiv:2312.09801  [pdf, other

    cs.CL

    ProCoT: Stimulating Critical Thinking and Writing of Students through Engagement with Large Language Models (LLMs)

    Authors: Tosin Adewumi, Lama Alkhaled, Claudia Buck, Sergio Hernandez, Saga Brilioth, Mkpe Kekung, Yelvin Ragimov, Elisa Barney

    Abstract: We introduce a novel writing method called Probing Chain-of-Thought (ProCoT), which potentially prevents students from cheating using a Large Language Model (LLM), such as ChatGPT, while enhancing their active learning. LLMs have disrupted education and many other fields. For fear of students cheating, many have resorted to banning their use. These LLMs are also known for hallucinations. We conduc… ▽ More

    Submitted 1 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 8 pages, 4 figures

  7. arXiv:2311.09828  [pdf, other

    cs.CL

    AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages

    Authors: Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, Sofia Bourhim, Andiswa Bukula, Muhidin Mohamed, Temitayo Olatoye, Tosin Adewumi, Hamam Mokayed, Christine Mwase, Wangui Kimotho, Foutse Yuehgoh, Anuoluwapo Aremu, Jessica Ojo, Shamsuddeen Hassan Muhammad, Salomey Osei, Abdul-Hakeem Omotayo, Chiamaka Chukwuneke, Perez Ogayo, Oumaima Hourrane , et al. (33 additional authors not shown)

    Abstract: Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of eval… ▽ More

    Submitted 23 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted by NAACL 2024

  8. arXiv:2304.14462  [pdf, other

    cs.CV cs.LG

    Robust and Fast Vehicle Detection using Augmented Confidence Map

    Authors: Hamam Mokayed, Palaiahnakote Shivakumara, Lama Alkhaled, Rajkumar Saini, Muhammad Zeshan Afzal, Yan Chai Hum, Marcus Liwicki

    Abstract: Vehicle detection in real-time scenarios is challenging because of the time constraints and the presence of multiple types of vehicles with different speeds, shapes, structures, etc. This paper presents a new method relied on generating a confidence map-for robust and faster vehicle detection. To reduce the adverse effect of different speeds, shapes, structures, and the presence of several vehicle… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  9. arXiv:2304.04029  [pdf, other

    cs.CL

    Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLP

    Authors: Lama Alkhaled, Tosin Adewumi, Sana Sabah Sabry

    Abstract: We introduce bipol, a new metric with explainability, for estimating social bias in text data. Harmful bias is prevalent in many online sources of data that are used for training machine learning (ML) models. In a step to address this challenge we create a novel metric that involves a two-step process: corpus-level evaluation based on model classification and sentence-level evaluation based on (se… ▽ More

    Submitted 16 September, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: Published in Elsevier's Natural Language Processing Journal

  10. arXiv:2301.12139  [pdf, other

    cs.CL

    Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets

    Authors: Tosin Adewumi, Isabella Södergren, Lama Alkhaled, Sana Sabah Sabry, Foteini Liwicki, Marcus Liwicki

    Abstract: We investigate five English NLP benchmark datasets (on the superGLUE leaderboard) and two Swedish datasets for bias, along multiple axes. The datasets are the following: Boolean Question (Boolq), CommitmentBank (CB), Winograd Schema Challenge (WSC), Wino-gender diagnostic (AXg), Recognising Textual Entailment (RTE), Swedish CB, and SWEDN. Bias can be harmful and it is known to be common in data, w… ▽ More

    Submitted 16 September, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: Accepted at RANLP 2023

  11. arXiv:2204.07432  [pdf, other

    cs.CL

    ML_LTU at SemEval-2022 Task 4: T5 Towards Identifying Patronizing and Condescending Language

    Authors: Tosin Adewumi, Lama Alkhaled, Hamam Mokayed, Foteini Liwicki, Marcus Liwicki

    Abstract: This paper describes the system used by the Machine Learning Group of LTU in subtask 1 of the SemEval-2022 Task 4: Patronizing and Condescending Language (PCL) Detection. Our system consists of finetuning a pretrained Text-to-Text-Transfer Transformer (T5) and innovatively reducing its out-of-class predictions. The main contributions of this paper are 1) the description of the implementation detai… ▽ More

    Submitted 5 May, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted at the International Workshop on Semantic Evaluation (2022) co-located with NAACL