Skip to main content

Showing 1–1 of 1 results for author: Ghawanmeh, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04756  [pdf, other

    cs.CL cs.LG

    BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models

    Authors: Chu Fei Luo, Ahmad Ghawanmeh, Xiaodan Zhu, Faiza Khan Khattak

    Abstract: Modern large language models (LLMs) have a significant amount of world knowledge, which enables strong performance in commonsense reasoning and knowledge-intensive tasks when harnessed properly. The language model can also learn social biases, which has a significant potential for societal harm. There have been many mitigation strategies proposed for LLM safety, but it is unclear how effective the… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.