Skip to main content

Showing 1–1 of 1 results for author: Jenny, D F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.08605  [pdf, other

    cs.CL cs.AI cs.CY cs.SI

    Exploring the Jungle of Bias: Political Bias Attribution in Language Models via Dependency Analysis

    Authors: David F. Jenny, Yann Billeter, Mrinmaya Sachan, Bernhard Schölkopf, Zhi**g **

    Abstract: The rapid advancement of Large Language Models (LLMs) has sparked intense debate regarding the prevalence of bias in these models and its mitigation. Yet, as exemplified by both results on debiasing methods in the literature and reports of alignment-related defects from the wider community, bias remains a poorly understood topic despite its practical relevance. To enhance the understanding of the… ▽ More

    Submitted 12 May, 2024; v1 submitted 14 November, 2023; originally announced November 2023.