Skip to main content

Showing 1–2 of 2 results for author: Reif, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02743  [pdf, other

    cs.CL

    Beyond Performance: Quantifying and Mitigating Label Bias in LLMs

    Authors: Yuval Reif, Roy Schwartz

    Abstract: Large language models (LLMs) have shown remarkable adaptability to diverse tasks, by leveraging context prompts containing instructions, or minimal input-output examples. However, recent work revealed they also exhibit label bias -- an undesirable preference toward predicting certain answers over others. Still, detecting and measuring this bias reliably and at scale has remained relatively unexplo… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  2. arXiv:2305.18917  [pdf, other

    cs.CL

    Fighting Bias with Bias: Promoting Model Robustness by Amplifying Dataset Biases

    Authors: Yuval Reif, Roy Schwartz

    Abstract: NLP models often rely on superficial cues known as dataset biases to achieve impressive performance, and can fail on examples where these biases do not hold. Recent work sought to develop robust, unbiased models by filtering biased examples from training sets. In this work, we argue that such filtering can obscure the true capabilities of models to overcome biases, which might never be removed in… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023