How Robust are LLMs to In-Context Majority Label Bias?

Gupta, Karan; Roychowdhury, Sumegh; Kasa, Siva Rajesh; Kasa, Santhosh Kumar; Bhanushali, Anish; Pattisapu, Nikhil; Murthy, Prasanna Srinivasa

Computer Science > Machine Learning

arXiv:2312.16549 (cs)

[Submitted on 27 Dec 2023]

Title:How Robust are LLMs to In-Context Majority Label Bias?

Authors:Karan Gupta, Sumegh Roychowdhury, Siva Rajesh Kasa, Santhosh Kumar Kasa, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

View PDF HTML (experimental)

Abstract:In the In-Context Learning (ICL) setup, various forms of label biases can manifest. One such manifestation is majority label bias, which arises when the distribution of labeled examples in the in-context samples is skewed towards one or more specific classes making Large Language Models (LLMs) more prone to predict those labels. Such discrepancies can arise from various factors, including logistical constraints, inherent biases in data collection methods, limited access to diverse data sources, etc. which are unavoidable in a real-world industry setup. In this work, we study the robustness of in-context learning in LLMs to shifts that occur due to majority label bias within the purview of text classification tasks. Prior works have shown that in-context learning with LLMs is susceptible to such biases. In our study, we go one level deeper and show that the robustness boundary varies widely for different models and tasks, with certain LLMs being highly robust (~90%) to majority label bias. Additionally, our findings also highlight the impact of model size and the richness of instructional prompts contributing towards model robustness. We restrict our study to only publicly available open-source models to ensure transparency and reproducibility.

Comments:	6 pages, 3 figures, 2 table. Accepted at Workshop on Responsible Language Modeling, AAAI 2024, (this http URL)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2312.16549 [cs.LG]
	(or arXiv:2312.16549v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.16549

Submission history

From: Karan Gupta [view email]
[v1] Wed, 27 Dec 2023 12:20:12 UTC (498 KB)

Computer Science > Machine Learning

Title:How Robust are LLMs to In-Context Majority Label Bias?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:How Robust are LLMs to In-Context Majority Label Bias?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators