Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Lee, Messi H. J.; Montgomery, Jacob M.; Lai, Calvin K.

doi:10.1145/3630106.3658975

Computer Science > Computation and Language

arXiv:2401.08495 (cs)

[Submitted on 16 Jan 2024 (v1), last revised 26 Apr 2024 (this version, v2)]

Title:Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Authors:Messi H.J. Lee, Jacob M. Montgomery, Calvin K. Lai

View PDF HTML (experimental)

Abstract:Large language models (LLMs) are becoming pervasive in everyday life, yet their propensity to reproduce biases inherited from training data remains a pressing concern. Prior investigations into bias in LLMs have focused on the association of social groups with stereotypical attributes. However, this is only one form of human bias such systems may reproduce. We investigate a new form of bias in LLMs that resembles a social psychological phenomenon where socially subordinate groups are perceived as more homogeneous than socially dominant groups. We had ChatGPT, a state-of-the-art LLM, generate texts about intersectional group identities and compared those texts on measures of homogeneity. We consistently found that ChatGPT portrayed African, Asian, and Hispanic Americans as more homogeneous than White Americans, indicating that the model described racial minority groups with a narrower range of human experience. ChatGPT also portrayed women as more homogeneous than men, but these differences were small. Finally, we found that the effect of gender differed across racial/ethnic groups such that the effect of gender was consistent within African and Hispanic Americans but not within Asian and White Americans. We argue that the tendency of LLMs to describe groups as less diverse risks perpetuating stereotypes and discriminatory behavior.

Comments:	Forthcoming at ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.08495 [cs.CL]
	(or arXiv:2401.08495v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.08495
Related DOI:	https://doi.org/10.1145/3630106.3658975

Submission history

From: Messi H.J. Lee [view email]
[v1] Tue, 16 Jan 2024 16:52:00 UTC (64 KB)
[v2] Fri, 26 Apr 2024 01:40:12 UTC (202 KB)

Computer Science > Computation and Language

Title:Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators