Inducing Group Fairness in LLM-Based Decisions

Atwood, James; Lahoti, Preethi; Balashankar, Ananth; Prost, Flavien; Beirami, Ahmad

Computer Science > Machine Learning

arXiv:2406.16738 (cs)

[Submitted on 24 Jun 2024]

Title:Inducing Group Fairness in LLM-Based Decisions

Authors:James Atwood, Preethi Lahoti, Ananth Balashankar, Flavien Prost, Ahmad Beirami

View PDF HTML (experimental)

Abstract:Prompting Large Language Models (LLMs) has created new and interesting means for classifying textual data. While evaluating and remediating group fairness is a well-studied problem in classifier fairness literature, some classical approaches (e.g., regularization) do not carry over, and some new opportunities arise (e.g., prompt-based remediation). We measure fairness of LLM-based classifiers on a toxicity classification task, and empirically show that prompt-based classifiers may lead to unfair decisions. We introduce several remediation techniques and benchmark their fairness and performance trade-offs. We hope our work encourages more research on group fairness in LLM-based classifiers.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
Cite as:	arXiv:2406.16738 [cs.LG]
	(or arXiv:2406.16738v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.16738

Submission history

From: James Atwood [view email]
[v1] Mon, 24 Jun 2024 15:45:20 UTC (752 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-06

Change to browse by:

cs
cs.AI
cs.CY

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Inducing Group Fairness in LLM-Based Decisions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Inducing Group Fairness in LLM-Based Decisions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators