The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

Selvam, Nikil Roashan; Dev, Sunipa; Khashabi, Daniel; Khot, Tushar; Chang, Kai-Wei

Computer Science > Computation and Language

arXiv:2210.10040 (cs)

[Submitted on 18 Oct 2022 (v1), last revised 16 Jun 2023 (this version, v2)]

Title:The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

Authors:Nikil Roashan Selvam, Sunipa Dev, Daniel Khashabi, Tushar Khot, Kai-Wei Chang

View PDF

Abstract:How reliably can we trust the scores obtained from social bias benchmarks as faithful indicators of problematic social biases in a given language model? In this work, we study this question by contrasting social biases with non-social biases stemming from choices made during dataset construction that might not even be discernible to the human eye. To do so, we empirically simulate various alternative constructions for a given benchmark based on innocuous modifications (such as paraphrasing or random-sampling) that maintain the essence of their social bias. On two well-known social bias benchmarks (Winogender and BiasNLI) we observe that these shallow modifications have a surprising effect on the resulting degree of bias across various models. We hope these troubling observations motivate more robust measures of social biases.

Comments:	ACL 2023
Subjects:	Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as:	arXiv:2210.10040 [cs.CL]
	(or arXiv:2210.10040v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.10040

Submission history

From: Nikil Selvam [view email]
[v1] Tue, 18 Oct 2022 17:58:39 UTC (12,248 KB)
[v2] Fri, 16 Jun 2023 18:35:13 UTC (6,117 KB)

Computer Science > Computation and Language

Title:The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators