Goldilocks: Consistent Crowdsourced Scalar Annotations with Relative Uncertainty

Chen, Quanze; Weld, Daniel S.; Zhang, Amy X.

doi:10.1145/3476076

Computer Science > Human-Computer Interaction

arXiv:2108.01799 (cs)

[Submitted on 4 Aug 2021]

Title:Goldilocks: Consistent Crowdsourced Scalar Annotations with Relative Uncertainty

Authors:Quanze Chen, Daniel S. Weld, Amy X. Zhang

View PDF

Abstract:Human ratings have become a crucial resource for training and evaluating machine learning systems. However, traditional elicitation methods for absolute and comparative rating suffer from issues with consistency and often do not distinguish between uncertainty due to disagreement between annotators and ambiguity inherent to the item being rated. In this work, we present Goldilocks, a novel crowd rating elicitation technique for collecting calibrated scalar annotations that also distinguishes inherent ambiguity from inter-annotator disagreement. We introduce two main ideas: grounding absolute rating scales with examples and using a two-step bounding process to establish a range for an item's placement. We test our designs in three domains: judging toxicity of online comments, estimating satiety of food depicted in images, and estimating age based on portraits. We show that (1) Goldilocks can improve consistency in domains where interpretation of the scale is not universal, and that (2) representing items with ranges lets us simultaneously capture different sources of uncertainty leading to better estimates of pairwise relationship distributions.

Comments:	CSCW '21
Subjects:	Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2108.01799 [cs.HC]
	(or arXiv:2108.01799v1 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2108.01799
Related DOI:	https://doi.org/10.1145/3476076

Submission history

From: Quan Ze Chen [view email]
[v1] Wed, 4 Aug 2021 00:58:18 UTC (4,731 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.HC

< prev | next >

new | recent | 2021-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Quanze Chen
Daniel S. Weld
Amy X. Zhang

export BibTeX citation

Computer Science > Human-Computer Interaction

Title:Goldilocks: Consistent Crowdsourced Scalar Annotations with Relative Uncertainty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Goldilocks: Consistent Crowdsourced Scalar Annotations with Relative Uncertainty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators