CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Jeon, Hyeon; Quadri, Ghulam Jilani; Lee, Hyunwook; Rosen, Paul; Szafir, Danielle Albers; Seo, **wook

Computer Science > Human-Computer Interaction

arXiv:2308.00284 (cs)

[Submitted on 1 Aug 2023 (v1), last revised 11 Aug 2023 (this version, v2)]

Title:CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Authors:Hyeon Jeon, Ghulam Jilani Quadri, Hyunwook Lee, Paul Rosen, Danielle Albers Szafir, **wook Seo

View PDF

Abstract:Visual clustering is a common perceptual task in scatterplots that supports diverse analytics tasks (e.g., cluster identification). However, even with the same scatterplot, the ways of perceiving clusters (i.e., conducting visual clustering) can differ due to the differences among individuals and ambiguous cluster boundaries. Although such perceptual variability casts doubt on the reliability of data analysis based on visual clustering, we lack a systematic way to efficiently assess this variability. In this research, we study perceptual variability in conducting visual clustering, which we call Cluster Ambiguity. To this end, we introduce CLAMS, a data-driven visual quality measure for automatically predicting cluster ambiguity in monochrome scatterplots. We first conduct a qualitative study to identify key factors that affect the visual separation of clusters (e.g., proximity or size difference between clusters). Based on study findings, we deploy a regression module that estimates the human-judged separability of two clusters. Then, CLAMS predicts cluster ambiguity by analyzing the aggregated results of all pairwise separability between clusters that are generated by the module. CLAMS outperforms widely-used clustering techniques in predicting ground truth cluster ambiguity. Meanwhile, CLAMS exhibits performance on par with human annotators. We conclude our work by presenting two applications for optimizing and benchmarking data mining techniques using CLAMS. The interactive demo of CLAMS is available at clusterambiguity.dev.

Comments:	IEEE Transactions on Visualization and Computer Graphics (TVCG) (Proc. IEEE VIS 2023); equally contributed by Hyeon Jeon and Ghulam Jilani Quadri
Subjects:	Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2308.00284 [cs.HC]
	(or arXiv:2308.00284v2 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2308.00284

Submission history

From: Hyeon Jeon [view email]
[v1] Tue, 1 Aug 2023 04:46:35 UTC (4,725 KB)
[v2] Fri, 11 Aug 2023 04:43:16 UTC (4,725 KB)

Computer Science > Human-Computer Interaction

Title:CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators