Who wants accurate models? Arguing for a different metrics to take classification models seriously

Cabitza, Federico; Campagner, Andrea

Computer Science > Machine Learning

arXiv:1910.09246 (cs)

[Submitted on 21 Oct 2019 (v1), last revised 22 Oct 2019 (this version, v2)]

Title:Who wants accurate models? Arguing for a different metrics to take classification models seriously

Authors:Federico Cabitza, Andrea Campagner

View PDF

Abstract:With the increasing availability of AI-based decision support, there is an increasing need for their certification by both AI manufacturers and notified bodies, as well as the pragmatic (real-world) validation of these systems. Therefore, there is the need for meaningful and informative ways to assess the performance of AI systems in clinical practice. Common metrics (like accuracy scores and areas under the ROC curve) have known problems and they do not take into account important information about the preferences of clinicians and the needs of their specialist practice, like the likelihood and impact of errors and the complexity of cases. In this paper, we present a new accuracy measure, the H-accuracy (Ha), which we claim is more informative in the medical domain (and others of similar needs) for the elements it encompasses. We also provide proof that the H-accuracy is a generalization of the balanced accuracy and establish a relation between the H-accuracy and the Net Benefit. Finally, we illustrate an experimentation in two user studies to show the descriptive power of the Ha score and how complementary and differently informative measures can be derived from its formulation (a Python script to compute Ha is also made available).

Comments:	this https URL
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1910.09246 [cs.LG]
	(or arXiv:1910.09246v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1910.09246

Submission history

From: Federico Cabitza [view email]
[v1] Mon, 21 Oct 2019 10:04:50 UTC (4,767 KB)
[v2] Tue, 22 Oct 2019 12:32:56 UTC (4,767 KB)

Computer Science > Machine Learning

Title:Who wants accurate models? Arguing for a different metrics to take classification models seriously

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Who wants accurate models? Arguing for a different metrics to take classification models seriously

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators