Measuring AI Systems Beyond Accuracy

Turri, Violet; Dzombak, Rachel; Heim, Eric; VanHoudnos, Nathan; Palat, Jay; Sinha, Anusha

Computer Science > Software Engineering

arXiv:2204.04211 (cs)

[Submitted on 7 Apr 2022]

Title:Measuring AI Systems Beyond Accuracy

Authors:Violet Turri, Rachel Dzombak, Eric Heim, Nathan VanHoudnos, Jay Palat, Anusha Sinha

View PDF

Abstract:Current test and evaluation (T&E) methods for assessing machine learning (ML) system performance often rely on incomplete metrics. Testing is additionally often siloed from the other phases of the ML system lifecycle. Research investigating cross-domain approaches to ML T&E is needed to drive the state of the art forward and to build an Artificial Intelligence (AI) engineering discipline. This paper advocates for a robust, integrated approach to testing by outlining six key questions for guiding a holistic T&E strategy.

Comments:	8 pages, Presented at 2022 AAAI Spring Symposium Series Workshop on AI Engineering: Creating Scalable, Human-Centered and Robust AI Systems
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2204.04211 [cs.SE]
	(or arXiv:2204.04211v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2204.04211

Submission history

From: Violet Turri [view email]
[v1] Thu, 7 Apr 2022 17:09:07 UTC (387 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-04

Change to browse by:

cs
cs.AI
cs.SE

References & Citations

export BibTeX citation

Computer Science > Software Engineering

Title:Measuring AI Systems Beyond Accuracy

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Measuring AI Systems Beyond Accuracy

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators