MaNtLE: Model-agnostic Natural Language Explainer

Menon, Rakesh R.; Zaman, Kerem; Srivastava, Shashank

Computer Science > Computation and Language

arXiv:2305.12995 (cs)

[Submitted on 22 May 2023]

Title:MaNtLE: Model-agnostic Natural Language Explainer

Authors:Rakesh R. Menon, Kerem Zaman, Shashank Srivastava

View PDF

Abstract:Understanding the internal reasoning behind the predictions of machine learning systems is increasingly vital, given their rising adoption and acceptance. While previous approaches, such as LIME, generate algorithmic explanations by attributing importance to input features for individual examples, recent research indicates that practitioners prefer examining language explanations that explain sub-groups of examples. In this paper, we introduce MaNtLE, a model-agnostic natural language explainer that analyzes multiple classifier predictions and generates faithful natural language explanations of classifier rationale for structured classification tasks. MaNtLE uses multi-task training on thousands of synthetic classification tasks to generate faithful explanations. Simulated user studies indicate that, on average, MaNtLE-generated explanations are at least 11% more faithful compared to LIME and Anchors explanations across three tasks. Human evaluations demonstrate that users can better predict model behavior using explanations from MaNtLE compared to other techniques

Comments:	17 pages, 13 figures, 6 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2305.12995 [cs.CL]
	(or arXiv:2305.12995v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.12995

Submission history

From: Rakesh R Menon [view email]
[v1] Mon, 22 May 2023 12:58:06 UTC (8,966 KB)

Computer Science > Computation and Language

Title:MaNtLE: Model-agnostic Natural Language Explainer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MaNtLE: Model-agnostic Natural Language Explainer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators