GlanceNets: Interpretabile, Leak-proof Concept-based Models

Marconato, Emanuele; Passerini, Andrea; Teso, Stefano

Computer Science > Machine Learning

arXiv:2205.15612 (cs)

[Submitted on 31 May 2022 (v1), last revised 18 Oct 2022 (this version, v2)]

Title:GlanceNets: Interpretabile, Leak-proof Concept-based Models

Authors:Emanuele Marconato, Andrea Passerini, Stefano Teso

View PDF

Abstract:There is growing interest in concept-based models (CBMs) that combine high-performance and interpretability by acquiring and reasoning with a vocabulary of high-level concepts. A key requirement is that the concepts be interpretable. Existing CBMs tackle this desideratum using a variety of heuristics based on unclear notions of interpretability, and fail to acquire concepts with the intended semantics. We address this by providing a clear definition of interpretability in terms of alignment between the model's representation and an underlying data generation process, and introduce GlanceNets, a new CBM that exploits techniques from disentangled representation learning and open-set recognition to achieve alignment, thus improving the interpretability of the learned concepts. We show that GlanceNets, paired with concept-level supervision, achieve better alignment than state-of-the-art approaches while preventing spurious information from unintendedly leaking into the learned concepts.

Comments:	36th Conference on Neural Information Processing Systems (NeurIPS 2022)
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2205.15612 [cs.LG]
	(or arXiv:2205.15612v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.15612

Submission history

From: Emanuele Marconato [view email]
[v1] Tue, 31 May 2022 08:53:53 UTC (10,544 KB)
[v2] Tue, 18 Oct 2022 07:02:50 UTC (11,045 KB)

Computer Science > Machine Learning

Title:GlanceNets: Interpretabile, Leak-proof Concept-based Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GlanceNets: Interpretabile, Leak-proof Concept-based Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators