Knowledge-driven Active Learning

Ciravegna, Gabriele; Precioso, Frédéric; Betti, Alessandro; Mottin, Kevin; Gori, Marco

doi:10.1007/978-3-031-43412-9_3

Computer Science > Machine Learning

arXiv:2110.08265 (cs)

[Submitted on 15 Oct 2021 (v1), last revised 16 Jun 2023 (this version, v4)]

Title:Knowledge-driven Active Learning

Authors:Gabriele Ciravegna, Frédéric Precioso, Alessandro Betti, Kevin Mottin, Marco Gori

View PDF

Abstract:The deployment of Deep Learning (DL) models is still precluded in those contexts where the amount of supervised data is limited. To answer this issue, active learning strategies aim at minimizing the amount of labelled data required to train a DL model. Most active strategies are based on uncertain sample selection, and even often restricted to samples lying close to the decision boundary. These techniques are theoretically sound, but an understanding of the selected samples based on their content is not straightforward, further driving non-experts to consider DL as a black-box. For the first time, here we propose to take into consideration common domain-knowledge and enable non-expert users to train a model with fewer samples. In our Knowledge-driven Active Learning (KAL) framework, rule-based knowledge is converted into logic constraints and their violation is checked as a natural guide for sample selection. We show that even simple relationships among data and output classes offer a way to spot predictions for which the model need supervision. We empirically show that KAL (i) outperforms many active learning strategies, particularly in those contexts where domain knowledge is rich, (ii) it discovers data distribution lying far from the initial training data, (iii) it ensures domain experts that the provided knowledge is acquired by the model, (iv) it is suitable for regression and object recognition tasks unlike uncertainty-based strategies, and (v) its computational demand is low.

Comments:	Accepted at ECML2023 for presentation! Check also the github repo: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2110.08265 [cs.LG]
	(or arXiv:2110.08265v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.08265
Journal reference:	In Machine Learning and Knowledge Discovery in Databases: Research Track. pp. 38-54. vol 14169. Springer (2023)
Related DOI:	https://doi.org/10.1007/978-3-031-43412-9_3

Submission history

From: Gabriele Ciravegna [view email]
[v1] Fri, 15 Oct 2021 06:11:53 UTC (6,727 KB)
[v2] Tue, 1 Mar 2022 15:19:49 UTC (14,869 KB)
[v3] Tue, 6 Dec 2022 17:56:09 UTC (33,647 KB)
[v4] Fri, 16 Jun 2023 17:31:33 UTC (10,438 KB)

Computer Science > Machine Learning

Title:Knowledge-driven Active Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Knowledge-driven Active Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators