AdaCL:Adaptive Continual Learning

Yildirim, Elif Ceren Gok; Yildirim, Murat Onur; Kilickaya, Mert; Vanschoren, Joaquin

Computer Science > Machine Learning

arXiv:2303.13113 (cs)

[Submitted on 23 Mar 2023 (v1), last revised 1 Jul 2024 (this version, v3)]

Title:AdaCL:Adaptive Continual Learning

Authors:Elif Ceren Gok Yildirim, Murat Onur Yildirim, Mert Kilickaya, Joaquin Vanschoren

View PDF HTML (experimental)

Abstract:Class-Incremental Learning aims to update a deep classifier to learn new categories while maintaining or improving its accuracy on previously observed classes. Common methods to prevent forgetting previously learned classes include regularizing the neural network updates and storing exemplars in memory, which come with hyperparameters such as the learning rate, regularization strength, or the number of exemplars. However, these hyperparameters are usually only tuned at the start and then kept fixed throughout the learning sessions, ignoring the fact that newly encountered tasks may have varying levels of novelty or difficulty. This study investigates the necessity of hyperparameter `adaptivity' in Class-Incremental Learning: the ability to dynamically adjust hyperparameters such as the learning rate, regularization strength, and memory size according to the properties of the new task at hand. We propose AdaCL, a Bayesian Optimization-based approach to automatically and efficiently determine the optimal values for those parameters with each learning task. We show that adapting hyperpararmeters on each new task leads to improvement in accuracy, forgetting and memory. Code is available at this https URL.

Comments:	Published in 1st ContinualAI Unconference
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.13113 [cs.LG]
	(or arXiv:2303.13113v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.13113

Submission history

From: Elif Ceren Gok Yildirim [view email]
[v1] Thu, 23 Mar 2023 09:00:38 UTC (565 KB)
[v2] Fri, 24 Mar 2023 09:23:37 UTC (565 KB)
[v3] Mon, 1 Jul 2024 11:57:06 UTC (1,270 KB)

Computer Science > Machine Learning

Title:AdaCL:Adaptive Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AdaCL:Adaptive Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators