Towards Redundancy-Free Sub-networks in Continual Learning

Chen, Cheng; Song, **gkuan; Gao, LianLi; Shen, Heng Tao

Computer Science > Machine Learning

arXiv:2312.00840 (cs)

[Submitted on 1 Dec 2023 (v1), last revised 11 Jan 2024 (this version, v2)]

Title:Towards Redundancy-Free Sub-networks in Continual Learning

Authors:Cheng Chen, **gkuan Song, LianLi Gao, Heng Tao Shen

View PDF HTML (experimental)

Abstract:Catastrophic Forgetting (CF) is a prominent issue in continual learning. Parameter isolation addresses this challenge by masking a sub-network for each task to mitigate interference with old tasks. However, these sub-networks are constructed relying on weight magnitude, which does not necessarily correspond to the importance of weights, resulting in maintaining unimportant weights and constructing redundant sub-networks. To overcome this limitation, inspired by information bottleneck, which removes redundancy between adjacent network layers, we propose \textbf{\underline{I}nformation \underline{B}ottleneck \underline{M}asked sub-network (IBM)} to eliminate redundancy within sub-networks. Specifically, IBM accumulates valuable information into essential weights to construct redundancy-free sub-networks, not only effectively mitigating CF by freezing the sub-networks but also facilitating new tasks training through the transfer of valuable knowledge. Additionally, IBM decomposes hidden representations to automate the construction process and make it flexible. Extensive experiments demonstrate that IBM consistently outperforms state-of-the-art methods. Notably, IBM surpasses the state-of-the-art parameter isolation method with a 70\% reduction in the number of parameters within sub-networks and an 80\% decrease in training time.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.00840 [cs.LG]
	(or arXiv:2312.00840v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.00840

Submission history

From: Cheng Chen [view email]
[v1] Fri, 1 Dec 2023 02:29:52 UTC (1,524 KB)
[v2] Thu, 11 Jan 2024 14:44:13 UTC (1,318 KB)

Computer Science > Machine Learning

Title:Towards Redundancy-Free Sub-networks in Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Redundancy-Free Sub-networks in Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators