Progressive Skeletonization: Trimming more fat from a network at initialization

de Jorge, Pau; Sanyal, Amartya; Behl, Harkirat S.; Torr, Philip H. S.; Rogez, Gregory; Dokania, Puneet K.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2006.09081v2 (cs)

[Submitted on 16 Jun 2020 (v1), revised 23 Jun 2020 (this version, v2), latest version 19 Mar 2021 (v5)]

Title:Progressive Skeletonization: Trimming more fat from a network at initialization

Authors:Pau de Jorge, Amartya Sanyal, Harkirat S. Behl, Philip H.S. Torr, Gregory Rogez, Puneet K. Dokania

View PDF

Abstract:Recent studies have shown that skeletonization (pruning parameters) of networks at initialization provides all the practical benefits of sparsity both at inference and training time, while only marginally degrading their performance. However, we observe that beyond a certain level of sparsity (approx 95%), these approaches fail to preserve the network performance, and to our surprise, in many cases perform even worse than trivial random pruning. To this end, we propose to find a skeletonized network with maximum foresight connection sensitivity (FORCE). Intuitively, out of all possible sub-networks, we propose to find the one whose connections would have a maximum impact on the loss when perturbed. Our approximate solution to maximize the FORCE, progressively prunes connections of a given network at initialization. This allows parameters that were unimportant at earlier stages of skeletonization to become important at later stages. In many cases, our approach enables us to remove up to 99.9% parameters, while kee** networks trainable and providing significantly better performance than recent approaches. We demonstrate the effectiveness of our approach at various levels of sparsity (from medium to extreme) through extensive experiments and analysis.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2006.09081 [cs.CV]
	(or arXiv:2006.09081v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2006.09081

Submission history

From: Pau de Jorge Aranda [view email]
[v1] Tue, 16 Jun 2020 11:32:47 UTC (455 KB)
[v2] Tue, 23 Jun 2020 14:41:08 UTC (455 KB)
[v3] Tue, 14 Jul 2020 12:02:15 UTC (455 KB)
[v4] Wed, 21 Oct 2020 13:54:26 UTC (394 KB)
[v5] Fri, 19 Mar 2021 13:06:16 UTC (421 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Skeletonization: Trimming more fat from a network at initialization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Skeletonization: Trimming more fat from a network at initialization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators