CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning

Aggarwal, Shivam; Binici, Kuluhan; Mitra, Tulika

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.14272 (cs)

[Submitted on 24 Nov 2023 (v1), last revised 18 Mar 2024 (this version, v2)]

Title:CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning

Authors:Shivam Aggarwal, Kuluhan Binici, Tulika Mitra

View PDF HTML (experimental)

Abstract:Machine learning pipelines for classification tasks often train a universal model to achieve accuracy across a broad range of classes. However, a typical user encounters only a limited selection of classes regularly. This disparity provides an opportunity to enhance computational efficiency by tailoring models to focus on user-specific classes. Existing works rely on unstructured pruning, which introduces randomly distributed non-zero values in the model, making it unsuitable for hardware acceleration. Alternatively, some approaches employ structured pruning, such as channel pruning, but these tend to provide only minimal compression and may lead to reduced model accuracy. In this work, we propose CRISP, a novel pruning framework leveraging a hybrid structured sparsity pattern that combines both fine-grained N:M structured sparsity and coarse-grained block sparsity. Our pruning strategy is guided by a gradient-based class-aware saliency score, allowing us to retain weights crucial for user-specific classes. CRISP achieves high accuracy with minimal memory consumption for popular models like ResNet-50, VGG-16, and MobileNetV2 on ImageNet and CIFAR-100 datasets. Moreover, CRISP delivers up to 14$\times$ reduction in latency and energy consumption compared to existing pruning methods while maintaining comparable accuracy. Our code is available at this https URL.

Comments:	6 pages, accepted in Design, Automation & Test in Europe Conference & Exhibition (DATE) 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
Cite as:	arXiv:2311.14272 [cs.CV]
	(or arXiv:2311.14272v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.14272

Submission history

From: Shivam Aggarwal [view email]
[v1] Fri, 24 Nov 2023 04:16:32 UTC (11,734 KB)
[v2] Mon, 18 Mar 2024 08:15:48 UTC (11,734 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators