Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

Singh, Siddharth; Bhatele, Abhinav

Computer Science > Machine Learning

arXiv:2302.05045 (cs)

[Submitted on 10 Feb 2023 (v1), last revised 14 May 2023 (this version, v3)]

Title:Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

Authors:Siddharth Singh, Abhinav Bhatele

View PDF

Abstract:Parallel training of neural networks at scale is challenging due to significant overheads arising from communication. Recently, deep learning researchers have developed a variety of pruning algorithms that are capable of pruning (i.e. setting to zero) 80-90% of the parameters in a neural network to yield sparse subnetworks that equal the accuracy of the unpruned parent network. In this work, we propose a novel approach that exploits these sparse subnetworks to optimize the memory utilization and communication in two popular algorithms for parallel deep learning namely -- data and inter-layer parallelism. We integrate our approach into AxoNN, a highly scalable framework for parallel deep learning that relies on data and inter-layer parallelism, and demonstrate the reduction in communication time and memory utilization. On 512 NVIDIA V100 GPUs, our optimizations reduce the memory consumption of a 2.7 billion parameter model by 74%, and the total communication time by 40%, thus providing an overall speedup of 34% over AxoNN, 32% over DeepSpeed-3D and 46% over Sputnik, a sparse matrix computation baseline.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
Cite as:	arXiv:2302.05045 [cs.LG]
	(or arXiv:2302.05045v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.05045

Submission history

From: Abhinav Bhatele [view email]
[v1] Fri, 10 Feb 2023 04:22:25 UTC (437 KB)
[v2] Sat, 11 Mar 2023 04:58:04 UTC (409 KB)
[v3] Sun, 14 May 2023 04:14:41 UTC (289 KB)

Computer Science > Machine Learning

Title:Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators