Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

Chen, Hanqiu; Yang, Hang; Fitzmeyer, Stephen; Hao, Cong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.16699 (cs)

[Submitted on 29 Jun 2023 (v1), last revised 23 Apr 2024 (this version, v3)]

Title:Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

Authors:Hanqiu Chen, Hang Yang, Stephen Fitzmeyer, Cong Hao

View PDF HTML (experimental)

Abstract:Implicit Neural Representation (INR) is an innovative approach for representing complex shapes or objects without explicitly defining their geometry or surface structure. Instead, INR represents objects as continuous functions. Previous research has demonstrated the effectiveness of using neural networks as INR for image compression, showcasing comparable performance to traditional methods such as JPEG. However, INR holds potential for various applications beyond image compression. This paper introduces Rapid-INR, a novel approach that utilizes INR for encoding and compressing images, thereby accelerating neural network training in computer vision tasks. Our methodology involves storing the whole dataset directly in INR format on a GPU, mitigating the significant data communication overhead between the CPU and GPU during training. Additionally, the decoding process from INR to RGB format is highly parallelized and executed on-the-fly. To further enhance compression, we propose iterative and dynamic pruning, as well as layer-wise quantization, building upon previous work. We evaluate our framework on the image classification task, utilizing the ResNet-18 backbone network and three commonly used datasets with varying image sizes. Rapid-INR reduces memory consumption to only about 5% of the original dataset size in RGB format and achieves a maximum 6$\times$ speedup over the PyTorch training pipeline, as well as a maximum 1.2x speedup over the DALI training pipeline, with only a marginal decrease in accuracy. Importantly, Rapid-INR can be readily applied to other computer vision tasks and backbone networks with reasonable engineering efforts. Our implementation code is publicly available at this https URL.

Comments:	Accepted by ICCAD 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
Cite as:	arXiv:2306.16699 [cs.CV]
	(or arXiv:2306.16699v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.16699
Journal reference:	ICCAD 2023

Submission history

From: Hanqiu Chen [view email]
[v1] Thu, 29 Jun 2023 05:49:07 UTC (2,123 KB)
[v2] Sun, 20 Aug 2023 20:20:15 UTC (2,123 KB)
[v3] Tue, 23 Apr 2024 23:20:41 UTC (2,124 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators