RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Prutianova, Anastasiia; Zaytsev, Alexey; Lee, Chung-Kuei; Sun, Fengyu; Koryakovskiy, Ivan

Computer Science > Machine Learning

arXiv:2311.05317 (cs)

[Submitted on 9 Nov 2023]

Title:RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Authors:Anastasiia Prutianova, Alexey Zaytsev, Chung-Kuei Lee, Fengyu Sun, Ivan Koryakovskiy

View PDF

Abstract:Existing neural networks are memory-consuming and computationally intensive, making deploying them challenging in resource-constrained environments. However, there are various methods to improve their efficiency. Two such methods are quantization, a well-known approach for network compression, and re-parametrization, an emerging technique designed to improve model performance. Although both techniques have been studied individually, there has been limited research on their simultaneous application. To address this gap, we propose a novel approach called RepQ, which applies quantization to re-parametrized networks. Our method is based on the insight that the test stage weights of an arbitrary re-parametrized layer can be presented as a differentiable function of trainable parameters. We enable quantization-aware training by applying quantization on top of this function. RepQ generalizes well to various re-parametrized models and outperforms the baseline method LSQ quantization scheme in all experiments.

Comments:	BMVC 2023 (Oral)
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2311.05317 [cs.LG]
	(or arXiv:2311.05317v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.05317

Submission history

From: Anastasiia Prutianova [view email]
[v1] Thu, 9 Nov 2023 12:25:39 UTC (677 KB)

Computer Science > Machine Learning

Title:RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators