Quadapter: Adapter for GPT-2 Quantization

Park, Minseop; You, Jaeseong; Nagel, Markus; Chang, Simyung

Computer Science > Machine Learning

arXiv:2211.16912 (cs)

[Submitted on 30 Nov 2022]

Title:Quadapter: Adapter for GPT-2 Quantization

Authors:Minseop Park, Jaeseong You, Markus Nagel, Simyung Chang

View PDF

Abstract:Transformer language models such as GPT-2 are difficult to quantize because of outliers in activations leading to a large quantization error. To adapt to the error, one must use quantization-aware training, which entails a fine-tuning process based on the dataset and the training pipeline identical to those for the original model. Pretrained language models, however, often do not grant access to their datasets and training pipelines, forcing us to rely on arbitrary ones for fine-tuning. In that case, it is observed that quantization-aware training overfits the model to the fine-tuning data. For quantization without overfitting, we introduce a quantization adapter (Quadapter), a small set of parameters that are learned to make activations quantization-friendly by scaling them channel-wise. It keeps the model parameters unchanged. By applying our method to the challenging task of quantizing GPT-2, we demonstrate that it effectively prevents the overfitting and improves the quantization performance.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2211.16912 [cs.LG]
	(or arXiv:2211.16912v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.16912

Submission history

From: Minseop Park [view email]
[v1] Wed, 30 Nov 2022 11:20:33 UTC (926 KB)

Computer Science > Machine Learning

Title:Quadapter: Adapter for GPT-2 Quantization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Quadapter: Adapter for GPT-2 Quantization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators