ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Bini, Massimo; Roth, Karsten; Akata, Zeynep; Khoreva, Anna

Computer Science > Machine Learning

arXiv:2405.20271 (cs)

[Submitted on 30 May 2024]

Title:ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Authors:Massimo Bini, Karsten Roth, Zeynep Akata, Anna Khoreva

View PDF HTML (experimental)

Abstract:Parameter-efficient finetuning (PEFT) has become ubiquitous to adapt foundation models to downstream task requirements while retaining their generalization ability. However, the amount of additionally introduced parameters and compute for successful adaptation and hyperparameter searches can explode quickly, especially when deployed at scale to serve numerous individual requests. To ensure effective, parameter-efficient, and hyperparameter-robust adaptation, we propose the ETHER transformation family, which performs Efficient fineTuning via HypErplane Reflections. By design, ETHER transformations require a minimal number of parameters, are less likely to deteriorate model performance, and exhibit robustness to hyperparameter and learning rate choices. In particular, we introduce ETHER and its relaxation ETHER+, which match or outperform existing PEFT methods with significantly fewer parameters ($\sim$$10$-$100$ times lower than LoRA or OFT) across multiple image synthesis and natural language tasks without exhaustive hyperparameter tuning. Finally, we investigate the recent emphasis on Hyperspherical Energy retention for adaptation and raise questions on its practical utility. The code is available at this https URL.

Comments:	Accepted to ICML 2024. Code available at this https URL
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.20271 [cs.LG]
	(or arXiv:2405.20271v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.20271

Submission history

From: Massimo Bini [view email]
[v1] Thu, 30 May 2024 17:26:02 UTC (6,556 KB)

Computer Science > Machine Learning

Title:ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators