Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

Pu, George; Jain, Anirudh; Yin, Jihan; Kaplan, Russell

Computer Science > Computation and Language

arXiv:2304.14999 (cs)

[Submitted on 28 Apr 2023]

Title:Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

Authors:George Pu, Anirudh Jain, Jihan Yin, Russell Kaplan

View PDF

Abstract:As foundation models continue to exponentially scale in size, efficient methods of adaptation become increasingly critical. Parameter-efficient fine-tuning (PEFT), a recent class of techniques that require only modifying a small percentage of the model parameters, is currently the most popular method for adapting large language models (LLMs). Several PEFT techniques have recently been proposed with varying tradeoffs. We provide a comprehensive and uniform benchmark of various PEFT techniques across a representative LLM, the FLAN-T5 model, and evaluate model performance across different data scales of classification and generation datasets. Based on this, we provide a framework for choosing the optimal fine-tuning techniques given the task type and data availability. Contrary to popular belief, we also empirically prove that PEFT techniques converge slower than full tuning in low data scenarios, and posit the amount of data required for PEFT methods to both perform well and converge efficiently. Lastly, we further optimize these PEFT techniques by selectively choosing which parts of the model to train, and find that these techniques can be applied with significantly fewer parameters while maintaining and even improving performance.

Comments:	Short paper, ICLR '23 Workshop on Understanding Foundation Models
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2304.14999 [cs.CL]
	(or arXiv:2304.14999v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.14999

Submission history

From: George Pu [view email]
[v1] Fri, 28 Apr 2023 17:39:49 UTC (75 KB)

Computer Science > Computation and Language

Title:Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators