LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Wang, Hanqing; **, Bowen; Wang, Shuo; Han, Xu; Chen, Yun; Liu, Zhiyuan; Sun, Maosong

Computer Science > Computation and Language

arXiv:2402.11455 (cs)

[Submitted on 18 Feb 2024]

Title:LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Authors:Hanqing Wang, Bowen **, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun

View PDF HTML (experimental)

Abstract:LoRA employs lightweight modules to customize large language models (LLMs) for each downstream task or domain, where different learned additional modules represent diverse skills. Combining existing LoRAs to address new tasks can enhance the reusability of learned LoRAs, particularly beneficial for tasks with limited annotated data. Most prior works on LoRA combination primarily rely on task-level weights for each involved LoRA, making different examples and tokens share the same LoRA weights. However, in generative tasks, different tokens may necessitate diverse skills to manage. Taking the Chinese math task as an example, understanding the problem description may depend more on the Chinese LoRA, while the calculation part may rely more on the math LoRA. To this end, we propose LoRA-Flow, which utilizes dynamic weights to adjust the impact of different LoRAs. The weights at each step are determined by a fusion gate with extremely few parameters, which can be learned with only 200 training examples. Experiments across six generative tasks demonstrate that our method consistently outperforms baselines with task-level fusion weights. This underscores the necessity of introducing dynamic fusion weights for LoRA combination.

Comments:	Work in Progress
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.11455 [cs.CL]
	(or arXiv:2402.11455v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.11455

Submission history

From: Shuo Wang [view email]
[v1] Sun, 18 Feb 2024 04:41:25 UTC (680 KB)

Computer Science > Computation and Language

Title:LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators