Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models

Foley, Myles; Rawat, Ambrish; Lee, Taesung; Hou, Yufang; Picco, Gabriele; Zizzo, Giulio

Computer Science > Computation and Language

arXiv:2306.09308 (cs)

[Submitted on 15 Jun 2023]

Title:Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models

Authors:Myles Foley, Ambrish Rawat, Taesung Lee, Yufang Hou, Gabriele Picco, Giulio Zizzo

View PDF

Abstract:The wide applicability and adaptability of generative large language models (LLMs) has enabled their rapid adoption. While the pre-trained models can perform many tasks, such models are often fine-tuned to improve their performance on various downstream applications. However, this leads to issues over violation of model licenses, model theft, and copyright infringement. Moreover, recent advances show that generative technology is capable of producing harmful content which exacerbates the problems of accountability within model supply chains. Thus, we need a method to investigate how a model was trained or a piece of text was generated and what their pre-trained base model was. In this paper we take the first step to address this open problem by tracing back the origin of a given fine-tuned LLM to its corresponding pre-trained base model. We consider different knowledge levels and attribution strategies, and find that we can correctly trace back 8 out of the 10 fine tuned models with our best method.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2306.09308 [cs.CL]
	(or arXiv:2306.09308v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.09308

Submission history

From: Myles Foley [view email]
[v1] Thu, 15 Jun 2023 17:42:48 UTC (8,222 KB)

Computer Science > Computation and Language

Title:Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators