Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Ye, Seonghyeon; Jang, Joel; Kim, Doyoung; Jo, Yongrae; Seo, Minjoon

Computer Science > Computation and Language

arXiv:2210.03029v2 (cs)

[Submitted on 6 Oct 2022 (v1), revised 11 Oct 2022 (this version, v2), latest version 16 Oct 2023 (v4)]

Title:Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Authors:Seonghyeon Ye, Joel Jang, Doyoung Kim, Yongrae Jo, Minjoon Seo

View PDF

Abstract:During zero-shot inference with language models (LMs), using hard prompts alone may not be able to fully describe the target task. In this paper, we explore how the retrieval of soft prompts obtained through prompt tuning can assist hard prompts in zero-shot task generalization. Specifically, we train soft prompt embeddings for each prompt through prompt tuning, store the samples of the training instances (hard prompt + input instances) mapped with the prompt embeddings, and retrieve the corresponding prompt embedding of the training instance closest to the query instance during inference. Results show this simple approach enhances the performance of T0 on unseen tasks by outperforming it on 10 out of 11 datasets as well as improving the mean accuracy of T0 on BIG-bench benchmark by 2.39% points while adding only 0.007% additional parameters. Also, using interpolation of multiple embeddings and variance-based ranking further improve accuracy and robustness to different evaluation prompts, widening the performance gap. Finally, we find that retrieving source embeddings trained on similar answer choice formats is more important than those on similar task types. Model checkpoints and code implementation are available at this https URL.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.03029 [cs.CL]
	(or arXiv:2210.03029v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.03029

Submission history

From: Seonghyeon Ye [view email]
[v1] Thu, 6 Oct 2022 16:26:03 UTC (6,287 KB)
[v2] Tue, 11 Oct 2022 13:33:25 UTC (6,287 KB)
[v3] Fri, 13 Oct 2023 11:58:30 UTC (12,793 KB)
[v4] Mon, 16 Oct 2023 04:57:33 UTC (12,794 KB)

Computer Science > Computation and Language

Title:Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators