In-Context Learning with Many Demonstration Examples

Li, Mukai; Gong, Shansan; Feng, Jiangtao; Xu, Yiheng; Zhang, Jun; Wu, Zhiyong; Kong, Lingpeng

Computer Science > Computation and Language

arXiv:2302.04931 (cs)

[Submitted on 9 Feb 2023]

Title:In-Context Learning with Many Demonstration Examples

Authors:Mukai Li, Shansan Gong, Jiangtao Feng, Yiheng Xu, Jun Zhang, Zhiyong Wu, Lingpeng Kong

View PDF

Abstract:Large pre-training language models (PLMs) have shown promising in-context learning abilities. However, due to the backbone transformer architecture, existing PLMs are bottlenecked by the memory and computational cost when scaling up to a large context size, leaving instruction tuning and in-context learning of many demonstration examples, as well as long-range language modeling under-explored. In this study, we propose a long-range language model EVALM based on an efficient transformer mechanism. EVALM is trained with 8k tokens per batch line and can test up to 256k-lengthed contexts with extrapolation, 128 times to the limit of existing PLMs (e.g. GPT3). Based on EVALM, we scale up the size of examples efficiently in both instruction tuning and in-context learning to explore the boundary of the benefits from more annotated data. Experimental results on a diverse set of tasks show that EVALM achieves 4.1% higher accuracy on average, and the average length of achieving the best accuracy score over tasks is around 12k. We find that in-context learning can achieve higher performance with more demonstrations under many-shot instruction tuning (8k), and further extending the length of instructions (16k) can further improve the upper bound of scaling in-context learning.

Comments:	Preprint, under review
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.04931 [cs.CL]
	(or arXiv:2302.04931v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.04931

Submission history

From: Mukai Li [view email]
[v1] Thu, 9 Feb 2023 20:53:12 UTC (1,104 KB)

Computer Science > Computation and Language

Title:In-Context Learning with Many Demonstration Examples

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:In-Context Learning with Many Demonstration Examples

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators