DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Zhou, Zijian; Lin, Xiaoqiang; Xu, Xinyi; Prakash, Alok; Rus, Daniela; Low, Bryan Kian Hsiang

Computer Science > Computation and Language

arXiv:2405.14899 (cs)

[Submitted on 22 May 2024]

Title:DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Authors:Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

View PDF

Abstract:In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their flexibility and generality. ICL possesses many distinct characteristics from conventional machine learning, thereby requiring new approaches to interpret this learning paradigm. Taking the viewpoint of recent works showing that transformers learn in context by formulating an internal optimizer, we propose an influence function-based attribution technique, DETAIL, that addresses the specific characteristics of ICL. We empirically verify the effectiveness of our approach for demonstration attribution while being computationally efficient. Leveraging the results, we then show how DETAIL can help improve model performance in real-world scenarios through demonstration reordering and curation. Finally, we experimentally prove the wide applicability of DETAIL by showing our attribution scores obtained on white-box models are transferable to black-box models in improving model performance.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2405.14899 [cs.CL]
	(or arXiv:2405.14899v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.14899

Submission history

From: Zijian Zhou [view email]
[v1] Wed, 22 May 2024 15:52:52 UTC (613 KB)

Computer Science > Computation and Language

Title:DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators