PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Sahu, Gaurav; Vechtomova, Olga; Bahdanau, Dzmitry; Laradji, Issam H.

Computer Science > Computation and Language

arXiv:2310.14192 (cs)

[Submitted on 22 Oct 2023]

Title:PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Authors:Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji

View PDF

Abstract:Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follow instructions and perform few-shot classifications. Our specific PromptMix method consists of two steps: 1) generate challenging text augmentations near class boundaries; however, generating borderline examples increases the risk of false positives in the dataset, so we 2) relabel the text augmentations using a prompting-based LLM classifier to enhance the correctness of labels in the generated data. We evaluate the proposed method in challenging 2-shot and zero-shot settings on four text classification datasets: Banking77, TREC6, Subjectivity (SUBJ), and Twitter Complaints. Our experiments show that generating and, crucially, relabeling borderline examples facilitates the transfer of knowledge of a massive LLM like GPT3.5-turbo into smaller and cheaper classifiers like DistilBERT$_{base}$ and BERT$_{base}$. Furthermore, 2-shot PromptMix outperforms multiple 5-shot data augmentation methods on the four datasets. Our code is available at this https URL.

Comments:	Accepted to EMNLP 2023 (Long paper)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2310.14192 [cs.CL]
	(or arXiv:2310.14192v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.14192

Submission history

From: Gaurav Sahu [view email]
[v1] Sun, 22 Oct 2023 05:43:23 UTC (276 KB)

Computer Science > Computation and Language

Title:PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators