Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Lin, Yen-Ting; Papangelis, Alexandros; Kim, Seokhwan; Lee, Sung**; Hazarika, Devamanyu; Namazifar, Mahdi; **, Di; Liu, Yang; Hakkani-Tur, Dilek

Computer Science > Computation and Language

arXiv:2302.05096 (cs)

[Submitted on 10 Feb 2023]

Title:Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Authors:Yen-Ting Lin, Alexandros Papangelis, Seokhwan Kim, Sung** Lee, Devamanyu Hazarika, Mahdi Namazifar, Di **, Yang Liu, Dilek Hakkani-Tur

View PDF

Abstract:This work focuses on in-context data augmentation for intent detection. Having found that augmentation via in-context prompting of large pre-trained language models (PLMs) alone does not improve performance, we introduce a novel approach based on PLMs and pointwise V-information (PVI), a metric that can measure the usefulness of a datapoint for training a model. Our method first fine-tunes a PLM on a small seed of training data and then synthesizes new datapoints - utterances that correspond to given intents. It then employs intent-aware filtering, based on PVI, to remove datapoints that are not helpful to the downstream intent classifier. Our method is thus able to leverage the expressive power of large language models to produce diverse training data. Empirical results demonstrate that our method can produce synthetic training data that achieve state-of-the-art performance on three challenging intent detection datasets under few-shot settings (1.28% absolute improvement in 5-shot and 1.18% absolute in 10-shot, on average) and perform on par with the state-of-the-art in full-shot settings (within 0.01% absolute, on average).

Comments:	Accepted at EACL 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.05096 [cs.CL]
	(or arXiv:2302.05096v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.05096

Submission history

From: Yen-Ting Lin [view email]
[v1] Fri, 10 Feb 2023 07:37:49 UTC (7,935 KB)

Computer Science > Computation and Language

Title:Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators