ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

Bhattacharjee, Amrita; Kumarage, Tharindu; Moraffah, Raha; Liu, Huan

Computer Science > Computation and Language

arXiv:2309.03992 (cs)

[Submitted on 7 Sep 2023 (v1), last revised 20 Sep 2023 (this version, v2)]

Title:ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

Authors:Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu

View PDF

Abstract:Large language models (LLMs) are increasingly being used for generating text in a variety of use cases, including journalistic news articles. Given the potential malicious nature in which these LLMs can be used to generate disinformation at scale, it is important to build effective detectors for such AI-generated text. Given the surge in development of new LLMs, acquiring labeled training data for supervised detectors is a bottleneck. However, there might be plenty of unlabeled text data available, without information on which generator it came from. In this work we tackle this data problem, in detecting AI-generated news text, and frame the problem as an unsupervised domain adaptation task. Here the domains are the different text generators, i.e. LLMs, and we assume we have access to only the labeled source data and unlabeled target data. We develop a Contrastive Domain Adaptation framework, called ConDA, that blends standard domain adaptation techniques with the representation power of contrastive learning to learn domain invariant representations that are effective for the final unsupervised detection task. Our experiments demonstrate the effectiveness of our framework, resulting in average performance gains of 31.7% from the best performing baselines, and within 0.8% margin of a fully supervised detector. All our code and data is available at this https URL.

Comments:	Camera-ready for IJCNLP-AACL 2023 main track
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2309.03992 [cs.CL]
	(or arXiv:2309.03992v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.03992

Submission history

From: Amrita Bhattacharjee [view email]
[v1] Thu, 7 Sep 2023 19:51:30 UTC (6,003 KB)
[v2] Wed, 20 Sep 2023 22:17:30 UTC (5,784 KB)

Computer Science > Computation and Language

Title:ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators