Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets

Tuck, Bryan E.; Verma, Rakesh M.

Computer Science > Computation and Language

arXiv:2406.17967 (cs)

[Submitted on 25 Jun 2024]

Title:Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets

Authors:Bryan E. Tuck, Rakesh M. Verma

View PDF HTML (experimental)

Abstract:The rapid development of large language models (LLMs) has significantly improved the generation of fluent and convincing text, raising concerns about their misuse on social media platforms. We present a methodology using Twitter datasets to examine the generative capabilities of four LLMs: Llama 3, Mistral, Qwen2, and GPT4o. We evaluate 7B and 8B parameter base-instruction models of the three open-source LLMs and validate the impact of further fine-tuning and "uncensored" versions. Our findings show that "uncensored" models with additional in-domain fine-tuning dramatically reduce the effectiveness of automated detection methods. This study addresses a gap by exploring smaller open-source models and the effects of "uncensoring," providing insights into how fine-tuning and content moderation influence machine-generated text detection.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.17967 [cs.CL]
	(or arXiv:2406.17967v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.17967

Submission history

From: Bryan Tuck Tuck [view email]
[v1] Tue, 25 Jun 2024 22:49:17 UTC (2,660 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2024-06

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators