CAVE: Controllable Authorship Verification Explanations

Ramnath, Sahana; Pandey, Kartik; Boschee, Elizabeth; Ren, Xiang

Computer Science > Computation and Language

arXiv:2406.16672 (cs)

[Submitted on 24 Jun 2024]

Title:CAVE: Controllable Authorship Verification Explanations

Authors:Sahana Ramnath, Kartik Pandey, Elizabeth Boschee, Xiang Ren

View PDF HTML (experimental)

Abstract:Authorship Verification (AV) (do two documents have the same author?) is essential for many sensitive real-life applications. AV is often used in proprietary domains that require a private, offline model, making SOTA online models like ChatGPT undesirable. Other SOTA systems use methods, e.g. Siamese Networks, that are uninterpretable, and hence cannot be trusted in high-stakes applications. In this work, we take the first step to address the above challenges with our model CAVE (Controllable Authorship Verification Explanations): CAVE generates free-text AV explanations that are controlled to be 1) structured (can be decomposed into sub-explanations with respect to relevant linguistic features), and 2) easily verified for explanation-label consistency (via intermediate labels in sub-explanations). In this work, we train a Llama-3-8B as CAVE; since there are no human-written corpora for AV explanations, we sample silver-standard explanations from GPT-4-TURBO and distill them into a pretrained Llama-3-8B. Results on three difficult AV datasets IMdB2, Blog-Auth, and FanFiction show that CAVE generates high quality explanations (as measured by automatic and human evaluation) as well as competitive task accuracies.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.16672 [cs.CL]
	(or arXiv:2406.16672v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.16672

Submission history

From: Sahana Ramnath [view email]
[v1] Mon, 24 Jun 2024 14:27:54 UTC (332 KB)

Computer Science > Computation and Language

Title:CAVE: Controllable Authorship Verification Explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CAVE: Controllable Authorship Verification Explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators