CAVE: Controllable Authorship Verification Explanations

Ramnath, Sahana; Pandey, Kartik; Boschee, Elizabeth; Ren, Xiang

Computer Science > Computation and Language

arXiv:2406.16672 (cs)

[Submitted on 24 Jun 2024 (v1), last revised 5 Sep 2024 (this version, v2)]

Title:CAVE: Controllable Authorship Verification Explanations

Authors:Sahana Ramnath, Kartik Pandey, Elizabeth Boschee, Xiang Ren

View PDF HTML (experimental)

Abstract:Authorship Verification (AV) (do two documents have the same author?) is essential in many sensitive real-life applications. AV is often used in proprietary domains that require a private, offline model, making SOTA online models like ChatGPT undesirable. Current offline models however have lower downstream utility due to low accuracy/scalability (eg: traditional stylometry AV systems) and lack of accessible post-hoc explanations. In this work, we take the first step to address the above challenges with our trained, offline Llama-3-8B model CAVE (Controllable Authorship Verification Explanations): CAVE generates free-text AV explanations that are controlled to be (1) structured (can be decomposed into sub-explanations in terms of relevant linguistic features), and (2) easily verified for explanation-label consistency (via intermediate labels in sub-explanations). We first engineer a prompt that can generate silver training data from a SOTA teacher model in the desired CAVE output format. We then filter and distill this data into a pretrained Llama-3-8B, our carefully selected student model. Results on three difficult AV datasets IMDb62, Blog-Auth, and Fanfiction show that CAVE generates high quality explanations (as measured by automatic and human evaluation) as well as competitive task accuracies.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.16672 [cs.CL]
	(or arXiv:2406.16672v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.16672

Submission history

From: Sahana Ramnath [view email]
[v1] Mon, 24 Jun 2024 14:27:54 UTC (332 KB)
[v2] Thu, 5 Sep 2024 06:44:24 UTC (335 KB)

Computer Science > Computation and Language

Title:CAVE: Controllable Authorship Verification Explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CAVE: Controllable Authorship Verification Explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators