EAGLE: A Domain Generalization Framework for AI-generated Text Detection

Bhattacharjee, Amrita; Moraffah, Raha; Garland, Joshua; Liu, Huan

Computer Science > Computation and Language

arXiv:2403.15690 (cs)

[Submitted on 23 Mar 2024]

Title:EAGLE: A Domain Generalization Framework for AI-generated Text Detection

Authors:Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu

View PDF HTML (experimental)

Abstract:With the advancement in capabilities of Large Language Models (LLMs), one major step in the responsible and safe use of such LLMs is to be able to detect text generated by these models. While supervised AI-generated text detectors perform well on text generated by older LLMs, with the frequent release of new LLMs, building supervised detectors for identifying text from such new models would require new labeled training data, which is infeasible in practice. In this work, we tackle this problem and propose a domain generalization framework for the detection of AI-generated text from unseen target generators. Our proposed framework, EAGLE, leverages the labeled data that is available so far from older language models and learns features invariant across these generators, in order to detect text generated by an unknown target generator. EAGLE learns such domain-invariant features by combining the representational power of self-supervised contrastive learning with domain adversarial training. Through our experiments we demonstrate how EAGLE effectively achieves impressive performance in detecting text generated by unseen target generators, including recent state-of-the-art ones such as GPT-4 and Claude, reaching detection scores of within 4.7% of a fully supervised detector.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2403.15690 [cs.CL]
	(or arXiv:2403.15690v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.15690

Submission history

From: Amrita Bhattacharjee [view email]
[v1] Sat, 23 Mar 2024 02:44:20 UTC (16,990 KB)

Computer Science > Computation and Language

Title:EAGLE: A Domain Generalization Framework for AI-generated Text Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:EAGLE: A Domain Generalization Framework for AI-generated Text Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators