Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Liu, Ken Ziyu; Choquette-Choo, Christopher A.; Jagielski, Matthew; Kairouz, Peter; Koyejo, Sanmi; Liang, Percy; Papernot, Nicolas

Computer Science > Computation and Language

arXiv:2503.17514 (cs)

[Submitted on 21 Mar 2025 (v1), last revised 25 Mar 2025 (this version, v2)]

Title:Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Authors:Ken Ziyu Liu, Christopher A. Choquette-Choo, Matthew Jagielski, Peter Kairouz, Sanmi Koyejo, Percy Liang, Nicolas Papernot

View PDF

Abstract:An important question today is whether a given text was used to train a large language model (LLM). A \emph{completion} test is often employed: check if the LLM completes a sufficiently complex text. This, however, requires a ground-truth definition of membership; most commonly, it is defined as a member based on the $n$-gram overlap between the target text and any text in the dataset. In this work, we demonstrate that this $n$-gram based membership definition can be effectively gamed. We study scenarios where sequences are \emph{non-members} for a given $n$ and we find that completion tests still succeed. We find many natural cases of this phenomenon by retraining LLMs from scratch after removing all training samples that were completed; these cases include exact duplicates, near-duplicates, and even short overlaps. They showcase that it is difficult to find a single viable choice of $n$ for membership definitions. Using these insights, we design adversarial datasets that can cause a given target sequence to be completed without containing it, for any reasonable choice of $n$. Our findings highlight the inadequacy of $n$-gram membership, suggesting membership definitions fail to account for auxiliary information available to the training algorithm.

Comments:	Main text: 9 pages, 7 figures, 1 table. Appendix: 29 pages, 20 tables, 15 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2503.17514 [cs.CL]
	(or arXiv:2503.17514v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.17514

Submission history

From: Christopher A. Choquette-Choo [view email]
[v1] Fri, 21 Mar 2025 19:57:04 UTC (2,911 KB)
[v2] Tue, 25 Mar 2025 04:43:33 UTC (2,911 KB)

Computer Science > Computation and Language

Title:Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Language Models May Verbatim Complete Text They Were Not Explicitly Trained On

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators