Towards More Realistic Extraction Attacks: An Adversarial Perspective

More, Yash; Ganesh, Prakhar; Farnadi, Golnoosh

Computer Science > Cryptography and Security

arXiv:2407.02596 (cs)

[Submitted on 2 Jul 2024]

Title:Towards More Realistic Extraction Attacks: An Adversarial Perspective

Authors:Yash More, Prakhar Ganesh, Golnoosh Farnadi

View PDF

Abstract:Language models are prone to memorizing large parts of their training data, making them vulnerable to extraction attacks. Existing research on these attacks remains limited in scope, often studying isolated trends rather than the real-world interactions with these models. In this paper, we revisit extraction attacks from an adversarial perspective, exploiting the brittleness of language models. We find significant churn in extraction attack trends, i.e., even minor, unintuitive changes to the prompt, or targeting smaller models and older checkpoints, can exacerbate the risks of extraction by up to $2-4 \times$. Moreover, relying solely on the widely accepted verbatim match underestimates the extent of extracted information, and we provide various alternatives to more accurately capture the true risks of extraction. We conclude our discussion with data deduplication, a commonly suggested mitigation strategy, and find that while it addresses some memorization concerns, it remains vulnerable to the same escalation of extraction risks against a real-world adversary. Our findings highlight the necessity of acknowledging an adversary's true capabilities to avoid underestimating extraction risks.

Comments:	To be presented at PrivateNLP@ACL2024
Subjects:	Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2407.02596 [cs.CR]
	(or arXiv:2407.02596v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2407.02596

Submission history

From: Prakhar Ganesh [view email]
[v1] Tue, 2 Jul 2024 18:33:49 UTC (2,683 KB)

Computer Science > Cryptography and Security

Title:Towards More Realistic Extraction Attacks: An Adversarial Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Towards More Realistic Extraction Attacks: An Adversarial Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators