Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Wang, Junlin; Yang, Tianyi; Xie, Roy; Dhingra, Bhuwan

doi:10.18653/v1/2024.findings-acl.791

Computer Science > Cryptography and Security

arXiv:2406.06737 (cs)

[Submitted on 10 Jun 2024 (v1), last revised 26 Oct 2024 (this version, v2)]

Title:Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Authors:Junlin Wang, Tianyi Yang, Roy Xie, Bhuwan Dhingra

View PDF HTML (experimental)

Abstract:With the proliferation of LLM-integrated applications such as GPT-s, millions are deployed, offering valuable services through proprietary instruction prompts. These systems, however, are prone to prompt extraction attacks through meticulously designed queries. To help mitigate this problem, we introduce the Raccoon benchmark which comprehensively evaluates a model's susceptibility to prompt extraction attacks. Our novel evaluation method assesses models under both defenseless and defended scenarios, employing a dual approach to evaluate the effectiveness of existing defenses and the resilience of the models. The benchmark encompasses 14 categories of prompt extraction attacks, with additional compounded attacks that closely mimic the strategies of potential attackers, alongside a diverse collection of defense templates. This array is, to our knowledge, the most extensive compilation of prompt theft attacks and defense mechanisms to date. Our findings highlight universal susceptibility to prompt theft in the absence of defenses, with OpenAI models demonstrating notable resilience when protected. This paper aims to establish a more systematic benchmark for assessing LLM robustness against prompt extraction attacks, offering insights into their causes and potential countermeasures. Resources of Raccoon are publicly available at this https URL.

Comments:	ACL 2024 Findings
Subjects:	Cryptography and Security (cs.CR); Computation and Language (cs.CL)
Cite as:	arXiv:2406.06737 [cs.CR]
	(or arXiv:2406.06737v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2406.06737
Related DOI:	https://doi.org/10.18653/v1/2024.findings-acl.791

Submission history

From: Junlin Wang [view email]
[v1] Mon, 10 Jun 2024 18:57:22 UTC (735 KB)
[v2] Sat, 26 Oct 2024 03:01:42 UTC (739 KB)

Computer Science > Cryptography and Security

Title:Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators