Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack

Yue, Murong; Yao, Ziyu

Computer Science > Cryptography and Security

arXiv:2503.15551 (cs)

[Submitted on 18 Mar 2025]

Title:Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack

Authors:Murong Yue, Ziyu Yao

View PDF HTML (experimental)

Abstract:Batch prompting, which combines a batch of multiple queries sharing the same context in one inference, has emerged as a promising solution to reduce inference costs. However, our study reveals a significant security vulnerability in batch prompting: malicious users can inject attack instructions into a batch, leading to unwanted interference across all queries, which can result in the inclusion of harmful content, such as phishing links, or the disruption of logical reasoning. In this paper, we construct BATCHSAFEBENCH, a comprehensive benchmark comprising 150 attack instructions of two types and 8k batch instances, to study the batch prompting vulnerability systematically. Our evaluation of both closed-source and open-weight LLMs demonstrates that all LLMs are susceptible to batch-prompting attacks. We then explore multiple defending approaches. While the prompting-based defense shows limited effectiveness for smaller LLMs, the probing-based approach achieves about 95% accuracy in detecting attacks. Additionally, we perform a mechanistic analysis to understand the attack and identify attention heads that are responsible for it.

Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2503.15551 [cs.CR]
	(or arXiv:2503.15551v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2503.15551

Submission history

From: Murong Yue [view email]
[v1] Tue, 18 Mar 2025 15:16:10 UTC (1,286 KB)

Computer Science > Cryptography and Security

Title:Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators