Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models

Chong, Chun Jie; Hou, Chenxi; Yao, Zhihao; Talebi, Seyed Mohammadjavad Seyed

Computer Science > Cryptography and Security

arXiv:2408.07004 (cs)

[Submitted on 13 Aug 2024]

Title:Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models

Authors:Chun Jie Chong, Chenxi Hou, Zhihao Yao, Seyed Mohammadjavad Seyed Talebi

View PDF HTML (experimental)

Abstract:Web-based Large Language Model (LLM) services have been widely adopted and have become an integral part of our Internet experience. Third-party plugins enhance the functionalities of LLM by enabling access to real-world data and services. However, the privacy consequences associated with these services and their third-party plugins are not well understood. Sensitive prompt data are stored, processed, and shared by cloud-based LLM providers and third-party plugins. In this paper, we propose Casper, a prompt sanitization technique that aims to protect user privacy by detecting and removing sensitive information from user inputs before sending them to LLM services. Casper runs entirely on the user's device as a browser extension and does not require any changes to the online LLM services. At the core of Casper is a three-layered sanitization mechanism consisting of a rule-based filter, a Machine Learning (ML)-based named entity recognizer, and a browser-based local LLM topic identifier. We evaluate Casper on a dataset of 4000 synthesized prompts and show that it can effectively filter out Personal Identifiable Information (PII) and privacy-sensitive topics with high accuracy, at 98.5% and 89.9%, respectively.

Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.07004 [cs.CR]
	(or arXiv:2408.07004v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2408.07004

Submission history

From: Zhihao Yao [view email]
[v1] Tue, 13 Aug 2024 16:08:37 UTC (314 KB)

Computer Science > Cryptography and Security

Title:Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators