Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Hoang, Duy C.; Le, Hung T. Q.; Chu, Rui; Li, Ping; Zhao, Weijie; Lao, Yingjie; Doan, Khoa D.

Computer Science > Cryptography and Security

arXiv:2407.13803 (cs)

[Submitted on 17 Jul 2024]

Title:Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Authors:Duy C. Hoang, Hung T. Q. Le, Rui Chu, Ping Li, Weijie Zhao, Yingjie Lao, Khoa D. Doan

View PDF HTML (experimental)

Abstract:With the widespread adoption of Large Language Models (LLMs), concerns about potential misuse have emerged. To this end, watermarking has been adapted to LLM, enabling a simple and effective way to detect and monitor generated text. However, while the existing methods can differentiate between watermarked and unwatermarked text with high accuracy, they often face a trade-off between the quality of the generated text and the effectiveness of the watermarking process. In this work, we present a novel type of LLM watermark, Sparse Watermark, which aims to mitigate this trade-off by applying watermarks to a small subset of generated tokens distributed across the text. The key strategy involves anchoring watermarked tokens to words that have specific Part-of-Speech (POS) tags. Our experimental results demonstrate that the proposed watermarking scheme achieves high detectability while generating text that outperforms previous LLM watermarking methods in quality across various tasks

Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.13803 [cs.CR]
	(or arXiv:2407.13803v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2407.13803

Submission history

From: Cao Duy Hoang [view email]
[v1] Wed, 17 Jul 2024 18:52:12 UTC (1,566 KB)

Computer Science > Cryptography and Security

Title:Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators