Improved Unbiased Watermark for Large Language Models

Chen, Ruibo; Wu, Yihan; Guo, Junfeng; Huang, Heng

Computer Science > Computation and Language

arXiv:2502.11268 (cs)

[Submitted on 16 Feb 2025]

Title:Improved Unbiased Watermark for Large Language Models

Authors:Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang

View PDF HTML (experimental)

Abstract:As artificial intelligence surpasses human capabilities in text generation, the necessity to authenticate the origins of AI-generated content has become paramount. Unbiased watermarks offer a powerful solution by embedding statistical signals into language model-generated text without distorting the quality. In this paper, we introduce MCmark, a family of unbiased, Multi-Channel-based watermarks. MCmark works by partitioning the model's vocabulary into segments and promoting token probabilities within a selected segment based on a watermark key. We demonstrate that MCmark not only preserves the original distribution of the language model but also offers significant improvements in detectability and robustness over existing unbiased watermarks. Our experiments with widely-used language models demonstrate an improvement in detectability of over 10% using MCmark, compared to existing state-of-the-art unbiased watermarks. This advancement underscores MCmark's potential in enhancing the practical application of watermarking in AI-generated texts.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.11268 [cs.CL]
	(or arXiv:2502.11268v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.11268

Submission history

From: Ruibo Chen [view email]
[v1] Sun, 16 Feb 2025 21:02:36 UTC (571 KB)

Computer Science > Computation and Language

Title:Improved Unbiased Watermark for Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improved Unbiased Watermark for Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators