Length Controlled Generation for Black-box LLMs

Gu, Yuxuan; Wang, Wenjie; Feng, Xiaocheng; Zhong, Weihong; Zhu, Kun; Huang, Lei; Chua, Tat-Seng; Qin, Bing

Computer Science > Computation and Language

arXiv:2412.14656 (cs)

[Submitted on 19 Dec 2024]

Title:Length Controlled Generation for Black-box LLMs

Authors:Yuxuan Gu, Wenjie Wang, Xiaocheng Feng, Weihong Zhong, Kun Zhu, Lei Huang, Tat-Seng Chua, Bing Qin

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have demonstrated impressive instruction following capabilities, while still struggling to accurately manage the length of the generated text, which is a fundamental requirement in many real-world applications. Existing length control methods involve fine-tuning the parameters of LLMs, which is inefficient and suboptimal for practical use. In this paper, we propose a novel iterative sampling framework for text length control, integrating the Metropolis-Hastings algorithm with an importance sampling acceleration strategy. This framework efficiently and reliably regulates LLMs to generate length-constrained text without modifying the underlying parameters, thereby preserving the original capabilities of LLMs. Experimental results demonstrate that our framework achieves almost 100\% success rates of length control on Llama3.1 for tasks such as length-controlled abstractive summarization and length-constrained instruction following, with minimal additional computational overhead. This also highlights the significant potential of our method for precise length control across a broader range of applications, without compromising the versatility of LLMs.

Comments:	Preprint
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.14656 [cs.CL]
	(or arXiv:2412.14656v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.14656

Submission history

From: Yuxuan Gu [view email]
[v1] Thu, 19 Dec 2024 09:07:38 UTC (77 KB)

Computer Science > Computation and Language

Title:Length Controlled Generation for Black-box LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Length Controlled Generation for Black-box LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators