Gamma Sampling: Fine-grained Controlling Language Models without Training

Wu, Shangda; Sun, Maosong

Computer Science > Computation and Language

arXiv:2205.06036v2 (cs)

[Submitted on 12 May 2022 (v1), revised 6 Sep 2022 (this version, v2), latest version 21 Feb 2023 (v5)]

Title:Gamma Sampling: Fine-grained Controlling Language Models without Training

Authors:Shangda Wu, Maosong Sun

View PDF

Abstract:The dominant approaches for controlling language models achieve prominence in controlling high-level attributes (e.g. topic and sentiment). However, these methods often require condition-specific data or are computationally expensive. We propose a new simple guided decoding method, Gamma Sampling, which does not require any training data to achieve fine-grained controllable text generation while maintaining a fast generation speed. Gamma Sampling introduces attribute-related information (provided by humans or language models themselves) into the sampling process to guide language models to generate texts with desired attributes. Since no training is involved, Gamma Sampling can be easily applied to any language model for controllable text generation. Through experiments, we show that Gamma Sampling-steered GPT2-small (117M) outperforms baselines such as PPLM (345M) and CTRL (1.6B) in diversity, attribute relevance, and overall quality of generated samples.

Comments:	20 pages, 5 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.06036 [cs.CL]
	(or arXiv:2205.06036v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.06036

Submission history

From: Shangda Wu [view email]
[v1] Thu, 12 May 2022 11:48:11 UTC (1,724 KB)
[v2] Tue, 6 Sep 2022 12:12:50 UTC (3,672 KB)
[v3] Mon, 12 Sep 2022 15:12:59 UTC (3,671 KB)
[v4] Sat, 17 Sep 2022 03:14:34 UTC (3,671 KB)
[v5] Tue, 21 Feb 2023 07:48:50 UTC (10,535 KB)

Computer Science > Computation and Language

Title:Gamma Sampling: Fine-grained Controlling Language Models without Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Gamma Sampling: Fine-grained Controlling Language Models without Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators