Bridging the Gap Between Training and Inference of Bayesian Controllable Language Models

Liu, Han; Wang, Bingning; Yao, Ting; Liang, Haijin; Xu, Jianjin; Hu, Xiaolin

Computer Science > Computation and Language

arXiv:2206.05519 (cs)

[Submitted on 11 Jun 2022]

Title:Bridging the Gap Between Training and Inference of Bayesian Controllable Language Models

Authors:Han Liu, Bingning Wang, Ting Yao, Haijin Liang, Jianjin Xu, Xiaolin Hu

View PDF

Abstract:Large-scale pre-trained language models have achieved great success on natural language generation tasks. However, it is difficult to control the pre-trained language models to generate sentences with the desired attribute such as topic and sentiment, etc. Recently, Bayesian Controllable Language Models (BCLMs) have been shown to be efficient in controllable language generation. Rather than fine-tuning the parameters of pre-trained language models, BCLMs use external discriminators to guide the generation of pre-trained language models. However, the mismatch between training and inference of BCLMs limits the performance of the models. To address the problem, in this work we propose a "Gemini Discriminator" for controllable language generation which alleviates the mismatch problem with a small computational cost. We tested our method on two controllable language generation tasks: sentiment control and topic control. On both tasks, our method reached achieved new state-of-the-art results in automatic and human evaluations.

Comments:	Submitted to Neurips 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2206.05519 [cs.CL]
	(or arXiv:2206.05519v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2206.05519

Submission history

From: Han Liu [view email]
[v1] Sat, 11 Jun 2022 12:52:32 UTC (673 KB)

Computer Science > Computation and Language

Title:Bridging the Gap Between Training and Inference of Bayesian Controllable Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Bridging the Gap Between Training and Inference of Bayesian Controllable Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators