Controllable Text Generation with Neurally-Decomposed Oracle

Meng, Tao; Lu, Sidi; Peng, Nanyun; Chang, Kai-Wei

Computer Science > Computation and Language

arXiv:2205.14219 (cs)

[Submitted on 27 May 2022 (v1), last revised 20 Oct 2022 (this version, v2)]

Title:Controllable Text Generation with Neurally-Decomposed Oracle

Authors:Tao Meng, Sidi Lu, Nanyun Peng, Kai-Wei Chang

View PDF

Abstract:We propose a general and efficient framework to control auto-regressive generation models with NeurAlly-Decomposed Oracle (NADO). Given a pre-trained base language model and a sequence-level boolean oracle function, we propose to decompose the oracle function into token-level guidance to steer the base model in text generation. Specifically, the token-level guidance is approximated by a neural model trained with examples sampled from the base model, demanding no additional auxiliary labeled data. Based on posterior regularization, we present the closed-form optimal solution to incorporate the token-level guidance into the base model for controllable generation. We further provide a theoretical analysis of how the approximation quality of NADO affects the controllable generation results. Experiments conducted on two applications: (1) text generation with lexical constraints and (2) machine translation with formality control demonstrate that our framework efficiently guides the base model towards the given oracle while maintaining high generation quality.

Comments:	Accepted by NeurIPS 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.14219 [cs.CL]
	(or arXiv:2205.14219v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.14219

Submission history

From: Tao Meng [view email]
[v1] Fri, 27 May 2022 20:17:53 UTC (8,631 KB)
[v2] Thu, 20 Oct 2022 19:40:02 UTC (8,737 KB)

Computer Science > Computation and Language

Title:Controllable Text Generation with Neurally-Decomposed Oracle

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Controllable Text Generation with Neurally-Decomposed Oracle

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators