A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation

Forristal, Jarad; Mireshghallah, Niloofar; Durrett, Greg; Berg-Kirkpatrick, Taylor

Computer Science > Computation and Language

arXiv:2312.04510 (cs)

[Submitted on 7 Dec 2023]

Title:A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation

Authors:Jarad Forristal, Niloofar Mireshghallah, Greg Durrett, Taylor Berg-Kirkpatrick

View PDF HTML (experimental)

Abstract:Recent work has shown that energy-based language modeling is an effective framework for controllable text generation because it enables flexible integration of arbitrary discriminators. However, because energy-based LMs are globally normalized, approximate techniques like Metropolis-Hastings (MH) are required for inference. Past work has largely explored simple proposal distributions that modify a single token at a time, like in Gibbs sampling. In this paper, we develop a novel MH sampler that, in contrast, proposes re-writes of the entire sequence in each step via iterative prompting of a large language model. Our new sampler (a) allows for more efficient and accurate sampling from a target distribution and (b) allows generation length to be determined through the sampling procedure rather than fixed in advance, as past work has required. We perform experiments on two controlled generation tasks, showing both downstream performance gains and more accurate target distribution sampling in comparison with single-token proposal techniques.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2312.04510 [cs.CL]
	(or arXiv:2312.04510v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.04510

Submission history

From: Fatemehsadat Mireshghallah [view email]
[v1] Thu, 7 Dec 2023 18:30:15 UTC (751 KB)

Computer Science > Computation and Language

Title:A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators