Can Unconditional Language Models Recover Arbitrary Sentences?

Subramani, Nishant; Bowman, Samuel R.; Cho, Kyunghyun

Computer Science > Computation and Language

arXiv:1907.04944 (cs)

[Submitted on 10 Jul 2019 (v1), last revised 9 Jan 2020 (this version, v2)]

Title:Can Unconditional Language Models Recover Arbitrary Sentences?

Authors:Nishant Subramani, Samuel R. Bowman, Kyunghyun Cho

View PDF

Abstract:Neural network-based generative language models like ELMo and BERT can work effectively as general purpose sentence encoders in text classification without further fine-tuning. Is it possible to adapt them in a similar way for use as general-purpose decoders? For this to be possible, it would need to be the case that for any target sentence of interest, there is some continuous representation that can be passed to the language model to cause it to reproduce that sentence. We set aside the difficult problem of designing an encoder that can produce such representations and, instead, ask directly whether such representations exist at all. To do this, we introduce a pair of effective, complementary methods for feeding representations into pretrained unconditional language models and a corresponding set of methods to map sentences into and out of this representation space, the reparametrized sentence space. We then investigate the conditions under which a language model can be made to generate a sentence through the identification of a point in such a space and find that it is possible to recover arbitrary sentences nearly perfectly with language models and representations of moderate size without modifying any model parameters.

Comments:	NeurIPS 2019
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1907.04944 [cs.CL]
	(or arXiv:1907.04944v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1907.04944

Submission history

From: Nishant Subramani [view email]
[v1] Wed, 10 Jul 2019 22:13:48 UTC (6,742 KB)
[v2] Thu, 9 Jan 2020 23:03:30 UTC (8,448 KB)

Computer Science > Computation and Language

Title:Can Unconditional Language Models Recover Arbitrary Sentences?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Unconditional Language Models Recover Arbitrary Sentences?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators