Perceptual Generative Autoencoders

Zhang, Zijun; Zhang, Ruixiang; Li, Zongpeng; Bengio, Yoshua; Paull, Liam

Computer Science > Machine Learning

arXiv:1906.10335 (cs)

[Submitted on 25 Jun 2019 (v1), last revised 1 Jul 2020 (this version, v2)]

Title:Perceptual Generative Autoencoders

Authors:Zijun Zhang, Ruixiang Zhang, Zongpeng Li, Yoshua Bengio, Liam Paull

View PDF

Abstract:Modern generative models are usually designed to match target distributions directly in the data space, where the intrinsic dimension of data can be much lower than the ambient dimension. We argue that this discrepancy may contribute to the difficulties in training generative models. We therefore propose to map both the generated and target distributions to a latent space using the encoder of a standard autoencoder, and train the generator (or decoder) to match the target distribution in the latent space. Specifically, we enforce the consistency in both the data space and the latent space with theoretically justified data and latent reconstruction losses. The resulting generative model, which we call a perceptual generative autoencoder (PGA), is then trained with a maximum likelihood or variational autoencoder (VAE) objective. With maximum likelihood, PGAs generalize the idea of reversible generative models to unrestricted neural network architectures and arbitrary number of latent dimensions. When combined with VAEs, PGAs substantially improve over the baseline VAEs in terms of sample quality. Compared to other autoencoder-based generative models using simple priors, PGAs achieve state-of-the-art FID scores on CIFAR-10 and CelebA.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1906.10335 [cs.LG]
	(or arXiv:1906.10335v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.10335

Submission history

From: Zijun Zhang [view email]
[v1] Tue, 25 Jun 2019 06:03:14 UTC (5,344 KB)
[v2] Wed, 1 Jul 2020 04:52:04 UTC (9,069 KB)

Computer Science > Machine Learning

Title:Perceptual Generative Autoencoders

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Perceptual Generative Autoencoders

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators