How to train your VAE

Rivera, Mariano

Computer Science > Machine Learning

arXiv:2309.13160 (cs)

[Submitted on 22 Sep 2023 (v1), last revised 21 Jun 2024 (this version, v3)]

Title:How to train your VAE

Authors:Mariano Rivera

View PDF HTML (experimental)

Abstract:Variational Autoencoders (VAEs) have become a cornerstone in generative modeling and representation learning within machine learning. This paper explores a nuanced aspect of VAEs, focusing on interpreting the Kullback-Leibler (KL) Divergence, a critical component within the Evidence Lower Bound (ELBO) that governs the trade-off between reconstruction accuracy and regularization. Meanwhile, the KL Divergence enforces alignment between latent variable distributions and a prior imposing a structure on the overall latent space but leaves individual variable distributions unconstrained. The proposed method redefines the ELBO with a mixture of Gaussians for the posterior probability, introduces a regularization term to prevent variance collapse, and employs a PatchGAN discriminator to enhance texture realism. Implementation details involve ResNetV2 architectures for both the Encoder and Decoder. The experiments demonstrate the ability to generate realistic faces, offering a promising solution for enhancing VAE-based generative models.

Comments:	5 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
MSC classes:	68T07
ACM classes:	I.2.4; I.4.5
Cite as:	arXiv:2309.13160 [cs.LG]
	(or arXiv:2309.13160v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.13160

Submission history

From: Mariano Rivera [view email]
[v1] Fri, 22 Sep 2023 19:52:28 UTC (6,615 KB)
[v2] Thu, 8 Feb 2024 17:37:56 UTC (10,935 KB)
[v3] Fri, 21 Jun 2024 19:15:54 UTC (2,738 KB)

Computer Science > Machine Learning

Title:How to train your VAE

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:How to train your VAE

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators