LatentSwap3D: Semantic Edits on 3D Image GANs

Simsar, Enis; Tonioni, Alessio; Örnek, Evin Pınar; Tombari, Federico

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.01381 (cs)

[Submitted on 2 Dec 2022 (v1), last revised 4 Sep 2023 (this version, v2)]

Title:LatentSwap3D: Semantic Edits on 3D Image GANs

Authors:Enis Simsar, Alessio Tonioni, Evin Pınar Örnek, Federico Tombari

View PDF

Abstract:3D GANs have the ability to generate latent codes for entire 3D volumes rather than only 2D images. These models offer desirable features like high-quality geometry and multi-view consistency, but, unlike their 2D counterparts, complex semantic image editing tasks for 3D GANs have only been partially explored. To address this problem, we propose LatentSwap3D, a semantic edit approach based on latent space discovery that can be used with any off-the-shelf 3D or 2D GAN model and on any dataset. LatentSwap3D relies on identifying the latent code dimensions corresponding to specific attributes by feature ranking using a random forest classifier. It then performs the edit by swap** the selected dimensions of the image being edited with the ones from an automatically selected reference image. Compared to other latent space control-based edit methods, which were mainly designed for 2D GANs, our method on 3D GANs provides remarkably consistent semantic edits in a disentangled manner and outperforms others both qualitatively and quantitatively. We show results on seven 3D GANs (pi-GAN, GIRAFFE, StyleSDF, MVCGAN, EG3D, StyleNeRF, and VolumeGAN) and on five datasets (FFHQ, AFHQ, Cats, MetFaces, and CompCars).

Comments:	The paper has been accepted by ICCV'23 AI3DCC
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2212.01381 [cs.CV]
	(or arXiv:2212.01381v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.01381

Submission history

From: Enis Simsar [view email]
[v1] Fri, 2 Dec 2022 18:59:51 UTC (25,776 KB)
[v2] Mon, 4 Sep 2023 19:12:46 UTC (43,798 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LatentSwap3D: Semantic Edits on 3D Image GANs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LatentSwap3D: Semantic Edits on 3D Image GANs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators