3D-aware Image Synthesis via Learning Structural and Textural Representations

Xu, Yinghao; Peng, Sida; Yang, Ceyuan; Shen, Yujun; Zhou, Bolei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.10759 (cs)

[Submitted on 20 Dec 2021 (v1), last revised 18 Apr 2022 (this version, v2)]

Title:3D-aware Image Synthesis via Learning Structural and Textural Representations

Authors:Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou

View PDF

Abstract:Making generative models 3D-aware bridges the 2D image space and the 3D physical world yet remains challenging. Recent attempts equip a Generative Adversarial Network (GAN) with a Neural Radiance Field (NeRF), which maps 3D coordinates to pixel values, as a 3D prior. However, the implicit function in NeRF has a very local receptive field, making the generator hard to become aware of the global structure. Meanwhile, NeRF is built on volume rendering which can be too costly to produce high-resolution results, increasing the optimization difficulty. To alleviate these two problems, we propose a novel framework, termed as VolumeGAN, for high-fidelity 3D-aware image synthesis, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Comments:	CVPR 2022 camera-ready, Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2112.10759 [cs.CV]
	(or arXiv:2112.10759v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.10759

Submission history

From: Yinghao Xu [view email]
[v1] Mon, 20 Dec 2021 18:59:40 UTC (1,973 KB)
[v2] Mon, 18 Apr 2022 12:26:06 UTC (1,975 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:3D-aware Image Synthesis via Learning Structural and Textural Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:3D-aware Image Synthesis via Learning Structural and Textural Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators