3DGen: Triplane Latent Diffusion for Textured Mesh Generation

Gupta, Anchit; Xiong, Wenhan; Nie, Yixin; Jones, Ian; Oğuz, Barlas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.05371 (cs)

[Submitted on 9 Mar 2023 (v1), last revised 27 Mar 2023 (this version, v2)]

Title:3DGen: Triplane Latent Diffusion for Textured Mesh Generation

Authors:Anchit Gupta, Wenhan Xiong, Yixin Nie, Ian Jones, Barlas Oğuz

View PDF

Abstract:Latent diffusion models for image generation have crossed a quality threshold which enabled them to achieve mass adoption. Recently, a series of works have made advancements towards replicating this success in the 3D domain, introducing techniques such as point cloud VAE, triplane representation, neural implicit surfaces and differentiable rendering based training. We take another step along this direction, combining these developments in a two-step pipeline consisting of 1) a triplane VAE which can learn latent representations of textured meshes and 2) a conditional diffusion model which generates the triplane features. For the first time this architecture allows conditional and unconditional generation of high quality textured or untextured 3D meshes across multiple diverse categories in a few seconds on a single GPU. It outperforms previous work substantially on image-conditioned and unconditional generation on mesh quality as well as texture generation. Furthermore, we demonstrate the scalability of our model to large datasets for increased quality and diversity. We will release our code and trained models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2303.05371 [cs.CV]
	(or arXiv:2303.05371v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.05371

Submission history

From: Anchit Gupta [view email]
[v1] Thu, 9 Mar 2023 16:18:14 UTC (26,310 KB)
[v2] Mon, 27 Mar 2023 18:04:20 UTC (26,310 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:3DGen: Triplane Latent Diffusion for Textured Mesh Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:3DGen: Triplane Latent Diffusion for Textured Mesh Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators