CoReNet: Coherent 3D scene reconstruction from a single RGB image

Popov, Stefan; Bauszat, Pablo; Ferrari, Vittorio

Computer Science > Computer Vision and Pattern Recognition

arXiv:2004.12989 (cs)

[Submitted on 27 Apr 2020 (v1), last revised 5 Aug 2020 (this version, v2)]

Title:CoReNet: Coherent 3D scene reconstruction from a single RGB image

Authors:Stefan Popov, Pablo Bauszat, Vittorio Ferrari

View PDF

Abstract:Advances in deep learning techniques have allowed recent work to reconstruct the shape of a single object given only one RBG image as input. Building on common encoder-decoder architectures for this task, we propose three extensions: (1) ray-traced skip connections that propagate local 2D information to the output 3D volume in a physically correct manner; (2) a hybrid 3D volume representation that enables building translation equivariant models, while at the same time encoding fine object details without an excessive memory footprint; (3) a reconstruction loss tailored to capture overall object geometry. Furthermore, we adapt our model to address the harder task of reconstructing multiple objects from a single image. We reconstruct all objects jointly in one pass, producing a coherent reconstruction, where all objects live in a single consistent 3D coordinate frame relative to the camera and they do not intersect in 3D space. We also handle occlusions and resolve them by hallucinating the missing object parts in the 3D volume. We validate the impact of our contributions experimentally both on synthetic data from ShapeNet as well as real images from Pix3D. Our method improves over the state-of-the-art single-object methods on both datasets. Finally, we evaluate performance quantitatively on multiple object reconstruction with synthetic scenes assembled from ShapeNet objects.

Comments:	ECCV 2020, camera ready, oral
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2004.12989 [cs.CV]
	(or arXiv:2004.12989v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2004.12989

Submission history

From: Stefan Popov [view email]
[v1] Mon, 27 Apr 2020 17:53:07 UTC (2,402 KB)
[v2] Wed, 5 Aug 2020 15:59:48 UTC (2,404 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CoReNet: Coherent 3D scene reconstruction from a single RGB image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CoReNet: Coherent 3D scene reconstruction from a single RGB image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators