A fully end-to-end deep learning approach for real-time simultaneous 3D reconstruction and material recognition

Zhao, Cheng; Sun, Li; Stolkin, Rustam

doi:10.1109/ICAR.2017.8023499

Abstract:This paper addresses the problem of simultaneous 3D reconstruction and material recognition and segmentation. Enabling robots to recognise different materials (concrete, metal etc.) in a scene is important for many tasks, e.g. robotic interventions in nuclear decommissioning. Previous work on 3D semantic reconstruction has predominantly focused on recognition of everyday domestic objects (tables, chairs etc.), whereas previous work on material recognition has largely been confined to single 2D images without any 3D reconstruction. Meanwhile, most 3D semantic reconstruction methods rely on computationally expensive post-processing, using Fully-Connected Conditional Random Fields (CRFs), to achieve consistent segmentations. In contrast, we propose a deep learning method which performs 3D reconstruction while simultaneously recognising different types of materials and labelling them at the pixel level. Unlike previous methods, we propose a fully end-to-end approach, which does not require hand-crafted features or CRF post-processing. Instead, we use only learned features, and the CRF segmentation constraints are incorporated inside the fully end-to-end learned system. We present the results of experiments, in which we trained our system to perform real-time 3D semantic reconstruction for 23 different materials in a real-world application. The run-time performance of the system can be boosted to around 10Hz, using a conventional GPU, which is enough to achieve real-time semantic reconstruction using a 30fps RGB-D camera. To the best of our knowledge, this work is the first real-time end-to-end system for simultaneous 3D reconstruction and material recognition.

Comments:	8 pages, 7 figures, 4 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1703.04699 [cs.CV]
	(or arXiv:1703.04699v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1703.04699
Related DOI:	https://doi.org/10.1109/ICAR.2017.8023499

Computer Science > Computer Vision and Pattern Recognition

Title:A fully end-to-end deep learning approach for real-time simultaneous 3D reconstruction and material recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators