RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

Yan, Zhiqiang; Li, Xiang; Hui, Le; Zhang, Zhenyu; Li, Jun; Yang, Jian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.00655 (cs)

[Submitted on 1 Sep 2023 (v1), last revised 28 Feb 2024 (this version, v4)]

Title:RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

Authors:Zhiqiang Yan, Xiang Li, Le Hui, Zhenyu Zhang, Jun Li, Jian Yang

View PDF HTML (experimental)

Abstract:Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to gradually and sufficiently recover depth values. Specifically, the repetition is embodied in both the image guidance branch and depth generation branch. In the former branch, we design a dense repetitive hourglass network (DRHN) to extract discriminative image features of complex environments, which can provide powerful contextual instruction for depth prediction. In the latter branch, we present a repetitive guidance (RG) module based on dynamic convolution, in which an efficient convolution factorization is proposed to reduce the complexity while modeling high-frequency structures progressively. Furthermore, in the semantic guidance branch, we utilize the well-known large vision model, i.e., segment anything (SAM), to supply RG with semantic prior. In addition, we propose a region-aware spatial propagation network (RASPN) for further depth refinement based on the semantic prior constraint. Finally, we collect a new dataset termed TOFDC for the depth completion task, which is acquired by the time-of-flight (TOF) sensor and the color camera on smartphones. Extensive experiments demonstrate that our method achieves state-of-the-art performance on KITTI, NYUv2, Matterport3D, 3D60, VKITTI, and our TOFDC.

Comments:	20 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.00655 [cs.CV]
	(or arXiv:2309.00655v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.00655

Submission history

From: Zhiqiang Yan [view email]
[v1] Fri, 1 Sep 2023 09:11:20 UTC (4,162 KB)
[v2] Thu, 14 Sep 2023 08:50:06 UTC (4,164 KB)
[v3] Fri, 15 Sep 2023 06:11:28 UTC (4,164 KB)
[v4] Wed, 28 Feb 2024 06:58:46 UTC (6,294 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators