ViewSynth: Learning Local Features from Depth using View Synthesis

Mahmud, Jisan; Singh, Rajat Vikram; Akiva, Peri; Kundu, Spondon; Peng, Kuan-Chuan; Frahm, Jan-Michael

Computer Science > Computer Vision and Pattern Recognition

arXiv:1911.10248 (cs)

[Submitted on 22 Nov 2019 (v1), last revised 1 Sep 2020 (this version, v4)]

Title:ViewSynth: Learning Local Features from Depth using View Synthesis

Authors:Jisan Mahmud, Rajat Vikram Singh, Peri Akiva, Spondon Kundu, Kuan-Chuan Peng, Jan-Michael Frahm

View PDF

Abstract:The rapid development of inexpensive commodity depth sensors has made keypoint detection and matching in the depth image modality an important problem in computer vision. Despite great improvements in recent RGB local feature learning methods, adapting them directly in the depth modality leads to unsatisfactory performance. Most of these methods do not explicitly reason beyond the visible pixels in the images. To address the limitations of these methods, we propose a framework ViewSynth, to jointly learn: (1) viewpoint invariant keypoint-descriptor from depth images using a proposed Contrastive Matching Loss, and (2) view synthesis of depth images from different viewpoints using the proposed View Synthesis Module and View Synthesis Loss. By learning view synthesis, we explicitly encourage the feature extractor to encode information about not only the visible, but also the occluded parts of the scene. We demonstrate that in the depth modality, ViewSynth outperforms the state-of-the-art depth and RGB local feature extraction techniques in the 3D keypoint matching and camera localization tasks on the RGB-D datasets 7-Scenes, TUM RGBD and CoRBS in most scenarios. We also show the generalizability of ViewSynth in 3D keypoint matching across different datasets.

Comments:	Accepted to BMVC 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1911.10248 [cs.CV]
	(or arXiv:1911.10248v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1911.10248

Submission history

From: Jisan Mahmud [view email]
[v1] Fri, 22 Nov 2019 21:01:33 UTC (7,894 KB)
[v2] Thu, 7 May 2020 21:47:46 UTC (4,582 KB)
[v3] Thu, 20 Aug 2020 17:31:24 UTC (4,582 KB)
[v4] Tue, 1 Sep 2020 21:19:21 UTC (4,582 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ViewSynth: Learning Local Features from Depth using View Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ViewSynth: Learning Local Features from Depth using View Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators