SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks

Izquierdo, Sergio; Civera, Javier

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.13551 (cs)

[Submitted on 24 Nov 2022 (v1), last revised 31 Mar 2023 (this version, v2)]

Title:SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks

Authors:Sergio Izquierdo, Javier Civera

View PDF

Abstract:Estimating a dense depth map from a single view is geometrically ill-posed, and state-of-the-art methods rely on learning depth's relation with visual appearance using deep neural networks. On the other hand, Structure from Motion (SfM) leverages multi-view constraints to produce very accurate but sparse maps, as matching across images is typically limited by locally discriminative texture. In this work, we combine the strengths of both approaches by proposing a novel test-time refinement (TTR) method, denoted as SfM-TTR, that boosts the performance of single-view depth networks at test time using SfM multi-view cues. Specifically, and differently from the state of the art, we use sparse SfM point clouds as test-time self-supervisory signal, fine-tuning the network encoder to learn a better representation of the test scene. Our results show how the addition of SfM-TTR to several state-of-the-art self-supervised and supervised networks improves significantly their performance, outperforming previous TTR baselines mainly based on photometric multi-view consistency. The code is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.13551 [cs.CV]
	(or arXiv:2211.13551v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.13551

Submission history

From: Sergio Izquierdo [view email]
[v1] Thu, 24 Nov 2022 12:02:13 UTC (10,980 KB)
[v2] Fri, 31 Mar 2023 11:37:12 UTC (10,981 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators