Towards Multimodal Depth Estimation from Light Fields

Leistner, Titus; Mackowiak, Radek; Ardizzone, Lynton; Köthe, Ullrich; Rother, Carsten

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.16542v2 (cs)

[Submitted on 30 Mar 2022 (v1), last revised 1 Apr 2022 (this version, v2)]

Title:Towards Multimodal Depth Estimation from Light Fields

Authors:Titus Leistner, Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

View PDF

Abstract:Light field applications, especially light field rendering and depth estimation, developed rapidly in recent years. While state-of-the-art light field rendering methods handle semi-transparent and reflective objects well, depth estimation methods either ignore these cases altogether or only deliver a weak performance. We argue that this is due current methods only considering a single "true" depth, even when multiple objects at different depths contributed to the color of a single pixel. Based on the simple idea of outputting a posterior depth distribution instead of only a single estimate, we develop and explore several different deep-learning-based approaches to the problem. Additionally, we contribute the first "multimodal light field depth dataset" that contains the depths of all objects which contribute to the color of a pixel. This allows us to supervise the multimodal depth prediction and also validate all methods by measuring the KL divergence of the predicted posteriors. With our thorough analysis and novel dataset, we aim to start a new line of depth estimation research that overcomes some of the long-standing limitations of this field.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.16542 [cs.CV]
	(or arXiv:2203.16542v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.16542

Submission history

From: Titus Leistner [view email]
[v1] Wed, 30 Mar 2022 18:00:00 UTC (34,598 KB)
[v2] Fri, 1 Apr 2022 10:55:33 UTC (34,778 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Multimodal Depth Estimation from Light Fields

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Multimodal Depth Estimation from Light Fields

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators