Neural Video Depth Stabilizer

Wang, Yiran; Shi, Min; Li, Jiaqi; Huang, Zihao; Cao, Zhiguo; Zhang, Jianming; Xian, Ke; Lin, Guosheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.08695 (cs)

[Submitted on 17 Jul 2023 (v1), last revised 10 Aug 2023 (this version, v2)]

Title:Neural Video Depth Stabilizer

Authors:Yiran Wang, Min Shi, Jiaqi Li, Zihao Huang, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin

View PDF

Abstract:Video depth estimation aims to infer temporally consistent depth. Some methods achieve temporal consistency by finetuning a single-image depth model during test time using geometry and re-projection constraints, which is inefficient and not robust. An alternative approach is to learn how to enforce temporal consistency from data, but this requires well-designed models and sufficient video depth data. To address these challenges, we propose a plug-and-play framework called Neural Video Depth Stabilizer (NVDS) that stabilizes inconsistent depth estimations and can be applied to different single-image depth models without extra effort. We also introduce a large-scale dataset, Video Depth in the Wild (VDW), which consists of 14,203 videos with over two million frames, making it the largest natural-scene video depth dataset to our knowledge. We evaluate our method on the VDW dataset as well as two public benchmarks and demonstrate significant improvements in consistency, accuracy, and efficiency compared to previous approaches. Our work serves as a solid baseline and provides a data foundation for learning-based video depth models. We will release our dataset and code for future research.

Comments:	Accepted by ICCV2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.08695 [cs.CV]
	(or arXiv:2307.08695v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.08695

Submission history

From: Yiran Wang [view email]
[v1] Mon, 17 Jul 2023 17:57:01 UTC (18,729 KB)
[v2] Thu, 10 Aug 2023 09:36:06 UTC (18,760 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Neural Video Depth Stabilizer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Neural Video Depth Stabilizer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators