Learning Neural Volumetric Representations of Dynamic Humans in Minutes

Geng, Chen; Peng, Sida; Xu, Zhen; Bao, Hujun; Zhou, Xiaowei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2302.12237 (cs)

[Submitted on 23 Feb 2023 (v1), last revised 24 Feb 2023 (this version, v2)]

Title:Learning Neural Volumetric Representations of Dynamic Humans in Minutes

Authors:Chen Geng, Sida Peng, Zhen Xu, Hujun Bao, Xiaowei Zhou

View PDF

Abstract:This paper addresses the challenge of quickly reconstructing free-viewpoint videos of dynamic humans from sparse multi-view videos. Some recent works represent the dynamic human as a canonical neural radiance field (NeRF) and a motion field, which are learned from videos through differentiable rendering. But the per-scene optimization generally requires hours. Other generalizable NeRF models leverage learned prior from datasets and reduce the optimization time by only finetuning on new scenes at the cost of visual fidelity. In this paper, we propose a novel method for learning neural volumetric videos of dynamic humans from sparse view videos in minutes with competitive visual quality. Specifically, we define a novel part-based voxelized human representation to better distribute the representational power of the network to different human parts. Furthermore, we propose a novel 2D motion parameterization scheme to increase the convergence rate of deformation field learning. Experiments demonstrate that our model can be learned 100 times faster than prior per-scene optimization methods while being competitive in the rendering quality. Training our model on a $512 \times 512$ video with 100 frames typically takes about 5 minutes on a single RTX 3090 GPU. The code will be released on our project page: this https URL

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
Cite as:	arXiv:2302.12237 [cs.CV]
	(or arXiv:2302.12237v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2302.12237

Submission history

From: Chen Geng [view email]
[v1] Thu, 23 Feb 2023 18:57:01 UTC (9,652 KB)
[v2] Fri, 24 Feb 2023 03:13:56 UTC (9,652 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Neural Volumetric Representations of Dynamic Humans in Minutes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Neural Volumetric Representations of Dynamic Humans in Minutes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators