One-shot Implicit Animatable Avatars with Model-based Priors

Huang, Yangyi; Yi, Hongwei; Liu, Weiyang; Wang, Haofan; Wu, Boxi; Wang, Wenxiao; Lin, Binbin; Zhang, Debing; Cai, Deng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.02469v1 (cs)

[Submitted on 5 Dec 2022 (this version), latest version 27 Sep 2023 (v4)]

Title:One-shot Implicit Animatable Avatars with Model-based Priors

Authors:Yangyi Huang, Hongwei Yi, Weiyang Liu, Haofan Wang, Boxi Wu, Wenxiao Wang, Binbin Lin, Debing Zhang, Deng Cai

View PDF

Abstract:Existing neural rendering methods for creating human avatars typically either require dense input signals such as video or multi-view images, or leverage a learned prior from large-scale specific 3D human datasets such that reconstruction can be performed with sparse-view inputs. Most of these methods fail to achieve realistic reconstruction when only a single image is available. To enable the data-efficient creation of realistic animatable 3D humans, we propose ELICIT, a novel method for learning human-specific neural radiance fields from a single image. Inspired by the fact that humans can easily reconstruct the body geometry and infer the full-body clothing from a single image, we leverage two priors in ELICIT: 3D geometry prior and visual semantic prior. Specifically, ELICIT introduces the 3D body shape geometry prior from a skinned vertex-based template model (i.e., SMPL) and implements the visual clothing semantic prior with the CLIP-based pre-trained models. Both priors are used to jointly guide the optimization for creating plausible content in the invisible areas. In order to further improve visual details, we propose a segmentation-based sampling strategy that locally refines different parts of the avatar. Comprehensive evaluations on multiple popular benchmarks, including ZJU-MoCAP, Human3.6M, and DeepFashion, show that ELICIT has outperformed current state-of-the-art avatar creation methods when only a single image is available. Code will be public for reseach purpose at this https URL .

Comments:	Project website: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
Cite as:	arXiv:2212.02469 [cs.CV]
	(or arXiv:2212.02469v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.02469

Submission history

From: Yangyi Huang [view email]
[v1] Mon, 5 Dec 2022 18:24:06 UTC (32,809 KB)
[v2] Thu, 16 Mar 2023 09:59:52 UTC (34,253 KB)
[v3] Mon, 21 Aug 2023 08:59:06 UTC (28,844 KB)
[v4] Wed, 27 Sep 2023 05:04:23 UTC (30,369 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:One-shot Implicit Animatable Avatars with Model-based Priors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:One-shot Implicit Animatable Avatars with Model-based Priors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators