Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

Zhang, Boyang; Wu, Su**; Cao, Hu; Ma, Kehua; Li, Pan; Lin, Lei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.03659 (cs)

[Submitted on 7 Oct 2022 (v1), last revised 10 Oct 2022 (this version, v2)]

Title:Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

Authors:Boyang Zhang, Su** Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin

View PDF

Abstract:In this paper, we present a spatio-temporal tendency reasoning (STR) network for recovering human body pose and shape from videos. Previous approaches have focused on how to extend 3D human datasets and temporal-based learning to promote accuracy and temporal smoothing. Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal and spatial tendency and to fully excavate the spatio-temporal features of existing video data. To this end, our STR learns the representation of features in the temporal and spatial dimensions respectively, to concentrate on a more robust representation of spatio-temporal features. More specifically, for efficient temporal modeling, we first propose a temporal tendency reasoning (TTR) module. TTR constructs a time-dimensional hierarchical residual connection representation within a video sequence to effectively reason temporal sequences' tendencies and retain effective dissemination of human information. Meanwhile, for enhancing the spatial representation, we design a spatial tendency enhancing (STE) module to further learns to excite spatially time-frequency domain sensitive features in human motion information representations. Finally, we introduce integration strategies to integrate and refine the spatio-temporal feature representations. Extensive experimental findings on large-scale publically available datasets reveal that our STR remains competitive with the state-of-the-art on three datasets. Our code are available at this https URL.

Comments:	Accepted by BMVC2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.03659 [cs.CV]
	(or arXiv:2210.03659v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.03659

Submission history

From: Boyang Zhang [view email]
[v1] Fri, 7 Oct 2022 16:09:07 UTC (7,190 KB)
[v2] Mon, 10 Oct 2022 03:24:48 UTC (7,190 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators