Deep Dual Consecutive Network for Human Pose Estimation

Liu, Zhenguang; Chen, Haoming; Feng, Runyang; Wu, Shuang; Ji, Shouling; Yang, Bailin; Wang, Xun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.07254 (cs)

[Submitted on 12 Mar 2021 (v1), last revised 19 Mar 2021 (this version, v3)]

Title:Deep Dual Consecutive Network for Human Pose Estimation

Authors:Zhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, Xun Wang

View PDF

Abstract:Multi-frame human pose estimation in complicated situations is challenging. Although state-of-the-art human joints detectors have demonstrated remarkable results for static images, their performances come short when we apply these models to video sequences. Prevalent shortcomings include the failure to handle motion blur, video defocus, or pose occlusions, arising from the inability in capturing the temporal dependency among video frames. On the other hand, directly employing conventional recurrent neural networks incurs empirical difficulties in modeling spatial contexts, especially for dealing with pose occlusions. In this paper, we propose a novel multi-frame human pose estimation framework, leveraging abundant temporal cues between video frames to facilitate keypoint detection. Three modular components are designed in our framework. A Pose Temporal Merger encodes keypoint spatiotemporal context to generate effective searching scopes while a Pose Residual Fusion module computes weighted pose residuals in dual directions. These are then processed via our Pose Correction Network for efficient refining of pose estimations. Our method ranks No.1 in the Multi-frame Person Pose Estimation Challenge on the large-scale benchmark datasets PoseTrack2017 and PoseTrack2018. We have released our code, ho** to inspire future research.

Comments:	This paper is accepted by CVPR 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.07254 [cs.CV]
	(or arXiv:2103.07254v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.07254

Submission history

From: Runyang Feng [view email]
[v1] Fri, 12 Mar 2021 13:11:27 UTC (5,650 KB)
[v2] Mon, 15 Mar 2021 02:24:12 UTC (5,656 KB)
[v3] Fri, 19 Mar 2021 11:49:02 UTC (5,651 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Dual Consecutive Network for Human Pose Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Dual Consecutive Network for Human Pose Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators