Synthetic Humans for Action Recognition from Unseen Viewpoints

Varol, Gül; Laptev, Ivan; Schmid, Cordelia; Zisserman, Andrew

doi:10.1007/s11263-021-01467-7

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.04070 (cs)

[Submitted on 9 Dec 2019 (v1), last revised 23 May 2021 (this version, v3)]

Title:Synthetic Humans for Action Recognition from Unseen Viewpoints

Authors:Gül Varol, Ivan Laptev, Cordelia Schmid, Andrew Zisserman

View PDF

Abstract:Although synthetic training data has been shown to be beneficial for tasks such as human pose estimation, its use for RGB human action recognition is relatively unexplored. Our goal in this work is to answer the question whether synthetic humans can improve the performance of human action recognition, with a particular focus on generalization to unseen viewpoints. We make use of the recent advances in monocular 3D human body reconstruction from real action sequences to automatically render synthetic training videos for the action labels. We make the following contributions: (i) we investigate the extent of variations and augmentations that are beneficial to improving performance at new viewpoints. We consider changes in body shape and clothing for individuals, as well as more action relevant augmentations such as non-uniform frame sampling, and interpolating between the motion of individuals performing the same action; (ii) We introduce a new data generation methodology, SURREACT, that allows training of spatio-temporal CNNs for action classification; (iii) We substantially improve the state-of-the-art action recognition performance on the NTU RGB+D and UESTC standard human action multi-view benchmarks; Finally, (iv) we extend the augmentation approach to in-the-wild videos from a subset of the Kinetics dataset to investigate the case when only one-shot training data is available, and demonstrate improvements in this case as well.

Comments:	21 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1912.04070 [cs.CV]
	(or arXiv:1912.04070v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.04070
Journal reference:	International Journal of Computer Vision (2021)
Related DOI:	https://doi.org/10.1007/s11263-021-01467-7

Submission history

From: GÃ¼l Varol [view email]
[v1] Mon, 9 Dec 2019 14:17:03 UTC (8,561 KB)
[v2] Wed, 28 Oct 2020 14:14:16 UTC (8,897 KB)
[v3] Sun, 23 May 2021 14:08:35 UTC (8,895 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Synthetic Humans for Action Recognition from Unseen Viewpoints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Synthetic Humans for Action Recognition from Unseen Viewpoints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators