Video Action Detection: Analysing Limitations and Challenges

Modi, Rajat; Rana, Aayush Jung; Kumar, Akash; Tirupattur, Praveen; Vyas, Shruti; Rawat, Yogesh Singh; Shah, Mubarak

Computer Science > Computer Vision and Pattern Recognition

arXiv:2204.07892 (cs)

[Submitted on 17 Apr 2022]

Title:Video Action Detection: Analysing Limitations and Challenges

Authors:Rajat Modi, Aayush Jung Rana, Akash Kumar, Praveen Tirupattur, Shruti Vyas, Yogesh Singh Rawat, Mubarak Shah

View PDF

Abstract:Beyond possessing large enough size to feed data hungry machines (eg, transformers), what attributes measure the quality of a dataset? Assuming that the definitions of such attributes do exist, how do we quantify among their relative existences? Our work attempts to explore these questions for video action detection. The task aims to spatio-temporally localize an actor and assign a relevant action class. We first analyze the existing datasets on video action detection and discuss their limitations. Next, we propose a new dataset, Multi Actor Multi Action (MAMA) which overcomes these limitations and is more suitable for real world applications. In addition, we perform a biasness study which analyzes a key property differentiating videos from static images: the temporal aspect. This reveals if the actions in these datasets really need the motion information of an actor, or whether they predict the occurrence of an action even by looking at a single frame. Finally, we investigate the widely held assumptions on the importance of temporal ordering: is temporal ordering important for detecting these actions? Such extreme experiments show existence of biases which have managed to creep into existing methods inspite of careful modeling.

Comments:	CVPRW'22
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.07892 [cs.CV]
	(or arXiv:2204.07892v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2204.07892

Submission history

From: Akash Kumar [view email]
[v1] Sun, 17 Apr 2022 00:42:14 UTC (10,441 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Video Action Detection: Analysing Limitations and Challenges

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Video Action Detection: Analysing Limitations and Challenges

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators