Towards Segmenting Anything That Moves

Dave, Achal; Tokmakov, Pavel; Ramanan, Deva

Computer Science > Computer Vision and Pattern Recognition

arXiv:1902.03715v2 (cs)

[Submitted on 11 Feb 2019 (v1), revised 25 Apr 2019 (this version, v2), latest version 1 Apr 2020 (v4)]

Title:Towards Segmenting Anything That Moves

Authors:Achal Dave, Pavel Tokmakov, Deva Ramanan

View PDF

Abstract:For many applications such as action detection or robotic interaction, segmenting all moving objects is a crucial first step. While this problem has been well-studied under the formulation of spatiotemporal video segmentation, virtually none of the prior works use learning-based approaches, despite significant advances in single-frame instance segmentation. We propose the first deep-learning based approach for spatio-temporal grou**. Our model extends the state-of-the-art Mask R-CNN architecture to the video domain. It takes a video frame together with its optical flow as input, and passes them through appearance and motion streams respectively. It then combines the motion cues, which provide a bottom-up signal for object detection, with appearance cues that allow capturing the full extent of the object via a joint RPN module. We show state-of-the-art results on the Freiburg Berkeley Motion Segmentation dataset by a wide margin. One potential worry with learning-based methods is that they might overfit to the particular type of objects that they have been trained on. While current recognition systems tend to be limited to a "closed world" of N objects on which they are trained, our model can segment almost anything that moves.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1902.03715 [cs.CV]
	(or arXiv:1902.03715v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1902.03715

Submission history

From: Achal Dave [view email]
[v1] Mon, 11 Feb 2019 03:40:48 UTC (9,074 KB)
[v2] Thu, 25 Apr 2019 20:18:11 UTC (3,178 KB)
[v3] Tue, 10 Sep 2019 21:15:14 UTC (2,994 KB)
[v4] Wed, 1 Apr 2020 01:19:41 UTC (2,993 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Segmenting Anything That Moves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Segmenting Anything That Moves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators