Towards Segmenting Everything That Moves

Dave, Achal; Tokmakov, Pavel; Ramanan, Deva

Computer Science > Computer Vision and Pattern Recognition

arXiv:1902.03715v1 (cs)

[Submitted on 11 Feb 2019 (this version), latest version 1 Apr 2020 (v4)]

Title:Towards Segmenting Everything That Moves

Authors:Achal Dave, Pavel Tokmakov, Deva Ramanan

View PDF

Abstract:Video analysis is the task of perceiving the world as it changes. Often, though, most of the world doesn't change all that much: it's boring. For many applications such as action detection or robotic interaction, segmenting all moving objects is a crucial first step. While this problem has been well-studied in the field of spatiotemporal segmentation, virtually none of the prior works use learning-based approaches, despite significant advances in single-frame instance segmentation. We propose the first deep-learning based approach for video instance segmentation. Our two-stream models' architecture is based on Mask R-CNN, but additionally takes optical flow as input to identify moving objects. It then combines the motion and appearance cues to correct motion estimation mistakes and capture the full extent of objects. We show state-of-the-art results on the Freiburg Berkeley Motion Segmentation dataset by a wide margin. One potential worry with learning-based methods is that they might overfit to the particular type of objects that they have been trained on. While current recognition systems tend to be limited to a "closed world" of N objects on which they are trained, our model seems to segment almost anything that moves.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1902.03715 [cs.CV]
	(or arXiv:1902.03715v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1902.03715

Submission history

From: Achal Dave [view email]
[v1] Mon, 11 Feb 2019 03:40:48 UTC (9,074 KB)
[v2] Thu, 25 Apr 2019 20:18:11 UTC (3,178 KB)
[v3] Tue, 10 Sep 2019 21:15:14 UTC (2,994 KB)
[v4] Wed, 1 Apr 2020 01:19:41 UTC (2,993 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Segmenting Everything That Moves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Segmenting Everything That Moves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators