Towards Segmenting Anything That Moves

Dave, Achal; Tokmakov, Pavel; Ramanan, Deva

Computer Science > Computer Vision and Pattern Recognition

arXiv:1902.03715v3 (cs)

[Submitted on 11 Feb 2019 (v1), revised 10 Sep 2019 (this version, v3), latest version 1 Apr 2020 (v4)]

Title:Towards Segmenting Anything That Moves

Authors:Achal Dave, Pavel Tokmakov, Deva Ramanan

View PDF

Abstract:Detecting and segmenting individual objects, regardless of their category, is crucial for many applications such as action detection or robotic interaction. While this problem has been well-studied under the classic formulation of spatio-temporal grou**, state-of-the-art approaches do not make use of learning-based methods. To bridge this gap, we propose a simple learning-based approach for spatio-temporal grou**. Our approach leverages motion cues from optical flow as a bottom-up signal for separating objects from each other. Motion cues are then combined with appearance cues that provide a generic \textit{objectness} prior for capturing the full extent of objects. We show that our approach outperforms all prior work on the benchmark FBMS dataset. One potential worry with learning-based methods is that they might overfit to the particular type of objects that they have been trained on. To address this concern, we propose two new benchmarks for generic, moving object detection, and show that our model matches top-down methods on common categories, while significantly out-performing both top-down and bottom-up methods on never-before-seen categories.

Comments:	Website: this http URL. Code: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1902.03715 [cs.CV]
	(or arXiv:1902.03715v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1902.03715

Submission history

From: Achal Dave [view email]
[v1] Mon, 11 Feb 2019 03:40:48 UTC (9,074 KB)
[v2] Thu, 25 Apr 2019 20:18:11 UTC (3,178 KB)
[v3] Tue, 10 Sep 2019 21:15:14 UTC (2,994 KB)
[v4] Wed, 1 Apr 2020 01:19:41 UTC (2,993 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Segmenting Anything That Moves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Segmenting Anything That Moves

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators