3D Object Detection with a Self-supervised Lidar Scene Flow Backbone

Yurtsever, Ekim; Erçelik, Emeç; Liu, Mingyu; Yang, Zhijie; Zhang, Hanzhen; Topçam, Pınar; Listl, Maximilian; Çaylı, Yılmaz Kaan; Knoll, Alois

Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.00705 (cs)

[Submitted on 2 May 2022 (v1), last revised 19 Jul 2022 (this version, v2)]

Title:3D Object Detection with a Self-supervised Lidar Scene Flow Backbone

Authors:Ekim Yurtsever, Emeç Erçelik, Mingyu Liu, Zhijie Yang, Hanzhen Zhang, Pınar Topçam, Maximilian Listl, Yılmaz Kaan Çaylı, Alois Knoll

View PDF

Abstract:State-of-the-art lidar-based 3D object detection methods rely on supervised learning and large labeled datasets. However, annotating lidar data is resource-consuming, and depending only on supervised learning limits the applicability of trained models. Self-supervised training strategies can alleviate these issues by learning a general point cloud backbone model for downstream 3D vision tasks. Against this backdrop, we show the relationship between self-supervised multi-frame flow representations and single-frame 3D detection hypotheses. Our main contribution leverages learned flow and motion representations and combines a self-supervised backbone with a supervised 3D detection head. First, a self-supervised scene flow estimation model is trained with cycle consistency. Then, the point cloud encoder of this model is used as the backbone of a single-frame 3D object detection head model. This second 3D object detection model learns to utilize motion representations to distinguish dynamic objects exhibiting different movement patterns. Experiments on KITTI and nuScenes benchmarks show that the proposed self-supervised pre-training increases 3D detection performance significantly. this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.00705 [cs.CV]
	(or arXiv:2205.00705v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.00705

Submission history

From: Emeç Erçelik [view email]
[v1] Mon, 2 May 2022 07:53:29 UTC (4,338 KB)
[v2] Tue, 19 Jul 2022 08:22:38 UTC (4,661 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:3D Object Detection with a Self-supervised Lidar Scene Flow Backbone

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:3D Object Detection with a Self-supervised Lidar Scene Flow Backbone

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators