Joint 3D Proposal Generation and Object Detection from View Aggregation

Ku, Jason; Mozifian, Melissa; Lee, Jungwook; Harakeh, Ali; Waslander, Steven

Computer Science > Computer Vision and Pattern Recognition

arXiv:1712.02294 (cs)

[Submitted on 6 Dec 2017 (v1), last revised 12 Jul 2018 (this version, v4)]

Title:Joint 3D Proposal Generation and Object Detection from View Aggregation

Authors:Jason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, Steven Waslander

View PDF

Abstract:We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high resolution feature maps to generate reliable 3D object proposals for multiple object classes in road scenes. Using these proposals, the second stage detection network performs accurate oriented 3D bounding box regression and category classification to predict the extents, orientation, and classification of objects in 3D space. Our proposed architecture is shown to produce state of the art results on the KITTI 3D object detection benchmark while running in real time with a low memory footprint, making it a suitable candidate for deployment on autonomous vehicles. Code is at: this https URL

Comments:	For any inquiries contact aharakeh(at)uwaterloo(dot)ca
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1712.02294 [cs.CV]
	(or arXiv:1712.02294v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1712.02294

Submission history

From: Ali Harakeh [view email]
[v1] Wed, 6 Dec 2017 17:20:21 UTC (8,705 KB)
[v2] Fri, 22 Dec 2017 16:34:19 UTC (8,599 KB)
[v3] Tue, 6 Mar 2018 16:50:01 UTC (5,126 KB)
[v4] Thu, 12 Jul 2018 14:11:40 UTC (5,452 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Joint 3D Proposal Generation and Object Detection from View Aggregation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Joint 3D Proposal Generation and Object Detection from View Aggregation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators