Computer Science > Computer Vision and Pattern Recognition
[Submitted on 19 Sep 2016 (this version), latest version 13 Jul 2017 (v3)]
Title:Random Forests versus Neural Networks - What's Best for Camera Relocalization?
View PDFAbstract:This work addresses the task of camera localization in a known 3D scene, given a single input RGB image. State-of-the-art approaches accomplish this with two steps. Firstly, regressing for every pixel in the image its so-called 3D scene coordinate and, subsequently, using those coordinates to estimate the final 6D camera pose via RANSAC. To solve the first step, Random Forests (RFs) are typically used. On the other hand, Neural Networks (NNs) currently reign in many dense regression problems, but are not test-time efficient. We ask the question: Which of the two is the best choice for camera localization? To address this, we make two method contributions: (1) a test-time efficient NN architecture which we term a ForestNet that is derived and initialized from a RF, and (2) a new fully-differentiable robust averaging technique for regression ensembles which can be trained end-to-end with a NN architecture. Our experimental findings show that traditional NN architectures are superior to test-time efficient RFs and ForestNets in terms of scene coordinate regression, however, this does not translate to final 6D camera pose accuracy where ForestNets and RFs perform slightly better. To summarize, our best method, a ForestNet with a robust average, improves over the state-of-the-art for camera localization on the 7-Scenes dataset. While this work focuses on scene coordinate regression for camera localization, our innovations may also be applied to other continuous regression tasks.
Submission history
From: Daniela Massiceti [view email][v1] Mon, 19 Sep 2016 15:50:25 UTC (5,124 KB)
[v2] Wed, 1 Mar 2017 17:36:00 UTC (3,518 KB)
[v3] Thu, 13 Jul 2017 08:52:13 UTC (3,518 KB)
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.