Skip to main content

Showing 1–26 of 26 results for author: Brachmann, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.14351  [pdf, other

    cs.CV

    Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer

    Authors: Eric Brachmann, Jamie Wynn, Shuai Chen, Tommaso Cavallari, Áron Monszpart, Daniyar Turmukhambetov, Victor Adrian Prisacariu

    Abstract: We address the task of estimating camera parameters from a set of images depicting a scene. Popular feature-based structure-from-motion (SfM) tools solve this task by incremental reconstruction: they repeat triangulation of sparse 3D points and registration of more camera views to the sparse point cloud. We re-interpret incremental structure-from-motion as an iterated application and refinement of… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Project page: https://nianticlabs.github.io/acezero/

  2. arXiv:2404.09884  [pdf, other

    cs.CV cs.LG

    Map-Relative Pose Regression for Visual Re-Localization

    Authors: Shuai Chen, Tommaso Cavallari, Victor Adrian Prisacariu, Eric Brachmann

    Abstract: Pose regression networks predict the camera pose of a query image relative to a known environment. Within this family of methods, absolute pose regression (APR) has recently shown promising accuracy in the range of a few centimeters in position error. APR networks encode the scene geometry implicitly in their weights. To achieve high accuracy, they require vast amounts of training data that, reali… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2024, Highlight Paper

  3. arXiv:2404.06337  [pdf, other

    cs.CV

    Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

    Authors: Axel Barroso-Laguna, Sowmya Munukutla, Victor Adrian Prisacariu, Eric Brachmann

    Abstract: Given two images, we can estimate the relative camera pose between them by establishing image-to-image correspondences. Usually, correspondences are 2D-to-2D and the pose we estimate is defined only up to scale. Some applications, aiming at instant augmented reality anywhere, require scale-metric pose estimates, and hence, they rely on external depth estimators to recover the scale. We present Mic… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  4. arXiv:2403.10452  [pdf, other

    cs.CV

    Robust Shape Fitting for 3D Scene Abstraction

    Authors: Florian Kluger, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn

    Abstract: Humans perceive and construct the world as an arrangement of simple parametric models. In particular, we can often describe man-made environments using volumetric primitives such as cuboids or cylinders. Inferring these primitives is important for attaining high-level, abstract scene descriptions. Previous approaches for primitive-based abstraction estimate shape parameters directly and are only a… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in Transactions on Pattern Analysis and Machine Intelligence (PAMI). arXiv admin note: substantial text overlap with arXiv:2105.02047

  5. arXiv:2403.09799  [pdf, other

    cs.CV cs.RO

    BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

    Authors: Tomas Hodan, Martin Sundermeyer, Yann Labbe, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2023, the fifth in a series of public competitions organized to capture the state of the art in model-based 6D object pose estimation from an RGB/RGB-D image and related tasks. Besides the three tasks from 2022 (model-based 2D detection, 2D segmentation, and 6D localization of objects seen during training), the 2023 c… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.13075

  6. arXiv:2306.01596  [pdf, other

    cs.CV

    Two-View Geometry Scoring Without Correspondences

    Authors: Axel Barroso-Laguna, Eric Brachmann, Victor Adrian Prisacariu, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: Camera pose estimation for two-view geometry traditionally relies on RANSAC. Normally, a multitude of image correspondences leads to a pool of proposed hypotheses, which are then scored to find a winning model. The inlier count is generally regarded as a reliable indicator of "consensus". We examine this scoring heuristic, and find that it favors disappointing models under certain circumstances. A… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

  7. arXiv:2305.14059  [pdf, other

    cs.CV cs.LG

    Accelerated Coordinate Encoding: Learning to Relocalize in Minutes using RGB and Poses

    Authors: Eric Brachmann, Tommaso Cavallari, Victor Adrian Prisacariu

    Abstract: Learning-based visual relocalizers exhibit leading pose accuracy, but require hours or days of training. Since training needs to happen on each new scene again, long training times make learning-based relocalization impractical for most applications, despite its promise of high accuracy. In this paper we show how such a system can actually achieve the same accuracy in less than 5 minutes. We start… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: CVPR 2023 Highlight

  8. arXiv:2302.13075  [pdf, other

    cs.CV

    BOP Challenge 2022 on Detection, Segmentation and Pose Estimation of Specific Rigid Objects

    Authors: Martin Sundermeyer, Tomas Hodan, Yann Labbe, Gu Wang, Eric Brachmann, Bertram Drost, Carsten Rother, Jiri Matas

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2022, the fourth in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB/RGB-D image. In 2022, we witnessed another significant improvement in the pose estimation accuracy -- the state of the art, which was 56.9 AR$_C$ in 2019 (Vidal et… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2009.07378

  9. arXiv:2210.05494  [pdf, other

    cs.CV

    Map-free Visual Relocalization: Metric Pose Relative to a Single Image

    Authors: Eduardo Arnold, Jamie Wynn, Sara Vicente, Guillermo Garcia-Hernando, Áron Monszpart, Victor Adrian Prisacariu, Daniyar Turmukhambetov, Eric Brachmann

    Abstract: Can we relocalize in a scene represented by a single reference image? Standard visual relocalization requires hundreds of images and scale calibration to build a scene-specific 3D map. In contrast, we propose Map-free Relocalization, i.e., using only one photo of a scene to enable instant, metric scaled relocalization. Existing datasets are not suitable to benchmark map-free relocalization, due to… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: ECCV2022 camera-ready. 14 pages + 4 reference pages

  10. arXiv:2109.00524  [pdf, other

    cs.CV cs.LG

    On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation

    Authors: Eric Brachmann, Martin Humenberger, Carsten Rother, Torsten Sattler

    Abstract: Benchmark datasets that measure camera pose accuracy have driven progress in visual re-localisation research. To obtain poses for thousands of images, it is common to use a reference algorithm to generate pseudo ground truth. Popular choices include Structure-from-Motion (SfM) and Simultaneous-Localisation-and-Map** (SLAM) using additional sensors like depth cameras if available. Re-localisation… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: ICCV 2021

  11. arXiv:2105.02047  [pdf, other

    cs.CV

    Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images

    Authors: Florian Kluger, Hanno Ackermann, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn

    Abstract: Humans perceive and construct the surrounding world as an arrangement of simple parametric models. In particular, man-made environments commonly consist of volumetric primitives such as cuboids or cylinders. Inferring these primitives is an important step to attain high-level, abstract scene descriptions. Previous approaches directly estimate shape parameters from a 2D or 3D input, and are only ab… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: CVPR 2021

  12. arXiv:2104.02538  [pdf, other

    cs.CV

    Visual Camera Re-Localization Using Graph Neural Networks and Relative Pose Supervision

    Authors: Mehmet Ozgur Turkoglu, Eric Brachmann, Konrad Schindler, Gabriel Brostow, Aron Monszpart

    Abstract: Visual re-localization means using a single image as input to estimate the camera's location and orientation relative to a pre-recorded environment. The highest-scoring methods are "structure based," and need the query camera's intrinsics as an input to the model, with careful geometric optimization. When intrinsics are absent, methods vie for accuracy by making various other assumptions. This yie… ▽ More

    Submitted 12 April, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

  13. arXiv:2009.07378  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    BOP Challenge 2020 on 6D Object Localization

    Authors: Tomas Hodan, Martin Sundermeyer, Bertram Drost, Yann Labbe, Eric Brachmann, Frank Michel, Carsten Rother, Jiri Matas

    Abstract: This paper presents the evaluation methodology, datasets, and results of the BOP Challenge 2020, the third in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB-D image. In 2020, to reduce the domain gap between synthetic training and real test RGB images, the participants were provided 350K photorealistic trainin… ▽ More

    Submitted 13 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: In ECCV 2020 Workshops Proceedings

  14. arXiv:2002.12324  [pdf, other

    cs.CV cs.LG

    Visual Camera Re-Localization from RGB and RGB-D Images Using DSAC

    Authors: Eric Brachmann, Carsten Rother

    Abstract: We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In… ▽ More

    Submitted 9 October, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

  15. arXiv:2001.02643  [pdf, other

    cs.CV

    CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus

    Authors: Florian Kluger, Eric Brachmann, Hanno Ackermann, Carsten Rother, Michael Ying Yang, Bodo Rosenhahn

    Abstract: We present a robust estimator for fitting multiple parametric models of the same form to noisy measurements. Applications include finding multiple vanishing points in man-made scenes, fitting planes to architectural imagery, or estimating multiple rigid motions within the same sequence. In contrast to previous works, which resorted to hand-crafted search strategies for multiple model detection, we… ▽ More

    Submitted 25 March, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: CVPR 2020

  16. arXiv:1912.00623  [pdf, other

    cs.CV cs.LG

    Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

    Authors: Aritra Bhowmik, Stefan Gumhold, Carsten Rother, Eric Brachmann

    Abstract: We address a core problem of computer vision: Detection and description of 2D feature points for image matching. For a long time, hand-crafted designs, like the seminal SIFT algorithm, were unsurpassed in accuracy and efficiency. Recently, learned feature detectors emerged that implement detection and description using neural networks. Training these networks usually resorts to optimizing low-leve… ▽ More

    Submitted 20 March, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: CVPR 2020 (oral)

  17. arXiv:1908.02484  [pdf, other

    cs.CV

    Expert Sample Consensus Applied to Camera Re-Localization

    Authors: Eric Brachmann, Carsten Rother

    Abstract: Fitting model parameters to a set of noisy data points is a common problem in computer vision. In this work, we fit the 6D camera pose to a set of noisy correspondences between the 2D input image and a known 3D environment. We estimate these correspondences from the image using a neural network. Since the correspondences often contain outliers, we utilize a robust estimator such as Random Sample C… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: ICCV 2019. Supplementary materials included

  18. arXiv:1905.04132  [pdf, other

    cs.CV

    Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses

    Authors: Eric Brachmann, Carsten Rother

    Abstract: We present Neural-Guided RANSAC (NG-RANSAC), an extension to the classic RANSAC algorithm from robust optimization. NG-RANSAC uses prior information to improve model hypothesis search, increasing the chance of finding outlier-free minimal sets. Previous works use heuristic side-information like hand-crafted descriptor distance to guide hypothesis search. In contrast, we learn hypothesis search in… ▽ More

    Submitted 31 July, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: ICCV 2019

  19. arXiv:1808.08319  [pdf, other

    cs.CV cs.AI cs.RO

    BOP: Benchmark for 6D Object Pose Estimation

    Authors: Tomas Hodan, Frank Michel, Eric Brachmann, Wadim Kehl, Anders Glent Buch, Dirk Kraft, Bertram Drost, Joel Vidal, Stephan Ihrke, Xenophon Zabulis, Caner Sahin, Fabian Manhardt, Federico Tombari, Tae-Kyun Kim, Jiri Matas, Carsten Rother

    Abstract: We propose a benchmark for 6D pose estimation of a rigid object from a single RGB-D input image. The training data consists of a texture-mapped 3D object model or images of the object in known 6D poses. The benchmark comprises of: i) eight datasets in a unified format that cover different practical scenarios, including two new datasets focusing on varying lighting conditions, ii) an evaluation met… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

    Comments: ECCV 2018

  20. arXiv:1712.01924  [pdf, other

    cs.CV

    iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects

    Authors: Omid Hosseini Jafari, Siva Karthik Mustikovela, Karl Pertsch, Eric Brachmann, Carsten Rother

    Abstract: We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded. Recent RGB-D-based methods are robust to moderate degrees of occlusion. For RGB inputs, no previous method works well for partly occluded objects. Our main contribution is to present the first deep learning-based system that estimates accurate poses for part… ▽ More

    Submitted 18 June, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

  21. arXiv:1711.10228  [pdf, other

    cs.CV

    Learning Less is More - 6D Camera Localization via 3D Surface Regression

    Authors: Eric Brachmann, Carsten Rother

    Abstract: Popular research areas like autonomous driving and augmented reality have renewed the interest in image-based camera localization. In this work, we address the task of predicting the 6D camera pose from a single RGB image in a given 3D environment. With the advent of neural networks, previous works have either learned the entire camera localization process, or multiple components of a camera local… ▽ More

    Submitted 27 March, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: CVPR 2018

  22. arXiv:1612.03779  [pdf, other

    cs.CV

    PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

    Authors: Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother

    Abstract: State-of-the-art computer vision algorithms often achieve efficiency by making discrete choices about which hypotheses to explore next. This allows allocation of computational resources to promising candidates, however, such decisions are non-differentiable. As a result, these algorithms are hard to train in an end-to-end fashion. In this work we propose to learn an efficient algorithm for the tas… ▽ More

    Submitted 11 April, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  23. arXiv:1612.02287  [pdf, other

    cs.CV

    Global Hypothesis Generation for 6D Object Pose Estimation

    Authors: Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother

    Abstract: This paper addresses the task of estimating the 6D pose of a known 3D object from a single RGB-D image. Most modern approaches solve this task in three steps: i) Compute local features; ii) Generate a pool of pose-hypotheses; iii) Select and refine a pose from the pool. This work focuses on the second step. While all existing approaches generate the hypotheses pool via local reasoning, e.g. RANSAC… ▽ More

    Submitted 2 January, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

  24. arXiv:1611.05705  [pdf, other

    cs.CV

    DSAC - Differentiable RANSAC for Camera Localization

    Authors: Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother

    Abstract: RANSAC is an important algorithm in robust optimization and a central building block for many computer vision applications. In recent years, traditionally hand-crafted pipelines have been replaced by deep learning pipelines, which can be trained in an end-to-end fashion. However, RANSAC has so far not been used as part of such deep learning pipelines, because its hypothesis selection procedure is… ▽ More

    Submitted 21 March, 2018; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: CVPR 2017

  25. arXiv:1609.05797  [pdf, other

    cs.CV cs.RO

    Random Forests versus Neural Networks - What's Best for Camera Localization?

    Authors: Daniela Massiceti, Alexander Krull, Eric Brachmann, Carsten Rother, Philip H. S. Torr

    Abstract: This work addresses the task of camera localization in a known 3D scene given a single input RGB image. State-of-the-art approaches accomplish this in two steps: firstly, regressing for every pixel in the image its 3D scene coordinate and subsequently, using these coordinates to estimate the final 6D camera pose via RANSAC. To solve the first step, Random Forests (RFs) are typically used. On the o… ▽ More

    Submitted 13 July, 2017; v1 submitted 19 September, 2016; originally announced September 2016.

    Comments: 8 pages, 4 figures

  26. arXiv:1508.04546  [pdf, other

    cs.CV

    Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images

    Authors: Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother

    Abstract: Analysis-by-synthesis has been a successful approach for many tasks in computer vision, such as 6D pose estimation of an object in an RGB-D image which is the topic of this work. The idea is to compare the observation with the output of a forward process, such as a rendered image of the object of interest in a particular pose. Due to occlusion or complicated sensor noise, it can be difficult to pe… ▽ More

    Submitted 19 August, 2015; originally announced August 2015.

    Comments: 16 pages, 8 figures

    MSC Class: 65-XX