Skip to main content

Showing 1–50 of 61 results for author: Sattler, T

.
  1. arXiv:2406.17345  [pdf, other

    cs.CV

    NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods

    Authors: Jonas Kulhanek, Torsten Sattler

    Abstract: Novel view synthesis is an important problem with many applications, including AR/VR, gaming, and simulations for robotics. With the recent rapid development of Neural Radiance Fields (NeRFs) and 3D Gaussian Splatting (3DGS) methods, it is becoming difficult to keep track of the current state of the art (SoTA) due to methods using different evaluation protocols, codebases being difficult to instal… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Web: https://jkulhanek.com/nerfbaselines

  2. arXiv:2406.08463  [pdf, other

    cs.CV

    Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement

    Authors: Maxime Pietrantoni, Gabriela Csurka, Martin Humenberger, Torsten Sattler

    Abstract: Visual localization techniques rely upon some underlying scene representation to localize against. These representations can be explicit such as 3D SFM map or implicit, such as a neural network that learns to encode the scene. The former requires sparse feature extractors and matchers to build the scene representation. The latter might lack geometric grounding not capturing the 3D structure of the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Published in 3DV24 (highlight)

  3. arXiv:2404.10772  [pdf, other

    cs.CV

    Gaussian Opacity Fields: Efficient and Compact Surface Reconstruction in Unbounded Scenes

    Authors: Zehao Yu, Torsten Sattler, Andreas Geiger

    Abstract: Recently, 3D Gaussian Splatting (3DGS) has demonstrated impressive novel view synthesis results, while allowing the rendering of high-resolution images in real-time. However, leveraging 3D Gaussians for surface reconstruction poses significant challenges due to the explicit and disconnected nature of 3D Gaussians. In this work, we present Gaussian Opacity Fields (GOF), a novel approach for efficie… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Project page: https://niu**shuchong.github.io/gaussian-opacity-fields

  4. arXiv:2404.10438  [pdf, other

    cs.CV

    The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

    Authors: Gabriele Trivigno, Carlo Masone, Barbara Caputo, Torsten Sattler

    Abstract: Pose refinement is an interesting and practically relevant research direction. Pose refinement can be used to (1) obtain a more accurate pose estimate from an initial prior (e.g., from retrieval), (2) as pre-processing, i.e., to provide a better starting point to a more expensive pose estimator, (3) as post-processing of a more accurate localizer. Existing approaches focus on learning features / s… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024 (Highlight)

  5. arXiv:2312.01148  [pdf, other

    cs.CV

    Has Anything Changed? 3D Change Detection by 2D Segmentation Masks

    Authors: Aikaterini Adam, Konstantinos Karantzalos, Lazaros Grammatikopoulos, Torsten Sattler

    Abstract: As capturing devices become common, 3D scans of interior spaces are acquired on a daily basis. Through scene comparison over time, information about objects in the scene and their changes is inferred. This information is important for robots and AR and VR devices, in order to operate in an immersive virtual experience. We thus propose an unsupervised object discovery method that identifies added,… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  6. arXiv:2311.16493  [pdf, other

    cs.CV

    Mip-Splatting: Alias-free 3D Gaussian Splatting

    Authors: Zehao Yu, Anpei Chen, Binbin Huang, Torsten Sattler, Andreas Geiger

    Abstract: Recently, 3D Gaussian Splatting has demonstrated impressive novel view synthesis results, reaching high fidelity and efficiency. However, strong artifacts can be observed when changing the sampling rate, \eg, by changing focal length or camera distance. We find that the source for this phenomenon can be attributed to the lack of 3D frequency constraints and the usage of a 2D dilation filter. To ad… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Project page: https://niu**shuchong.github.io/mip-splatting/

  7. arXiv:2305.04603  [pdf, other

    cs.CV

    Privacy-Preserving Representations are not Enough -- Recovering Scene Content from Camera Poses

    Authors: Kunal Chelani, Torsten Sattler, Fredrik Kahl, Zuzana Kukelova

    Abstract: Visual localization is the task of estimating the camera pose from which a given image was taken and is central to several 3D computer vision applications. With the rapid growth in the popularity of AR/VR/MR devices and cloud-based applications, privacy issues are becoming a very important aspect of the localization process. Existing work on privacy-preserving localization aims to defend against a… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  8. arXiv:2304.09987  [pdf, other

    cs.CV cs.GR cs.LG

    Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra

    Authors: Jonas Kulhanek, Torsten Sattler

    Abstract: Neural Radiance Fields (NeRFs) are a very recent and very popular approach for the problems of novel view synthesis and 3D reconstruction. A popular scene representation used by NeRFs is to combine a uniform, voxel-based subdivision of the scene with an MLP. Based on the observation that a (sparse) point cloud of the scene is often available, this paper proposes to use an adaptive representation b… ▽ More

    Submitted 20 August, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: ICCV 2023, Web: https://jkulhanek.com/tetra-nerf

  9. arXiv:2304.05947  [pdf, other

    cs.CV

    Visual Localization using Imperfect 3D Models from the Internet

    Authors: Vojtech Panek, Zuzana Kukelova, Torsten Sattler

    Abstract: Visual localization is a core component in many applications, including augmented reality (AR). Localization algorithms compute the camera pose of a query image w.r.t. a scene representation, which is typically built from images. This often requires capturing and storing large amounts of data, followed by running Structure-from-Motion (SfM) algorithms. An interesting, and underexplored, source of… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: to be presented at CVPR 2023

    ACM Class: I.2.10; I.4.8; I.4.9

  10. arXiv:2303.16078  [pdf, other

    cs.CV

    Relative pose of three calibrated and partially calibrated cameras from four points using virtual correspondences

    Authors: Charalambos Tzamos, Daniel Barath, Torsten Sattler, Zuzana Kukelova

    Abstract: We study challenging problems of estimating the relative pose of three cameras and propose novel efficient solutions to (1) the notoriously difficult configuration of four points in three calibrated views, known as the 4p3v problem, and (2) to the previously unsolved configuration of four points in three cameras with unknown shared focal length, i.e., the 4p3vf problem. Our solutions are based on… ▽ More

    Submitted 11 December, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  11. arXiv:2209.15072  [pdf, other

    cs.CV

    Partially calibrated semi-generalized pose from hybrid point correspondences

    Authors: Snehal Bhayani, Viktor Larsson, Torsten Sattler, Janne Heikkila, Zuzana Kukelova

    Abstract: In this paper we study the problem of estimating the semi-generalized pose of a partially calibrated camera, i.e., the pose of a perspective camera with unknown focal length w.r.t. a generalized camera, from a hybrid set of 2D-2D and 2D-3D point correspondences. We study all possible camera configurations within the generalized camera system. To derive practical solvers to previously unsolved chal… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  12. arXiv:2208.09870  [pdf, other

    cs.CV

    Objects Can Move: 3D Change Detection by Geometric Transformation Constistency

    Authors: Aikaterini Adam, Torsten Sattler, Konstantinos Karantzalos, Tomas Pajdla

    Abstract: AR/VR applications and robots need to know when the scene has changed. An example is when objects are moved, added, or removed from the scene. We propose a 3D object discovery method that is based only on scene changes. Our method does not need to encode any assumptions about what is an object, but rather discovers objects by exploiting their coherent move. Changes are initially detected as differ… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  13. arXiv:2207.10762  [pdf, other

    cs.CV

    MeshLoc: Mesh-Based Visual Localization

    Authors: Vojtech Panek, Zuzana Kukelova, Torsten Sattler

    Abstract: Visual localization, i.e., the problem of camera pose estimation, is a central component of applications such as autonomous robots and augmented reality systems. A dominant approach in the literature, shown to scale to large scenes and to handle complex illumination and seasonal changes, is based on local features extracted from images. The scene representation is a sparse Structure-from-Motion po… ▽ More

    Submitted 25 July, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: to be published in the proceedings of ECCV 2022, code repository: https://github.com/tsattler/meshloc_release

    ACM Class: I.2.10; I.4.9

  14. arXiv:2206.00665  [pdf, other

    cs.CV

    MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction

    Authors: Zehao Yu, Songyou Peng, Michael Niemeyer, Torsten Sattler, Andreas Geiger

    Abstract: In recent years, neural implicit surface reconstruction methods have become popular for multi-view 3D reconstruction. In contrast to traditional multi-view stereo methods, these approaches tend to produce smoother and more complete reconstructions due to the inductive smoothness bias of neural networks. State-of-the-art neural implicit methods allow for high-quality reconstructions of simple scene… ▽ More

    Submitted 12 October, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Project page: https://niu**shuchong.github.io/monosdf/

  15. Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

    Authors: Martin Humenberger, Yohann Cabon, Noé Pion, Philippe Weinzaepfel, Donghwan Lee, Nicolas Guérin, Torsten Sattler, Gabriela Csurka

    Abstract: Visual localization, i.e., camera pose estimation in a known scene, is a core component of technologies such as autonomous driving and augmented reality. State-of-the-art localization approaches often rely on image retrieval techniques for one of two purposes: (1) provide an approximate pose estimate or (2) determine which parts of the scene are potentially visible in a given query image. It is co… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: International Journal of Computer Vision (2022). arXiv admin note: text overlap with arXiv:2011.11946

  16. arXiv:2205.02830  [pdf, other

    cs.CV

    Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion

    Authors: Vladimir Guzov, Julian Chibane, Riccardo Marin, Yannan He, Yunus Saracoglu, Torsten Sattler, Gerard Pons-Moll

    Abstract: Our world is not static and humans naturally cause changes in their environments through interactions, e.g., opening doors or moving furniture. Modeling changes caused by humans is essential for building digital twins, e.g., in the context of shared physical-virtual spaces (metaverses) and robotics. In order for widespread adoption of such emerging applications, the sensor setup used to capture th… ▽ More

    Submitted 18 March, 2024; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: International Conference on 3D Vision 2024 (3DV'24)

  17. arXiv:2204.03444  [pdf, other

    cs.CV

    Deep Visual Geo-localization Benchmark

    Authors: Gabriele Berton, Riccardo Mereu, Gabriele Trivigno, Carlo Masone, Gabriela Csurka, Torsten Sattler, Barbara Caputo

    Abstract: In this paper, we propose a new open-source benchmarking framework for Visual Geo-localization (VG) that allows to build, train, and test a wide range of commonly used architectures, with the flexibility to change individual components of a geo-localization pipeline. The purpose of this framework is twofold: i) gaining insights into how different components and design choices in a VG pipeline impa… ▽ More

    Submitted 9 June, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: CVPR 2022 (Oral)

  18. arXiv:2203.10157  [pdf, other

    cs.CV cs.LG

    ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers

    Authors: Jonáš Kulhánek, Erik Derner, Torsten Sattler, Robert Babuška

    Abstract: Novel view synthesis is a long-standing problem. In this work, we consider a variant of the problem where we are given only a few context views sparsely covering a scene or an object. The goal is to predict novel viewpoints in the scene, which requires learning priors. The current state of the art is based on Neural Radiance Field (NeRF), and while achieving impressive results, the methods suffer… ▽ More

    Submitted 21 July, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: ECCV 2022 poster

  19. arXiv:2109.04527  [pdf, other

    cs.CV

    CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization

    Authors: Ara Jafarzadeh, Manuel Lopez Antequera, Pau Gargallo, Yubin Kuang, Carl Toft, Fredrik Kahl, Torsten Sattler

    Abstract: Visual localization is the problem of estimating the position and orientation from which a given image (or a sequence of images) is taken in a known scene. It is an important part of a wide range of computer vision and robotics applications, from self-driving cars to augmented/virtual reality systems. Visual localization techniques should work reliably and robustly under a wide range of conditions… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  20. arXiv:2109.00524  [pdf, other

    cs.CV cs.LG

    On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation

    Authors: Eric Brachmann, Martin Humenberger, Carsten Rother, Torsten Sattler

    Abstract: Benchmark datasets that measure camera pose accuracy have driven progress in visual re-localisation research. To obtain poses for thousands of images, it is common to use a reference algorithm to generate pseudo ground truth. Popular choices include Structure-from-Motion (SfM) and Simultaneous-Localisation-and-Map** (SLAM) using additional sensors like depth cameras if available. Re-localisation… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: ICCV 2021

  21. arXiv:2103.17265  [pdf, other

    cs.CV

    Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors

    Authors: Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll

    Abstract: We introduce (HPS) Human POSEitioning System, a method to recover the full 3D pose of a human registered with a 3D scan of the surrounding environment using wearable sensors. Using IMUs attached at the body limbs and a head mounted camera looking outwards, HPS fuses camera based self-localization with IMU-based human body tracking. The former provides drift-free but noisy position and orientation… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  22. arXiv:2103.09213  [pdf, other

    cs.CV

    Back to the Feature: Learning Robust Camera Localization from Pixels to Pose

    Authors: Paul-Edouard Sarlin, Ajaykumar Unagar, Måns Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler

    Abstract: Camera pose estimation in known scenes is a 3D geometry task recently tackled by multiple learning algorithms. Many regress precise geometric quantities, like poses or 3D points, from an input image. This either fails to generalize to new viewpoints or ties the model parameters to a specific scene. In this paper, we go Back to the Feature: we argue that deep networks should focus on learning robus… ▽ More

    Submitted 7 April, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  23. arXiv:2103.06535  [pdf, other

    cs.CV

    Calibrated and Partially Calibrated Semi-Generalized Homographies

    Authors: Snehal Bhayani, Torsten Sattler, Daniel Barath, Patrik Beliansky, Janne Heikkila, Zuzana Kukelova

    Abstract: In this paper, we propose the first minimal solutions for estimating the semi-generalized homography given a perspective and a generalized camera. The proposed solvers use five 2D-2D image point correspondences induced by a scene plane. One of them assumes the perspective camera to be fully calibrated, while the other solver estimates the unknown focal length together with the absolute pose parame… ▽ More

    Submitted 11 October, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted to ICCV 2021 and to appear in the conference proceedings

  24. arXiv:2103.05086  [pdf, other

    cs.CV

    How Privacy-Preserving are Line Clouds? Recovering Scene Details from 3D Lines

    Authors: Kunal Chelani, Fredrik Kahl, Torsten Sattler

    Abstract: Visual localization is the problem of estimating the camera pose of a given image with respect to a known scene. Visual localization algorithms are a fundamental building block in advanced computer vision applications, including Mixed and Virtual Reality systems. Many algorithms used in practice represent the scene through a Structure-from-Motion (SfM) point cloud and use 2D-3D matches between a q… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Computer Vision and Pattern Recognition (CVPR) 2021

  25. arXiv:2012.01909  [pdf, other

    cs.CV

    Patch2Pix: Epipolar-Guided Pixel-Level Correspondences

    Authors: Qunjie Zhou, Torsten Sattler, Laura Leal-Taixe

    Abstract: The classical matching pipeline used for visual localization typically involves three steps: (i) local feature detection and description, (ii) feature matching, and (iii) outlier rejection. Recently emerged correspondence networks propose to perform those steps inside a single network but suffer from low matching resolution due to the memory bottleneck. In this work, we propose a new perspective t… ▽ More

    Submitted 26 March, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: CVPR2021 Camera Ready Version

  26. arXiv:2011.11946  [pdf, other

    cs.CV cs.LG

    Benchmarking Image Retrieval for Visual Localization

    Authors: Noé Pion, Martin Humenberger, Gabriela Csurka, Yohann Cabon, Torsten Sattler

    Abstract: Visual localization, i.e., camera pose estimation in a known scene, is a core component of technologies such as autonomous driving and augmented reality. State-of-the-art localization approaches often rely on image retrieval techniques for one of two tasks: (1) provide an approximate pose estimate or (2) determine which parts of the scene are potentially visible in a given query image. It is commo… ▽ More

    Submitted 1 December, 2020; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: International Conference on 3D Vision, 2020

  27. arXiv:2011.08790  [pdf, other

    cs.CV

    P1AC: Revisiting Absolute Pose From a Single Affine Correspondence

    Authors: Jonathan Ventura, Zuzana Kukelova, Torsten Sattler, Dániel Baráth

    Abstract: Affine correspondences have traditionally been used to improve feature matching over wide baselines. While recent work has successfully used affine correspondences to solve various relative camera pose estimation problems, less attention has been given to their use in absolute pose estimation. We introduce the first general solution to the problem of estimating the pose of a calibrated camera give… ▽ More

    Submitted 29 June, 2024; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: ICCV 2023 (with corrections in Eqs. 6 and 13 and Fig. 4)

  28. arXiv:2008.09497  [pdf, other

    cs.CV

    Single-Image Depth Prediction Makes Feature Matching Easier

    Authors: Carl Toft, Daniyar Turmukhambetov, Torsten Sattler, Fredrik Kahl, Gabriel Brostow

    Abstract: Good local features improve the robustness of many 3D re-localization and multi-view reconstruction pipelines. The problem is that viewing angle and distance severely impact the recognizability of a local feature. Attempts to improve appearance invariance by choosing better local feature points or by leveraging outside information, have come with pre-requisites that made some of them impractical.… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: 14 pages, 7 figures, accepted for publication at the European conference on computer vision (ECCV) 2020

    ACM Class: I.4

  29. arXiv:2008.02004  [pdf, other

    cs.CV

    Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes

    Authors: Johanna Wald, Torsten Sattler, Stuart Golodetz, Tommaso Cavallari, Federico Tombari

    Abstract: Long-term camera re-localization is an important task with numerous computer vision and robotics applications. Whilst various outdoor benchmarks exist that target lighting, weather and seasonal changes, far less attention has been paid to appearance changes that occur indoors. This has led to a mismatch between popular indoor benchmarks, which focus on static scenes, and indoor environments that a… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: ECCV 2020, project website https://waldjohannau.github.io/RIO10

  30. Infrastructure-based Multi-Camera Calibration using Radial Projections

    Authors: Yukai Lin, Viktor Larsson, Marcel Geppert, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler

    Abstract: Multi-camera systems are an important sensor platform for intelligent systems such as self-driving cars. Pattern-based calibration techniques can be used to calibrate the intrinsics of the cameras individually. However, extrinsic calibration of systems with little to no visual overlap between the cameras is a challenge. Given the camera intrinsics, infrastucture-based calibration techniques are ab… ▽ More

    Submitted 16 September, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  31. arXiv:2007.10032  [pdf, other

    cs.CV

    Making Affine Correspondences Work in Camera Geometry Computation

    Authors: Daniel Barath, Michal Polic, Wolfgang Förstner, Torsten Sattler, Tomas Pajdla, Zuzana Kukelova

    Abstract: Local features e.g. SIFT and its affine and learned variants provide region-to-region rather than point-to-point correspondences. This has recently been exploited to create new minimal solvers for classical problems such as homography, essential and fundamental matrix estimation. The main advantage of such solvers is that their sample size is smaller, e.g., only two instead of four matches are req… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  32. arXiv:2006.04250  [pdf, other

    cs.CV

    AdaLAM: Revisiting Handcrafted Outlier Detection

    Authors: Luca Cavalli, Viktor Larsson, Martin Ralf Oswald, Torsten Sattler, Marc Pollefeys

    Abstract: Local feature matching is a critical component of many computer vision pipelines, including among others Structure-from-Motion, SLAM, and Visual Localization. However, due to limitations in the descriptors, raw matches are often contaminated by a majority of outliers. As a result, outlier detection is a fundamental problem in computer vision, and a wide range of approaches have been proposed over… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

  33. Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis

    Authors: Zichao Zhang, Torsten Sattler, Davide Scaramuzza

    Abstract: Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtained via Structure-from-Motion (SfM). However, SfM itself relies on local features which are prone to f… ▽ More

    Submitted 30 December, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: 25 pages, 16 figures. Int J Comput Vis (2020)

  34. Self-Supervised Linear Motion Deblurring

    Authors: Peidong Liu, Joel Janai, Marc Pollefeys, Torsten Sattler, Andreas Geiger

    Abstract: Motion blurry images challenge many computer vision algorithms, e.g, feature detection, motion estimation, or object recognition. Deep convolutional neural networks are state-of-the-art for image deblurring. However, obtaining training data with corresponding sharp and blurry image pairs can be difficult. In this paper, we present a differentiable reblur model for self-supervised motion deblurring… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: Accepted by Robotics and Automation Letters (RA-L)

  35. arXiv:1912.02908  [pdf, other

    cs.CV

    Why Having 10,000 Parameters in Your Camera Model is Better Than Twelve

    Authors: Thomas Schöps, Viktor Larsson, Marc Pollefeys, Torsten Sattler

    Abstract: Camera calibration is an essential first step in setting up 3D Computer Vision systems. Commonly used parametric camera models are limited to a few degrees of freedom and thus often do not optimally fit to complex real lens distortion. In contrast, generic camera models allow for very accurate calibration due to their flexibility. Despite this, they have seen little use in practice. In this paper,… ▽ More

    Submitted 23 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 15 pages, 12 figures, accepted to CVPR 2020 as an oral

  36. arXiv:1910.10518  [pdf

    cond-mat.mes-hall physics.optics quant-ph

    Generation of ultrashort (~10ps) spontaneous emission pulses by quantum dots in a switched optical microcavity

    Authors: Emanuel Peinke, Tobias Sattler, Guilherme Monteiro Torelly, Joël Bleuse, Julien Claudon, Willem L. Vos, Jean-Michel Gérard

    Abstract: We report on the generation of few-ps long spontaneous emission pulses by quantum dots (QDs) in a switched optical microcavity. We use a pulsed optical injection of free charge carriers to induce a large frequency shift of the fundamental mode of a GaAs/AlAs micropillar. We track in real time by time-resolved photoluminescence its fundamental mode during its relaxation, using the emission of the Q… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 11 pages, 8 figures; includes supplemental material

    Journal ref: Light Sci. Appl. 10, 215 (2021)

  37. arXiv:1908.06387  [pdf, other

    cs.CV

    Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization

    Authors: Måns Larsson, Erik Stenborg, Carl Toft, Lars Hammarstrand, Torsten Sattler, Fredrik Kahl

    Abstract: Long-term visual localization is the problem of estimating the camera pose of a given query image in a scene whose appearance changes over time. It is an important problem in practice, for example, encountered in autonomous driving. In order to gain robustness to such changes, long-term localization approaches often use segmantic segmentations as an invariant scene representation, as the semantic… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.

    Comments: Accepted to ICCV 2019

    MSC Class: 68T45

  38. arXiv:1908.04598  [pdf, other

    cs.CV

    Is This The Right Place? Geometric-Semantic Pose Verification for Indoor Visual Localization

    Authors: Hajime Taira, Ignacio Rocco, Jiri Sedlar, Masatoshi Okutomi, Josef Sivic, Tomas Pajdla, Torsten Sattler, Akihiko Torii

    Abstract: Visual localization in large and complex indoor scenes, dominated by weakly textured rooms and repeating geometric patterns, is a challenging problem with high practical relevance for applications such as Augmented Reality and robotics. To handle the ambiguities arising in this scenario, a common strategy is, first, to generate multiple estimates for the camera pose from which a given query image… ▽ More

    Submitted 2 September, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

  39. arXiv:1908.01293  [pdf, other

    cs.CV

    To Learn or Not to Learn: Visual Localization from Essential Matrices

    Authors: Qunjie Zhou, Torsten Sattler, Marc Pollefeys, Laura Leal-Taixe

    Abstract: Visual localization is the problem of estimating a camera within a scene and a key component in computer vision applications such as self-driving cars and Mixed Reality. State-of-the-art approaches for accurate visual localization use scene-specific representations, resulting in the overhead of constructing these models when applying the techniques to new scenes. Recently, deep learning-based appr… ▽ More

    Submitted 9 March, 2020; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: Accepted to ICRA 2020

  40. arXiv:1907.00338  [pdf, other

    cs.CV

    Large-scale, real-time visual-inertial localization revisited

    Authors: Simon Lynen, Bernhard Zeisl, Dror Aiger, Michael Bosse, Joel Hesch, Marc Pollefeys, Roland Siegwart, Torsten Sattler

    Abstract: The overarching goals in image-based localization are scale, robustness and speed. In recent years, approaches based on local features and sparse 3D point-cloud models have both dominated the benchmarks and seen successful realworld deployment. They enable applications ranging from robot navigation, autonomous driving, virtual and augmented reality to device geo-localization. Recently end-to-end l… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

  41. arXiv:1905.03561  [pdf, other

    cs.CV

    D2-Net: A Trainable CNN for Joint Detection and Description of Local Features

    Authors: Mihai Dusmanu, Ignacio Rocco, Tomas Pajdla, Marc Pollefeys, Josef Sivic, Akihiko Torii, Torsten Sattler

    Abstract: In this work we address the problem of finding reliable pixel-level correspondences under difficult imaging conditions. We propose an approach where a single convolutional neural network plays a dual role: It is simultaneously a dense feature descriptor and a feature detector. By postponing the detection to a later stage, the obtained keypoints are more stable than their traditional counterparts b… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: Accepted at CVPR 2019

  42. arXiv:1903.07504  [pdf, other

    cs.CV

    Understanding the Limitations of CNN-based Absolute Camera Pose Regression

    Authors: Torsten Sattler, Qunjie Zhou, Marc Pollefeys, Laura Leal-Taixe

    Abstract: Visual localization is the task of accurate camera pose estimation in a known scene. It is a key problem in computer vision and robotics, with applications including self-driving cars, Structure-from-Motion, SLAM, and Mixed Reality. Traditionally, the localization problem has been tackled using 3D geometry. Recently, end-to-end approaches based on convolutional neural networks have become popular.… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Initial version of a paper accepted to CVPR 2019

  43. arXiv:1903.06916  [pdf, other

    cs.CV

    A Cross-Season Correspondence Dataset for Robust Semantic Segmentation

    Authors: Måns Larsson, Erik Stenborg, Lars Hammarstrand, Torsten Sattler, Mark Pollefeys, Fredrik Kahl

    Abstract: In this paper, we present a method to utilize 2D-2D point matches between images taken during different image conditions to train a convolutional neural network for semantic segmentation. Enforcing label consistency across the matches makes the final segmentation algorithm robust to seasonal changes. We describe how these 2D-2D matches can be generated with little human interaction by geometricall… ▽ More

    Submitted 16 August, 2019; v1 submitted 16 March, 2019; originally announced March 2019.

    Comments: In Proc. CVPR 2019

    MSC Class: 68T45

  44. arXiv:1903.01067  [pdf, other

    cs.CV cs.CG cs.RO

    Incremental Visual-Inertial 3D Mesh Generation with Structural Regularities

    Authors: Antoni Rosinol, Torsten Sattler, Marc Pollefeys, Luca Carlone

    Abstract: Visual-Inertial Odometry (VIO) algorithms typically rely on a point cloud representation of the scene that does not model the topology of the environment. A 3D mesh instead offers a richer, yet lightweight, model. Nevertheless, building a 3D mesh out of the sparse and noisy 3D landmarks triangulated by a VIO algorithm often results in a mesh that does not fit the real scene. In order to regularize… ▽ More

    Submitted 29 July, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: 7 pages, 5 figures, ICRA accepted

    Journal ref: IEEE Int. Conf. Robot. Autom. (ICRA), 2019

  45. arXiv:1901.03991  [pdf, other

    cs.CV

    RNN-based Generative Model for Fine-Grained Sketching

    Authors: Andrin Jenal, Nikolay Savinov, Torsten Sattler, Gaurav Chaurasia

    Abstract: Deep generative models have shown great promise when it comes to synthesising novel images. While they can generate images that look convincing on a higher-level, generating fine-grained details is still a challenge. In order to foster research on more powerful generative approaches, this paper proposes a novel task: generative modelling of 2D tree skeletons. Trees are an interesting shape class b… ▽ More

    Submitted 13 January, 2019; originally announced January 2019.

    Comments: Includes supplemental material. Link to datasets to be added shortly

  46. arXiv:1810.08393  [pdf, other

    cs.CV

    DGC-Net: Dense Geometric Correspondence Network

    Authors: Iaroslav Melekhov, Aleksei Tiulpin, Torsten Sattler, Marc Pollefeys, Esa Rahtu, Juho Kannala

    Abstract: This paper addresses the challenge of dense pixel correspondence estimation between two images. This problem is closely related to optical flow estimation task where ConvNets (CNNs) have recently achieved significant progress. While optical flow methods produce very accurate results for the small pixel translation and limited appearance variation scenarios, they hardly deal with the strong geometr… ▽ More

    Submitted 22 October, 2018; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: Supplementary material included; Affiliation section has been changed

  47. SurfelMeshing: Online Surfel-Based Mesh Reconstruction

    Authors: Thomas Schöps, Torsten Sattler, Marc Pollefeys

    Abstract: We address the problem of mesh reconstruction from live RGB-D video, assuming a calibrated camera and poses provided externally (e.g., by a SLAM system). In contrast to most existing approaches, we do not fuse depth measurements in a volume but in a dense surfel cloud. We asynchronously (re)triangulate the smoothed surfels to reconstruct a surface mesh. This novel approach enables to maintain a de… ▽ More

    Submitted 20 November, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: Version accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  48. arXiv:1809.09767  [pdf, other

    cs.CV

    Night-to-Day Image Translation for Retrieval-based Localization

    Authors: Asha Anoosheh, Torsten Sattler, Radu Timofte, Marc Pollefeys, Luc Van Gool

    Abstract: Visual localization is a key step in many robotics pipelines, allowing the robot to (approximately) determine its position and orientation in the world. An efficient and scalable approach to visual localization is to use image retrieval techniques. These approaches identify the image most similar to a query photo in a database of geo-tagged images and approximate the query's pose via the pose of t… ▽ More

    Submitted 4 March, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: Published in ICRA 2019

  49. arXiv:1809.06445  [pdf, other

    cs.RO

    Efficient 2D-3D Matching for Multi-Camera Visual Localization

    Authors: Marcel Geppert, Peidong Liu, Zhaopeng Cui, Marc Pollefeys, Torsten Sattler

    Abstract: Visual localization, i.e., determining the position and orientation of a vehicle with respect to a map, is a key problem in autonomous driving. We present a multicamera visual inertial localization algorithm for large scale environments. To efficiently and effectively match features against a pre-built global 3D map, we propose a prioritized feature matching scheme for multi-camera systems. In con… ▽ More

    Submitted 14 May, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: 7 pages, 5 figures

  50. arXiv:1809.06132  [pdf, other

    cs.RO

    Real-Time Dense Map** for Self-driving Vehicles using Fisheye Cameras

    Authors: Zhaopeng Cui, Lionel Heng, Ye Chuan Yeo, Andreas Geiger, Marc Pollefeys, Torsten Sattler

    Abstract: We present a real-time dense geometric map** algorithm for large-scale environments. Unlike existing methods which use pinhole cameras, our implementation is based on fisheye cameras which have larger field of view and benefit some other tasks including Visual-Inertial Odometry, localization and object detection around vehicles. Our algorithm runs on in-vehicle PCs at 15 Hz approximately, enabli… ▽ More

    Submitted 18 April, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: 7 pages, 10 figures