Search | arXiv e-print repository

arXiv:2007.08988 [pdf, other]

Online Invariance Selection for Local Feature Descriptors

Authors: Rémi Pautrat, Viktor Larsson, Martin R. Oswald, Marc Pollefeys

Abstract: To be invariant, or not to be invariant: that is the question formulated in this work about local descriptors. A limitation of current feature descriptors is the trade-off between generalization and discriminative power: more invariance means less informative descriptors. We propose to overcome this limitation with a disentanglement of invariance in local descriptors and with an online selection o… ▽ More To be invariant, or not to be invariant: that is the question formulated in this work about local descriptors. A limitation of current feature descriptors is the trade-off between generalization and discriminative power: more invariance means less informative descriptors. We propose to overcome this limitation with a disentanglement of invariance in local descriptors and with an online selection of the most appropriate invariance given the context. Our framework consists in a joint learning of multiple local descriptors with different levels of invariance and of meta descriptors encoding the regional variations of an image. The similarity of these meta descriptors across images is used to select the right invariance when matching the local descriptors. Our approach, named Local Invariance Selection at Runtime for Descriptors (LISRD), enables descriptors to adapt to adverse changes in images, while remaining discriminative when invariance is not required. We demonstrate that our method can boost the performance of current descriptors and outperforms state-of-the-art descriptors in several matching tasks, when evaluated on challenging datasets with day-night illumination as well as viewpoint changes. △ Less

Submitted 23 July, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

Comments: 27 pages, Accepted at ECCV 2020 (Oral)

arXiv:2006.04250 [pdf, other]

AdaLAM: Revisiting Handcrafted Outlier Detection

Authors: Luca Cavalli, Viktor Larsson, Martin Ralf Oswald, Torsten Sattler, Marc Pollefeys

Abstract: Local feature matching is a critical component of many computer vision pipelines, including among others Structure-from-Motion, SLAM, and Visual Localization. However, due to limitations in the descriptors, raw matches are often contaminated by a majority of outliers. As a result, outlier detection is a fundamental problem in computer vision, and a wide range of approaches have been proposed over… ▽ More Local feature matching is a critical component of many computer vision pipelines, including among others Structure-from-Motion, SLAM, and Visual Localization. However, due to limitations in the descriptors, raw matches are often contaminated by a majority of outliers. As a result, outlier detection is a fundamental problem in computer vision, and a wide range of approaches have been proposed over the last decades. In this paper we revisit handcrafted approaches to outlier filtering. Based on best practices, we propose a hierarchical pipeline for effective outlier detection as well as integrate novel ideas which in sum lead to AdaLAM, an efficient and competitive approach to outlier rejection. AdaLAM is designed to effectively exploit modern parallel hardware, resulting in a very fast, yet very accurate, outlier filter. We validate AdaLAM on multiple large and diverse datasets, and we submit to the Image Matching Challenge (CVPR2020), obtaining competitive results with simple baseline descriptors. We show that AdaLAM is more than competitive to current state of the art, both in terms of efficiency and effectiveness. △ Less

Submitted 7 June, 2020; originally announced June 2020.

arXiv:2003.05266 [pdf, other]

doi 10.1109/IROS45743.2020.9341702

Accurate Map** and Planning for Autonomous Racing

Authors: Leiv Andresen, Adrian Brandemuehl, Alex Hönger, Benson Kuan, Niclas Vödisch, Hermann Blum, Victor Reijgwart, Lukas Bernreiter, Lukas Schaupp, Jen Jen Chung, Mathias Bürki, Martin R. Oswald, Roland Siegwart, Abel Gawel

Abstract: This paper presents the perception, map**, and planning pipeline implemented on an autonomous race car. It was developed by the 2019 AMZ driverless team for the Formula Student Germany (FSG) 2019 driverless competition, where it won 1st place overall. The presented solution combines early fusion of camera and LiDAR data, a layered map** approach, and a planning approach that uses Bayesian filt… ▽ More This paper presents the perception, map**, and planning pipeline implemented on an autonomous race car. It was developed by the 2019 AMZ driverless team for the Formula Student Germany (FSG) 2019 driverless competition, where it won 1st place overall. The presented solution combines early fusion of camera and LiDAR data, a layered map** approach, and a planning approach that uses Bayesian filtering to achieve high-speed driving on unknown race tracks while creating accurate maps. We benchmark the method against our team's previous solution, which won FSG 2018, and show improved accuracy when driving at the same speeds. Furthermore, the new pipeline makes it possible to reliably raise the maximum driving speed in unknown environments from 3~m/s to 12~m/s while still map** with an acceptable RMSE of 0.29~m. △ Less

Submitted 17 September, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 4743-4749

arXiv:2001.04775 [pdf, other]

Learned Multi-View Texture Super-Resolution

Authors: Audrey Richard, Ian Cherabier, Martin R. Oswald, Vagia Tsiminaki, Marc Pollefeys, Konrad Schindler

Abstract: We present a super-resolution method capable of creating a high-resolution texture map for a virtual 3D object from a set of lower-resolution images of that object. Our architecture unifies the concepts of (i) multi-view super-resolution based on the redundancy of overlap** views and (ii) single-view super-resolution based on a learned prior of high-resolution (HR) image structure. The principle… ▽ More We present a super-resolution method capable of creating a high-resolution texture map for a virtual 3D object from a set of lower-resolution images of that object. Our architecture unifies the concepts of (i) multi-view super-resolution based on the redundancy of overlap** views and (ii) single-view super-resolution based on a learned prior of high-resolution (HR) image structure. The principle of multi-view super-resolution is to invert the image formation process and recover the latent HR texture from multiple lower-resolution projections. We map that inverse problem into a block of suitably designed neural network layers, and combine it with a standard encoder-decoder network for learned single-image super-resolution. Wiring the image formation model into the network avoids having to learn perspective map** from textures to images, and elegantly handles a varying number of input views. Experiments demonstrate that the combination of multi-view observations and learned prior yields improved texture maps. △ Less

Submitted 14 January, 2020; originally announced January 2020.

Comments: 11 pages, 5 figures, 2019 International Conference on 3D Vision (3DV)

arXiv:2001.04388 [pdf, other]

RoutedFusion: Learning Real-time Depth Map Fusion

Authors: Silvan Weder, Johannes L. Schönberger, Marc Pollefeys, Martin R. Oswald

Abstract: The efficient fusion of depth maps is a key part of most state-of-the-art 3D reconstruction methods. Besides requiring high accuracy, these depth fusion methods need to be scalable and real-time capable. To this end, we present a novel real-time capable machine learning-based method for depth map fusion. Similar to the seminal depth map fusion approach by Curless and Levoy, we only update a local… ▽ More The efficient fusion of depth maps is a key part of most state-of-the-art 3D reconstruction methods. Besides requiring high accuracy, these depth fusion methods need to be scalable and real-time capable. To this end, we present a novel real-time capable machine learning-based method for depth map fusion. Similar to the seminal depth map fusion approach by Curless and Levoy, we only update a local group of voxels to ensure real-time capability. Instead of a simple linear fusion of depth information, we propose a neural network that predicts non-linear updates to better account for typical fusion errors. Our network is composed of a 2D depth routing network and a 3D depth fusion network which efficiently handle sensor-specific noise and outliers. This is especially useful for surface edges and thin objects for which the original approach suffers from thickening artifacts. Our method outperforms the traditional fusion approach and related learned approaches on both synthetic and real data. We demonstrate the performance of our method in reconstructing fine geometric details from noise and outlier contaminated data on various scenes. △ Less

Submitted 3 April, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

Comments: 11 pages, 8 figures, accepted to CVPR 2020

arXiv:1909.00703 [pdf, other]

doi 10.1109/ICCVW.2019.00264

Learned Semantic Multi-Sensor Depth Map Fusion

Authors: Denys Rozumnyi, Ian Cherabier, Marc Pollefeys, Martin R. Oswald

Abstract: Volumetric depth map fusion based on truncated signed distance functions has become a standard method and is used in many 3D reconstruction pipelines. In this paper, we are generalizing this classic method in multiple ways: 1) Semantics: Semantic information enriches the scene representation and is incorporated into the fusion process. 2) Multi-Sensor: Depth information can originate from differen… ▽ More Volumetric depth map fusion based on truncated signed distance functions has become a standard method and is used in many 3D reconstruction pipelines. In this paper, we are generalizing this classic method in multiple ways: 1) Semantics: Semantic information enriches the scene representation and is incorporated into the fusion process. 2) Multi-Sensor: Depth information can originate from different sensors or algorithms with very different noise and outlier statistics which are considered during data fusion. 3) Scene denoising and completion: Sensors can fail to recover depth for certain materials and light conditions, or data is missing due to occlusions. Our method denoises the geometry, closes holes and computes a watertight surface for every semantic class. 4) Learning: We propose a neural network reconstruction method that unifies all these properties within a single powerful framework. Our method learns sensor or algorithm properties jointly with semantic depth fusion and scene completion and can also be used as an expert system, e.g. to unify the strengths of various photometric stereo algorithms. Our approach is the first to unify all these properties. Experimental evaluations on both synthetic and real data sets demonstrate clear improvements. △ Less

Submitted 2 September, 2019; originally announced September 2019.

Comments: 11 pages, 7 figures, 2 tables, accepted for the 2nd Workshop on 3D Reconstruction in the Wild (3DRW2019) in conjunction with ICCV2019

Journal ref: 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

arXiv:1906.08650 [pdf, other]

3D Instance Segmentation via Multi-Task Metric Learning

Authors: Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald

Abstract: We propose a novel method for instance label segmentation of dense 3D voxel grids. We target volumetric scene representations, which have been acquired with depth sensors or multi-view stereo methods and which have been processed with semantic 3D reconstruction or scene completion methods. The main task is to learn shape information about individual object instances in order to accurately separate… ▽ More We propose a novel method for instance label segmentation of dense 3D voxel grids. We target volumetric scene representations, which have been acquired with depth sensors or multi-view stereo methods and which have been processed with semantic 3D reconstruction or scene completion methods. The main task is to learn shape information about individual object instances in order to accurately separate them, including connected and incompletely scanned objects. We solve the 3D instance-labeling problem with a multi-task learning strategy. The first goal is to learn an abstract feature embedding, which groups voxels with the same instance label close to each other while separating clusters with different instance labels from each other. The second goal is to learn instance information by densely estimating directional information of the instance's center of mass for each voxel. This is particularly useful to find instance boundaries in the clustering post-processing step, as well as, for scoring the segmentation quality for the first goal. Both synthetic and real-world experiments demonstrate the viability and merits of our approach. In fact, it achieves state-of-the-art performance on the ScanNet 3D instance segmentation benchmark. △ Less

Submitted 31 October, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

arXiv:1706.08336 [pdf, other]

Semantically Informed Multiview Surface Refinement

Authors: Maros Blaha, Mathias Rothermel, Martin R. Oswald, Torsten Sattler, Audrey Richard, Jan D. Wegner, Marc Pollefeys, Konrad Schindler

Abstract: We present a method to jointly refine the geometry and semantic segmentation of 3D surface meshes. Our method alternates between updating the shape and the semantic labels. In the geometry refinement step, the mesh is deformed with variational energy minimization, such that it simultaneously maximizes photo-consistency and the compatibility of the semantic segmentations across a set of calibrated… ▽ More We present a method to jointly refine the geometry and semantic segmentation of 3D surface meshes. Our method alternates between updating the shape and the semantic labels. In the geometry refinement step, the mesh is deformed with variational energy minimization, such that it simultaneously maximizes photo-consistency and the compatibility of the semantic segmentations across a set of calibrated images. Label-specific shape priors account for interactions between the geometry and the semantic labels in 3D. In the semantic segmentation step, the labels on the mesh are updated with MRF inference, such that they are compatible with the semantic segmentations in the input images. Also, this step includes prior assumptions about the surface shape of different semantic classes. The priors induce a tight coupling, where semantic information influences the shape update and vice versa. Specifically, we introduce priors that favor (i) adaptive smoothing, depending on the class label; (ii) straightness of class boundaries; and (iii) semantic labels that are consistent with the surface orientation. The novel mesh-based reconstruction is evaluated in a series of experiments with real and synthetic data. We compare both to state-of-the-art, voxel-based semantic 3D reconstruction, and to purely geometric mesh refinement, and demonstrate that the proposed scheme yields improved 3D geometry as well as an improved semantic segmentation. △ Less

Submitted 26 June, 2017; originally announced June 2017.

arXiv:1605.04146 [pdf, ps, other]

Geometrie der Zahlen. Ein Überblick

Authors: Nicola M. R. Oswald

Abstract: This article provides a historical overview of Geometry of Numbers. 1. Figures, 2. The circuit problem and its relatives, 3. Minkowski lattice point set, 4. The young Hermann Minkowski, 5. The geometry of numbers develops, 6. Minkowski geometry in the context of time, 7. Successive minima, 8. Minkowski space-time, 9. Voronoi and Blichfeldt, 10. The schools in Manchester and Vienna, 11. packing… ▽ More This article provides a historical overview of Geometry of Numbers. 1. Figures, 2. The circuit problem and its relatives, 3. Minkowski lattice point set, 4. The young Hermann Minkowski, 5. The geometry of numbers develops, 6. Minkowski geometry in the context of time, 7. Successive minima, 8. Minkowski space-time, 9. Voronoi and Blichfeldt, 10. The schools in Manchester and Vienna, 11. packing problems △ Less

Submitted 13 May, 2016; originally announced May 2016.

Comments: in German

MSC Class: 01A55; 01A60; 11H

arXiv:1506.00856 [pdf, ps, other]

Aspects of Zeta-Function Theory in the Mathematical Works of Adolf Hurwitz

Authors: Nicola M. R. Oswald, Jörn Steuding

Abstract: Adolf Hurwitz is rather famous for his celebrated contributions to Riemann surfaces, modular forms, diophantine equations and approximation as well as to certain aspects of algebra. His early work on an important generalization of Dirichlet's $L$-series, nowadays called Hurwitz zeta-function, is the only published work settled in the very active field of research around the Riemann zeta-function a… ▽ More Adolf Hurwitz is rather famous for his celebrated contributions to Riemann surfaces, modular forms, diophantine equations and approximation as well as to certain aspects of algebra. His early work on an important generalization of Dirichlet's $L$-series, nowadays called Hurwitz zeta-function, is the only published work settled in the very active field of research around the Riemann zeta-function and its relatives. His mathematical diaries, however, provide another picture, namely a lifelong interest in the development of zeta-function theory. In this note we shall investigate his early work, its origin and its reception, as well as Hurwitz's further studies of the Riemann zeta-function and allied Dirichlet series from his diaries. It turns out that Hurwitz already in 1889 knew about the essential analytic properties of the Epstein zeta-function (including its functional equation) 13 years before Paul Epstein. △ Less

Submitted 2 June, 2015; originally announced June 2015.

Comments: 32 pages, 2 figures

MSC Class: 01A55; 01A70; 11M06; 11M35

Showing 51–60 of 60 results for author: Oswald, M R