Skip to main content

Showing 151–180 of 180 results for author: Pollefeys, M

.
  1. arXiv:1812.00488  [pdf, other

    cs.CV

    DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene from Sparse LiDAR Data and Single Color Image

    Authors: Jiaxiong Qiu, Zhaopeng Cui, Yinda Zhang, Xingdi Zhang, Shuaicheng Liu, Bing Zeng, Marc Pollefeys

    Abstract: In this paper, we propose a deep learning architecture that produces accurate dense depth for the outdoor scene from a single color image and a sparse depth. Inspired by the indoor depth completion, our network estimates surface normals as the intermediate representation to produce dense depth, and can be trained end-to-end. With a modified encoder-decoder structure, our network effectively fuses… ▽ More

    Submitted 9 April, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: 10 pages, 8 figures

  2. arXiv:1810.08393  [pdf, other

    cs.CV

    DGC-Net: Dense Geometric Correspondence Network

    Authors: Iaroslav Melekhov, Aleksei Tiulpin, Torsten Sattler, Marc Pollefeys, Esa Rahtu, Juho Kannala

    Abstract: This paper addresses the challenge of dense pixel correspondence estimation between two images. This problem is closely related to optical flow estimation task where ConvNets (CNNs) have recently achieved significant progress. While optical flow methods produce very accurate results for the small pixel translation and limited appearance variation scenarios, they hardly deal with the strong geometr… ▽ More

    Submitted 22 October, 2018; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: Supplementary material included; Affiliation section has been changed

  3. arXiv:1810.02274  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Episodic Curiosity through Reachability

    Authors: Nikolay Savinov, Anton Raichuk, Raphaël Marinier, Damien Vincent, Marc Pollefeys, Timothy Lillicrap, Sylvain Gelly

    Abstract: Rewards are sparse in the real world and most of today's reinforcement learning algorithms struggle with such sparsity. One solution to this problem is to allow the agent to create rewards for itself - thus making rewards dense and more suitable for learning. In particular, inspired by curious behaviour in animals, observing something novel could be rewarded with a bonus. Such bonus is summed up w… ▽ More

    Submitted 6 August, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: Accepted to ICLR 2019. Code at https://github.com/google-research/episodic-curiosity/. Videos at https://sites.google.com/view/episodic-curiosity/

  4. SurfelMeshing: Online Surfel-Based Mesh Reconstruction

    Authors: Thomas Schöps, Torsten Sattler, Marc Pollefeys

    Abstract: We address the problem of mesh reconstruction from live RGB-D video, assuming a calibrated camera and poses provided externally (e.g., by a SLAM system). In contrast to most existing approaches, we do not fuse depth measurements in a volume but in a dense surfel cloud. We asynchronously (re)triangulate the smoothed surfels to reconstruct a surface mesh. This novel approach enables to maintain a de… ▽ More

    Submitted 20 November, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: Version accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  5. arXiv:1809.09767  [pdf, other

    cs.CV

    Night-to-Day Image Translation for Retrieval-based Localization

    Authors: Asha Anoosheh, Torsten Sattler, Radu Timofte, Marc Pollefeys, Luc Van Gool

    Abstract: Visual localization is a key step in many robotics pipelines, allowing the robot to (approximately) determine its position and orientation in the world. An efficient and scalable approach to visual localization is to use image retrieval techniques. These approaches identify the image most similar to a query photo in a database of geo-tagged images and approximate the query's pose via the pose of t… ▽ More

    Submitted 4 March, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: Published in ICRA 2019

  6. arXiv:1809.06445  [pdf, other

    cs.RO

    Efficient 2D-3D Matching for Multi-Camera Visual Localization

    Authors: Marcel Geppert, Peidong Liu, Zhaopeng Cui, Marc Pollefeys, Torsten Sattler

    Abstract: Visual localization, i.e., determining the position and orientation of a vehicle with respect to a map, is a key problem in autonomous driving. We present a multicamera visual inertial localization algorithm for large scale environments. To efficiently and effectively match features against a pre-built global 3D map, we propose a prioritized feature matching scheme for multi-camera systems. In con… ▽ More

    Submitted 14 May, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: 7 pages, 5 figures

  7. arXiv:1809.06132  [pdf, other

    cs.RO

    Real-Time Dense Map** for Self-driving Vehicles using Fisheye Cameras

    Authors: Zhaopeng Cui, Lionel Heng, Ye Chuan Yeo, Andreas Geiger, Marc Pollefeys, Torsten Sattler

    Abstract: We present a real-time dense geometric map** algorithm for large-scale environments. Unlike existing methods which use pinhole cameras, our implementation is based on fisheye cameras which have larger field of view and benefit some other tasks including Visual-Inertial Odometry, localization and object detection around vehicles. Our algorithm runs on in-vehicle PCs at 15 Hz approximately, enabli… ▽ More

    Submitted 18 April, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: 7 pages, 10 figures

  8. arXiv:1809.05477  [pdf, other

    cs.RO

    Project AutoVision: Localization and 3D Scene Perception for an Autonomous Vehicle with a Multi-Camera System

    Authors: Lionel Heng, Benjamin Choi, Zhaopeng Cui, Marcel Geppert, Sixing Hu, Benson Kuan, Peidong Liu, Rang Nguyen, Ye Chuan Yeo, Andreas Geiger, Gim Hee Lee, Marc Pollefeys, Torsten Sattler

    Abstract: Project AutoVision aims to develop localization and 3D scene perception capabilities for a self-driving vehicle. Such capabilities will enable autonomous navigation in urban and rural environments, in day and night, and with cameras as the only exteroceptive sensors. The sensor suite employs many cameras for both 360-degree coverage and accurate multi-view stereo; the use of low-cost cameras keeps… ▽ More

    Submitted 4 March, 2019; v1 submitted 14 September, 2018; originally announced September 2018.

    Journal ref: 2019 IEEE International Conference on Robotics and Automation (ICRA)

  9. arXiv:1807.11272  [pdf, other

    cs.CV cs.AI cs.LG

    Uncertainty Quantification in CNN-Based Surface Prediction Using Shape Priors

    Authors: Katarína Tóthová, Sarah Parisot, Matthew C. H. Lee, Esther Puyol-Antón, Lisa M. Koch, Andrew P. King, Ender Konukoglu, Marc Pollefeys

    Abstract: Surface reconstruction is a vital tool in a wide range of areas of medical image analysis and clinical research. Despite the fact that many methods have proposed solutions to the reconstruction problem, most, due to their deterministic nature, do not directly address the issue of quantifying uncertainty associated with their predictions. We remedy this by proposing a novel probabilistic deep learn… ▽ More

    Submitted 30 July, 2018; originally announced July 2018.

    Comments: Accepted to ShapeMI MICCAI 2018: Workshop on Shape in Medical Imaging

  10. arXiv:1807.07512  [pdf, other

    cs.CV

    Hybrid Scene Compression for Visual Localization

    Authors: Federico Camposeco, Andrea Cohen, Marc Pollefeys, Torsten Sattler

    Abstract: Localizing an image wrt. a 3D scene model represents a core task for many computer vision applications. An increasing number of real-world applications of visual localization on mobile devices, e.g., Augmented Reality or autonomous robots such as drones or self-driving cars, demand localization approaches to minimize storage and bandwidth requirements. Compressing the 3D models used for localizati… ▽ More

    Submitted 22 April, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: Published at CVPR 2019

  11. arXiv:1804.01792  [pdf, other

    cs.RO

    TrimBot2020: an outdoor robot for automatic gardening

    Authors: Nicola Strisciuglio, Radim Tylecek, Michael Blaich, Nicolai Petkov, Peter Bieber, Jochen Hemming, Eldert van Henten, Torsten Sattler, Marc Pollefeys, Theo Gevers, Thomas Brox, Robert B. Fisher

    Abstract: Robots are increasingly present in modern industry and also in everyday life. Their applications range from health-related situations, for assistance to elderly people or in surgical operations, to automatic and driver-less vehicles (on wheels or flying) or for driving assistance. Recently, an interest towards robotics applied in agriculture and gardening has arisen, with applications to automatic… ▽ More

    Submitted 15 May, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: Accepted for publication at International Sympsium on Robotics 2018

  12. arXiv:1803.10368  [pdf, other

    cs.CV

    InLoc: Indoor Visual Localization with Dense Matching and View Synthesis

    Authors: Hajime Taira, Masatoshi Okutomi, Torsten Sattler, Mircea Cimpoi, Marc Pollefeys, Josef Sivic, Tomas Pajdla, Akihiko Torii

    Abstract: We seek to predict the 6 degree-of-freedom (6DoF) pose of a query photograph with respect to a large indoor 3D map. The contributions of this work are three-fold. First, we develop a new large-scale visual localization method targeted for indoor environments. The method proceeds along three steps: (i) efficient retrieval of candidate poses that ensures scalability to large-scale environments, (ii)… ▽ More

    Submitted 8 April, 2018; v1 submitted 27 March, 2018; originally announced March 2018.

  13. arXiv:1712.05773  [pdf, other

    cs.CV

    Semantic Visual Localization

    Authors: Johannes L. Schönberger, Marc Pollefeys, Andreas Geiger, Torsten Sattler

    Abstract: Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision. Handling the difficult cases of this problem is not only very challenging but also of high practical relevance, e.g., in the context of life-long localization for augmented reality or autonomous robots. In this paper, we propose a novel approach based on a joint 3D geometric and semanti… ▽ More

    Submitted 16 April, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

  14. arXiv:1709.04496  [pdf, other

    cs.CV

    An Exploration of 2D and 3D Deep Learning Techniques for Cardiac MR Image Segmentation

    Authors: Christian F. Baumgartner, Lisa M. Koch, Marc Pollefeys, Ender Konukoglu

    Abstract: Accurate segmentation of the heart is an important step towards evaluating cardiac function. In this paper, we present a fully automated framework for segmentation of the left (LV) and right (RV) ventricular cavities and the myocardium (Myo) on short-axis cardiac MR images. We investigate various 2D and 3D convolutional neural network architectures for this task. We investigate the suitability of… ▽ More

    Submitted 10 October, 2017; v1 submitted 13 September, 2017; originally announced September 2017.

    Comments: to appear in STACOM 2017 proceedings

  15. arXiv:1708.09839  [pdf, other

    cs.CV

    3D Visual Perception for Self-Driving Cars using a Multi-Camera System: Calibration, Map**, Localization, and Obstacle Detection

    Authors: Christian Häne, Lionel Heng, Gim Hee Lee, Friedrich Fraundorfer, Paul Furgale, Torsten Sattler, Marc Pollefeys

    Abstract: Cameras are a crucial exteroceptive sensor for self-driving cars as they are low-cost and small, provide appearance information about the environment, and work in various weather conditions. They can be used for multiple purposes such as visual navigation and obstacle detection. We can use a surround multi-camera system to cover the full 360-degree field-of-view around the car. In this way, we avo… ▽ More

    Submitted 31 August, 2017; originally announced August 2017.

  16. arXiv:1707.09092  [pdf, ps, other

    cs.CV

    Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions

    Authors: Torsten Sattler, Will Maddern, Carl Toft, Akihiko Torii, Lars Hammarstrand, Erik Stenborg, Daniel Safari, Masatoshi Okutomi, Marc Pollefeys, Josef Sivic, Fredrik Kahl, Tomas Pajdla

    Abstract: Visual localization enables autonomous vehicles to navigate in their surroundings and augmented reality applications to link virtual to real worlds. Practical visual localization approaches need to be robust to a wide variety of viewing condition, including day-night changes, as well as weather and seasonal variations, while providing highly accurate 6 degree-of-freedom (6DOF) camera pose estimate… ▽ More

    Submitted 4 April, 2018; v1 submitted 27 July, 2017; originally announced July 2017.

    Comments: Accepted to CVPR 2018 as a spotlight

  17. arXiv:1707.05397  [pdf, other

    cs.CV

    Slanted Stixels: Representing San Francisco's Steepest Streets

    Authors: Daniel Hernandez-Juarez, Lukas Schneider, Antonio Espinosa, David Vázquez, Antonio M. López, Uwe Franke, Marc Pollefeys, Juan C. Moure

    Abstract: In this work we present a novel compact scene representation based on Stixels that infers geometric and semantic information. Our approach overcomes the previous rather restrictive geometric assumptions for Stixels by introducing a novel depth model to account for non-flat roads and slanted objects. Both semantic and depth cues are used jointly to infer the scene representation in a sound global e… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: Accepted to BMVC 2017 as oral presentation

  18. arXiv:1707.05055  [pdf, other

    cs.CV

    Information-Flow Matting

    Authors: Yağız Aksoy, Tunç Ozan Aydın, Marc Pollefeys

    Abstract: We present a novel, purely affinity-based natural image matting algorithm. Our method relies on carefully defined pixel-to-pixel connections that enable effective use of information available in the image. We control the information flow from the known-opacity regions into the unknown region, as well as within the unknown region itself, by utilizing multiple definitions of pixel affinities. Among… ▽ More

    Submitted 12 April, 2019; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: 16 pages, 13 figures, extended version of CVPR 2017 publication titled "Designing Effective Inter-pixel Information Flow for Natural Image Matting"

    MSC Class: 68T45 ACM Class: I.4.6

  19. arXiv:1706.08336  [pdf, other

    cs.CV

    Semantically Informed Multiview Surface Refinement

    Authors: Maros Blaha, Mathias Rothermel, Martin R. Oswald, Torsten Sattler, Audrey Richard, Jan D. Wegner, Marc Pollefeys, Konrad Schindler

    Abstract: We present a method to jointly refine the geometry and semantic segmentation of 3D surface meshes. Our method alternates between updating the shape and the semantic labels. In the geometry refinement step, the mesh is deformed with variational energy minimization, such that it simultaneously maximizes photo-consistency and the compatibility of the semantic segmentations across a set of calibrated… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

  20. arXiv:1705.08272  [pdf, other

    cs.CV cs.LG cs.NE

    Matching neural paths: transfer from recognition to correspondence search

    Authors: Nikolay Savinov, Lubor Ladicky, Marc Pollefeys

    Abstract: Many machine learning tasks require finding per-part correspondences between objects. In this work we focus on low-level correspondences - a highly ambiguous matching problem. We propose to use a hierarchical semantic representation of the objects, coming from a convolutional neural network, to solve this ambiguity. Training it for low-level correspondence prediction directly might not be an optio… ▽ More

    Submitted 5 November, 2017; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: Accepted at NIPS 2017

  21. arXiv:1704.03847  [pdf, other

    cs.CV cs.LG cs.NE cs.RO

    Semantic3D.net: A new Large-scale Point Cloud Classification Benchmark

    Authors: Timo Hackel, Nikolay Savinov, Lubor Ladicky, Jan D. Wegner, Konrad Schindler, Marc Pollefeys

    Abstract: This paper presents a new 3D point cloud classification benchmark data set with over four billion manually labelled points, meant as input for data-hungry (deep) learning methods. We also discuss first submissions to the benchmark that use deep convolutional neural networks (CNNs) as a work horse, which already show remarkable performance improvements over state-of-the-art. CNNs have become the de… ▽ More

    Submitted 12 April, 2017; originally announced April 2017.

    Comments: Accepted to ISPRS Annals. The benchmark website is available at http://www.semantic3d.net/ . The baseline code is available at https://github.com/nsavinov/semantic3dnet

  22. The Stixel world: A medium-level representation of traffic scenes

    Authors: Marius Cordts, Timo Rehfeld, Lukas Schneider, David Pfeiffer, Markus Enzweiler, Stefan Roth, Marc Pollefeys, Uwe Franke

    Abstract: Recent progress in advanced driver assistance systems and the race towards autonomous vehicles is mainly driven by two factors: (1) increasingly sophisticated algorithms that interpret the environment around the vehicle and react accordingly, and (2) the continuous improvements of sensor technology itself. In terms of cameras, these improvements typically include higher spatial resolution, which a… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

    Comments: Accepted for publication in Image and Vision Computing

  23. arXiv:1611.07571  [pdf, other

    cs.CV cs.LG cs.NE

    Quad-networks: unsupervised learning to rank for interest point detection

    Authors: Nikolay Savinov, Akihito Seki, Lubor Ladicky, Torsten Sattler, Marc Pollefeys

    Abstract: Several machine learning tasks require to represent the data using only a sparse set of interest points. An ideal detector is able to find the corresponding interest points even if the data undergo a transformation typical for a given domain. Since the task is of high practical interest in computer vision, many hand-crafted solutions were proposed. In this paper, we ask a fundamental question: can… ▽ More

    Submitted 10 April, 2017; v1 submitted 22 November, 2016; originally announced November 2016.

    Comments: Accepted at CVPR 2017

  24. arXiv:1608.00753  [pdf, other

    cs.CV

    Semantically Guided Depth Upsampling

    Authors: Nick Schneider, Lukas Schneider, Peter **gera, Uwe Franke, Marc Pollefeys, Christoph Stiller

    Abstract: We present a novel method for accurate and efficient up- sampling of sparse depth data, guided by high-resolution imagery. Our approach goes beyond the use of intensity cues only and additionally exploits object boundary cues through structured edge detection and semantic scene labeling for guidance. Both cues are combined within a geodesic distance measure that allows for boundary-preserving dept… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

    Comments: German Conference on Pattern Recognition 2016 (Oral)

  25. arXiv:1604.06318  [pdf, other

    cs.CV

    TI-POOLING: transformation-invariant pooling for feature learning in Convolutional Neural Networks

    Authors: Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys

    Abstract: In this paper we present a deep neural network topology that incorporates a simple to implement transformation invariant pooling operator (TI-POOLING). This operator is able to efficiently handle prior knowledge on nuisance variations in the data, such as rotation or scale changes. Most current methods usually make use of dataset augmentation to address this issue, but this requires larger number… ▽ More

    Submitted 22 September, 2016; v1 submitted 21 April, 2016; originally announced April 2016.

    Comments: Accepted at CVPR 2016. The first two authors assert equal contribution and joint first authorship

  26. arXiv:1604.06258  [pdf, other

    cs.CV

    Automatic 3D Reconstruction of Manifold Meshes via Delaunay Triangulation and Mesh Swee**

    Authors: Andrea Romanoni, Amaël Delaunoy, Marc Pollefeys, Matteo Matteucci

    Abstract: In this paper we propose a new approach to incrementally initialize a manifold surface for automatic 3D reconstruction from images. More precisely we focus on the automatic initialization of a 3D mesh as close as possible to the final solution; indeed many approaches require a good initial solution for further refinement via multi-view stereo techniques. Our novel algorithm automatically estimates… ▽ More

    Submitted 21 April, 2016; originally announced April 2016.

    Comments: in IEEE Winter Conference on Applications of Computer Vision (WACV) 2016

  27. arXiv:1604.02885  [pdf, other

    cs.CV

    Semantic 3D Reconstruction with Continuous Regularization and Ray Potentials Using a Visibility Consistency Constraint

    Authors: Nikolay Savinov, Christian Haene, Lubor Ladicky, Marc Pollefeys

    Abstract: We propose an approach for dense semantic 3D reconstruction which uses a data term that is defined as potentials over viewing rays, combined with continuous surface area penalization. Our formulation is a convex relaxation which we augment with a crucial non-convex constraint that ensures exact handling of visibility. To tackle the non-convex minimization problem, we propose a majorize-minimize ty… ▽ More

    Submitted 26 August, 2019; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: Accepted as a spotlight oral paper by CVPR 2016. Code at https://github.com/nsavinov/ray_potentials/

  28. Capturing Hands in Action using Discriminative Salient Points and Physics Simulation

    Authors: Dimitrios Tzionas, Luca Ballan, Abhilash Srikantha, Pablo Aponte, Marc Pollefeys, Juergen Gall

    Abstract: Hand motion capture is a popular research field, recently gaining more attention due to the ubiquity of RGB-D sensors. However, even most recent approaches focus on the case of a single isolated hand. In this work, we focus on hands that interact with other hands or objects and present a framework that successfully captures motion in such interaction scenarios for both rigid and articulated object… ▽ More

    Submitted 7 March, 2016; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: Accepted for publication by the International Journal of Computer Vision (IJCV) on 16.02.2016 (submitted on 17.10.14). A combination into a single framework of an ECCV'12 multicamera-RGB and a monocular-RGBD GCPR'14 hand tracking paper with several extensions, additional experiments and details

  29. arXiv:1502.00652  [pdf, other

    cs.CV

    Learning the Matching Function

    Authors: Ľubor Ladický, Christian Häne, Marc Pollefeys

    Abstract: The matching function for the problem of stereo reconstruction or optical flow has been traditionally designed as a function of the distance between the features describing matched pixels. This approach works under assumption, that the appearance of pixels in two stereo cameras or in two consecutive video frames does not change dramatically. However, this might not be the case, if we try to match… ▽ More

    Submitted 2 February, 2015; originally announced February 2015.

    Comments: rejected from ACCV 2014 and probably from CVPR 2015

  30. arXiv:1206.6436  [pdf

    cs.LG stat.ML

    Efficient Structured Prediction with Latent Variables for General Graphical Models

    Authors: Alexander Schwing, Tamir Hazan, Marc Pollefeys, Raquel Urtasun

    Abstract: In this paper we propose a unified framework for structured prediction with latent variables which includes hidden conditional random fields and latent structured support vector machines as special cases. We describe a local entropy approximation for this general formulation using duality, and derive an efficient message passing algorithm that is guaranteed to converge. We demonstrate its effectiv… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)