Skip to main content

Showing 1–10 of 10 results for author: Galliani, S

.
  1. arXiv:2406.04340  [pdf, other

    cs.CV

    GLACE: Global Local Accelerated Coordinate Encoding

    Authors: Fang**hua Wang, Xudong Jiang, Silvano Galliani, Christoph Vogel, Marc Pollefeys

    Abstract: Scene coordinate regression (SCR) methods are a family of visual localization methods that directly regress 2D-3D matches for camera pose estimation. They are effective in small-scale scenes but face significant challenges in large-scale scenes that are further amplified in the absence of ground truth 3D point clouds for supervision. Here, the model can only rely on reprojection constraints and ne… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Large-scale visual localization with a single optimizable MLP. CVPR 2024. Code: https://github.com/cvg/glace. Project page: https://xjiangan.github.io/glace

  2. arXiv:2112.05126  [pdf, other

    cs.CV

    IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo

    Authors: Fang**hua Wang, Silvano Galliani, Christoph Vogel, Marc Pollefeys

    Abstract: We present IterMVS, a new data-driven method for high-resolution multi-view stereo. We propose a novel GRU-based estimator that encodes pixel-wise probability distributions of depth in its hidden state. Ingesting multi-scale matching information, our model refines these distributions over multiple iterations and infers depth and confidence. To extract the depth maps, we combine traditional classif… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  3. arXiv:2012.02177  [pdf, other

    cs.CV cs.LG

    DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion

    Authors: Arda Düzçeker, Silvano Galliani, Christoph Vogel, Pablo Speciale, Mihai Dusmanu, Marc Pollefeys

    Abstract: We propose an online multi-view depth prediction approach on posed video streams, where the scene geometry information computed in the previous time steps is propagated to the current time step in an efficient and geometrically plausible way. The backbone of our approach is a real-time capable, lightweight encoder-decoder that relies on cost volumes computed from pairs of images. We extend it by p… ▽ More

    Submitted 21 July, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: CVPR 2021

  4. arXiv:2012.01411  [pdf, other

    cs.CV

    PatchmatchNet: Learned Multi-View Patchmatch Stereo

    Authors: Fang**hua Wang, Silvano Galliani, Christoph Vogel, Pablo Speciale, Marc Pollefeys

    Abstract: We present PatchmatchNet, a novel and learnable cascade formulation of Patchmatch for high-resolution multi-view stereo. With high computation speed and low memory requirement, PatchmatchNet can process higher resolution imagery and is more suited to run on resource limited devices than competitors that employ 3D cost volume regularization. For the first time we introduce an iterative multi-scale… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

  5. arXiv:2008.11239  [pdf, other

    cs.CV

    HoloLens 2 Research Mode as a Tool for Computer Vision Research

    Authors: Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

    Abstract: Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research. In this technical report, we present HoloLens 2 Research Mode, an API and a set of tools enabling access to the raw sensor streams. We provide an overview of the API and explain how it can be used to build mixed… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  6. Super-resolution of Sentinel-2 images: Learning a globally applicable deep neural network

    Authors: Charis Lanaras, José Bioucas-Dias, Silvano Galliani, Emmanuel Baltsavias, Konrad Schindler

    Abstract: The Sentinel-2 satellite mission delivers multi-spectral imagery with 13 spectral bands, acquired at three different spatial resolutions. The aim of this research is to super-resolve the lower-resolution (20 m and 60 m Ground Sampling Distance - GSD) bands to 10 m GSD, so as to obtain a complete data cube at the maximal sensor resolution. We employ a state-of-the-art convolutional neural network (… ▽ More

    Submitted 1 October, 2018; v1 submitted 12 March, 2018; originally announced March 2018.

    Comments: 19 pages, 11 figures

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, 146 (2018), pp. 305-319

  7. Inference, Learning and Attention Mechanisms that Exploit and Preserve Sparsity in Convolutional Networks

    Authors: Timo Hackel, Mikhail Usvyatsov, Silvano Galliani, Jan D. Wegner, Konrad Schindler

    Abstract: While CNNs naturally lend themselves to densely sampled data, and sophisticated implementations are available, they lack the ability to efficiently process sparse data. In this work we introduce a suite of tools that exploit sparsity in both the feature maps and the filter weights, and thereby allow for significantly lower memory footprints and computation times than the conventional dense framewo… ▽ More

    Submitted 12 March, 2020; v1 submitted 31 January, 2018; originally announced January 2018.

    Comments: Updated to IJCV version

  8. arXiv:1703.09470  [pdf, other

    cs.CV cs.LG

    Learned Spectral Super-Resolution

    Authors: Silvano Galliani, Charis Lanaras, Dimitrios Marmanis, Emmanuel Baltsavias, Konrad Schindler

    Abstract: We describe a novel method for blind, single-image spectral super-resolution. While conventional super-resolution aims to increase the spatial resolution of an input image, our goal is to spectrally enhance the input, i.e., generate an image with the same spatial resolution, but a greatly increased number of narrow (hyper-spectral) wave-length bands. Just like the spatial statistics of natural ima… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.

    Comments: Submitted to ICCV 2017 (10 pages, 8 figures)

  9. arXiv:1703.08836  [pdf, other

    cs.CV cs.LG

    Learned Multi-Patch Similarity

    Authors: Wilfried Hartmann, Silvano Galliani, Michal Havlena, Luc Van Gool, Konrad Schindler

    Abstract: Estimating a depth map from multiple views of a scene is a fundamental task in computer vision. As soon as more than two viewpoints are available, one faces the very basic question how to measure similarity across >2 image patches. Surprisingly, no direct solution exists, instead it is common to fall back to more or less robust averaging of two-view similarities. Encouraged by the success of machi… ▽ More

    Submitted 21 August, 2017; v1 submitted 26 March, 2017; originally announced March 2017.

    Comments: 10 pages, 7 figures, Accepted at ICCV 2017

  10. arXiv:1612.01337  [pdf, other

    cs.CV

    Classification With an Edge: Improving Semantic Image Segmentation with Boundary Detection

    Authors: Dimitrios Marmanis, Konrad Schindler, Jan Dirk Wegner, Silvano Galliani, Mihai Datcu, Uwe Stilla

    Abstract: We present an end-to-end trainable deep convolutional neural network (DCNN) for semantic segmentation with built-in awareness of semantically meaningful boundaries. Semantic segmentation is a fundamental remote sensing task, and most state-of-the-art methods rely on DCNNs as their workhorse. A major reason for their success is that deep networks learn to accumulate contextual information over very… ▽ More

    Submitted 21 December, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 135, January 2018, Pages 158-172, ISSN 0924-2716, https://doi.org/10.1016/j.isprsjprs.2017.11.009