Skip to main content

Showing 1–13 of 13 results for author: Cavallari, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.14351  [pdf, other

    cs.CV

    Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer

    Authors: Eric Brachmann, Jamie Wynn, Shuai Chen, Tommaso Cavallari, Áron Monszpart, Daniyar Turmukhambetov, Victor Adrian Prisacariu

    Abstract: We address the task of estimating camera parameters from a set of images depicting a scene. Popular feature-based structure-from-motion (SfM) tools solve this task by incremental reconstruction: they repeat triangulation of sparse 3D points and registration of more camera views to the sparse point cloud. We re-interpret incremental structure-from-motion as an iterated application and refinement of… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Project page: https://nianticlabs.github.io/acezero/

  2. arXiv:2404.09884  [pdf, other

    cs.CV cs.LG

    Map-Relative Pose Regression for Visual Re-Localization

    Authors: Shuai Chen, Tommaso Cavallari, Victor Adrian Prisacariu, Eric Brachmann

    Abstract: Pose regression networks predict the camera pose of a query image relative to a known environment. Within this family of methods, absolute pose regression (APR) has recently shown promising accuracy in the range of a few centimeters in position error. APR networks encode the scene geometry implicitly in their weights. To achieve high accuracy, they require vast amounts of training data that, reali… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2024, Highlight Paper

  3. arXiv:2305.14059  [pdf, other

    cs.CV cs.LG

    Accelerated Coordinate Encoding: Learning to Relocalize in Minutes using RGB and Poses

    Authors: Eric Brachmann, Tommaso Cavallari, Victor Adrian Prisacariu

    Abstract: Learning-based visual relocalizers exhibit leading pose accuracy, but require hours or days of training. Since training needs to happen on each new scene again, long training times make learning-based relocalization impractical for most applications, despite its promise of high accuracy. In this paper we show how such a system can actually achieve the same accuracy in less than 5 minutes. We start… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: CVPR 2023 Highlight

  4. arXiv:2106.16129  [pdf, other

    cs.CV

    Recurrently Estimating Reflective Symmetry Planes from Partial Pointclouds

    Authors: Mihaela Cătălina Stoian, Tommaso Cavallari

    Abstract: Many man-made objects are characterised by a shape that is symmetric along one or more planar directions. Estimating the location and orientation of such symmetry planes can aid many tasks such as estimating the overall orientation of an object of interest or performing shape completion, where a partial scan of an object is reflected across the estimated symmetry plane in order to obtain a more de… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: Presented at the CVPR 2021 Workshop on 3D Vision and Robotics

  5. arXiv:2008.02004  [pdf, other

    cs.CV

    Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes

    Authors: Johanna Wald, Torsten Sattler, Stuart Golodetz, Tommaso Cavallari, Federico Tombari

    Abstract: Long-term camera re-localization is an important task with numerous computer vision and robotics applications. Whilst various outdoor benchmarks exist that target lighting, weather and seasonal changes, far less attention has been paid to appearance changes that occur indoors. This has led to a mismatch between popular indoor benchmarks, which focus on static scenes, and indoor environments that a… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: ECCV 2020, project website https://waldjohannau.github.io/RIO10

  6. arXiv:1907.07745  [pdf, other

    cs.CV eess.IV eess.SP

    Real-Time Highly Accurate Dense Depth on a Power Budget using an FPGA-CPU Hybrid SoC

    Authors: Oscar Rahnama, Tommaso Cavallari, Stuart Golodetz, Alessio Tonioni, Thomas Joy, Luigi Di Stefano, Simon Walker, Philip H. S. Torr

    Abstract: Obtaining highly accurate depth from stereo images in real time has many applications across computer vision and robotics, but in some contexts, upper bounds on power consumption constrain the feasible hardware to embedded platforms such as FPGAs. Whilst various stereo algorithms have been deployed on these platforms, usually cut down to better match the embedded architecture, certain key parts of… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 6 pages, 7 figures, 2 tables, journal

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 66, no. 5, pp. 773-777, May 2019

  7. arXiv:1906.08744  [pdf, other

    cs.CV cs.LG cs.RO

    Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

    Authors: Tommaso Cavallari, Luca Bertinetto, Jishnu Mukhoti, Philip Torr, Stuart Golodetz

    Abstract: Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can lever… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Tommaso Cavallari and Stuart Golodetz contributed equally to this paper

  8. R$^3$SGM: Real-time Raster-Respecting Semi-Global Matching for Power-Constrained Systems

    Authors: Oscar Rahnama, Tommaso Cavallari, Stuart Golodetz, Simon Walker, Philip H. S. Torr

    Abstract: Stereo depth estimation is used for many computer vision applications. Though many popular methods strive solely for depth quality, for real-time mobile applications (e.g. prosthetic glasses or micro-UAVs), speed and power efficiency are equally, if not more, important. Many real-world systems rely on Semi-Global Matching (SGM) to achieve a good accuracy vs. speed balance, but power efficiency is… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: Accepted in FPT 2018 as Oral presentation, 8 pages, 6 figures, 4 tables

    Journal ref: 2018 International Conference on Field-Programmable Technology (FPT)

  9. Real-Time RGB-D Camera Pose Estimation in Novel Scenes using a Relocalisation Cascade

    Authors: Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Victor A. Prisacariu, Luigi Di Stefano, Philip H. S. Torr

    Abstract: Camera pose estimation is an important problem in computer vision. Common techniques either match the current image against keyframes with known poses, directly regress the pose, or establish correspondences between keypoints in the image and points in the scene to estimate the pose. In recent years, regression forests have become a popular alternative to establish such correspondences. They achie… ▽ More

    Submitted 2 July, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Tommaso Cavallari, Stuart Golodetz, Nicholas Lord and Julien Valentin assert joint first authorship

    MSC Class: 68T45

  10. Collaborative Large-Scale Dense 3D Reconstruction with Online Inter-Agent Pose Optimisation

    Authors: Stuart Golodetz, Tommaso Cavallari, Nicholas A Lord, Victor A Prisacariu, David W Murray, Philip H S Torr

    Abstract: Reconstructing dense, volumetric models of real-world 3D scenes is important for many tasks, but capturing large scenes can take significant time, and the risk of transient changes to the scene goes up as the capture time increases. These are good reasons to want instead to capture several smaller sub-scenes that can be joined to make the whole scene. Achieving this has traditionally been difficul… ▽ More

    Submitted 2 July, 2019; v1 submitted 25 January, 2018; originally announced January 2018.

    Comments: Stuart Golodetz, Tommaso Cavallari and Nicholas Lord assert joint first authorship

    MSC Class: 68T45

    Journal ref: IEEE Transactions on Visualization and Computer Graphics 24(11):2895-2905, 2018

  11. arXiv:1708.00783  [pdf, other

    cs.CV

    InfiniTAM v3: A Framework for Large-Scale 3D Reconstruction with Loop Closure

    Authors: Victor Adrian Prisacariu, Olaf Kähler, Stuart Golodetz, Michael Sapienza, Tommaso Cavallari, Philip H S Torr, David W Murray

    Abstract: Volumetric models have become a popular representation for 3D scenes in recent years. One breakthrough leading to their popularity was KinectFusion, which focuses on 3D reconstruction using RGB-D sensors. However, monocular SLAM has since also been tackled with very similar approaches. Representing the reconstruction volumetrically as a TSDF leads to most of the simplicity and efficiency that can… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

    Comments: This article largely supersedes arxiv:1410.0925 (it describes version 3 of the InfiniTAM framework)

  12. arXiv:1702.02779  [pdf, other

    cs.CV

    On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

    Authors: Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr

    Abstract: Camera relocalisation is an important problem in computer vision, with applications in simultaneous localisation and map**, virtual/augmented reality and navigation. Common techniques either match the current image against keyframes with known poses coming from a tracker, or establish 2D-to-3D correspondences between keypoints in the current image and points in the scene in order to estimate the… ▽ More

    Submitted 26 June, 2017; v1 submitted 9 February, 2017; originally announced February 2017.

    Comments: To appear in the proceedings of CVPR 2017

  13. arXiv:1511.04242  [pdf, other

    cs.CV

    Volume-based Semantic Labeling with Signed Distance Functions

    Authors: Tommaso Cavallari, Luigi Di Stefano

    Abstract: Research works on the two topics of Semantic Segmentation and SLAM (Simultaneous Localization and Map**) have been following separate tracks. Here, we link them quite tightly by delineating a category label fusion technique that allows for embedding semantic information into the dense map created by a volume-based SLAM algorithm such as KinectFusion. Accordingly, our approach is the first to pro… ▽ More

    Submitted 13 November, 2015; originally announced November 2015.

    Comments: Submitted to PSIVT2015