Skip to main content

Showing 1–48 of 48 results for author: Baráth, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16204  [pdf, other

    cs.CV

    Breaking the Frame: Image Retrieval by Visual Overlap Prediction

    Authors: Tong Wei, Philipp Lindenberger, Jiri Matas, Daniel Barath

    Abstract: We propose a novel visual place recognition approach, VOP, that efficiently addresses occlusions and complex scenes by shifting from traditional reliance on global image similarities and local features to image overlap prediction. The proposed method enables the identification of visible image sections without requiring expensive feature detection and matching. By focusing on obtaining patch-level… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2406.05849  [pdf, other

    cs.RO

    MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps

    Authors: Jianhao Zheng, Daniel Barath, Marc Pollefeys, Iro Armeni

    Abstract: Creating 3D semantic reconstructions of environments is fundamental to many applications, especially when related to autonomous agent operation (e.g., goal-oriented navigation or object interaction and manipulation). Commonly, 3D semantic reconstruction systems capture the entire scene in the same level of detail. However, certain tasks (e.g., object interaction) require a fine-grained and high-re… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2404.14565  [pdf, other

    cs.CV

    "Where am I?" Scene Retrieval with Language

    Authors: Jiaqi Chen, Daniel Barath, Iro Armeni, Marc Pollefeys, Hermann Blum

    Abstract: Natural language interfaces to embodied AI are becoming more ubiquitous in our daily lives. This opens further opportunities for language-based interaction with embodied agents, such as a user instructing an agent to execute some task in a specific location. For example, "put the bowls back in the cupboard next to the fridge" or "meet me at the intersection under the red sign." As such, we need me… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  4. arXiv:2404.00469  [pdf, other

    cs.CV

    SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

    Authors: Yang Miao, Francis Engelmann, Olga Vysotska, Federico Tombari, Marc Pollefeys, Dániel Béla Baráth

    Abstract: We introduce a novel problem, i.e., the localization of an input image within a multi-modal reference map represented by a database of 3D scene graphs. These graphs comprise multiple modalities, including object-level point clouds, images, attributes, and relationships between objects, offering a lightweight and efficient alternative to conventional methods that rely on extensive image databases.… ▽ More

    Submitted 8 July, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  5. arXiv:2404.00429  [pdf, other

    cs.CV

    Multiway Point Cloud Mosaicking with Diffusion and Global Optimization

    Authors: Shengze **, Iro Armeni, Marc Pollefeys, Daniel Barath

    Abstract: We introduce a novel framework for multiway point cloud mosaicking (named Wednesday), designed to co-align sets of partially overlap** point clouds -- typically obtained from 3D scanners or moving RGB-D cameras -- into a unified coordinate system. At the core of our approach is ODIN, a learned pairwise registration algorithm that iteratively identifies overlaps and refines attention scores, empl… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  6. arXiv:2309.16040  [pdf, other

    cs.CV

    Handbook on Leveraging Lines for Two-View Relative Pose Estimation

    Authors: Petr Hruby, Shaohui Liu, Rémi Pautrat, Marc Pollefeys, Daniel Barath

    Abstract: We propose an approach for estimating the relative pose between calibrated image pairs by jointly exploiting points, lines, and their coincidences in a hybrid manner. We investigate all possible configurations where these data modalities can be used together and review the minimal solvers available in the literature. Our hybrid framework combines the advantages of all configurations, enabling robu… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 2 view relative pose from special configurations of line

    MSC Class: 68T45 ACM Class: I.4.5; I.4.8

  7. arXiv:2309.16023  [pdf, other

    cs.CV

    Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature

    Authors: Shengze **, Daniel Barath, Marc Pollefeys, Iro Armeni

    Abstract: Point cloud registration has seen recent success with several learning-based methods that focus on correspondence matching and, as such, optimize only for this objective. Following the learning step of correspondence matching, they evaluate the estimated rigid transformation with a RANSAC-like framework. While it is an indispensable component of these methods, it prevents a fully end-to-end traini… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  8. arXiv:2309.14737  [pdf, other

    cs.RO cs.CV

    Volumetric Semantically Consistent 3D Panoptic Map**

    Authors: Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath

    Abstract: We introduce an online 2D-to-3D semantic instance map** algorithm aimed at generating comprehensive, accurate, and efficient semantic 3D maps suitable for autonomous agents in unstructured environments. The proposed approach is based on a Voxel-TSDF representation used in recent algorithms. It introduces novel ways of integrating semantic prediction confidence during map**, producing semantic… ▽ More

    Submitted 8 July, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 8 pages, 2 figures

  9. arXiv:2308.10694  [pdf, other

    cs.CV

    Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction

    Authors: Rémi Pautrat, Shaohui Liu, Petr Hruby, Marc Pollefeys, Daniel Barath

    Abstract: We tackle the problem of estimating a Manhattan frame, i.e. three orthogonal vanishing points, and the unknown focal length of the camera, leveraging a prior vertical direction. The direction can come from an Inertial Measurement Unit that is a standard component of recent consumer devices, e.g., smartphones. We provide an exhaustive analysis of minimal line configurations and derive two new 2-lin… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023

  10. arXiv:2307.15381  [pdf, other

    cs.CV

    AffineGlue: Joint Matching and Robust Estimation

    Authors: Daniel Barath, Dmytro Mishkin, Luca Cavalli, Paul-Edouard Sarlin, Petr Hruby, Marc Pollefeys

    Abstract: We propose AffineGlue, a method for joint two-view feature matching and robust estimation that reduces the combinatorial complexity of the problem by employing single-point minimal solvers. AffineGlue selects potential matches from one-to-many correspondences to estimate minimal models. Guided matching is then used to find matches consistent with the model, suffering less from the ambiguities of o… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  11. arXiv:2307.14030  [pdf, other

    cs.CV

    Consensus-Adaptive RANSAC

    Authors: Luca Cavalli, Daniel Barath, Marc Pollefeys, Viktor Larsson

    Abstract: RANSAC and its variants are widely used for robust estimation, however, they commonly follow a greedy approach to finding the highest scoring model while ignoring other model hypotheses. In contrast, Iteratively Reweighted Least Squares (IRLS) techniques gradually approach the model by iteratively updating the weight of each correspondence based on the residuals from previous iterations. Inspired… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  12. arXiv:2306.12547  [pdf, other

    cs.CV

    DGC-GNN: Leveraging Geometry and Color Cues for Visual Descriptor-Free 2D-3D Matching

    Authors: Shuzhe Wang, Juho Kannala, Daniel Barath

    Abstract: Matching 2D keypoints in an image to a sparse 3D point cloud of the scene without requiring visual descriptors has garnered increased interest due to its low memory requirements, inherent privacy preservation, and reduced need for expensive 3D model maintenance compared to visual descriptor-based methods. However, existing algorithms often compromise on performance, resulting in a significant dete… ▽ More

    Submitted 24 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: CVPR 2024

  13. arXiv:2304.14880  [pdf, other

    cs.CV

    SGAligner : 3D Scene Alignment with Scene Graphs

    Authors: Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni

    Abstract: Building 3D scene graphs has recently emerged as a topic in scene representation for several embodied AI applications to represent the world in a structured and rich manner. With their increased use in solving downstream tasks (eg, navigation and room rearrangement), can we leverage and recycle them for creating 3D maps of environments, a pivotal step in agent operation? We focus on the fundamenta… ▽ More

    Submitted 26 September, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted at ICCV 2023

  14. arXiv:2303.16078  [pdf, other

    cs.CV

    Relative pose of three calibrated and partially calibrated cameras from four points using virtual correspondences

    Authors: Charalambos Tzamos, Daniel Barath, Torsten Sattler, Zuzana Kukelova

    Abstract: We study challenging problems of estimating the relative pose of three cameras and propose novel efficient solutions to (1) the notoriously difficult configuration of four points in three calibrated views, known as the 4p3v problem, and (2) to the previously unsolved configuration of four points in three cameras with unknown shared focal length, i.e., the 4p3vf problem. Our solutions are based on… ▽ More

    Submitted 11 December, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  15. arXiv:2303.05195  [pdf, other

    cs.CV

    Revisiting Rotation Averaging: Uncertainties and Robust Losses

    Authors: Ganlin Zhang, Viktor Larsson, Daniel Barath

    Abstract: In this paper, we revisit the rotation averaging problem applied in global Structure-from-Motion pipelines. We argue that the main problem of current methods is the minimized cost function that is only weakly connected with the input data via the estimated epipolar geometries.We propose to better model the underlying noise distributions by directly propagating the uncertainty from the point corres… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: submitted to CVPR2023

  16. arXiv:2302.09997  [pdf, other

    cs.CV

    A Large Scale Homography Benchmark

    Authors: Daniel Barath, Dmytro Mishkin, Michal Polic, Wolfgang Förstner, Jiri Matas

    Abstract: We present a large-scale dataset of Planes in 3D, Pi3D, of roughly 1000 planes observed in 10 000 images from the 1DSfM dataset, and HEB, a large-scale homography estimation benchmark leveraging Pi3D. The applications of the Pi3D dataset are diverse, e.g. training or evaluating monocular depth, surface normal estimation and image matching algorithms. The HEB dataset consists of 226 260 homographie… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  17. arXiv:2212.13185  [pdf, other

    cs.CV

    Generalized Differentiable RANSAC

    Authors: Tong Wei, Yash Patel, Alexander Shekhovtsov, Jiri Matas, Daniel Barath

    Abstract: We propose $\nabla$-RANSAC, a generalized differentiable RANSAC that allows learning the entire randomized robust estimation pipeline. The proposed approach enables the use of relaxation techniques for estimating the gradients in the sampling distribution, which are then propagated through a differentiable solver. The trainable quality function marginalizes over the scores from all the models esti… ▽ More

    Submitted 8 September, 2023; v1 submitted 26 December, 2022; originally announced December 2022.

  18. arXiv:2212.07766  [pdf, other

    cs.CV

    DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients

    Authors: Rémi Pautrat, Daniel Barath, Viktor Larsson, Martin R. Oswald, Marc Pollefeys

    Abstract: Line segments are ubiquitous in our human-made world and are increasingly used in vision tasks. They are complementary to feature points thanks to their spatial extent and the structural information they provide. Traditional line detectors based on the image gradient are extremely fast and accurate, but lack robustness in noisy images and challenging conditions. Their learned counterparts are more… ▽ More

    Submitted 28 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted at CVPR 2023

  19. arXiv:2207.07872  [pdf, other

    cs.CV cs.LG

    NeFSAC: Neurally Filtered Minimal Samples

    Authors: Luca Cavalli, Marc Pollefeys, Daniel Barath

    Abstract: Since RANSAC, a great deal of research has been devoted to improving both its accuracy and run-time. Still, only a few methods aim at recognizing invalid minimal samples early, before the often expensive model estimation and quality calculation are done. To this end, we propose NeFSAC, an efficient algorithm for neural filtering of motion-inconsistent and poorly-conditioned minimal samples. We tra… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: Published in the 17th European Conference on Computer Vision (ECCV 2022)

  20. arXiv:2203.07930  [pdf, other

    cs.CV

    Relative Pose from SIFT Features

    Authors: Daniel Barath, Zuzana Kukelova

    Abstract: This paper proposes the geometric relationship of epipolar geometry and orientation- and scale-covariant, e.g., SIFT, features. We derive a new linear constraint relating the unknown elements of the fundamental matrix and the orientation and scale. This equation can be used together with the well-known epipolar constraint to, e.g., estimate the fundamental matrix from four SIFT correspondences, es… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  21. arXiv:2111.14093  [pdf, other

    cs.CV

    Adaptive Reordering Sampler with Neurally Guided MAGSAC

    Authors: Tong Wei, Jiri Matas, Daniel Barath

    Abstract: We propose a new sampler for robust estimators that always selects the sample with the highest probability of consisting only of inliers. After every unsuccessful iteration, the inlier probabilities are updated in a principled way via a Bayesian approach. The probabilities obtained by the deep network are used as prior (so-called neural guidance) inside the sampler. Moreover, we introduce a new lo… ▽ More

    Submitted 8 September, 2023; v1 submitted 28 November, 2021; originally announced November 2021.

  22. arXiv:2111.12385  [pdf, other

    cs.CV

    Space-Partitioning RANSAC

    Authors: Daniel Barath, Gabor Valasek

    Abstract: A new algorithm is proposed to accelerate RANSAC model quality calculations. The method is based on partitioning the joint correspondence space, e.g., 2D-2D point correspondences, into a pair of regular grids. The grid cells are mapped by minimal sample models, estimated within RANSAC, to reject correspondences that are inconsistent with the model parameters early. The proposed technique is genera… ▽ More

    Submitted 20 July, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  23. arXiv:2106.10240  [pdf, other

    cs.CV

    VSAC: Efficient and Accurate Estimator for H and F

    Authors: Maksym Ivashechkin, Daniel Barath, Jiri Matas

    Abstract: We present VSAC, a RANSAC-type robust estimator with a number of novelties. It benefits from the introduction of the concept of independent inliers that improves significantly the efficacy of the dominant plane handling and, also, allows near error-free rejection of incorrect models, without false positives. The local optimization process and its application is improved so that it is run on averag… ▽ More

    Submitted 13 September, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  24. arXiv:2104.05044  [pdf, other

    cs.CV

    USACv20: robust essential, fundamental and homography matrix estimation

    Authors: Maksym Ivashechkin, Daniel Barath, Jiri Matas

    Abstract: We review the most recent RANSAC-like hypothesize-and-verify robust estimators. The best performing ones are combined to create a state-of-the-art version of the Universal Sample Consensus (USAC) algorithm. A recent objective is to implement a modular and optimized framework, making future RANSAC modules easy to be included. The proposed method, USACv20, is tested on eight publicly available real-… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:1912.05909

  25. arXiv:2103.13875  [pdf, other

    cs.CV

    Finding Geometric Models by Clustering in the Consensus Space

    Authors: Daniel Barath, Denys Rozumny, Ivan Eichhardt, Levente Hajder, Jiri Matas

    Abstract: We propose a new algorithm for finding an unknown number of geometric models, e.g., homographies. The problem is formalized as finding dominant model instances progressively without forming crisp point-to-model assignments. Dominant instances are found via a RANSAC-like sampling and a consolidation process driven by a model quality function considering previously proposed instances. New ones are f… ▽ More

    Submitted 17 April, 2023; v1 submitted 25 March, 2021; originally announced March 2021.

  26. arXiv:2103.06535  [pdf, other

    cs.CV

    Calibrated and Partially Calibrated Semi-Generalized Homographies

    Authors: Snehal Bhayani, Torsten Sattler, Daniel Barath, Patrik Beliansky, Janne Heikkila, Zuzana Kukelova

    Abstract: In this paper, we propose the first minimal solutions for estimating the semi-generalized homography given a perspective and a generalized camera. The proposed solvers use five 2D-2D image point correspondences induced by a scene plane. One of them assumes the perspective camera to be fully calibrated, while the other solver estimates the unknown focal length together with the absolute pose parame… ▽ More

    Submitted 11 October, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted to ICCV 2021 and to appear in the conference proceedings

  27. arXiv:2012.00465  [pdf, other

    cs.CV

    Minimal Solutions for Panoramic Stitching Given Gravity Prior

    Authors: Yaqing Ding, Daniel Barath, Zuzana Kukelova

    Abstract: When capturing panoramas, people tend to align their cameras with the vertical axis, i.e., the direction of gravity. Moreover, modern devices, such as smartphones and tablets, are equipped with an IMU (Inertial Measurement Unit) that can measure the gravity vector accurately. Using this prior, the y-axes of the cameras can be aligned or assumed to be already aligned, reducing their relative orient… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  28. arXiv:2012.00458  [pdf, other

    cs.CV

    Globally Optimal Relative Pose Estimation with Gravity Prior

    Authors: Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova

    Abstract: Smartphones, tablets and camera systems used, e.g., in cars and UAVs, are typically equipped with IMUs (inertial measurement units) that can measure the gravity vector accurately. Using this additional information, the $y$-axes of the cameras can be aligned, reducing their relative orientation to a single degree-of-freedom. With this assumption, we propose a novel globally optimal solver, minimizi… ▽ More

    Submitted 4 February, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

  29. arXiv:2011.11986  [pdf, other

    cs.CV

    Efficient Initial Pose-graph Generation for Global SfM

    Authors: Daniel Barath, Dmytro Mishkin, Ivan Eichhardt, Ilia Shipachev, Jiri Matas

    Abstract: We propose ways to speed up the initial pose-graph generation for global Structure-from-Motion algorithms. To avoid forming tentative point correspondences by FLANN and geometric verification by RANSAC, which are the most time-consuming steps of the pose-graph creation, we propose two new methods - built on the fact that image pairs usually are matched consecutively. Thus, candidate relative poses… ▽ More

    Submitted 26 November, 2020; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Added supplementary material

  30. arXiv:2011.08790  [pdf, other

    cs.CV

    P1AC: Revisiting Absolute Pose From a Single Affine Correspondence

    Authors: Jonathan Ventura, Zuzana Kukelova, Torsten Sattler, Dániel Baráth

    Abstract: Affine correspondences have traditionally been used to improve feature matching over wide baselines. While recent work has successfully used affine correspondences to solve various relative camera pose estimation problems, less attention has been given to their use in absolute pose estimation. We introduce the first general solution to the problem of estimating the pose of a calibrated camera give… ▽ More

    Submitted 29 June, 2024; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: ICCV 2023 (with corrections in Eqs. 6 and 13 and Fig. 4)

  31. arXiv:2008.05743  [pdf, other

    cs.CV

    Pose Estimation for Vehicle-mounted Cameras via Horizontal and Vertical Planes

    Authors: Istan Gergo Gal, Daniel Barath, Levente Hajder

    Abstract: We propose two novel solvers for estimating the egomotion of a calibrated camera mounted to a moving vehicle from a single affine correspondence via recovering special homographies. For the first class of solvers, the sought plane is expected to be perpendicular to one of the camera axes. For the second class, the plane is orthogonal to the ground with unknown normal, e.g., it is a building facade… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  32. arXiv:2007.10700  [pdf, other

    cs.CV cs.RO

    Minimal Cases for Computing the Generalized Relative Pose using Affine Correspondences

    Authors: Banglei Guan, Ji Zhao, Daniel Barath, Friedrich Fraundorfer

    Abstract: We propose three novel solvers for estimating the relative pose of a multi-camera system from affine correspondences (ACs). A new constraint is derived interpreting the relationship of ACs and the generalized camera model. Using the constraint, we demonstrate efficient solvers for two types of motions assumed. Considering that the cameras undergo planar motion, we propose a minimal solution using… ▽ More

    Submitted 19 August, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: ICCV 2021

  33. arXiv:2007.10082  [pdf, other

    cs.CV

    Relative Pose from Deep Learned Depth and a Single Affine Correspondence

    Authors: Ivan Eichhardt, Daniel Barath

    Abstract: We propose a new approach for combining deep-learned non-metric monocular depth with affine correspondences (ACs) to estimate the relative pose of two calibrated cameras from a single correspondence. Considering the depth information and affine features, two new constraints on the camera pose are derived. The proposed solver is usable within 1-point RANSAC approaches. Thus, the processing time of… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  34. arXiv:2007.10032  [pdf, other

    cs.CV

    Making Affine Correspondences Work in Camera Geometry Computation

    Authors: Daniel Barath, Michal Polic, Wolfgang Förstner, Torsten Sattler, Tomas Pajdla, Zuzana Kukelova

    Abstract: Local features e.g. SIFT and its affine and learned variants provide region-to-region rather than point-to-point correspondences. This has recently been exploited to create new minimal solvers for classical problems such as homography, essential and fundamental matrix estimation. The main advantage of such solvers is that their sample size is smaller, e.g., only two instead of four matches are req… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  35. arXiv:2004.00605  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    EPOS: Estimating 6D Pose of Objects with Symmetries

    Authors: Tomas Hodan, Daniel Barath, Jiri Matas

    Abstract: We present a new method for estimating the 6D pose of rigid objects with available 3D models from a single RGB input image. The method is applicable to a broad range of objects, including challenging ones with global or partial symmetries. An object is represented by compact surface fragments which allow handling symmetries in a systematic manner. Correspondences between densely sampled pixels and… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: Accepted to CVPR 2020

  36. arXiv:1912.06465  [pdf, other

    cs.CV

    Relative planar motion for vehicle-mounted cameras from a single affine correspondence

    Authors: Levente Hajder, Daniel Barath

    Abstract: Two solvers are proposed for estimating the extrinsic camera parameters from a single affine correspondence assuming general planar motion. In this case, the camera movement is constrained to a plane and the image plane is orthogonal to the ground. The algorithms do not assume other constraints, e.g.\ the non-holonomic one, to hold. A new minimal solver is proposed for the semi-calibrated case, i.… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  37. arXiv:1912.06464  [pdf, other

    cs.CV

    Least-squares Optimal Relative Planar Motion for Vehicle-mounted Cameras

    Authors: Levente Hajder, Daniel Barath

    Abstract: A new closed-form solver is proposed minimizing the algebraic error optimally, in the least-squares sense, to estimate the relative planar motion of two calibrated cameras. The main objective is to solve the over-determined case, i.e., when a larger-than-minimal sample of point correspondences is given - thus, estimating the motion from at least three correspondences. The algorithm requires the ca… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  38. arXiv:1912.05909  [pdf, other

    cs.CV

    MAGSAC++, a fast, reliable and accurate robust estimator

    Authors: Daniel Barath, Jana Noskova, Maksym Ivashechkin, Jiri Matas

    Abstract: A new method for robust estimation, MAGSAC++, is proposed. It introduces a new model quality (scoring) function that does not require the inlier-outlier decision, and a novel marginalization procedure formulated as an iteratively re-weighted least-squares approach. We also propose a new sampler, Progressive NAPSAC, for RANSAC-like robust estimators. Exploiting the fact that nearby points often ori… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1906.02295

  39. arXiv:1906.11927  [pdf, other

    cs.CV

    Homography from two orientation- and scale-covariant features

    Authors: Daniel Barath, Zuzana Kukelova

    Abstract: This paper proposes a geometric interpretation of the angles and scales which the orientation- and scale-covariant feature detectors, e.g. SIFT, provide. Two new general constraints are derived on the scales and rotations which can be used in any geometric model estimation tasks. Using these formulas, two new constraints on homography estimation are introduced. Exploiting the derived equations, a… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  40. arXiv:1906.02295  [pdf, other

    cs.CV

    Progressive NAPSAC: sampling from gradually growing neighborhoods

    Authors: Daniel Barath, Maksym Ivashechkin, Jiri Matas

    Abstract: We propose Progressive NAPSAC, P-NAPSAC in short, which merges the advantages of local and global sampling by drawing samples from gradually growing neighborhoods. Exploiting the fact that nearby points are more likely to originate from the same geometric model, P-NAPSAC finds local structures earlier than global samplers. We show that the progressive spatial sampling in P-NAPSAC can be integrated… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

  41. arXiv:1906.02290  [pdf, other

    cs.CV

    Progressive-X: Efficient, Anytime, Multi-Model Fitting Algorithm

    Authors: Daniel Barath, Jiri Matas

    Abstract: The Progressive-X algorithm, Prog-X in short, is proposed for geometric multi-model fitting. The method interleaves sampling and consolidation of the current data interpretation via repetitive hypothesis proposal, fast rejection, and integration of the new hypothesis into the kept instance set by labeling energy minimization. Due to exploring the data progressively, the method has several benefici… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

  42. arXiv:1905.00519  [pdf, other

    cs.CV

    Optimal Multi-view Correction of Local Affine Frames

    Authors: Ivan Eichhardt, Daniel Barath

    Abstract: The technique requires the epipolar geometry to be pre-estimated between each image pair. It exploits the constraints which the camera movement implies, in order to apply a closed-form correction to the parameters of the input affinities. Also, it is shown that the rotations and scales obtained by partially affine-covariant detectors, e.g., AKAZE or SIFT, can be completed to be full affine frames… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  43. arXiv:1807.03503  [pdf, other

    cs.CV

    Recovering affine features from orientation- and scale-invariant ones

    Authors: Daniel Barath

    Abstract: An approach is proposed for recovering affine correspondences (ACs) from orientation- and scale-invariant, e.g. SIFT, features. The method calculates the affine parameters consistent with a pre-estimated epipolar geometry from the point coordinates and the scales and rotations which the feature detector obtains. The closed-form solution is given as the roots of a quadratic polynomial equation, thu… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

  44. arXiv:1803.07469  [pdf, other

    cs.CV

    MAGSAC: marginalizing sample consensus

    Authors: Daniel Barath, Jana Noskova, Jiri Matas

    Abstract: A method called, sigma-consensus, is proposed to eliminate the need for a user-defined inlier-outlier threshold in RANSAC. Instead of estimating the noise sigma, it is marginalized over a range of noise scales. The optimized model is obtained by weighted least-squares fitting where the weights come from the marginalization over sigma of the point likelihoods of being inliers. A new quality functio… ▽ More

    Submitted 4 June, 2019; v1 submitted 20 March, 2018; originally announced March 2018.

  45. arXiv:1803.00260  [pdf, other

    cs.CV eess.IV

    Five-point Fundamental Matrix Estimation for Uncalibrated Cameras

    Authors: Daniel Barath

    Abstract: We aim at estimating the fundamental matrix in two views from five correspondences of rotation invariant features obtained by e.g.\ the SIFT detector. The proposed minimal solver first estimates a homography from three correspondences assuming that they are co-planar and exploiting their rotational components. Then the fundamental matrix is obtained from the homography and two additional point pai… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

  46. arXiv:1706.01649  [pdf, other

    cs.CV

    A Minimal Solution for Two-view Focal-length Estimation using Two Affine Correspondences

    Authors: Daniel Barath, Tekla Toth, Levente Hajder

    Abstract: A minimal solution using two affine correspondences is presented to estimate the common focal length and the fundamental matrix between two semi-calibrated cameras - known intrinsic parameters except a common focal length. To the best of our knowledge, this problem is unsolved. The proposed approach extends point correspondence-based techniques with linear constraints derived from local affine tra… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

  47. arXiv:1706.00984  [pdf, other

    cs.CV

    Graph-Cut RANSAC

    Authors: Daniel Barath, Jiri Matas

    Abstract: A novel method for robust estimation, called Graph-Cut RANSAC, GC-RANSAC in short, is introduced. To separate inliers and outliers, it runs the graph-cut algorithm in the local optimization (LO) step which is applied when a so-far-the-best model is found. The proposed LO step is conceptually simple, easy to implement, globally optimal and efficient. GC-RANSAC is shown experimentally, both on synth… ▽ More

    Submitted 16 November, 2017; v1 submitted 3 June, 2017; originally announced June 2017.

  48. arXiv:1706.00827  [pdf, other

    cs.CV

    Multi-Class Model Fitting by Energy Minimization and Mode-Seeking

    Authors: Daniel Barath, Jiri Matas

    Abstract: We propose a general formulation, called Multi-X, for multi-class multi-instance model fitting - the problem of interpreting the input data as a mixture of noisy observations originating from multiple instances of multiple classes. We extend the commonly used alpha-expansion-based technique with a new move in the label space. The move replaces a set of labels with the corresponding density mode in… ▽ More

    Submitted 16 November, 2017; v1 submitted 2 June, 2017; originally announced June 2017.