Skip to main content

Showing 1–10 of 10 results for author: Bökman, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08928  [pdf, other

    cs.CV

    DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector

    Authors: Johan Edstedt, Georg Bökman, Zhenjun Zhao

    Abstract: In this paper, we analyze and improve into the recently proposed DeDoDe keypoint detector. We focus our analysis on some key issues. First, we find that DeDoDe keypoints tend to cluster together, which we fix by performing non-max suppression on the target distribution of the detector during training. Second, we address issues related to data augmentation. In particular, the DeDoDe detector is sen… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: Accepted to Sixth Workshop on Image Matching - CVPRW 2024

  2. arXiv:2312.02152  [pdf, other

    cs.CV

    Steerers: A framework for rotation equivariant keypoint descriptors

    Authors: Georg Bökman, Johan Edstedt, Michael Felsberg, Fredrik Kahl

    Abstract: Image keypoint descriptions that are discriminative and matchable over large changes in viewpoint are vital for 3D reconstruction. However, descriptions output by learned descriptors are typically not robust to camera rotation. While they can be made more robust by, e.g., data augmentation, this degrades performance on upright images. Another approach is test-time augmentation, which incurs a sign… ▽ More

    Submitted 2 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Camera ready

  3. arXiv:2310.01092  [pdf, other

    cs.CV

    Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images

    Authors: Georg Bökman, Johan Edstedt

    Abstract: We present the top ranked solution for the AISG-SLA Visual Localisation Challenge benchmark (IJCAI 2023), where the task is to estimate relative motion between images taken in sequence by a camera mounted on a car driving through an urban scene. For matching images we use our recent deep learning based matcher RoMa. Matching image pairs sequentially and estimating relative motion from point corr… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Technical report for the top ranked solution to the AISG-SLA visual localization challenge at IJCAI 2023

  4. arXiv:2308.08479  [pdf, other

    cs.CV

    DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching

    Authors: Johan Edstedt, Georg Bökman, Mårten Wadenbäck, Michael Felsberg

    Abstract: Keypoint detection is a pivotal step in 3D reconstruction, whereby sets of (up to) K points are detected in each view of a scene. Crucially, the detected points need to be consistent between views, i.e., correspond to the same 3D point in the scene. One of the main challenges with keypoint detection is the formulation of the learning objective. Previous learning-based methods typically jointly lea… ▽ More

    Submitted 11 December, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted to 3DV 2024 (Oral)

  5. arXiv:2305.17017  [pdf, other

    cs.LG

    Investigating how ReLU-networks encode symmetries

    Authors: Georg Bökman, Fredrik Kahl

    Abstract: Many data symmetries can be described in terms of group equivariance and the most common way of encoding group equivariances in neural networks is by building linear layers that are group equivariant. In this work we investigate whether equivariance of a network implies that all layers are equivariant. On the theoretical side we find cases where equivariance implies layerwise equivariance, but als… ▽ More

    Submitted 8 December, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: NeurIPS camera ready

  6. arXiv:2305.15404  [pdf, other

    cs.CV

    RoMa: Robust Dense Feature Matching

    Authors: Johan Edstedt, Qiyu Sun, Georg Bökman, Mårten Wadenbäck, Michael Felsberg

    Abstract: Feature matching is an important computer vision task that involves estimating correspondences between two images of a 3D scene, and dense methods estimate all such correspondences. The aim is to learn a robust model, i.e., a model able to match under challenging real-world changes. In this work, we propose such a model, leveraging frozen pretrained features from the foundation model DINOv2. Altho… ▽ More

    Submitted 11 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  7. arXiv:2209.14719  [pdf, other

    cs.CV

    In Search of Projectively Equivariant Networks

    Authors: Georg Bökman, Axel Flinth, Fredrik Kahl

    Abstract: Equivariance of linear neural network layers is well studied. In this work, we relax the equivariance condition to only be true in a projective sense. We propose a way to construct a projectively equivariant neural network through building a standard equivariant network where the linear group representations acting on each intermediate feature space are "multiplicatively modified lifts" of project… ▽ More

    Submitted 20 December, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: v3: Another significant rewrite. Accepted for publication in TMLR. v2: Significant rewrite. The title has been changed: "neural network" -> "network". More general description of projectively equivariant linear layers, with new proposed architectures, and a completely new accompanying experiment section, as a result

    MSC Class: 68T07 (Primary) 20C35 (Secondary)

  8. arXiv:2204.10144  [pdf, other

    cs.CV

    A case for using rotation invariant features in state of the art feature matchers

    Authors: Georg Bökman, Fredrik Kahl

    Abstract: The aim of this paper is to demonstrate that a state of the art feature matcher (LoFTR) can be made more robust to rotations by simply replacing the backbone CNN with a steerable CNN which is equivariant to translations and image rotations. It is experimentally shown that this boost is obtained without reducing performance on ordinary illumination and viewpoint matching sequences.

    Submitted 3 July, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: CVPRW 2022, updated version

  9. arXiv:2201.13065  [pdf, other

    cs.CV cs.LG

    Rigidity Preserving Image Transformations and Equivariance in Perspective

    Authors: Lucas Brynte, Georg Bökman, Axel Flinth, Fredrik Kahl

    Abstract: We characterize the class of image plane transformations which realize rigid camera motions and call these transformations `rigidity preserving'. In particular, 2D translations of pinhole images are not rigidity preserving. Hence, when using CNNs for 3D inference tasks, it can be beneficial to modify the inductive bias from equivariance towards translations to equivariance towards rigidity preserv… ▽ More

    Submitted 13 October, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: v2: Substantially revised version. Among other things, experiments with the PixLoc model added

  10. arXiv:2111.15341  [pdf, other

    cs.CV cs.LG

    ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds

    Authors: Georg Bökman, Fredrik Kahl, Axel Flinth

    Abstract: In this paper, we are concerned with rotation equivariance on 2D point cloud data. We describe a particular set of functions able to approximate any continuous rotation equivariant and permutation invariant function. Based on this result, we propose a novel neural network architecture for processing 2D point clouds and we prove its universality for approximating functions exhibiting these symmetri… ▽ More

    Submitted 28 March, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: CVPR 2022 camera ready