Skip to main content

Showing 1–14 of 14 results for author: Turmukhambetov, D

.
  1. arXiv:2404.14351  [pdf, other

    cs.CV

    Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer

    Authors: Eric Brachmann, Jamie Wynn, Shuai Chen, Tommaso Cavallari, Áron Monszpart, Daniyar Turmukhambetov, Victor Adrian Prisacariu

    Abstract: We address the task of estimating camera parameters from a set of images depicting a scene. Popular feature-based structure-from-motion (SfM) tools solve this task by incremental reconstruction: they repeat triangulation of sparse 3D points and registration of more camera views to the sparse point cloud. We re-interpret incremental structure-from-motion as an iterated application and refinement of… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Project page: https://nianticlabs.github.io/acezero/

  2. arXiv:2306.01596  [pdf, other

    cs.CV

    Two-View Geometry Scoring Without Correspondences

    Authors: Axel Barroso-Laguna, Eric Brachmann, Victor Adrian Prisacariu, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: Camera pose estimation for two-view geometry traditionally relies on RANSAC. Normally, a multitude of image correspondences leads to a pool of proposed hypotheses, which are then scored to find a winning model. The inlier count is generally regarded as a reliable indicator of "consensus". We examine this scoring heuristic, and find that it favors disappointing models under certain circumstances. A… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

  3. arXiv:2302.12231  [pdf, other

    cs.CV

    DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

    Authors: Jamie Wynn, Daniyar Turmukhambetov

    Abstract: Under good conditions, Neural Radiance Fields (NeRFs) have shown impressive results on novel view synthesis tasks. NeRFs learn a scene's color and density fields by minimizing the photometric discrepancy between training views and differentiable renderings of the scene. Once trained from a sufficient set of views, NeRFs can generate novel views from arbitrary camera positions. However, the scene g… ▽ More

    Submitted 8 November, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: CVPR 2023. Updated LPIPS scores in Table 1

  4. arXiv:2210.05494  [pdf, other

    cs.CV

    Map-free Visual Relocalization: Metric Pose Relative to a Single Image

    Authors: Eduardo Arnold, Jamie Wynn, Sara Vicente, Guillermo Garcia-Hernando, Áron Monszpart, Victor Adrian Prisacariu, Daniyar Turmukhambetov, Eric Brachmann

    Abstract: Can we relocalize in a scene represented by a single reference image? Standard visual relocalization requires hundreds of images and scale calibration to build a scene-specific 3D map. In contrast, we propose Map-free Relocalization, i.e., using only one photo of a scene to enable instant, metric scaled relocalization. Existing datasets are not suitable to benchmark map-free relocalization, due to… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: ECCV2022 camera-ready. 14 pages + 4 reference pages

  5. arXiv:2106.02022  [pdf, other

    cs.CV

    Single Image Depth Prediction with Wavelet Decomposition

    Authors: Michaël Ramamonjisoa, Michael Firman, Jamie Watson, Vincent Lepetit, Daniyar Turmukhambetov

    Abstract: We present a novel method for predicting accurate depths from monocular images with high efficiency. This optimal efficiency is achieved by exploiting wavelet decomposition, which is integrated in a fully differentiable encoder-decoder architecture. We demonstrate that we can reconstruct high-fidelity depth maps by predicting sparse wavelet coefficients. In contrast with previous works, we show th… ▽ More

    Submitted 16 August, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: CVPR 2021

  6. arXiv:2105.03578  [pdf, other

    cs.CV cs.RO

    Learning to Predict Repeatability of Interest Points

    Authors: Anh-Dzung Doan, Daniyar Turmukhambetov, Yasir Latif, Tat-Jun Chin, Soohyun Bae

    Abstract: Many robotics applications require interest points that are highly repeatable under varying viewpoints and lighting conditions. However, this requirement is very challenging as the environment changes continuously and indefinitely, leading to appearance changes of interest points with respect to time. This paper proposes to predict the repeatability of an interest point as a function of time, whic… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted at IEEE International Conference on Robotics and Automation (ICRA) 2021

  7. arXiv:2008.09497  [pdf, other

    cs.CV

    Single-Image Depth Prediction Makes Feature Matching Easier

    Authors: Carl Toft, Daniyar Turmukhambetov, Torsten Sattler, Fredrik Kahl, Gabriel Brostow

    Abstract: Good local features improve the robustness of many 3D re-localization and multi-view reconstruction pipelines. The problem is that viewing angle and distance severely impact the recognizability of a local feature. Attempts to improve appearance invariance by choosing better local feature points or by leveraging outside information, have come with pre-requisites that made some of them impractical.… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: 14 pages, 7 figures, accepted for publication at the European conference on computer vision (ECCV) 2020

    ACM Class: I.4

  8. arXiv:2008.06959  [pdf, other

    cs.CV

    Image Stylization for Robust Features

    Authors: Iaroslav Melekhov, Gabriel J. Brostow, Juho Kannala, Daniyar Turmukhambetov

    Abstract: Local features that are robust to both viewpoint and appearance changes are crucial for many computer vision tasks. In this work we investigate if photorealistic image stylization improves robustness of local features to not only day-night, but also weather and season variations. We show that image stylization in addition to color augmentation is a powerful method of learning robust features. We e… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: v1.1

  9. arXiv:2008.05785  [pdf, other

    cs.CV cs.LG

    Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings

    Authors: Anita Rau, Guillermo Garcia-Hernando, Danail Stoyanov, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: To what extent are two images picturing the same 3D surfaces? Even when this is a known scene, the answer typically requires an expensive search across scale space, with matching and geometric verification of large sets of local features. This expense is further multiplied when a query image is evaluated against a gallery, e.g. in visual relocalization. While we don't obviate the need for geometri… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  10. arXiv:2008.01484  [pdf, other

    cs.CV

    Learning Stereo from Single Images

    Authors: Jamie Watson, Oisin Mac Aodha, Daniyar Turmukhambetov, Gabriel J. Brostow, Michael Firman

    Abstract: Supervised deep networks are among the best methods for finding correspondences in stereo image pairs. Like all supervised approaches, these networks require ground truth data during training. However, collecting large quantities of accurate dense correspondence data is very challenging. We propose that it is unnecessary to have such a high reliance on ground truth depths or even corresponding ste… ▽ More

    Submitted 20 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Accepted as an oral presentation at ECCV 2020

  11. arXiv:1909.09051  [pdf, other

    cs.CV

    Self-Supervised Monocular Depth Hints

    Authors: Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: Monocular depth estimators can be trained with various forms of self-supervision from binocular-stereo data to circumvent the need for high-quality laser scans or other ground-truth data. The disadvantage, however, is that the photometric reprojection losses used with self-supervised learning typically have multiple local minima. These plausible-looking alternatives to ground truth can restrict wh… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Accepted to ICCV 2019

  12. arXiv:1710.07307  [pdf, other

    cs.CV

    Interpretable Transformations with Encoder-Decoder Networks

    Authors: Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow

    Abstract: Deep feature spaces have the capacity to encode complex transformations of their input data. However, understanding the relative feature-space relationship between two transformed encoded images is difficult. For instance, what is the relative feature space relationship between two rotated images? What is decoded when we interpolate in feature space? Ideally, we want to disentangle confounding fac… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Comments: Accepted at ICCV 2017

  13. arXiv:1612.04642  [pdf, other

    cs.CV cs.LG stat.ML

    Harmonic Networks: Deep Translation and Rotation Equivariance

    Authors: Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow

    Abstract: Translating or rotating an input image should not affect the results of many computer vision tasks. Convolutional neural networks (CNNs) are already translation equivariant: input image translations produce proportionate feature map translations. This is not the case for rotations. Global rotation equivariance is typically sought through data augmentation, but patch-wise equivariance is more diffi… ▽ More

    Submitted 11 April, 2017; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: Submitted to CVPR 2017

  14. arXiv:1611.03906  [pdf, other

    cs.HC

    Help, It Looks Confusing: GUI Task Automation Through Demonstration and Follow-up Questions

    Authors: Thanapong Intharah, Daniyar Turmukhambetov, Gabriel J. Brostow

    Abstract: Non-programming users should be able to create their own customized scripts to perform computer-based tasks for them, just by demonstrating to the machine how it's done. To that end, we develop a system prototype which learns-by-demonstration called HILC (Help, It Looks Confusing). Users train HILC to synthesize a task script by demonstrating the task, which produces the needed screenshots and the… ▽ More

    Submitted 13 January, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: Camera Ready version. Accepted to be presented at the ACM IUI 2017