Skip to main content

Showing 1–23 of 23 results for author: Mishkin, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.15381  [pdf, other

    cs.CV

    AffineGlue: Joint Matching and Robust Estimation

    Authors: Daniel Barath, Dmytro Mishkin, Luca Cavalli, Paul-Edouard Sarlin, Petr Hruby, Marc Pollefeys

    Abstract: We propose AffineGlue, a method for joint two-view feature matching and robust estimation that reduces the combinatorial complexity of the problem by employing single-point minimal solvers. AffineGlue selects potential matches from one-to-many correspondences to estimate minimal models. Guided matching is then used to find matches consistent with the model, suffering less from the ambiguities of o… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  2. arXiv:2302.09997  [pdf, other

    cs.CV

    A Large Scale Homography Benchmark

    Authors: Daniel Barath, Dmytro Mishkin, Michal Polic, Wolfgang Förstner, Jiri Matas

    Abstract: We present a large-scale dataset of Planes in 3D, Pi3D, of roughly 1000 planes observed in 10 000 images from the 1DSfM dataset, and HEB, a large-scale homography estimation benchmark leveraging Pi3D. The applications of the Pi3D dataset are diverse, e.g. training or evaluating monocular depth, surface normal estimation and image matching algorithms. The HEB dataset consists of 226 260 homographie… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  3. arXiv:2207.14660  [pdf, other

    cs.CV

    Matching with AffNet based rectifications

    Authors: Václav Vávra, Dmytro Mishkin, Jiří Matas

    Abstract: We consider the problem of two-view matching under significant viewpoint changes with view synthesis. We propose two novel methods, minimizing the view synthesis overhead. The first one, named DenseAffNet, uses dense affine shapes estimates from AffNet, which allows it to partition the image, rectifying each partition with just a single affine map. The second one, named DepthAffNet, combines infor… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: 13 pages, 9 figures

  4. arXiv:2204.08870  [pdf, other

    cs.CV

    OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching

    Authors: Ostap Viniavskyi, Mariia Dobko, Dmytro Mishkin, Oles Dobosevych

    Abstract: We present OpenGlue: a free open-source framework for image matching, that uses a Graph Neural Network-based matcher inspired by SuperGlue \cite{sarlin20superglue}. We show that including additional geometrical information, such as local feature scale, orientation, and affine geometry, when available (e.g. for SIFT features), significantly improves the performance of the OpenGlue matcher. We study… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  5. arXiv:2112.12027  [pdf, other

    cs.CV

    Learning and Crafting for the Wide Multiple Baseline Stereo

    Authors: Dmytro Mishkin

    Abstract: This thesis introduces the wide multiple baseline stereo (WxBS) problem. WxBS, a generalization of the standard wide baseline stereo problem, considers the matching of images that simultaneously differ in more than one image acquisition factor such as viewpoint, illumination, sensor type, or where object appearance changes significantly, e.g., over time. A new dataset with the ground truth, evalua… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: After-defence version with additional fixes based on reviewer commends. 144 pages

  6. HarrisZ$^+$: Harris Corner Selection for Next-Gen Image Matching Pipelines

    Authors: Fabio Bellavia, Dmytro Mishkin

    Abstract: Due to its role in many computer vision tasks, image matching has been subjected to an active investigation by researchers, which has lead to better and more discriminant feature descriptors and to more robust matching strategies, also thanks to the advent of the deep learning and the increased computational power of the modern hardware. Despite of these achievements, the keypoint extraction proce… ▽ More

    Submitted 1 May, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Journal ref: Pattern Recognition Letters, 158 (2022) 141-147

  7. arXiv:2011.11986  [pdf, other

    cs.CV

    Efficient Initial Pose-graph Generation for Global SfM

    Authors: Daniel Barath, Dmytro Mishkin, Ivan Eichhardt, Ilia Shipachev, Jiri Matas

    Abstract: We propose ways to speed up the initial pose-graph generation for global Structure-from-Motion algorithms. To avoid forming tentative point correspondences by FLANN and geometric verification by RANSAC, which are the most time-consuming steps of the pose-graph creation, we propose two new methods - built on the fact that image pairs usually are matched consecutively. Thus, candidate relative poses… ▽ More

    Submitted 26 November, 2020; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Added supplementary material

  8. arXiv:2011.09832  [pdf, other

    cs.CV

    Differentiable Data Augmentation with Kornia

    Authors: Jian Shi, Edgar Riba, Dmytro Mishkin, Francesc Moreno, Anguelos Nicolaou

    Abstract: In this paper we present a review of the Kornia differentiable data augmentation (DDA) module for both for spatial (2D) and volumetric (3D) tensors. This module leverages differentiable computer vision solutions from Kornia, with an aim of integrating data augmentation (DA) pipelines and strategies to existing PyTorch components (e.g. autograd for differentiability, optim for optimization). In add… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

  9. arXiv:2010.05365  [pdf, other

    cs.DL cs.LG

    ArXiving Before Submission Helps Everyone

    Authors: Dmytro Mishkin, Amy Tabb, Jiri Matas

    Abstract: We claim, and present evidence, that allowing ar**; it facilitates open research result distribution and reduces inequality… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  10. arXiv:2009.10521  [pdf, other

    cs.CV

    A survey on Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

    Authors: E. Riba, D. Mishkin, J. Shi, D. Ponsa, F. Moreno-Noguer, G. Bradski

    Abstract: This work presents Kornia, an open source computer vision library built upon a set of differentiable routines and modules that aims to solve generic computer vision problems. The package uses PyTorch as its main backend, not only for efficiency but also to take advantage of the reverse auto-differentiation engine to define and compute the gradient of complex functions. Inspired by OpenCV, Kornia i… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.02190

  11. Image Matching across Wide Baselines: From Paper to Practice

    Authors: Yuhe **, Dmytro Mishkin, Anastasiia Mishchuk, Jiri Matas, Pascal Fua, Kwang Moo Yi, Eduard Trulls

    Abstract: We introduce a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task -- the accuracy of the reconstructed camera pose -- as our primary metric. Our pipeline's modular structure allows easy integration, configuration, and combination of different methods and heuristics. This is demonstrated by embedding dozens of popular algorithms and evaluati… ▽ More

    Submitted 11 February, 2021; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Added: KeyNet-SOSNet, AffNet-HardNet, TFeat, MKD from kornia

  12. arXiv:1910.02190  [pdf, other

    cs.CV

    Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

    Authors: Edgar Riba, Dmytro Mishkin, Daniel Ponsa, Ethan Rublee, Gary Bradski

    Abstract: This work presents Kornia -- an open source computer vision library which consists of a set of differentiable routines and modules to solve generic computer vision problems. The package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by OpenCV, Kornia is composed of… ▽ More

    Submitted 9 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: Updated adversarial attack example

  13. arXiv:1901.10915  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Benchmarking Classic and Learned Navigation in Complex 3D Environments

    Authors: Dmytro Mishkin, Alexey Dosovitskiy, Vladlen Koltun

    Abstract: Navigation research is attracting renewed interest with the advent of learning-based methods. However, this new line of work is largely disconnected from well-established classic navigation approaches. In this paper, we take a step towards coordinating these two directions of research. We set up classic and learning-based navigation systems in common simulated environments and thoroughly evaluate… ▽ More

    Submitted 28 March, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: Added CNN-Monodepth and OpenCV Stereo agents

  14. arXiv:1901.09780  [pdf, other

    cs.CV

    Leveraging Outdoor Webcams for Local Descriptor Learning

    Authors: Milan Pultar, Dmytro Mishkin, Jiří Matas

    Abstract: We present AMOS Patches, a large set of image cut-outs, intended primarily for the robustification of trainable local feature descriptors to illumination and appearance changes. Images contributing to AMOS Patches originate from the AMOS dataset of recordings from a large set of outdoor webcams. The semiautomatic method used to generate AMOS Patches is described. It includes camera selection, vi… ▽ More

    Submitted 1 January, 2021; v1 submitted 28 January, 2019; originally announced January 2019.

  15. arXiv:1711.07064  [pdf, other

    cs.CV

    DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks

    Authors: Orest Kupyn, Volodymyr Budzan, Mykola Mykhailych, Dmytro Mishkin, Jiri Matas

    Abstract: We present DeblurGAN, an end-to-end learned method for motion deblurring. The learning is based on a conditional GAN and the content loss . DeblurGAN achieves state-of-the art performance both in the structural similarity measure and visual appearance. The quality of the deblurring model is also evaluated in a novel way on a real-world problem -- object detection on (de-)blurred images. The method… ▽ More

    Submitted 3 April, 2018; v1 submitted 19 November, 2017; originally announced November 2017.

    Comments: CVPR 2018 camera-ready

  16. arXiv:1711.06704  [pdf, other

    cs.CV cs.NE

    Repeatability Is Not Enough: Learning Affine Regions via Discriminability

    Authors: Dmytro Mishkin, Filip Radenovic, Jiri Matas

    Abstract: A method for learning local affine-covariant regions is presented. We show that maximizing geometric repeatability does not lead to local regions, a.k.a features,that are reliably matched and this necessitates descriptor-based learning. We explore factors that influence such learning and registration: the loss function, descriptor type, geometric parametrization and the trade-off between matchabil… ▽ More

    Submitted 28 August, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: ECCV 2018 camera ready

  17. arXiv:1705.10872  [pdf, other

    cs.CV

    Working hard to know your neighbor's margins: Local descriptor learning loss

    Authors: Anastasiya Mishchuk, Dmytro Mishkin, Filip Radenovic, Jiri Matas

    Abstract: We introduce a novel loss for learning local feature descriptors which is inspired by the Lowe's matching criterion for SIFT. We show that the proposed loss that maximizes the distance between the closest positive and closest negative patch in the batch is better than complex regularization methods; it works well for both shallow and deep convolution network architectures. Applying the novel loss… ▽ More

    Submitted 12 January, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: Post-NIPS-2017 update. Better hyperparameters and better results on HPatches + Brown dataset, + couple of references

  18. arXiv:1608.06800  [pdf, other

    cs.CV

    In the Saddle: Chasing Fast and Repeatable Features

    Authors: Javier Aldana-Iuit, Dmytro Mishkin, Ondrej Chum, Jiri Matas

    Abstract: A novel similarity-covariant feature detector that extracts points whose neighbourhoods, when treated as a 3D intensity surface, have a saddle-like intensity profile. The saddle condition is verified efficiently by intensity comparisons on two concentric rings that must have exactly two dark-to-bright and two bright-to-dark transitions satisfying certain geometric constraints. Experiments show tha… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

  19. Systematic evaluation of CNN advances on the ImageNet

    Authors: Dmytro Mishkin, Nikolay Sergievskiy, Jiri Matas

    Abstract: The paper systematically studies the impact of a range of recent advances in CNN architectures and learning methods on the object categorization (ILSVRC) problem. The evalution tests the influence of the following choices of the architecture: non-linearity (ReLU, ELU, maxout, compatibility with batch normalization), pooling variants (stochastic, max, average, mixed), network width, classifier desi… ▽ More

    Submitted 13 June, 2016; v1 submitted 7 June, 2016; originally announced June 2016.

    Comments: Submitted to CVIU Special Issue on Deep Learning. Updated dataset quality experiment

  20. arXiv:1511.06422  [pdf, other

    cs.LG

    All you need is a good init

    Authors: Dmytro Mishkin, Jiri Matas

    Abstract: Layer-sequential unit-variance (LSUV) initialization - a simple method for weight initialization for deep net learning - is proposed. The method consists of the two steps. First, pre-initialize weights of each convolution or inner-product layer with orthonormal matrices. Second, proceed from the first to the final layer, normalizing the variance of the output of each layer to be equal to one. Ex… ▽ More

    Submitted 19 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Published as a conference paper at ICLR 2016

  21. arXiv:1504.06603  [pdf, other

    cs.CV

    WxBS: Wide Baseline Stereo Generalizations

    Authors: Dmytro Mishkin, Jiri Matas, Michal Perdoch, Karel Lenc

    Abstract: We have presented a new problem -- the wide multiple baseline stereo (WxBS) -- which considers matching of images that simultaneously differ in more than one image acquisition factor such as viewpoint, illumination, sensor type or where object appearance changes significantly, e.g. over time. A new dataset with the ground truth for evaluation of matching algorithms has been introduced and will be… ▽ More

    Submitted 12 May, 2015; v1 submitted 24 April, 2015; originally announced April 2015.

    Comments: Descriptor and detector evaluation expanded

  22. MODS: Fast and Robust Method for Two-View Matching

    Authors: Dmytro Mishkin, Jiri Matas, Michal Perdoch

    Abstract: A novel algorithm for wide-baseline matching called MODS - Matching On Demand with view Synthesis - is presented. The MODS algorithm is experimentally shown to solve a broader range of wide-baseline problems than the state of the art while being nearly as fast as standard matchers on simple problems. The apparent robustness vs. speed trade-off is finessed by the use of progressively more time-cons… ▽ More

    Submitted 1 May, 2016; v1 submitted 9 March, 2015; originally announced March 2015.

    Comments: Version accepted to CVIU. arXiv admin note: text overlap with arXiv:1306.3855

  23. arXiv:1306.3855  [pdf, other

    cs.CV

    Two-View Matching with View Synthesis Revisited

    Authors: Dmytro Mishkin, Michal Perdoch, Jiri Matas

    Abstract: Wide-baseline matching focussing on problems with extreme viewpoint change is considered. We introduce the use of view synthesis with affine-covariant detectors to solve such problems and show that matching with the Hessian-Affine or MSER detectors outperforms the state-of-the-art ASIFT. To minimise the loss of speed caused by view synthesis, we propose the Matching On Demand with view Synthesis… ▽ More

    Submitted 11 November, 2013; v1 submitted 17 June, 2013; originally announced June 2013.

    Comments: 25 pages, 14 figures

    Report number: CTU--CMP--2013--15