Skip to main content

Showing 1–9 of 9 results for author: Rukhovich, D

.
  1. arXiv:2311.14405  [pdf, other

    cs.CV

    OneFormer3D: One Transformer for Unified Point Cloud Segmentation

    Authors: Maxim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich

    Abstract: Semantic, instance, and panoptic segmentation of 3D point clouds have been addressed using task-specific models of distinct design. Thereby, the similarity of all segmentation tasks and the implicit relationship between them have not been utilized effectively. This paper presents a unified, simple, and effective model addressing all these tasks jointly. The model, named OneFormer3D, performs insta… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  2. arXiv:2302.02871  [pdf, other

    cs.CV

    Top-Down Beats Bottom-Up in 3D Instance Segmentation

    Authors: Maksim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich

    Abstract: Most 3D instance segmentation methods exploit a bottom-up strategy, typically including resource-exhaustive post-processing. For point grou**, bottom-up methods rely on prior assumptions about the objects in the form of hyperparameters, which are domain-specific and need to be carefully tuned. On the contrary, we address 3D instance segmentation with a TD3D: the pioneering cluster-free, fully-co… ▽ More

    Submitted 11 September, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  3. arXiv:2302.02858  [pdf, other

    cs.CV

    TR3D: Towards Real-Time Indoor 3D Object Detection

    Authors: Danila Rukhovich, Anna Vorontsova, Anton Konushin

    Abstract: Recently, sparse 3D convolutions have changed 3D object detection. Performing on par with the voting-based approaches, 3D CNNs are memory-efficient and scale to large scenes better. However, there is still room for improvement. With a conscious, practice-oriented approach to problem-solving, we analyze the performance of such methods and localize the weaknesses. Applying modifications that resolve… ▽ More

    Submitted 5 December, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  4. arXiv:2112.00322  [pdf, other

    cs.CV

    FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

    Authors: Danila Rukhovich, Anna Vorontsova, Anton Konushin

    Abstract: Recently, promising applications in robotics and augmented reality have attracted considerable attention to 3D object detection from point clouds. In this paper, we present FCAF3D - a first-in-class fully convolutional anchor-free indoor 3D object detection method. It is a simple yet effective method that uses a voxel representation of a point cloud and processes voxels with sparse convolutions. F… ▽ More

    Submitted 24 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  5. arXiv:2106.01178  [pdf, other

    cs.CV

    ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

    Authors: Danila Rukhovich, Anna Vorontsova, Anton Konushin

    Abstract: In this paper, we introduce the task of multi-view RGB-based 3D object detection as an end-to-end optimization problem. To address this problem, we propose ImVoxelNet, a novel fully convolutional method of 3D object detection based on monocular or multi-view RGB images. The number of monocular images in each multi-view input can variate during training and inference; actually, this number might be… ▽ More

    Submitted 15 October, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

  6. arXiv:2006.10451  [pdf, other

    cs.CV

    Learning High-Resolution Domain-Specific Representations with a GAN Generator

    Authors: Danil Galeev, Konstantin Sofiiuk, Danila Rukhovich, Mikhail Romanov, Olga Barinova, Anton Konushin

    Abstract: In recent years generative models of visual data have made a great progress, and now they are able to produce images of high quality and diversity. In this work we study representations learnt by a GAN generator. First, we show that these representations can be easily projected onto semantic segmentation map using a lightweight decoder. We find that such semantic projection can be learnt from just… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  7. arXiv:2005.05708  [pdf, other

    cs.CV

    IterDet: Iterative Scheme for Object Detection in Crowded Environments

    Authors: Danila Rukhovich, Konstantin Sofiiuk, Danil Galeev, Olga Barinova, Anton Konushin

    Abstract: Deep learning-based detectors usually produce a redundant set of object bounding boxes including many duplicate detections of the same object. These boxes are then filtered using non-maximum suppression (NMS) in order to select exactly one bounding box per object of interest. This greedy scheme is simple and provides sufficient accuracy for isolated objects but often fails in crowded environments,… ▽ More

    Submitted 29 January, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

  8. arXiv:1910.03903  [pdf, other

    cs.CV

    MixMatch Domain Adaptaion: Prize-winning solution for both tracks of VisDA 2019 challenge

    Authors: Danila Rukhovich, Danil Galeev

    Abstract: We present a domain adaptation (DA) system that can be used in multi-source and semi-supervised settings. Using the proposed method we achieved 2nd place on multi-source track and 3rd place on semi-supervised track of the VisDA 2019 challenge (http://ai.bu.edu/visda-2019/). The source code of the method is available at https://github.com/filaPro/visda2019.

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: accepted at TASK-CV 2019 at ICCV

  9. arXiv:1909.00713  [pdf, other

    cs.CV cs.RO

    Estimation of Absolute Scale in Monocular SLAM Using Synthetic Data

    Authors: Danila Rukhovich, Daniel Mouritzen, Ralf Kaestner, Martin Rufli, Alexander Velizhev

    Abstract: This paper addresses the problem of scale estimation in monocular SLAM by estimating absolute distances between camera centers of consecutive image frames. These estimates would improve the overall performance of classical (not deep) SLAM systems and allow metric feature locations to be recovered from a single monocular camera. We propose several network architectures that lead to an improvement o… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.