Skip to main content

Showing 1–8 of 8 results for author: Zhukov, D

.
  1. arXiv:2309.14662  [pdf, other

    cs.LG cs.CY cs.IR

    Transformer-based classification of user queries for medical consultancy with respect to expert specialization

    Authors: Dmitry Lyutkin, Andrey Soloviev, Dmitry Zhukov, Denis Pozdnyakov, Muhammad Shahid Iqbal Malik, Dmitry I. Ignatov

    Abstract: The need for skilled medical support is growing in the era of digital healthcare. This research presents an innovative strategy, utilizing the RuBERT model, for categorizing user inquiries in the field of medical consultation with a focus on expert specialization. By harnessing the capabilities of transformers, we fine-tuned the pre-trained RuBERT model on a varied dataset, which facilitates preci… ▽ More

    Submitted 2 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 16 pages, 5 figures

  2. arXiv:2109.04409  [pdf, other

    cs.CV

    Reconstructing and grounding narrated instructional videos in 3D

    Authors: Dimitri Zhukov, Ignacio Rocco, Ivan Laptev, Josef Sivic, Johannes L. Schönberger, Bugra Tekin, Marc Pollefeys

    Abstract: Narrated instructional videos often show and describe manipulations of similar objects, e.g., repairing a particular model of a car or laptop. In this work we aim to reconstruct such objects and to localize associated narrations in 3D. Contrary to the standard scenario of instance-level 3D reconstruction, where identical objects or scenes are present in all views, objects in different instructiona… ▽ More

    Submitted 10 September, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

  3. arXiv:1912.05405  [pdf, other

    cs.CV

    Training Deep SLAM on Single Frames

    Authors: Igor Slinko, Anna Vorontsova, Dmitry Zhukov, Olga Barinova, Anton Konushin

    Abstract: Learning-based visual odometry and SLAM methods demonstrate a steady improvement over past years. However, collecting ground truth poses to train these methods is difficult and expensive. This could be resolved by training in an unsupervised mode, but there is still a large gap between performance of unsupervised and supervised methods. In this work, we focus on generating synthetic data for deep… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  4. arXiv:1910.04755  [pdf, other

    cs.CV cs.RO

    Measuring robustness of Visual SLAM

    Authors: David Prokhorov, Dmitry Zhukov, Olga Barinova, Anna Vorontsova, Anton Konushin

    Abstract: Simultaneous localization and map** (SLAM) is an essential component of robotic systems. In this work we perform a feasibility study of RGB-D SLAM for the task of indoor robot navigation. Recent visual SLAM methods, e.g. ORBSLAM2 \cite{mur2017orb}, demonstrate really impressive accuracy, but the experiments in the papers are usually conducted on just a few sequences, that makes it difficult to r… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  5. arXiv:1909.12146  [pdf, other

    cs.CV

    DISCOMAN: Dataset of Indoor SCenes for Odometry, Map** And Navigation

    Authors: Pavel Kirsanov, Airat Gaskarov, Filipp Konokhov, Konstantin Sofiiuk, Anna Vorontsova, Igor Slinko, Dmitry Zhukov, Sergey Bykov, Olga Barinova, Anton Konushin

    Abstract: We present a novel dataset for training and benchmarking semantic SLAM methods. The dataset consists of 200 long sequences, each one containing 3000-5000 data frames. We generate the sequences using realistic home layouts. For that we sample trajectories that simulate motions of a simple home robot, and then render the frames along the trajectories. Each data frame contains a) RGB images generated… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: 8 pages, 7 figures

  6. arXiv:1906.03327  [pdf, other

    cs.CV

    HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

    Authors: Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, Josef Sivic

    Abstract: Learning text-video embeddings usually requires a dataset of video clips with manually provided captions. However, such datasets are expensive and time consuming to create and therefore difficult to obtain on a large scale. In this work, we propose instead to learn such embeddings from video data with readily available natural language annotations in the form of automatically transcribed narration… ▽ More

    Submitted 31 July, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: Accepted at ICCV 2019

  7. arXiv:1903.08225  [pdf, other

    cs.CV

    Cross-task weakly supervised learning from instructional videos

    Authors: Dimitri Zhukov, Jean-Baptiste Alayrac, Ramazan Gokberk Cinbis, David Fouhey, Ivan Laptev, Josef Sivic

    Abstract: In this paper we investigate learning visual models for the steps of ordinary tasks using weak supervision via instructional narrations and an ordered list of steps instead of strong supervision via temporal annotations. At the heart of our approach is the observation that weakly supervised learning may be easier if a model shares components while learning different steps: `pour egg' should be tra… ▽ More

    Submitted 29 April, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: 18 pages, 17 figures, to be published in proceedings of the CVPR, 2019

  8. arXiv:1901.07281  [pdf, other

    astro-ph.SR astro-ph.IM

    Full orbital solution for the binary system in the northern Galactic disc microlensing event Gaia16aye

    Authors: Łukasz Wyrzykowski, P. Mróz, K. A. Rybicki, M. Gromadzki, Z. Kołaczkowski, M. Zieliński, P. Zieliński, N. Britavskiy, A. Gomboc, K. Sokolovsky, S. T. Hodgkin, L. Abe, G. F. Aldi, A. AlMannaei, G. Altavilla, A. Al Qasim, G. C. Anupama, S. Awiphan, E. Bachelet, V. Bakıs, S. Baker, S. Bartlett, P. Bendjoya, K. Benson, I. F. Bikmaev , et al. (160 additional authors not shown)

    Abstract: Gaia16aye was a binary microlensing event discovered in the direction towards the northern Galactic disc and was one of the first microlensing events detected and alerted to by the Gaia space mission. Its light curve exhibited five distinct brightening episodes, reaching up to I=12 mag, and it was covered in great detail with almost 25,000 data points gathered by a network of telescopes. We presen… ▽ More

    Submitted 28 October, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: accepted for publication in A&A, 24 pages, 10 figures, tables with the data will be available electronically

    Journal ref: A&A 633, A98 (2020)