Skip to main content

Showing 1–7 of 7 results for author: Suvorov, R

.
  1. arXiv:2109.07161  [pdf, other

    cs.CV eess.IV

    Resolution-robust Large Mask Inpainting with Fourier Convolutions

    Authors: Roman Suvorov, Elizaveta Logacheva, Anton Mashikhin, Anastasia Remizova, Arsenii Ashukha, Aleksei Silvestrov, Nae** Kong, Harshith Goka, Kiwoong Park, Victor Lempitsky

    Abstract: Modern image inpainting systems, despite the significant progress, often struggle with large missing areas, complex geometric structures, and high-resolution images. We find that one of the main reasons for that is the lack of an effective receptive field in both the inpainting network and the loss function. To alleviate this issue, we propose a new method called large mask inpainting (LaMa). LaMa… ▽ More

    Submitted 10 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Winter Conference on Applications of Computer Vision (WACV 2022)

  2. arXiv:2105.01957  [pdf, other

    cs.LG cs.CV

    Perceptual Gradient Networks

    Authors: Dmitry Nikulin, Roman Suvorov, Aleksei Ivakhnenko, Victor Lempitsky

    Abstract: Many applications of deep learning for image generation use perceptual losses for either training or fine-tuning of the generator networks. The use of perceptual loss however incurs repeated forward-backward passes in a large image classification network as well as a considerable memory overhead required to store the activations of this network. It is therefore desirable or sometimes even critical… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 28 pages, 15 figures, 8 tables

  3. arXiv:2008.09655  [pdf, other

    cs.CV cs.GR cs.LG

    DeepLandscape: Adversarial Modeling of Landscape Video

    Authors: Elizaveta Logacheva, Roman Suvorov, Oleg Khomenko, Anton Mashikhin, Victor Lempitsky

    Abstract: We build a new model of landscape videos that can be trained on a mixture of static landscape images as well as landscape animations. Our architecture extends StyleGAN model by augmenting it with parts that allow to model dynamic changes in a scene. Once trained, our model can be used to generate realistic time-lapse landscape videos with moving objects and time-of-the-day changes. Furthermore, by… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted at ECCV 2020

  4. arXiv:1811.11067  [pdf, other

    cs.LG cs.AI stat.ML

    Learning State Representations in Complex Systems with Multimodal Data

    Authors: Pavel Solovev, Vladimir Aliev, Pavel Ostyakov, Gleb Sterkin, Elizaveta Logacheva, Stepan Troeshestov, Roman Suvorov, Anton Mashikhin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Representation learning becomes especially important for complex systems with multimodal data sources such as cameras or sensors. Recent advances in reinforcement learning and optimal control make it possible to design control algorithms on these latent representations, but the field still lacks a large-scale standard dataset for unified comparison. In this work, we present a large-scale dataset a… ▽ More

    Submitted 15 January, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: Fixed references

  5. arXiv:1811.07630  [pdf, other

    cs.CV cs.LG cs.NE

    SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint

    Authors: Pavel Ostyakov, Roman Suvorov, Elizaveta Logacheva, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: We present a novel approach to image manipulation and understanding by simultaneously learning to segment object masks, paste objects to another background image, and remove them from original images. For this purpose, we develop a novel generative model for compositional image generation, SEIGAN (Segment-Enhance-Inpaint Generative Adversarial Network), which learns these three operations together… ▽ More

    Submitted 15 January, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  6. arXiv:1809.04403  [pdf, other

    cs.CV cs.LG

    Label Denoising with Large Ensembles of Heterogeneous Neural Networks

    Authors: Pavel Ostyakov, Elizaveta Logacheva, Roman Suvorov, Vladimir Aliev, Gleb Sterkin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Despite recent advances in computer vision based on various convolutional architectures, video understanding remains an important challenge. In this work, we present and discuss a top solution for the large-scale video classification (labeling) problem introduced as a Kaggle competition based on the YouTube-8M dataset. We show and compare different approaches to preprocessing, data augmentation, m… ▽ More

    Submitted 15 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

  7. arXiv:1806.02253  [pdf, ps, other

    cs.CL

    The Limitations of Cross-language Word Embeddings Evaluation

    Authors: Amir Bakarov, Roman Suvorov, Ilya Sochenkov

    Abstract: The aim of this work is to explore the possible limitations of existing methods of cross-language word embeddings evaluation, addressing the lack of correlation between intrinsic and extrinsic cross-language evaluation methods. To prove this hypothesis, we construct English-Russian datasets for extrinsic and intrinsic evaluation tasks and compare performances of 5 different cross-language models o… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: In Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (*SEM 2018)