Skip to main content

Showing 1–6 of 6 results for author: Logacheva, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2109.07161  [pdf, other

    cs.CV eess.IV

    Resolution-robust Large Mask Inpainting with Fourier Convolutions

    Authors: Roman Suvorov, Elizaveta Logacheva, Anton Mashikhin, Anastasia Remizova, Arsenii Ashukha, Aleksei Silvestrov, Nae** Kong, Harshith Goka, Kiwoong Park, Victor Lempitsky

    Abstract: Modern image inpainting systems, despite the significant progress, often struggle with large missing areas, complex geometric structures, and high-resolution images. We find that one of the main reasons for that is the lack of an effective receptive field in both the inpainting network and the loss function. To alleviate this issue, we propose a new method called large mask inpainting (LaMa). LaMa… ▽ More

    Submitted 10 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Winter Conference on Applications of Computer Vision (WACV 2022)

  2. arXiv:2008.09655  [pdf, other

    cs.CV cs.GR cs.LG

    DeepLandscape: Adversarial Modeling of Landscape Video

    Authors: Elizaveta Logacheva, Roman Suvorov, Oleg Khomenko, Anton Mashikhin, Victor Lempitsky

    Abstract: We build a new model of landscape videos that can be trained on a mixture of static landscape images as well as landscape animations. Our architecture extends StyleGAN model by augmenting it with parts that allow to model dynamic changes in a scene. Once trained, our model can be used to generate realistic time-lapse landscape videos with moving objects and time-of-the-day changes. Furthermore, by… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted at ECCV 2020

  3. AI2D-RST: A multimodal corpus of 1000 primary school science diagrams

    Authors: Tuomo Hiippala, Malihe Alikhani, Jonas Haverinen, Timo Kalliokoski, Evanfiya Logacheva, Serafina Orekhova, Aino Tuomainen, Matthew Stone, John A. Bateman

    Abstract: This article introduces AI2D-RST, a multimodal corpus of 1000 English-language diagrams that represent topics in primary school natural sciences, such as food webs, life cycles, moon phases and human physiology. The corpus is based on the Allen Institute for Artificial Intelligence Diagrams (AI2D) dataset, a collection of diagrams with crowd-sourced descriptions, which was originally developed to… ▽ More

    Submitted 20 March, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: 24 pages; revised version submitted to Language Resources & Evaluation

    Journal ref: Language Resources and Evaluation 55(3), 2021, pp. 661-688

  4. arXiv:1811.11067  [pdf, other

    cs.LG cs.AI stat.ML

    Learning State Representations in Complex Systems with Multimodal Data

    Authors: Pavel Solovev, Vladimir Aliev, Pavel Ostyakov, Gleb Sterkin, Elizaveta Logacheva, Stepan Troeshestov, Roman Suvorov, Anton Mashikhin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Representation learning becomes especially important for complex systems with multimodal data sources such as cameras or sensors. Recent advances in reinforcement learning and optimal control make it possible to design control algorithms on these latent representations, but the field still lacks a large-scale standard dataset for unified comparison. In this work, we present a large-scale dataset a… ▽ More

    Submitted 15 January, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: Fixed references

  5. arXiv:1811.07630  [pdf, other

    cs.CV cs.LG cs.NE

    SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint

    Authors: Pavel Ostyakov, Roman Suvorov, Elizaveta Logacheva, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: We present a novel approach to image manipulation and understanding by simultaneously learning to segment object masks, paste objects to another background image, and remove them from original images. For this purpose, we develop a novel generative model for compositional image generation, SEIGAN (Segment-Enhance-Inpaint Generative Adversarial Network), which learns these three operations together… ▽ More

    Submitted 15 January, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  6. arXiv:1809.04403  [pdf, other

    cs.CV cs.LG

    Label Denoising with Large Ensembles of Heterogeneous Neural Networks

    Authors: Pavel Ostyakov, Elizaveta Logacheva, Roman Suvorov, Vladimir Aliev, Gleb Sterkin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Despite recent advances in computer vision based on various convolutional architectures, video understanding remains an important challenge. In this work, we present and discuss a top solution for the large-scale video classification (labeling) problem introduced as a Kaggle competition based on the YouTube-8M dataset. We show and compare different approaches to preprocessing, data augmentation, m… ▽ More

    Submitted 15 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.