Skip to main content

Showing 1–5 of 5 results for author: Sterkin, G

.
  1. arXiv:2201.05023  [pdf, other

    cs.CV cs.GR

    Stereo Magnification with Multi-Layer Images

    Authors: Taras Khakhulin, Denis Korzhenkov, Pavel Solovev, Gleb Sterkin, Timotei Ardelean, Victor Lempitsky

    Abstract: Representing scenes with multiple semi-transparent colored layers has been a popular and successful choice for real-time novel view synthesis. Existing approaches infer colors and transparency values over regularly-spaced layers of planar or spherical shape. In this work, we introduce a new view synthesis approach based on multiple semi-transparent layers with scene-adapted geometry. Our approach… ▽ More

    Submitted 29 March, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: CVPR 2022

  2. arXiv:2011.13775  [pdf, other

    cs.CV cs.AI cs.LG

    Image Generators with Conditionally-Independent Pixel Synthesis

    Authors: Ivan Anokhin, Kirill Demochkin, Taras Khakhulin, Gleb Sterkin, Victor Lempitsky, Denis Korzhenkov

    Abstract: Existing image generator networks rely heavily on spatial convolutions and, optionally, self-attention blocks in order to gradually synthesize images in a coarse-to-fine manner. Here, we present a new architecture for image generators, where the color value at each pixel is computed independently given the value of a random latent vector and the coordinate of that pixel. No spatial convolutions or… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

  3. arXiv:2003.08791  [pdf, other

    cs.CV cs.LG eess.IV

    High-Resolution Daytime Translation Without Domain Labels

    Authors: Ivan Anokhin, Pavel Solovev, Denis Korzhenkov, Alexey Kharlamov, Taras Khakhulin, Alexey Silvestrov, Sergey Nikolenko, Victor Lempitsky, Gleb Sterkin

    Abstract: Modeling daytime changes in high resolution photographs, e.g., re-rendering the same scene under different illuminations typical for day, night, or dawn, is a challenging image manipulation task. We present the high-resolution daytime translation (HiDT) model for this task. HiDT combines a generative image-to-image model and a new upsampling scheme that allows to apply image translation at high re… ▽ More

    Submitted 23 March, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: accepted to CVPR 2020

  4. arXiv:1811.11067  [pdf, other

    cs.LG cs.AI stat.ML

    Learning State Representations in Complex Systems with Multimodal Data

    Authors: Pavel Solovev, Vladimir Aliev, Pavel Ostyakov, Gleb Sterkin, Elizaveta Logacheva, Stepan Troeshestov, Roman Suvorov, Anton Mashikhin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Representation learning becomes especially important for complex systems with multimodal data sources such as cameras or sensors. Recent advances in reinforcement learning and optimal control make it possible to design control algorithms on these latent representations, but the field still lacks a large-scale standard dataset for unified comparison. In this work, we present a large-scale dataset a… ▽ More

    Submitted 15 January, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: Fixed references

  5. arXiv:1809.04403  [pdf, other

    cs.CV cs.LG

    Label Denoising with Large Ensembles of Heterogeneous Neural Networks

    Authors: Pavel Ostyakov, Elizaveta Logacheva, Roman Suvorov, Vladimir Aliev, Gleb Sterkin, Oleg Khomenko, Sergey I. Nikolenko

    Abstract: Despite recent advances in computer vision based on various convolutional architectures, video understanding remains an important challenge. In this work, we present and discuss a top solution for the large-scale video classification (labeling) problem introduced as a Kaggle competition based on the YouTube-8M dataset. We show and compare different approaches to preprocessing, data augmentation, m… ▽ More

    Submitted 15 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.