Skip to main content

Showing 1–5 of 5 results for author: Anokhin, I

.
  1. arXiv:2307.14993  [pdf, other

    cs.AI cs.LG

    Thinker: Learning to Plan and Act

    Authors: Stephen Chung, Ivan Anokhin, David Krueger

    Abstract: We propose the Thinker algorithm, a novel approach that enables reinforcement learning agents to autonomously interact with and utilize a learned world model. The Thinker algorithm wraps the environment with a world model and introduces new actions designed for interacting with the world model. These model-interaction actions enable agents to perform planning by proposing alternative plans to the… ▽ More

    Submitted 26 October, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: 38 pages

    ACM Class: I.2.6; I.2.8; I.5.1

  2. arXiv:2202.12297  [pdf, other

    stat.ML cs.LG

    Embedded Ensembles: Infinite Width Limit and Operating Regimes

    Authors: Maksim Velikanov, Roman Kail, Ivan Anokhin, Roman Vashurin, Maxim Panov, Alexey Zaytsev, Dmitry Yarotsky

    Abstract: A memory efficient approach to ensembling neural networks is to share most weights among the ensembled models by means of a single reference network. We refer to this strategy as Embedded Ensembling (EE); its particular examples are BatchEnsembles and Monte-Carlo dropout ensembles. In this paper we perform a systematic theoretical and empirical analysis of embedded ensembles with different number… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  3. arXiv:2011.13775  [pdf, other

    cs.CV cs.AI cs.LG

    Image Generators with Conditionally-Independent Pixel Synthesis

    Authors: Ivan Anokhin, Kirill Demochkin, Taras Khakhulin, Gleb Sterkin, Victor Lempitsky, Denis Korzhenkov

    Abstract: Existing image generator networks rely heavily on spatial convolutions and, optionally, self-attention blocks in order to gradually synthesize images in a coarse-to-fine manner. Here, we present a new architecture for image generators, where the color value at each pixel is computed independently given the value of a random latent vector and the coordinate of that pixel. No spatial convolutions or… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

  4. arXiv:2008.00741  [pdf, other

    cs.LG cs.NE stat.ML

    Low-loss connection of weight vectors: distribution-based approaches

    Authors: Ivan Anokhin, Dmitry Yarotsky

    Abstract: Recent research shows that sublevel sets of the loss surfaces of overparameterized networks are connected, exactly or approximately. We describe and compare experimentally a panel of methods used to connect two low-loss points by a low-loss curve on this surface. Our methods vary in accuracy and complexity. Most of our methods are based on "macroscopic" distributional assumptions, and some are ins… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: accepted to ICML 2020

  5. arXiv:2003.08791  [pdf, other

    cs.CV cs.LG eess.IV

    High-Resolution Daytime Translation Without Domain Labels

    Authors: Ivan Anokhin, Pavel Solovev, Denis Korzhenkov, Alexey Kharlamov, Taras Khakhulin, Alexey Silvestrov, Sergey Nikolenko, Victor Lempitsky, Gleb Sterkin

    Abstract: Modeling daytime changes in high resolution photographs, e.g., re-rendering the same scene under different illuminations typical for day, night, or dawn, is a challenging image manipulation task. We present the high-resolution daytime translation (HiDT) model for this task. HiDT combines a generative image-to-image model and a new upsampling scheme that allows to apply image translation at high re… ▽ More

    Submitted 23 March, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: accepted to CVPR 2020