Skip to main content

Showing 1–3 of 3 results for author: Shvechikov, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.04269  [pdf, other

    cs.LG cs.AI stat.ML

    Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics

    Authors: Arsenii Kuznetsov, Pavel Shvechikov, Alexander Grishin, Dmitry Vetrov

    Abstract: The overestimation bias is one of the major impediments to accurate off-policy learning. This paper investigates a novel way to alleviate the overestimation bias in a continuous control setting. Our method---Truncated Quantile Critics, TQC,---blends three ideas: distributional representation of a critic, truncation of critics prediction, and ensembling of multiple critics. Distributional represent… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Under review by the International Conference on Machine Learning

  2. arXiv:1811.02783  [pdf, other

    cs.LG stat.ML

    YASENN: Explaining Neural Networks via Partitioning Activation Sequences

    Authors: Yaroslav Zharov, Denis Korzhenkov, Pavel Shvechikov, Alexander Tuzhilin

    Abstract: We introduce a novel approach to feed-forward neural network interpretation based on partitioning the space of sequences of neuron activations. In line with this approach, we propose a model-specific interpretation method, called YASENN. Our method inherits many advantages of model-agnostic distillation, such as an ability to focus on the particular input region and to express an explanation in te… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

  3. arXiv:1810.07151  [pdf, other

    stat.ML cs.AI cs.LG

    Metropolis-Hastings view on variational inference and adversarial training

    Authors: Kirill Neklyudov, Evgenii Egorov, Pavel Shvechikov, Dmitry Vetrov

    Abstract: A significant part of MCMC methods can be considered as the Metropolis-Hastings (MH) algorithm with different proposal distributions. From this point of view, the problem of constructing a sampler can be reduced to the question - how to choose a proposal for the MH algorithm? To address this question, we propose to learn an independent sampler that maximizes the acceptance rate of the MH algorithm… ▽ More

    Submitted 9 June, 2019; v1 submitted 16 October, 2018; originally announced October 2018.