Skip to main content

Showing 1–8 of 8 results for author: Figurnov, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1906.10652  [pdf, other

    stat.ML cs.LG math.OC

    Monte Carlo Gradient Estimation in Machine Learning

    Authors: Shakir Mohamed, Mihaela Rosca, Michael Figurnov, Andriy Mnih

    Abstract: This paper is a broad and accessible survey of the methods we have at our disposal for Monte Carlo gradient estimation in machine learning and across the statistical sciences: the problem of computing the gradient of an expectation of a function with respect to parameters defining the distribution that is integrated; the problem of sensitivity analysis. In machine learning research, this gradient… ▽ More

    Submitted 29 September, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 62 pages

    Journal ref: Journal of Machine Learning Research, 21(132):1-62, 2020

  2. arXiv:1806.02382  [pdf, other

    stat.ML cs.LG

    Variational Autoencoder with Arbitrary Conditioning

    Authors: Oleg Ivanov, Michael Figurnov, Dmitry Vetrov

    Abstract: We propose a single neural probabilistic model based on variational autoencoder that can be conditioned on an arbitrary subset of observed features and then sample the remaining features in "one shot". The features may be both real-valued and categorical. Training of the model is performed by stochastic variational Bayes. The experimental evaluation on synthetic data, as well as feature imputation… ▽ More

    Submitted 27 June, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ICLR 2019

  3. arXiv:1805.08498  [pdf, other

    cs.LG stat.ML

    Implicit Reparameterization Gradients

    Authors: Michael Figurnov, Shakir Mohamed, Andriy Mnih

    Abstract: By providing a simple and efficient way of computing low-variance gradients of continuous random variables, the reparameterization trick has become the technique of choice for training a variety of latent variable models. However, it is not applicable to a number of important continuous distributions. We introduce an alternative approach to computing reparameterization gradients based on implicit… ▽ More

    Submitted 30 January, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018

  4. arXiv:1801.01928  [pdf, ps, other

    cs.MS math.NA

    Tensor Train decomposition on TensorFlow (T3F)

    Authors: Alexander Novikov, Pavel Izmailov, Valentin Khrulkov, Michael Figurnov, Ivan Oseledets

    Abstract: Tensor Train decomposition is used across many branches of machine learning. We present T3F -- a library for Tensor Train decomposition based on TensorFlow. T3F supports GPU execution, batch processing, automatic differentiation, and versatile functionality for the Riemannian optimization framework, which takes into account the underlying manifold structure to construct efficient optimization meth… ▽ More

    Submitted 2 March, 2020; v1 submitted 5 January, 2018; originally announced January 2018.

  5. arXiv:1712.00386  [pdf, other

    cs.LG cs.CV

    Probabilistic Adaptive Computation Time

    Authors: Michael Figurnov, Artem Sobolev, Dmitry Vetrov

    Abstract: We present a probabilistic model with discrete latent variables that control the computation time in deep learning models such as ResNets and LSTMs. A prior on the latent variables expresses the preference for faster computation. The amount of computation for an input is determined via amortized maximum a posteriori (MAP) inference. MAP inference is performed using a novel stochastic variational o… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

  6. arXiv:1612.02297  [pdf, other

    cs.CV cs.LG

    Spatially Adaptive Computation Time for Residual Networks

    Authors: Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov

    Abstract: This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic. It is therefore applicable without any modifications to a wide range of computer vision problems such as image classification, object detection and image segment… ▽ More

    Submitted 2 July, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: CVPR 2017

  7. arXiv:1611.09226  [pdf, other

    cs.LG stat.ML

    Robust Variational Inference

    Authors: Michael Figurnov, Kirill Struminsky, Dmitry Vetrov

    Abstract: Variational inference is a powerful tool for approximate inference. However, it mainly focuses on the evidence lower bound as variational objective and the development of other measures for variational inference is a promising area of research. This paper proposes a robust modification of evidence and a lower bound for the evidence, which is applicable when the majority of the training set samples… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: NIPS 2016 Workshop, Advances in Approximate Bayesian Inference

  8. arXiv:1504.08362  [pdf, other

    cs.CV

    PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions

    Authors: Michael Figurnov, Aijan Ibraimova, Dmitry Vetrov, Pushmeet Kohli

    Abstract: We propose a novel approach to reduce the computational cost of evaluation of convolutional neural networks, a factor that has hindered their deployment in low-power devices such as mobile phones. Inspired by the loop perforation technique from source code optimization, we speed up the bottleneck convolutional layers by skip** their evaluation in some of the spatial positions. We propose and ana… ▽ More

    Submitted 15 October, 2016; v1 submitted 30 April, 2015; originally announced April 2015.

    Comments: NIPS 2016