Skip to main content

Showing 1–5 of 5 results for author: Danihelka, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  2. arXiv:1705.10743  [pdf, other

    cs.LG stat.ML

    The Cramer Distance as a Solution to Biased Wasserstein Gradients

    Authors: Marc G. Bellemare, Ivo Danihelka, Will Dabney, Shakir Mohamed, Balaji Lakshminarayanan, Stephan Hoyer, Rémi Munos

    Abstract: The Wasserstein probability metric has received much attention from the machine learning community. Unlike the Kullback-Leibler divergence, which strictly measures change in probability, the Wasserstein metric reflects the underlying geometry between outcomes. The value of being sensitive to this geometry has been demonstrated, among others, in ordinal regression and generative modelling. In this… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  3. arXiv:1604.08772  [pdf, other

    stat.ML cs.CV cs.LG

    Towards Conceptual Compression

    Authors: Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, Daan Wierstra

    Abstract: We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in latent variable models for both the ImageNet and Omniglot datasets. We show that it naturally separates global conceptual information from lower level details, thus addressing one of the fundamentally desired properties of unsupervised lear… ▽ More

    Submitted 29 April, 2016; originally announced April 2016.

    Comments: 14 pages, 13 figures

  4. arXiv:1603.05106  [pdf, other

    stat.ML cs.AI cs.LG

    One-Shot Generalization in Deep Generative Models

    Authors: Danilo Jimenez Rezende, Shakir Mohamed, Ivo Danihelka, Karol Gregor, Daan Wierstra

    Abstract: Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure, and then be able to generate compelling alternative variations of the concept. We develop machine learning systems with this important capacity by develo** new… ▽ More

    Submitted 25 May, 2016; v1 submitted 16 March, 2016; originally announced March 2016.

    Comments: 8pgs, 1pg references, 1pg appendix, In Proceedings of the 33rd International Conference on Machine Learning, JMLR: W&CP volume 48, 2016

  5. arXiv:1310.8499  [pdf, other

    cs.LG stat.ML

    Deep AutoRegressive Networks

    Authors: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra

    Abstract: We introduce a deep, generative autoencoder capable of learning hierarchies of distributed representations from data. Successive deep stochastic hidden layers are equipped with autoregressive connections, which enable the model to be sampled from quickly and exactly via ancestral sampling. We derive an efficient approximate parameter estimation method based on the minimum description length (MDL)… ▽ More

    Submitted 20 May, 2014; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: Appears in Proceedings of the 31st International Conference on Machine Learning (ICML), Bei**g, China, 2014

    Journal ref: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra. Deep AutoRegressive Networks. In Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W&CP volume 32, 2014