Skip to main content

Showing 1–11 of 11 results for author: Gregor, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2003.03124  [pdf, other

    cs.LG cs.NE stat.ML

    Finding online neural update rules by learning to remember

    Authors: Karol Gregor

    Abstract: We investigate learning of the online local update rules for neural activations (bodies) and weights (synapses) from scratch. We represent the states of each weight and activation by small vectors, and parameterize their updates using (meta-) neural networks. Different neuron types are represented by different embedding vectors which allows the same two functions to be used for all neurons. Instea… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 11 Pages, 1 figure

  2. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  3. arXiv:1906.09237  [pdf, other

    cs.LG cs.AI stat.ML

    Sha** Belief States with Generative Environment Models for RL

    Authors: Karol Gregor, Danilo Jimenez Rezende, Frederic Besse, Yan Wu, Hamza Merzic, Aaron van den Oord

    Abstract: When agents interact with a complex environment, they must form and maintain beliefs about the relevant aspects of that environment. We propose a way to efficiently train expressive generative models in complex environments. We show that a predictive algorithm with an expressive generative model can form stable belief-states in visually rich and dynamic 3D environments. More precisely, we show tha… ▽ More

    Submitted 24 June, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: pre-print

  4. arXiv:1901.03559  [pdf, other

    cs.LG cs.AI stat.ML

    An investigation of model-free planning

    Authors: Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap

    Abstract: The field of reinforcement learning (RL) is facing increasingly challenging domains with combinatorial complexity. For an RL agent to address these challenges, it is essential that it can plan effectively. Prior work has typically utilized an explicit model of the environment, combined with a specific planning algorithm (such as tree search). More recently, a new family of methods have been propos… ▽ More

    Submitted 20 May, 2019; v1 submitted 11 January, 2019; originally announced January 2019.

  5. arXiv:1811.09556  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Attractor Dynamics for Generative Memory

    Authors: Yan Wu, Greg Wayne, Karol Gregor, Timothy Lillicrap

    Abstract: A central challenge faced by memory systems is the robust retrieval of a stored pattern in the presence of interference due to other stored patterns and noise. A theoretically well-founded solution to robust retrieval is given by attractor dynamics, which iteratively clean up patterns during recall. However, incorporating attractor dynamics into modern deep learning systems poses difficulties: att… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.

  6. arXiv:1806.03107  [pdf, other

    cs.LG stat.ML

    Temporal Difference Variational Auto-Encoder

    Authors: Karol Gregor, George Papamakarios, Frederic Besse, Lars Buesing, Theophane Weber

    Abstract: To act and plan in complex environments, we posit that agents should have a mental simulator of the world with three characteristics: (a) it should build an abstract state representing the condition of the world; (b) it should form a belief which represents uncertainty on the world; (c) it should go beyond simple step-by-step simulation, and exhibit temporal abstraction. Motivated by the absence o… ▽ More

    Submitted 2 January, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

  7. arXiv:1604.08772  [pdf, other

    stat.ML cs.CV cs.LG

    Towards Conceptual Compression

    Authors: Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, Daan Wierstra

    Abstract: We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in latent variable models for both the ImageNet and Omniglot datasets. We show that it naturally separates global conceptual information from lower level details, thus addressing one of the fundamentally desired properties of unsupervised lear… ▽ More

    Submitted 29 April, 2016; originally announced April 2016.

    Comments: 14 pages, 13 figures

  8. arXiv:1603.05106  [pdf, other

    stat.ML cs.AI cs.LG

    One-Shot Generalization in Deep Generative Models

    Authors: Danilo Jimenez Rezende, Shakir Mohamed, Ivo Danihelka, Karol Gregor, Daan Wierstra

    Abstract: Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure, and then be able to generate compelling alternative variations of the concept. We develop machine learning systems with this important capacity by develo** new… ▽ More

    Submitted 25 May, 2016; v1 submitted 16 March, 2016; originally announced March 2016.

    Comments: 8pgs, 1pg references, 1pg appendix, In Proceedings of the 33rd International Conference on Machine Learning, JMLR: W&CP volume 48, 2016

  9. arXiv:1502.03509  [pdf, other

    cs.LG cs.NE stat.ML

    MADE: Masked Autoencoder for Distribution Estimation

    Authors: Mathieu Germain, Karol Gregor, Iain Murray, Hugo Larochelle

    Abstract: There has been a lot of recent interest in designing neural network models to estimate a distribution from a set of examples. We introduce a simple modification for autoencoder neural networks that yields powerful generative models. Our method masks the autoencoder's parameters to respect autoregressive constraints: each input is reconstructed only from previous inputs in a given ordering. Constra… ▽ More

    Submitted 5 June, 2015; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: 9 pages and 1 page of supplementary material. Updated to match published version

    Journal ref: Proceedings of the 32nd International Conference on Machine Learning, JMLR W&CP 37:881-889, 2015

  10. arXiv:1402.0030  [pdf, ps, other

    cs.LG stat.ML

    Neural Variational Inference and Learning in Belief Networks

    Authors: Andriy Mnih, Karol Gregor

    Abstract: Highly expressive directed latent variable models, such as sigmoid belief networks, are difficult to train on large datasets because exact inference in them is intractable and none of the approximate inference methods that have been applied to them scale well. We propose a fast non-iterative approximate inference method that uses a feedforward network to implement efficient exact sampling from the… ▽ More

    Submitted 4 June, 2014; v1 submitted 31 January, 2014; originally announced February 2014.

    Journal ref: Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W&CP volume 32, 2014 pgs 1791-1799

  11. arXiv:1310.8499  [pdf, other

    cs.LG stat.ML

    Deep AutoRegressive Networks

    Authors: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra

    Abstract: We introduce a deep, generative autoencoder capable of learning hierarchies of distributed representations from data. Successive deep stochastic hidden layers are equipped with autoregressive connections, which enable the model to be sampled from quickly and exactly via ancestral sampling. We derive an efficient approximate parameter estimation method based on the minimum description length (MDL)… ▽ More

    Submitted 20 May, 2014; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: Appears in Proceedings of the 31st International Conference on Machine Learning (ICML), Bei**g, China, 2014

    Journal ref: Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra. Deep AutoRegressive Networks. In Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W&CP volume 32, 2014