Skip to main content

Showing 1–5 of 5 results for author: Dayan, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:1810.00555  [pdf, other

    stat.ML cs.AI cs.LG

    Probabilistic Meta-Representations Of Neural Networks

    Authors: Theofanis Karaletsos, Peter Dayan, Zoubin Ghahramani

    Abstract: Existing Bayesian treatments of neural networks are typically characterized by weak prior and approximate posterior distributions according to which all the weights are drawn independently. Here, we consider a richer prior distribution in which units in the network are represented by latent variables, and the weights between units are drawn conditionally on the values of the collection of those va… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: presented at UAI 2018 Uncertainty In Deep Learning Workshop (UDL AUG. 2018)

  2. arXiv:1803.10049  [pdf, other

    cs.LG stat.ML

    Fast Parametric Learning with Activation Memorization

    Authors: Jack W Rae, Chris Dyer, Peter Dayan, Timothy P Lillicrap

    Abstract: Neural networks trained with backpropagation often struggle to identify classes that have been observed a small number of times. In applications where most class labels are rare, such as language modelling, this can become a performance bottleneck. One potential remedy is to augment the network with a fast-learning non-parametric model which stores recent activations and class labels into an exter… ▽ More

    Submitted 27 March, 2018; originally announced March 2018.

  3. Monte Carlo Planning method estimates planning horizons during interactive social exchange

    Authors: Andreas Hula, P. Read Montague, Peter Dayan

    Abstract: Reciprocating interactions represent a central feature of all human exchanges. They have been the target of various recent experiments, with healthy participants and psychiatric populations engaging as dyads in multi-round exchanges such as a repeated trust task. Behaviour in such exchanges involves complexities related to each agent's preference for equity with their partner, beliefs about the pa… ▽ More

    Submitted 26 May, 2015; v1 submitted 12 February, 2015; originally announced February 2015.

  4. arXiv:1402.1958  [pdf, other

    cs.AI cs.LG stat.ML

    Better Optimism By Bayes: Adaptive Planning with Rich Models

    Authors: Arthur Guez, David Silver, Peter Dayan

    Abstract: The computational costs of inference and planning have confined Bayesian model-based reinforcement learning to one of two dismal fates: powerful Bayes-adaptive planning but only for simplistic models, or powerful, Bayesian non-parametric models but using simple, myopic planning strategies such as Thompson sampling. We ask whether it is feasible and truly beneficial to combine rich probabilistic mo… ▽ More

    Submitted 9 February, 2014; originally announced February 2014.

    Comments: 11 pages, 11 figures

  5. arXiv:1205.3109  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

    Authors: Arthur Guez, David Silver, Peter Dayan

    Abstract: Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal policies is notoriously taxing, since the search space becomes enormous. In this paper we introduce a tractable, sample-based method for approximate Bayes-optima… ▽ More

    Submitted 18 December, 2013; v1 submitted 14 May, 2012; originally announced May 2012.

    Comments: 14 pages, 7 figures, includes supplementary material. Advances in Neural Information Processing Systems (NIPS) 2012

    Journal ref: (2012) Advances in Neural Information Processing Systems 25, pages 1034-1042