Skip to main content

Showing 1–3 of 3 results for author: Gadd, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.01226  [pdf, other

    stat.ML cs.LG

    Sample-efficient reinforcement learning using deep Gaussian processes

    Authors: Charles Gadd, Markus Heinonen, Harri Lähdesmäki, Samuel Kaski

    Abstract: Reinforcement learning provides a framework for learning to control which actions to take towards completing a task through trial-and-error. In many applications observing interactions is costly, necessitating sample-efficient learning. In model-based reinforcement learning efficiency is improved by learning to simulate the world dynamics. The challenge is that model inaccuracies rapidly accumulat… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  2. arXiv:1905.12969  [pdf, other

    stat.ML cs.LG

    Enriched Mixtures of Gaussian Process Experts

    Authors: Charles W. L. Gadd, Sara Wade, Alexis Boukouvalas

    Abstract: Mixtures of experts probabilistically divide the input space into regions, where the assumptions of each expert, or conditional model, need only hold locally. Combined with Gaussian process (GP) experts, this results in a powerful and highly flexible model. We focus on alternative mixtures of GP experts, which model the joint distribution of the inputs and targets explicitly. We highlight issues o… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  3. arXiv:1803.10746  [pdf, other

    stat.ML cs.LG

    Pseudo-marginal Bayesian inference for supervised Gaussian process latent variable models

    Authors: Charles Gadd, Sara Wade, Akeel Shah, Dimitris Grammatopoulos

    Abstract: We introduce a Bayesian framework for inference with a supervised version of the Gaussian process latent variable model. The framework overcomes the high correlations between latent variables and hyperparameters by using an unbiased pseudo estimate for the marginal likelihood that approximately integrates over the latent variables. This is used to construct a Markov Chain to explore the posterior… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: 9 pages, 2 figures, working paper