Skip to main content

Showing 1–4 of 4 results for author: Diaconu, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13493  [pdf, other

    cs.LG stat.ML

    In-Context In-Context Learning with Transformer Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Adrian Weller, Richard E. Turner

    Abstract: Neural processes (NPs) are a powerful family of meta-learning models that seek to approximate the posterior predictive map of the ground-truth stochastic process from which each dataset in a meta-dataset is sampled. There are many cases in which practitioners, besides having access to the dataset of interest, may also have access to other datasets that share similarities with it. In this case, int… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.13488  [pdf, other

    stat.ML cs.LG

    Approximately Equivariant Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Adrian Weller, Wessel Bruinsma, Richard E. Turner

    Abstract: Equivariant deep learning architectures exploit symmetries in learning problems to improve the sample efficiency of neural-network-based models and their ability to generalise. However, when modelling real-world data, learning problems are often not exactly equivariant, but only approximately. For example, when estimating the global temperature field from weather station observations, local topogr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.12409  [pdf, other

    stat.ML cs.LG

    Translation Equivariant Transformer Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Junhyuck Kim, Lakee Sivaraya, Stratis Markou, James Requeima, Wessel P. Bruinsma, Richard E. Turner

    Abstract: The effectiveness of neural processes (NPs) in modelling posterior prediction maps -- the map** from data to posterior predictive distributions -- has significantly improved since their inception. This improvement can be attributed to two principal factors: (1) advancements in the architecture of permutation invariant set functions, which are intrinsic to all NPs; and (2) leveraging symmetries p… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2402.04384  [pdf, other

    cs.LG stat.ML

    Denoising Diffusion Probabilistic Models in Six Simple Steps

    Authors: Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) are a very popular class of deep generative model that have been successfully applied to a diverse range of problems including image and video generation, protein and material synthesis, weather forecasting, and neural surrogates of partial differential equations. Despite their ubiquity it is hard to find an introduction to DDPMs which is simple, co… ▽ More

    Submitted 10 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.