Skip to main content

Showing 1–13 of 13 results for author: Foong, A Y K

.
  1. arXiv:2402.04384  [pdf, other

    cs.LG stat.ML

    Denoising Diffusion Probabilistic Models in Six Simple Steps

    Authors: Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) are a very popular class of deep generative model that have been successfully applied to a diverse range of problems including image and video generation, protein and material synthesis, weather forecasting, and neural surrogates of partial differential equations. Despite their ubiquity it is hard to find an introduction to DDPMs which is simple, co… ▽ More

    Submitted 10 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  2. arXiv:2401.04082  [pdf, other

    q-bio.QM cs.LG stat.ML

    Improved motif-scaffolding with SE(3) flow matching

    Authors: Jason Yim, Andrew Campbell, Emile Mathieu, Andrew Y. K. Foong, Michael Gastegger, José Jiménez-Luna, Sarah Lewis, Victor Garcia Satorras, Bastiaan S. Veeling, Frank Noé, Regina Barzilay, Tommi S. Jaakkola

    Abstract: Protein design often begins with knowledge of a desired function from a motif which motif-scaffolding aims to construct a functional protein around. Recently, generative models have achieved breakthrough success in designing scaffolds for a diverse range of motifs. However, the generated scaffolds tend to lack structural diversity, which can hinder success in wet-lab validation. In this work, we e… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Preprint. Code: https://github.com/ microsoft/frame-flow

  3. arXiv:2310.05297  [pdf, other

    q-bio.QM

    Fast protein backbone generation with SE(3) flow matching

    Authors: Jason Yim, Andrew Campbell, Andrew Y. K. Foong, Michael Gastegger, José Jiménez-Luna, Sarah Lewis, Victor Garcia Satorras, Bastiaan S. Veeling, Regina Barzilay, Tommi Jaakkola, Frank Noé

    Abstract: We present FrameFlow, a method for fast protein backbone generation using SE(3) flow matching. Specifically, we adapt FrameDiff, a state-of-the-art diffusion model, to the flow-matching generative modeling paradigm. We show how flow matching can be applied on SE(3) and propose modifications during training to effectively learn the vector field. Compared to FrameDiff, FrameFlow requires five times… ▽ More

    Submitted 10 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Preprint

  4. arXiv:2303.14468  [pdf, other

    stat.ML cs.LG

    Autoregressive Conditional Neural Processes

    Authors: Wessel P. Bruinsma, Stratis Markou, James Requiema, Andrew Y. K. Foong, Tom R. Andersson, Anna Vaughan, Anthony Buonomo, J. Scott Hosking, Richard E. Turner

    Abstract: Conditional neural processes (CNPs; Garnelo et al., 2018a) are attractive meta-learning models which produce well-calibrated predictions and are trainable via a simple maximum likelihood procedure. Although CNPs have many advantages, they are unable to model dependencies in their predictions. Various works propose solutions to this, but these come at the cost of either requiring approximate infere… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 57 pages; accepted to the 11th International Conference on Learning Representations (ICLR 2023)

  5. arXiv:2302.01170  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG physics.chem-ph

    Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics

    Authors: Leon Klein, Andrew Y. K. Foong, Tor Erlend Fjelde, Bruno Mlodozeniec, Marc Brockschmidt, Sebastian Nowozin, Frank Noé, Ryota Tomioka

    Abstract: Molecular dynamics (MD) simulation is a widely used technique to simulate molecular systems, most commonly at the all-atom resolution where equations of motion are integrated with timesteps on the order of femtoseconds ($1\textrm{fs}=10^{-15}\textrm{s}$). MD is often used to compute equilibrium properties, which requires sampling from an equilibrium distribution such as the Boltzmann distribution.… ▽ More

    Submitted 1 December, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  6. arXiv:2205.07880  [pdf, ps, other

    stat.ML cs.LG

    A Note on the Chernoff Bound for Random Variables in the Unit Interval

    Authors: Andrew Y. K. Foong, Wessel P. Bruinsma, David R. Burt

    Abstract: The Chernoff bound is a well-known tool for obtaining a high probability bound on the expectation of a Bernoulli random variable in terms of its sample average. This bound is commonly used in statistical learning theory to upper bound the generalisation risk of a hypothesis in terms of its empirical risk on held-out data, for the case of a binary-valued loss function. However, the extension of thi… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  7. arXiv:2106.03542  [pdf, other

    stat.ML cs.LG math.ST

    How Tight Can PAC-Bayes be in the Small Data Regime?

    Authors: Andrew Y. K. Foong, Wessel P. Bruinsma, David R. Burt, Richard E. Turner

    Abstract: In this paper, we investigate the question: Given a small number of datapoints, for example N = 30, how tight can PAC-Bayes and test set bounds be made? For such small datasets, test set bounds adversely affect generalisation performance by withholding data from the training procedure. In this setting, PAC-Bayes bounds are especially attractive, due to their ability to use all the data to simultan… ▽ More

    Submitted 13 January, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Published at Neural Information Processing Systems 2021

  8. arXiv:2101.03606  [pdf, other

    stat.ML cs.LG

    The Gaussian Neural Process

    Authors: Wessel P. Bruinsma, James Requeima, Andrew Y. K. Foong, Jonathan Gordon, Richard E. Turner

    Abstract: Neural Processes (NPs; Garnelo et al., 2018a,b) are a rich class of models for meta-learning that map data sets directly to predictive stochastic processes. We provide a rigorous analysis of the standard maximum-likelihood objective used to train conditional NPs. Moreover, we propose a new member to the Neural Process family called the Gaussian Neural Process (GNP), which models predictive correla… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: 34 pages; includes supplementary material; to appear in AABI 2020

  9. arXiv:2007.14235  [pdf, other

    cs.CV

    Structured Weight Priors for Convolutional Neural Networks

    Authors: Tim Pearce, Andrew Y. K. Foong, Alexandra Brintrup

    Abstract: Selection of an architectural prior well suited to a task (e.g. convolutions for image data) is crucial to the success of deep neural networks (NNs). Conversely, the weight priors within these architectures are typically left vague, e.g.~independent Gaussian distributions, which has led to debate over the utility of Bayesian deep learning. This paper explores the benefits of adding structure to we… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  10. arXiv:2007.01332  [pdf, other

    stat.ML cs.LG

    Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes

    Authors: Andrew Y. K. Foong, Wessel P. Bruinsma, Jonathan Gordon, Yann Dubois, James Requeima, Richard E. Turner

    Abstract: Stationary stochastic processes (SPs) are a key component of many probabilistic models, such as those for off-the-grid spatio-temporal data. They enable the statistical symmetry of underlying physical phenomena to be leveraged, thereby aiding generalization. Prediction in such models can be viewed as a translation equivariant map from observed data sets to predictive SPs, emphasizing the intimate… ▽ More

    Submitted 20 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020

  11. arXiv:1910.13556  [pdf, other

    stat.ML cs.LG

    Convolutional Conditional Neural Processes

    Authors: Jonathan Gordon, Wessel P. Bruinsma, Andrew Y. K. Foong, James Requeima, Yann Dubois, Richard E. Turner

    Abstract: We introduce the Convolutional Conditional Neural Process (ConvCNP), a new member of the Neural Process family that models translation equivariance in the data. Translation equivariance is an important inductive bias for many learning problems including time series modelling, spatial data, and images. The model embeds data sets into an infinite-dimensional function space as opposed to a finite-dim… ▽ More

    Submitted 25 June, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at International Conference on Learning Representations 2020

  12. arXiv:1909.00719  [pdf, other

    stat.ML cs.LG

    On the Expressiveness of Approximate Inference in Bayesian Neural Networks

    Authors: Andrew Y. K. Foong, David R. Burt, Yingzhen Li, Richard E. Turner

    Abstract: While Bayesian neural networks (BNNs) hold the promise of being flexible, well-calibrated statistical models, inference often requires approximations whose consequences are poorly understood. We study the quality of common variational methods in approximating the Bayesian predictive distribution. For single-hidden layer ReLU BNNs, we prove a fundamental limitation in function-space of two of the m… ▽ More

    Submitted 23 October, 2020; v1 submitted 2 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2020 version

  13. arXiv:1906.11537  [pdf, other

    stat.ML cs.AI cs.LG

    'In-Between' Uncertainty in Bayesian Neural Networks

    Authors: Andrew Y. K. Foong, Yingzhen Li, José Miguel Hernández-Lobato, Richard E. Turner

    Abstract: We describe a limitation in the expressiveness of the predictive uncertainty estimate given by mean-field variational inference (MFVI), a popular approximate inference method for Bayesian neural networks. In particular, MFVI fails to give calibrated uncertainty estimates in between separated regions of observations. This can lead to catastrophically overconfident predictions when testing on out-of… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: Presented at the ICML 2019 Workshop on Uncertainty and Robustness in Deep Learning