Skip to main content

Showing 1–16 of 16 results for author: Matthews, A G d G

.
  1. arXiv:2305.02402  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Normalizing flows for lattice gauge theory in arbitrary space-time dimension

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tracta… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  2. arXiv:2211.07541  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG

    Aspects of scaling and scalability for flow-based sampling of lattice QCD

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the vi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 22 pages, 8 figures

    Report number: MIT-CTP/5496

  3. arXiv:2208.07698  [pdf, other

    stat.ML cs.LG

    Score-Based Diffusion meets Annealed Importance Sampling

    Authors: Arnaud Doucet, Will Grathwohl, Alexander G. D. G. Matthews, Heiko Strathmann

    Abstract: More than twenty years after its introduction, Annealed Importance Sampling (AIS) remains one of the most effective methods for marginal likelihood estimation. It relies on a sequence of distributions interpolating between a tractable initial distribution and the target distribution of interest which we simulate from approximately using a non-homogeneous Markov chain. To obtain an importance sampl… ▽ More

    Submitted 24 October, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: accepted at NeurIPS 2022

  4. arXiv:2208.03832  [pdf, other

    hep-lat

    Sampling QCD field configurations with gauge-equivariant flow models

    Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

    Abstract: Machine learning methods based on normalizing flows have been shown to address important challenges, such as critical slowing-down and topological freezing, in the sampling of gauge field configurations in simple lattice field theories. A critical question is whether this success will translate to studies of QCD. This Proceedings presents a status update on advances in this area. In particular, it… ▽ More

    Submitted 20 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Submitted as a proceedings to the 39th International Symposium on Lattice Field Theory (Lattice 2022)

  5. arXiv:2201.13117  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG hep-lat

    Continual Repeated Annealed Flow Transport Monte Carlo

    Authors: Alexander G. D. G. Matthews, Michael Arbel, Danilo J. Rezende, Arnaud Doucet

    Abstract: We propose Continual Repeated Annealed Flow Transport Monte Carlo (CRAFT), a method that combines a sequential Monte Carlo (SMC) sampler (itself a generalization of Annealed Importance Sampling) with variational inference using normalizing flows. The normalizing flows are directly trained to transport between annealing temperatures using a KL divergence for each transition. This optimization objec… ▽ More

    Submitted 6 April, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 21 pages, 6 figures Published at International Conference on Machine Learning (ICML) 2022

  6. arXiv:2102.07501  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG math.ST

    Annealed Flow Transport Monte Carlo

    Authors: Michael Arbel, Alexander G. D. G. Matthews, Arnaud Doucet

    Abstract: Annealed Importance Sampling (AIS) and its Sequential Monte Carlo (SMC) extensions are state-of-the-art methods for estimating normalizing constants of probability distributions. We propose here a novel Monte Carlo algorithm, Annealed Flow Transport (AFT), that builds upon AIS and SMC and combines them with normalizing flows (NFs) for improved performance. This method transports a set of particles… ▽ More

    Submitted 9 July, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  7. arXiv:1909.02487  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph

    Ab-Initio Solution of the Many-Electron Schrödinger Equation with Deep Neural Networks

    Authors: David Pfau, James S. Spencer, Alexander G. de G. Matthews, W. M. C. Foulkes

    Abstract: Given access to accurate solutions of the many-electron Schrödinger equation, nearly all chemistry could be derived from first principles. Exact wavefunctions of interesting chemical systems are out of reach because they are NP-hard to compute in general, but approximations can be found using polynomially-scaling algorithms. The key challenge for many of these algorithms is the choice of wavefunct… ▽ More

    Submitted 25 March, 2021; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: Final proof for Physical Review Research

    Journal ref: Phys. Rev. Research 2, 033429 (2020)

  8. arXiv:1901.11356  [pdf, other

    stat.ML cs.LG

    Functional Regularisation for Continual Learning with Gaussian Processes

    Authors: Michalis K. Titsias, Jonathan Schwarz, Alexander G. de G. Matthews, Razvan Pascanu, Yee Whye Teh

    Abstract: We introduce a framework for Continual Learning (CL) based on Bayesian inference over the function space rather than the parameters of a deep neural network. This method, referred to as functional regularisation for Continual Learning, avoids forgetting a previous task by constructing and memorising an approximate posterior belief over the underlying task-specific function. To achieve this we rely… ▽ More

    Submitted 11 February, 2020; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: 17 pages, 7 figures

  9. arXiv:1807.01969  [pdf, other

    stat.ML cs.LG

    Variational Bayesian dropout: pitfalls and fixes

    Authors: Jiri Hron, Alexander G. de G. Matthews, Zoubin Ghahramani

    Abstract: Dropout, a stochastic regularisation technique for training of neural networks, has recently been reinterpreted as a specific type of approximate inference algorithm for Bayesian neural networks. The main contribution of the reinterpretation is in providing a theoretical framework useful for analysing and extending the algorithm. We show that the proposed framework suffers from several issues; fro… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: Extended version of the paper accepted to ICML 2018: more details in the proofs, few minor modifications

  10. arXiv:1804.11271  [pdf, other

    stat.ML cs.LG

    Gaussian Process Behaviour in Wide Deep Neural Networks

    Authors: Alexander G. de G. Matthews, Mark Rowland, Jiri Hron, Richard E. Turner, Zoubin Ghahramani

    Abstract: Whilst deep neural networks have shown great empirical success, there is still much work to be done to understand their theoretical properties. In this paper, we study the relationship between random, wide, fully connected, feedforward networks with more than one hidden layer and Gaussian processes with a recursive kernel definition. We show that, under broad conditions, as we make the architectur… ▽ More

    Submitted 16 August, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: This work substantially extends the work of Matthews et al. (2018) published at the International Conference on Learning Representations (ICLR) 2018

  11. arXiv:1711.02989  [pdf, other

    stat.ML

    Variational Gaussian Dropout is not Bayesian

    Authors: Jiri Hron, Alexander G. de G. Matthews, Zoubin Ghahramani

    Abstract: Gaussian multiplicative noise is commonly used as a stochastic regularisation technique in training of deterministic neural networks. A recent paper reinterpreted the technique as a specific algorithm for approximate inference in Bayesian neural networks; several extensions ensued. We show that the log-uniform prior used in all the above publications does not generally induce a proper posterior, a… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

  12. arXiv:1707.02476  [pdf, other

    stat.ML

    Adversarial Examples, Uncertainty, and Transfer Testing Robustness in Gaussian Process Hybrid Deep Networks

    Authors: John Bradshaw, Alexander G. de G. Matthews, Zoubin Ghahramani

    Abstract: Deep neural networks (DNNs) have excellent representative power and are state of the art classifiers on many tasks. However, they often do not capture their own uncertainties well making them less robust in the real world as they overconfidently extrapolate and do not notice domain shift. Gaussian processes (GPs) with RBF kernels on the other hand have better calibrated uncertainties and do not ov… ▽ More

    Submitted 8 July, 2017; originally announced July 2017.

  13. arXiv:1610.08733  [pdf, other

    stat.ML

    GPflow: A Gaussian process library using TensorFlow

    Authors: Alexander G. de G. Matthews, Mark van der Wilk, Tom Nickson, Keisuke Fujii, Alexis Boukouvalas, Pablo León-Villagrá, Zoubin Ghahramani, James Hensman

    Abstract: GPflow is a Gaussian process library that uses TensorFlow for its core computations and Python for its front end. The distinguishing features of GPflow are that it uses variational inference as the primary approximation method, provides concise code through the use of automatic differentiation, has been engineered with a particular emphasis on software testing and is able to exploit GPU hardware.

    Submitted 27 October, 2016; originally announced October 2016.

  14. arXiv:1506.04000  [pdf, other

    stat.ML

    MCMC for Variationally Sparse Gaussian Processes

    Authors: James Hensman, Alexander G. de G. Matthews, Maurizio Filippone, Zoubin Ghahramani

    Abstract: Gaussian process (GP) models form a core part of probabilistic machine learning. Considerable research effort has been made into attacking three issues with GP models: how to compute efficiently when the number of data is large; how to approximate the posterior when the likelihood is not Gaussian and how to estimate covariance function parameter posteriors. This paper simultaneously addresses thes… ▽ More

    Submitted 12 June, 2015; originally announced June 2015.

    Comments: 16 pages

  15. arXiv:1504.07027  [pdf, ps, other

    stat.ML

    On Sparse variational methods and the Kullback-Leibler divergence between stochastic processes

    Authors: Alexander G. de G. Matthews, James Hensman, Richard E. Turner, Zoubin Ghahramani

    Abstract: The variational framework for learning inducing variables (Titsias, 2009a) has had a large impact on the Gaussian process literature. The framework may be interpreted as minimizing a rigorously defined Kullback-Leibler divergence between the approximating and posterior processes. To our knowledge this connection has thus far gone unremarked in the literature. In this paper we give a substantial ge… ▽ More

    Submitted 4 December, 2015; v1 submitted 27 April, 2015; originally announced April 2015.

    Comments: 9 pages. No figures

  16. arXiv:1405.4141  [pdf, other

    stat.ML stat.CO stat.ME

    Classification using log Gaussian Cox processes

    Authors: Alexander G. de. G Matthews, Zoubin Ghahramani

    Abstract: McCullagh and Yang (2006) suggest a family of classification algorithms based on Cox processes. We further investigate the log Gaussian variant which has a number of appealing properties. Conditioned on the covariates, the distribution over labels is given by a type of conditional Markov random field. In the supervised case, computation of the predictive probability of a single test point scales l… ▽ More

    Submitted 20 June, 2014; v1 submitted 16 May, 2014; originally announced May 2014.

    Comments: 17 pages, 6 figures