Skip to main content

Showing 1–14 of 14 results for author: Nemeth, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19051  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Gradient Piecewise Deterministic Monte Carlo Samplers

    Authors: Paul Fearnhead, Sebastiano Grazzi, Chris Nemeth, Gareth O. Roberts

    Abstract: Recent work has suggested using Monte Carlo methods based on piecewise deterministic Markov processes (PDMPs) to sample from target distributions of interest. PDMPs are non-reversible continuous-time processes endowed with momentum, and hence can mix better than standard reversible MCMC samplers. Furthermore, they can incorporate exact sub-sampling schemes which only require access to a single (ra… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    MSC Class: 62-08 62F15

  2. arXiv:2406.11664  [pdf, other

    stat.ML cs.LG stat.CO

    Diffusion Generative Modelling for Divide-and-Conquer MCMC

    Authors: C. Trojan, P. Fearnhead, C. Nemeth

    Abstract: Divide-and-conquer MCMC is a strategy for parallelising Markov Chain Monte Carlo sampling by running independent samplers on disjoint subsets of a dataset and merging their output. An ongoing challenge in the literature is to efficiently perform this merging without imposing distributional assumptions on the posteriors. We propose using diffusion generative modelling to fit density approximations… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures

  3. arXiv:2406.02296  [pdf, other

    cs.LG math.OC

    Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

    Authors: Daniel Dodd, Louis Sharrock, Christopher Nemeth

    Abstract: In recent years, interest in gradient-based optimization over Riemannian manifolds has surged. However, a significant challenge lies in the reliance on hyperparameters, especially the learning rate, which requires meticulous tuning by practitioners to ensure convergence at a suitable rate. In this work, we introduce innovative learning-rate-free algorithms for stochastic optimization over Riemanni… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  4. arXiv:2405.14392  [pdf, other

    stat.ME cs.LG stat.ML

    Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows

    Authors: Alberto Cabezas, Louis Sharrock, Christopher Nemeth

    Abstract: Continuous normalizing flows (CNFs) learn the probability path between a reference and a target density by modeling the vector field generating said path using neural networks. Recently, Lipman et al. (2022) introduced a simple and inexpensive method for training CNFs in generative modeling, termed flow matching (FM). In this paper, we re-purpose this method for probabilistic inference by incorpor… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  6. arXiv:2305.14943  [pdf, other

    stat.ML cs.LG stat.ME

    Learning Rate Free Sampling in Constrained Domains

    Authors: Louis Sharrock, Lester Mackey, Christopher Nemeth

    Abstract: We introduce a suite of new particle-based algorithms for sampling in constrained domains which are entirely learning rate free. Our approach leverages coin betting ideas from convex optimisation, and the viewpoint of constrained sampling as a mirrored optimisation problem on the space of probability measures. Based on this viewpoint, we also introduce a unifying framework for several existing con… ▽ More

    Submitted 26 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  7. arXiv:2305.14916  [pdf, other

    stat.ML cs.LG stat.ME

    Tuning-Free Maximum Likelihood Training of Latent Variable Models via Coin Betting

    Authors: Louis Sharrock, Daniel Dodd, Christopher Nemeth

    Abstract: We introduce two new particle-based algorithms for learning latent variable models via marginal maximum likelihood estimation, including one which is entirely tuning-free. Our methods are based on the perspective of marginal maximum likelihood estimation as an optimization problem: namely, as the minimization of a free energy functional. One way to solve this problem is via the discretization of a… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2301.11294  [pdf, other

    stat.ML cs.LG

    Coin Sampling: Gradient-Based Bayesian Inference without Learning Rates

    Authors: Louis Sharrock, Christopher Nemeth

    Abstract: In recent years, particle-based variational inference (ParVI) methods such as Stein variational gradient descent (SVGD) have grown in popularity as scalable methods for Bayesian inference. Unfortunately, the properties of such methods invariably depend on hyperparameters such as the learning rate, which must be carefully tuned by the practitioner in order to ensure convergence to the target measur… ▽ More

    Submitted 1 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  9. arXiv:2210.16189  [pdf, ps, other

    stat.ML cs.LG stat.CO stat.ME

    Preferential Subsampling for Stochastic Gradient Langevin Dynamics

    Authors: Srshti Putcha, Christopher Nemeth, Paul Fearnhead

    Abstract: Stochastic gradient MCMC (SGMCMC) offers a scalable alternative to traditional MCMC, by constructing an unbiased estimate of the gradient of the log-posterior with a small, uniformly-weighted subsample of the data. While efficient to compute, the resulting gradient estimator may exhibit a high variance and impact sampler performance. The problem of variance control has been traditionally addressed… ▽ More

    Submitted 8 July, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 22 pages, 5 figures. Appeared in the proceedings of AISTATS 2023

  10. arXiv:2106.01982  [pdf, other

    stat.ML cs.LG

    Gaussian Processes on Hypergraphs

    Authors: Thomas Pinder, Kathryn Turnbull, Christopher Nemeth, David Leslie

    Abstract: We derive a Matern Gaussian process (GP) on the vertices of a hypergraph. This enables estimation of regression models of observed or latent values associated with the vertices, in which the correlation and uncertainty estimates are informed by the hypergraph structure. We further present a framework for embedding the vertices of a hypergraph into a latent space using the hypergraph GP. Finally, w… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 25 pages, 6 figures

  11. arXiv:2009.12141  [pdf, other

    stat.ML cs.LG

    Stein Variational Gaussian Processes

    Authors: Thomas Pinder, Christopher Nemeth, David Leslie

    Abstract: We show how to use Stein variational gradient descent (SVGD) to carry out inference in Gaussian process (GP) models with non-Gaussian likelihoods and large data volumes. Markov chain Monte Carlo (MCMC) is extremely computationally intensive for these situations, but the parametric assumptions required for efficient variational inference (VI) result in incorrect inference when they encounter the mu… ▽ More

    Submitted 19 January, 2022; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: 26 pages, 5 figures

  12. arXiv:1901.10568  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Gradient MCMC for Nonlinear State Space Models

    Authors: Christopher Aicher, Srshti Putcha, Christopher Nemeth, Paul Fearnhead, Emily B. Fox

    Abstract: State space models (SSMs) provide a flexible framework for modeling complex time series via a latent stochastic process. Inference for nonlinear, non-Gaussian SSMs is often tackled with particle methods that do not scale well to long time series. The challenge is two-fold: not only do computations scale linearly with time, as in the linear case, but particle filters additionally suffer from increa… ▽ More

    Submitted 16 July, 2023; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: To appear in Bayesian Analysis

  13. arXiv:1806.07137  [pdf, other

    stat.CO cs.LG stat.ML

    Large-Scale Stochastic Sampling from the Probability Simplex

    Authors: Jack Baker, Paul Fearnhead, Emily B Fox, Christopher Nemeth

    Abstract: Stochastic gradient Markov chain Monte Carlo (SGMCMC) has become a popular method for scalable Bayesian inference. These methods are based on sampling a discrete-time approximation to a continuous time process, such as the Langevin diffusion. When applied to distributions defined on a constrained space the time-discretization error can dominate when we are near the boundary of the space. We demons… ▽ More

    Submitted 26 October, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted to Advances in Neural Information Processing Systems (2018)

  14. arXiv:1706.05439  [pdf, other

    stat.CO cs.LG stat.ML

    Control Variates for Stochastic Gradient MCMC

    Authors: Jack Baker, Paul Fearnhead, Emily B. Fox, Christopher Nemeth

    Abstract: It is well known that Markov chain Monte Carlo (MCMC) methods scale poorly with dataset size. A popular class of methods for solving this issue is stochastic gradient MCMC. These methods use a noisy estimate of the gradient of the log posterior, which reduces the per iteration computational cost of the algorithm. Despite this, there are a number of results suggesting that stochastic gradient Lange… ▽ More

    Submitted 14 December, 2017; v1 submitted 16 June, 2017; originally announced June 2017.