Skip to main content

Showing 1–11 of 11 results for author: Vollmer, S J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2307.06431  [pdf, other

    stat.ML cs.LG

    Energy Discrepancies: A Score-Independent Loss for Energy-Based Models

    Authors: Tobias Schröder, Zi**g Ou, Jen Ning Lim, Yingzhen Li, Sebastian J. Vollmer, Andrew B. Duncan

    Abstract: Energy-based models are a simple yet powerful class of probabilistic models, but their widespread adoption has been limited by the computational burden of training them. We propose a novel loss function called Energy Discrepancy (ED) which does not rely on the computation of scores or expensive Markov chain Monte Carlo. We show that ED approaches the explicit score matching and negative log-likeli… ▽ More

    Submitted 27 November, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Camera Ready version for the 37th Conference on Neural Information Processing Systems (NeurIPS 2023). Changes in this revision: Appendix A1: Corrected proof of Theorem 1. Appendix D3: Added definition and numerical experiments for energy discrepancy on binary discrete spaces. Minor changes in the main text and correction of typos. Added new references

  2. arXiv:2010.11530  [pdf, other

    stat.ML cs.LG

    Model updating after interventions paradoxically introduces bias

    Authors: James Liley, Samuel R Emerson, Bilal A Mateen, Catalina A Vallejos, Louis J M Aslett, Sebastian J Vollmer

    Abstract: Machine learning is increasingly being used to generate prediction models for use in a number of real-world settings, from credit risk assessment to clinical decision support. Recent discussions have highlighted potential problems in the updating of a predictive score for a binary outcome when an existing predictive score forms part of the standard workflow, driving interventions. In this setting,… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Sections of this preprint on 'Successive adjuvancy' (section 4, theorem 2, figures 4,5, and associated discussions) were not included in the originally submitted version of this paper due to length. This material does not appear in the published version of this manuscript, and the reader should be aware that these sections did not undergo peer review

  3. MLJ: A Julia package for composable machine learning

    Authors: Anthony D. Blaom, Franz Kiraly, Thibaut Lienart, Yiannis Simillides, Diego Arenas, Sebastian J. Vollmer

    Abstract: MLJ (Machine Learing in Julia) is an open source software package providing a common interface for interacting with machine learning models written in Julia and other languages. It provides tools and meta-algorithms for selecting, tuning, evaluating, composing and comparing those models, with a focus on flexible model composition. In this design overview we detail chief novelties of the framework,… ▽ More

    Submitted 3 November, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: Shortened version of previous version

    Journal ref: Journal of Open Source Software, 2020, vol. 5(55), p. 2704

  4. arXiv:1706.02692  [pdf, other

    stat.ME math.NA

    The True Cost of Stochastic Gradient Langevin Dynamics

    Authors: Tigran Nagapetyan, Andrew B. Duncan, Leonard Hasenclever, Sebastian J. Vollmer, Lukasz Szpruch, Konstantinos Zygalakis

    Abstract: The problem of posterior inference is central to Bayesian statistics and a wealth of Markov Chain Monte Carlo (MCMC) methods have been proposed to obtain asymptotically correct samples from the posterior. As datasets in applications grow larger and larger, scalability has emerged as a central problem for MCMC methods. Stochastic Gradient Langevin Dynamics (SGLD) and related stochastic gradient Mar… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

    Comments: 6 Figures

    MSC Class: 65C05

  5. Piecewise Deterministic Markov Processes for Scalable Monte Carlo on Restricted Domains

    Authors: Joris Bierkens, Alexandre Bouchard-Côté, Arnaud Doucet, Andrew B. Duncan, Paul Fearnhead, Thibaut Lienart, Gareth Roberts, Sebastian J. Vollmer

    Abstract: Piecewise Deterministic Monte Carlo algorithms enable simulation from a posterior distribution, whilst only needing to access a sub-sample of data at each iteration. We show how they can be implemented in settings where the parameters live on a restricted domain.

    Submitted 17 February, 2018; v1 submitted 16 January, 2017; originally announced January 2017.

    Journal ref: Statistics & Probability Letters Volume 136, May 2018, Pages 148-154

  6. arXiv:1611.06972  [pdf, other

    stat.ML cs.LG math.PR

    Measuring Sample Quality with Diffusions

    Authors: Jackson Gorham, Andrew B. Duncan, Sebastian J. Vollmer, Lester Mackey

    Abstract: Stein's method for measuring convergence to a continuous target distribution relies on an operator characterizing the target and Stein factor bounds on the solutions of an associated differential equation. While such operators and bounds are readily available for a diversity of univariate targets, few multivariate targets have been analyzed. We introduce a new class of characterizing operators bas… ▽ More

    Submitted 12 November, 2018; v1 submitted 21 November, 2016; originally announced November 2016.

    MSC Class: 60J60; 62-04; 62E17; 60E15; 65C60 (Primary) 62-07; 65C05; 68T05 (Secondary)

  7. arXiv:1609.04388  [pdf, other

    stat.ML

    Relativistic Monte Carlo

    Authors: Xiaoyu Lu, Valerio Perrone, Leonard Hasenclever, Yee Whye Teh, Sebastian J. Vollmer

    Abstract: Hamiltonian Monte Carlo (HMC) is a popular Markov chain Monte Carlo (MCMC) algorithm that generates proposals for a Metropolis-Hastings algorithm by simulating the dynamics of a Hamiltonian system. However, HMC is sensitive to large time discretizations and performs poorly if there is a mismatch between the spatial geometry of the target distribution and the scales of the momentum distribution. In… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

  8. arXiv:1609.00691  [pdf, other

    stat.CO math.PR

    Multilevel Monte Carlo for Reliability Theory

    Authors: Louis J. M. Aslett, Tigran Nagapetyan, Sebastian J. Vollmer

    Abstract: As the size of engineered systems grows, problems in reliability theory can become computationally challenging, often due to the combinatorial growth in the cut sets. In this paper we demonstrate how Multilevel Monte Carlo (MLMC) - a simulation approach which is typically used for stochastic differential equation models - can be applied in reliability problems by carefully controlling the bias-var… ▽ More

    Submitted 11 March, 2017; v1 submitted 1 September, 2016; originally announced September 2016.

  9. arXiv:1510.02451  [pdf, other

    stat.ME math.ST

    The Bouncy Particle Sampler: A Non-Reversible Rejection-Free Markov Chain Monte Carlo Method

    Authors: Alexandre Bouchard-Côté, Sebastian J. Vollmer, Arnaud Doucet

    Abstract: Markov chain Monte Carlo methods have become standard tools in statistics to sample from complex probability measures. Many available techniques rely on discrete-time reversible Markov chains whose transition kernels build up over the Metropolis-Hastings algorithm. We explore and propose several original extensions of an alternative approach introduced recently in Peters and de With (2012) where t… ▽ More

    Submitted 17 February, 2017; v1 submitted 8 October, 2015; originally announced October 2015.

    Comments: 42 pages, 15 figures, reference in abstract is to arXiv:1112.1263v3

  10. arXiv:1501.00438  [pdf, other

    stat.ME math.ST stat.ML

    (Non-) asymptotic properties of Stochastic Gradient Langevin Dynamics

    Authors: Sebastian J. Vollmer, Konstantinos C. Zygalakis, and Yee Whye Teh

    Abstract: Applying standard Markov chain Monte Carlo (MCMC) algorithms to large data sets is computationally infeasible. The recently proposed stochastic gradient Langevin dynamics (SGLD) method circumvents this problem in three ways: it generates proposed moves using only a subset of the data, it skips the Metropolis-Hastings accept-reject step, and it uses sequences of decreasing step sizes. In \cite{TehT… ▽ More

    Submitted 21 September, 2015; v1 submitted 2 January, 2015; originally announced January 2015.

    Comments: 42 pages, 7 figures

    MSC Class: 60J05; 65C05

  11. arXiv:1411.7713  [pdf, other

    stat.ME math.ST

    Unbiased Monte Carlo: posterior estimation for intractable/infinite-dimensional models

    Authors: Sergios Agapiou, Gareth O. Roberts, Sebastian J. Vollmer

    Abstract: We provide a general methodology for unbiased estimation for intractable stochastic models. We consider situations where the target distribution can be written as an appropriate limit of distributions, and where conventional approaches require truncation of such a representation leading to a systematic bias. For example, the target distribution might be representable as the $L^2$-limit of a basis… ▽ More

    Submitted 27 November, 2014; originally announced November 2014.

    Comments: 74pages, 9 Figures

    MSC Class: 60J05; 60J22; 62M05