Skip to main content

Showing 1–24 of 24 results for author: John, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.03337  [pdf, other

    cs.LG stat.ML

    Identifying latent state transition in non-linear dynamical systems

    Authors: Çağlar Hızlı, Çağatay Yıldız, Matthias Bethge, ST John, Pekka Marttinen

    Abstract: This work aims to improve generalization and interpretability of dynamical systems by recovering the underlying lower-dimensional latent states and their time evolutions. Previous work on disentangled representation learning within the realm of dynamical systems focused on the latent states, possibly with linear transition approximations. As such, they cannot identify nonlinear transition dynamics… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2311.03129  [pdf, other

    stat.ML cs.LG

    Nonparametric modeling of the composite effect of multiple nutrients on blood glucose dynamics

    Authors: Arina Odnoblyudova, Çağlar Hizli, ST John, Andrea Cognolato, Anne Juuti, Simo Särkkä, Kirsi Pietiläinen, Pekka Marttinen

    Abstract: In biomedical applications it is often necessary to estimate a physiological response to a treatment consisting of multiple components, and learn the separate effects of the components in addition to the joint effect. Here, we extend existing probabilistic nonparametric approaches to explicitly address this problem. We also develop a new convolution-based model for composite treatment-response cur… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  3. arXiv:2310.11527  [pdf, other

    stat.ML cs.LG

    Thin and Deep Gaussian Processes

    Authors: Daniel Augusto de Souza, Alexander Nikitin, ST John, Magnus Ross, Mauricio A. Álvarez, Marc Peter Deisenroth, João P. P. Gomes, Diego Mesquita, César Lincoln C. Mattos

    Abstract: Gaussian processes (GPs) can provide a principled approach to uncertainty quantification with easy-to-interpret kernel hyperparameters, such as the lengthscale, which controls the correlation distance of function values. However, selecting an appropriate kernel can be challenging. Deep GPs avoid manual kernel engineering by successively parameterizing kernels with GP layers, allowing them to learn… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted at the Conference on Neural Information Processing Systems (NeurIPS) 2023

  4. arXiv:2307.03093  [pdf, other

    cs.LG stat.ML

    Beyond Intuition, a Framework for Applying GPs to Real-World Data

    Authors: Kenza Tazi, Jihao Andreas Lin, Ross Viljoen, Alex Gardner, ST John, Hong Ge, Richard E. Turner

    Abstract: Gaussian Processes (GPs) offer an attractive method for regression over small, structured and correlated datasets. However, their deployment is hindered by computational costs and limited guidelines on how to apply GPs beyond simple low-dimensional datasets. We propose a framework to identify the suitability of GPs to a given problem and how to set up a robust and well-specified GP model. The guid… ▽ More

    Submitted 17 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at the ICML Workshop on Structured Probabilistic Inference and Generative Modelling (2023)

  5. arXiv:2306.10915  [pdf, other

    stat.ML cs.LG

    Practical Equivariances via Relational Conditional Neural Processes

    Authors: Daolang Huang, Manuel Haussmann, Ulpu Remes, ST John, Grégoire Clarté, Kevin Sebastian Luck, Samuel Kaski, Luigi Acerbi

    Abstract: Conditional Neural Processes (CNPs) are a class of metalearning models popular for combining the runtime efficiency of amortized inference with reliable uncertainty quantification. Many relevant machine learning tasks, such as in spatio-temporal modeling, Bayesian Optimization and continuous control, inherently contain equivariances -- for example to translation -- which the model can exploit for… ▽ More

    Submitted 5 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 38 pages, 8 figures. Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  6. arXiv:2306.09656  [pdf, other

    cs.LG stat.ME

    Temporal Causal Mediation through a Point Process: Direct and Indirect Effects of Healthcare Interventions

    Authors: Çağlar Hızlı, ST John, Anne Juuti, Tuure Saarinen, Kirsi Pietiläinen, Pekka Marttinen

    Abstract: Deciding on an appropriate intervention requires a causal model of a treatment, the outcome, and potential mediators. Causal mediation analysis lets us distinguish between direct and indirect effects of the intervention, but has mostly been studied in a static setting. In healthcare, data come in the form of complex, irregularly sampled time-series, with dynamic interdependencies between a treatme… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  7. arXiv:2306.04201  [pdf, other

    cs.LG stat.ML

    Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models

    Authors: Rui Li, ST John, Arno Solin

    Abstract: Approximate inference in Gaussian process (GP) models with non-conjugate likelihoods gets entangled with the learning of the model hyperparameters. We improve hyperparameter learning in GP models and focus on the interplay between variational inference (VI) and the learning target. While VI's lower bound to the marginal likelihood is a suitable objective for inferring the approximate posterior, we… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  8. arXiv:2306.03566  [pdf, other

    cs.LG stat.ML

    Memory-Based Dual Gaussian Processes for Sequential Learning

    Authors: Paul E. Chang, Prakhar Verma, S. T. John, Arno Solin, Mohammad Emtiyaz Khan

    Abstract: Sequential learning with Gaussian processes (GPs) is challenging when access to past data is limited, for example, in continual and active learning. In such cases, errors can accumulate over time due to inaccuracies in the posterior, hyperparameters, and inducing points, making accurate learning challenging. Here, we present a method to keep all such errors in check using the recently proposed dua… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  9. arXiv:2305.14120  [pdf, other

    cs.LG stat.ML

    Learning Relevant Contextual Variables Within Bayesian Optimization

    Authors: Julien Martinelli, Ayush Bharti, Armi Tiihonen, S. T. John, Louis Filstroff, Sabina J. Sloman, Patrick Rinke, Samuel Kaski

    Abstract: Contextual Bayesian Optimization (CBO) efficiently optimizes black-box functions with respect to design variables, while simultaneously integrating contextual information regarding the environment, such as experimental conditions. However, the relevance of contextual variables is not necessarily known beforehand. Moreover, contextual variables can sometimes be optimized themselves at an additional… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  10. arXiv:2211.06260  [pdf, other

    cs.LG stat.ML

    Towards Improved Learning in Gaussian Processes: The Best of Two Worlds

    Authors: Rui Li, ST John, Arno Solin

    Abstract: Gaussian process training decomposes into inference of the (approximate) posterior and learning of the hyperparameters. For non-Gaussian (non-conjugate) likelihoods, two common choices for approximate inference are Expectation Propagation (EP) and Variational Inference (VI), which have complementary strengths and weaknesses. While VI's lower bound to the marginal likelihood is a suitable objective… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  11. arXiv:2211.01053  [pdf, other

    cs.LG stat.ML

    Fantasizing with Dual GPs in Bayesian Optimization and Active Learning

    Authors: Paul E. Chang, Prakhar Verma, ST John, Victor Picheny, Henry Moss, Arno Solin

    Abstract: Gaussian processes (GPs) are the main surrogate functions used for sequential modelling such as Bayesian Optimization and Active Learning. Their drawbacks are poor scaling with data and the need to run an optimization loop when using a non-Gaussian likelihood. In this paper, we focus on `fantasizing' batch acquisition functions that need the ability to condition on new fantasized data computationa… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  12. arXiv:2209.04142  [pdf, other

    cs.LG stat.ME

    Causal Modeling of Policy Interventions From Sequences of Treatments and Outcomes

    Authors: Çağlar Hızlı, ST John, Anne Juuti, Tuure Saarinen, Kirsi Pietiläinen, Pekka Marttinen

    Abstract: A treatment policy defines when and what treatments are applied to affect some outcome of interest. Data-driven decision-making requires the ability to predict what happens if a policy is changed. Existing methods that predict how the outcome evolves under different scenarios assume that the tentative sequences of future treatments are fixed in advance, while in practice the treatments are determi… ▽ More

    Submitted 20 June, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted at ICML 2023

  13. arXiv:2111.08524  [pdf, other

    cs.LG stat.ML

    Non-separable Spatio-temporal Graph Kernels via SPDEs

    Authors: Alexander Nikitin, ST John, Arno Solin, Samuel Kaski

    Abstract: Gaussian processes (GPs) provide a principled and direct approach for inference and learning on graphs. However, the lack of justified graph kernels for spatio-temporal modelling has held back their use in graph problems. We leverage an explicit link between stochastic partial differential equations (SPDEs) and GPs on graphs, introduce a framework for deriving graph kernels via SPDEs, and derive n… ▽ More

    Submitted 22 March, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  14. arXiv:2104.05674  [pdf, ps, other

    stat.ML cs.LG

    GPflux: A Library for Deep Gaussian Processes

    Authors: Vincent Dutordoir, Hugh Salimbeni, Eric Hambro, John McLeod, Felix Leibfried, Artem Artemev, Mark van der Wilk, James Hensman, Marc P. Deisenroth, ST John

    Abstract: We introduce GPflux, a Python library for Bayesian deep learning with a strong emphasis on deep Gaussian processes (DGPs). Implementing DGPs is a challenging endeavour due to the various mathematical subtleties that arise when dealing with multivariate Gaussian distributions and the complex bookkee** of indices. To date, there are no actively maintained, open-sourced and extendable libraries ava… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  15. arXiv:2012.13962  [pdf, other

    cs.LG stat.ML

    A Tutorial on Sparse Gaussian Processes and Variational Inference

    Authors: Felix Leibfried, Vincent Dutordoir, ST John, Nicolas Durrande

    Abstract: Gaussian processes (GPs) provide a framework for Bayesian inference that can offer principled uncertainty estimates for a large range of problems. For example, if we consider regression problems with Gaussian likelihoods, a GP model enjoys a posterior in closed form. However, identifying the posterior GP scales cubically with the number of training examples and requires to store all examples in me… ▽ More

    Submitted 18 December, 2022; v1 submitted 27 December, 2020; originally announced December 2020.

  16. arXiv:2003.04125  [pdf, other

    stat.ML cs.LG stat.ME

    Amortized variance reduction for doubly stochastic objectives

    Authors: Ayman Boustati, Sattar Vakili, James Hensman, ST John

    Abstract: Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions. These objectives incorporate randomness both from mini-batch subsampling of the data and from Monte Carlo estimation of expectations. If the gradient variance is high, the stochastic optimisation problem becomes difficult with a slow rate of conv… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  17. arXiv:2003.01115  [pdf, other

    stat.ML cs.LG

    A Framework for Interdomain and Multioutput Gaussian Processes

    Authors: Mark van der Wilk, Vincent Dutordoir, ST John, Artem Artemev, Vincent Adam, James Hensman

    Abstract: One obstacle to the use of Gaussian processes (GPs) in large-scale problems, and as a component in deep learning system, is the need for bespoke derivations and implementations for small variations in the model or inference. In order to improve the utility of GPs we need a modular system that allows rapid implementation and testing, as seen in the neural network community. We present a mathematica… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  18. arXiv:1911.02549  [pdf, other

    cs.LG cs.PF stat.ML

    MLPerf Inference Benchmark

    Authors: Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee , et al. (22 additional authors not shown)

    Abstract: Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic… ▽ More

    Submitted 9 May, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: ISCA 2020

  19. arXiv:1910.01500  [pdf, other

    cs.LG cs.PF stat.ML

    MLPerf Training Benchmark

    Authors: Peter Mattson, Christine Cheng, Cody Coleman, Greg Diamos, Paulius Micikevicius, David Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor Bittorf, David Brooks, Dehao Chen, Debojyoti Dutta, Udit Gupta, Kim Hazelwood, Andrew Hock, Xinyuan Huang, Atsushi Ike, Bill Jia, Daniel Kang, David Kanter, Naveen Kumar, Jeffery Liao, Guokai Ma, Deepak Narayanan , et al. (12 additional authors not shown)

    Abstract: Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits h… ▽ More

    Submitted 2 March, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: MLSys 2020

  20. arXiv:1902.10974  [pdf, other

    stat.ML cs.LG

    Gaussian Process Modulated Cox Processes under Linear Inequality Constraints

    Authors: Andrés F. López-Lopera, ST John, Nicolas Durrande

    Abstract: Gaussian process (GP) modulated Cox processes are widely used to model point patterns. Existing approaches require a map** (link function) between the unconstrained GP and the positive intensity function. This commonly yields solutions that do not have a closed form or that are restricted to specific covariance functions. We introduce a novel finite approximation of GP-modulated Cox processes wh… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.

  21. arXiv:1812.11106  [pdf, other

    cs.LG stat.ML

    Scalable GAM using sparse variational Gaussian processes

    Authors: Vincent Adam, Nicolas Durrande, ST John

    Abstract: Generalized additive models (GAMs) are a widely used class of models of interest to statisticians as they provide a flexible way to design interpretable models of data beyond linear models. We here propose a scalable and well-calibrated Bayesian treatment of GAMs using Gaussian processes (GPs) and leveraging recent advances in variational inference. We use sparse GPs to represent each component an… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

    Journal ref: 1st Symposium on Advances in Approximate Bayesian Inference, 2018

  22. arXiv:1808.05563  [pdf, other

    cs.LG stat.ML

    Learning Invariances using the Marginal Likelihood

    Authors: Mark van der Wilk, Matthias Bauer, ST John, James Hensman

    Abstract: Generalising well in supervised learning tasks relies on correctly extrapolating the training data to a large region of the input space. One way to achieve this is to constrain the predictions to be invariant to transformations on the input that are known to be irrelevant (e.g. translation). Commonly, this is done through data augmentation, where the training set is enlarged by applying hand-craft… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

  23. arXiv:1807.10363  [pdf, other

    physics.comp-ph cs.LG stat.ML

    Message-passing neural networks for high-throughput polymer screening

    Authors: Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson, Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen

    Abstract: Machine learning methods have shown promise in predicting molecular properties, and given sufficient training data machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure, and have consistently ach… ▽ More

    Submitted 5 April, 2019; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: 7 pages, 3 figures

  24. arXiv:1804.01016  [pdf, other

    stat.ML cs.LG

    Large-Scale Cox Process Inference using Variational Fourier Features

    Authors: S. T. John, James Hensman

    Abstract: Gaussian process modulated Poisson processes provide a flexible framework for modelling spatiotemporal point patterns. So far this had been restricted to one dimension, binning to a pre-determined grid, or small data sets of up to a few thousand data points. Here we introduce Cox process inference based on Fourier features. This sparse representation induces global rather than local constraints on… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.