Skip to main content

Showing 1–50 of 56 results for author: Osborne, M A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  2. arXiv:2402.01632  [pdf, other

    cs.LG stat.ML

    Beyond Lengthscales: No-regret Bayesian Optimisation With Unknown Hyperparameters Of Any Type

    Authors: Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

    Abstract: Bayesian optimisation requires fitting a Gaussian process model, which in turn requires specifying hyperparameters - most of the theoretical literature assumes those hyperparameters are known. The commonly used maximum likelihood estimator for hyperparameters of the Gaussian process is consistent only if the data fills the space uniformly, which does not have to be the case in Bayesian optimisatio… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  4. arXiv:2310.17273  [pdf, other

    cs.LG cs.HC stat.ML

    Loo** in the Human Collaborative and Explainable Bayesian Optimization

    Authors: Masaki Adachi, Brady Planden, David A. Howey, Michael A. Osborne, Sebastian Orbell, Natalia Ares, Krikamol Muandet, Siu Lun Chau

    Abstract: Like many optimizers, Bayesian optimization often falls short of gaining user trust due to opacity. While attempts have been made to develop human-centric optimizers, they typically assume user knowledge is well-specified and error-free, employing users mainly as supervisors of the optimization process. We relax these assumptions and propose a more balanced human-AI partnership with our Collaborat… ▽ More

    Submitted 29 February, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted at AISTATS 2024, 24 pages, 11 figures

    MSC Class: 62C10; 62F15

  5. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

  6. arXiv:2306.05304  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Optimisation of Functions on Graphs

    Authors: Xingchen Wan, Pierre Osselin, Henry Kenlay, Binxin Ru, Michael A. Osborne, Xiaowen Dong

    Abstract: The increasing availability of graph-structured data motivates the task of optimising over functions defined on the node set of graphs. Traditional graph search algorithms can be applied in this case, but they may be sample-inefficient and do not make use of information about the function values; on the other hand, Bayesian optimisation is a class of promising black-box solvers with superior sampl… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 11 pages, 11 figures, 1 table (29 pages, 31 figures, 1 table including references and appendices)

  7. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  8. arXiv:2212.13936  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

    Authors: Tim G. J. Rudner, Cong Lu, Michael A. Osborne, Yarin Gal, Yee Whye Teh

    Abstract: KL-regularized reinforcement learning from expert demonstrations has proved successful in improving the sample efficiency of deep reinforcement learning algorithms, allowing them to be applied to challenging physical real-world tasks. However, we show that KL-regularized reinforcement learning with behavioral reference policies derived from expert demonstrations can suffer from pathological traini… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Published in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  9. arXiv:2210.17299  [pdf, other

    stat.ME cs.LG eess.SY physics.chem-ph

    Bayesian Model Selection of Lithium-Ion Battery Models via Bayesian Quadrature

    Authors: Masaki Adachi, Yannick Kuhn, Birger Horstmann, Arnulf Latz, Michael A. Osborne, David A. Howey

    Abstract: A wide variety of battery models are available, and it is not always obvious which model `best' describes a dataset. This paper presents a Bayesian model selection approach using Bayesian quadrature. The model evidence is adopted as the selection metric, choosing the simplest model that describes the data, in the spirit of Occam's razor. However, estimating this requires integral computations over… ▽ More

    Submitted 5 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 11 pages, 2 figures, accepted at IFAC2023

    MSC Class: 62C10; 62F15

    Journal ref: IFAC-PapersOnLine, 56, 10521, 2023

  10. arXiv:2210.10199  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization

    Authors: Samuel Daulton, Xingchen Wan, David Eriksson, Maximilian Balandat, Michael A. Osborne, Eytan Bakshy

    Abstract: Optimizing expensive-to-evaluate black-box functions of discrete (and potentially continuous) design parameters is a ubiquitous problem in scientific and engineering applications. Bayesian optimization (BO) is a popular, sample-efficient method that leverages a probabilistic surrogate model and an acquisition function (AF) to select promising designs to evaluate. However, maximizing the AF over mi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: To appear in Advances in Neural Information Processing Systems 35, 2022. Code available at: https://github.com/facebookresearch/bo_pr

  11. arXiv:2209.00343  [pdf, other

    stat.ML cs.LG

    Bézier Gaussian Processes for Tall and Wide Data

    Authors: Martin Jørgensen, Michael A. Osborne

    Abstract: Modern approximations to Gaussian processes are suitable for "tall data", with a cost that scales well in the number of observations, but under-performs on ``wide data'', scaling poorly in the number of input features. That is, as the number of input features grows, good predictive performance requires the number of summarising variables, and their associated cost, to grow rapidly. We introduce a… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  12. arXiv:2206.04779  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

    Authors: Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye Teh

    Abstract: Offline reinforcement learning has shown great promise in leveraging large pre-collected datasets for policy learning, allowing agents to forgo often-expensive online data collection. However, offline reinforcement learning from visual observations with continuous action spaces remains under-explored, with a limited understanding of the key challenges in this complex domain. In this paper, we esta… ▽ More

    Submitted 6 July, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Published at TMLR, 2023

  13. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  14. arXiv:2202.07549  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Robust Multi-Objective Bayesian Optimization Under Input Noise

    Authors: Samuel Daulton, Sait Cakmak, Maximilian Balandat, Michael A. Osborne, Enlu Zhou, Eytan Bakshy

    Abstract: Bayesian optimization (BO) is a sample-efficient approach for tuning design parameters to optimize expensive-to-evaluate, black-box performance metrics. In many manufacturing processes, the design parameters are subject to random input noise, resulting in a product that is often less performant than expected. Although BO methods have been proposed for optimizing a single objective under input nois… ▽ More

    Submitted 3 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: To appear at ICML 2022. 36 pages. Code is available at https://github.com/facebookresearch/robust_mobo

  15. arXiv:2111.02842  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Adversarial Attacks on Graph Classification via Bayesian Optimisation

    Authors: Xingchen Wan, Henry Kenlay, Binxin Ru, Arno Blaas, Michael A. Osborne, Xiaowen Dong

    Abstract: Graph neural networks, a popular class of models effective in a wide range of graph-based learning tasks, have been shown to be vulnerable to adversarial attacks. While the majority of the literature focuses on such vulnerability in node-level classification tasks, little effort has been dedicated to analysing adversarial attacks on graph-level classification, an important problem with numerous re… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021. 11 pages, 8 figures, 2 tables (24 pages, 17 figures, 8 tables including references and appendices)

  16. arXiv:2110.12087  [pdf, other

    cs.LG stat.ML

    Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

    Authors: Vu Nguyen, Marc Peter Deisenroth, Michael A. Osborne

    Abstract: Many functions have approximately-known upper and/or lower bounds, potentially aiding the modeling of such functions. In this paper, we introduce Gaussian process models for functions where such bounds are (approximately) known. More specifically, we propose the first use of such bounds to improve Gaussian process (GP) posterior sampling and Bayesian optimization (BO). That is, we transform a GP m… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages

  17. arXiv:2107.01959  [pdf, other

    cs.LG stat.ML

    Universal Approximation of Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Michael A. Osborne, Ingmar Posner

    Abstract: Modelling functions of sets, or equivalently, permutation-invariant functions, is a long-standing challenge in machine learning. Deep Sets is a popular method which is known to be a universal approximator for continuous set functions. We provide a theoretical analysis of Deep Sets which shows that this universal approximation property is only guaranteed if the model's latent space is sufficiently… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 54 pages, 13 figures

  18. arXiv:2106.07452  [pdf, other

    stat.ML cs.LG

    Marginalising over Stationary Kernels with Bayesian Quadrature

    Authors: Saad Hamid, Sebastian Schulze, Michael A. Osborne, Stephen J. Roberts

    Abstract: Marginalising over families of Gaussian Process kernels produces flexible model classes with well-calibrated uncertainty estimates. Existing approaches require likelihood evaluations of many kernels, rendering them prohibitively expensive for larger datasets. We propose a Bayesian Quadrature scheme to make this marginalisation more efficient and thereby more practical. Through use of the maximum m… ▽ More

    Submitted 15 March, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  19. arXiv:2102.07188  [pdf, other

    stat.ML cs.LG

    Think Global and Act Local: Bayesian Optimisation over High-Dimensional Categorical and Mixed Search Spaces

    Authors: Xingchen Wan, Vu Nguyen, Huong Ha, Binxin Ru, Cong Lu, Michael A. Osborne

    Abstract: High-dimensional black-box optimisation remains an important yet notoriously challenging problem. Despite the success of Bayesian optimisation methods on continuous domains, domains that are categorical, or that mix continuous and categorical variables, remain challenging. We propose a novel solution -- we combine local optimisation with a tailored kernel design, effectively handling high-dimensio… ▽ More

    Submitted 10 June, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: ICML 2021. 9 page, 6 figures (26 pages, 16 figures, 2 tables including references and appendices)

  20. arXiv:2006.07593  [pdf, other

    cs.LG cs.NE stat.ML

    Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

    Authors: Vu Nguyen, Tam Le, Makoto Yamada, Michael A Osborne

    Abstract: Neural architecture search (NAS) automates the design of deep neural networks. One of the main challenges in searching complex and non-continuous architectures is to compare the similarity of networks that the conventional Euclidean metric may fail to capture. Optimal transport (OT) is resilient to such complex structure by considering the minimal cost for transporting a network into another. Howe… ▽ More

    Submitted 10 June, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: 23 pages, camera ready ICML2021

  21. arXiv:1909.09593  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization for Iterative Learning

    Authors: Vu Nguyen, Sebastian Schulze, Michael A Osborne

    Abstract: The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence. Traditional tuning algorithms only consider the final performance of hyperparameters acquired after many expensive iterations and ignore intermediate informatio… ▽ More

    Submitted 16 January, 2021; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: Camera ready NeurIPS 2020

  22. arXiv:1908.08258  [pdf, ps, other

    stat.ML cs.LG

    Adaptive Configuration Oracle for Online Portfolio Selection Methods

    Authors: Favour M. Nyikosa, Michael A. Osborne, Stephen J. Roberts

    Abstract: Financial markets are complex environments that produce enormous amounts of noisy and non-stationary data. One fundamental problem is online portfolio selection, the goal of which is to exploit this data to sequentially select portfolios of assets to achieve positive investment outcomes while managing risks. Various algorithms have been proposed for solving this problem in fields such as finance,… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    MSC Class: 62P30 ACM Class: G.3

  23. arXiv:1906.08878  [pdf, other

    stat.ML cs.LG

    Bayesian Optimisation over Multiple Continuous and Categorical Inputs

    Authors: Binxin Ru, Ahsan S. Alvi, Vu Nguyen, Michael A. Osborne, Stephen J Roberts

    Abstract: Efficient optimisation of black-box problems that comprise both continuous and categorical inputs is important, yet poses significant challenges. We propose a new approach, Continuous and Categorical Bayesian Optimisation (CoCaBO), which combines the strengths of multi-armed bandits and Bayesian optimisation to select values for both categorical and continuous inputs. We model this mixed-type spac… ▽ More

    Submitted 9 August, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 16 pages

  24. arXiv:1905.02685  [pdf, other

    stat.ML cs.LG

    Knowing The What But Not The Where in Bayesian Optimization

    Authors: Vu Nguyen, Michael A. Osborne

    Abstract: Bayesian optimization has demonstrated impressive success in finding the optimum input x* and output f* = f(x*) = max f(x) of a black-box function f. In some applications, however, the optimum output f* is known in advance and the goal is to find the corresponding optimum input x*. In this paper, we consider a new setting in BO in which the knowledge of the optimum output f* is available. Our goal… ▽ More

    Submitted 14 August, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 16 pages

    Journal ref: International Conference on Machine Learning (ICML) 2020

  25. arXiv:1902.09724  [pdf, other

    cs.LG stat.ML

    Automated Model Selection with Bayesian Quadrature

    Authors: Henry Chai, Jean-Francois Ton, Roman Garnett, Michael A. Osborne

    Abstract: We present a novel technique for tailoring Bayesian quadrature (BQ) to model selection. The state-of-the-art for comparing the evidence of multiple models relies on Monte Carlo methods, which converge slowly and are unreliable for computationally expensive models. Previous research has shown that BQ offers sample efficiency superior to Monte Carlo in computing the evidence of an individual model.… ▽ More

    Submitted 1 March, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 10 pages, 5 figures. Currently in submission to ICML 2019

  26. arXiv:1902.08480  [pdf, other

    cs.LG math.DS stat.ML

    AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

    Authors: Gabriele Abbati, Philippe Wenk, Michael A Osborne, Andreas Krause, Bernhard Schölkopf, Stefan Bauer

    Abstract: Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment mat… ▽ More

    Submitted 28 May, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-sixth International Conference on Machine Learning (ICML 2019)

  27. arXiv:1902.06278  [pdf, other

    cs.LG math.DS stat.ML

    ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

    Authors: Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, Stefan Bauer

    Abstract: Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of exper… ▽ More

    Submitted 5 December, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-fourth AAAI Conference on Artificial Intelligence

  28. arXiv:1901.10452  [pdf, other

    stat.ML cs.AI cs.LG

    Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation

    Authors: Ahsan S. Alvi, Binxin Ru, Jan Calliess, Stephen J. Roberts, Michael A. Osborne

    Abstract: Batch Bayesian optimisation (BO) has been successfully applied to hyperparameter tuning using parallel computing, but it is wasteful of resources: workers that complete jobs ahead of others are left idle. We address this problem by develo** an approach, Penalising Locally for Asynchronous Bayesian Optimisation on $k$ workers (PLAyBOOK), for asynchronous parallel BO. We demonstrate empirically th… ▽ More

    Submitted 28 May, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Camera-ready version after incorporating reviewers' suggestions

  29. arXiv:1811.10275  [pdf, ps, other

    stat.CO cs.LG math.NA stat.ML

    Rejoinder for "Probabilistic Integration: A Role in Statistical Computation?"

    Authors: Francois-Xavier Briol, Chris J. Oates, Mark Girolami, Michael A. Osborne, Dino Sejdinovic

    Abstract: This article is the rejoinder for the paper "Probabilistic Integration: A Role in Statistical Computation?" to appear in Statistical Science with discussion. We would first like to thank the reviewers and many of our colleagues who helped shape this paper, the editor for selecting our paper for discussion, and of course all of the discussants for their thoughtful, insightful and constructive comme… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted to Statistical Science

  30. Battery health prediction under generalized conditions using a Gaussian process transition model

    Authors: Robert R. Richardson, Michael A. Osborne, David A. Howey

    Abstract: Accurately predicting the future health of batteries is necessary to ensure reliable operation, minimise maintenance costs, and calculate the value of energy storage investments. The complex nature of degradation renders data-driven approaches a promising alternative to mechanistic modelling. This study predicts the changes in battery capacity over time using a Bayesian non-parametric approach bas… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

  31. arXiv:1805.10662  [pdf, other

    cs.LG cs.AI stat.ML

    Fingerprint Policy Optimisation for Robust Reinforcement Learning

    Authors: Supratik Paul, Michael A. Osborne, Shimon Whiteson

    Abstract: Policy gradient methods ignore the potential value of adjusting environment variables: unobservable state features that are randomly determined by the environment in a physical setting, but are controllable in a simulator. This can lead to slow learning, or convergence to suboptimal policies, if the environment variable has a large impact on the transition dynamics. In this paper, we present finge… ▽ More

    Submitted 27 May, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: ICML 2019

  32. arXiv:1805.08610  [pdf, other

    stat.ML cs.LG

    Optimization, fast and slow: optimally switching between local and Bayesian optimization

    Authors: Mark McLeod, Michael A. Osborne, Stephen J. Roberts

    Abstract: We develop the first Bayesian Optimization algorithm, BLOSSOM, which selects between multiple alternative acquisition functions and traditional local optimization at each step. This is combined with a novel stop** condition based on expected regret. This pairing allows us to obtain the best characteristics of both local and Bayesian optimization, making efficient use of function evaluations whil… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  33. arXiv:1803.10520  [pdf, ps, other

    quant-ph cs.LG stat.ML

    Quantum algorithms for training Gaussian Processes

    Authors: Zhikuan Zhao, Jack K. Fitzsimons, Michael A. Osborne, Stephen J. Roberts, Joseph F. Fitzsimons

    Abstract: Gaussian processes (GPs) are important models in supervised machine learning. Training in Gaussian processes refers to selecting the covariance functions and the associated parameters in order to improve the outcome of predictions, the core of which amounts to evaluating the logarithm of the marginal likelihood (LML) of a given model. LML gives a concrete measure of the quality of prediction that… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: 5 pages. Comments welcome

    Journal ref: Phys. Rev. A 100, 012304 (2019)

  34. arXiv:1803.03432  [pdf, other

    stat.ML

    Bayesian Optimization for Dynamic Problems

    Authors: Favour M. Nyikosa, Michael A. Osborne, Stephen J. Roberts

    Abstract: We propose practical extensions to Bayesian optimization for solving dynamic problems. We model dynamic objective functions using spatiotemporal Gaussian process priors which capture all the instances of the functions over time. Our extensions to Bayesian optimization use the information learnt from this model to guide the tracking of a temporally evolving minimum. By exploiting temporal correlati… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 10 pages, 4 figures

  35. Gaussian Process Regression for In-situ Capacity Estimation of Lithium-ion Batteries

    Authors: Robert R. Richardson, Christoph R. Birkl, Michael A. Osborne, David A. Howey

    Abstract: Accurate on-board capacity estimation is of critical importance in lithium-ion battery applications. Battery charging/discharging often occurs under a constant current load, and hence voltage vs. time measurements under this condition may be accessible in practice. This paper presents a data-driven diagnostic technique, Gaussian Process regression for In-situ Capacity Estimation (GP-ICE), which es… ▽ More

    Submitted 18 December, 2017; v1 submitted 7 December, 2017; originally announced December 2017.

    Comments: 12 pages, 10 figures, submitted to IEEE Transactions on Industrial Informatics

    Report number: TII-17-1314 MSC Class: 62P30 ACM Class: J.2; G.3

  36. arXiv:1711.00673  [pdf, other

    stat.ML

    Fast Information-theoretic Bayesian Optimisation

    Authors: Binxin Ru, Mark McLeod, Diego Granziol, Michael A. Osborne

    Abstract: Information-theoretic Bayesian optimisation techniques have demonstrated state-of-the-art performance in tackling important global optimisation problems. However, current information-theoretic approaches require many approximations in implementation, introduce often-prohibitive computational overhead and limit the choice of kernels available to model the objective. We develop a fast information-th… ▽ More

    Submitted 6 June, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

    Comments: Main Paper: 9 pages, 6 figures, 2 tables; Accepted by ICML 2018

  37. arXiv:1707.04314  [pdf, other

    stat.ML cs.AI cs.PL stat.CO

    Bayesian Optimization for Probabilistic Programs

    Authors: Tom Rainforth, Tuan Anh Le, Jan-Willem van de Meent, Michael A. Osborne, Frank Wood

    Abstract: We present the first general purpose framework for marginal maximum a posteriori estimation of probabilistic program variables. By using a series of code transformations, the evidence of any probabilistic program, and therefore of any graphical model, can be optimized with respect to an arbitrary subset of its sampled variables. To carry out this optimization, we develop the first Bayesian optimiz… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  38. arXiv:1707.04191  [pdf, other

    stat.ML

    Distributionally Ambiguous Optimization Techniques for Batch Bayesian Optimization

    Authors: Nikitas Rontsis, Michael A. Osborne, Paul J. Goulart

    Abstract: We propose a novel, theoretically-grounded, acquisition function for Batch Bayesian optimization informed by insights from distributionally ambiguous optimization. Our acquisition function is a lower bound on the well-known Expected Improvement function, which requires evaluation of a Gaussian Expectation over a multivariate piecewise affine function. Our bound is computed instead by evaluating th… ▽ More

    Submitted 16 April, 2018; v1 submitted 13 July, 2017; originally announced July 2017.

  39. arXiv:1705.00891  [pdf, ps, other

    stat.ML cs.CE q-fin.ST

    A Novel Approach to Forecasting Financial Volatility with Gaussian Process Envelopes

    Authors: Syed Ali Asad Rizvi, Stephen J. Roberts, Michael A. Osborne, Favour Nyikosa

    Abstract: In this paper we use Gaussian Process (GP) regression to propose a novel approach for predicting volatility of financial returns by forecasting the envelopes of the time series. We provide a direct comparison of their performance to traditional approaches such as GARCH. We compare the forecasting power of three approaches: GP regression on the absolute and squared returns; regression on the envelo… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.

    Comments: 16 pages, 8 figures, 6 tables

  40. arXiv:1703.08031  [pdf, other

    stat.ML

    Distribution of Gaussian Process Arc Lengths

    Authors: Justin D. Bewsher, Alessandra Tosi, Michael A. Osborne, Stephen J. Roberts

    Abstract: We present the first treatment of the arc length of the Gaussian Process (GP) with more than a single output dimension. GPs are commonly used for tasks such as trajectory modelling, where path length is a crucial quantity of interest. Previously, only paths in one dimension have been considered, with no theoretical consideration of higher dimensional problems. We fill the gap in the existing liter… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

    Comments: 10 pages, 4 figures, Accepted to The 20th International Conference on Artificial Intelligence and Statistics (AISTATS)

  41. Gaussian process regression for forecasting battery state of health

    Authors: Robert R. Richardson, Michael A. Osborne, David A. Howey

    Abstract: Accurately predicting the future capacity and remaining useful life of batteries is necessary to ensure reliable system operation and to minimise maintenance costs. The complex nature of battery degradation has meant that mechanistic modelling of capacity fade has thus far remained intractable; however, with the advent of cloud-connected devices, data from cells in various applications is becoming… ▽ More

    Submitted 31 May, 2017; v1 submitted 16 March, 2017; originally announced March 2017.

    Comments: 13 pages, 7 figures, published in the Journal of Power Sources, 2017

    MSC Class: 62P30 ACM Class: J.2; G.3

    Journal ref: Journal of Power Sources, Volume 357, 31 July 2017, Pages 209 to 219

  42. arXiv:1703.04335  [pdf, other

    stat.ML

    Practical Bayesian Optimization for Variable Cost Objectives

    Authors: Mark McLeod, Michael A. Osborne, Stephen J. Roberts

    Abstract: We propose a novel Bayesian Optimization approach for black-box functions with an environmental variable whose value determines the tradeoff between evaluation cost and the fidelity of the evaluations. Further, we use a novel approach to sampling support points, allowing faster construction of the acquisition function. This allows us to achieve optimization with lower overheads than previous appro… ▽ More

    Submitted 15 May, 2018; v1 submitted 13 March, 2017; originally announced March 2017.

    Comments: 8 pages, 7 figures

  43. arXiv:1605.07496  [pdf, other

    cs.LG cs.AI stat.ML

    Alternating Optimisation and Quadrature for Robust Control

    Authors: Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson

    Abstract: Bayesian optimisation has been successfully applied to a variety of reinforcement learning problems. However, the traditional approach for learning optimal policies in simulators does not utilise the opportunity to improve learning by adjusting certain environment variables: state features that are unobservable and randomly determined by the environment in a physical setting but are controllable i… ▽ More

    Submitted 18 December, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: To appear in AAAI 2018. Video of policy learnt in simulation deployed on a real hexapod see https://youtu.be/ME90xtIPsKk

  44. arXiv:1602.06693  [pdf, other

    stat.ML stat.CO stat.ME

    Preconditioning Kernel Matrices

    Authors: Kurt Cutajar, Michael A. Osborne, John P. Cunningham, Maurizio Filippone

    Abstract: The computational and storage complexity of kernel machines presents the primary barrier to their scaling to large, modern, datasets. A common way to tackle the scalability issue is to use the conjugate gradient algorithm, which relieves the constraints on both storage (the kernel matrix need not be stored) and computation (both stochastic gradients and parallelization can be used). Even so, conju… ▽ More

    Submitted 25 May, 2016; v1 submitted 22 February, 2016; originally announced February 2016.

  45. arXiv:1512.00933  [pdf, other

    stat.ML math.NA math.ST stat.CO

    Probabilistic Integration: A Role in Statistical Computation?

    Authors: François-Xavier Briol, Chris. J. Oates, Mark Girolami, Michael A. Osborne, Dino Sejdinovic

    Abstract: A research frontier has emerged in scientific computation, wherein numerical error is regarded as a source of epistemic uncertainty that can be modelled. This raises several statistical challenges, including the design of statistical methods that enable the coherent propagation of probabilities through a (possibly deterministic) computational work-flow. This paper examines the case for probabilist… ▽ More

    Submitted 18 October, 2017; v1 submitted 2 December, 2015; originally announced December 2015.

    Comments: Several improvements suggested by reviewers, including additional experiments on uncertainty quantification properties. Change of title: previously "Probabilistic Integration: A Role for Statisticians in Numerical Analysis?"

  46. arXiv:1510.07965  [pdf, other

    stat.ML

    Blitzkriging: Kronecker-structured Stochastic Gaussian Processes

    Authors: Thomas Nickson, Tom Gunter, Chris Lloyd, Michael A Osborne, Stephen Roberts

    Abstract: We present Blitzkriging, a new approach to fast inference for Gaussian processes, applicable to regression, optimisation and classification. State-of-the-art (stochastic) inference for Gaussian processes on very large datasets scales cubically in the number of 'inducing inputs', variables introduced to factorise the model. Blitzkriging shares state-of-the-art scaling with data, but reduces the sca… ▽ More

    Submitted 31 October, 2015; v1 submitted 27 October, 2015; originally announced October 2015.

  47. arXiv:1509.02438  [pdf, other

    stat.ML

    A Variational Bayesian State-Space Approach to Online Passive-Aggressive Regression

    Authors: Arnold Salas, Stephen J. Roberts, Michael A. Osborne

    Abstract: Online Passive-Aggressive (PA) learning is a class of online margin-based algorithms suitable for a wide range of real-time prediction tasks, including classification and regression. PA algorithms are formulated in terms of deterministic point-estimation problems governed by a set of user-defined hyperparameters: the approach fails to capture model/prediction uncertainty and makes their performanc… ▽ More

    Submitted 8 September, 2015; originally announced September 2015.

  48. arXiv:1506.02681  [pdf, other

    stat.ML

    Frank-Wolfe Bayesian Quadrature: Probabilistic Integration with Theoretical Guarantees

    Authors: François-Xavier Briol, Chris J. Oates, Mark Girolami, Michael A. Osborne

    Abstract: There is renewed interest in formulating integration as an inference problem, motivated by obtaining a full distribution over numerical error that can be propagated through subsequent computation. Current methods, such as Bayesian Quadrature, demonstrate impressive empirical performance but lack theoretical analysis. An important challenge is to reconcile these probabilistic integrators with rigor… ▽ More

    Submitted 6 December, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

    Journal ref: Advances in Neural Information Processing Systems 28, 1162--1170, 2015

  49. arXiv:1506.01326  [pdf, other

    math.NA cs.AI cs.LG stat.CO stat.ML

    Probabilistic Numerics and Uncertainty in Computations

    Authors: Philipp Hennig, Michael A Osborne, Mark Girolami

    Abstract: We deliver a call to arms for probabilistic numerical methods: algorithms for numerical tasks, including linear algebra, integration, optimization and solving differential equations, that return uncertainties in their calculations. Such uncertainties, arising from the loss of precision induced by numerical calculation with limited time or hardware, are important for much contemporary science and i… ▽ More

    Submitted 3 June, 2015; originally announced June 2015.

    Comments: Author Generated Postprint. 17 pages, 4 Figures, 1 Table

  50. arXiv:1411.0439  [pdf, other

    stat.ML

    Sampling for Inference in Probabilistic Models with Fast Bayesian Quadrature

    Authors: Tom Gunter, Michael A. Osborne, Roman Garnett, Philipp Hennig, Stephen J. Roberts

    Abstract: We propose a novel sampling framework for inference in probabilistic models: an active learning approach that converges more quickly (in wall-clock time) than Markov chain Monte Carlo (MCMC) benchmarks. The central challenge in probabilistic inference is numerical integration, to average over ensembles of models or unknown (hyper-)parameters (for example to compute the marginal likelihood or a par… ▽ More

    Submitted 3 November, 2014; originally announced November 2014.

    Journal ref: Advances in Neural Information Processing Systems (NIPS) 2014