Skip to main content

Showing 1–50 of 73 results for author: Osborne, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  2. arXiv:2402.01632  [pdf, other

    cs.LG stat.ML

    Beyond Lengthscales: No-regret Bayesian Optimisation With Unknown Hyperparameters Of Any Type

    Authors: Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

    Abstract: Bayesian optimisation requires fitting a Gaussian process model, which in turn requires specifying hyperparameters - most of the theoretical literature assumes those hyperparameters are known. The commonly used maximum likelihood estimator for hyperparameters of the Gaussian process is consistent only if the data fills the space uniformly, which does not have to be the case in Bayesian optimisatio… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  4. arXiv:2310.17273  [pdf, other

    cs.LG cs.HC stat.ML

    Loo** in the Human Collaborative and Explainable Bayesian Optimization

    Authors: Masaki Adachi, Brady Planden, David A. Howey, Michael A. Osborne, Sebastian Orbell, Natalia Ares, Krikamol Muandet, Siu Lun Chau

    Abstract: Like many optimizers, Bayesian optimization often falls short of gaining user trust due to opacity. While attempts have been made to develop human-centric optimizers, they typically assume user knowledge is well-specified and error-free, employing users mainly as supervisors of the optimization process. We relax these assumptions and propose a more balanced human-AI partnership with our Collaborat… ▽ More

    Submitted 29 February, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted at AISTATS 2024, 24 pages, 11 figures

    MSC Class: 62C10; 62F15

  5. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

  6. arXiv:2306.05304  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Optimisation of Functions on Graphs

    Authors: Xingchen Wan, Pierre Osselin, Henry Kenlay, Binxin Ru, Michael A. Osborne, Xiaowen Dong

    Abstract: The increasing availability of graph-structured data motivates the task of optimising over functions defined on the node set of graphs. Traditional graph search algorithms can be applied in this case, but they may be sample-inefficient and do not make use of information about the function values; on the other hand, Bayesian optimisation is a class of promising black-box solvers with superior sampl… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 11 pages, 11 figures, 1 table (29 pages, 31 figures, 1 table including references and appendices)

  7. arXiv:2303.08874  [pdf, other

    stat.ML cs.LG

    Bayesian Quadrature for Neural Ensemble Search

    Authors: Saad Hamid, Xingchen Wan, Martin Jørgensen, Binxin Ru, Michael Osborne

    Abstract: Ensembling can improve the performance of Neural Networks, but existing approaches struggle when the architecture likelihood surface has dispersed, narrow peaks. Furthermore, existing methods construct equally weighted ensembles, and this is likely to be vulnerable to the failure modes of the weaker architectures. By viewing ensembling as approximately marginalising over architectures we construct… ▽ More

    Submitted 17 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  8. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  9. arXiv:2212.13936  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

    Authors: Tim G. J. Rudner, Cong Lu, Michael A. Osborne, Yarin Gal, Yee Whye Teh

    Abstract: KL-regularized reinforcement learning from expert demonstrations has proved successful in improving the sample efficiency of deep reinforcement learning algorithms, allowing them to be applied to challenging physical real-world tasks. However, we show that KL-regularized reinforcement learning with behavioral reference policies derived from expert demonstrations can suffer from pathological traini… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Published in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  10. arXiv:2210.17299  [pdf, other

    stat.ME cs.LG eess.SY physics.chem-ph

    Bayesian Model Selection of Lithium-Ion Battery Models via Bayesian Quadrature

    Authors: Masaki Adachi, Yannick Kuhn, Birger Horstmann, Arnulf Latz, Michael A. Osborne, David A. Howey

    Abstract: A wide variety of battery models are available, and it is not always obvious which model `best' describes a dataset. This paper presents a Bayesian model selection approach using Bayesian quadrature. The model evidence is adopted as the selection metric, choosing the simplest model that describes the data, in the spirit of Occam's razor. However, estimating this requires integral computations over… ▽ More

    Submitted 5 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 11 pages, 2 figures, accepted at IFAC2023

    MSC Class: 62C10; 62F15

    Journal ref: IFAC-PapersOnLine, 56, 10521, 2023

  11. arXiv:2210.10199  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization

    Authors: Samuel Daulton, Xingchen Wan, David Eriksson, Maximilian Balandat, Michael A. Osborne, Eytan Bakshy

    Abstract: Optimizing expensive-to-evaluate black-box functions of discrete (and potentially continuous) design parameters is a ubiquitous problem in scientific and engineering applications. Bayesian optimization (BO) is a popular, sample-efficient method that leverages a probabilistic surrogate model and an acquisition function (AF) to select promising designs to evaluate. However, maximizing the AF over mi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: To appear in Advances in Neural Information Processing Systems 35, 2022. Code available at: https://github.com/facebookresearch/bo_pr

  12. arXiv:2209.00343  [pdf, other

    stat.ML cs.LG

    Bézier Gaussian Processes for Tall and Wide Data

    Authors: Martin Jørgensen, Michael A. Osborne

    Abstract: Modern approximations to Gaussian processes are suitable for "tall data", with a cost that scales well in the number of observations, but under-performs on ``wide data'', scaling poorly in the number of input features. That is, as the number of input features grows, good predictive performance requires the number of summarising variables, and their associated cost, to grow rapidly. We introduce a… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  13. arXiv:2206.04779  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

    Authors: Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye Teh

    Abstract: Offline reinforcement learning has shown great promise in leveraging large pre-collected datasets for policy learning, allowing agents to forgo often-expensive online data collection. However, offline reinforcement learning from visual observations with continuous action spaces remains under-explored, with a limited understanding of the key challenges in this complex domain. In this paper, we esta… ▽ More

    Submitted 6 July, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Published at TMLR, 2023

  14. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  15. arXiv:2202.07549  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Robust Multi-Objective Bayesian Optimization Under Input Noise

    Authors: Samuel Daulton, Sait Cakmak, Maximilian Balandat, Michael A. Osborne, Enlu Zhou, Eytan Bakshy

    Abstract: Bayesian optimization (BO) is a sample-efficient approach for tuning design parameters to optimize expensive-to-evaluate, black-box performance metrics. In many manufacturing processes, the design parameters are subject to random input noise, resulting in a product that is often less performant than expected. Although BO methods have been proposed for optimizing a single objective under input nois… ▽ More

    Submitted 3 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: To appear at ICML 2022. 36 pages. Code is available at https://github.com/facebookresearch/robust_mobo

  16. arXiv:2111.02842  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Adversarial Attacks on Graph Classification via Bayesian Optimisation

    Authors: Xingchen Wan, Henry Kenlay, Binxin Ru, Arno Blaas, Michael A. Osborne, Xiaowen Dong

    Abstract: Graph neural networks, a popular class of models effective in a wide range of graph-based learning tasks, have been shown to be vulnerable to adversarial attacks. While the majority of the literature focuses on such vulnerability in node-level classification tasks, little effort has been dedicated to analysing adversarial attacks on graph-level classification, an important problem with numerous re… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021. 11 pages, 8 figures, 2 tables (24 pages, 17 figures, 8 tables including references and appendices)

  17. arXiv:2110.12087  [pdf, other

    cs.LG stat.ML

    Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

    Authors: Vu Nguyen, Marc Peter Deisenroth, Michael A. Osborne

    Abstract: Many functions have approximately-known upper and/or lower bounds, potentially aiding the modeling of such functions. In this paper, we introduce Gaussian process models for functions where such bounds are (approximately) known. More specifically, we propose the first use of such bounds to improve Gaussian process (GP) posterior sampling and Bayesian optimization (BO). That is, we transform a GP m… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages

  18. arXiv:2107.01959  [pdf, other

    cs.LG stat.ML

    Universal Approximation of Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Michael A. Osborne, Ingmar Posner

    Abstract: Modelling functions of sets, or equivalently, permutation-invariant functions, is a long-standing challenge in machine learning. Deep Sets is a popular method which is known to be a universal approximator for continuous set functions. We provide a theoretical analysis of Deep Sets which shows that this universal approximation property is only guaranteed if the model's latent space is sufficiently… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 54 pages, 13 figures

  19. arXiv:2106.07452  [pdf, other

    stat.ML cs.LG

    Marginalising over Stationary Kernels with Bayesian Quadrature

    Authors: Saad Hamid, Sebastian Schulze, Michael A. Osborne, Stephen J. Roberts

    Abstract: Marginalising over families of Gaussian Process kernels produces flexible model classes with well-calibrated uncertainty estimates. Existing approaches require likelihood evaluations of many kernels, rendering them prohibitively expensive for larger datasets. We propose a Bayesian Quadrature scheme to make this marginalisation more efficient and thereby more practical. Through use of the maximum m… ▽ More

    Submitted 15 March, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  20. arXiv:2102.07188  [pdf, other

    stat.ML cs.LG

    Think Global and Act Local: Bayesian Optimisation over High-Dimensional Categorical and Mixed Search Spaces

    Authors: Xingchen Wan, Vu Nguyen, Huong Ha, Binxin Ru, Cong Lu, Michael A. Osborne

    Abstract: High-dimensional black-box optimisation remains an important yet notoriously challenging problem. Despite the success of Bayesian optimisation methods on continuous domains, domains that are categorical, or that mix continuous and categorical variables, remain challenging. We propose a novel solution -- we combine local optimisation with a tailored kernel design, effectively handling high-dimensio… ▽ More

    Submitted 10 June, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: ICML 2021. 9 page, 6 figures (26 pages, 16 figures, 2 tables including references and appendices)

  21. arXiv:2006.07593  [pdf, other

    cs.LG cs.NE stat.ML

    Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

    Authors: Vu Nguyen, Tam Le, Makoto Yamada, Michael A Osborne

    Abstract: Neural architecture search (NAS) automates the design of deep neural networks. One of the main challenges in searching complex and non-continuous architectures is to compare the similarity of networks that the conventional Euclidean metric may fail to capture. Optimal transport (OT) is resilient to such complex structure by considering the minimal cost for transporting a network into another. Howe… ▽ More

    Submitted 10 June, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: 23 pages, camera ready ICML2021

  22. arXiv:2006.07556  [pdf, other

    cs.LG stat.ML

    Interpretable Neural Architecture Search via Bayesian Optimisation with Weisfeiler-Lehman Kernels

    Authors: Binxin Ru, Xingchen Wan, Xiaowen Dong, Michael Osborne

    Abstract: Current neural architecture search (NAS) strategies focus only on finding a single, good, architecture. They offer little insight into why a specific network is performing well, or how we should modify the architecture if we want further improvements. We propose a Bayesian optimisation (BO) approach for NAS that combines the Weisfeiler-Lehman graph kernel with a Gaussian process surrogate. Our met… ▽ More

    Submitted 19 February, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: ICLR 2021. 9 pages, 5 figures, 1 table (23 pages, 14 figures and 3 tables including references and appendices)

  23. arXiv:1912.09068  [pdf, other

    stat.ML cs.LG

    A Maximum Entropy approach to Massive Graph Spectra

    Authors: Diego Granziol, Robin Ru, Stefan Zohren, Xiaowen Dong, Michael Osborne, Stephen Roberts

    Abstract: Graph spectral techniques for measuring graph similarity, or for learning the cluster number, require kernel smoothing. The choice of kernel function and bandwidth are typically chosen in an ad-hoc manner and heavily affect the resulting output. We prove that kernel smoothing biases the moments of the spectral density. We propose an information theoretically optimal approach to learn a smooth grap… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: 12 pages. 9 Figures

  24. arXiv:1909.09593  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization for Iterative Learning

    Authors: Vu Nguyen, Sebastian Schulze, Michael A Osborne

    Abstract: The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence. Traditional tuning algorithms only consider the final performance of hyperparameters acquired after many expensive iterations and ignore intermediate informatio… ▽ More

    Submitted 16 January, 2021; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: Camera ready NeurIPS 2020

  25. arXiv:1908.08258  [pdf, ps, other

    stat.ML cs.LG

    Adaptive Configuration Oracle for Online Portfolio Selection Methods

    Authors: Favour M. Nyikosa, Michael A. Osborne, Stephen J. Roberts

    Abstract: Financial markets are complex environments that produce enormous amounts of noisy and non-stationary data. One fundamental problem is online portfolio selection, the goal of which is to exploit this data to sequentially select portfolios of assets to achieve positive investment outcomes while managing risks. Various algorithms have been proposed for solving this problem in fields such as finance,… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    MSC Class: 62P30 ACM Class: G.3

  26. arXiv:1907.00865  [pdf, other

    stat.ML cs.LG

    Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning

    Authors: Sebastian Farquhar, Michael Osborne, Yarin Gal

    Abstract: We propose Radial Bayesian Neural Networks (BNNs): a variational approximate posterior for BNNs which scales well to large models while maintaining a distribution over weight-space with full support. Other scalable Bayesian deep learning methods, like MC dropout or deep ensembles, have discrete support-they assign zero probability to almost all of the weight-space. Unlike these discrete support me… ▽ More

    Submitted 31 May, 2021; v1 submitted 1 July, 2019; originally announced July 2019.

    Journal ref: AI Stats, PMLR 108:1352-1362, 2020

  27. arXiv:1906.08878  [pdf, other

    stat.ML cs.LG

    Bayesian Optimisation over Multiple Continuous and Categorical Inputs

    Authors: Binxin Ru, Ahsan S. Alvi, Vu Nguyen, Michael A. Osborne, Stephen J Roberts

    Abstract: Efficient optimisation of black-box problems that comprise both continuous and categorical inputs is important, yet poses significant challenges. We propose a new approach, Continuous and Categorical Bayesian Optimisation (CoCaBO), which combines the strengths of multi-armed bandits and Bayesian optimisation to select values for both categorical and continuous inputs. We model this mixed-type spac… ▽ More

    Submitted 9 August, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 16 pages

  28. arXiv:1906.01101  [pdf, other

    stat.ML cs.LG

    MEMe: An Accurate Maximum Entropy Method for Efficient Approximations in Large-Scale Machine Learning

    Authors: Diego Granziol, Binxin Ru, Stefan Zohren, Xiaowen Doing, Michael Osborne, Stephen Roberts

    Abstract: Efficient approximation lies at the heart of large-scale machine learning problems. In this paper, we propose a novel, robust maximum entropy algorithm, which is capable of dealing with hundreds of moments and allows for computationally efficient approximations. We showcase the usefulness of the proposed method, its equivalence to constrained Bayesian variational inference and demonstrate its supe… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 18 pages, 3 figures, Published at Entropy 2019: Special Issue Entropy Based Inference and Optimization in Machine Learning

    Journal ref: MEMe: An Accurate Maximum Entropy Method for Efficient Approximations in Large-Scale Machine Learning. Entropy, 21(6), 551 (2019)

  29. arXiv:1905.02685  [pdf, other

    stat.ML cs.LG

    Knowing The What But Not The Where in Bayesian Optimization

    Authors: Vu Nguyen, Michael A. Osborne

    Abstract: Bayesian optimization has demonstrated impressive success in finding the optimum input x* and output f* = f(x*) = max f(x) of a black-box function f. In some applications, however, the optimum output f* is known in advance and the goal is to find the corresponding optimum input x*. In this paper, we consider a new setting in BO in which the knowledge of the optimum output f* is available. Our goal… ▽ More

    Submitted 14 August, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 16 pages

    Journal ref: International Conference on Machine Learning (ICML) 2020

  30. arXiv:1902.09724  [pdf, other

    cs.LG stat.ML

    Automated Model Selection with Bayesian Quadrature

    Authors: Henry Chai, Jean-Francois Ton, Roman Garnett, Michael A. Osborne

    Abstract: We present a novel technique for tailoring Bayesian quadrature (BQ) to model selection. The state-of-the-art for comparing the evidence of multiple models relies on Monte Carlo methods, which converge slowly and are unreliable for computationally expensive models. Previous research has shown that BQ offers sample efficiency superior to Monte Carlo in computing the evidence of an individual model.… ▽ More

    Submitted 1 March, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 10 pages, 5 figures. Currently in submission to ICML 2019

  31. arXiv:1902.08480  [pdf, other

    cs.LG math.DS stat.ML

    AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

    Authors: Gabriele Abbati, Philippe Wenk, Michael A Osborne, Andreas Krause, Bernhard Schölkopf, Stefan Bauer

    Abstract: Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment mat… ▽ More

    Submitted 28 May, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-sixth International Conference on Machine Learning (ICML 2019)

  32. arXiv:1902.06278  [pdf, other

    cs.LG math.DS stat.ML

    ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

    Authors: Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, Stefan Bauer

    Abstract: Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of exper… ▽ More

    Submitted 5 December, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-fourth AAAI Conference on Artificial Intelligence

  33. arXiv:1901.10452  [pdf, other

    stat.ML cs.AI cs.LG

    Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation

    Authors: Ahsan S. Alvi, Binxin Ru, Jan Calliess, Stephen J. Roberts, Michael A. Osborne

    Abstract: Batch Bayesian optimisation (BO) has been successfully applied to hyperparameter tuning using parallel computing, but it is wasteful of resources: workers that complete jobs ahead of others are left idle. We address this problem by develo** an approach, Penalising Locally for Asynchronous Bayesian Optimisation on $k$ workers (PLAyBOOK), for asynchronous parallel BO. We demonstrate empirically th… ▽ More

    Submitted 28 May, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Camera-ready version after incorporating reviewers' suggestions

  34. arXiv:1901.09006  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    On the Limitations of Representing Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Ingmar Posner, Michael Osborne

    Abstract: Recent work on the representation of functions on sets has considered the use of summation in a latent space to enforce permutation invariance. In particular, it has been conjectured that the dimension of this latent space may remain fixed as the cardinality of the sets under consideration increases. However, we demonstrate that the analysis leading to this conjecture requires map**s which are h… ▽ More

    Submitted 7 October, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Published at the International Conference on Machine Learning (2019)

  35. arXiv:1812.01553  [pdf, other

    stat.ML cs.LG stat.CO

    Batch Selection for Parallelisation of Bayesian Quadrature

    Authors: Ed Wagstaff, Saad Hamid, Michael Osborne

    Abstract: Integration over non-negative integrands is a central problem in machine learning (e.g. for model averaging, (hyper-)parameter marginalisation, and computing posterior predictive distributions). Bayesian Quadrature is a probabilistic numerical integration technique that performs promisingly when compared to traditional Markov Chain Monte Carlo methods. However, in contrast to easily-parallelised M… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  36. arXiv:1811.10275  [pdf, ps, other

    stat.CO cs.LG math.NA stat.ML

    Rejoinder for "Probabilistic Integration: A Role in Statistical Computation?"

    Authors: Francois-Xavier Briol, Chris J. Oates, Mark Girolami, Michael A. Osborne, Dino Sejdinovic

    Abstract: This article is the rejoinder for the paper "Probabilistic Integration: A Role in Statistical Computation?" to appear in Statistical Science with discussion. We would first like to thank the reviewers and many of our colleagues who helped shape this paper, the editor for selecting our paper for discussion, and of course all of the discussants for their thoughtful, insightful and constructive comme… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted to Statistical Science

  37. arXiv:1811.09960  [pdf, other

    stat.ML cs.AI cs.CY cs.LG

    Intersectionality: Multiple Group Fairness in Expectation Constraints

    Authors: Jack Fitzsimons, Michael Osborne, Stephen Roberts

    Abstract: Group fairness is an important concern for machine learning researchers, developers, and regulators. However, the strictness to which models must be constrained to be considered fair is still under debate. The focus of this work is on constraining the expected outcome of subpopulations in kernel regression and, in particular, decision tree regression, with application to random forests, boosted tr… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: NeurIPS (previously NIPS) 2018, Workshop on Ethical, Social and Governance Issues in AI

  38. arXiv:1810.05041  [pdf, other

    cs.LG cs.AI stat.ML

    A General Framework for Fair Regression

    Authors: Jack Fitzsimons, AbdulRahman Al Ali, Michael Osborne, Stephen Roberts

    Abstract: Fairness, through its many forms and definitions, has become an important issue facing the machine learning community. In this work, we consider how to incorporate group fairness constraints in kernel regression methods, applicable to Gaussian processes, support vector machines, neural network regression and decision tree regression. Further, we focus on examining the effect of incorporating these… ▽ More

    Submitted 2 February, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: 8 pages, 4 figures, 2 pages references

  39. Battery health prediction under generalized conditions using a Gaussian process transition model

    Authors: Robert R. Richardson, Michael A. Osborne, David A. Howey

    Abstract: Accurately predicting the future health of batteries is necessary to ensure reliable operation, minimise maintenance costs, and calculate the value of energy storage investments. The complex nature of degradation renders data-driven approaches a promising alternative to mechanistic modelling. This study predicts the changes in battery capacity over time using a Bayesian non-parametric approach bas… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

  40. arXiv:1805.10662  [pdf, other

    cs.LG cs.AI stat.ML

    Fingerprint Policy Optimisation for Robust Reinforcement Learning

    Authors: Supratik Paul, Michael A. Osborne, Shimon Whiteson

    Abstract: Policy gradient methods ignore the potential value of adjusting environment variables: unobservable state features that are randomly determined by the environment in a physical setting, but are controllable in a simulator. This can lead to slow learning, or convergence to suboptimal policies, if the environment variable has a large impact on the transition dynamics. In this paper, we present finge… ▽ More

    Submitted 27 May, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: ICML 2019

  41. arXiv:1805.08610  [pdf, other

    stat.ML cs.LG

    Optimization, fast and slow: optimally switching between local and Bayesian optimization

    Authors: Mark McLeod, Michael A. Osborne, Stephen J. Roberts

    Abstract: We develop the first Bayesian Optimization algorithm, BLOSSOM, which selects between multiple alternative acquisition functions and traditional local optimization at each step. This is combined with a novel stop** condition based on expected regret. This pairing allows us to obtain the best characteristics of both local and Bayesian optimization, making efficient use of function evaluations whil… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  42. arXiv:1804.06802  [pdf, other

    stat.ML cs.IT cs.LG

    Entropic Spectral Learning for Large-Scale Graphs

    Authors: Diego Granziol, Binxin Ru, Stefan Zohren, Xiaowen Dong, Michael Osborne, Stephen Roberts

    Abstract: Graph spectra have been successfully used to classify network types, compute the similarity between graphs, and determine the number of communities in a network. For large graphs, where an eigen-decomposition is infeasible, iterative moment matched approximations to the spectra and kernel smoothing are typically used. We show that the underlying moment information is lost when using kernel smoothi… ▽ More

    Submitted 25 March, 2019; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: 13 pages, 12 figures

  43. arXiv:1803.10520  [pdf, ps, other

    quant-ph cs.LG stat.ML

    Quantum algorithms for training Gaussian Processes

    Authors: Zhikuan Zhao, Jack K. Fitzsimons, Michael A. Osborne, Stephen J. Roberts, Joseph F. Fitzsimons

    Abstract: Gaussian processes (GPs) are important models in supervised machine learning. Training in Gaussian processes refers to selecting the covariance functions and the associated parameters in order to improve the outcome of predictions, the core of which amounts to evaluating the logarithm of the marginal likelihood (LML) of a given model. LML gives a concrete measure of the quality of prediction that… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: 5 pages. Comments welcome

    Journal ref: Phys. Rev. A 100, 012304 (2019)

  44. arXiv:1803.03432  [pdf, other

    stat.ML

    Bayesian Optimization for Dynamic Problems

    Authors: Favour M. Nyikosa, Michael A. Osborne, Stephen J. Roberts

    Abstract: We propose practical extensions to Bayesian optimization for solving dynamic problems. We model dynamic objective functions using spatiotemporal Gaussian process priors which capture all the instances of the functions over time. Our extensions to Bayesian optimization use the information learnt from this model to guide the tracking of a temporally evolving minimum. By exploiting temporal correlati… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 10 pages, 4 figures

  45. arXiv:1802.08054  [pdf, other

    cs.LG cs.IT stat.ML

    VBALD - Variational Bayesian Approximation of Log Determinants

    Authors: Diego Granziol, Edward Wagstaff, Bin Xin Ru, Michael Osborne, Stephen Roberts

    Abstract: Evaluating the log determinant of a positive definite matrix is ubiquitous in machine learning. Applications thereof range from Gaussian processes, minimum-volume ellipsoids, metric learning, kernel learning, Bayesian neural networks, Determinental Point Processes, Markov random fields to partition functions of discrete graphical models. In order to avoid the canonical, yet prohibitive, Cholesky… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

  46. Gaussian Process Regression for In-situ Capacity Estimation of Lithium-ion Batteries

    Authors: Robert R. Richardson, Christoph R. Birkl, Michael A. Osborne, David A. Howey

    Abstract: Accurate on-board capacity estimation is of critical importance in lithium-ion battery applications. Battery charging/discharging often occurs under a constant current load, and hence voltage vs. time measurements under this condition may be accessible in practice. This paper presents a data-driven diagnostic technique, Gaussian Process regression for In-situ Capacity Estimation (GP-ICE), which es… ▽ More

    Submitted 18 December, 2017; v1 submitted 7 December, 2017; originally announced December 2017.

    Comments: 12 pages, 10 figures, submitted to IEEE Transactions on Industrial Informatics

    Report number: TII-17-1314 MSC Class: 62P30 ACM Class: J.2; G.3

  47. arXiv:1711.04308  [pdf, ps, other

    stat.ML eess.SP

    Sensor Selection and Random Field Reconstruction for Robust and Cost-effective Heterogeneous Weather Sensor Networks for the Develo** World

    Authors: Pengfei Zhang, Ido Nevat, Gareth W. Peters, Wolfgang Fruehwirt, Yongchao Huang, Ivonne Anders, Michael Osborne

    Abstract: We address the two fundamental problems of spatial field reconstruction and sensor selection in heterogeneous sensor networks: (i) how to efficiently perform spatial field reconstruction based on measurements obtained simultaneously from networks with both high and low quality sensors; and (ii) how to perform query based sensor set selection with predictive MSE performance guarantee. For the first… ▽ More

    Submitted 23 November, 2017; v1 submitted 12 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Workshop on Machine Learning for the Develo** World

  48. arXiv:1711.00673  [pdf, other

    stat.ML

    Fast Information-theoretic Bayesian Optimisation

    Authors: Binxin Ru, Mark McLeod, Diego Granziol, Michael A. Osborne

    Abstract: Information-theoretic Bayesian optimisation techniques have demonstrated state-of-the-art performance in tackling important global optimisation problems. However, current information-theoretic approaches require many approximations in implementation, introduce often-prohibitive computational overhead and limit the choice of kernels available to model the objective. We develop a fast information-th… ▽ More

    Submitted 6 June, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

    Comments: Main Paper: 9 pages, 6 figures, 2 tables; Accepted by ICML 2018

  49. arXiv:1707.04314  [pdf, other

    stat.ML cs.AI cs.PL stat.CO

    Bayesian Optimization for Probabilistic Programs

    Authors: Tom Rainforth, Tuan Anh Le, Jan-Willem van de Meent, Michael A. Osborne, Frank Wood

    Abstract: We present the first general purpose framework for marginal maximum a posteriori estimation of probabilistic program variables. By using a series of code transformations, the evidence of any probabilistic program, and therefore of any graphical model, can be optimized with respect to an arbitrary subset of its sampled variables. To carry out this optimization, we develop the first Bayesian optimiz… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  50. arXiv:1707.04191  [pdf, other

    stat.ML

    Distributionally Ambiguous Optimization Techniques for Batch Bayesian Optimization

    Authors: Nikitas Rontsis, Michael A. Osborne, Paul J. Goulart

    Abstract: We propose a novel, theoretically-grounded, acquisition function for Batch Bayesian optimization informed by insights from distributionally ambiguous optimization. Our acquisition function is a lower bound on the well-known Expected Improvement function, which requires evaluation of a Gaussian Expectation over a multivariate piecewise affine function. Our bound is computed instead by evaluating th… ▽ More

    Submitted 16 April, 2018; v1 submitted 13 July, 2017; originally announced July 2017.