Skip to main content

Showing 1–49 of 49 results for author: Osborne, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  2. arXiv:2403.08501  [pdf, other

    cs.CY

    Governing Through the Cloud: The Intermediary Role of Compute Providers in AI Regulation

    Authors: Lennart Heim, Tim Fist, Janet Egan, Sihao Huang, Stephen Zekany, Robert Trager, Michael A Osborne, Noa Zilberman

    Abstract: As jurisdictions around the world take their first steps toward regulating the most powerful AI systems, such as the EU AI Act and the US Executive Order 14110, there is a growing need for effective enforcement mechanisms that can verify compliance and respond to violations. We argue that compute providers should have legal obligations and ethical responsibilities associated with AI development an… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: v2: Fixing affiliations, formatting errors, and vector graphics

  3. arXiv:2402.01632  [pdf, other

    cs.LG stat.ML

    Beyond Lengthscales: No-regret Bayesian Optimisation With Unknown Hyperparameters Of Any Type

    Authors: Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

    Abstract: Bayesian optimisation requires fitting a Gaussian process model, which in turn requires specifying hyperparameters - most of the theoretical literature assumes those hyperparameters are known. The commonly used maximum likelihood estimator for hyperparameters of the Gaussian process is consistent only if the data fills the space uniformly, which does not have to be the case in Bayesian optimisatio… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  4. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  5. arXiv:2310.17273  [pdf, other

    cs.LG cs.HC stat.ML

    Loo** in the Human Collaborative and Explainable Bayesian Optimization

    Authors: Masaki Adachi, Brady Planden, David A. Howey, Michael A. Osborne, Sebastian Orbell, Natalia Ares, Krikamol Muandet, Siu Lun Chau

    Abstract: Like many optimizers, Bayesian optimization often falls short of gaining user trust due to opacity. While attempts have been made to develop human-centric optimizers, they typically assume user knowledge is well-specified and error-free, employing users mainly as supervisors of the optimization process. We relax these assumptions and propose a more balanced human-AI partnership with our Collaborat… ▽ More

    Submitted 29 February, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted at AISTATS 2024, 24 pages, 11 figures

    MSC Class: 62C10; 62F15

  6. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

  7. arXiv:2306.05304  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Optimisation of Functions on Graphs

    Authors: Xingchen Wan, Pierre Osselin, Henry Kenlay, Binxin Ru, Michael A. Osborne, Xiaowen Dong

    Abstract: The increasing availability of graph-structured data motivates the task of optimising over functions defined on the node set of graphs. Traditional graph search algorithms can be applied in this case, but they may be sample-inefficient and do not make use of information about the function values; on the other hand, Bayesian optimisation is a class of promising black-box solvers with superior sampl… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 11 pages, 11 figures, 1 table (29 pages, 31 figures, 1 table including references and appendices)

  8. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  9. arXiv:2212.13936  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

    Authors: Tim G. J. Rudner, Cong Lu, Michael A. Osborne, Yarin Gal, Yee Whye Teh

    Abstract: KL-regularized reinforcement learning from expert demonstrations has proved successful in improving the sample efficiency of deep reinforcement learning algorithms, allowing them to be applied to challenging physical real-world tasks. However, we show that KL-regularized reinforcement learning with behavioral reference policies derived from expert demonstrations can suffer from pathological traini… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Published in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  10. arXiv:2210.17299  [pdf, other

    stat.ME cs.LG eess.SY physics.chem-ph

    Bayesian Model Selection of Lithium-Ion Battery Models via Bayesian Quadrature

    Authors: Masaki Adachi, Yannick Kuhn, Birger Horstmann, Arnulf Latz, Michael A. Osborne, David A. Howey

    Abstract: A wide variety of battery models are available, and it is not always obvious which model `best' describes a dataset. This paper presents a Bayesian model selection approach using Bayesian quadrature. The model evidence is adopted as the selection metric, choosing the simplest model that describes the data, in the spirit of Occam's razor. However, estimating this requires integral computations over… ▽ More

    Submitted 5 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 11 pages, 2 figures, accepted at IFAC2023

    MSC Class: 62C10; 62F15

    Journal ref: IFAC-PapersOnLine, 56, 10521, 2023

  11. arXiv:2210.10199  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization

    Authors: Samuel Daulton, Xingchen Wan, David Eriksson, Maximilian Balandat, Michael A. Osborne, Eytan Bakshy

    Abstract: Optimizing expensive-to-evaluate black-box functions of discrete (and potentially continuous) design parameters is a ubiquitous problem in scientific and engineering applications. Bayesian optimization (BO) is a popular, sample-efficient method that leverages a probabilistic surrogate model and an acquisition function (AF) to select promising designs to evaluate. However, maximizing the AF over mi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: To appear in Advances in Neural Information Processing Systems 35, 2022. Code available at: https://github.com/facebookresearch/bo_pr

  12. arXiv:2210.01633  [pdf, other

    cs.LG

    Log-Linear-Time Gaussian Processes Using Binary Tree Kernels

    Authors: Michael K. Cohen, Samuel Daulton, Michael A. Osborne

    Abstract: Gaussian processes (GPs) produce good probabilistic models of functions, but most GP kernels require $O((n+m)n^2)$ time, where $n$ is the number of data points and $m$ the number of predictive locations. We present a new kernel that allows for Gaussian process regression in $O((n+m)\log(n+m))$ time. Our "binary tree" kernel places all data points on the leaves of a binary tree, with the kernel dep… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022; 9 pages + appendices

    Journal ref: Adv.Neur.Info.Proc.Sys. 35 (2022) 8118-8129

  13. arXiv:2209.00343  [pdf, other

    stat.ML cs.LG

    Bézier Gaussian Processes for Tall and Wide Data

    Authors: Martin Jørgensen, Michael A. Osborne

    Abstract: Modern approximations to Gaussian processes are suitable for "tall data", with a cost that scales well in the number of observations, but under-performs on ``wide data'', scaling poorly in the number of input features. That is, as the number of input features grows, good predictive performance requires the number of summarising variables, and their associated cost, to grow rapidly. We introduce a… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  14. arXiv:2207.09405  [pdf, other

    cs.LG cs.AI

    Bayesian Generational Population-Based Training

    Authors: Xingchen Wan, Cong Lu, Jack Parker-Holder, Philip J. Ball, Vu Nguyen, Binxin Ru, Michael A. Osborne

    Abstract: Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network architecture choice. Furthermore, non-stationarities such as evolving training data and increased agent complexity mean that different hyperparameters and architec… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: AutoML Conference 2022. 10 pages, 4 figure, 3 tables (28 pages, 10 figures, 7 tables including references and appendices)

  15. arXiv:2206.04779  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

    Authors: Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye Teh

    Abstract: Offline reinforcement learning has shown great promise in leveraging large pre-collected datasets for policy learning, allowing agents to forgo often-expensive online data collection. However, offline reinforcement learning from visual observations with continuous action spaces remains under-explored, with a limited understanding of the key challenges in this complex domain. In this paper, we esta… ▽ More

    Submitted 6 July, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Published at TMLR, 2023

  16. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  17. arXiv:2202.07549  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Robust Multi-Objective Bayesian Optimization Under Input Noise

    Authors: Samuel Daulton, Sait Cakmak, Maximilian Balandat, Michael A. Osborne, Enlu Zhou, Eytan Bakshy

    Abstract: Bayesian optimization (BO) is a sample-efficient approach for tuning design parameters to optimize expensive-to-evaluate, black-box performance metrics. In many manufacturing processes, the design parameters are subject to random input noise, resulting in a product that is often less performant than expected. Although BO methods have been proposed for optimizing a single objective under input nois… ▽ More

    Submitted 3 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: To appear at ICML 2022. 36 pages. Code is available at https://github.com/facebookresearch/robust_mobo

  18. arXiv:2111.11285  [pdf, other

    cond-mat.mes-hall cs.LG

    Bridging the reality gap in quantum devices with physics-aware machine learning

    Authors: D. L. Craig, H. Moon, F. Fedele, D. T. Lennon, B. Van Straaten, F. Vigneau, L. C. Camenzind, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, D. Sejdinovic, N. Ares

    Abstract: The discrepancies between reality and simulation impede the optimisation and scalability of solid-state quantum devices. Disorder induced by the unpredictable distribution of material defects is one of the major contributions to the reality gap. We bridge this gap using physics-aware machine learning, in particular, using an approach combining a physical model, deep learning, Gaussian random field… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  19. arXiv:2111.02842  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Adversarial Attacks on Graph Classification via Bayesian Optimisation

    Authors: Xingchen Wan, Henry Kenlay, Binxin Ru, Arno Blaas, Michael A. Osborne, Xiaowen Dong

    Abstract: Graph neural networks, a popular class of models effective in a wide range of graph-based learning tasks, have been shown to be vulnerable to adversarial attacks. While the majority of the literature focuses on such vulnerability in node-level classification tasks, little effort has been dedicated to analysing adversarial attacks on graph-level classification, an important problem with numerous re… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021. 11 pages, 8 figures, 2 tables (24 pages, 17 figures, 8 tables including references and appendices)

  20. arXiv:2110.12087  [pdf, other

    cs.LG stat.ML

    Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

    Authors: Vu Nguyen, Marc Peter Deisenroth, Michael A. Osborne

    Abstract: Many functions have approximately-known upper and/or lower bounds, potentially aiding the modeling of such functions. In this paper, we introduce Gaussian process models for functions where such bounds are (approximately) known. More specifically, we propose the first use of such bounds to improve Gaussian process (GP) posterior sampling and Bayesian optimization (BO). That is, we transform a GP m… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages

  21. arXiv:2110.04135  [pdf, other

    cs.LG cs.AI

    Revisiting Design Choices in Offline Model-Based Reinforcement Learning

    Authors: Cong Lu, Philip J. Ball, Jack Parker-Holder, Michael A. Osborne, Stephen J. Roberts

    Abstract: Offline reinforcement learning enables agents to leverage large pre-collected datasets of environment transitions to learn control policies, circumventing the need for potentially expensive or unsafe online data collection. Significant progress has been made recently in offline model-based reinforcement learning, approaches which leverage a learned dynamics model. This typically involves construct… ▽ More

    Submitted 16 March, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Spotlight @ ICLR 2022; Spotlight @ RL4RealLife Workshop ICML2021

  22. arXiv:2107.12975  [pdf, other

    cond-mat.mes-hall cs.LG quant-ph

    Cross-architecture Tuning of Silicon and SiGe-based Quantum Devices Using Machine Learning

    Authors: B. Severin, D. T. Lennon, L. C. Camenzind, F. Vigneau, F. Fedele, D. Jirovec, A. Ballabio, D. Chrastina, G. Isella, M. de Kruijf, M. J. Carballido, S. Svab, A. V. Kuhlmann, F. R. Braakman, S. Geyer, F. N. M. Froning, H. Moon, M. A. Osborne, D. Sejdinovic, G. Katsaros, D. M. Zumbühl, G. A. D. Briggs, N. Ares

    Abstract: The potential of Si and SiGe-based devices for the scaling of quantum circuits is tainted by device variability. Each device needs to be tuned to operation conditions. We give a key step towards tackling this variability with an algorithm that, without modification, is capable of tuning a 4-gate Si FinFET, a 5-gate GeSi nanowire and a 7-gate SiGe heterostructure double quantum dot device from scra… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  23. arXiv:2107.01959  [pdf, other

    cs.LG stat.ML

    Universal Approximation of Functions on Sets

    Authors: Edward Wagstaff, Fabian B. Fuchs, Martin Engelcke, Michael A. Osborne, Ingmar Posner

    Abstract: Modelling functions of sets, or equivalently, permutation-invariant functions, is a long-standing challenge in machine learning. Deep Sets is a popular method which is known to be a universal approximator for continuous set functions. We provide a theoretical analysis of Deep Sets which shows that this universal approximation property is only guaranteed if the model's latent space is sufficiently… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 54 pages, 13 figures

  24. arXiv:2106.07452  [pdf, other

    stat.ML cs.LG

    Marginalising over Stationary Kernels with Bayesian Quadrature

    Authors: Saad Hamid, Sebastian Schulze, Michael A. Osborne, Stephen J. Roberts

    Abstract: Marginalising over families of Gaussian Process kernels produces flexible model classes with well-calibrated uncertainty estimates. Existing approaches require likelihood evaluations of many kernels, rendering them prohibitively expensive for larger datasets. We propose a Bayesian Quadrature scheme to make this marginalisation more efficient and thereby more practical. Through use of the maximum m… ▽ More

    Submitted 15 March, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  25. arXiv:2102.07188  [pdf, other

    stat.ML cs.LG

    Think Global and Act Local: Bayesian Optimisation over High-Dimensional Categorical and Mixed Search Spaces

    Authors: Xingchen Wan, Vu Nguyen, Huong Ha, Binxin Ru, Cong Lu, Michael A. Osborne

    Abstract: High-dimensional black-box optimisation remains an important yet notoriously challenging problem. Despite the success of Bayesian optimisation methods on continuous domains, domains that are categorical, or that mix continuous and categorical variables, remain challenging. We propose a novel solution -- we combine local optimisation with a tailored kernel design, effectively handling high-dimensio… ▽ More

    Submitted 10 June, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: ICML 2021. 9 page, 6 figures (26 pages, 16 figures, 2 tables including references and appendices)

  26. arXiv:2010.15750  [pdf, other

    cs.LG

    Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective

    Authors: Vu Nguyen, Vaden Masrani, Rob Brekelmans, Michael A. Osborne, Frank Wood

    Abstract: Achieving the full promise of the Thermodynamic Variational Objective (TVO), a recently proposed variational lower bound on the log evidence involving a one-dimensional Riemann integral approximation, requires choosing a "schedule" of sorted discretization points. This paper introduces a bespoke Gaussian process bandit optimization method for automatically choosing these points. Our approach not o… ▽ More

    Submitted 20 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020

  27. arXiv:2009.14825  [pdf, other

    cond-mat.mes-hall cs.LG quant-ph

    Deep Reinforcement Learning for Efficient Measurement of Quantum Devices

    Authors: V. Nguyen, S. B. Orbell, D. T. Lennon, H. Moon, F. Vigneau, L. C. Camenzind, L. Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, D. Sejdinovic, N. Ares

    Abstract: Deep reinforcement learning is an emerging machine learning approach which can teach a computer to learn from their actions and rewards similar to the way humans learn from experience. It offers many advantages in automating decision processes to navigate large parameter spaces. This paper proposes a novel approach to the efficient measurement of quantum devices based on deep reinforcement learnin… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  28. arXiv:2006.07593  [pdf, other

    cs.LG cs.NE stat.ML

    Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

    Authors: Vu Nguyen, Tam Le, Makoto Yamada, Michael A Osborne

    Abstract: Neural architecture search (NAS) automates the design of deep neural networks. One of the main challenges in searching complex and non-continuous architectures is to compare the similarity of networks that the conventional Euclidean metric may fail to capture. Optimal transport (OT) is resilient to such complex structure by considering the minimal cost for transporting a network into another. Howe… ▽ More

    Submitted 10 June, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: 23 pages, camera ready ICML2021

  29. arXiv:2001.02589  [pdf, other

    cond-mat.mes-hall cs.LG quant-ph

    Machine learning enables completely automatic tuning of a quantum device faster than human experts

    Authors: H. Moon, D. T. Lennon, J. Kirkpatrick, N. M. van Esbroeck, L. C. Camenzind, Liuqi Yu, F. Vigneau, D. M. Zumbühl, G. A. D. Briggs, M. A Osborne, D. Sejdinovic, E. A. Laird, N. Ares

    Abstract: Device variability is a bottleneck for the scalability of semiconductor quantum devices. Increasing device control comes at the cost of a large parameter space that has to be explored in order to find the optimal operating conditions. We demonstrate a statistical tuning algorithm that navigates this entire parameter space, using just a few modelling assumptions, in the search for specific electron… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

  30. arXiv:1909.09593  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization for Iterative Learning

    Authors: Vu Nguyen, Sebastian Schulze, Michael A Osborne

    Abstract: The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence. Traditional tuning algorithms only consider the final performance of hyperparameters acquired after many expensive iterations and ignore intermediate informatio… ▽ More

    Submitted 16 January, 2021; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: Camera ready NeurIPS 2020

  31. arXiv:1908.08258  [pdf, ps, other

    stat.ML cs.LG

    Adaptive Configuration Oracle for Online Portfolio Selection Methods

    Authors: Favour M. Nyikosa, Michael A. Osborne, Stephen J. Roberts

    Abstract: Financial markets are complex environments that produce enormous amounts of noisy and non-stationary data. One fundamental problem is online portfolio selection, the goal of which is to exploit this data to sequentially select portfolios of assets to achieve positive investment outcomes while managing risks. Various algorithms have been proposed for solving this problem in fields such as finance,… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    MSC Class: 62P30 ACM Class: G.3

  32. arXiv:1906.08878  [pdf, other

    stat.ML cs.LG

    Bayesian Optimisation over Multiple Continuous and Categorical Inputs

    Authors: Binxin Ru, Ahsan S. Alvi, Vu Nguyen, Michael A. Osborne, Stephen J Roberts

    Abstract: Efficient optimisation of black-box problems that comprise both continuous and categorical inputs is important, yet poses significant challenges. We propose a new approach, Continuous and Categorical Bayesian Optimisation (CoCaBO), which combines the strengths of multi-armed bandits and Bayesian optimisation to select values for both categorical and continuous inputs. We model this mixed-type spac… ▽ More

    Submitted 9 August, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 16 pages

  33. arXiv:1905.02685  [pdf, other

    stat.ML cs.LG

    Knowing The What But Not The Where in Bayesian Optimization

    Authors: Vu Nguyen, Michael A. Osborne

    Abstract: Bayesian optimization has demonstrated impressive success in finding the optimum input x* and output f* = f(x*) = max f(x) of a black-box function f. In some applications, however, the optimum output f* is known in advance and the goal is to find the corresponding optimum input x*. In this paper, we consider a new setting in BO in which the knowledge of the optimum output f* is available. Our goal… ▽ More

    Submitted 14 August, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 16 pages

    Journal ref: International Conference on Machine Learning (ICML) 2020

  34. arXiv:1902.09724  [pdf, other

    cs.LG stat.ML

    Automated Model Selection with Bayesian Quadrature

    Authors: Henry Chai, Jean-Francois Ton, Roman Garnett, Michael A. Osborne

    Abstract: We present a novel technique for tailoring Bayesian quadrature (BQ) to model selection. The state-of-the-art for comparing the evidence of multiple models relies on Monte Carlo methods, which converge slowly and are unreliable for computationally expensive models. Previous research has shown that BQ offers sample efficiency superior to Monte Carlo in computing the evidence of an individual model.… ▽ More

    Submitted 1 March, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 10 pages, 5 figures. Currently in submission to ICML 2019

  35. arXiv:1902.08480  [pdf, other

    cs.LG math.DS stat.ML

    AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

    Authors: Gabriele Abbati, Philippe Wenk, Michael A Osborne, Andreas Krause, Bernhard Schölkopf, Stefan Bauer

    Abstract: Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment mat… ▽ More

    Submitted 28 May, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-sixth International Conference on Machine Learning (ICML 2019)

  36. arXiv:1902.06278  [pdf, other

    cs.LG math.DS stat.ML

    ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

    Authors: Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, Stefan Bauer

    Abstract: Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of exper… ▽ More

    Submitted 5 December, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: Published at the Thirty-fourth AAAI Conference on Artificial Intelligence

  37. arXiv:1901.10452  [pdf, other

    stat.ML cs.AI cs.LG

    Asynchronous Batch Bayesian Optimisation with Improved Local Penalisation

    Authors: Ahsan S. Alvi, Binxin Ru, Jan Calliess, Stephen J. Roberts, Michael A. Osborne

    Abstract: Batch Bayesian optimisation (BO) has been successfully applied to hyperparameter tuning using parallel computing, but it is wasteful of resources: workers that complete jobs ahead of others are left idle. We address this problem by develo** an approach, Penalising Locally for Asynchronous Bayesian Optimisation on $k$ workers (PLAyBOOK), for asynchronous parallel BO. We demonstrate empirically th… ▽ More

    Submitted 28 May, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Camera-ready version after incorporating reviewers' suggestions

  38. arXiv:1811.10275  [pdf, ps, other

    stat.CO cs.LG math.NA stat.ML

    Rejoinder for "Probabilistic Integration: A Role in Statistical Computation?"

    Authors: Francois-Xavier Briol, Chris J. Oates, Mark Girolami, Michael A. Osborne, Dino Sejdinovic

    Abstract: This article is the rejoinder for the paper "Probabilistic Integration: A Role in Statistical Computation?" to appear in Statistical Science with discussion. We would first like to thank the reviewers and many of our colleagues who helped shape this paper, the editor for selecting our paper for discussion, and of course all of the discussants for their thoughtful, insightful and constructive comme… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted to Statistical Science

  39. arXiv:1810.10042  [pdf, other

    quant-ph cond-mat.mes-hall cs.LG

    Efficiently measuring a quantum device using machine learning

    Authors: D. T. Lennon, H. Moon, L. C. Camenzind, Liuqi Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, E. A. Laird, N. Ares

    Abstract: Scalable quantum technologies will present challenges for characterizing and tuning quantum devices. This is a time-consuming activity, and as the size of quantum systems increases, this task will become intractable without the aid of automation. We present measurements on a quantum dot device performed by a machine learning algorithm. The algorithm selects the most informative measurements to per… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  40. arXiv:1805.10662  [pdf, other

    cs.LG cs.AI stat.ML

    Fingerprint Policy Optimisation for Robust Reinforcement Learning

    Authors: Supratik Paul, Michael A. Osborne, Shimon Whiteson

    Abstract: Policy gradient methods ignore the potential value of adjusting environment variables: unobservable state features that are randomly determined by the environment in a physical setting, but are controllable in a simulator. This can lead to slow learning, or convergence to suboptimal policies, if the environment variable has a large impact on the transition dynamics. In this paper, we present finge… ▽ More

    Submitted 27 May, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: ICML 2019

  41. arXiv:1805.08610  [pdf, other

    stat.ML cs.LG

    Optimization, fast and slow: optimally switching between local and Bayesian optimization

    Authors: Mark McLeod, Michael A. Osborne, Stephen J. Roberts

    Abstract: We develop the first Bayesian Optimization algorithm, BLOSSOM, which selects between multiple alternative acquisition functions and traditional local optimization at each step. This is combined with a novel stop** condition based on expected regret. This pairing allows us to obtain the best characteristics of both local and Bayesian optimization, making efficient use of function evaluations whil… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  42. arXiv:1803.10520  [pdf, ps, other

    quant-ph cs.LG stat.ML

    Quantum algorithms for training Gaussian Processes

    Authors: Zhikuan Zhao, Jack K. Fitzsimons, Michael A. Osborne, Stephen J. Roberts, Joseph F. Fitzsimons

    Abstract: Gaussian processes (GPs) are important models in supervised machine learning. Training in Gaussian processes refers to selecting the covariance functions and the associated parameters in order to improve the outcome of predictions, the core of which amounts to evaluating the logarithm of the marginal likelihood (LML) of a given model. LML gives a concrete measure of the quality of prediction that… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: 5 pages. Comments welcome

    Journal ref: Phys. Rev. A 100, 012304 (2019)

  43. arXiv:1707.04314  [pdf, other

    stat.ML cs.AI cs.PL stat.CO

    Bayesian Optimization for Probabilistic Programs

    Authors: Tom Rainforth, Tuan Anh Le, Jan-Willem van de Meent, Michael A. Osborne, Frank Wood

    Abstract: We present the first general purpose framework for marginal maximum a posteriori estimation of probabilistic program variables. By using a series of code transformations, the evidence of any probabilistic program, and therefore of any graphical model, can be optimized with respect to an arbitrary subset of its sampled variables. To carry out this optimization, we develop the first Bayesian optimiz… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  44. arXiv:1705.00891  [pdf, ps, other

    stat.ML cs.CE q-fin.ST

    A Novel Approach to Forecasting Financial Volatility with Gaussian Process Envelopes

    Authors: Syed Ali Asad Rizvi, Stephen J. Roberts, Michael A. Osborne, Favour Nyikosa

    Abstract: In this paper we use Gaussian Process (GP) regression to propose a novel approach for predicting volatility of financial returns by forecasting the envelopes of the time series. We provide a direct comparison of their performance to traditional approaches such as GARCH. We compare the forecasting power of three approaches: GP regression on the absolute and squared returns; regression on the envelo… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.

    Comments: 16 pages, 8 figures, 6 tables

  45. arXiv:1608.00117  [pdf, other

    math.NA cs.DS quant-ph

    Improved stochastic trace estimation using mutually unbiased bases

    Authors: J. K. Fitzsimons, M. A. Osborne, S. J. Roberts, J. F. Fitzsimons

    Abstract: We examine the problem of estimating the trace of a matrix $A$ when given access to an oracle which computes $x^\dagger A x$ for an input vector $x$. We make use of the basis vectors from a set of mutually unbiased bases, widely studied in the field of quantum information processing, in the selection of probing vectors $x$. This approach offers a new state of the art single shot sampling variance… ▽ More

    Submitted 30 July, 2016; originally announced August 2016.

    Comments: 5 pages, 1 figure, 2 tables. Comments welcome

  46. arXiv:1605.07496  [pdf, other

    cs.LG cs.AI stat.ML

    Alternating Optimisation and Quadrature for Robust Control

    Authors: Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson

    Abstract: Bayesian optimisation has been successfully applied to a variety of reinforcement learning problems. However, the traditional approach for learning optimal policies in simulators does not utilise the opportunity to improve learning by adjusting certain environment variables: state features that are unobservable and randomly determined by the environment in a physical setting but are controllable i… ▽ More

    Submitted 18 December, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: To appear in AAAI 2018. Video of policy learnt in simulation deployed on a real hexapod see https://youtu.be/ME90xtIPsKk

  47. arXiv:1506.01326  [pdf, other

    math.NA cs.AI cs.LG stat.CO stat.ML

    Probabilistic Numerics and Uncertainty in Computations

    Authors: Philipp Hennig, Michael A Osborne, Mark Girolami

    Abstract: We deliver a call to arms for probabilistic numerical methods: algorithms for numerical tasks, including linear algebra, integration, optimization and solving differential equations, that return uncertainties in their calculations. Such uncertainties, arising from the loss of precision induced by numerical calculation with limited time or hardware, are important for much contemporary science and i… ▽ More

    Submitted 3 June, 2015; originally announced June 2015.

    Comments: Author Generated Postprint. 17 pages, 4 Figures, 1 Table

  48. arXiv:1310.6740  [pdf, other

    stat.ML cs.LG

    Active Learning of Linear Embeddings for Gaussian Processes

    Authors: Roman Garnett, Michael A. Osborne, Philipp Hennig

    Abstract: We propose an active learning method for discovering low-dimensional structure in high-dimensional Gaussian process (GP) tasks. Such problems are increasingly frequent and important, but have hitherto presented severe practical difficulties. We further introduce a novel technique for approximately marginalizing GP hyperparameters, yielding marginal predictions robust to hyperparameter mis-specific… ▽ More

    Submitted 24 October, 2013; originally announced October 2013.

    MSC Class: 68T05 ACM Class: I.2.6; I.5.2; G.3

  49. arXiv:1310.5738  [pdf, ps, other

    stat.ML cs.LG

    A Kernel for Hierarchical Parameter Spaces

    Authors: Frank Hutter, Michael A. Osborne

    Abstract: We define a family of kernels for mixed continuous/discrete hierarchical parameter spaces and show that they are positive definite.

    Submitted 21 October, 2013; originally announced October 2013.