Skip to main content

Showing 1–14 of 14 results for author: Ober, S W

.
  1. arXiv:2406.07263  [pdf, other

    cs.LG q-bio.QM stat.ML

    Active learning for affinity prediction of antibodies

    Authors: Alexandra Gessner, Sebastian W. Ober, Owen Vickery, Dino Oglić, Talip Uçar

    Abstract: The primary objective of most lead optimization campaigns is to enhance the binding affinity of ligands. For large molecules such as antibodies, identifying mutations that enhance antibody affinity is particularly challenging due to the combinatorial explosion of potential mutations. When the structure of the antibody-antigen complex is available, relative binding free energy (RBFE) methods can of… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2402.09849  [pdf, other

    cs.LG stat.ML

    Recommendations for Baselines and Benchmarking Approximate Gaussian Processes

    Authors: Sebastian W. Ober, Artem Artemev, Marcel Wagenländer, Rudolfs Grobins, Mark van der Wilk

    Abstract: Gaussian processes (GPs) are a mature and widely-used component of the ML toolbox. One of their desirable qualities is automatic hyperparameter selection, which allows for training without user intervention. However, in many realistic settings, approximations are typically needed, which typically do require tuning. We argue that this requirement for tuning complicates evaluation, which has led to… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Preprint. 25 pages, 16 figures

  3. Towards Improved Variational Inference for Deep Bayesian Models

    Authors: Sebastian W. Ober

    Abstract: Deep learning has revolutionized the last decade, being at the forefront of extraordinary advances in a wide range of tasks including computer vision, natural language processing, and reinforcement learning, to name but a few. However, it is well-known that deep models trained via maximum likelihood estimation tend to be overconfident and give poorly-calibrated predictions. Bayesian deep learning… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: PhD Thesis; University of Cambridge

  4. arXiv:2302.08436  [pdf, other

    stat.ML cs.LG

    Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlow

    Authors: Victor Picheny, Joel Berkeley, Henry B. Moss, Hrvoje Stojic, Uri Granta, Sebastian W. Ober, Artem Artemev, Khurram Ghani, Alexander Goodall, Andrei Paleyes, Sattar Vakili, Sergio Pascual-Diaz, Stratis Markou, Jixiang Qing, Nasrulloh R. B. S Loka, Ivo Couckuyt

    Abstract: We present Trieste, an open-source Python package for Bayesian optimization and active learning benefiting from the scalability and efficiency of TensorFlow. Our library enables the plug-and-play of popular TensorFlow-based models within sequential decision-making loops, e.g. Gaussian processes from GPflow or GPflux, or neural networks from Keras. This modular mindset is central to the package and… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  5. arXiv:2301.10123  [pdf, other

    cs.LG stat.ML

    Inducing Point Allocation for Sparse Gaussian Processes in High-Throughput Bayesian Optimisation

    Authors: Henry B. Moss, Sebastian W. Ober, Victor Picheny

    Abstract: Sparse Gaussian Processes are a key component of high-throughput Bayesian Optimisation (BO) loops; however, we show that existing methods for allocating their inducing points severely hamper optimisation performance. By exploiting the quality-diversity decomposition of Determinantal Point Processes, we propose the first inducing point allocation strategy designed specifically for use in BO. Unlike… ▽ More

    Submitted 23 February, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  6. arXiv:2206.02437  [pdf, other

    cs.LG stat.ML

    Information-theoretic Inducing Point Placement for High-throughput Bayesian Optimisation

    Authors: Henry B. Moss, Sebastian W. Ober, Victor Picheny

    Abstract: Sparse Gaussian Processes are a key component of high-throughput Bayesian optimisation (BO) loops -- an increasingly common setting where evaluation budgets are large and highly parallelised. By using representative subsets of the available data to build approximate posteriors, sparse models dramatically reduce the computational costs of surrogate modelling by relying on a small set of pseudo-obse… ▽ More

    Submitted 13 July, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

  7. arXiv:2107.10125  [pdf, other

    stat.ML cs.LG

    A variational approximate posterior for the deep Wishart process

    Authors: Sebastian W. Ober, Laurence Aitchison

    Abstract: Recent work introduced deep kernel processes as an entirely kernel-based alternative to NNs (Aitchison et al. 2020). Deep kernel processes flexibly learn good top-layer representations by alternately sampling the kernel from a distribution over positive semi-definite matrices and performing nonlinear transformations. A particular deep kernel process, the deep Wishart process (DWP), is of particula… ▽ More

    Submitted 3 December, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021). 23 pages

  8. arXiv:2106.07512  [pdf, other

    stat.ML cs.LG

    Last Layer Marginal Likelihood for Invariance Learning

    Authors: Pola Schwöbel, Martin Jørgensen, Sebastian W. Ober, Mark van der Wilk

    Abstract: Data augmentation is often used to incorporate inductive biases into models. Traditionally, these are hand-crafted and tuned with cross validation. The Bayesian paradigm for model selection provides a path towards end-to-end learning of invariances using only the training data, by optimising the marginal likelihood. Computing the marginal likelihood is hard for neural networks, but success with tr… ▽ More

    Submitted 1 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: AISTATS '22

  9. arXiv:2102.12108  [pdf, other

    stat.ML cs.LG

    The Promises and Pitfalls of Deep Kernel Learning

    Authors: Sebastian W. Ober, Carl E. Rasmussen, Mark van der Wilk

    Abstract: Deep kernel learning (DKL) and related techniques aim to combine the representational power of neural networks with the reliable uncertainty estimates of Gaussian processes. One crucial aspect of these models is an expectation that, because they are treated as Gaussian process models optimized using the marginal likelihood, they are protected from overfitting. However, we identify situations where… ▽ More

    Submitted 7 July, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Accepted for the 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021), 20 pages

  10. arXiv:2102.06571  [pdf, other

    stat.ML cs.LG

    Bayesian Neural Network Priors Revisited

    Authors: Vincent Fortuin, Adrià Garriga-Alonso, Sebastian W. Ober, Florian Wenzel, Gunnar Rätsch, Richard E. Turner, Mark van der Wilk, Laurence Aitchison

    Abstract: Isotropic Gaussian priors are the de facto standard for modern Bayesian neural network inference. However, it is unclear whether these priors accurately reflect our true beliefs about the weight distributions or give optimal performance. To find better priors, we study summary statistics of neural network weights in networks trained using stochastic gradient descent (SGD). We find that convolution… ▽ More

    Submitted 16 March, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: Accepted at ICLR 2022

  11. arXiv:2011.09421  [pdf, other

    stat.ML cs.LG

    Understanding Variational Inference in Function-Space

    Authors: David R. Burt, Sebastian W. Ober, Adrià Garriga-Alonso, Mark van der Wilk

    Abstract: Recent work has attempted to directly approximate the `function-space' or predictive posterior distribution of Bayesian models, without approximating the posterior distribution over the parameters. This is appealing in e.g. Bayesian neural networks, where we only need the former, and the latter is hard to represent. In this work, we highlight some advantages and limitations of employing the Kullba… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 19 pages

  12. arXiv:2010.01590  [pdf, other

    stat.ML cs.LG

    Deep kernel processes

    Authors: Laurence Aitchison, Adam X. Yang, Sebastian W. Ober

    Abstract: We define deep kernel processes in which positive definite Gram matrices are progressively transformed by nonlinear kernel functions and by sampling from (inverse) Wishart distributions. Remarkably, we find that deep Gaussian processes (DGPs), Bayesian neural networks (BNNs), infinite BNNs, and infinite BNNs with bottlenecks can all be written as deep kernel processes. For DGPs the equivalence ari… ▽ More

    Submitted 30 May, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: 21 pages

  13. arXiv:2005.08140  [pdf, other

    stat.ML cs.LG

    Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes

    Authors: Sebastian W. Ober, Laurence Aitchison

    Abstract: We consider the optimal approximate posterior over the top-layer weights in a Bayesian neural network for regression, and show that it exhibits strong dependencies on the lower-layer weights. We adapt this result to develop a correlated approximate posterior over the weights at all layers in a Bayesian neural network. We extend this approach to deep Gaussian processes, unifying inference in the tw… ▽ More

    Submitted 22 June, 2021; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: Accepted for publication at the 38th International Conference on Machine Learning (ICML 2021, PMLR 139), 33 pages

  14. arXiv:1912.08416  [pdf, other

    stat.ML cs.LG

    Benchmarking the Neural Linear Model for Regression

    Authors: Sebastian W. Ober, Carl Edward Rasmussen

    Abstract: The neural linear model is a simple adaptive Bayesian linear regression method that has recently been used in a number of problems ranging from Bayesian optimization to reinforcement learning. Despite its apparent successes in these settings, to the best of our knowledge there has been no systematic exploration of its capabilities on simple regression tasks. In this work we characterize these on t… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: Advances in Approximate Bayesian Inference (AABI 2019)