Skip to main content

Showing 1–50 of 96 results for author: Hernández-Lobato, J M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07709  [pdf, other

    cs.LG physics.chem-ph stat.ML

    Diagnosing and fixing common problems in Bayesian optimization for molecule design

    Authors: Austin Tripp, José Miguel Hernández-Lobato

    Abstract: Bayesian optimization (BO) is a principled approach to molecular design tasks. In this paper we explain three pitfalls of BO which can cause poor empirical performance: an incorrect prior width, over-smoothing, and inadequate acquisition function maximization. We show that with these issues addressed, even a basic BO setup is able to achieve the highest overall performance on the PMO benchmark for… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures. Code at: https://github.com/AustinT/basic-mol-bo-workshop2024

  2. arXiv:2406.05832  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    Improving Antibody Design with Force-Guided Sampling in Diffusion Models

    Authors: Paulina Kulytė, Francisco Vargas, Simon Valentin Mathis, Yu Guang Wang, José Miguel Hernández-Lobato, Pietro Liò

    Abstract: Antibodies, crucial for immune defense, primarily rely on complementarity-determining regions (CDRs) to bind and neutralize antigens, such as viruses. The design of these CDRs determines the antibody's affinity and specificity towards its target. Generative models, particularly denoising diffusion probabilistic models (DDPMs), have shown potential to advance the structure-based design of CDR regio… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2405.18457  [pdf, other

    cs.LG stat.ML

    Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across so… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Preprint. arXiv admin note: text overlap with arXiv:2405.18328

  4. arXiv:2405.18328  [pdf, other

    cs.LG stat.ML

    Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, José Miguel Hernández-Lobato

    Abstract: Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between co… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Advances in Approximate Bayesian Inference 2024

  5. arXiv:2405.12203  [pdf, other

    cs.IT cs.LG

    Accelerating Relative Entropy Coding with Space Partitioning

    Authors: Jiajun He, Gergely Flamich, José Miguel Hernández-Lobato

    Abstract: Relative entropy coding (REC) algorithms encode a random sample following a target distribution $Q$, using a coding distribution $P$ shared between the sender and receiver. Sadly, general REC algorithms suffer from prohibitive encoding times, at least on the order of $2^{D_{\text{KL}}[Q||P]}$, and faster algorithms are limited to very specific settings. This work addresses this issue by introducin… ▽ More

    Submitted 24 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 28 pages, 9 figures

  6. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  7. arXiv:2403.01946  [pdf, other

    cs.LG

    A Generative Model of Symmetry Transformations

    Authors: James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: Correctly capturing the symmetry transformations of data can lead to efficient models with strong generalization capabilities, though methods incorporating symmetries often require prior knowledge. While recent advancements have been made in learning those symmetries directly from the dataset, most of this work has focused on the discriminative setting. In this paper, we take inspiration from grou… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  8. arXiv:2402.08845  [pdf, other

    cs.LG stat.ME

    Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation

    Authors: Xuexin Chen, Ruichu Cai, Zhengting Huang, Yuxuan Zhu, Julien Horwood, Zhifeng Hao, Zijian Li, Jose Miguel Hernandez-Lobato

    Abstract: We investigate the problem of explainability for machine learning models, focusing on Feature Attribution Methods (FAMs) that evaluate feature importance through perturbation tests. Despite their utility, FAMs struggle to distinguish the contributions of different features, when their prediction changes are similar after perturbation. To enhance FAMs' discriminative power, we introduce Feature Att… ▽ More

    Submitted 4 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted in the Proceedings of the 41st International Conference on Machine Learning (ICML2024)

  9. arXiv:2402.03008  [pdf, other

    stat.ML cs.LG stat.CO

    Diffusive Gibbs Sampling

    Authors: Wenlin Chen, Mingtian Zhang, Brooks Paige, José Miguel Hernández-Lobato, David Barber

    Abstract: The inadequate mixing of conventional Markov Chain Monte Carlo (MCMC) methods for multi-modal distributions presents a significant challenge in practical applications such as Bayesian inference and molecular dynamics. Addressing this, we propose Diffusive Gibbs Sampling (DiGS), an innovative family of sampling methods designed for effective sampling from distributions characterized by distant and… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted for publication at ICML 2024. Code available: https://github.com/Wenlin-Chen/DiGS

  10. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  11. arXiv:2310.20581  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Descent for Gaussian Processes Done Right

    Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

    Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More

    Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  12. Introducing instance label correlation in multiple instance learning. Application to cancer detection on histopathological images

    Authors: Pablo Morales-Álvarez, Arne Schmidt, José Miguel Hernández-Lobato, Rafael Molina

    Abstract: In the last years, the weakly supervised paradigm of multiple instance learning (MIL) has become very popular in many different areas. A paradigmatic example is computational pathology, where the lack of patch-level labels for whole-slide images prevents the application of supervised models. Probabilistic MIL methods based on Gaussian Processes (GPs) have obtained promising results due to their ex… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 33 pages, 6 figures, 6 tables. Published at Pattern Recognition journal

  13. arXiv:2310.14963  [pdf, other

    cs.LG stat.ML

    Studying K-FAC Heuristics by Viewing Adam through a Second-Order Lens

    Authors: Ross M. Clarke, José Miguel Hernández-Lobato

    Abstract: Research into optimisation for deep learning is characterised by a tension between the computational efficiency of first-order, gradient-based methods (such as SGD and Adam) and the theoretical efficiency of second-order, curvature-based methods (such as quasi-Newton methods and K-FAC). Noting that second-order methods often only function effectively with the addition of stabilising heuristics (su… ▽ More

    Submitted 13 June, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 33 pages, 21 figures, 7 tables. Published at ICML 2024

  14. arXiv:2310.14901  [pdf, other

    cs.LG stat.ML

    Series of Hessian-Vector Products for Tractable Saddle-Free Newton Optimisation of Neural Networks

    Authors: Elre T. Oldewage, Ross M. Clarke, José Miguel Hernández-Lobato

    Abstract: Despite their popularity in the field of continuous optimisation, second-order quasi-Newton methods are challenging to apply in machine learning, as the Hessian matrix is intractably large. This computational burden is exacerbated by the need to address non-convexity, for instance by modifying the Hessian's eigenvalues as in Saddle-Free Newton methods. We propose an optimisation algorithm which ad… ▽ More

    Submitted 27 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 37 pages, 10 figures, 5 tables. To appear in TMLR. First two authors' order randomised

  15. arXiv:2310.09270  [pdf, other

    cs.AI cs.LG

    Retro-fallback: retrosynthetic planning in an uncertain world

    Authors: Austin Tripp, Krzysztof Maziarz, Sarah Lewis, Marwin Segler, José Miguel Hernández-Lobato

    Abstract: Retrosynthesis is the task of planning a series of chemical reactions to create a desired molecule from simpler, buyable molecules. While previous works have proposed algorithms to find optimal solutions for a range of metrics (e.g. shortest, lowest-cost), these works generally overlook the fact that we have imperfect knowledge of the space of possible reactions, meaning plans created by algorithm… ▽ More

    Submitted 13 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 camera ready version (https://openreview.net/forum?id=dl0u4ODCuW). 58 pages total. Code available at: https://github.com/AustinT/retro-fallback-iclr24. This version has 1) updated writing 2) updated figures 3) additional experimental results 4) more complete explanation of AND/OR graphs in the appendices 5) correct typos + error in fig G.5 caption

  16. arXiv:2310.09267  [pdf, ps, other

    cs.NE cs.LG q-bio.QM

    Genetic algorithms are strong baselines for molecule generation

    Authors: Austin Tripp, José Miguel Hernández-Lobato

    Abstract: Generating molecules, both in a directed and undirected fashion, is a huge part of the drug discovery pipeline. Genetic algorithms (GAs) generate molecules by randomly modifying known molecules. In this paper we show that GAs are very strong algorithms for such tasks, outperforming many complicated machine learning methods: a result which many researchers may find surprising. We therefore propose… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Currently under review. Code will be made available at a later date

  17. arXiv:2309.17182  [pdf, other

    cs.LG

    RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations

    Authors: Jiajun He, Gergely Flamich, Zongyu Guo, José Miguel Hernández-Lobato

    Abstract: COMpression with Bayesian Implicit NEural Representations (COMBINER) is a recent data compression method that addresses a key inefficiency of previous Implicit Neural Representation (INR)-based approaches: it avoids quantization and enables direct optimization of the rate-distortion performance. However, COMBINER still has significant limitations: 1) it uses factorized priors and posterior approxi… ▽ More

    Submitted 7 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 camera-ready version; 27 pages, 17 figures

  18. arXiv:2308.12316  [pdf, other

    cs.LG

    Graph Neural Stochastic Differential Equations

    Authors: Richard Bergna, Felix Opolka, Pietro Liò, Jose Miguel Hernandez-Lobato

    Abstract: We present a novel model Graph Neural Stochastic Differential Equations (Graph Neural SDEs). This technique enhances the Graph Neural Ordinary Differential Equations (Graph Neural ODEs) by embedding randomness into data representation using Brownian motion. This inclusion allows for the assessment of prediction uncertainty, a crucial aspect frequently missed in current models. In our framework, we… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 9 main pages, 6 of appendix (15 in total), submitted for the Learning on Graph (LoG) conference

  19. arXiv:2308.10364  [pdf, other

    cs.LG physics.comp-ph

    SE(3) Equivariant Augmented Coupling Flows

    Authors: Laurence I. Midgley, Vincent Stimper, Javier Antorán, Emile Mathieu, Bernhard Schölkopf, José Miguel Hernández-Lobato

    Abstract: Coupling normalizing flows allow for fast sampling and density evaluation, making them the tool of choice for probabilistic modeling of physical systems. However, the standard coupling architecture precludes endowing flows that operate on the Cartesian coordinates of atoms with the SE(3) and permutation invariances of physical systems. This work proposes a coupling flow that preserves SE(3) and pe… ▽ More

    Submitted 5 March, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

  20. arXiv:2307.07816  [pdf, other

    cs.LG stat.ML

    Minimal Random Code Learning with Mean-KL Parameterization

    Authors: Jihao Andreas Lin, Gergely Flamich, José Miguel Hernández-Lobato

    Abstract: This paper studies the qualitative behavior and robustness of two variants of Minimal Random Code Learning (MIRACLE) used to compress variational Bayesian neural networks. MIRACLE implements a powerful, conditionally Gaussian variational approximation for the weight posterior $Q_{\mathbf{w}}$ and uses relative entropy coding to compress a weight sample from the posterior using a Gaussian coding di… ▽ More

    Submitted 4 December, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: ICML Neural Compression Workshop 2023

  21. arXiv:2307.06093  [pdf, other

    cs.LG stat.ML

    Online Laplace Model Selection Revisited

    Authors: Jihao Andreas Lin, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: The Laplace approximation provides a closed-form model selection objective for neural networks (NN). Online variants, which optimise NN parameters jointly with hyperparameters, like weight decay strength, have seen renewed interest in the Bayesian deep learning community. However, these methods violate Laplace's method's critical assumption that the approximation is performed around a mode of the… ▽ More

    Submitted 9 January, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Advances in Approximate Bayesian Inference 2023

  22. arXiv:2306.14861  [pdf, other

    stat.ML cs.LG

    Leveraging Task Structures for Improved Identifiability in Neural Network Representations

    Authors: Wenlin Chen, Julien Horwood, Juyeon Heo, José Miguel Hernández-Lobato

    Abstract: This work extends the theory of identifiability in supervised learning by considering the consequences of having access to a distribution of tasks. In such cases, we show that identifiability is achievable even in the case of regression, extending prior work restricted to linear identifiability in the single-task classification case. Furthermore, we show that the existence of a task distribution w… ▽ More

    Submitted 29 September, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 18 pages, 4 figures, 5 tables, 1 algorithm

  23. arXiv:2306.14809  [pdf, other

    cs.LG

    Tanimoto Random Features for Scalable Molecular Machine Learning

    Authors: Austin Tripp, Sergio Bacallado, Sukriti Singh, José Miguel Hernández-Lobato

    Abstract: The Tanimoto coefficient is commonly used to measure the similarity between molecules represented as discrete fingerprints, either as a distance metric or a positive definite kernel. While many kernel methods can be accelerated using random feature approximations, at present there is a lack of such approximations for the Tanimoto kernel. In this paper we propose two kinds of novel random features… ▽ More

    Submitted 13 November, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Camera-ready version presented at NeurIPS 2023. Updates include: notation changes, better description of features in section 4, updated experiments, link to code

  24. arXiv:2306.11589  [pdf, other

    cs.LG stat.ML

    Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent

    Authors: Jihao Andreas Lin, Javier Antorán, Shreyas Padhy, David Janz, José Miguel Hernández-Lobato, Alexander Terenin

    Abstract: Gaussian processes are a powerful framework for quantifying uncertainty and for sequential decision-making but are limited by the requirement of solving linear systems. In general, this has a cubic cost in dataset size and is sensitive to conditioning. We explore stochastic gradient algorithms as a computationally efficient method of approximately solving these linear systems: we develop low-varia… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  25. arXiv:2305.19185  [pdf, other

    cs.LG cs.IT stat.ML

    Compression with Bayesian Implicit Neural Representations

    Authors: Zongyu Guo, Gergely Flamich, Jiajun He, Zhibo Chen, José Miguel Hernández-Lobato

    Abstract: Many common types of data can be represented as functions that map coordinates to signal values, such as pixel locations to RGB values in the case of an image. Based on this view, data can be compressed by overfitting a compact neural network to its functional representation and then encoding the network weights. However, most current solutions for this are inefficient, as quantization to low-bit… ▽ More

    Submitted 29 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted as a Spotlight paper in NeurIPS 2023. Updated camera-ready version

  26. normflows: A PyTorch Package for Normalizing Flows

    Authors: Vincent Stimper, David Liu, Andrew Campbell, Vincent Berenz, Lukas Ryll, Bernhard Schölkopf, José Miguel Hernández-Lobato

    Abstract: Normalizing flows model probability distributions through an expressive tractable density. They transform a simple base distribution, such as a Gaussian, through a sequence of invertible functions, which are referred to as layers. These layers typically use neural networks to become very expressive. Flows are ubiquitous in machine learning and have been applied to image generation, text modeling,… ▽ More

    Submitted 26 June, 2023; v1 submitted 26 January, 2023; originally announced February 2023.

    Journal ref: Journal of Open Source Software, 8(86), 5361 (2023)

  27. arXiv:2302.10279  [pdf, other

    cs.CV eess.IV

    Image Reconstruction via Deep Image Prior Subspaces

    Authors: Riccardo Barbano, Javier Antorán, Johannes Leuschner, José Miguel Hernández-Lobato, Bangti **, Željko Kereta

    Abstract: Deep learning has been widely used for solving image reconstruction tasks but its deployability has been held back due to the shortage of high-quality training data. Unsupervised learning methods, such as the deep image prior (DIP), naturally fill this gap, but bring a host of new issues: the susceptibility to overfitting due to a lack of robust early stop** strategies and unstable convergence.… ▽ More

    Submitted 5 June, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

  28. arXiv:2210.04994  [pdf, other

    stat.ML cs.AI cs.LG

    Sampling-based inference for large linear models, with application to linearised Laplace

    Authors: Javier Antorán, Shreyas Padhy, Riccardo Barbano, Eric Nalisnick, David Janz, José Miguel Hernández-Lobato

    Abstract: Large-scale linear models are ubiquitous throughout machine learning, with contemporary application as surrogate models for neural network uncertainty quantification; that is, the linearised Laplace method. Alas, the computational cost associated with Bayesian linear models constrains this method's application to small networks, small output spaces and small datasets. We address this limitation by… ▽ More

    Submitted 16 March, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Published at ICLR 2023. This latest Arxiv version is extended with a demonstration of the proposed methods on the Imagenet dataset

  29. arXiv:2208.01893  [pdf, other

    cs.LG q-bio.QM stat.ML

    Flow Annealed Importance Sampling Bootstrap

    Authors: Laurence Illing Midgley, Vincent Stimper, Gregor N. C. Simm, Bernhard Schölkopf, José Miguel Hernández-Lobato

    Abstract: Normalizing flows are tractable density models that can approximate complicated target distributions, e.g. Boltzmann distributions of physical systems. However, current methods for training flows either suffer from mode-seeking behavior, use samples from the target generated beforehand by expensive MCMC methods, or use stochastic losses that have high variance. To avoid these problems, we augment… ▽ More

    Submitted 7 March, 2023; v1 submitted 3 August, 2022; originally announced August 2022.

  30. arXiv:2207.05714  [pdf, other

    cs.CV cs.LG

    Bayesian Experimental Design for Computed Tomography with the Linearised Deep Image Prior

    Authors: Riccardo Barbano, Johannes Leuschner, Javier Antorán, Bangti **, José Miguel Hernández-Lobato

    Abstract: We investigate adaptive design based on a single sparse pilot scan for generating effective scanning strategies for computed tomography reconstruction. We propose a novel approach using the linearised deep image prior. It allows incorporating information from the pilot measurements into the angle selection criteria, while maintaining the tractability of a conjugate Gaussian-linear model. On a synt… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  31. arXiv:2206.08900  [pdf, other

    stat.ML cs.AI cs.LG

    Adapting the Linearised Laplace Model Evidence for Modern Deep Learning

    Authors: Javier Antorán, David Janz, James Urquhart Allingham, Erik Daxberger, Riccardo Barbano, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: The linearised Laplace method for estimating model uncertainty has received renewed attention in the Bayesian deep learning community. The method provides reliable error bars and admits a closed-form expression for the model evidence, allowing for scalable selection of model hyperparameters. In this work, we examine the assumptions behind this method, particularly in conjunction with model selecti… ▽ More

    Submitted 8 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Paper appearing at ICML 2022

  32. arXiv:2205.02708  [pdf, other

    cs.LG

    Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction

    Authors: Wenlin Chen, Austin Tripp, José Miguel Hernández-Lobato

    Abstract: We propose Adaptive Deep Kernel Fitting with Implicit Function Theorem (ADKF-IFT), a novel framework for learning deep kernel Gaussian processes (GPs) by interpolating between meta-learning and conventional deep kernel learning. Our approach employs a bilevel optimization objective where we meta-learn generally useful feature representations across tasks, in the sense that task-specific GP models… ▽ More

    Submitted 16 February, 2023; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at The Eleventh International Conference on Learning Representations (ICLR 2023); code available at: https://github.com/Wenlin-Chen/ADKF-IFT

  33. arXiv:2203.00479  [pdf, other

    eess.IV cs.LG stat.ML

    Uncertainty Estimation for Computed Tomography with a Linearised Deep Image Prior

    Authors: Javier Antorán, Riccardo Barbano, Johannes Leuschner, José Miguel Hernández-Lobato, Bangti **

    Abstract: Existing deep-learning based tomographic image reconstruction methods do not provide accurate estimates of reconstruction uncertainty, hindering their real-world deployment. This paper develops a method, termed as the linearised deep image prior (DIP), to estimate the uncertainty associated with reconstructions produced by the DIP with total variation regularisation (TV). Specifically, we endow th… ▽ More

    Submitted 4 November, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

  34. arXiv:2202.04599  [pdf, other

    cs.LG stat.ML

    Missing Data Imputation and Acquisition with Deep Hierarchical Models and Hamiltonian Monte Carlo

    Authors: Ignacio Peis, Chao Ma, José Miguel Hernández-Lobato

    Abstract: Variational Autoencoders (VAEs) have recently been highly successful at imputing and acquiring heterogeneous missing data. However, within this specific application domain, existing VAE methods are restricted by using only one layer of latent variables and strictly Gaussian posterior approximations. To address these limitations, we present HH-VAEM, a Hierarchical VAE model for mixed-type incomplet… ▽ More

    Submitted 22 December, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Published at NeurIPS 2022

  35. arXiv:2201.12857  [pdf, other

    cs.IT

    Fast Relative Entropy Coding with A* coding

    Authors: Gergely Flamich, Stratis Markou, José Miguel Hernández-Lobato

    Abstract: Relative entropy coding (REC) algorithms encode a sample from a target distribution $Q$ using a proposal distribution $P$, such that the expected codelength is $\mathcal{O}(D_{KL}[Q \,||\, P])$. REC can be seamlessly integrated with existing learned compression models since, unlike entropy coding, it does not assume discrete $Q$ or $P$, and does not require quantisation. However, general REC algor… ▽ More

    Submitted 19 June, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    MSC Class: 68P30; 94A08; 94A20 ACM Class: E.4; G.3; H.1.1

  36. arXiv:2112.06926  [pdf, other

    cs.LG stat.ML

    Addressing Bias in Active Learning with Depth Uncertainty Networks... or Not

    Authors: Chelsea Murray, James U. Allingham, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: Farquhar et al. [2021] show that correcting for active learning bias with underparameterised models leads to improved downstream performance. For overparameterised models such as NNs, however, correction leads either to decreased or unchanged performance. They suggest that this is due to an "overfitting bias" which offsets the active learning bias. We show that depth uncertainty networks operate i… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2112.06796

  37. arXiv:2112.06796  [pdf, other

    cs.LG stat.ML

    Depth Uncertainty Networks for Active Learning

    Authors: Chelsea Murray, James U. Allingham, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: In active learning, the size and complexity of the training dataset changes over time. Simple models that are well specified by the amount of data available at the start of active learning might suffer from bias as more points are actively sampled. Flexible models that might be well suited to the full dataset can suffer from overfitting towards the start of active learning. We tackle this problem… ▽ More

    Submitted 4 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

  38. arXiv:2111.11510  [pdf, other

    cs.LG cs.AI stat.ML

    Bootstrap Your Flow

    Authors: Laurence Illing Midgley, Vincent Stimper, Gregor N. C. Simm, José Miguel Hernández-Lobato

    Abstract: Normalizing flows are flexible, parameterized distributions that can be used to approximate expectations from intractable distributions via importance sampling. However, current flow-based approaches are limited on challenging targets where they either suffer from mode seeking behaviour or high variance in the training loss, or rely on samples from the target distribution, which may not be availab… ▽ More

    Submitted 14 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  39. arXiv:2110.15828  [pdf, other

    stat.ML cs.AI cs.LG

    Resampling Base Distributions of Normalizing Flows

    Authors: Vincent Stimper, Bernhard Schölkopf, José Miguel Hernández-Lobato

    Abstract: Normalizing flows are a popular class of models for approximating probability distributions. However, their invertible nature limits their ability to model target distributions whose support have a complex topological structure, such as Boltzmann distributions. Several procedures have been proposed to solve this problem but many of them sacrifice invertibility and, thereby, tractability of the log… ▽ More

    Submitted 24 February, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  40. arXiv:2110.15486  [pdf, other

    stat.ML cs.LG q-bio.BM

    DOCKSTRING: easy molecular docking yields better benchmarks for ligand design

    Authors: Miguel García-Ortegón, Gregor N. C. Simm, Austin J. Tripp, José Miguel Hernández-Lobato, Andreas Bender, Sergio Bacallado

    Abstract: The field of machine learning for drug discovery is witnessing an explosion of novel methods. These methods are often benchmarked on simple physicochemical properties such as solubility or general druglikeness, which can be readily computed. However, these properties are poor representatives of objective functions in drug design, mainly because they do not depend on the candidate's interaction wit… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  41. arXiv:2110.10461  [pdf, other

    cs.LG stat.ML

    Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation

    Authors: Ross M. Clarke, Elre T. Oldewage, José Miguel Hernández-Lobato

    Abstract: Machine learning training methods depend plentifully and intricately on hyperparameters, motivating automated strategies for their optimisation. Many existing algorithms restart training for each new hyperparameter choice, at considerable computational cost. Some hypergradient-based one-pass methods exist, but these either cannot be applied to arbitrary optimiser hyperparameters (such as learning… ▽ More

    Submitted 21 April, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 41 pages, 19 figures, 15 tables; minor CIFAR-10 normalisation updates from ICLR 2022 camera-ready version

  42. arXiv:2110.05721  [pdf, other

    cs.LG cs.AI

    Action-Sufficient State Representation Learning for Control with Structural Constraints

    Authors: Biwei Huang, Chaochao Lu, Liu Leqi, José Miguel Hernández-Lobato, Clark Glymour, Bernhard Schölkopf, Kun Zhang

    Abstract: Perceived signals in real-world scenarios are usually high-dimensional and noisy, and finding and using their representation that contains essential and sufficient information required by downstream decision-making tasks will help improve computational efficiency and generalization ability in the tasks. In this paper, we focus on partially observable environments and propose to learn a minimal set… ▽ More

    Submitted 19 June, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

  43. arXiv:2107.00096  [pdf, other

    cs.LG stat.ML

    Improving black-box optimization in VAE latent space using decoder uncertainty

    Authors: Pascal Notin, José Miguel Hernández-Lobato, Yarin Gal

    Abstract: Optimization in the latent space of variational autoencoders is a promising approach to generate high-dimensional discrete objects that maximize an expensive black-box property (e.g., drug-likeness in molecular generation, function approximation with arithmetic expressions). However, existing methods lack robustness as they may decide to explore areas of the latent space for which no data was avai… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

  44. arXiv:2104.05860  [pdf, other

    cs.LG

    Contextual HyperNetworks for Novel Feature Adaptation

    Authors: Angus Lamb, Evgeny Saveliev, Yingzhen Li, Sebastian Tschiatschek, Camilla Longden, Simon Woodhead, José Miguel Hernández-Lobato, Richard E. Turner, Pashmina Cameron, Cheng Zhang

    Abstract: While deep learning has obtained state-of-the-art results in many applications, the adaptation of neural network architectures to incorporate new output features remains a challenge, as neural networks are commonly trained to produce a fixed output dimension. This issue is particularly severe in online learning settings, where new output features, such as items in a recommender system, are added c… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 17 pages, 9 Figures, workshop paper at NeurIPS 2020 Meta-Learning Workshop

  45. arXiv:2104.04034  [pdf, other

    cs.CY cs.HC

    Results and Insights from Diagnostic Questions: The NeurIPS 2020 Education Challenge

    Authors: Zichao Wang, Angus Lamb, Evgeny Saveliev, Pashmina Cameron, Yordan Zaykov, Jose Miguel Hernandez-Lobato, Richard E. Turner, Richard G. Baraniuk, Craig Barton, Simon Peyton Jones, Simon Woodhead, Cheng Zhang

    Abstract: This competition concerns educational diagnostic questions, which are pedagogically effective, multiple-choice questions (MCQs) whose distractors embody misconceptions. With a large and ever-increasing number of such questions, it becomes overwhelming for teachers to know which questions are the best ones to use for their students. We thus seek to answer the following question: how can we use data… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2007.12061

  46. arXiv:2102.12353  [pdf, other

    cs.LG stat.ML

    Nonlinear Invariant Risk Minimization: A Causal Approach

    Authors: Chaochao Lu, Yuhuai Wu, Jośe Miguel Hernández-Lobato, Bernhard Schölkopf

    Abstract: Due to spurious correlations, machine learning systems often fail to generalize to environments whose distributions differ from the ones used at training time. Prior work addressing this, either explicitly or implicitly, attempted to find a data representation that has an invariant relationship with the target. This is done by leveraging a diverse set of training environments to reduce the effect… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

  47. arXiv:2102.03159  [pdf, other

    cs.LG cs.AI stat.ML

    Active Slices for Sliced Stein Discrepancy

    Authors: Wenbo Gong, Kaibo Zhang, Yingzhen Li, José Miguel Hernández-Lobato

    Abstract: Sliced Stein discrepancy (SSD) and its kernelized variants have demonstrated promising successes in goodness-of-fit tests and model learning in high dimensions. Despite their theoretical elegance, their empirical performance depends crucially on the search of optimal slicing directions to discriminate between two distributions. Unfortunately, previous gradient-based optimisation approaches for thi… ▽ More

    Submitted 21 July, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 22 pages, 7 figures, International Conference on Machine Learning (ICML) 2021

  48. arXiv:2012.11522  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Barking up the right tree: an approach to search over molecule synthesis DAGs

    Authors: John Bradshaw, Brooks Paige, Matt J. Kusner, Marwin H. S. Segler, José Miguel Hernández-Lobato

    Abstract: When designing new molecules with particular properties, it is not only important what to make but crucially how to make it. These instructions form a synthesis directed acyclic graph (DAG), describing how a large vocabulary of simple building blocks can be recursively combined through chemical reactions to create more complicated molecules of interest. In contrast, many current deep generative mo… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: To appear in Advances in Neural Information Processing Systems 2020

  49. arXiv:2012.09092  [pdf, other

    cs.LG stat.ML

    Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation

    Authors: Chaochao Lu, Biwei Huang, Ke Wang, José Miguel Hernández-Lobato, Kun Zhang, Bernhard Schölkopf

    Abstract: Reinforcement learning (RL) algorithms usually require a substantial amount of interaction data and perform well only for specific tasks in a fixed environment. In some scenarios such as healthcare, however, usually only few records are available for each patient, and patients may show different responses to the same treatment, impeding the application of current RL algorithms to learn optimal pol… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: Neural Information Processing Systems Workshop on Offline Reinforcement Learning

  50. BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis

    Authors: Weijie He, Xiaohao Mao, Chao Ma, Yu Huang, José Miguel Hernández-Lobato, Ting Chen

    Abstract: A growing number of people are seeking healthcare advice online. Usually, they diagnose their medical conditions based on the symptoms they are experiencing, which is also known as self-diagnosis. From the machine learning perspective, online disease diagnosis is a sequential feature (symptom) selection and classification problem. Reinforcement learning (RL) methods are the standard approaches to… ▽ More

    Submitted 20 January, 2022; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: The Web Conference 2022