Skip to main content

Showing 1–29 of 29 results for author: Bonilla, E V

.
  1. arXiv:2405.15991  [pdf, other

    cs.LG cs.AI stat.ML

    Rényi Neural Processes

    Authors: Xuesong Wang, He Zhao, Edwin V. Bonilla

    Abstract: Neural Processes (NPs) are variational frameworks that aim to represent stochastic processes with deep neural networks. Despite their obvious benefits in uncertainty estimation for complex distributions via data-driven priors, NPs enforce network parameter sharing between the conditional prior and posterior distributions, thereby risking introducing a misspecified prior. We hereby propose Rényi Ne… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2405.15167  [pdf, other

    stat.ML cs.LG

    ProDAG: Projection-induced variational inference for directed acyclic graphs

    Authors: Ryan Thompson, Edwin V. Bonilla, Robert Kohn

    Abstract: Directed acyclic graph (DAG) learning is a rapidly expanding field of research. Though the field has witnessed remarkable advances over the past few years, it remains statistically and computationally challenging to learn a single (point estimate) DAG from data, let alone provide uncertainty quantification. Our article addresses the difficult task of quantifying graph uncertainty by develo** a v… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2402.15255  [pdf, other

    cs.LG cs.AI

    Optimal Transport for Structure Learning Under Missing Data

    Authors: Vy Vo, He Zhao, Trung Le, Edwin V. Bonilla, Dinh Phung

    Abstract: Causal discovery in the presence of missing data introduces a chicken-and-egg dilemma. While the goal is to recover the true causal structure, robust imputation requires considering the dependencies or, preferably, causal relations among variables. Merely filling in missing values with existing imputation methods and subsequently applying structure learning on the complete data is empirically show… ▽ More

    Submitted 1 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  4. arXiv:2402.03614  [pdf, other

    cs.LG stat.ML

    Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs

    Authors: He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla

    Abstract: We study the problem of automatically discovering Granger causal relations from observational multivariate time-series data.Vector autoregressive (VAR) models have been time-tested for this problem, including Bayesian variants and more recent developments using deep neural networks. Most existing VAR methods for Granger causality use sparsity-inducing penalties/priors or post-hoc thresholds to int… ▽ More

    Submitted 23 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  5. arXiv:2402.02644  [pdf, other

    cs.LG stat.ML

    Variational DAG Estimation via State Augmentation With Stochastic Permutations

    Authors: Edwin V. Bonilla, Pantelis Elinas, He Zhao, Maurizio Filippone, Vassili Kitsios, Terry O'Kane

    Abstract: Estimating the structure of a Bayesian network, in the form of a directed acyclic graph (DAG), from observational data is a statistically and computationally hard problem with essential applications in areas such as causal discovery. Bayesian approaches are a promising direction for solving this task, as they allow for uncertainty quantification and deal with well-known identifiability issues. Fro… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  6. arXiv:2310.15627  [pdf, other

    stat.ML cs.LG

    Contextual Directed Acyclic Graphs

    Authors: Ryan Thompson, Edwin V. Bonilla, Robert Kohn

    Abstract: Estimating the structure of directed acyclic graphs (DAGs) from observational data remains a significant challenge in machine learning. Most research in this area concentrates on learning a single DAG for the entire population. This paper considers an alternative setting where the graph structure varies across individuals based on available "contextual" features. We tackle this contextual DAG prob… ▽ More

    Submitted 20 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: To appear in the Proceedings of the 27th International Conference on Artificial Intelligence and Statistics

  7. arXiv:2305.18435  [pdf, other

    cs.LG stat.ME

    Statistically Efficient Bayesian Sequential Experiment Design via Reinforcement Learning with Cross-Entropy Estimators

    Authors: Tom Blau, Iadine Chades, Amir Dezfouli, Daniel Steinberg, Edwin V. Bonilla

    Abstract: Reinforcement learning can learn amortised design policies for designing sequences of experiments. However, current amortised methods rely on estimators of expected information gain (EIG) that require an exponential number of samples on the magnitude of the EIG to achieve an unbiased estimation. We propose the use of an alternative estimator based on the cross-entropy of the joint model distributi… ▽ More

    Submitted 4 February, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

  8. arXiv:2302.09921  [pdf, other

    cs.LG stat.ML

    Free-Form Variational Inference for Gaussian Process State-Space Models

    Authors: Xuhui Fan, Edwin V. Bonilla, Terence J. O'Kane, Scott A. Sisson

    Abstract: Gaussian process state-space models (GPSSMs) provide a principled and flexible approach to modeling the dynamics of a latent state, which is observed at discrete-time points via a likelihood model. However, inference in GPSSMs is computationally and statistically challenging due to the large number of latent variables in the model and the strong temporal dependencies between them. In this paper, w… ▽ More

    Submitted 16 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Updating to final version to appear in the proceedings

  9. arXiv:2211.00335  [pdf, other

    stat.ML cs.LG eess.SP math.OC

    Recurrent Neural Networks and Universal Approximation of Bayesian Filters

    Authors: Adrian N. Bishop, Edwin V. Bonilla

    Abstract: We consider the Bayesian optimal filtering problem: i.e. estimating some conditional statistics of a latent time-series signal from an observation sequence. Classical approaches often rely on the use of assumed or estimated transition and observation models. Instead, we formulate a generic recurrent neural network framework and seek to learn directly a recursive map** from observational inputs t… ▽ More

    Submitted 15 March, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Journal ref: In Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023, Valencia, Spain. PMLR: Volume 206

  10. arXiv:2202.12508  [pdf, other

    cs.LG

    Addressing Over-Smoothing in Graph Neural Networks via Deep Supervision

    Authors: Pantelis Elinas, Edwin V. Bonilla

    Abstract: Learning useful node and graph representations with graph neural networks (GNNs) is a challenging task. It is known that deep GNNs suffer from over-smoothing where, as the number of layers increases, node representations become nearly indistinguishable and model performance on the downstream task degrades significantly. To address this problem, we propose deeply-supervised GNNs (DSGNNs), i.e., GNN… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  11. arXiv:2202.00821  [pdf, other

    cs.LG stat.ML

    Optimizing Sequential Experimental Design with Deep Reinforcement Learning

    Authors: Tom Blau, Edwin V. Bonilla, Iadine Chades, Amir Dezfouli

    Abstract: Bayesian approaches developed to solve the optimal design of sequential experiments are mathematically elegant but computationally challenging. Recently, techniques using amortization have been proposed to make these Bayesian approaches practical, by training a parameterized policy that proposes designs efficiently at deployment time. However, these methods may not sufficiently explore the design… ▽ More

    Submitted 17 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Journal ref: International Conference on Machine Learning (2022)

  12. arXiv:2107.01650  [pdf, other

    cs.LG

    Learning ODEs via Diffeomorphisms for Fast and Robust Integration

    Authors: Weiming Zhi, Tin Lai, Lionel Ott, Edwin V. Bonilla, Fabio Ramos

    Abstract: Advances in differentiable numerical integrators have enabled the use of gradient descent techniques to learn ordinary differential equations (ODEs). In the context of machine learning, differentiable solvers are central for Neural ODEs (NODEs), a class of deep learning models with continuous depth, rather than discrete layers. However, these integrators can be unsatisfactorily slow and inaccurate… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

  13. arXiv:2106.06245  [pdf, other

    stat.ML cs.LG

    Model Selection for Bayesian Autoencoders

    Authors: Ba-Hien Tran, Simone Rossi, Dimitrios Milios, Pietro Michiardi, Edwin V. Bonilla, Maurizio Filippone

    Abstract: We develop a novel method for carrying out model selection for Bayesian autoencoders (BAEs) by means of prior hyper-parameter optimization. Inspired by the common practice of type-II maximum likelihood optimization and its equivalence to Kullback-Leibler divergence minimization, we propose to optimize the distributional sliced-Wasserstein distance (DSWD) between the output of the autoencoder and t… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  14. arXiv:2105.04211  [pdf, other

    stat.ML cs.LG

    SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Thomas Cass, Edwin V. Bonilla, Theodoros Damoulas, Terry Lyons

    Abstract: Making predictions and quantifying their uncertainty when the input data is sequential is a fundamental learning challenge, recently attracting increasing attention. We develop SigGPDE, a new scalable sparse variational inference framework for Gaussian Processes (GPs) on sequential data. Our contribution is twofold. First, we construct inducing variables underpinning the sparse approximation so th… ▽ More

    Submitted 12 October, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Published at ICML 2021

    MSC Class: 60L10; 60L20

  15. arXiv:2102.09009  [pdf, other

    cs.LG stat.ML

    BORE: Bayesian Optimization by Density-Ratio Estimation

    Authors: Louis C. Tiao, Aaron Klein, Matthias Seeger, Edwin V. Bonilla, Cedric Archambeau, Fabio Ramos

    Abstract: Bayesian optimization (BO) is among the most effective and widely-used blackbox optimization methods. BO proposes solutions according to an explore-exploit trade-off criterion encoded in an acquisition function, many of which are computed from the posterior predictive of a probabilistic surrogate model. Prevalent among these is the expected improvement (EI) function. The need to ensure analytical… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: preprint, under review

  16. arXiv:2006.05805  [pdf, other

    cs.LG stat.ML

    Distribution Regression for Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Theodoros Damoulas, Edwin V. Bonilla, Terry Lyons

    Abstract: Distribution regression refers to the supervised learning problem where labels are only available for groups of inputs instead of individual inputs. In this paper, we develop a rigorous mathematical framework for distribution regression where inputs are complex data streams. Leveraging properties of the expected signature and a recent signature kernel trick for sequential data from stochastic anal… ▽ More

    Submitted 29 September, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Published at AISTATS 2021

    MSC Class: 60L10; 60L20

  17. arXiv:2003.03080  [pdf, other

    stat.ML cs.LG

    Sparse Gaussian Processes Revisited: Bayesian Approaches to Inducing-Variable Approximations

    Authors: Simone Rossi, Markus Heinonen, Edwin V. Bonilla, Zheyang Shen, Maurizio Filippone

    Abstract: Variational inference techniques based on inducing variables provide an elegant framework for scalable posterior estimation in Gaussian process (GP) models. Besides enabling scalability, one of their main advantages over sparse approximations using direct marginal likelihood maximization is that they provide a robust alternative for point estimation of the inducing inputs, i.e. the location of the… ▽ More

    Submitted 23 February, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

  18. arXiv:1912.10200  [pdf, other

    cs.LG stat.ML

    Quantile Propagation for Wasserstein-Approximate Gaussian Processes

    Authors: Rui Zhang, Christian J. Walder, Edwin V. Bonilla, Marian-Andrei Rizoiu, Lexing Xie

    Abstract: Approximate inference techniques are the cornerstone of probabilistic methods based on Gaussian process priors. Despite this, most work approximately optimizes standard divergence measures such as the Kullback-Leibler (KL) divergence, which lack the basic desiderata for the task at hand, while chiefly offering merely technical convenience. We develop a new approximate inference method for Gaussian… ▽ More

    Submitted 5 November, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  19. arXiv:1906.03161  [pdf, other

    stat.ML cs.LG stat.AP

    Structured Variational Inference in Continuous Cox Process Models

    Authors: Virginia Aglietti, Edwin V. Bonilla, Theodoros Damoulas, Sally Cripps

    Abstract: We propose a scalable framework for inference in an inhomogeneous Poisson process modeled by a continuous sigmoidal Cox process that assumes the corresponding intensity function is given by a Gaussian process (GP) prior transformed with a scaled logistic sigmoid function. We present a tractable representation of the likelihood through augmentation with a superposition of Poisson processes. This vi… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  20. arXiv:1906.01852  [pdf, other

    cs.LG stat.ML

    Variational Inference for Graph Convolutional Networks in the Absence of Graph Data and Adversarial Settings

    Authors: Pantelis Elinas, Edwin V. Bonilla, Louis Tiao

    Abstract: We propose a framework that lifts the capabilities of graph convolutional networks (GCNs) to scenarios where no input graph is given and increases their robustness to adversarial attacks. We formulate a joint probabilistic model that considers a prior distribution over graphs along with a GCN-based likelihood and develop a stochastic variational inference algorithm to estimate the graph posterior… ▽ More

    Submitted 20 October, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  21. arXiv:1903.03986  [pdf, other

    stat.ML cs.LG

    Scalable Grouped Gaussian Processes via Direct Cholesky Functional Representations

    Authors: Astrid Dahl, Edwin V. Bonilla

    Abstract: We consider multi-task regression models where observations are assumed to be a linear combination of several latent node and weight functions, all drawn from Gaussian process (GP) priors that allow nonzero covariance between grouped latent functions. We show that when these grouped functions are conditionally independent given a group-dependent pivot, it is possible to parameterize the prior thro… ▽ More

    Submitted 22 July, 2019; v1 submitted 10 March, 2019; originally announced March 2019.

    Comments: 14 pages, 4 figures

    MSC Class: 62G08 ACM Class: G.3; I.2.6; J.2

  22. arXiv:1806.02543  [pdf, other

    stat.ML cs.LG

    Grouped Gaussian Processes for Solar Power Prediction

    Authors: Astrid Dahl, Edwin V. Bonilla

    Abstract: We consider multi-task regression models where the observations are assumed to be a linear combination of several latent node functions and weight functions, which are both drawn from Gaussian process priors. Driven by the problem of develo** scalable methods for forecasting distributed solar and other renewable power generation, we propose coupled priors over groups of (node or weight) processe… ▽ More

    Submitted 3 December, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: 15 pages, 4 figures. Replacement extends last version with further experimental results and additional figures

    ACM Class: G.3; I.2.6; J.2

  23. arXiv:1806.01771  [pdf, other

    stat.ML cs.LG

    Cycle-Consistent Adversarial Learning as Approximate Bayesian Inference

    Authors: Louis C. Tiao, Edwin V. Bonilla, Fabio Ramos

    Abstract: We formalize the problem of learning interdomain correspondences in the absence of paired data as Bayesian inference in a latent variable model (LVM), where one seeks the underlying hidden representations of entities from one domain as entities from the other domain. First, we introduce implicit latent variable models, where the prior over hidden representations can be specified flexibly as an imp… ▽ More

    Submitted 24 August, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: Presented at the ICML 2018 Workshop on Theoretical Foundations and Applications of Deep Generative Models. Stockholm, Sweden, 2018

  24. arXiv:1805.10522  [pdf, other

    stat.ML cs.LG

    Calibrating Deep Convolutional Gaussian Processes

    Authors: Gia-Lac Tran, Edwin V. Bonilla, John P. Cunningham, Pietro Michiardi, Maurizio Filippone

    Abstract: The wide adoption of Convolutional Neural Networks (CNNs) in applications where decision-making under uncertainty is fundamental, has brought a great deal of attention to the ability of these models to accurately quantify the uncertainty in their predictions. Previous work on combining CNNs with Gaussian processes (GPs) has been developed under the assumption that the predictive probabilities of t… ▽ More

    Submitted 26 May, 2018; originally announced May 2018.

    Comments: 12 pages

  25. arXiv:1702.08530  [pdf, ps, other

    cs.LG stat.ML

    Semi-parametric Network Structure Discovery Models

    Authors: Amir Dezfouli, Edwin V. Bonilla, Richard Nock

    Abstract: We propose a network structure discovery model for continuous observations that generalizes linear causal models by incorporating a Gaussian process (GP) prior on a network-independent component, and random sparsity and weight matrices as the network-dependent parameters. This approach provides flexible modeling of network-independent trends in the observations as well as uncertainty quantificatio… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

    ACM Class: I.2.6; I.5.1

  26. arXiv:1610.05392  [pdf, other

    stat.ML

    AutoGP: Exploring the Capabilities and Limitations of Gaussian Process Models

    Authors: Karl Krauth, Edwin V. Bonilla, Kurt Cutajar, Maurizio Filippone

    Abstract: We investigate the capabilities and limitations of Gaussian process models by jointly exploring three complementary directions: (i) scalable and statistically efficient inference; (ii) flexible kernels; and (iii) objective functions for hyperparameter learning alternative to the marginal likelihood. Our approach outperforms all previously reported GP methods on the standard MNIST dataset; performs… ▽ More

    Submitted 5 March, 2017; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: Edited results on RECTANGLES-IMAGE and related comments; minor additional edits

  27. arXiv:1610.04386  [pdf, other

    stat.ML stat.CO

    Random Feature Expansions for Deep Gaussian Processes

    Authors: Kurt Cutajar, Edwin V. Bonilla, Pietro Michiardi, Maurizio Filippone

    Abstract: The composition of multiple Gaussian Processes as a Deep Gaussian Process (DGP) enables a deep probabilistic nonparametric approach to flexibly tackle complex machine learning problems with sound quantification of uncertainty. Existing inference approaches for DGP models have limited scalability and are notoriously cumbersome to construct. In this work, we introduce a novel formulation of DGPs bas… ▽ More

    Submitted 1 March, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

  28. arXiv:1609.04289  [pdf, other

    stat.ML

    Gray-box inference for structured Gaussian process models

    Authors: Pietro Galliani, Amir Dezfouli, Edwin V. Bonilla, Novi Quadrianto

    Abstract: We develop an automated variational inference method for Bayesian structured prediction problems with Gaussian process (GP) priors and linear-chain likelihoods. Our approach does not need to know the details of the structured likelihood model and can scale up to a large number of observations. Furthermore, we show that the required expected likelihood term and its gradients in the variational obje… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

  29. arXiv:1609.00577  [pdf, other

    stat.ML

    Generic Inference in Latent Gaussian Process Models

    Authors: Edwin V. Bonilla, Karl Krauth, Amir Dezfouli

    Abstract: We develop an automated variational method for inference in models with Gaussian process (GP) priors and general likelihoods. The method supports multiple outputs and multiple latent functions and does not require detailed knowledge of the conditional likelihood, only needing its evaluation as a black-box function. Using a mixture of Gaussians as the variational distribution, we show that the evid… ▽ More

    Submitted 5 November, 2018; v1 submitted 2 September, 2016; originally announced September 2016.

    Comments: 61 pages