Skip to main content

Showing 1–26 of 26 results for author: Damoulas, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.11158  [pdf, other

    cs.MA cs.LG stat.ML

    Interventionally Consistent Surrogates for Agent-based Simulators

    Authors: Joel Dyer, Nicholas Bishop, Yorgos Felekis, Fabio Massimo Zennaro, Anisoara Calinescu, Theodoros Damoulas, Michael Wooldridge

    Abstract: Agent-based simulators provide granular representations of complex intelligent systems by directly modelling the interactions of the system's constituent agents. Their high-fidelity nature enables hyper-local policy evaluation and testing of what-if scenarios, but is associated with large computational costs that inhibits their widespread use. Surrogate models can address these computational limit… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2312.08107  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Optimal Transport of Abstractions

    Authors: Yorgos Felekis, Fabio Massimo Zennaro, Nicola Branchini, Theodoros Damoulas

    Abstract: Causal abstraction (CA) theory establishes formal criteria for relating multiple structural causal models (SCMs) at different levels of granularity by defining maps between them. These maps have significant relevance for real-world challenges such as synthesizing causal evidence from multiple experimental environments, learning causally consistent representations at different resolutions, and link… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  3. Table inference for combinatorial origin-destination choices in agent-based population synthesis

    Authors: Ioannis Zachos, Theodoros Damoulas, Mark Girolami

    Abstract: A key challenge in agent-based mobility simulations is the synthesis of individual agent socioeconomic profiles. Such profiles include locations of agent activities, which dictate the quality of the simulated travel patterns. These locations are typically represented in origin-destination matrices that are sampled using coarse travel surveys. This is because fine-grained trip profiles are scarce a… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 17 pages, 8 figures, 2 tables

  4. arXiv:2306.01468  [pdf, other

    stat.ME stat.ML

    Robust Bayesian Inference for Berkson and Classical Measurement Error Models

    Authors: Charita Dellaporta, Theodoros Damoulas

    Abstract: Measurement error occurs when a covariate influencing a response variable is corrupted by noise. This can lead to misleading inference outcomes, particularly in problems where accurately estimating the relationship between covariates and response variables is crucial, such as causal effect estimation. Existing methods for dealing with measurement error often rely on strong assumptions such as know… ▽ More

    Submitted 29 April, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 60 pages, 12 figures. v2: Updated version of paper

  5. arXiv:2208.10981  [pdf, ps, other

    cs.LG stat.ML

    Causal Entropy Optimization

    Authors: Nicola Branchini, Virginia Aglietti, Neil Dhir, Theodoros Damoulas

    Abstract: We study the problem of globally optimizing the causal effect on a target variable of an unknown causal graph in which interventions can be performed. This problem arises in many areas of science including biology, operations research and healthcare. We propose Causal Entropy Optimization (CEO), a framework that generalizes Causal Bayesian Optimization (CBO) to account for all sources of uncertain… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  6. arXiv:2202.04744  [pdf, other

    stat.ME cs.LG stat.ML

    Robust Bayesian Inference for Simulator-based Models via the MMD Posterior Bootstrap

    Authors: Charita Dellaporta, Jeremias Knoblauch, Theodoros Damoulas, François-Xavier Briol

    Abstract: Simulator-based models are models for which the likelihood is intractable but simulation of synthetic data is possible. They are often used to describe complex real-world phenomena, and as such can often be misspecified in practice. Unfortunately, existing Bayesian approaches for simulators are known to perform poorly in those cases. In this paper, we propose a novel algorithm based on the posteri… ▽ More

    Submitted 19 December, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted for publication (with an oral presentation) at AISTATS 2022. A preliminary version of this paper was accepted in the NeurIPS 2021 workshop "Your Model is Wrong: Robustness and misspecification in probabilistic modeling". v2: added some references. v3: corrected small error in theorem 3

  7. arXiv:2111.01732  [pdf, other

    cs.LG stat.ML

    Spatio-Temporal Variational Gaussian Processes

    Authors: Oliver Hamelijnck, William J. Wilkinson, Niki A. Loppi, Arno Solin, Theodoros Damoulas

    Abstract: We introduce a scalable approach to Gaussian process inference that combines spatio-temporal filtering with natural gradient variational inference, resulting in a non-conjugate GP method for multivariate data that scales linearly with respect to time. Our natural gradient approach enables application of parallel filtering and smoothing, further reducing the temporal span complexity to be logarithm… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  8. arXiv:2110.13891  [pdf, other

    stat.ML cs.LG

    Dynamic Causal Bayesian Optimization

    Authors: Virginia Aglietti, Neil Dhir, Javier González, Theodoros Damoulas

    Abstract: This paper studies the problem of performing a sequence of optimal interventions in a causal dynamical system where both the target variable of interest and the inputs evolve over time. This problem arises in a variety of domains e.g. system biology and operational research. Dynamic Causal Bayesian Optimization (DCBO) brings together ideas from sequential decision making, causal inference and Gaus… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  9. arXiv:2109.03582  [pdf, other

    stat.ML cs.LG

    Higher Order Kernel Mean Embeddings to Capture Filtrations of Stochastic Processes

    Authors: Cristopher Salvi, Maud Lemercier, Chong Liu, Blanka Hovarth, Theodoros Damoulas, Terry Lyons

    Abstract: Stochastic processes are random variables with values in some space of paths. However, reducing a stochastic process to a path-valued random variable ignores its filtration, i.e. the flow of information carried by the process through time. By conditioning the process on its filtration, we introduce a family of higher order kernel mean embeddings (KMEs) that generalizes the notion of KME and captur… ▽ More

    Submitted 3 November, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Published at NeurIPS 2021

    MSC Class: 60L10; 60L20

  10. arXiv:2108.02594  [pdf, other

    stat.ML cs.LG stat.AP

    A variational Bayesian spatial interaction model for estimating revenue and demand at business facilities

    Authors: Shanaka Perera, Virginia Aglietti, Theodoros Damoulas

    Abstract: We study the problem of estimating potential revenue or demand at business facilities and understanding its generating mechanism. This problem arises in different fields such as operation research or urban science, and more generally, it is crucial for businesses' planning and decision making. We develop a Bayesian spatial interaction model, henceforth BSIM, which provides probabilistic prediction… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  11. arXiv:2105.04211  [pdf, other

    stat.ML cs.LG

    SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Thomas Cass, Edwin V. Bonilla, Theodoros Damoulas, Terry Lyons

    Abstract: Making predictions and quantifying their uncertainty when the input data is sequential is a fundamental learning challenge, recently attracting increasing attention. We develop SigGPDE, a new scalable sparse variational inference framework for Gaussian Processes (GPs) on sequential data. Our contribution is twofold. First, we construct inducing variables underpinning the sparse approximation so th… ▽ More

    Submitted 12 October, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Published at ICML 2021

    MSC Class: 60L10; 60L20

  12. arXiv:2012.07574  [pdf, other

    cs.LG physics.soc-ph stat.AP

    An Expectation-Based Network Scan Statistic for a COVID-19 Early Warning System

    Authors: Chance Haycock, Edward Thorpe-Woods, James Walsh, Patrick O'Hara, Oscar Giles, Neil Dhir, Theodoros Damoulas

    Abstract: One of the Greater London Authority's (GLA) response to the COVID-19 pandemic brings together multiple large-scale and heterogeneous datasets capturing mobility, transportation and traffic activity over the city of London to better understand 'busyness' and enable targeted interventions and effective policy-making. As part of Project Odysseus we describe an early-warning system and introduce an ex… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  13. arXiv:2009.12821  [pdf, other

    stat.ML cs.LG

    Multi-task Causal Learning with Gaussian Processes

    Authors: Virginia Aglietti, Theodoros Damoulas, Mauricio Álvarez, Javier González

    Abstract: This paper studies the problem of learning the correlation structure of a set of intervention functions defined on the directed acyclic graph (DAG) of a causal model. This is useful when we are interested in jointly learning the causal effects of interventions on different subsets of variables in a DAG, which is common in field such as healthcare or operations research. We propose the first multi-… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

  14. arXiv:2006.15641  [pdf, other

    cs.LG stat.ML

    Variational Autoencoding of PDE Inverse Problems

    Authors: Daniel J. Tait, Theodoros Damoulas

    Abstract: Specifying a governing physical model in the presence of missing physics and recovering its parameters are two intertwined and fundamental problems in science. Modern machine learning allows one to circumvent these, via emulators and surrogates, but in doing so disregards prior knowledge and physical laws that are especially important for small data regimes, interpretability, and decision making.… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

  15. arXiv:2006.05805  [pdf, other

    cs.LG stat.ML

    Distribution Regression for Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Theodoros Damoulas, Edwin V. Bonilla, Terry Lyons

    Abstract: Distribution regression refers to the supervised learning problem where labels are only available for groups of inputs instead of individual inputs. In this paper, we develop a rigorous mathematical framework for distribution regression where inputs are complex data streams. Leveraging properties of the expected signature and a recent signature kernel trick for sequential data from stochastic anal… ▽ More

    Submitted 29 September, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Published at AISTATS 2021

    MSC Class: 60L10; 60L20

  16. arXiv:2002.09998  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Generalized Bayesian Filtering via Sequential Monte Carlo

    Authors: Ayman Boustati, Ömer Deniz Akyildiz, Theodoros Damoulas, Adam M. Johansen

    Abstract: We introduce a framework for inference in general state-space hidden Markov models (HMMs) under likelihood misspecification. In particular, we leverage the loss-theoretic perspective of Generalized Bayesian Inference (GBI) to define generalised filtering recursions in HMMs, that can tackle the problem of inference under model misspecification. In doing so, we arrive at principled procedures for ro… ▽ More

    Submitted 21 October, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

  17. arXiv:1910.03906  [pdf, other

    stat.ML cs.LG stat.CO

    Probabilistic sequential matrix factorization

    Authors: Ömer Deniz Akyildiz, Gerrit J. J. van den Burg, Theodoros Damoulas, Mark F. J. Steel

    Abstract: We introduce the probabilistic sequential matrix factorization (PSMF) method for factorizing time-varying and non-stationary datasets consisting of high-dimensional time-series. In particular, we consider nonlinear Gaussian state-space models where sequential approximate inference results in the factorization of a data matrix into a dictionary and time-varying coefficients with potentially nonline… ▽ More

    Submitted 18 March, 2021; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Accepted for publication at AISTATS 2021

  18. arXiv:1910.02008  [pdf, ps, other

    math.ST cs.LG math.PR stat.ML

    Nonasymptotic estimates for Stochastic Gradient Langevin Dynamics under local conditions in nonconvex optimization

    Authors: Ying Zhang, Ömer Deniz Akyildiz, Theodoros Damoulas, Sotirios Sabanis

    Abstract: In this paper, we are concerned with a non-asymptotic analysis of sampling algorithms used in nonconvex optimization. In particular, we obtain non-asymptotic estimates in Wasserstein-1 and Wasserstein-2 distances for a popular class of algorithms called Stochastic Gradient Langevin Dynamics (SGLD). In addition, the aforementioned Wasserstein-2 convergence result can be applied to establish a non-a… ▽ More

    Submitted 14 October, 2022; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: 38 pages

    MSC Class: 60J20; 60J22; 65C05; 65C40; 62D05

  19. arXiv:1906.08344  [pdf, other

    stat.ML cs.LG

    Multi-resolution Multi-task Gaussian Processes

    Authors: Oliver Hamelijnck, Theodoros Damoulas, Kangrui Wang, Mark Girolami

    Abstract: We consider evidence integration from potentially dependent observation processes under varying spatio-temporal sampling resolutions and noise levels. We develop a multi-resolution multi-task (MRGP) framework while allowing for both inter-task and intra-task multi-resolution and multi-fidelity. We develop shallow Gaussian Process (GP) mixtures that approximate the difficult to estimate joint likel… ▽ More

    Submitted 5 November, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

  20. arXiv:1906.03161  [pdf, other

    stat.ML cs.LG stat.AP

    Structured Variational Inference in Continuous Cox Process Models

    Authors: Virginia Aglietti, Edwin V. Bonilla, Theodoros Damoulas, Sally Cripps

    Abstract: We propose a scalable framework for inference in an inhomogeneous Poisson process modeled by a continuous sigmoidal Cox process that assumes the corresponding intensity function is given by a Gaussian process (GP) prior transformed with a scaled logistic sigmoid function. We present a tractable representation of the likelihood through augmentation with a superposition of Poisson processes. This vi… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  21. arXiv:1905.12407  [pdf, other

    stat.ML cs.LG

    Non-linear Multitask Learning with Deep Gaussian Processes

    Authors: Ayman Boustati, Theodoros Damoulas, Richard S. Savage

    Abstract: We present a multi-task learning formulation for Deep Gaussian processes (DGPs), through non-linear mixtures of latent processes. The latent space is composed of private processes that capture within-task information and shared processes that capture across-task dependencies. We propose two different methods for segmenting the latent space: through hard coding shared and task-specific processes or… ▽ More

    Submitted 23 February, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

  22. arXiv:1904.02063  [pdf, other

    stat.ML cs.AI cs.LG

    Generalized Variational Inference: Three arguments for deriving new Posteriors

    Authors: Jeremias Knoblauch, Jack Jewson, Theodoros Damoulas

    Abstract: We advocate an optimization-centric view on and introduce a novel generalization of Bayesian inference. Our inspiration is the representation of Bayes' rule as infinite-dimensional optimization problem (Csiszar, 1975; Donsker and Varadhan; 1975, Zellner; 1988). First, we use it to prove an optimality result of standard Variational Inference (VI): Under the proposed view, the standard Evidence Lowe… ▽ More

    Submitted 12 December, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: 103 pages, 23 figures (comprehensive revision of previous version)

  23. arXiv:1806.02261  [pdf, other

    stat.ML cs.LG

    Doubly Robust Bayesian Inference for Non-Stationary Streaming Data with $β$-Divergences

    Authors: Jeremias Knoblauch, Jack Jewson, Theodoros Damoulas

    Abstract: We present the very first robust Bayesian Online Changepoint Detection algorithm through General Bayesian Inference (GBI) with $β$-divergences. The resulting inference procedure is doubly robust for both the parameter and the changepoint (CP) posterior, with linear time and constant space complexity. We provide a construction for exponential models and demonstrate it on the Bayesian Linear Regress… ▽ More

    Submitted 27 November, 2018; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: 39 pages, 11 figures, published at Neural Information Processing Systems (NeurIPS) 2018

    Journal ref: Neural Information Processing Systems (NeurIPS) 2018

  24. arXiv:1805.09781  [pdf, other

    stat.ML cs.LG

    Efficient Inference in Multi-task Cox Process Models

    Authors: Virginia Aglietti, Theodoros Damoulas, Edwin Bonilla

    Abstract: We generalize the log Gaussian Cox process (LGCP) framework to model multiple correlated point data jointly. The observations are treated as realizations of multiple LGCPs, whose log intensities are given by linear combinations of latent functions drawn from Gaussian process priors. The combination coefficients are also drawn from Gaussian processes and can incorporate additional dependencies. We… ▽ More

    Submitted 15 March, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

  25. arXiv:1805.05383  [pdf, other

    stat.ML cs.LG stat.ME

    Spatio-temporal Bayesian On-line Changepoint Detection with Model Selection

    Authors: Jeremias Knoblauch, Theodoros Damoulas

    Abstract: Bayesian On-line Changepoint Detection is extended to on-line model selection and non-stationary spatio-temporal processes. We propose spatially structured Vector Autoregressions (VARs) for modelling the process between changepoints (CPs) and give an upper bound on the approximation error of such models. The resulting algorithm performs prediction, model selection and CP detection on-line. Its tim… ▽ More

    Submitted 6 June, 2018; v1 submitted 14 May, 2018; originally announced May 2018.

    Comments: 10 pages, 7f figures, to appear in Proceedings of the 35th International Conference on Machine Learning 2018

  26. arXiv:1804.01431  [pdf, other

    stat.CO

    Posterior Inference for Sparse Hierarchical Non-stationary Models

    Authors: Karla Monterrubio-Gómez, Lassi Roininen, Sara Wade, Theo Damoulas, Mark Girolami

    Abstract: Gaussian processes are valuable tools for non-parametric modelling, where typically an assumption of stationarity is employed. While removing this assumption can improve prediction, fitting such models is challenging. In this work, hierarchical models are constructed based on Gaussian Markov random fields with stochastic spatially varying parameters. Importantly, this allows for non-stationarity w… ▽ More

    Submitted 1 May, 2019; v1 submitted 4 April, 2018; originally announced April 2018.