-
Investigating the price determinants of the European Emission Trading System: a non-parametric approach
Authors:
Cristiano Salvagnin,
Aldo Glielmo,
Maria Elena De Giuli,
Antonietta Mira
Abstract:
The European carbon market plays a pivotal role in the European Union's ambitious target of achieving carbon neutrality by 2050. Understanding the intricacies of factors influencing European Union Emission Trading System (EU ETS) market prices is paramount for effective policy making and strategy implementation. We propose the use of the Information Imbalance, a recently introduced non-parametric…
▽ More
The European carbon market plays a pivotal role in the European Union's ambitious target of achieving carbon neutrality by 2050. Understanding the intricacies of factors influencing European Union Emission Trading System (EU ETS) market prices is paramount for effective policy making and strategy implementation. We propose the use of the Information Imbalance, a recently introduced non-parametric measure quantifying the degree to which a set of variables is informative with respect to another one, to study the relationships among macroeconomic, economic, uncertainty, and energy variables concerning EU ETS prices. Our analysis shows that in Phase 3 commodity related variables such as the ERIX index are the most informative to explain the behaviour of the EU ETS market price. Transitioning to Phase 4, financial fluctuations take centre stage, with the uncertainty in the EUR/CHF exchange rate emerging as a crucial determinant. These results reflect the disruptive impacts of the COVID-19 pandemic and the energy crisis in resha** the importance of the different variables. Beyond variable analysis, we also propose to leverage the Information Imbalance to address the problem of mixed-frequency forecasting, and we identify the weekly time scale as the most informative for predicting the EU ETS price. Finally, we show how the Information Imbalance can be effectively combined with Gaussian Process regression for efficient nowcasting and forecasting using very small sets of highly informative predictors.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Federated Learning for Non-factorizable Models using Deep Generative Prior Approximations
Authors:
Conor Hassan,
Joshua J Bon,
Elizaveta Semenova,
Antonietta Mira,
Kerrie Mengersen
Abstract:
Federated learning (FL) allows for collaborative model training across decentralized clients while preserving privacy by avoiding data sharing. However, current FL methods assume conditional independence between client models, limiting the use of priors that capture dependence, such as Gaussian processes (GPs). We introduce the Structured Independence via deep Generative Model Approximation (SIGMA…
▽ More
Federated learning (FL) allows for collaborative model training across decentralized clients while preserving privacy by avoiding data sharing. However, current FL methods assume conditional independence between client models, limiting the use of priors that capture dependence, such as Gaussian processes (GPs). We introduce the Structured Independence via deep Generative Model Approximation (SIGMA) prior which enables FL for non-factorizable models across clients, expanding the applicability of FL to fields such as spatial statistics, epidemiology, environmental science, and other domains where modeling dependencies is crucial. The SIGMA prior is a pre-trained deep generative model that approximates the desired prior and induces a specified conditional independence structure in the latent variables, creating an approximate model suitable for FL settings. We demonstrate the SIGMA prior's effectiveness on synthetic data and showcase its utility in a real-world example of FL for spatial data, using a conditional autoregressive prior to model spatial dependence across Australia. Our work enables new FL applications in domains where modeling dependent data is essential for accurate predictions and decision-making.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Likelihood distortion and Bayesian local robustness
Authors:
Antonio Di Noia,
Fabrizio Ruggeri,
Antonietta Mira
Abstract:
Robust Bayesian analysis has been mainly devoted to detecting and measuring robustness to the prior distribution. Indeed, many contributions in the literature aim to define suitable classes of priors which allow the computation of variations of quantities of interest while the prior changes within those classes. The literature has devoted much less attention to the robustness of Bayesian methods t…
▽ More
Robust Bayesian analysis has been mainly devoted to detecting and measuring robustness to the prior distribution. Indeed, many contributions in the literature aim to define suitable classes of priors which allow the computation of variations of quantities of interest while the prior changes within those classes. The literature has devoted much less attention to the robustness of Bayesian methods to the likelihood function due to mathematical and computational complexity, and because it is often arguably considered a more objective choice compared to the prior. In this contribution, a new approach to Bayesian local robustness to the likelihood function is proposed and extended to robustness to the prior and to both. This approach is based on the notion of distortion function introduced in the literature on risk theory, and then successfully adopted to build suitable classes of priors for Bayesian global robustness to the prior. The novel robustness measure is a local sensitivity measure that turns out to be very tractable and easy to compute for certain classes of distortion functions. Asymptotic properties are derived and numerical experiments illustrate the theory and its applicability for modelling purposes.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Beyond the noise: intrinsic dimension estimation with optimal neighbourhood identification
Authors:
Antonio Di Noia,
Iuri Macocco,
Aldo Glielmo,
Alessandro Laio,
Antonietta Mira
Abstract:
The Intrinsic Dimension (ID) is a key concept in unsupervised learning and feature selection, as it is a lower bound to the number of variables which are necessary to describe a system. However, in almost any real-world dataset the ID depends on the scale at which the data are analysed. Quite typically at a small scale, the ID is very large, as the data are affected by measurement errors. At large…
▽ More
The Intrinsic Dimension (ID) is a key concept in unsupervised learning and feature selection, as it is a lower bound to the number of variables which are necessary to describe a system. However, in almost any real-world dataset the ID depends on the scale at which the data are analysed. Quite typically at a small scale, the ID is very large, as the data are affected by measurement errors. At large scale, the ID can also be erroneously large, due to the curvature and the topology of the manifold containing the data. In this work, we introduce an automatic protocol to select the sweet spot, namely the correct range of scales in which the ID is meaningful and useful. This protocol is based on imposing that for distances smaller than the correct scale the density of the data is constant. Since to estimate the density it is necessary to know the ID, this condition is imposed self-consistently. We illustrate the usefulness and robustness of this procedure by benchmarks on artificial and real-world datasets.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Scalable Vertical Federated Learning via Data Augmentation and Amortized Inference
Authors:
Conor Hassan,
Matthew Sutton,
Antonietta Mira,
Kerrie Mengersen
Abstract:
Vertical federated learning (VFL) has emerged as a paradigm for collaborative model estimation across multiple clients, each holding a distinct set of covariates. This paper introduces the first comprehensive framework for fitting Bayesian models in the VFL setting. We propose a novel approach that leverages data augmentation techniques to transform VFL problems into a form compatible with existin…
▽ More
Vertical federated learning (VFL) has emerged as a paradigm for collaborative model estimation across multiple clients, each holding a distinct set of covariates. This paper introduces the first comprehensive framework for fitting Bayesian models in the VFL setting. We propose a novel approach that leverages data augmentation techniques to transform VFL problems into a form compatible with existing Bayesian federated learning algorithms. We present an innovative model formulation for specific VFL scenarios where the joint likelihood factorizes into a product of client-specific likelihoods. To mitigate the dimensionality challenge posed by data augmentation, which scales with the number of observations and clients, we develop a factorized amortized variational approximation that achieves scalability independent of the number of observations. We showcase the efficacy of our framework through extensive numerical experiments on logistic regression, multilevel regression, and a novel hierarchical Bayesian split neural net model. Our work paves the way for privacy-preserving, decentralized Bayesian inference in vertically partitioned data scenarios, opening up new avenues for research and applications in various domains.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Searching for WIMPs with TREX-DM: achievements and challenges
Authors:
Juan F. Castel,
Susana Cebrián,
Theopisti Dafni,
David Díez-Ibáñez,
Álvaro Ezquerro,
Javier Galán,
Juan Antonio García,
Igor G. Irastorza,
María Jiménez,
Gloria Luzón,
Cristina Margalejo,
Ángel de Mira,
Hector Mirallas,
Luis Obis,
Alfonso Ortiz de Solórzano,
Oscar Pérez,
Jaime Ruz,
Julia Vogel
Abstract:
The TREX-DM detector, a low background chamber with microbulk Micromegas readout, was commissioned in the underground laboratory of Canfranc (LSC) in 2018. Since then, data taking campaigns have been carried out with Argon and Neon mixtures, at different pressures from 1 to 4 bar. By achieving a low energy threshold of 1 keV$_{ee}$ and a background level of 80 counts keV$^{-1}$ Kg$^{-1}$ day…
▽ More
The TREX-DM detector, a low background chamber with microbulk Micromegas readout, was commissioned in the underground laboratory of Canfranc (LSC) in 2018. Since then, data taking campaigns have been carried out with Argon and Neon mixtures, at different pressures from 1 to 4 bar. By achieving a low energy threshold of 1 keV$_{ee}$ and a background level of 80 counts keV$^{-1}$ Kg$^{-1}$ day$^{-1}$ in the region from 1 to 7 keV$_{ee}$, the experiment demonstrates its potential to search for low-mass WIMPs. Two of the most important challenges currently faced are the reduction of both, background level and energy threshold. With respect to the energy threshold, recently a new readout plane is being developed, based on the combination of Micromegas and GEM technologies, aiming to have a pre-amplification stage that would permit very low energy thresholds, close to the single-electron ionization energy. With respect to the background reduction, apart from studies to identify and minimize contamination population, a high sensitivity alpha detector is being developed in order to allow a proper material selection for the TREX-DM detector components. Both challenges, together with the optimization of the gas mixture used as target for the WIMP detection, will take TREX-DM to explore regions of WIMP's mass below 1 GeV c$^{-2}$.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Algorithms for the Global Domination Problem
Authors:
Ernesto Parra Inza,
Nodari Vakhania,
Jose M. Sigarreta Almira,
Frank A. Hernández Mira
Abstract:
A dominating set D in a graph G is a subset of its vertices such that every vertex of the graph which does not belong to set D is adjacent to at least one vertex from set D. A set of vertices of graph G is a global dominating set if it is a dominating set for both, graph G and its complement. The objective is to find a global dominating set with the minimum cardinality. The problem is known to be…
▽ More
A dominating set D in a graph G is a subset of its vertices such that every vertex of the graph which does not belong to set D is adjacent to at least one vertex from set D. A set of vertices of graph G is a global dominating set if it is a dominating set for both, graph G and its complement. The objective is to find a global dominating set with the minimum cardinality. The problem is known to be NP-hard. Neither exact nor approximation algorithm existed . We propose two exact solution methods, one of them being based on an integer linear program (ILP) formulation, three heuristic algorithms and a special purification procedure that further reduces the size of a global dominated set delivered by any of our heuristic algorithms. We show that the problem remains NP-hard for restricted types of graphs and specify some families of graphs for which the heuristics guarantee the optimality. The second exact algorithm turned out to be about twice faster than ILP for graphs with more than 230 vertices and up to 1080 vertices, which were the largest benchmark instances that were solved optimally. The heuristics were tested for the existing 2284 benchmark problem instances with up to 14000 vertices and delivered solutions for the largest instances in less than one minute. Remarkably, for about 52% of the 1000 instances with the obtained optimal solutions, at least one of the heuristics generated an optimal solution, where the average approximation error for the remaining instances was 1.07%.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Bayesian Poisson Regression and Tensor Train Decomposition Model for Learning Mortality Pattern Changes during COVID-19 Pandemic
Authors:
Wei Zhang,
Antonietta Mira,
Ernst C. Wit
Abstract:
COVID-19 has led to excess deaths around the world, however it remains unclear how the mortality of other causes of death has changed during the pandemic. Aiming at understanding the wider impact of COVID-19 on other death causes, we study Italian data set that consists of monthly mortality counts of different causes from January 2015 to December 2020. Due to the high dimensional nature of the dat…
▽ More
COVID-19 has led to excess deaths around the world, however it remains unclear how the mortality of other causes of death has changed during the pandemic. Aiming at understanding the wider impact of COVID-19 on other death causes, we study Italian data set that consists of monthly mortality counts of different causes from January 2015 to December 2020. Due to the high dimensional nature of the data, we develop a model which combines conventional Poisson regression with tensor train decomposition to explore the lower dimensional residual structure of the data. We take a Bayesian approach, impose priors on model parameters. Posterior inference is performed using an efficient Metropolis-Hastings within Gibbs algorithm. The validity of our approach is tested in simulation studies. Our method not only identifies differential effects of interventions on cause specific mortality rates through the Poisson regression component, but also offers informative interpretations of the relationship between COVID-19 and other causes of death as well as latent classes that underline demographic characteristics, temporal patterns and causes of death respectively.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Exact and Heuristic Algorithms for the Domination Problem
Authors:
Ernesto Parra Inza,
Frank Angel Hernández Mira,
José María Sigarreta Almira,
Nodari Vakhania
Abstract:
In a simple connected graph $G=(V,E)$, a subset of vertices $S \subseteq V$ is a dominating set if any vertex $v \in V\setminus S$ is adjacent to some vertex $x$ from this subset. A number of real-life problems can be modeled using this problem which is known to be among the difficult NP-hard problems in its class. We formulate the problem as an integer liner program (ILP) and compare the performa…
▽ More
In a simple connected graph $G=(V,E)$, a subset of vertices $S \subseteq V$ is a dominating set if any vertex $v \in V\setminus S$ is adjacent to some vertex $x$ from this subset. A number of real-life problems can be modeled using this problem which is known to be among the difficult NP-hard problems in its class. We formulate the problem as an integer liner program (ILP) and compare the performance with the two earlier existing exact state-of-the-art algorithms and exact implicit enumeration and heuristic algorithms that we propose here. Our exact algorithm was able to find optimal solutions much faster than ILP and the above two exact algorithms for middle-dense instances. For graphs with a considerable size, our heuristic algorithm was much faster than both, ILP and our exact algorithm. It found an optimal solution for more than half of the tested instances, whereas it improved the earlier known state-of-the-art solutions for almost all the tested benchmark instances. Among the instances where the optimum was not found, it gave an average approximation error of $1.18$.
△ Less
Submitted 16 June, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Investigating field-induced magnetic order in Han Purple by neutron scattering up to 25.9 T
Authors:
S. Allenspach,
A. Madsen,
A. Biffin,
M. Bartkowiak,
O. Prokhnenko,
A. Gazizulina,
X. Liu,
R. Wahle,
S. Gerischer,
S. Kempfer,
P. Heller,
P. Smeibidl,
A. Mira,
N. Laflorencie,
F. Mila,
B. Normand,
Ch. Rüegg
Abstract:
BaCuSi$_2$O$_6$ is a quasi-two-dimensional (2D) quantum antiferromagnet containing three different types of stacked, square-lattice bilayer hosting spin-1/2 dimers. Although this compound has been studied extensively over the last two decades, the critical applied magnetic field required to close the dimer spin gap and induce magnetic order, which exceeds 23 T, has to date precluded any kind of ne…
▽ More
BaCuSi$_2$O$_6$ is a quasi-two-dimensional (2D) quantum antiferromagnet containing three different types of stacked, square-lattice bilayer hosting spin-1/2 dimers. Although this compound has been studied extensively over the last two decades, the critical applied magnetic field required to close the dimer spin gap and induce magnetic order, which exceeds 23 T, has to date precluded any kind of neutron scattering investigation. However, the HFM/EXED instrument at the Helmholtz-Zentrum Berlin made this possible at magnetic fields up to 25.9 T. Thus we have used HFM/EXED to investigate the field-induced ordered phase, in particular to look for quasi-2D physics arising from the layered structure and from the different bilayer types. From neutron diffraction data, we determined the global dependence of the magnetic order parameter on both magnetic field and temperature, finding a form consistent with 3D quantum critical scaling; from this we deduce that the quasi-2D interactions and nonuniform layering of BaCuSi$_2$O$_6$ are not anisotropic enough to induce hallmarks of 2D physics. From neutron spectroscopy data, we measured the dispersion of the strongly Zeeman-split magnetic excitations, finding good agreement with the zero-field interaction parameters of BaCuSi$_2$O$_6$. We conclude that HFM/EXED allowed a significant extension in the application of neutron scattering techniques to the field range above 20 T and in particular opened new horizons in the study of field-induced magnetic quantum phase transitions.
△ Less
Submitted 17 September, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
Multiple hypothesis screening using mixtures of non-local distributions with applications to genomic studies
Authors:
Francesco Denti,
Stefano Peluso,
Michele Guindani,
Antonietta Mira
Abstract:
The analysis of large-scale datasets, especially in biomedical contexts, frequently involves a principled screening of multiple hypotheses. The celebrated two-group model jointly models the distribution of the test statistics with mixtures of two competing densities, the null and the alternative distributions. We investigate the use of weighted densities and, in particular, non-local densities as…
▽ More
The analysis of large-scale datasets, especially in biomedical contexts, frequently involves a principled screening of multiple hypotheses. The celebrated two-group model jointly models the distribution of the test statistics with mixtures of two competing densities, the null and the alternative distributions. We investigate the use of weighted densities and, in particular, non-local densities as working alternative distributions, to enforce separation from the null and thus refine the screening procedure. We show how these weighted alternatives improve various operating characteristics, such as the Bayesian False Discovery rate, of the resulting tests for a fixed mixture proportion with respect to a local, unweighted likelihood approach. Parametric and nonparametric model specifications are proposed, along with efficient samplers for posterior inference. By means of a simulation study, we exhibit how our model compares with both well-established and state-of-the-art alternatives in terms of various operating characteristics. Finally, to illustrate the versatility of our method, we conduct three differential expression analyses with publicly-available datasets from genomic studies of heterogeneous nature.
△ Less
Submitted 9 March, 2023; v1 submitted 2 May, 2022;
originally announced May 2022.
-
A predictive model for planning emergency events rescue during COVID-19 in Lombardy, Italy
Authors:
Angela Andreella,
Antonietta Mira,
Spyros Balafas,
Ernst C. Wit,
Fabrizio Ruggeri,
Giovanni Nattino,
Giulia Ghilardi,
Guido Bertolini
Abstract:
Italy, particularly the Lombardy region, was among the first countries outside of Asia to report cases of COVID-19. The emergency medical service called Regional Emergency Agency (AREU) coordinates the intra- and inter-regional non-hospital emergency network and the European emergency number service in Lombardy. AREU must deal with daily and seasonal variations of call volume. The number and type…
▽ More
Italy, particularly the Lombardy region, was among the first countries outside of Asia to report cases of COVID-19. The emergency medical service called Regional Emergency Agency (AREU) coordinates the intra- and inter-regional non-hospital emergency network and the European emergency number service in Lombardy. AREU must deal with daily and seasonal variations of call volume. The number and type of emergency calls changed dramatically during the COVID-19 pandemic. A model to predict incoming calls and how many of these turn into events, i.e., dispatch of transport and equipment until the rescue is completed, was developed to address the emergency period. We used the generalized additive model with a negative binomial family to predict the number of events one, two, five, and seven days ahead. The over-dispersion of the data was tackled by using the negative binomial family and the nonlinear relationship between the number of events and covariates (e.g., seasonal effects) by smoothing splines. The model coefficients show the effect of variables, e.g., the day of the week, on the number of events and how these effects change during the pre-COVID-19 period. The proposed model returns reasonable mean absolute errors for most of the 2020-2021 period.
△ Less
Submitted 27 March, 2022;
originally announced March 2022.
-
On the intrinsic dimensionality of Covid-19 data: a global perspective
Authors:
Abhishek Varghese,
Edgar Santos-Fernandez,
Francesco Denti,
Antonietta Mira,
Kerrie Mengersen
Abstract:
This paper aims to develop a global perspective of the complexity of the relationship between the standardised per-capita growth rate of Covid-19 cases, deaths, and the OxCGRT Covid-19 Stringency Index, a measure describing a country's stringency of lockdown policies. To achieve our goal, we use a heterogeneous intrinsic dimension estimator implemented as a Bayesian mixture model, called Hidalgo.…
▽ More
This paper aims to develop a global perspective of the complexity of the relationship between the standardised per-capita growth rate of Covid-19 cases, deaths, and the OxCGRT Covid-19 Stringency Index, a measure describing a country's stringency of lockdown policies. To achieve our goal, we use a heterogeneous intrinsic dimension estimator implemented as a Bayesian mixture model, called Hidalgo. We identify that the Covid-19 dataset may project onto two low-dimensional manifolds without significant information loss. The low dimensionality suggests strong dependency among the standardised growth rates of cases and deaths per capita and the OxCGRT Covid-19 Stringency Index for a country over 2020-2021. Given the low dimensional structure, it may be feasible to model observable Covid-19 dynamics with few parameters. Importantly, we identify spatial autocorrelation in the intrinsic dimension distribution worldwide. Moreover, we highlight that high-income countries are more likely to lie on low-dimensional manifolds, likely arising from aging populations, comorbidities, and increased per capita mortality burden from Covid-19. Finally, we temporally stratify the dataset to examine the intrinsic dimension at a more granular level throughout the Covid-19 pandemic.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Learning Summary Statistics for Bayesian Inference with Autoencoders
Authors:
Carlo Albert,
Simone Ulzega,
Firat Ozdemir,
Fernando Perez-Cruz,
Antonietta Mira
Abstract:
For stochastic models with intractable likelihood functions, approximate Bayesian computation offers a way of approximating the true posterior through repeated comparisons of observations with simulated model outputs in terms of a small set of summary statistics. These statistics need to retain the information that is relevant for constraining the parameters but cancel out the noise. They can thus…
▽ More
For stochastic models with intractable likelihood functions, approximate Bayesian computation offers a way of approximating the true posterior through repeated comparisons of observations with simulated model outputs in terms of a small set of summary statistics. These statistics need to retain the information that is relevant for constraining the parameters but cancel out the noise. They can thus be seen as thermodynamic state variables, for general stochastic models. For many scientific applications, we need strictly more summary statistics than model parameters to reach a satisfactory approximation of the posterior. Therefore, we propose to use the inner dimension of deep neural network based Autoencoders as summary statistics. To create an incentive for the encoder to encode all the parameter-related information but not the noise, we give the decoder access to explicit or implicit information on the noise that has been used to generate the training data. We validate the approach empirically on two types of stochastic models.
△ Less
Submitted 23 May, 2022; v1 submitted 28 January, 2022;
originally announced January 2022.
-
A Bayesian Semiparametric Vector Multiplicative Error Model
Authors:
Nicola Donelli,
Stefano Peluso,
Antonietta Mira
Abstract:
Interactions among multiple time series of positive random variables are crucial in diverse financial applications, from spillover effects to volatility interdependence. A popular model in this setting is the vector Multiplicative Error Model (vMEM) which poses a linear iterative structure on the dynamics of the conditional mean, perturbed by a multiplicative innovation term. A main limitation of…
▽ More
Interactions among multiple time series of positive random variables are crucial in diverse financial applications, from spillover effects to volatility interdependence. A popular model in this setting is the vector Multiplicative Error Model (vMEM) which poses a linear iterative structure on the dynamics of the conditional mean, perturbed by a multiplicative innovation term. A main limitation of vMEM is however its restrictive assumption on the distribution of the random innovation term. A Bayesian semiparametric approach that models the innovation vector as an infinite location-scale mixture of multidimensional kernels with support on the positive orthant is used to address this major shortcoming of vMEM. Computational complications arising from the constraints to the positive orthant are avoided through the formulation of a slice sampler on the parameter-extended unconstrained version of the model. The method is applied to simulated and real data and a flexible specification is obtained that outperforms the classical ones in terms of fitting and predictive power.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
A Bayesian latent allocation model for clustering compositional data with application to the Great Barrier Reef
Authors:
Luiza Piancastelli,
Nial Friel,
Julie Vercelloni,
Kerrie Mengersen,
Antonietta Mira
Abstract:
Relative abundance is a common metric to estimate the composition of species in ecological surveys reflecting patterns of commonness and rarity of biological assemblages. Measurements of coral reef compositions formed by four communities along Australia's Great Barrier Reef (GBR) gathered between 2012 and 2017 are the focus of this paper. We undertake the task of finding clusters of transect locat…
▽ More
Relative abundance is a common metric to estimate the composition of species in ecological surveys reflecting patterns of commonness and rarity of biological assemblages. Measurements of coral reef compositions formed by four communities along Australia's Great Barrier Reef (GBR) gathered between 2012 and 2017 are the focus of this paper. We undertake the task of finding clusters of transect locations with similar community composition and investigate changes in clustering dynamics over time. During these years, an unprecedented sequence of extreme weather events (cyclones and coral bleaching) impacted the 58 surveyed locations. The dependence between constituent parts of a composition presents a challenge for existing multivariate clustering approaches. In this paper, we introduce a finite mixture of Dirichlet distributions with group-specific parameters, where cluster memberships are dictated by unobserved latent variables. The inference is carried in a Bayesian framework, where MCMC strategies are outlined to sample from the posterior model. Simulation studies are presented to illustrate the performance of the model in a controlled setting. The application of the model to the 2012 coral reef data reveals that clusters were spatially distributed in similar ways across reefs which indicates a potential influence of wave exposure at the origin of coral reef community composition. The number of clusters estimated by the model decreased from four in 2012 to two from 2014 until 2017. Posterior probabilities of transect allocations to the same cluster substantially increase through time showing a potential homogenization of community composition across the whole GBR. The Bayesian model highlights the diversity of coral reef community composition within a coral reef and rapid changes across large spatial scales that may contribute to undermining the future of the GBR's biodiversity.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
On the mathematical axiomatization of approximate Bayesian computation. A robust set for estimating mechanistic network models through optimal transport
Authors:
Marco Tarsia,
Antonietta Mira,
Daniele Cassani
Abstract:
We research relations between optimal transport theory (OTT) and approximate Bayesian computation (ABC) possibly connected to relevant metrics defined on probability measures. Those of ABC are computational methods based on Bayesian statistics and applicable to a given generative model to estimate its a posteriori distribution in case the likelihood function is intractable. The idea is therefore t…
▽ More
We research relations between optimal transport theory (OTT) and approximate Bayesian computation (ABC) possibly connected to relevant metrics defined on probability measures. Those of ABC are computational methods based on Bayesian statistics and applicable to a given generative model to estimate its a posteriori distribution in case the likelihood function is intractable. The idea is therefore to simulate sets of synthetic data from the model with respect to assigned parameters and, rather than comparing prospects of these data with the corresponding observed values as typically ABC requires, to employ just a distance between a chosen distribution associated to the synthetic data and another of the observed values. Our focus lies in theoretical and methodological aspects, although there would exist a remarkable part of algorithmic implementation, and more precisely issues regarding mathematical foundation and asymptotic properties are carefully analysed, inspired by an in-depth study of what is then our main bibliographic reference, that is Bernton et al. (2019), carrying out what follows: a rigorous formulation of the set-up for the ABC rejection algorithm, also to regain a transparent and general result of convergence as the ABC threshold goes to zero whereas the number n of samples from the prior stays fixed; general technical proposals about distances leaning on OTT; weak assumptions which lead to lower bounds for small values of threshold and as n goes to infinity, ultimately showing a reasonable possibility of lack of concentration which is contrary to what is proposed in Bernton et al. (2019) itself.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
Distributional Results for Model-Based Intrinsic Dimension Estimators
Authors:
Francesco Denti,
Diego Doimo,
Alessandro Laio,
Antonietta Mira
Abstract:
Modern datasets are characterized by a large number of features that may conceal complex dependency structures. To deal with this type of data, dimensionality reduction techniques are essential. Numerous dimensionality reduction methods rely on the concept of intrinsic dimension, a measure of the complexity of the dataset. In this article, we first review the TWO-NN model, a likelihood-based intri…
▽ More
Modern datasets are characterized by a large number of features that may conceal complex dependency structures. To deal with this type of data, dimensionality reduction techniques are essential. Numerous dimensionality reduction methods rely on the concept of intrinsic dimension, a measure of the complexity of the dataset. In this article, we first review the TWO-NN model, a likelihood-based intrinsic dimension estimator recently introduced in the literature. The TWO-NN estimator is based on the statistical properties of the ratio of the distances between a point and its first two nearest neighbors, assuming that the points are a realization from an homogeneous Poisson point process. We extend the TWO-NN theoretical framework by providing novel distributional results of consecutive and generic ratios of distances. These distributional results are then employed to derive intrinsic dimension estimators, called Cride and Gride. These novel estimators are more robust to noisy measurements than the TWO-NN and allow the study of the evolution of the intrinsic dimension as a function of the scale used to analyze the dataset. We discuss the properties of the different estimators with the help of simulation scenarios.
△ Less
Submitted 1 June, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Revealing three-dimensional quantum criticality by Sr-substitution in Han Purple
Authors:
Stephan Allenspach,
Pascal Puphal,
Joosep Link,
Ivo Heinmaa,
Ekaterina Pomjakushina,
Cornelius Krellner,
Jakob Lass,
Gregory S. Tucker,
Christof Niedermayer,
Shusaku Imajo,
Yoshimitsu Kohama,
Koichi Kindo,
Steffen Krämer,
Mladen Horvatić,
Marcelo Jaime,
Alexander Madsen,
Antonietta Mira,
Nicolas Laflorencie,
Frédéric Mila,
Bruce Normand,
Christian Rüegg,
Raivo Stern,
Franziska Weickert
Abstract:
Classical and quantum phase transitions (QPTs), with their accompanying concepts of criticality and universality, are a cornerstone of statistical thermodynamics. An exemplary controlled QPT is the field-induced magnetic ordering of a gapped quantum magnet. Although numerous "quasi-one-dimensional" coupled spin-chain and -ladder materials are known whose ordering transition is three-dimensional (3…
▽ More
Classical and quantum phase transitions (QPTs), with their accompanying concepts of criticality and universality, are a cornerstone of statistical thermodynamics. An exemplary controlled QPT is the field-induced magnetic ordering of a gapped quantum magnet. Although numerous "quasi-one-dimensional" coupled spin-chain and -ladder materials are known whose ordering transition is three-dimensional (3D), quasi-2D systems are special for several physical reasons. Motivated by the ancient pigment Han Purple (BaCuSi$_{2}$O$_{6}$), a quasi-2D material displaying anomalous critical properties, we present a complete analysis of Ba$_{0.9}$Sr$_{0.1}$CuSi$_{2}$O$_{6}$. We measure the zero-field magnetic excitations by neutron spectroscopy and deduce the magnetic Hamiltonian. We probe the field-induced transition by combining magnetization, specific-heat, torque and magnetocalorimetric measurements with low-temperature nuclear magnetic resonance studies near the QPT. By a Bayesian statistical analysis and large-scale Quantum Monte Carlo simulations, we demonstrate unambiguously that observable 3D quantum critical scaling is restored by the structural simplification arising from light Sr-substitution in Han Purple.
△ Less
Submitted 9 June, 2021; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Total Roman 2-domination in graphs
Authors:
Suitberto Cabrera Garcia,
Abel Cabrera Martinez,
Frank A. Hernandez Mira,
Ismael G. Yero
Abstract:
Given a graph $G=(V,E)$, a function $f:V\rightarrow \{0,1,2\}$ is a total Roman $\{2\}$-dominating function if: (1) every vertex $v\in V$ for which $f(v)=0$ satisfies that $\sum_{u\in N(v)}f(u)\geq 2$, where $N(v)$ represents the open neighborhood of $v$, and (2) every vertex $x\in V$ for which $f(x)\geq 1$ is adjacent to at least one vertex $y\in V$ such that $f(y)\geq 1$. The weight of the funct…
▽ More
Given a graph $G=(V,E)$, a function $f:V\rightarrow \{0,1,2\}$ is a total Roman $\{2\}$-dominating function if: (1) every vertex $v\in V$ for which $f(v)=0$ satisfies that $\sum_{u\in N(v)}f(u)\geq 2$, where $N(v)$ represents the open neighborhood of $v$, and (2) every vertex $x\in V$ for which $f(x)\geq 1$ is adjacent to at least one vertex $y\in V$ such that $f(y)\geq 1$. The weight of the function $f$ is defined as $ω(f)=\sum_{v\in V}f(v)$. The total Roman $\{2\}$-domination number, denoted by $γ_{t\{R2\}}(G)$, is the minimum weight among all total Roman $\{2\}$-dominating functions on $G$. In this article we introduce the concepts above and begin the study of its combinatorial and computational properties. For instance, we give several closed relationships between this parameter and other domination related parameters in graphs. In addition, we prove that the complexity of computing the value $γ_{t\{R2\}}(G)$ is NP-hard, even when restricted to bipartite or chordal graphs.
△ Less
Submitted 7 January, 2021;
originally announced January 2021.
-
Scalable Approximate Bayesian Computation for Growing Network Models via Extrapolated and Sampled Summaries
Authors:
Louis Raynal,
Sixing Chen,
Antonietta Mira,
Jukka-Pekka Onnela
Abstract:
Approximate Bayesian computation (ABC) is a simulation-based likelihood-free method applicable to both model selection and parameter estimation. ABC parameter estimation requires the ability to forward simulate datasets from a candidate model, but because the sizes of the observed and simulated datasets usually need to match, this can be computationally expensive. Additionally, since ABC inference…
▽ More
Approximate Bayesian computation (ABC) is a simulation-based likelihood-free method applicable to both model selection and parameter estimation. ABC parameter estimation requires the ability to forward simulate datasets from a candidate model, but because the sizes of the observed and simulated datasets usually need to match, this can be computationally expensive. Additionally, since ABC inference is based on comparisons of summary statistics computed on the observed and simulated data, using computationally expensive summary statistics can lead to further losses in efficiency. ABC has recently been applied to the family of mechanistic network models, an area that has traditionally lacked tools for inference and model choice. Mechanistic models of network growth repeatedly add nodes to a network until it reaches the size of the observed network, which may be of the order of millions of nodes. With ABC, this process can quickly become computationally prohibitive due to the resource intensive nature of network simulations and evaluation of summary statistics. We propose two methodological developments to enable the use of ABC for inference in models for large growing networks. First, to save time needed for forward simulating model realizations, we propose a procedure to extrapolate (via both least squares and Gaussian processes) summary statistics from small to large networks. Second, to reduce computation time for evaluating summary statistics, we use sample-based rather than census-based summary statistics. We show that the ABC posterior obtained through this approach, which adds two additional layers of approximation to the standard ABC, is similar to a classic ABC posterior. Although we deal with growing network models, both extrapolated summaries and sampled summaries are expected to be relevant in other ABC settings where the data are generated incrementally.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Personalized pathology test for Cardio-vascular disease: Approximate Bayesian computation with discriminative summary statistics learning
Authors:
Ritabrata Dutta,
Karim Zouaoui-Boudjeltia,
Christos Kotsalos,
Alexandre Rousseau,
Daniel Ribeiro de Sousa,
Jean-Marc Desmet,
Alain Van Meerhaeghe,
Antonietta Mira,
Bastien Chopard
Abstract:
Cardio/cerebrovascular diseases (CVD) have become one of the major health issue in our societies. But recent studies show that the present pathology tests to detect CVD are ineffectual as they do not consider different stages of platelet activation or the molecular dynamics involved in platelet interactions and are incapable to consider inter-individual variability. Here we propose a stochastic pl…
▽ More
Cardio/cerebrovascular diseases (CVD) have become one of the major health issue in our societies. But recent studies show that the present pathology tests to detect CVD are ineffectual as they do not consider different stages of platelet activation or the molecular dynamics involved in platelet interactions and are incapable to consider inter-individual variability. Here we propose a stochastic platelet deposition model and an inferential scheme to estimate the biologically meaningful model parameters using approximate Bayesian computation with a summary statistic that maximally discriminates between different types of patients. Inferred parameters from data collected on healthy volunteers and different patient types help us to identify specific biological parameters and hence biological reasoning behind the dysfunction for each type of patients. This work opens up an unprecedented opportunity of personalized pathology test for CVD detection and medical treatment.
△ Less
Submitted 9 February, 2022; v1 submitted 13 October, 2020;
originally announced October 2020.
-
A Common Atom Model for the Bayesian Nonparametric Analysis of Nested Data
Authors:
Francesco Denti,
Federico Camerlenghi,
Michele Guindani,
Antonietta Mira
Abstract:
The use of high-dimensional data for targeted therapeutic interventions requires new ways to characterize the heterogeneity observed across subgroups of a specific population. In particular, models for partially exchangeable data are needed for inference on nested datasets, where the observations are assumed to be organized in different units and some sharing of information is required to learn di…
▽ More
The use of high-dimensional data for targeted therapeutic interventions requires new ways to characterize the heterogeneity observed across subgroups of a specific population. In particular, models for partially exchangeable data are needed for inference on nested datasets, where the observations are assumed to be organized in different units and some sharing of information is required to learn distinctive features of the units. In this manuscript, we propose a nested Common Atoms Model (CAM) that is particularly suited for the analysis of nested datasets where the distributions of the units are expected to differ only over a small fraction of the observations sampled from each unit. The proposed CAM allows a two-layered clustering at the distributional and observational level and is amenable to scalable posterior inference through the use of a computationally efficient nested slice-sampler algorithm. We further discuss how to extend the proposed modeling framework to handle discrete measurements, and we conduct posterior inference on a real microbiome dataset from a diet swap study to investigate how the alterations in intestinal microbiota composition are associated with different eating habits. We further investigate the performance of our model in capturing true distributional structures in the population by means of a simulation study.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
A note on total co-independent domination in trees
Authors:
Abel Cabrera Martínez,
Frank A. Hernández Mira,
José M. Sigarreta Almira,
Ismael G. Yero
Abstract:
A set $D$ of vertices of a graph $G$ is a total dominating set if every vertex of $G$ is adjacent to at least one vertex of $D$. The total domination number of $G$ is the minimum cardinality of any total dominating set of $G$ and is denoted by $γ_t(G)$. The total dominating set $D$ is called a total co-independent dominating set if $V(G)\setminus D$ is an independent set and has at least one verte…
▽ More
A set $D$ of vertices of a graph $G$ is a total dominating set if every vertex of $G$ is adjacent to at least one vertex of $D$. The total domination number of $G$ is the minimum cardinality of any total dominating set of $G$ and is denoted by $γ_t(G)$. The total dominating set $D$ is called a total co-independent dominating set if $V(G)\setminus D$ is an independent set and has at least one vertex. The minimum cardinality of any total co-independent dominating set is denoted by $γ_{t,coi}(G)$. In this paper, we show that, for any tree $T$ of order $n$ and diameter at least three, $n-β(T)\leq γ_{t,coi}(T)\leq n-|L(T)|$ where $β(T)$ is the maximum cardinality of any independent set and $L(T)$ is the set of leaves of $T$. We also characterize the families of trees attaining the extremal bounds above and show that the differences between the value of $γ_{t,coi}(T)$ and these bounds can be arbitrarily large for some classes of trees.
△ Less
Submitted 2 May, 2020;
originally announced May 2020.
-
The role of intrinsic dimension in high-resolution player tracking data -- Insights in basketball
Authors:
Edgar Santos-Fernandez,
Francesco Denti,
Kerrie Mengersen,
Antonietta Mira
Abstract:
A new range of statistical analysis has emerged in sports after the introduction of the high-resolution player tracking technology, specifically in basketball. However, this high dimensional data is often challenging for statistical inference and decision making. In this article, we employ Hidalgo, a state-of-the-art Bayesian mixture model that allows the estimation of heterogeneous intrinsic dime…
▽ More
A new range of statistical analysis has emerged in sports after the introduction of the high-resolution player tracking technology, specifically in basketball. However, this high dimensional data is often challenging for statistical inference and decision making. In this article, we employ Hidalgo, a state-of-the-art Bayesian mixture model that allows the estimation of heterogeneous intrinsic dimensions (ID) within a dataset and propose some theoretical enhancements. ID results can be interpreted as indicators of variability and complexity of basketball plays and games. This technique allows classification and clustering of NBA basketball player's movement and shot charts data. Analyzing movement data, Hidalgo identifies key stages of offensive actions such as creating space for passing, preparation/shooting and following through. We found that the ID value spikes reaching a peak between 4 and 8 seconds in the offensive part of the court after which it declines. In shot charts, we obtained groups of shots that produce substantially higher and lower successes. Overall, game-winners tend to have a larger intrinsic dimension which is an indication of more unpredictability and unique shot placements. Similarly, we found higher ID values in plays when the score margin is small compared to large margin ones. These outcomes could be exploited by coaches to obtain better offensive/defensive results.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.
-
Estimating a novel stochastic model for within-field disease dynamics of banana bunchy top virus via approximate Bayesian computation
Authors:
Abhishek Varghese,
Christopher Drovandi,
Kerrie Mengersen,
Antonietta Mira
Abstract:
The Banana Bunchy Top Virus (BBTV) is one of the most economically important vector-borne banana diseases throughout the Asia-Pacific Basin and presents a significant challenge to the agricultural sector. Current models of BBTV are largely deterministic, limited by an incomplete understanding of interactions in complex natural systems, and the appropriate identification of parameters. A stochastic…
▽ More
The Banana Bunchy Top Virus (BBTV) is one of the most economically important vector-borne banana diseases throughout the Asia-Pacific Basin and presents a significant challenge to the agricultural sector. Current models of BBTV are largely deterministic, limited by an incomplete understanding of interactions in complex natural systems, and the appropriate identification of parameters. A stochastic network-based Susceptible-Infected model has been created which simulates the spread of BBTV across the subsections of a banana plantation, parameterising nodal recovery, neighbouring and distant infectivity across summer and winter. Findings from posterior results achieved through Markov Chain Monte Carlo approach to approximate Bayesian computation suggest seasonality in all parameters, which are influenced by correlated changes in inspection accuracy, temperatures and aphid activity. This paper demonstrates how the model may be used for monitoring and forecasting of various disease management strategies to support policy-level decision making.
△ Less
Submitted 16 March, 2020; v1 submitted 4 September, 2019;
originally announced September 2019.
-
Conditionally Gaussian Random Sequences for an Integrated Variance Estimator with Correlation between Noise and Returns
Authors:
Stefano Peluso,
Antonietta Mira,
Pietro Muliere
Abstract:
Correlation between microstructure noise and latent financial logarithmic returns is an empirically relevant phenomenon with sound theoretical justification. With few notable exceptions, all integrated variance estimators proposed in the financial literature are not designed to explicitly handle such a dependence, or handle it only in special settings. We provide an integrated variance estimator t…
▽ More
Correlation between microstructure noise and latent financial logarithmic returns is an empirically relevant phenomenon with sound theoretical justification. With few notable exceptions, all integrated variance estimators proposed in the financial literature are not designed to explicitly handle such a dependence, or handle it only in special settings. We provide an integrated variance estimator that is robust to correlated noise and returns. For this purpose, a generalization of the Forward Filtering Backward Sampling algorithm is proposed, to provide a sampling technique for a latent conditionally Gaussian random sequence. We apply our methodology to intra-day Microsoft prices, and compare it in a simulation study with established alternatives, showing an advantage in terms of root mean square error and dispersion.
△ Less
Submitted 28 May, 2019;
originally announced May 2019.
-
Data segmentation based on the local intrinsic dimension
Authors:
Michele Allegra,
Elena Facco,
Francesco Denti,
Alessandro Laio,
Antonietta Mira
Abstract:
One of the founding paradigms of machine learning is that a small number of variables is often sufficient to describe high-dimensional data. The minimum number of variables required is called the intrinsic dimension (ID) of the data. Contrary to common intuition, there are cases where the ID varies within the same data set. This fact has been highlighted in technical discussions, but seldom exploi…
▽ More
One of the founding paradigms of machine learning is that a small number of variables is often sufficient to describe high-dimensional data. The minimum number of variables required is called the intrinsic dimension (ID) of the data. Contrary to common intuition, there are cases where the ID varies within the same data set. This fact has been highlighted in technical discussions, but seldom exploited to analyze large data sets and obtain insight into their structure. Here we develop a robust approach to discriminate regions with different local IDs and segment the points accordingly. Our approach is computationally efficient and can be proficiently used even on large data sets. We find that many real-world data sets contain regions with widely heterogeneous dimensions. These regions host points differing in core properties: folded vs unfolded configurations in a protein molecular dynamics trajectory, active vs non-active regions in brain imaging data, and firms with different financial risk in company balance sheets. A simple topological feature, the local ID, is thus sufficient to achieve an unsupervised segmentation of high-dimensional data, complementary to the one given by clustering algorithms.
△ Less
Submitted 13 July, 2020; v1 submitted 27 February, 2019;
originally announced February 2019.
-
A biomechanical breast model evaluated with respect to MRI data collected in three different positions
Authors:
Anna Mîra,
Ann-Katherine Carton,
Serge Muller,
Yohan Payan
Abstract:
Background: Mammography is a specific type of breast imaging that uses low-dose X-rays to detect cancer in early stage. During the exam, the women breast is compressed between two plates in order to even out the breast thickness and to spread out the soft tissues. This technique improves exam quality but can be uncomfortable for the patient. The perceived discomfort can be assessed by the means of…
▽ More
Background: Mammography is a specific type of breast imaging that uses low-dose X-rays to detect cancer in early stage. During the exam, the women breast is compressed between two plates in order to even out the breast thickness and to spread out the soft tissues. This technique improves exam quality but can be uncomfortable for the patient. The perceived discomfort can be assessed by the means of a breast biomechanical model. Alternative breast compression techniques may be computationally investigated trough finite elements simulations.Methods: The aim of this work is to develop and evaluate a new biomechanical Finite Element (FE) breast model. The complex breast anatomy is considered including adipose and glandular tissues, muscle, skin, suspensory ligaments and pectoral fascias. Material hyper-elasticity is modeled using the Neo-Hookean material models. The stress-free breast geometry and subject-specific constitutive models are derived using tissues deformations measurements from MR images.Findings: The breast geometry in three breast configurations were computed using the breast stress-free geometry together with the estimated set of equivalent Young's modulus (Ebreast_r=0.3 kPa, Ebreast_l=0.2 kPa, Eskin=4 kPa, Efascia=120 kPa). The Hausdorff distance between estimated and measured breast geometries for prone, supine and supine tilted configurations is equal to 2.17 mm, 1.72mm and 5.90mm respectively.Interpretation: A subject-specific breast model allows a better characterization of breast mechanics. However, the model presents some limitations when estimating the supine tilted breast configuration. The results show clearly the difficulties to characterize soft tissues mechanics at large strain ranges with Neo-Hookean material models.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.
-
Simulation of breast compression using a new biomechanical model
Authors:
Anna Mîra,
Yohan Payan,
Ann-Katherine Carton,
Pablo Milioni de Carvalho,
Zhi** Li,
Viviane Devauges,
Serge Muller
Abstract:
Mammography is currently the primary imaging modality for breast cancer screening and plays an important role in cancer diagnostics. A standard mammographic image acquisition always includes the compression of the breast prior x-ray exposure. The breast is compressed between two plates (the image receptor and the compression paddle) until a nearly uniform breast thickness is obtained. The breast f…
▽ More
Mammography is currently the primary imaging modality for breast cancer screening and plays an important role in cancer diagnostics. A standard mammographic image acquisition always includes the compression of the breast prior x-ray exposure. The breast is compressed between two plates (the image receptor and the compression paddle) until a nearly uniform breast thickness is obtained. The breast flattening improves diagnostic image quality 1 and reduces the absorbed dose 2. However, this technique can also be a source of discomfort and might deter some women from attending breast screening by mammography 3,4. Therefore, the characterization of the pain perceived during breast compression is of potential interest to compare different compression approaches. The aim of this work is to develop simulation tools enabling the characterization of existing breast compression techniques in terms of patient comfort, dose delivered to the patient and resulting image quality. A 3D biomechanical model of the breast was developed providing physics-based predictions of tissue motion and internal stress and strain intensity. The internal stress and strain intensity are assumed to be directly correlated with the patient discomfort. The resulting compressed breast model is integrated in an image simulation framework to assess both image quality and average glandular dose. We present the results of compression simulations on two breast geometries, under different compression paddles (flex or rigid).
△ Less
Submitted 22 November, 2018;
originally announced November 2018.
-
Regularized Zero-Variance Control Variates
Authors:
Leah F. South,
Chris J. Oates,
Antonietta Mira,
Christopher Drovandi
Abstract:
Zero-variance control variates (ZV-CV) are a post-processing method to reduce the variance of Monte Carlo estimators of expectations using the derivatives of the log target. Once the derivatives are available, the only additional computational effort lies in solving a linear regression problem. Significant variance reductions have been achieved with this method in low dimensional examples, but the…
▽ More
Zero-variance control variates (ZV-CV) are a post-processing method to reduce the variance of Monte Carlo estimators of expectations using the derivatives of the log target. Once the derivatives are available, the only additional computational effort lies in solving a linear regression problem. Significant variance reductions have been achieved with this method in low dimensional examples, but the number of covariates in the regression rapidly increases with the dimension of the target. In this paper, we present compelling empirical evidence that the use of penalized regression techniques in the selection of high-dimensional control variates provides performance gains over the classical least squares method. Another type of regularization based on using subsets of derivatives, or a priori regularization as we refer to it in this paper, is also proposed to reduce computational and storage requirements. Several examples showing the utility and limitations of regularized ZV-CV for Bayesian inference are given. The methods proposed in this paper are accessible through the R package ZVCV.
△ Less
Submitted 15 August, 2022; v1 submitted 12 November, 2018;
originally announced November 2018.
-
Marginal models with individual-specific effects for the analysis of longitudinal bipartite networks
Authors:
Francesco Bartolucci,
Antonietta Mira,
Stefano Peluso
Abstract:
A new modeling framework for bipartite social networks arising from a sequence of partially time-ordered relational events is proposed. We directly model the joint distribution of the binary variables indicating if each single actor is involved or not in an event. The adopted parametrization is based on first- and second-order effects, formulated as in marginal models for categorical data and free…
▽ More
A new modeling framework for bipartite social networks arising from a sequence of partially time-ordered relational events is proposed. We directly model the joint distribution of the binary variables indicating if each single actor is involved or not in an event. The adopted parametrization is based on first- and second-order effects, formulated as in marginal models for categorical data and free higher order effects. In particular, second-order effects are log-odds ratios with meaningful interpretation from the social perspective in terms of tendency to cooperate, in contrast to first-order effects interpreted in terms of tendency of each single actor to participate in an event. These effects are parametrized on the basis of the event times, so that suitable latent trajectories of individual behaviors may be represented. Inference is based on a composite likelihood function, maximized by an algorithm with numerical complexity proportional to the square of the number of units in the network. A classification composite likelihood is used to cluster the actors, simplifying the interpretation of the data structure. The proposed approach is illustrated on a dataset of scientific articles published in four top statistical journals from 2003 to 2012.
△ Less
Submitted 20 October, 2018;
originally announced October 2018.
-
Bayesian Calibration of Force-fields from Experimental Data: TIP4P Water
Authors:
Ritabrata Dutta,
Zacharias Faidon Brotzakis,
Antonietta Mira
Abstract:
Molecular dynamics (MD) simulations give access to equilibrium structures and dynamic properties given an ergodic sampling and an accurate force-field. The force-field parameters are calibrated to reproduce properties measured by experiments or simulations. The main contribution of this paper is an approximate Bayesian framework for the calibration and uncertainty quantification of the force-field…
▽ More
Molecular dynamics (MD) simulations give access to equilibrium structures and dynamic properties given an ergodic sampling and an accurate force-field. The force-field parameters are calibrated to reproduce properties measured by experiments or simulations. The main contribution of this paper is an approximate Bayesian framework for the calibration and uncertainty quantification of the force-field parameters, without assuming parameter uncertainty to be Gaussian. To this aim, since the likelihood function of the MD simulation models are intractable in absence of Gaussianity assumption, we use a likelihood-free inference scheme known as approximate Bayesian computation (ABC) and propose an adaptive population Monte Carlo ABC algorithm, which is illustrated to converge faster and scales better than previously used ABCsubsim algorithm for calibration of force-field of a helium system. The second contribution is the adaptation of ABC algorithms for High Performance Computing to MD simulation within the Python ecosystem ABCpy. We illustrate the performance of the developed methodology to learn posterior distribution and Bayesian estimates of Lennard-Jones force-field parameters of helium and TIP4P system of water implemented both for simulated and experimental datasets collected using Neutron and X-ray diffraction. For simulated data, the Bayesian estimate is in close agreement with the true parameter value used to generate the dataset. For experimental as well as for simulated data, the Bayesian posterior distribution shows a strong correlation pattern between the force-field parameters. Providing an estimate of the entire posterior distribution, our methodology also allows us to perform uncertainty quantification of model prediction. This research opens up the possibility to rigorously calibrate force-fields from available experimental datasets of any structural and dynamic property.
△ Less
Submitted 27 September, 2018; v1 submitted 8 April, 2018;
originally announced April 2018.
-
Likelihood-free parameter estimation for dynamic queueing networks: case study of passenger flow in an international airport terminal
Authors:
Anthony Ebert,
Ritabrata Dutta,
Kerrie Mengersen,
Antonietta Mira,
Fabrizio Ruggeri,
Paul Wu
Abstract:
Dynamic queueing networks (DQN) model queueing systems where demand varies strongly with time, such as airport terminals. With rapidly rising global air passenger traffic placing increasing pressure on airport terminals, efficient allocation of resources is more important than ever. Parameter inference and quantification of uncertainty are key challenges for develo** decision support tools. The…
▽ More
Dynamic queueing networks (DQN) model queueing systems where demand varies strongly with time, such as airport terminals. With rapidly rising global air passenger traffic placing increasing pressure on airport terminals, efficient allocation of resources is more important than ever. Parameter inference and quantification of uncertainty are key challenges for develo** decision support tools. The DQN likelihood function is, in general, intractable and current approaches to simulation make likelihood-free parameter inference methods, such as approximate Bayesian computation (ABC), infeasible since simulating from these models is computationally expensive. By leveraging a recent advance in computationally efficient queueing simulation, we develop the first parameter inference approach for DQNs. We demonstrate our approach with data of passenger flows in a real airport terminal, and we show that our model accurately recreates the behaviour of the system and is useful for decision support. Special care must be taken in develo** the distance for ABC since any useful output must vary with time. We use maximum mean discrepancy, a metric on probability measures, as the distance function for ABC. Prediction intervals of performance measures for decision support tools are easily constructed using draws from posterior samples, which we demonstrate with a scenario of a delayed flight.
△ Less
Submitted 22 March, 2019; v1 submitted 7 April, 2018;
originally announced April 2018.
-
Flexible model selection for mechanistic network models
Authors:
Sixing Chen,
Antonietta Mira,
Jukka-Pekka Onnela
Abstract:
Network models are applied across many domains where data can be represented as a network. Two prominent paradigms for modeling networks are statistical models (probabilistic models for the observed network) and mechanistic models (models for network growth and/or evolution). Mechanistic models are better suited for incorporating domain knowledge, to study effects of interventions (such as changes…
▽ More
Network models are applied across many domains where data can be represented as a network. Two prominent paradigms for modeling networks are statistical models (probabilistic models for the observed network) and mechanistic models (models for network growth and/or evolution). Mechanistic models are better suited for incorporating domain knowledge, to study effects of interventions (such as changes to specific mechanisms) and to forward simulate, but they typically have intractable likelihoods. As such, and in a stark contrast to statistical models, there is a relative dearth of research on model selection for such models despite the otherwise large body of extant work. In this paper, we propose a simulator-based procedure for mechanistic network model selection that borrows aspects from Approximate Bayesian Computation (ABC) along with a means to quantify the uncertainty in the selected model. To select the most suitable network model, we consider and assess the performance of several learning algorithms, most notably the so-called Super Learner, which makes our framework less sensitive to the choice of a particular learning algorithm. Our approach takes advantage of the ease to forward simulate from mechanistic network models to circumvent their intractable likelihoods. The overall process is flexible and widely applicable. Our simulation results demonstrate the approach's ability to accurately discriminate between competing mechanistic models. Finally, we showcase our approach with a protein-protein interaction network model from the literature for yeast (Saccharomyces cerevisiae).
△ Less
Submitted 19 June, 2019; v1 submitted 31 March, 2018;
originally announced April 2018.
-
Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data
Authors:
Maksym Byshkin,
Alex Stivala,
Antonietta Mira,
Garry Robins,
Alessandro Lomi
Abstract:
A major line of contemporary research on complex networks is based on the development of statistical models that specify the local motifs associated with macro-structural properties observed in actual networks. This statistical approach becomes increasingly problematic as network size increases. In the context of current research on efficient estimation of models for large network data sets, we pr…
▽ More
A major line of contemporary research on complex networks is based on the development of statistical models that specify the local motifs associated with macro-structural properties observed in actual networks. This statistical approach becomes increasingly problematic as network size increases. In the context of current research on efficient estimation of models for large network data sets, we propose a fast algorithm for maximum likelihood estimation (MLE) that afords a signifcant increase in the size of networks amenable to direct empirical analysis. The algorithm we propose in this paper relies on properties of Markov chains at equilibrium, and for this reason it is called equilibrium expectation (EE). We demonstrate the performance of the EE algorithm in the context of exponential random graphmodels (ERGMs) a family of statistical models commonly used in empirical research based on network data observed at a single period in time. Thus far, the lack of efcient computational strategies has limited the empirical scope of ERGMs to relatively small networks with a few thousand nodes. The approach we propose allows a dramatic increase in the size of networks that may be analyzed using ERGMs. This is illustrated in an analysis of several biological networks and one social network with 104,103 nodes
△ Less
Submitted 1 August, 2018; v1 submitted 28 February, 2018;
originally announced February 2018.
-
ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation
Authors:
Ritabrata Dutta,
Marcel Schoengens,
Lorenzo Pacchiardi,
Avinash Ummadisingu,
Nicole Widmer,
Pierre Künzli,
Jukka-Pekka Onnela,
Antonietta Mira
Abstract:
ABCpy is a highly modular scientific library for Approximate Bayesian Computation (ABC) written in Python. The main contribution of this paper is to document a software engineering effort that enables domain scientists to easily apply ABC to their research without being ABC experts; using ABCpy they can easily run large parallel simulations without much knowledge about parallelization. Further, AB…
▽ More
ABCpy is a highly modular scientific library for Approximate Bayesian Computation (ABC) written in Python. The main contribution of this paper is to document a software engineering effort that enables domain scientists to easily apply ABC to their research without being ABC experts; using ABCpy they can easily run large parallel simulations without much knowledge about parallelization. Further, ABCpy enables ABC experts to easily develop new inference schemes and evaluate them in a standardized environment and to extend the library with new algorithms. These benefits come mainly from the modularity of ABCpy. We give an overview of the design of ABCpy and provide a performance evaluation concentrating on parallelization. This points us towards the inherent imbalance in some of the ABC algorithms. We develop a dynamic scheduling MPI implementation to mitigate this issue and evaluate the various ABC algorithms according to their adaptability towards high-performance computing.
△ Less
Submitted 17 December, 2021; v1 submitted 13 November, 2017;
originally announced November 2017.
-
Parameter estimation of platelets deposition: Approximate Bayesian computation with high performance computing
Authors:
Ritabrata Dutta,
Bastien Chopard,
Jonas Lätt,
Frank Dubois,
Karim Zouaoui Boudjeltia,
Antonietta Mira
Abstract:
Recent studies show the existing clinical tests to detect Cardio/cerebrovascular diseases (CVD) are ineffectual as they do not consider different stages of platelet activation or the molecular dynamics involved in platelet interactions. Further they are also incapable to consider inter-individual variability. A physical description of platelets deposition was introduced recently in Chopard et. al.…
▽ More
Recent studies show the existing clinical tests to detect Cardio/cerebrovascular diseases (CVD) are ineffectual as they do not consider different stages of platelet activation or the molecular dynamics involved in platelet interactions. Further they are also incapable to consider inter-individual variability. A physical description of platelets deposition was introduced recently in Chopard et. al. [2017], by integrating fundamental understandings of how platelets interact in a numerical model, parameterized by five parameters. These parameters specify the deposition process and are relevant for a biomedical understanding of the phenomena. One of the main intuition is that these parameters are precisely the information needed for a pathological test identifying CVD captured and that they capture the inter-individual variability. Following this intuition, here we devise a Bayesian inferential scheme for estimation of these parameters. As the likelihood function of the numerical model is intractable due to the complex stochastic nature of the model, we use a likelihood-free inference scheme approximate Bayesian computation (ABC) to calibrate the parameters in a data-driven manner. As ABC requires the generation of many pseudo-data by expensive simulation runs, we use a high performance computing (HPC) framework for ABC to make the inference possible for this model. We illustrate that our mean posterior prediction of platelet deposition pattern matches the experimental dataset closely with a tight posterior prediction error margin for a collective dataset of 7 volunteers. The present approach can be used to build a new generation of personalized platelet functionality tests for CVD detection, using numerical modeling of platelet deposition, Bayesian uncertainty quantification and High performance computing.
△ Less
Submitted 21 May, 2018; v1 submitted 3 October, 2017;
originally announced October 2017.
-
Bayesian Inference of Spreading Processes on Networks
Authors:
Ritabrata Dutta,
Antonietta Mira,
Jukka-Pekka Onnela
Abstract:
Infectious diseases are studied to understand their spreading mechanisms, to evaluate control strategies and to predict the risk and course of future outbreaks. Because people only interact with a small number of individuals, and because the structure of these interactions matters for spreading processes, the pairwise relationships between individuals in a population can be usefully represented by…
▽ More
Infectious diseases are studied to understand their spreading mechanisms, to evaluate control strategies and to predict the risk and course of future outbreaks. Because people only interact with a small number of individuals, and because the structure of these interactions matters for spreading processes, the pairwise relationships between individuals in a population can be usefully represented by a network. Although the underlying processes of transmission are different, the network approach can be used to study the spread of pathogens in a contact network or the spread of rumors in an online social network. We study simulated simple and complex epidemics on synthetic networks and on two empirical networks, a social / contact network in an Indian village and an online social network in the U.S. Our goal is to learn simultaneously about the spreading process parameters and the source node (first infected node) of the epidemic, given a fixed and known network structure, and observations about state of nodes at several points in time. Our inference scheme is based on approximate Bayesian computation (ABC), an inference technique for complex models with likelihood functions that are either expensive to evaluate or analytically intractable. ABC enables us to adopt a Bayesian approach to the problem despite the posterior distribution being very complex. Our method is agnostic about the topology of the network and the nature of the spreading process. It generally performs well and, somewhat counter-intuitively, the inference problem appears to be easier on more heterogeneous network topologies, which enhances its future applicability to real-world settings where few networks have homogeneous topologies.
△ Less
Submitted 21 May, 2018; v1 submitted 26 September, 2017;
originally announced September 2017.
-
On computational and combinatorial properties of the total co-independent domination number of graphs
Authors:
Abel Cabrera Martinez,
Frank A. Hernandez Mira,
Jose M. Sigarreta Almira,
Ismael G. Yero
Abstract:
A subset $D$ of vertices of a graph $G$ is a total dominating set if every vertex of $G$ is adjacent to at least one vertex of $D$. The total dominating set $D$ is called a total co-independent dominating set if the subgraph induced by $V-D$ is edgeless and has at least one vertex. The minimum cardinality of any total co-independent dominating set is the total co-independent domination number of…
▽ More
A subset $D$ of vertices of a graph $G$ is a total dominating set if every vertex of $G$ is adjacent to at least one vertex of $D$. The total dominating set $D$ is called a total co-independent dominating set if the subgraph induced by $V-D$ is edgeless and has at least one vertex. The minimum cardinality of any total co-independent dominating set is the total co-independent domination number of $G$ and is denoted by $γ_{t,coi}(G)$. In this work we study some complexity and combinatorial properties of $γ_{t,coi}(G)$. Specifically, we prove that deciding whether $γ_{t,coi}(G)\le k$ for a given integer $k$ is an NP-complete problem and give several bounds on $γ_{t,coi}(G)$. Also, since any total co-independent dominating set is also a total dominating set, we characterize all the trees having equal total co-independent domination number and total domination number.
△ Less
Submitted 29 August, 2017; v1 submitted 2 May, 2017;
originally announced May 2017.
-
Adaptive Incremental Mixture Markov chain Monte Carlo
Authors:
Florian Maire,
Nial Friel,
Antonietta Mira,
Adrian Raftery
Abstract:
We propose Adaptive Incremental Mixture Markov chain Monte Carlo (AIMM), a novel approach to sample from challenging probability distributions defined on a general state-space. While adaptive MCMC methods usually update a parametric proposal kernel with a global rule, AIMM locally adapts a semiparametric kernel. AIMM is based on an independent Metropolis-Hastings proposal distribution which takes…
▽ More
We propose Adaptive Incremental Mixture Markov chain Monte Carlo (AIMM), a novel approach to sample from challenging probability distributions defined on a general state-space. While adaptive MCMC methods usually update a parametric proposal kernel with a global rule, AIMM locally adapts a semiparametric kernel. AIMM is based on an independent Metropolis-Hastings proposal distribution which takes the form of a finite mixture of Gaussian distributions. Central to this approach is the idea that the proposal distribution adapts to the target by locally adding a mixture component when the discrepancy between the proposal mixture and the target is deemed to be too large. As a result, the number of components in the mixture proposal is not fixed in advance. Theoretically, we prove that there exists a process that can be made arbitrarily close to AIMM and that converges to the correct target distribution. We also illustrate that it performs well in practice in a variety of challenging situations, including high-dimensional and multimodal target distributions.
△ Less
Submitted 31 May, 2018; v1 submitted 27 April, 2016;
originally announced April 2016.
-
International Trade: a Reinforced Urn Network Model
Authors:
Stefano Peluso,
Antonietta Mira,
Pietro Muliere,
Alessandro Lomi
Abstract:
We propose a unified modelling framework that theoretically justifies the main empirical regularities characterizing the international trade network. Each country is associated to a Polya urn whose composition controls the propensity of the country to trade with other countries. The urn composition is updated through the walk of the Reinforced Urn Process of Muliere et al. (2000). The model implie…
▽ More
We propose a unified modelling framework that theoretically justifies the main empirical regularities characterizing the international trade network. Each country is associated to a Polya urn whose composition controls the propensity of the country to trade with other countries. The urn composition is updated through the walk of the Reinforced Urn Process of Muliere et al. (2000). The model implies a local preferential attachment scheme and a power law right tail behaviour of bilateral trade flows. Different assumptions on the urns' reinforcement parameters account for local clustering, path-shortening and sparsity. Likelihood-based estimation approaches are facilitated by feasible likelihood analytical derivation in various network settings. A simulated example and the empirical results on the international trade network are discussed.
△ Less
Submitted 12 January, 2016;
originally announced January 2016.
-
Exploiting Multi-Core Architectures for Reduced-Variance Estimation with Intractable Likelihoods
Authors:
Nial Friel,
Antonietta Mira,
Chris. J. Oates
Abstract:
Many popular statistical models for complex phenomena are intractable, in the sense that the likelihood function cannot easily be evaluated. Bayesian estimation in this setting remains challenging, with a lack of computational methodology to fully exploit modern processing capabilities. In this paper we introduce novel control variates for intractable likelihoods that can dramatically reduce the M…
▽ More
Many popular statistical models for complex phenomena are intractable, in the sense that the likelihood function cannot easily be evaluated. Bayesian estimation in this setting remains challenging, with a lack of computational methodology to fully exploit modern processing capabilities. In this paper we introduce novel control variates for intractable likelihoods that can dramatically reduce the Monte Carlo variance of Bayesian estimators. We prove that our control variates are well-defined and provide a positive variance reduction. Furthermore we show how to optimise these control variates for variance reduction. The methodology is highly parallel and offers a route to exploit multi-core processing architectures that complements recent research in this direction. Indeed, our work shows that it may not be necessary to parallelise the sampling process itself in order to harness the potential of massively multi-core architectures. Simulation results presented on the Ising model, exponential random graph models and non-linear stochastic differential equation models support our theoretical findings.
△ Less
Submitted 30 March, 2015; v1 submitted 20 August, 2014;
originally announced August 2014.
-
Efficient computational strategies for doubly intractable problems with applications to Bayesian social networks
Authors:
Alberto Caimo,
Antonietta Mira
Abstract:
Powerful ideas recently appeared in the literature are adjusted and combined to design improved samplers for Bayesian exponential random graph models. Different forms of adaptive Metropolis-Hastings proposals (vertical, horizontal and rectangular) are tested and combined with the Delayed rejection (DR) strategy with the aim of reducing the variance of the resulting Markov chain Monte Carlo estimat…
▽ More
Powerful ideas recently appeared in the literature are adjusted and combined to design improved samplers for Bayesian exponential random graph models. Different forms of adaptive Metropolis-Hastings proposals (vertical, horizontal and rectangular) are tested and combined with the Delayed rejection (DR) strategy with the aim of reducing the variance of the resulting Markov chain Monte Carlo estimators for a given computational time. In the examples treated in this paper the best combination, namely horizontal adaptation with delayed rejection, leads to a variance reduction that varies between 92% and 144% relative to the adaptive direction sampling approximate exchange algorithm of Caimo and Friel (2011). These results correspond to an increased performance which varies from 10% to 94% if we take simulation time into account. The highest improvements are obtained when highly correlated posterior distributions are considered.
△ Less
Submitted 17 September, 2014; v1 submitted 18 March, 2014;
originally announced March 2014.
-
Zero Variance Markov Chain Monte Carlo for Bayesian Estimators
Authors:
Antonietta Mira,
Reza Solgi,
Daniele Imparato
Abstract:
Interest is in evaluating, by Markov chain Monte Carlo (MCMC) simulation, the expected value of a function with respect to a, possibly unnormalized, probability distribution. A general purpose variance reduction technique for the MCMC estimator, based on the zero-variance principle introduced in the physics literature, is proposed. Conditions for asymptotic unbiasedness of the zero-variance estima…
▽ More
Interest is in evaluating, by Markov chain Monte Carlo (MCMC) simulation, the expected value of a function with respect to a, possibly unnormalized, probability distribution. A general purpose variance reduction technique for the MCMC estimator, based on the zero-variance principle introduced in the physics literature, is proposed. Conditions for asymptotic unbiasedness of the zero-variance estimator are derived. A central limit theorem is also proved under regularity conditions. The potential of the idea is illustrated with real applications to probit, logit and GARCH Bayesian models. For all these models, a central limit theorem and unbiasedness for the zero-variance estimator are proved (see the supplementary material available on-line).
△ Less
Submitted 26 June, 2012; v1 submitted 14 December, 2010;
originally announced December 2010.
-
Adaptive Multiple Importance Sampling
Authors:
Jean-Marie Cornuet,
Jean-Michel Marin,
Antonietta Mira,
Christian P. Robert
Abstract:
The Adaptive Multiple Importance Sampling (AMIS) algorithm is aimed at an optimal recycling of past simulations in an iterated importance sampling scheme. The difference with earlier adaptive importance sampling implementations like Population Monte Carlo is that the importance weights of all simulated values, past as well as present, are recomputed at each iteration, following the technique of th…
▽ More
The Adaptive Multiple Importance Sampling (AMIS) algorithm is aimed at an optimal recycling of past simulations in an iterated importance sampling scheme. The difference with earlier adaptive importance sampling implementations like Population Monte Carlo is that the importance weights of all simulated values, past as well as present, are recomputed at each iteration, following the technique of the deterministic multiple mixture estimator of Owen and Zhou (2000). Although the convergence properties of the algorithm cannot be fully investigated, we demonstrate through a challenging banana shape target distribution and a population genetics example that the improvement brought by this technique is substantial.
△ Less
Submitted 3 October, 2011; v1 submitted 7 July, 2009;
originally announced July 2009.
-
Delayed Rejection Variational Monte Carlo
Authors:
Dario Bressanini,
Gabriele Morosi,
Silvia Tarasco,
Antonietta Mira
Abstract:
A new acceleration algorithm to address the problem of multiple time scales in variational Monte Carlo simulations is presented. After a first attempted move has been rejected, the delayed rejection algorithm attempts a second move with a smaller time step, so that even moves of the core electrons can be accepted. Results on Be and Ne atoms as test cases are presented. Correlation time and both…
▽ More
A new acceleration algorithm to address the problem of multiple time scales in variational Monte Carlo simulations is presented. After a first attempted move has been rejected, the delayed rejection algorithm attempts a second move with a smaller time step, so that even moves of the core electrons can be accepted. Results on Be and Ne atoms as test cases are presented. Correlation time and both average accepted displacement and acceptance ratio as a function of the distance from the nucleus evidence the efficiency of the proposed algorithm in dealing with the multiple time scales problem.
△ Less
Submitted 20 July, 2004;
originally announced July 2004.