Search | arXiv e-print repository

Doubly Adaptive Importance Sampling

Authors: Willem van den Boom, Andrea Cremaschi, Alexandre H. Thiery

Abstract: We propose an adaptive importance sampling scheme for Gaussian approximations of intractable posteriors. Optimization-based approximations like variational inference can be too inaccurate while existing Monte Carlo methods can be too slow. Therefore, we propose a hybrid where, at each iteration, the Monte Carlo effective sample size can be guaranteed at a fixed computational cost by interpolating… ▽ More We propose an adaptive importance sampling scheme for Gaussian approximations of intractable posteriors. Optimization-based approximations like variational inference can be too inaccurate while existing Monte Carlo methods can be too slow. Therefore, we propose a hybrid where, at each iteration, the Monte Carlo effective sample size can be guaranteed at a fixed computational cost by interpolating between natural-gradient variational inference and importance sampling. The amount of dam** in the updates adapts to the posterior and guarantees the effective sample size. Gaussianity enables the use of Stein's lemma to obtain gradient-based optimization in the highly damped variational inference regime and a reduction of Monte Carlo error for undamped adaptive importance sampling. The result is a generic, embarrassingly parallel and adaptive posterior approximation method. Numerical studies on simulated and real data show its competitiveness with other, less general methods. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 30 pages, 8 figures

arXiv:2312.13790 [pdf, other]

From Past to Future: Digital Methods Towards Artefact Analysis

Authors: Andrew Harris, Andrea Cremaschi, Tse Siang Lim, Maria De Iorio, Kwa Chong Guan

Abstract: Over the past two decades, Digital Humanities has transformed the landscape of humanities and social sciences, enabling advanced computational analysis and interpretation of extensive datasets. Notably, recent initiatives in Southeast Asia, particularly in Singapore, focus on categorising and archiving historical data such as artwork, literature and, most notably archaeological artefacts. This stu… ▽ More Over the past two decades, Digital Humanities has transformed the landscape of humanities and social sciences, enabling advanced computational analysis and interpretation of extensive datasets. Notably, recent initiatives in Southeast Asia, particularly in Singapore, focus on categorising and archiving historical data such as artwork, literature and, most notably archaeological artefacts. This study illustrates the profound potential of Digital Humanities through the application of statistical methods on two distinct artefact datasets. Specifically, we present the results of an automated die study on mid-1st millennium AD "Rising Sun" coinage from mainland Southeast Asia, while subsequently utilising unsupervised statistical methods on 2D images of 13th-14th century earthenware ceramics excavated from the precolonial St. Andrew's Cathedral site in central Singapore. This research offers a comparative assessment showcasing the transformative impact of statistics-based approaches on the interpretation and analysis of diverse archaeological materials and within Digital Humanities overall. △ Less

Submitted 21 December, 2023; originally announced December 2023.

arXiv:2312.12396 [pdf, other]

A change-point random partition model for large spatio-temporal datasets

Authors: Andrea Cremaschi, Annalisa Cadonna, Alessandra Guglielmi, Fernando Quintana

Abstract: Spatio-temporal areal data can be seen as a collection of time series which are spatially correlated, according to a specific neighboring structure. Motivated by a dataset on mobile phone usage in the Metropolitan area of Milan, Italy, we propose a semi-parametric hierarchical Bayesian model allowing for time-varying as well as spatial model-based clustering. To accommodate for changing patterns o… ▽ More Spatio-temporal areal data can be seen as a collection of time series which are spatially correlated, according to a specific neighboring structure. Motivated by a dataset on mobile phone usage in the Metropolitan area of Milan, Italy, we propose a semi-parametric hierarchical Bayesian model allowing for time-varying as well as spatial model-based clustering. To accommodate for changing patterns over work hours and weekdays/weekends, we incorporate a temporal change-point component that allows the specification of different hierarchical structures across time points. The model features a random partition prior that incorporates the desired spatial features and encourages co-clustering based on areal proximity. We explore properties of the model by way of extensive simulation studies from which we collect valuable information. Finally, we discuss the application to the motivating data, where the main goal is to spatially cluster population patterns of mobile phone usage. △ Less

Submitted 8 December, 2023; originally announced December 2023.

arXiv:2311.04408 [pdf, other]

Bayesian modelling of response to therapy and drug-sensitivity in acute lymphoblastic leukemia

Authors: Andrea Cremaschi, Wenjian Yang, Maria De Iorio, William E. Evans, Jun J. Yang, Gary L. Rosner

Abstract: Acute lymphoblastic leukemia (ALL) is a heterogeneous hematologic malignancy involving the abnormal proliferation of immature lymphocytes, accounting for most pediatric cancer cases. ALL management in children has seen great improvement in the last decades thanks to better understanding of the disease leading to improved treatment strategies evidenced through clinical trials. Commonly a first cour… ▽ More Acute lymphoblastic leukemia (ALL) is a heterogeneous hematologic malignancy involving the abnormal proliferation of immature lymphocytes, accounting for most pediatric cancer cases. ALL management in children has seen great improvement in the last decades thanks to better understanding of the disease leading to improved treatment strategies evidenced through clinical trials. Commonly a first course of chemotherapy (induction phase) is administered, followed by treatment with a combination of anti-leukemia drugs. A measure of the efficacy early in the course of therapy is minimal residual disease (MRD). MRD quantifies residual tumor cells and indicates the effectiveness of the treatment over the course of therapy. MRD positivity is defined for values of MRD greater than 0.01%, yielding left-censored observations. We propose a Bayesian model to study the relationship between patient features and MRD observed at two time points during the induction phase. Specifically, we model the observed MRD values via an auto-regressive model, accounting for left-censoring of the data and for the fact that some patients are already in remission after the induction phase. Patient characteristics are included in the model via linear regression terms. In particular, patient-specific drug sensitivity based on ex-vivo assays of patient samples is exploited to identify groups of subjects with similar profiles. We include this information as a covariate in the model for MRD. We adopt horseshoe priors for the regression coefficients to perform variable selection to identify important covariates. We fit the proposed approach to data from three prospective pediatric ALL clinical trials carried out at the St. Jude Children's Research Hospital. Our results highlight that drug sensitivity profiles and leukemic subtypes play an important role in the response to induction therapy as measured by serial MRD measures. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2306.10669 [pdf, other]

Repulsion, Chaos and Equilibrium in Mixture Models

Authors: Andrea Cremaschi, Timothy M. Wertz, Maria De Iorio

Abstract: Mixture models are commonly used in applications with heterogeneity and overdispersion in the population, as they allow the identification of subpopulations. In the Bayesian framework, this entails the specification of suitable prior distributions for the weights and location parameters of the mixture. Widely used are Bayesian semi-parametric models based on mixtures with infinite or random number… ▽ More Mixture models are commonly used in applications with heterogeneity and overdispersion in the population, as they allow the identification of subpopulations. In the Bayesian framework, this entails the specification of suitable prior distributions for the weights and location parameters of the mixture. Widely used are Bayesian semi-parametric models based on mixtures with infinite or random number of components, such as Dirichlet process mixtures or mixtures with random number of components. Key in this context is the choice of the kernel for cluster identification. Despite their popularity, the flexibility of these models and prior distributions often does not translate into interpretability of the identified clusters. To overcome this issue, clustering methods based on repulsive mixtures have been recently proposed. The basic idea is to include a repulsive term in the prior distribution of the atoms of the mixture, which favours mixture locations far apart. This approach is increasingly popular and allows one to produce well-separated clusters, thus facilitating the interpretation of the results. However, the resulting models are usually not easy to handle due to the introduction of unknown normalising constants. Exploiting results from statistical mechanics, we propose in this work a novel class of repulsive prior distributions based on Gibbs measures. Specifically, we use Gibbs measures associated to joint distributions of eigenvalues of random matrices, which naturally possess a repulsive property. The proposed framework greatly simplifies the computations needed for the use of repulsive mixtures due to the availability of the normalising constant in closed form. We investigate theoretical properties of such class of prior distributions, and illustrate the novel class of priors and their properties, as well as their clustering performance, on benchmark datasets. △ Less

Submitted 18 June, 2023; originally announced June 2023.

arXiv:2207.13418 [pdf, other]

Bayesian Dynamic Network Modelling: an application to metabolic associations in cardiovascular diseases

Authors: Marco Molinari, Andrea Cremaschi, Maria De Iorio, Nishi Chaturvedi, Alun Hughes, Therese Tillin

Abstract: We propose a novel approach to the estimation of multiple Graphical Models to analyse temporal patterns of association among a set of metabolites over different groups of patients. Our motivating application is the Southall And Brent REvisited (SABRE) study, a tri-ethnic cohort study conducted in the UK. We are interested in identifying potential ethnic differences in metabolite levels and associa… ▽ More We propose a novel approach to the estimation of multiple Graphical Models to analyse temporal patterns of association among a set of metabolites over different groups of patients. Our motivating application is the Southall And Brent REvisited (SABRE) study, a tri-ethnic cohort study conducted in the UK. We are interested in identifying potential ethnic differences in metabolite levels and associations as well as their evolution over time, with the aim of gaining a better understanding of different risk of cardio-metabolic disorders across ethnicities. Within a Bayesian framework, we employ a nodewise regression approach to infer the structure of the graphs, borrowing information across time as well as across ethnicities. The response variables of interest are metabolite levels measured at two time points and for two ethnic groups, Europeans and South-Asians. We use nodewise regression to estimate the high-dimensional precision matrices of the metabolites, imposing sparsity on the regression coefficients through the dynamic horseshoe prior, thus favouring sparser graphs. We provide the code to fit the proposed model using the software Stan, which performs posterior inference using Hamiltonian Monte Carlo sampling, as well as a detailed description of a block Gibbs sampling scheme. △ Less

Submitted 27 July, 2022; originally announced July 2022.

arXiv:2206.10509 [pdf, other]

doi 10.1016/j.spasta.2022.100715

Bayesian modeling and clustering for spatio-temporal areal data: An application to Italian unemployment

Authors: Alexander Mozdzen, Andrea Cremaschi, Annalisa Cadonna, Alessandra Guglielmi, Gregor Kastner

Abstract: Spatio-temporal areal data can be seen as a collection of time series which are spatially correlated according to a specific neighboring structure. Incorporating the temporal and spatial dimension into a statistical model poses challenges regarding the underlying theoretical framework as well as the implementation of efficient computational methods. We propose to include spatio-temporal random eff… ▽ More Spatio-temporal areal data can be seen as a collection of time series which are spatially correlated according to a specific neighboring structure. Incorporating the temporal and spatial dimension into a statistical model poses challenges regarding the underlying theoretical framework as well as the implementation of efficient computational methods. We propose to include spatio-temporal random effects using a conditional autoregressive prior, where the temporal correlation is modeled through an autoregressive mean decomposition and the spatial correlation by the precision matrix inheriting the neighboring structure. Their joint distribution constitutes a Gaussian Markov random field, whose sparse precision matrix enables the usage of efficient sampling algorithms. We cluster the areal units using a nonparametric prior, thereby learning latent partitions of the areal units. The performance of the model is assessed via an application to study regional unemployment patterns in Italy. When compared to other spatial and spatio-temporal competitors, the proposed model shows more precise estimates and the additional information obtained from the clustering allows for an extended economic interpretation of the unemployment rates of the Italian provinces. △ Less

Submitted 19 November, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

Journal ref: Spatial Statistics (2022)

arXiv:2205.05054 [pdf, other]

Bayesian clustering of multiple zero-inflated outcomes

Authors: Beatrice Franzolini, Andrea Cremaschi, Willem van den Boom, Maria De Iorio

Abstract: Several applications involving counts present a large proportion of zeros (excess-of-zeros data). A popular model for such data is the Hurdle model, which explicitly models the probability of a zero count, while assuming a sampling distribution on the positive integers. We consider data from multiple count processes. In this context, it is of interest to study the patterns of counts and cluster th… ▽ More Several applications involving counts present a large proportion of zeros (excess-of-zeros data). A popular model for such data is the Hurdle model, which explicitly models the probability of a zero count, while assuming a sampling distribution on the positive integers. We consider data from multiple count processes. In this context, it is of interest to study the patterns of counts and cluster the subjects accordingly. We introduce a novel Bayesian nonparametric approach to cluster multiple, possibly related, zero-inflated processes. We propose a joint model for zero-inflated counts, specifying a Hurdle model for each process with a shifted Negative Binomial sampling distribution. Conditionally on the model parameters, the different processes are assumed independent, leading to a substantial reduction in the number of parameters as compared to traditional multivariate approaches. The subject-specific probabilities of zero-inflation and the parameters of the sampling distribution are flexibly modelled via an enriched finite mixture with random number of components. This induces a two-level clustering of the subjects based on the zero/non-zero patterns (outer clustering) and on the sampling distribution (inner clustering). Posterior inference is performed through tailored MCMC schemes. We demonstrate the proposed approach on an application involving the use of the messaging service WhatsApp. △ Less

Submitted 29 August, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

arXiv:2111.06212 [pdf, other]

Joint modelling of association networks and longitudinal biomarkers: an application to child obesity

Authors: Andrea Cremaschi, Maria De Iorio, Narasimhan Kothandaraman, Fabian Yap, Mya Tway Tint, Johan Eriksson

Abstract: The prevalence of chronic non-communicable diseases such as obesity has noticeably increased in the last decade. The study of these diseases in early life is of paramount importance in determining their course in adult life and in supporting clinical interventions. Recently, attention has been drawn on approaches that study the alteration of metabolic pathways in obese children. In this work, we p… ▽ More The prevalence of chronic non-communicable diseases such as obesity has noticeably increased in the last decade. The study of these diseases in early life is of paramount importance in determining their course in adult life and in supporting clinical interventions. Recently, attention has been drawn on approaches that study the alteration of metabolic pathways in obese children. In this work, we propose a novel joint modelling approach for the analysis of growth biomarkers and metabolite concentrations, to unveil metabolic pathways related to child obesity. Within a Bayesian framework, we flexibly model the temporal evolution of growth trajectories and metabolic associations through the specification of a joint non-parametric random effect distribution which also allows for clustering of the subjects, thus identifying risk sub-groups. Growth profiles as well as patterns of metabolic associations determine the clustering structure. Inclusion of risk factors is straightforward through the specification of a regression term. We demonstrate the proposed approach on data from the Growing Up in Singapore Towards healthy Outcomes (GUSTO) cohort study, based in Singapore. Posterior inference is obtained via a tailored MCMC algorithm, accommodating a nonparametric prior with mixed support. Our analysis has identified potential key pathways in obese children that allows for exploration of possible molecular mechanisms associated with child obesity. △ Less

Submitted 11 February, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

arXiv:2106.03072 [pdf, ps, other]

Seemingly Unrelated Multi-State processes: a Bayesian semiparametric approach

Authors: Andrea Cremaschi, Raffele Argiento, Maria De Iorio, Cai Shirong, Yap Seng Chong, Michael J. Meaney, Michelle Z. L. Kee

Abstract: Many applications in medical statistics as well as in other fields can be described by transitions between multiple states (e.g. from health to disease) experienced by individuals over time. In this context, multi-state models are a popular statistical technique, in particular when the exact transition times are not observed. The key quantities of interest are the transition rates, capturing the i… ▽ More Many applications in medical statistics as well as in other fields can be described by transitions between multiple states (e.g. from health to disease) experienced by individuals over time. In this context, multi-state models are a popular statistical technique, in particular when the exact transition times are not observed. The key quantities of interest are the transition rates, capturing the instantaneous risk of moving from one state to another. The main contribution of this work is to propose a joint semiparametric model for several possibly related multi-state processes (Seemingly Unrelated Multi-State, SUMS, processes), assuming a Markov structure for the transitions over time. The dependence between different processes is captured by specifying a joint random effect distribution on the transition rates of each process. We assume a flexible random effect distribution, which allows for clustering of the individuals, overdispersion and outliers. Moreover, we employ a graph structure to describe the dependence among processes, exploiting tools from the Gaussian Graphical model literature. It is also possible to include covariate effects. We use our approach to model disease progression in mental health. Posterior inference is performed through a specially devised MCMC algorithm. △ Less

Submitted 6 June, 2021; originally announced June 2021.

arXiv:1905.07172 [pdf, other]

Colombian Women's Life Patterns: A Multivariate Density Regression Approach

Authors: Sara Wade, Raffaella Piccarreta, Andrea Cremaschi, Isadora Antoniano-Villalobos

Abstract: Women in Colombia face difficulties related to the patriarchal traits of their societies and well-known conflict afflicting the country since 1948. In this critical context, our aim is to study the relationship between baseline socio-demographic factors and variables associated to fertility, partnership patterns, and work activity. To best exploit the explanatory structure, we propose a Bayesian m… ▽ More Women in Colombia face difficulties related to the patriarchal traits of their societies and well-known conflict afflicting the country since 1948. In this critical context, our aim is to study the relationship between baseline socio-demographic factors and variables associated to fertility, partnership patterns, and work activity. To best exploit the explanatory structure, we propose a Bayesian multivariate density regression model, which can accommodate mixed responses with censored, constrained, and binary traits. The flexible nature of the models allows for nonlinear regression functions and non-standard features in the errors, such as asymmetry or multi-modality. The model has interpretable covariate-dependent weights constructed through normalization, allowing for combinations of categorical and continuous covariates. Computational difficulties for inference are overcome through an adaptive truncation algorithm combining adaptive Metropolis-Hastings and sequential Monte Carlo to create a sequence of automatically truncated posterior mixtures. For our study on Colombian women's life patterns, a variety of quantities are visualised and described, and in particular, our findings highlight the detrimental impact of family violence on women's choices and behaviors. △ Less

Submitted 20 January, 2021; v1 submitted 17 May, 2019; originally announced May 2019.

Comments: to appear in Bayesian analysis

arXiv:1904.04901 [pdf, other]

A Bayesian approach to study synergistic interaction effects in in-vitro drug combination experiments

Authors: Andrea Cremaschi, Arnoldo Frigessi, Kjetil Taskén, Manuela Zucknick

Abstract: In cancer translational research, increasing effort is devoted to the study of the combined effect of two drugs when they are administered simultaneously. In this paper, we introduce a new approach to estimate the part of the effect of the two drugs due to the interaction of the compounds, i.e. which is due to synergistic or antagonistic effects of the two drugs, compared to a reference value repr… ▽ More In cancer translational research, increasing effort is devoted to the study of the combined effect of two drugs when they are administered simultaneously. In this paper, we introduce a new approach to estimate the part of the effect of the two drugs due to the interaction of the compounds, i.e. which is due to synergistic or antagonistic effects of the two drugs, compared to a reference value representing the condition when the combined compounds do not interact, called zero-interaction. We describe an in-vitro cell viability experiment as a random experiment, by interpreting cell viability as the probability of a cell in the experiment to be viable after treatment, and including information related to different exposure conditions. We propose a flexible Bayesian spline regression framework for modelling the viability surface of two drugs combined as a function of the concentrations. Since the proposed approach is based on a statistical model, it allows to include replicates of the experiments, to evaluate the uncertainty of the estimates, and to perform prediction. We test the model fit and prediction performance on a simulation study, and on an ovarian cancer cell dataset. Posterior estimates of the zero-interaction level and of the synergy term, obtained via adaptive MCMC algorithms, are used to compute interpretable measures of efficacy of the combined experiment, including relative volume under the surface (rVUS) measures to summarise the zero-interaction and synergy terms and a bi-variate alternative to the well-known EC50 measure. △ Less

Submitted 20 September, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

Showing 1–12 of 12 results for author: Cremaschi, A