-
Bayesian Nonparametrics for Principal Stratification with Continuous Post-Treatment Variables
Authors:
Dafne Zorzetto,
Antonio Canale,
Fabrizia Mealli,
Francesca Dominici,
Falco J. Bargagli-Stoffi
Abstract:
Principal stratification provides a causal inference framework that allows adjustment for confounded post-treatment variables when comparing treatments. Although the literature has focused mainly on binary post-treatment variables, there is a growing interest in principal stratification involving continuous post-treatment variables. However, characterizing the latent principal strata with a contin…
▽ More
Principal stratification provides a causal inference framework that allows adjustment for confounded post-treatment variables when comparing treatments. Although the literature has focused mainly on binary post-treatment variables, there is a growing interest in principal stratification involving continuous post-treatment variables. However, characterizing the latent principal strata with a continuous post-treatment presents a significant challenge, which is further complicated in observational studies where the treatment is not randomized. In this paper, we introduce the Confounders-Aware SHared atoms BAyesian mixture (CASBAH), a novel approach for principal stratification with continuous post-treatment variables that can be directly applied to observational studies. CASBAH leverages a dependent Dirichlet process, utilizing shared atoms across treatment levels, to effectively control for measured confounders and facilitate information sharing between treatment groups in the identification of principal strata membership. CASBAH also offers a comprehensive quantification of uncertainty surrounding the membership of the principal strata. Through Monte Carlo simulations, we show that the proposed methodology has excellent performance in characterizing the latent principal strata and estimating the effects of treatment on post-treatment variables and outcomes. Finally, CASBAH is applied to a case study in which we estimate the causal effects of US national air quality regulations on pollution levels and health outcomes.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Structured factorization for single-cell gene expression data
Authors:
Antonio Canale,
Luisa Galtarossa,
Davide Risso,
Lorenzo Schiavon,
Giovanni Toto
Abstract:
Single-cell gene expression data are often characterized by large matrices, where the number of cells may be lower than the number of genes of interest. Factorization models have emerged as powerful tools to condense the available information through a sparse decomposition into lower rank matrices. In this work, we adapt and implement a recent Bayesian class of generalized factor models to count d…
▽ More
Single-cell gene expression data are often characterized by large matrices, where the number of cells may be lower than the number of genes of interest. Factorization models have emerged as powerful tools to condense the available information through a sparse decomposition into lower rank matrices. In this work, we adapt and implement a recent Bayesian class of generalized factor models to count data and, specifically, to model the covariance between genes. The developed methodology also allows one to include exogenous information within the prior, such that recognition of covariance structures between genes is favoured. In this work, we use biological pathways as external information to induce sparsity patterns within the loadings matrix. This approach facilitates the interpretation of loadings columns and the corresponding latent factors, which can be regarded as unobserved cell covariates. We demonstrate the effectiveness of our model on single-cell RNA sequencing data obtained from lung adenocarcinoma cell lines, revealing promising insights into the role of pathways in characterizing gene relationships and extracting valuable information about unobserved cell traits.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Confounder-Dependent Bayesian Mixture Model: Characterizing Heterogeneity of Causal Effects in Air Pollution Epidemiology
Authors:
Dafne Zorzetto,
Falco J. Bargagli-Stoffi,
Antonio Canale,
Francesca Dominici
Abstract:
Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (PM2.5) increases mortality risk. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vul…
▽ More
Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (PM2.5) increases mortality risk. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vulnerable to air pollution. In causal inference literature, the Group Average Treatment Effect (GATE) is a distinctive facet of the conditional average treatment effect. This widely employed metric serves to characterize the heterogeneity of a treatment effect based on some population characteristics. In this work, we introduce a novel Confounder-Dependent Bayesian Mixture Model (CDBMM) to characterize causal effect heterogeneity. More specifically, our method leverages the flexibility of the dependent Dirichlet process to model the distribution of the potential outcomes conditionally to the covariates and the treatment levels, thus enabling us to: (i) identify heterogeneous and mutually exclusive population groups defined by similar GATEs in a data-driven way, and (ii) estimate and characterize the causal effects within each of the identified groups. Through simulations, we demonstrate the effectiveness of our method in uncovering key insights about treatment effects heterogeneity. We apply our method to claims data from Medicare enrollees in Texas. We found six mutually exclusive groups where the causal effects of PM2.5 on mortality are heterogeneous.
△ Less
Submitted 30 October, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Multipolar Hardy inequalities and mutual interaction of the poles
Authors:
Anna Canale
Abstract:
In this paper we state the weighted Hardy inequality \begin{equation*} c\int_{{\mathbb R}^N}\sum_{i=1}^n \frac{\varphi^2 }{|x-a_i|^2}\, μ(x)dx\le \int_{{\mathbb R}^N} |\nabla\varphi|^2 \, μ(x)dx +k \int_{\mathbb{R}^N}\varphi^2 \, μ(x)dx \end{equation*} for any $ \varphi$ in a weighted Sobolev spaces, with $c\in]0,c_o[$ where $c_o=c_o(N,μ)$ is the optimal constant, $a_1,\dots,a_n\in \mathbb{R}^N$,…
▽ More
In this paper we state the weighted Hardy inequality \begin{equation*} c\int_{{\mathbb R}^N}\sum_{i=1}^n \frac{\varphi^2 }{|x-a_i|^2}\, μ(x)dx\le \int_{{\mathbb R}^N} |\nabla\varphi|^2 \, μ(x)dx +k \int_{\mathbb{R}^N}\varphi^2 \, μ(x)dx \end{equation*} for any $ \varphi$ in a weighted Sobolev spaces, with $c\in]0,c_o[$ where $c_o=c_o(N,μ)$ is the optimal constant, $a_1,\dots,a_n\in \mathbb{R}^N$, $k$ is a constant depending on $μ$. We show the relation between $c$ and the closeness to the single pole. To this aim we analyze in detail the difficulties to be overcome to get the inequality.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Accelerated structured matrix factorization
Authors:
Lorenzo Schiavon,
Bernardo Nipoti,
Antonio Canale
Abstract:
Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typically lies in lower-dimensional structures. These lower dimensional objects provide useful insight, with interpretability favored by sparse structures. Sparsity, in addition, is beneficial in terms of regularization and, thus, to avoid over-fitting. By exploiting Bayesian shrinkage priors, we devis…
▽ More
Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typically lies in lower-dimensional structures. These lower dimensional objects provide useful insight, with interpretability favored by sparse structures. Sparsity, in addition, is beneficial in terms of regularization and, thus, to avoid over-fitting. By exploiting Bayesian shrinkage priors, we devise a computationally convenient approach for high-dimensional matrix factorization. The dependence between row and column entities is modeled by inducing flexible sparse patterns within factors. The availability of external information is accounted for in such a way that structures are allowed while not imposed. Inspired by boosting algorithms, we pair the the proposed approach with a numerical strategy relying on a sequential inclusion and estimation of low-rank contributions, with data-driven stop** rule. Practical advantages of the proposed approach are demonstrated by means of a simulation study and the analysis of soccer heatmaps obtained from new generation tracking data.
△ Less
Submitted 13 December, 2022;
originally announced December 2022.
-
Counterexample to a Boesch's Conjecture
Authors:
Nicole Rosenstock,
Eduardo A. Canale
Abstract:
A key issue in network reliability analysis. A graph with $n$ nodes and whose $e$ edges fail independently with probability $p$ is an \emph{Uniformly Most Reliable Graph} (UMRG) if it has the highest reliability among all graphs with the same order and size for every value of $p$. The \emph{all-terminal reliability} is a polynomial in $p$ which defines the probability of a network to remain connec…
▽ More
A key issue in network reliability analysis. A graph with $n$ nodes and whose $e$ edges fail independently with probability $p$ is an \emph{Uniformly Most Reliable Graph} (UMRG) if it has the highest reliability among all graphs with the same order and size for every value of $p$. The \emph{all-terminal reliability} is a polynomial in $p$ which defines the probability of a network to remain connected if some of its components fail. If the coefficients of the reliability polynomial are maximized by a graph, that graph is called \textit{Strong Uniformly Most Reliable Graph} (SUMRG) and it should be UMRG. An exhaustive computer search of the SUMRG with vertices up to 9 is done. Regular graphs with 10 to 14 vertices that maximize tree number are proposed as candidates to UMRG. As an outstanding result a UMRG with 9 vertices and 18 edges which has girth 3 is found, so smaller than the conjectured by Boesch in 1986. A new conjecture about UMRG's topology is posed here: the $(n,e)$-UMRG is $\overline{(k-1)C_3\cup C_{3+r}}$ whenever $n=3k+r$,$n\geq5$ and $e={n(n-3)}/{2}$. A reformulation of Boesch's conjecture is presented stating that if a $(n, {kn}/{2})$-UMRG exists and it has girth $g$, then it has maximum girth among all $k$-regular $(n,{kn}/{2})$ graphs and minimum number of $g$-cycles among those $k$-regular $(n,{kn}/{2})$ graphs with girth $g$.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
The Palindromic Trees
Authors:
Tadashi Akagi,
Eduardo A. Canale
Abstract:
The family of trees with palindromic characteristic polynomials is characterized. Large families of graphs with this property are found as well.
The family of trees with palindromic characteristic polynomials is characterized. Large families of graphs with this property are found as well.
△ Less
Submitted 8 December, 2022; v1 submitted 6 December, 2022;
originally announced December 2022.
-
Multipolar potentials and weighted Hardy inequalities
Authors:
A. Canale
Abstract:
\begin{abstract}
In this paper we state the following weighted Hardy type inequality for any functions $\varphi$ in a weighted Sobolev space and for weight functions $μ$ of a quite general type \begin{equation*} c_{N,μ} \int_{\R^N}V\,\varphi^2μ(x)dx\le \int_{\R^N}|\nabla \varphi|^2μ(x)dx +C_μ\int_{\R^N}W \varphi^2μ(x)dx, \end{equation*} where $V$ is a multipolar potential and $W$ is a bounded fu…
▽ More
\begin{abstract}
In this paper we state the following weighted Hardy type inequality for any functions $\varphi$ in a weighted Sobolev space and for weight functions $μ$ of a quite general type \begin{equation*} c_{N,μ} \int_{\R^N}V\,\varphi^2μ(x)dx\le \int_{\R^N}|\nabla \varphi|^2μ(x)dx +C_μ\int_{\R^N}W \varphi^2μ(x)dx, \end{equation*} where $V$ is a multipolar potential and $W$ is a bounded function from above depending on $μ$. The method to get the result is based on the introduction of a suitable vector value function and on an integral identity that we state in the paper. We prove that the constant $c_{N,μ}$ in the estimate is optimal by building a suitable sequence of functions.
\end{abstract}
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
Improved Hardy inequalities with a class of weights
Authors:
Anna Canale
Abstract:
\begin{abstract} In the paper we state conditions on potentials $V$ to get the improved Hardy inequality with weight \begin{equation*} \begin{split} c_{N,μ}\int_{\R^N}\frac{\varphi^2}{|x|^2}μ(x)dx&+ \int_{\R^N}V\,\varphi^2μ(x)dx \\&\le \int_{\R^N}|\nabla \varphi|^2μ(x)dx +K_1 \int_{\R^N} \varphi^2μ(x)dx, \end{split} \end{equation*} for functions $\varphi$ in a weighted Sobolev space and for weight…
▽ More
\begin{abstract} In the paper we state conditions on potentials $V$ to get the improved Hardy inequality with weight \begin{equation*} \begin{split} c_{N,μ}\int_{\R^N}\frac{\varphi^2}{|x|^2}μ(x)dx&+ \int_{\R^N}V\,\varphi^2μ(x)dx \\&\le \int_{\R^N}|\nabla \varphi|^2μ(x)dx +K_1 \int_{\R^N} \varphi^2μ(x)dx, \end{split} \end{equation*} for functions $\varphi$ in a weighted Sobolev space and for weight functions $μ$ of a quite general type. Some local improved Hardy inequalities are also given. To get the results we use a generalized vector field method. \end{abstract}
△ Less
Submitted 23 November, 2022; v1 submitted 27 September, 2022;
originally announced September 2022.
-
From weighted to unweighted graphs in Synchronizing Graph Theory
Authors:
Eduardo A. Canale
Abstract:
A way to associate unweighted graphs from weighted ones is presented, such that linear stable equilibria of the Kuramoto homogeneous model associated to both graphs coincide, i.e., equilibria of the system $\dotθ_i = \sum_{j \sim i} \sin(θ_{j}-θ_j)$, where $i\sim j$ means vertices $i$ and $j$ are adjacent in the corresponding graph. As a consequence, the existence of linearly stable equilibrium is…
▽ More
A way to associate unweighted graphs from weighted ones is presented, such that linear stable equilibria of the Kuramoto homogeneous model associated to both graphs coincide, i.e., equilibria of the system $\dotθ_i = \sum_{j \sim i} \sin(θ_{j}-θ_j)$, where $i\sim j$ means vertices $i$ and $j$ are adjacent in the corresponding graph. As a consequence, the existence of linearly stable equilibrium is proved to be NP-Hard as conjectured by R. Taylor in 2015 and a new lower bound for the minimum degree that ensures synchronization is found.
△ Less
Submitted 3 October, 2022; v1 submitted 13 September, 2022;
originally announced September 2022.
-
A hierarchical Bayesian non-asymptotic extreme value model for spatial data
Authors:
Federica Stolf,
Antonio Canale
Abstract:
Spatial maps of extreme precipitation are crucial in flood protection. With the aim of producing maps of precipitation return levels, we propose a novel approach to model a collection of spatially distributed time series where the asymptotic assumption, typical of the traditional extreme value theory, is relaxed. We introduce a Bayesian hierarchical model that accounts for the possible underlying…
▽ More
Spatial maps of extreme precipitation are crucial in flood protection. With the aim of producing maps of precipitation return levels, we propose a novel approach to model a collection of spatially distributed time series where the asymptotic assumption, typical of the traditional extreme value theory, is relaxed. We introduce a Bayesian hierarchical model that accounts for the possible underlying variability in the distribution of event magnitudes and occurrences, which are described through latent temporal and spatial processes. Spatial dependence is characterized by geographical covariates and effects not fully described by the covariates are captured by spatial structure in the hierarchies. The performance of the approach is illustrated through simulation studies and an application to daily rainfall extremes across North Carolina (USA). The results show that we significantly reduce the estimation uncertainty with respect to state of the art techniques.
△ Less
Submitted 26 April, 2023; v1 submitted 3 May, 2022;
originally announced May 2022.
-
Numerical evaluation of dual norms via the MM algorithm
Authors:
Bernardi Mauro,
Marco Stefanucci,
Antonio Canale
Abstract:
We deal with the problem of numerically computing the dual norm, which is important to study sparsity-inducing regularizations (Jenatton et al. 2011,Bach et al. 2012). The dual norms find application in optimization and statistical learning, for example, in the design of working-set strategies, for characterizing dual gradient methods, for dual decompositions and in the definition of augmented Lag…
▽ More
We deal with the problem of numerically computing the dual norm, which is important to study sparsity-inducing regularizations (Jenatton et al. 2011,Bach et al. 2012). The dual norms find application in optimization and statistical learning, for example, in the design of working-set strategies, for characterizing dual gradient methods, for dual decompositions and in the definition of augmented Lagrangian functions. Nevertheless, the dual norm of some well-known sparsity-inducing regolarization methods are not analytically available. Examples are the overlap group $\ell_2$-norm of (Jenatton et al. 2011) and the elastic net norm of Zhou and Hastie (2005). Therefore we resort to the Majorization-Minimization principle of Lange (2016) to provide an efficient algorithm that leverages a reparametrization of the dual constrained optimization problem as unconstrained optimization with barrier. Extensive simulation experiments have been performed in order to verify the correctness of operation, and evaluate the performance of the proposed method. Our results demonstrate the effectiveness of the algorithm in retrieving the dual norm even for large dimensions.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
Locally Sparse Function on function Regression
Authors:
Mauro Bernardi,
Antonio Canale,
Marco Stefanucci
Abstract:
In functional data analysis, functional linear regression has attracted significant attention recently. Herein, we consider the case where both the response and covariates are functions. There are two available approaches for addressing such a situation: concurrent and nonconcurrent functional models. In the former, the value of the functional response at a given domain point depends only on the v…
▽ More
In functional data analysis, functional linear regression has attracted significant attention recently. Herein, we consider the case where both the response and covariates are functions. There are two available approaches for addressing such a situation: concurrent and nonconcurrent functional models. In the former, the value of the functional response at a given domain point depends only on the value of the functional regressors evaluated at the same domain point, whereas, in the latter, the functional covariates evaluated at each point of their domain have a non-null effect on the response at any point of its domain. To balance these two extremes, we propose a locally sparse functional regression model in which the functional regression coefficient is allowed (but not forced) to be exactly zero for a subset of its domain. This is achieved using a suitable basis representation of the functional regression coefficient and exploiting an overlap** group-Lasso penalty for its estimation. We introduce efficient computational strategies based on majorization-minimization algorithms and discuss appealing theoretical properties regarding the model support and consistency of the proposed estimator. We further illustrate the empirical performance of the method through simulations and two applications related to human mortality and bidding the energy market.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Efficient posterior sampling for Bayesian Poisson regression
Authors:
Laura D'Angelo,
Antonio Canale
Abstract:
Poisson log-linear models are ubiquitous in many applications, and one of the most popular approaches for parametric count regression. In the Bayesian context, however, there are no sufficient specific computational tools for efficient sampling from the posterior distribution of parameters, and standard algorithms, such as random walk Metropolis-Hastings or Hamiltonian Monte Carlo algorithms, are…
▽ More
Poisson log-linear models are ubiquitous in many applications, and one of the most popular approaches for parametric count regression. In the Bayesian context, however, there are no sufficient specific computational tools for efficient sampling from the posterior distribution of parameters, and standard algorithms, such as random walk Metropolis-Hastings or Hamiltonian Monte Carlo algorithms, are typically used. Herein, we developed an efficient Metropolis-Hastings algorithm and importance sampler to simulate from the posterior distribution of the parameters of Poisson log-linear models under conditional Gaussian priors with superior performance with respect to the state-of-the-art alternatives. The key for both algorithms is the introduction of a proposal density based on a Gaussian approximation of the posterior distribution of parameters. Specifically, our result leverages the negative binomial approximation of the Poisson likelihood and the successful Pólya-gamma data augmentation scheme. Via simulation, we obtained that the time per independent sample of the proposed samplers is competitive with that obtained using the successful Hamiltonian Monte Carlo sampling, with the Metropolis-Hastings showing superior performance in all scenarios considered.
△ Less
Submitted 1 September, 2022; v1 submitted 20 September, 2021;
originally announced September 2021.
-
Semiparametric Functional Factor Models with Bayesian Rank Selection
Authors:
Daniel R. Kowal,
Antonio Canale
Abstract:
Functional data are frequently accompanied by a parametric template that describes the typical shapes of the functions. However, these parametric templates can incur significant bias, which undermines both utility and interpretability. To correct for model misspecification, we augment the parametric template with an infinite-dimensional nonparametric functional basis. The nonparametric basis funct…
▽ More
Functional data are frequently accompanied by a parametric template that describes the typical shapes of the functions. However, these parametric templates can incur significant bias, which undermines both utility and interpretability. To correct for model misspecification, we augment the parametric template with an infinite-dimensional nonparametric functional basis. The nonparametric basis functions are learned from the data and constrained to be orthogonal to the parametric template, which preserves distinctness between the parametric and nonparametric terms. This distinctness is essential to prevent functional confounding, which otherwise induces severe bias for the parametric terms. The nonparametric factors are regularized with an ordered spike-and-slab prior that provides consistent rank selection and satisfies several appealing theoretical properties. The versatility of the proposed approach is illustrated through applications to synthetic data, human motor control data, and dynamic yield curve data. Relative to parametric and semiparametric alternatives, the proposed semiparametric functional factor model eliminates bias, reduces excessive posterior and predictive uncertainty, and provides reliable inference on the effective number of nonparametric terms--all with minimal additional computational costs.
△ Less
Submitted 16 May, 2022; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Inner spike and slab Bayesian nonparametric models
Authors:
Antonio Canale,
Antonio Lijoi,
Bernardo Nipoti,
Igor Prünster
Abstract:
Discrete Bayesian nonparametric models whose expectation is a convex linear combination of a point mass at some point of the support and a diffuse probability distribution allow to incorporate strong prior information, while still being extremely flexible. Recent contributions in the statistical literature have successfully implemented such a modelling strategy in a variety of applications, includ…
▽ More
Discrete Bayesian nonparametric models whose expectation is a convex linear combination of a point mass at some point of the support and a diffuse probability distribution allow to incorporate strong prior information, while still being extremely flexible. Recent contributions in the statistical literature have successfully implemented such a modelling strategy in a variety of applications, including density estimation, nonparametric regression and model-based clustering. We provide a thorough study of a large class of nonparametric models we call inner spike and slab hNRMI models, which are obtained by considering homogeneous normalized random measures with independent increments (hNRMI) with base measure given by a convex linear combination of a point mass and a diffuse probability distribution. In this paper we investigate the distributional properties of these models and our results include: i) the exchangeable partition probability function they induce, ii) the distribution of the number of distinct values in an exchangeable sample, iii) the posterior predictive distribution, and iv) the distribution of the number of elements that coincide with the only point of the support with positive probability. Our findings are the main building block for an actual implementation of Bayesian inner spike and slab hNRMI models by means of a generalized Pólya urn scheme.
△ Less
Submitted 21 July, 2021;
originally announced July 2021.
-
Generalized infinite factorization models
Authors:
Lorenzo Schiavon,
Antonio Canale,
David B. Dunson
Abstract:
Factorization models express a statistical object of interest in terms of a collection of simpler objects. For example, a matrix or tensor can be expressed as a sum of rank-one components. However, in practice, it can be challenging to infer the relative impact of the different components as well as the number of components. A popular idea is to include infinitely many components having impact dec…
▽ More
Factorization models express a statistical object of interest in terms of a collection of simpler objects. For example, a matrix or tensor can be expressed as a sum of rank-one components. However, in practice, it can be challenging to infer the relative impact of the different components as well as the number of components. A popular idea is to include infinitely many components having impact decreasing with the component index. This article is motivated by two limitations of existing methods: (1) lack of careful consideration of the within component sparsity structure; and (2) no accommodation for grouped variables and other non-exchangeable structures. We propose a general class of infinite factorization models that address these limitations. Theoretical support is provided, practical gains are shown in simulation studies, and an ecology application focusing on modelling bird species occurrence is discussed.
△ Less
Submitted 23 October, 2021; v1 submitted 18 March, 2021;
originally announced March 2021.
-
Bayesian nonparametric analysis for the detection of spikes in noisy calcium imaging data
Authors:
Laura D'Angelo,
Antonio Canale,
Zhaoxia Yu,
Michele Guindani
Abstract:
Recent advancements in miniaturized fluorescence microscopy have made it possible to investigate neuronal responses to external stimuli in awake behaving animals through the analysis of intra-cellular calcium signals. An on-going challenge is deconvolving the temporal signals to extract the spike trains from the noisy calcium signals' time-series. In this manuscript, we propose a nested Bayesian f…
▽ More
Recent advancements in miniaturized fluorescence microscopy have made it possible to investigate neuronal responses to external stimuli in awake behaving animals through the analysis of intra-cellular calcium signals. An on-going challenge is deconvolving the temporal signals to extract the spike trains from the noisy calcium signals' time-series. In this manuscript, we propose a nested Bayesian finite mixture specification that allows the estimation of spiking activity and, simultaneously, reconstructing the distributions of the calcium transient spikes' amplitudes under different experimental conditions. The proposed model leverages two nested layers of random discrete mixture priors to borrow information between experiments and discover similarities in the distributional patterns of neuronal responses to different stimuli. Furthermore, the spikes' intensity values are also clustered within and between experimental conditions to determine the existence of common (recurring) response amplitudes. Simulation studies and the analysis of a data set from the Allen Brain Observatory show the effectiveness of the method in clustering and detecting neuronal activities.
△ Less
Submitted 27 January, 2022; v1 submitted 18 February, 2021;
originally announced February 2021.
-
Tromino Tilings with Pegs via Flow Networks
Authors:
Javier T. Akagi,
Eduardo A. Canale,
Marcos Villagra
Abstract:
A tromino tiling problem is a packing puzzle where we are given a region of connected lattice squares and we want to decide whether there exists a tiling of the region using trominoes with the shape of an L. In this work we study a slight variation of the tromino tiling problem where some positions of the region have pegs and each tromino comes with a hole that can only be placed on top of the peg…
▽ More
A tromino tiling problem is a packing puzzle where we are given a region of connected lattice squares and we want to decide whether there exists a tiling of the region using trominoes with the shape of an L. In this work we study a slight variation of the tromino tiling problem where some positions of the region have pegs and each tromino comes with a hole that can only be placed on top of the pegs. We present a characterization of this tiling problem with pegs using flow networks and show that (i) there exists a linear-time parsimonious reduction to the maximum-flow problem, and (ii) counting the number of such tilings can be done in linear-time. The proofs of both results contain algorithms that can then be used to decide the tiling of a region with pegs in $O(n)$ time.
△ Less
Submitted 13 March, 2021; v1 submitted 24 July, 2020;
originally announced July 2020.
-
Esca** the curse of dimensionality in Bayesian model based clustering
Authors:
Noirrit Kiran Chandra,
Antonio Canale,
David B. Dunson
Abstract:
Bayesian mixture models are widely used for clustering of high-dimensional data with appropriate uncertainty quantification. However, as the dimension of the observations increases, posterior inference often tends to favor too many or too few clusters. This article explains this behavior by studying the random partition posterior in a non-standard setting with a fixed sample size and increasing da…
▽ More
Bayesian mixture models are widely used for clustering of high-dimensional data with appropriate uncertainty quantification. However, as the dimension of the observations increases, posterior inference often tends to favor too many or too few clusters. This article explains this behavior by studying the random partition posterior in a non-standard setting with a fixed sample size and increasing data dimensionality. We provide conditions under which the finite sample posterior tends to either assign every observation to a different cluster or all observations to the same cluster as the dimension grows. Interestingly, the conditions do not depend on the choice of clustering prior, as long as all possible partitions of observations into clusters have positive prior probabilities, and hold irrespective of the true data-generating model. We then propose a class of latent mixtures for Bayesian clustering (Lamb) on a set of low-dimensional latent variables inducing a partition on the observed data. The model is amenable to scalable posterior inference and we show that it can avoid the pitfalls of high-dimensionality under mild assumptions. The proposed approach is shown to have good performance in simulation studies and an application to inferring cell types based on scRNAseq.
△ Less
Submitted 20 November, 2022; v1 submitted 4 June, 2020;
originally announced June 2020.
-
Bayesian non-asymptotic extreme value models for environmental data
Authors:
Enrico Zorzetto,
Antonio Canale,
Marco Marani
Abstract:
Motivated by the analysis of extreme rainfall data, we introduce a general Bayesian hierarchical model for estimating the probability distribution of extreme values of intermittent random sequences, a common problem in geophysical and environmental science settings. The approach presented here relaxes the asymptotic assumption typical of the traditional extreme value (EV) theory, and accounts for…
▽ More
Motivated by the analysis of extreme rainfall data, we introduce a general Bayesian hierarchical model for estimating the probability distribution of extreme values of intermittent random sequences, a common problem in geophysical and environmental science settings. The approach presented here relaxes the asymptotic assumption typical of the traditional extreme value (EV) theory, and accounts for the possible underlying variability in the distribution of event magnitudes and occurrences, which are described through a latent temporal process. Focusing on daily rainfall extremes, the structure of the proposed model lends itself to incorporating prior geo-physical understanding of the rainfall process. By means of an extensive simulation study, we show that this methodology can significantly reduce estimation uncertainty with respect to Bayesian formulations of traditional asymptotic EV methods, particularly in the case of relatively small samples. The benefits of the approach are further illustrated with an application to a large data set of 479 long daily rainfall historical records from across the continental United States. By comparing measures of in-sample and out-of-sample predictive accuracy, we find that the model structure developed here, combined with the use of all available observations for inference, significantly improves robustness with respect to overfitting to the specific sample.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Multiscale stick-breaking mixture models
Authors:
Marco Stefanucci,
Antonio Canale
Abstract:
We introduce a family of multiscale stick-breaking mixture models for Bayesian nonparametric density estimation. The Bayesian nonparametric literature is dominated by single scale methods, exception made for Pòlya trees and allied approaches. Our proposal is based on a mixture specification exploiting an infinitely-deep binary tree of random weights that grows according to a multiscale generalizat…
▽ More
We introduce a family of multiscale stick-breaking mixture models for Bayesian nonparametric density estimation. The Bayesian nonparametric literature is dominated by single scale methods, exception made for Pòlya trees and allied approaches. Our proposal is based on a mixture specification exploiting an infinitely-deep binary tree of random weights that grows according to a multiscale generalization of a large class of stick-breaking processes; this multiscale stick-breaking is paired with specific stochastic processes generating sequences of parameters that induce stochastically ordered kernel functions. Properties of this family of multiscale stick-breaking mixtures are described. Focusing on a Gaussian specification, a Markov Chain Montecarlo algorithm for posterior computation is introduced. The performance of the method is illustrated analyzing both synthetic and real data sets. The method is well-suited for data living in $\mathbb{R}$ and is able to detect densities with varying degree of smoothness and local features.
△ Less
Submitted 16 January, 2020;
originally announced January 2020.
-
Weighted multipolar Hardy inequalities and evolution problems with Kolmogorov operators perturbed by singular potentials
Authors:
Anna Canale,
Francesco Pappalardo,
Ciro Tarantino
Abstract:
The main results in the paper are the weighted multipolar Hardy inequalities \begin{equation*} c\int_{\R^N}\sum_{i=1}^n\frac{u^2}{|x-a_i|^2}\,dμ\leq\int_{\R^N}|\nabla u |^2dμ+ K\int_{\R^N} u^2dμ, \end{equation*} in $\R^N$ for any $u$ in a suitable weighted Sobolev space, with $0<c\le c_{o,μ}$, $a_1,\dots,a_n\in \R^N$, $K$ constant. The weight functions $μ$ are of a quite general type.
The paper…
▽ More
The main results in the paper are the weighted multipolar Hardy inequalities \begin{equation*} c\int_{\R^N}\sum_{i=1}^n\frac{u^2}{|x-a_i|^2}\,dμ\leq\int_{\R^N}|\nabla u |^2dμ+ K\int_{\R^N} u^2dμ, \end{equation*} in $\R^N$ for any $u$ in a suitable weighted Sobolev space, with $0<c\le c_{o,μ}$, $a_1,\dots,a_n\in \R^N$, $K$ constant. The weight functions $μ$ are of a quite general type.
The paper fits in the framework of the study of Kolmogorov operators \begin{equation*} Lu=Δu+\frac{\nabla μ}μ\cdot\nabla u, \end{equation*} perturbed by multipolar inverse square potentials, and of the related evolution problems.
The necessary and sufficient conditions for the existence of positive exponentially bounded in time solutions to the associated initial value problem are based on weighted Hardy inequalities. The optimality of the constant constant $c_{o,μ}$ allow us to state the nonexistence of positive solutions.
We follow the Cabré-Martel's approach. To this aim we state some properties of the operator $L$, of its corresponding $C_0$-semigroup and density results.
△ Less
Submitted 6 August, 2019;
originally announced August 2019.
-
Simultaneous Transformation and Rounding (STAR) Models for Integer-Valued Data
Authors:
Daniel R. Kowal,
Antonio Canale
Abstract:
We propose a simple yet powerful framework for modeling integer-valued data, such as counts, scores, and rounded data. The data-generating process is defined by Simultaneously Transforming and Rounding (STAR) a continuous-valued process, which produces a flexible family of integer-valued distributions capable of modeling zero-inflation, bounded or censored data, and over- or underdispersion. The t…
▽ More
We propose a simple yet powerful framework for modeling integer-valued data, such as counts, scores, and rounded data. The data-generating process is defined by Simultaneously Transforming and Rounding (STAR) a continuous-valued process, which produces a flexible family of integer-valued distributions capable of modeling zero-inflation, bounded or censored data, and over- or underdispersion. The transformation is modeled as unknown for greater distributional flexibility, while the rounding operation ensures a coherent integer-valued data-generating process. An efficient MCMC algorithm is developed for posterior inference and provides a mechanism for adaptation of successful Bayesian models and algorithms for continuous data to the integer-valued data setting. Using the STAR framework, we design a new Bayesian Additive Regression Tree (BART) model for integer-valued data, which demonstrates impressive predictive distribution accuracy for both synthetic data and a large healthcare utilization dataset. For interpretable regression-based inference, we develop a STAR additive model, which offers greater flexibility and scalability than existing integer-valued models. The STAR additive model is applied to study the recent decline in Amazon river dolphins.
△ Less
Submitted 3 September, 2019; v1 submitted 27 June, 2019;
originally announced June 2019.
-
Importance conditional sampling for Pitman-Yor mixtures
Authors:
Antonio Canale,
Riccardo Corradin,
Bernardo Nipoti
Abstract:
Nonparametric mixture models based on the Pitman-Yor process represent a flexible tool for density estimation and clustering. Natural generalization of the popular class of Dirichlet process mixture models, they allow for more robust inference on the number of components characterizing the distribution of the data. We propose a new sampling strategy for such models, named importance conditional sa…
▽ More
Nonparametric mixture models based on the Pitman-Yor process represent a flexible tool for density estimation and clustering. Natural generalization of the popular class of Dirichlet process mixture models, they allow for more robust inference on the number of components characterizing the distribution of the data. We propose a new sampling strategy for such models, named importance conditional sampling (ICS), which combines appealing properties of existing methods, including easy interpretability and a within-iteration parallelizable structure. An extensive simulation study highlights the efficiency of the proposed method which, unlike other conditional samplers, shows stable performances for different specifications of the parameters characterizing the Pitman-Yor process. We further show that the ICS approach can be naturally extended to other classes of computationally demanding models, such as nonparametric mixture models for partially exchangeable data.
△ Less
Submitted 23 October, 2021; v1 submitted 19 June, 2019;
originally announced June 2019.
-
Pairwise likelihood inference for the multivariate ordered probit model
Authors:
Martina Bravo,
Antonio Canale
Abstract:
This paper provides a closed form expression for the pairwise score vector for the multivariate ordered probit model. This result has several implications in likelihood-based inference. It is indeed used both to speed-up gradient based optimization routines for point estimation, and to provide a building block to compute standard errors and confidence intervals by means of the Godambe matrix.
This paper provides a closed form expression for the pairwise score vector for the multivariate ordered probit model. This result has several implications in likelihood-based inference. It is indeed used both to speed-up gradient based optimization routines for point estimation, and to provide a building block to compute standard errors and confidence intervals by means of the Godambe matrix.
△ Less
Submitted 29 January, 2019;
originally announced January 2019.
-
A class of weighted Hardy inequalities and applications to evolution problems
Authors:
Anna Canale,
Francesco Pappalardo,
Ciro Tarantino
Abstract:
\begin{abstract} We state the following weighted Hardy inequality \begin{equation*} c_{o, μ}\int_{{\R}^N}\frac{\varphi^2 }{|x|^2}\, dμ\le \int_{{\R}^N} |\nabla\varphi|^2 \, dμ+
K \int_{\R^N}\varphi^2 \, dμ\quad \forall\, \varphi \in H_μ^1 %\qquad c\le c_μ, \end{equation*} in the context of the study of the Kolmogorov operators \begin{equation*} Lu=Δu+\frac{\nabla μ}μ\cdot\nabla u \end{equation*}…
▽ More
\begin{abstract} We state the following weighted Hardy inequality \begin{equation*} c_{o, μ}\int_{{\R}^N}\frac{\varphi^2 }{|x|^2}\, dμ\le \int_{{\R}^N} |\nabla\varphi|^2 \, dμ+
K \int_{\R^N}\varphi^2 \, dμ\quad \forall\, \varphi \in H_μ^1 %\qquad c\le c_μ, \end{equation*} in the context of the study of the Kolmogorov operators \begin{equation*} Lu=Δu+\frac{\nabla μ}μ\cdot\nabla u \end{equation*} perturbed by inverse square potentials and of the related evolution problems. The function $μ$ in the drift term is a probability density on $\R^N$. We prove the optimality of the constant $c_{o, μ}$ and state existence and nonexistence results following the Cabré-Martel's approach \cite{CabreMartel} extended to Kolmogorov operators. \end{abstract}
△ Less
Submitted 29 April, 2019; v1 submitted 7 December, 2018;
originally announced December 2018.
-
Weighted Hardy inequalities and Ornstein-Uhlenbeck type operators perturbed by multipolar inverse square potentials
Authors:
A. Canale,
F. Pappalardo
Abstract:
We give necessary and sufficient conditions for the existence of weak solutions of a parabolic problem corresponding to the Kolmogorov operators perturbed by a multipolar inverse square potential with respect to the Gaussian probability measure which is the unique invariant measure for Ornstein-Uhlenbeck type operators. We state the optimality of the constant and, then, the nonexistence of positiv…
▽ More
We give necessary and sufficient conditions for the existence of weak solutions of a parabolic problem corresponding to the Kolmogorov operators perturbed by a multipolar inverse square potential with respect to the Gaussian probability measure which is the unique invariant measure for Ornstein-Uhlenbeck type operators. We state the optimality of the constant and, then, the nonexistence of positive exponentially bounded solutions to the parabolic problem.
△ Less
Submitted 2 August, 2017;
originally announced August 2017.
-
A nested expectation-maximization algorithm for latent class models with covariates
Authors:
Daniele Durante,
Antonio Canale,
Tommaso Rigon
Abstract:
We develop a nested EM routine for latent class models with covariates which allows maximization of the full-model log-likelihood and, differently from current methods, guarantees monotone log-likelihood sequences along with improved convergence rates.
We develop a nested EM routine for latent class models with covariates which allows maximization of the full-model log-likelihood and, differently from current methods, guarantees monotone log-likelihood sequences along with improved convergence rates.
△ Less
Submitted 2 August, 2018; v1 submitted 10 May, 2017;
originally announced May 2017.
-
Weighted Hardy's inequalities and Kolmogorov-type operators
Authors:
Anna Canale,
Federica Gregorio,
Abdelaziz Rhandi,
Cristian Tacelli
Abstract:
We give general conditions to state the weighted Hardy inequality \[ c\int_{\mathbb{R}^N}\frac{\varphi^2} {|x|^2}dμ\leq\int_{\mathbb{R}^N}|\nabla \varphi |^2 dμ+C\int_{\mathbb{R}^N} \varphi^2dμ,\quad \varphi\in C_c^{\infty}(\mathbb{R}^N),\,c\leq c_{0,μ}, \] with respect to a probability measure $dμ$. Moreover, the optimality of the constant $c_{0,μ}$ is given. The inequality is related to the foll…
▽ More
We give general conditions to state the weighted Hardy inequality \[ c\int_{\mathbb{R}^N}\frac{\varphi^2} {|x|^2}dμ\leq\int_{\mathbb{R}^N}|\nabla \varphi |^2 dμ+C\int_{\mathbb{R}^N} \varphi^2dμ,\quad \varphi\in C_c^{\infty}(\mathbb{R}^N),\,c\leq c_{0,μ}, \] with respect to a probability measure $dμ$. Moreover, the optimality of the constant $c_{0,μ}$ is given. The inequality is related to the following Kolmogorov equation perturbed by a singular potential \[ Lu+Vu=\left(Δu+\frac{\nabla μ}μ\cdot \nabla u\right)+\frac{c}{|x|^2}u \] for which the existence of positive solutions to the corresponding parabolic problem can be investigated. The hypotheses on $dμ$ allow the drift term to be of type $\frac{\nabla μ}μ= -|x|^{m-2}x$ with $m> 0$.
△ Less
Submitted 31 July, 2017; v1 submitted 30 March, 2017;
originally announced March 2017.
-
Convex Mixture Regression for Quantitative Risk Assessment
Authors:
Antonio Canale,
Daniele Durante,
David Dunson
Abstract:
There is wide interest in studying how the distribution of a continuous response changes with a predictor. We are motivated by environmental applications in which the predictor is the dose of an exposure and the response is a health outcome. A main focus in these studies is inference on dose levels associated with a given increase in risk relative to a baseline. Popular methods either dichotomize…
▽ More
There is wide interest in studying how the distribution of a continuous response changes with a predictor. We are motivated by environmental applications in which the predictor is the dose of an exposure and the response is a health outcome. A main focus in these studies is inference on dose levels associated with a given increase in risk relative to a baseline. Popular methods either dichotomize the continuous response or focus on modeling changes with the dose in the expectation of the outcome. Such choices may lead to information loss and provide inaccurate inference on dose-response relationships. We instead propose a Bayesian convex mixture regression model that allows the entire distribution of the health outcome to be unknown and changing with the dose. To balance flexibility and parsimony, we rely on a mixture model for the density at the extreme doses, and express the conditional density at each intermediate dose via a convex combination of these extremal densities. This representation generalizes classical dose-response models for quantitative outcomes, and provides a more parsimonious, but still powerful, formulation compared to nonparametric methods, thereby improving interpretability and efficiency in inference on risk functions. A Markov chain Monte Carlo algorithm for posterior inference is developed, and the benefits of our methods are outlined in simulations, along with a study on the impact of DDT exposure on gestational age.
△ Less
Submitted 9 May, 2018; v1 submitted 11 January, 2017;
originally announced January 2017.
-
Model based approach for household clustering with mixed scale variables
Authors:
Christian Carmona,
Luis Nieto-Barajas,
Antonio Canale
Abstract:
The Ministry of Social Development in Mexico is in charge of creating and assigning social programmes targeting specific needs in the population for the improvement of quality of life. To better target the social programmes, the Ministry is aimed to find clusters of households with the same needs based on demographic characteristics as well as poverty conditions of the household. Available data co…
▽ More
The Ministry of Social Development in Mexico is in charge of creating and assigning social programmes targeting specific needs in the population for the improvement of quality of life. To better target the social programmes, the Ministry is aimed to find clusters of households with the same needs based on demographic characteristics as well as poverty conditions of the household. Available data consists of continuous, ordinal, and nominal variables and the observations are not iid but come from a survey sample based on a complex design. We propose a Bayesian nonparametric mixture model that jointly models this mixed scale data and accommodates for the different sampling probabilities. The performance of the model is assessed via simulated data. A full analysis of socio-economic conditions in households in the State of Mexico is presented.
△ Less
Submitted 23 November, 2017; v1 submitted 30 November, 2016;
originally announced December 2016.
-
Bayesian nonparametric forecasting of monotonic functional time series
Authors:
Antonio Canale,
Matteo Ruggiero
Abstract:
We propose a Bayesian nonparametric approach to modelling and predicting a class of functional time series with application to energy markets, based on fully observed, noise-free functional data. Traders in such contexts conceive profitable strategies if they can anticipate the impact of their bidding actions on the aggregate demand and supply curves, which in turn need to be predicted reliably. H…
▽ More
We propose a Bayesian nonparametric approach to modelling and predicting a class of functional time series with application to energy markets, based on fully observed, noise-free functional data. Traders in such contexts conceive profitable strategies if they can anticipate the impact of their bidding actions on the aggregate demand and supply curves, which in turn need to be predicted reliably. Here we propose a simple Bayesian nonparametric method for predicting such curves, which take the form of monotonic bounded step functions. We borrow ideas from population genetics by defining a class of interacting particle systems to model the functional trajectory, and develop an implementation strategy which uses ideas from Markov chain Monte Carlo and approximate Bayesian computation techniques and allows to circumvent the intractability of the likelihood. Our approach shows great adaptation to the degree of smoothness of the curves and the volatility of the functional series, proves to be robust to an increase of the forecast horizon and yields an uncertainty quantification for the functional forecasts. We illustrate the model and discuss its performance with simulated datasets and on real data relative to the Italian natural gas market.
△ Less
Submitted 29 August, 2016;
originally announced August 2016.
-
Optimal kernel estimates for a Schrödinger type operator
Authors:
Anna Canale,
Cristian Tacelli
Abstract:
In the paper the principal result obtained is the estimate for the heat kernel associated to the Schrödinger type operator $(1+|x|^α)Δ-|x|^β$ \[ k(t,x,y)\leq Ct^{-\fracθ{2}}\frac {\varphi(x)\varphi(y)}{1+|x|^α}, \] where $\varphi=(1+|x|^α)^{\frac{2-θ}{4}+\frac{1}α\frac{θ-N}{2}}$, $θ\geq N$ and $0<t<1$, provided that $N>2$, $α> 2$ and $β>α-2$. This estimate improves a similar estimate in \cite {can…
▽ More
In the paper the principal result obtained is the estimate for the heat kernel associated to the Schrödinger type operator $(1+|x|^α)Δ-|x|^β$ \[ k(t,x,y)\leq Ct^{-\fracθ{2}}\frac {\varphi(x)\varphi(y)}{1+|x|^α}, \] where $\varphi=(1+|x|^α)^{\frac{2-θ}{4}+\frac{1}α\frac{θ-N}{2}}$, $θ\geq N$ and $0<t<1$, provided that $N>2$, $α> 2$ and $β>α-2$. This estimate improves a similar estimate in \cite {can-rhan-tac2} with respect to the dependence on spatial component.
△ Less
Submitted 13 April, 2016;
originally announced April 2016.
-
Kernel estimates for Schrödinger type operators with unbounded diffusion and potential terms
Authors:
Anna Canale,
Abdelaziz Rhandi,
Cristian Tacelli
Abstract:
We prove that the heat kernel associated to the Schrödinger type operator $A:=(1+|x|^α)Δ-|x|^β$ satisfies the estimate $$k(t,x,y)\leq c_1e^{λ_0t}e^{c_2t^{-b}}\frac{(|x||y|)^{-\frac{N-1}{2}-\frac{β-α}{4}}}{1+|y|^α} e^{-\frac{2}{β-α+2}|x|^{\frac{β-α+2}{2}}} e^{-\frac{2}{β-α+2}|y|^{\frac{β-α+2}{2}}} $$ for $t>0,|x|,|y|\ge 1$, where $c_1,c_2$ are positive constants and $b=\frac{β-α+2}{β+α-2}$ provided…
▽ More
We prove that the heat kernel associated to the Schrödinger type operator $A:=(1+|x|^α)Δ-|x|^β$ satisfies the estimate $$k(t,x,y)\leq c_1e^{λ_0t}e^{c_2t^{-b}}\frac{(|x||y|)^{-\frac{N-1}{2}-\frac{β-α}{4}}}{1+|y|^α} e^{-\frac{2}{β-α+2}|x|^{\frac{β-α+2}{2}}} e^{-\frac{2}{β-α+2}|y|^{\frac{β-α+2}{2}}} $$ for $t>0,|x|,|y|\ge 1$, where $c_1,c_2$ are positive constants and $b=\frac{β-α+2}{β+α-2}$ provided that $N>2,\,α\geq 2$ and $β>α-2$. We also obtain an estimate of the eigenfunctions of $A$.
△ Less
Submitted 23 November, 2016; v1 submitted 5 January, 2015;
originally announced January 2015.
-
Morrey type spaces and multiplication operator in Sobolev spaces
Authors:
A. Canale,
C. Tarantino
Abstract:
The paper deals with the operator $u\rightarrow gu$ defined in the Sobolev space $W^{r,p}(Ω)$ and which takes values in $L^p(Ω)$ when $Ω$ is an unbounded open subset in $R^n$. The functions $g$ belong to wider spaces of $L^p$ connected with the Morrey type spaces. $L^p$ estimates and compactness results are stated.
The paper deals with the operator $u\rightarrow gu$ defined in the Sobolev space $W^{r,p}(Ω)$ and which takes values in $L^p(Ω)$ when $Ω$ is an unbounded open subset in $R^n$. The functions $g$ belong to wider spaces of $L^p$ connected with the Morrey type spaces. $L^p$ estimates and compactness results are stated.
△ Less
Submitted 21 December, 2014;
originally announced December 2014.
-
Scalable multiscale density estimation
Authors:
Ye Wang,
Antonio Canale,
David Dunson
Abstract:
Although Bayesian density estimation using discrete mixtures has good performance in modest dimensions, there is a lack of statistical and computational scalability to high-dimensional multivariate cases. To combat the curse of dimensionality, it is necessary to assume the data are concentrated near a lower-dimensional subspace. However, Bayesian methods for learning this subspace along with the d…
▽ More
Although Bayesian density estimation using discrete mixtures has good performance in modest dimensions, there is a lack of statistical and computational scalability to high-dimensional multivariate cases. To combat the curse of dimensionality, it is necessary to assume the data are concentrated near a lower-dimensional subspace. However, Bayesian methods for learning this subspace along with the density of the data scale poorly computationally. To solve this problem, we propose an empirical Bayes approach, which estimates a multiscale dictionary using geometric multiresolution analysis in a first stage. We use this dictionary within a multiscale mixture model, which allows uncertainty in component allocation, mixture weights and scaling factors over a binary tree. A computational algorithm is proposed, which scales efficiently to massive dimensional problems. We provide some theoretical support for this geometric density estimation (GEODE) method, and illustrate the performance through simulated and real data examples.
△ Less
Submitted 28 October, 2014;
originally announced October 2014.
-
Multiscale Bernstein polynomials for densities
Authors:
Antonio Canale,
David B. Dunson
Abstract:
Our focus is on constructing a multiscale nonparametric prior for densities. The Bayes density estimation literature is dominated by single scale methods, with the exception of Polya trees, which favor overly-spiky densities even when the truth is smooth. We propose a multiscale Bernstein polynomial family of priors, which produce smooth realizations that do not rely on hard partitioning of the su…
▽ More
Our focus is on constructing a multiscale nonparametric prior for densities. The Bayes density estimation literature is dominated by single scale methods, with the exception of Polya trees, which favor overly-spiky densities even when the truth is smooth. We propose a multiscale Bernstein polynomial family of priors, which produce smooth realizations that do not rely on hard partitioning of the support. At each level in an infinitely-deep binary tree, we place a beta dictionary density; within a scale the densities are equivalent to Bernstein polynomials. Using a stick-breaking characterization, stochastically decreasing weights are allocated to the finer scale dictionary elements. A slice sampler is used for posterior computation, and properties are described. The method characterizes densities with locally-varying smoothness, and can produce a sequence of coarse to fine density estimates. An extension for Bayesian testing of group differences is introduced and applied to DNA methylation array data.
△ Less
Submitted 3 October, 2014;
originally announced October 2014.
-
Analitic approach to solve a degenerate parabolic PDE for the Heston model
Authors:
A. Canale,
R. M. Mininni,
A. Rhandi
Abstract:
We present an analytic approach to solve a degenerate parabolic problem associated to the Heston model, which is widely used in mathematical finance to derive the price of an European option on an risky asset with stochastic volatility. We give a variational formulation, involving weighted Sobolev spaces, of the second order degenerate elliptic operator of the parabolic PDE. We use this approach t…
▽ More
We present an analytic approach to solve a degenerate parabolic problem associated to the Heston model, which is widely used in mathematical finance to derive the price of an European option on an risky asset with stochastic volatility. We give a variational formulation, involving weighted Sobolev spaces, of the second order degenerate elliptic operator of the parabolic PDE. We use this approach to prove, under appropriate assumptions on some involved unknown parameters, the existence and uniqueness of weak solutions to the parabolic problem on unbounded subdomains of the half-plane.
△ Less
Submitted 9 June, 2014;
originally announced June 2014.
-
Schrödinger type operators with unbounded diffusion and potential terms
Authors:
Anna Canale,
Abdelaziz Rhandi,
Cristian Tacelli
Abstract:
We prove that the realization $A_p$ in $L^p(\mathbb{R}^N),\,1<p<\infty$, of the Schrödinger type operator $A=(1+|x|^α)Δ-|x|^β$ with domain $D(A_p)=\{u\in W^{2,p}(\mathbb{R}^N): Au\in L^p(\mathbb{R}^N)\}$ generates a strongly continuous analytic semigroup provided that $N>2,\,α>2$ and $β>α-2$. Moreover this semigroup is consistent, irreducible, immediately compact and ultracontractive.
We prove that the realization $A_p$ in $L^p(\mathbb{R}^N),\,1<p<\infty$, of the Schrödinger type operator $A=(1+|x|^α)Δ-|x|^β$ with domain $D(A_p)=\{u\in W^{2,p}(\mathbb{R}^N): Au\in L^p(\mathbb{R}^N)\}$ generates a strongly continuous analytic semigroup provided that $N>2,\,α>2$ and $β>α-2$. Moreover this semigroup is consistent, irreducible, immediately compact and ultracontractive.
△ Less
Submitted 2 June, 2014;
originally announced June 2014.
-
An embedding result
Authors:
Anna Canale
Abstract:
In unbounded subset $Ω$ in $R^n$ we study the operator $u\rightarrow gu$ as an operator defined in the Sobolev space $W^{r,p}(Ω)$ and which takes values in $L^p(Ω)$. The functions $g$ belong to wider spaces of $L^p$ connected with the Morrey type spaces. The main result is an embedding theorem from which we can deduce a Fefferman type inequality.
In unbounded subset $Ω$ in $R^n$ we study the operator $u\rightarrow gu$ as an operator defined in the Sobolev space $W^{r,p}(Ω)$ and which takes values in $L^p(Ω)$. The functions $g$ belong to wider spaces of $L^p$ connected with the Morrey type spaces. The main result is an embedding theorem from which we can deduce a Fefferman type inequality.
△ Less
Submitted 29 December, 2013;
originally announced December 2013.
-
Bayesian nonparametric location-scale-shape mixtures
Authors:
Antonio Canale,
Bruno Scarpa
Abstract:
Discrete mixture models are one of the most successful approaches for density estimation. Under a Bayesian nonparametric framework, Dirichlet process location-scale mixture of Gaussian kernels is the golden standard, both having nice theoretical properties and computational tractability. In this paper we explore the use of the skew-normal kernel, which can naturally accommodate several degrees of…
▽ More
Discrete mixture models are one of the most successful approaches for density estimation. Under a Bayesian nonparametric framework, Dirichlet process location-scale mixture of Gaussian kernels is the golden standard, both having nice theoretical properties and computational tractability. In this paper we explore the use of the skew-normal kernel, which can naturally accommodate several degrees of skewness by the use of a third parameter. The choice of this kernel function allows us to formulate nonparametric location-scale-shape mixture prior with large support and good performance in different applications. Asymptotically, we show that this modelling framework is consistent in frequentist sense. Efficient Gibbs sampling algorithms are also discussed and the performance of the methods are tested through simulations and applications to galaxy velocity and fertility data. Extensions to accommodate discrete data are also discussed.
△ Less
Submitted 29 November, 2013;
originally announced November 2013.
-
Nonparametric Bayes modeling of count processes
Authors:
Antonio Canale,
David B. Dunson
Abstract:
Data on count processes arise in a variety of applications, including longitudinal, spatial and imaging studies measuring count responses. The literature on statistical models for dependent count data is dominated by models built from hierarchical Poisson components. The Poisson assumption is not warranted in many applications, and hierarchical Poisson models make restrictive assumptions about ove…
▽ More
Data on count processes arise in a variety of applications, including longitudinal, spatial and imaging studies measuring count responses. The literature on statistical models for dependent count data is dominated by models built from hierarchical Poisson components. The Poisson assumption is not warranted in many applications, and hierarchical Poisson models make restrictive assumptions about over-dispersion in marginal distributions. This article proposes a class of nonparametric Bayes count process models, which are constructed through rounding real-valued underlying processes. The proposed class of models accommodates applications in which one observes separate count-valued functional data for each subject under study. Theoretical results on large support and posterior consistency are established, and computational algorithms are developed using Markov chain Monte Carlo. The methods are evaluated via simulation studies and illustrated through application to longitudinal tumor counts and asthma inhaler usage.
△ Less
Submitted 10 July, 2013;
originally announced July 2013.
-
Posterior asymptotics of nonparametric location-scale mixtures for multivariate density estimation
Authors:
Antonio Canale,
Pierpaolo De Blasi
Abstract:
Density estimation represents one of the most successful applications of Bayesian nonparametrics. In particular, Dirichlet process mixtures of normals are the gold standard for density estimation and their asymptotic properties have been studied extensively, especially in the univariate case. However a gap between practitioners and the current theoretical literature is present. So far, posterior a…
▽ More
Density estimation represents one of the most successful applications of Bayesian nonparametrics. In particular, Dirichlet process mixtures of normals are the gold standard for density estimation and their asymptotic properties have been studied extensively, especially in the univariate case. However a gap between practitioners and the current theoretical literature is present. So far, posterior asymptotic results in the multivariate case are available only for location mixtures of Gaussian kernels with independent prior on the common covariance matrix, while in practice as well as from a conceptual point of view a location-scale mixture is often preferable. In this paper we address posterior consistency for such general mixture models by adapting a convergence rate result which combines the usual low-entropy, high-mass sieve approach with a suitable summability condition. Specifically, we establish consistency for Dirichlet process mixtures of Gaussian kernels with various prior specifications on the covariance matrix. Posterior convergence rates are also discussed.
△ Less
Submitted 1 July, 2015; v1 submitted 11 June, 2013;
originally announced June 2013.
-
Informative Bayesian inference for the skew-normal distribution
Authors:
Antonio Canale,
Bruno Scarpa
Abstract:
Motivated by the analysis of the distribution of university grades, which is usually asymmetric, we discuss two informative priors for the shape parameter of the skew-normal distribution, showing that they lead to closed-form full-conditional posterior distributions, particularly useful in MCMC computation. Gibbs sampling algorithms are discussed for the joint vector of parameters, given independe…
▽ More
Motivated by the analysis of the distribution of university grades, which is usually asymmetric, we discuss two informative priors for the shape parameter of the skew-normal distribution, showing that they lead to closed-form full-conditional posterior distributions, particularly useful in MCMC computation. Gibbs sampling algorithms are discussed for the joint vector of parameters, given independent prior distributions for the location and scale parameters. Simulation studies are performed to assess the performance of Gibbs samplers and to compare the choice of informative priors against a non-informative one. The method is used to analyze the grades of the basic statistics examination of the first-year undergraduate students at the School of Economics, University of Padua, Italy.
△ Less
Submitted 14 May, 2013;
originally announced May 2013.
-
Bayesian multivariate mixed-scale density estimation
Authors:
Antonio Canale,
David B. Dunson
Abstract:
Although continuous density estimation has received abundant attention in the Bayesian nonparametrics literature, there is limited theory on multivariate mixed scale density estimation. In this note, we consider a general framework to jointly model continuous, count and categorical variables under a nonparametric prior, which is induced through rounding latent variables having an unknown density w…
▽ More
Although continuous density estimation has received abundant attention in the Bayesian nonparametrics literature, there is limited theory on multivariate mixed scale density estimation. In this note, we consider a general framework to jointly model continuous, count and categorical variables under a nonparametric prior, which is induced through rounding latent variables having an unknown density with respect to Lebesgue measure. For the proposed class of priors, we provide sufficient conditions for large support, strong consistency and rates of posterior contraction. These conditions allow one to convert sufficient conditions obtained in the setting of multivariate continuous density estimation to the mixed scale case. To illustrate the procedure a rounded multivariate nonparametric mixture of Gaussians is introduced and applied to a crime and communities dataset.
△ Less
Submitted 23 May, 2014; v1 submitted 6 October, 2011;
originally announced October 2011.