-
Composite likelihood inference for space-time point processes
Authors:
Abdollah Jalilian,
Francisco Cuevas-Pacheco,
Ganggang Xu,
Rasmus Waagepetersen
Abstract:
The dynamics of a rain forest is extremely complex involving births, deaths and growth of trees with complex interactions between trees, animals, climate, and environment. We consider the patterns of recruits (new trees) and dead trees between rain forest censuses. For a current census we specify regression models for the conditional intensity of recruits and the conditional probabilities of death…
▽ More
The dynamics of a rain forest is extremely complex involving births, deaths and growth of trees with complex interactions between trees, animals, climate, and environment. We consider the patterns of recruits (new trees) and dead trees between rain forest censuses. For a current census we specify regression models for the conditional intensity of recruits and the conditional probabilities of death given the current trees and spatial covariates. We estimate regression parameters using conditional composite likelihood functions that only involve the conditional first order properties of the data. When constructing assumption lean estimators of covariance matrices of parameter estimates we only need mild assumptions of decaying conditional correlations in space while assumptions regarding correlations over time are avoided by exploiting conditional centering of composite likelihood score functions. Time series of point patterns from rain forest censuses are quite short while each point pattern covers a fairly big spatial region. To obtain asymptotic results we therefore use a central limit theorem for the fixed timespan - increasing spatial domain asymptotic setting. This also allows us to handle the challenge of using stochastic covariates constructed from past point patterns. Conveniently, it suffices to impose weak dependence assumptions on the innovations of the space-time process. We investigate the proposed methodology by simulation studies and applications to rain forest data.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
A functional central limit theorem for the K-function with an estimated intensity function
Authors:
Anne Marie Svane,
Christophe Biscio,
Rasmus Waagepetersen
Abstract:
The $K$-function is arguably the most important functional summary statistic for spatial point processes. It is used extensively for goodness-of-fit testing and in connection with minimum contrast estimation for parametric spatial point process models. It is thus pertinent to understand the asymptotic properties of estimates of the $K$-function. In this paper we derive the functional asymptotic di…
▽ More
The $K$-function is arguably the most important functional summary statistic for spatial point processes. It is used extensively for goodness-of-fit testing and in connection with minimum contrast estimation for parametric spatial point process models. It is thus pertinent to understand the asymptotic properties of estimates of the $K$-function. In this paper we derive the functional asymptotic distribution for the $K$-function estimator. Contrary to previous papers on functional convergence we consider the case of an inhomogeneous intensity function. We moreover handle the fact that practical $K$-function estimators rely on plugging in an estimate of the intensity function. This removes two serious limitations of the existing literature.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
A central limit theorem for a sequence of conditionally centered and $α$-mixing random fields
Authors:
Abdollah Jalilian,
Ganggang Xu,
Arnaud Poinas,
Rasmus Waagepetersen
Abstract:
A central limit theorem is established for a sum of random variables belonging to a sequence of random fields. The fields are assumed to have zero mean conditional on the past history and to satisfy certain conditional $α$-mixing conditions in space or time. The limiting normal distribution is obtained for increasing spatial domain or increasing length of the sequence. The applicability of the the…
▽ More
A central limit theorem is established for a sum of random variables belonging to a sequence of random fields. The fields are assumed to have zero mean conditional on the past history and to satisfy certain conditional $α$-mixing conditions in space or time. The limiting normal distribution is obtained for increasing spatial domain or increasing length of the sequence. The applicability of the theorem is demonstrated by examples regarding estimating functions for a space-time point process and a space-time Markov process.
△ Less
Submitted 11 April, 2024; v1 submitted 21 January, 2023;
originally announced January 2023.
-
A $K$-function for inhomogeneous random measures with geometric features
Authors:
Anne Marie Svane,
Hans Jacob Teglbjærg Stephensen,
Rasmus Waagepetersen
Abstract:
This paper introduces a $K$-function for assessing second-order properties of inhomogeneous random measures generated by marked point processes. The marks can be geometric objects like fibers or sets of positive volume, and the presented $K$-function takes into account geometric features of the marks, such as tangent directions of fibers. The $K$-function requires an estimate of the inhomogeneous…
▽ More
This paper introduces a $K$-function for assessing second-order properties of inhomogeneous random measures generated by marked point processes. The marks can be geometric objects like fibers or sets of positive volume, and the presented $K$-function takes into account geometric features of the marks, such as tangent directions of fibers. The $K$-function requires an estimate of the inhomogeneous density function of the random measure. We introduce parametric estimates for the density function based on parametric models that represent large scale features of the inhomogeneous random measure. The proposed methodology is applied to simulated fiber patterns as well as a three-dimensional data set of steel fibers in concrete.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
Currents and K-functions for Fiber Point Processes
Authors:
Pernille EH. Hansen,
Rasmus Waagepetersen,
Anne Marie Svane,
Jon Sporring,
Hans JT. Stephensen,
Stine Hasselholt,
Stefan Sommer
Abstract:
Analysis of images of sets of fibers such as myelin sheaths or skeletal muscles must account for both the spatial distribution of fibers and differences in fiber shape. This necessitates a combination of point process and shape analysis methodology. In this paper, we develop a K-function for shape-valued point processes by embedding shapes as currents, thus equip** the point process domain with…
▽ More
Analysis of images of sets of fibers such as myelin sheaths or skeletal muscles must account for both the spatial distribution of fibers and differences in fiber shape. This necessitates a combination of point process and shape analysis methodology. In this paper, we develop a K-function for shape-valued point processes by embedding shapes as currents, thus equip** the point process domain with metric structure inherited from a reproducing kernel Hilbert space. We extend Ripley's K-function which measures deviations from spatial homogeneity of point processes to fiber data. The paper provides a theoretical account of the statistical foundation of the K-function and its extension to fiber data, and we test the developed K-function on simulated as well as real data sets. This includes a fiber data set consisting of myelin sheaths, visualizing the spatial and fiber shape behavior of myelin configurations at different debts.
△ Less
Submitted 10 February, 2021;
originally announced February 2021.
-
Second order semi-parametric inference for multivariate log Gaussian Cox processes
Authors:
Kristian Bjørn Hessellund,
Ganggang Xu,
Yongtao Guan,
Rasmus Waagepetersen
Abstract:
This paper introduces a new approach to inferring the second order properties of a multivariate log Gaussian Cox process (LGCP) with a complex intensity function. We assume a semi-parametric model for the multivariate intensity function containing an unspecified complex factor common to all types of points. Given this model we exploit the availability of several types of points to construct a seco…
▽ More
This paper introduces a new approach to inferring the second order properties of a multivariate log Gaussian Cox process (LGCP) with a complex intensity function. We assume a semi-parametric model for the multivariate intensity function containing an unspecified complex factor common to all types of points. Given this model we exploit the availability of several types of points to construct a second-order conditional composite likelihood to infer the pair correlation and cross pair correlation functions of the LGCP. Crucially this likelihood does not depend on the unspecified part of the intensity function. We also introduce a cross validation method for model selection and an algorithm for regularized inference that can be used to obtain sparse models for cross pair correlation functions. The methodology is applied to simulated data as well as data examples from microscopy and criminology. This shows how the new approach outperforms existing alternatives where the intensity functions are estimated non-parametrically.
△ Less
Submitted 3 January, 2022; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Globally intensity-reweighted estimators for $K$- and pair correlation functions
Authors:
Thomas Shaw,
Jesper Møller,
Rasmus Waagepetersen
Abstract:
We introduce new estimators of the inhomogeneous $K$-function and the pair correlation function of a spatial point process as well as the cross $K$-function and the cross pair correlation function of a bivariate spatial point process under the assumption of second-order intensity-reweighted stationarity. These estimators rely on a 'global' normalization factor which depends on an aggregation of th…
▽ More
We introduce new estimators of the inhomogeneous $K$-function and the pair correlation function of a spatial point process as well as the cross $K$-function and the cross pair correlation function of a bivariate spatial point process under the assumption of second-order intensity-reweighted stationarity. These estimators rely on a 'global' normalization factor which depends on an aggregation of the intensity function, whilst the existing estimators depend 'locally' on the intensity function at the individual observed points. The advantages of our new global estimators over the existing local estimators are demonstrated by theoretical considerations and a simulation study.
△ Less
Submitted 2 October, 2020; v1 submitted 1 April, 2020;
originally announced April 2020.
-
Information criteria for inhomogeneous spatial point processes
Authors:
Achmad Choiruddin,
Jean-François Coeurjolly,
Rasmus Waagepetersen
Abstract:
The theoretical foundation for a number of model selection criteria is established in the context of inhomogeneous point processes and under various asymptotic settings: infill, increasing domain, and combinations of these. For inhomogeneous Poisson processes we consider Akaike information criterion and the Bayesian information criterion, and in particular we identify the point process analogue of…
▽ More
The theoretical foundation for a number of model selection criteria is established in the context of inhomogeneous point processes and under various asymptotic settings: infill, increasing domain, and combinations of these. For inhomogeneous Poisson processes we consider Akaike information criterion and the Bayesian information criterion, and in particular we identify the point process analogue of sample size needed for the Bayesian information criterion. Considering general inhomogeneous point processes we derive new composite likelihood and composite Bayesian information criteria for selecting a regression model for the intensity function. The proposed model selection criteria are evaluated using simulations of Poisson processes and cluster point processes.
△ Less
Submitted 8 March, 2020;
originally announced March 2020.
-
Regularized estimation for highly multivariate log Gaussian Cox processes
Authors:
Achmad Choiruddin,
Francisco Cuevas-Pacheco,
Jean-François Coeurjolly,
Rasmus Waagepetersen
Abstract:
Statistical inference for highly multivariate point pattern data is challenging due to complex models with large numbers of parameters. In this paper, we develop numerically stable and efficient parameter estimation and model selection algorithms for a class of multivariate log Gaussian Cox processes. The methodology is applied to a highly multivariate point pattern data set from tropical rain for…
▽ More
Statistical inference for highly multivariate point pattern data is challenging due to complex models with large numbers of parameters. In this paper, we develop numerically stable and efficient parameter estimation and model selection algorithms for a class of multivariate log Gaussian Cox processes. The methodology is applied to a highly multivariate point pattern data set from tropical rain forest ecology.
△ Less
Submitted 4 May, 2019;
originally announced May 2019.
-
Second-order variational equations for spatial point processes with a view to pair correlation function estimation
Authors:
Jean-François Coeurjolly,
Francisco Cuevas-Pacheco,
Rasmus Waagepetersen
Abstract:
Second-order variational type equations for spatial point processes are established. In case of log linear parametric models for pair correlation functions, it is demonstrated that the variational equations can be applied to construct estimating equations with closed form solutions for the parameter estimates. This result is used to fit orthogonal series expansions of log pair correlation function…
▽ More
Second-order variational type equations for spatial point processes are established. In case of log linear parametric models for pair correlation functions, it is demonstrated that the variational equations can be applied to construct estimating equations with closed form solutions for the parameter estimates. This result is used to fit orthogonal series expansions of log pair correlation functions of general form.
△ Less
Submitted 15 January, 2019;
originally announced January 2019.
-
Generalizations of Ripley's K-function with Application to Space Curves
Authors:
Jon Sporring,
Rasmus Waagepetersen,
Stefan Sommer
Abstract:
The intensity function and Ripley's K-function have been used extensively in the literature to describe the first and second moment structure of spatial point sets. This has many applications including describing the statistical structure of synaptic vesicles. Some attempts have been made to extend Ripley's K-function to curve pieces. Such an extension can be used to describe the statistical struc…
▽ More
The intensity function and Ripley's K-function have been used extensively in the literature to describe the first and second moment structure of spatial point sets. This has many applications including describing the statistical structure of synaptic vesicles. Some attempts have been made to extend Ripley's K-function to curve pieces. Such an extension can be used to describe the statistical structure of muscle fibers and brain fiber tracks. In this paper, we take a computational perspective and construct new and very general variants of Ripley's K-function for curves pieces, surface patches etc. We discuss the method from [Chiu, Stoyan, Kendall, & Mecke 2013] and compare it with our generalizations theoretically, and we give examples demonstrating the difference in their ability to separate sets of curve pieces.
△ Less
Submitted 17 December, 2018;
originally announced December 2018.
-
Adaptive estimating function inference for non-stationary determinantal point processes
Authors:
Frédéric Lavancier,
Arnaud Poinas,
Rasmus Waagepetersen
Abstract:
Estimating function inference is indispensable for many common point process models where the joint intensities are tractable while the likelihood function is not. In this paper we establish asymptotic normality of estimating function estimators in a very general setting of non-stationary point processes. We then adapt this result to the case of non-stationary determinantal point processes which a…
▽ More
Estimating function inference is indispensable for many common point process models where the joint intensities are tractable while the likelihood function is not. In this paper we establish asymptotic normality of estimating function estimators in a very general setting of non-stationary point processes. We then adapt this result to the case of non-stationary determinantal point processes which are an important class of models for repulsive point patterns. In practice often first and second order estimating functions are used. For the latter it is common practice to omit contributions for pairs of points separated by a distance larger than some truncation distance which is usually specified in an ad hoc manner. We suggest instead a data-driven approach where the truncation distance is adapted automatically to the point process being fitted and where the approach integrates seamlessly with our asymptotic framework. The good performance of the adaptive approach is illustrated via simulation studies for non-stationary determinantal point processes and by an application to a real dataset.
△ Less
Submitted 15 November, 2019; v1 submitted 16 June, 2018;
originally announced June 2018.
-
Orthogonal series estimation of the pair correlation function of a spatial point process
Authors:
Abdollah Jalilian,
Yongtao Guan,
Rasmus Waagepetersen
Abstract:
The pair correlation function is a fundamental spatial point process characteristic that, given the intensity function, determines second order moments of the point process. Non-parametric estimation of the pair correlation function is a typical initial step of a statistical analysis of a spatial point pattern. Kernel estimators are popular but especially for clustered point patterns suffer from b…
▽ More
The pair correlation function is a fundamental spatial point process characteristic that, given the intensity function, determines second order moments of the point process. Non-parametric estimation of the pair correlation function is a typical initial step of a statistical analysis of a spatial point pattern. Kernel estimators are popular but especially for clustered point patterns suffer from bias for small spatial lags. In this paper we introduce a new orthogonal series estimator. The new estimator is consistent and asymptotically normal according to our theoretical and simulation results. Our simulations further show that the new estimator can outperform the kernel estimators in particular for Poisson and clustered point processes.
△ Less
Submitted 6 February, 2017;
originally announced February 2017.
-
Some recent developments in statistics for spatial point patterns
Authors:
Jesper Møller,
Rasmus Waagepetersen
Abstract:
This paper reviews developments in statistics for spatial point processes obtained within roughly the last decade. These developments include new classes of spatial point process models such as determinantal point processes, models incorporating both regularity and aggregation, and models where points are randomly distributed around latent geometric structures. Regarding parametric inference the m…
▽ More
This paper reviews developments in statistics for spatial point processes obtained within roughly the last decade. These developments include new classes of spatial point process models such as determinantal point processes, models incorporating both regularity and aggregation, and models where points are randomly distributed around latent geometric structures. Regarding parametric inference the main focus is on various types of estimating functions derived from so-called innovation measures. Optimality of such estimating functions is discussed as well as computational issues. Maximum likelihood inference for determinantal point processes and Bayesian inference are briefly considered too. Concerning non-parametric inference, we consider extensions of functional summary statistics to the case of inhomogeneous point processes as well as new approaches to simulation based inference.
△ Less
Submitted 4 September, 2016;
originally announced September 2016.
-
Towards optimal Takacs--Fiksel estimation
Authors:
Jean-François Coeurjolly,
Yongtao Guan,
Mahdieh Khanmohammadi,
Rasmus Waagepetersen
Abstract:
The Takacs--Fiksel method is a general approach to estimate the parameters of a spatial Gibbs point process. This method embraces standard procedures such as the pseudolikelihood and is defined via weight functions. In this paper we propose a general procedure to find weight functions which reduce the Godambe information and thus outperform pseudolikelihood in certain situations. The new procedure…
▽ More
The Takacs--Fiksel method is a general approach to estimate the parameters of a spatial Gibbs point process. This method embraces standard procedures such as the pseudolikelihood and is defined via weight functions. In this paper we propose a general procedure to find weight functions which reduce the Godambe information and thus outperform pseudolikelihood in certain situations. The new procedure is applied to a standard dataset and to a recent neuroscience replicated point pattern dataset. Finally, the performance of the new procedure is investigated in a simulation study.
△ Less
Submitted 13 July, 2016; v1 submitted 21 December, 2015;
originally announced December 2015.
-
A tutorial on Palm distributions for spatial point processes
Authors:
Jean-François Coeurjolly,
Jesper Møller,
Rasmus Waagepetersen
Abstract:
This tutorial provides an introduction to Palm distributions for spatial point processes. Initially, in the context of finite point processes , we give an explicit definition of Palm distributions in terms of their density functions. Then we review Palm distributions in the general case. Finally we discuss some examples of Palm distributions for specific models and some applications.
This tutorial provides an introduction to Palm distributions for spatial point processes. Initially, in the context of finite point processes , we give an explicit definition of Palm distributions in terms of their density functions. Then we review Palm distributions in the general case. Finally we discuss some examples of Palm distributions for specific models and some applications.
△ Less
Submitted 17 June, 2016; v1 submitted 18 December, 2015;
originally announced December 2015.
-
Palm distributions for log Gaussian Cox processes
Authors:
Jean-François Coeurjolly,
Jesper Møller,
Rasmus Waagepetersen
Abstract:
This paper establishes a remarkable result regarding Palmdistributions for a log Gaussian Cox process: the reduced Palmdistribution for a log Gaussian Cox process is itself a log Gaussian Coxprocess which only differs from the original log Gaussian Cox processin the intensity function. This new result is used to study functionalsummaries for log Gaussian Cox processes.
This paper establishes a remarkable result regarding Palmdistributions for a log Gaussian Cox process: the reduced Palmdistribution for a log Gaussian Cox process is itself a log Gaussian Coxprocess which only differs from the original log Gaussian Cox processin the intensity function. This new result is used to study functionalsummaries for log Gaussian Cox processes.
△ Less
Submitted 8 June, 2016; v1 submitted 15 June, 2015;
originally announced June 2015.
-
Quasi-likelihood for Spatial Point Processes
Authors:
Yongtao Guan,
Abdollah Jalilian,
Rasmus Waagepetersen
Abstract:
Fitting regression models for intensity functions of spatial point processes is of great interest in ecological and epidemiological studies of association between spatially referenced events and geographical or environmental covariates. When Cox or cluster process models are used to accommodate clustering not accounted for by the available covariates, likelihood based inference becomes computation…
▽ More
Fitting regression models for intensity functions of spatial point processes is of great interest in ecological and epidemiological studies of association between spatially referenced events and geographical or environmental covariates. When Cox or cluster process models are used to accommodate clustering not accounted for by the available covariates, likelihood based inference becomes computationally cumbersome due to the complicated nature of the likelihood function and the associated score function. It is therefore of interest to consider alternative more easily computable estimating functions. We derive the optimal estimating function in a class of first-order estimating functions. The optimal estimating function depends on the solution of a certain Fredholm integral equation which in practice is solved numerically. The approximate solution is equivalent to a quasi-likelihood for binary spatial data and we therefore use the term quasi-likelihood for our optimal estimating function approach. We demonstrate in a simulation study and a data example that our quasi-likelihood method for spatial point processes is both statistically and computationally efficient.
△ Less
Submitted 1 March, 2013;
originally announced March 2013.
-
Reproducible probe-level analysis of the Affymetrix Exon 1.0 ST array with R/Bioconductor
Authors:
Maria Rodrigo-Domingo,
Rasmus Waagepetersen,
Julie Støve Bødker,
Steffen Falgreen,
Malene Krag Kjeldsen,
Hans Erik Johnsen,
Karen Dybkær,
Martin Bøgsted
Abstract:
The presence of different transcripts of a gene across samples can be analysed by whole-transcriptome microarrays. Reproducing results from published microarray data represents a challenge due to the vast amounts of data and the large variety of pre-processing and filtering steps employed before the actual analysis is carried out. To guarantee a firm basis for methodological development where resu…
▽ More
The presence of different transcripts of a gene across samples can be analysed by whole-transcriptome microarrays. Reproducing results from published microarray data represents a challenge due to the vast amounts of data and the large variety of pre-processing and filtering steps employed before the actual analysis is carried out. To guarantee a firm basis for methodological development where results with new methods are compared with previous results it is crucial to ensure that all analyses are completely reproducible for other researchers. We here give a detailed workflow on how to perform reproducible analysis of the GeneChip Human Exon 1.0 ST Array at probe and probeset level solely in R/Bioconductor, choosing packages based on their simplicity of use. To exemplify the use of the proposed workflow we analyse differential splicing and differential gene expression in a publicly available dataset using various statistical methods. We believe this study will provide other researchers with an easy way of accessing gene expression data at different annotation levels and with the sufficient details needed for develo** their own tools for reproducible analysis of the GeneChip Human Exon 1.0 ST Array.
△ Less
Submitted 18 February, 2013; v1 submitted 15 February, 2013;
originally announced February 2013.