-
Multidimensional spatiotemporal clustering -- An application to environmental sustainability scores in Europe
Authors:
Caterina Morelli,
Simone Boccaletti,
Paolo Maranzano,
Philipp Otto
Abstract:
The assessment of corporate sustainability performance is extremely relevant in facilitating the transition to a green and low-carbon intensity economy. However, companies located in different areas may be subject to different sustainability and environmental risks and policies. Henceforth, the main objective of this paper is to investigate the spatial and temporal pattern of the sustainability ev…
▽ More
The assessment of corporate sustainability performance is extremely relevant in facilitating the transition to a green and low-carbon intensity economy. However, companies located in different areas may be subject to different sustainability and environmental risks and policies. Henceforth, the main objective of this paper is to investigate the spatial and temporal pattern of the sustainability evaluations of European firms. We leverage on a large dataset containing information about companies' sustainability performances, measured by MSCI ESG ratings, and geographical coordinates of firms in Western Europe between 2013 and 2023. By means of a modified version of the Chavent et al. (2018) hierarchical algorithm, we conduct a spatial clustering analysis, combining sustainability and spatial information, and a spatiotemporal clustering analysis, which combines the time dynamics of multiple sustainability features and spatial dissimilarities, to detect groups of firms with homogeneous sustainability performance. We are able to build cross-national and cross-industry clusters with remarkable differences in terms of sustainability scores. Among other results, in the spatio-temporal analysis, we observe a high degree of geographical overlap among clusters, indicating that the temporal dynamics in sustainability assessment are relevant within a multidimensional approach. Our findings help to capture the diversity of ESG ratings across Western Europe and may assist practitioners and policymakers in evaluating companies facing different sustainability-linked risks in different areas.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
The Canadian VirusSeq Data Portal & Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology
Authors:
Erin E. Gill,
Baofeng Jia,
Carmen Lia Murall,
Raphaël Poujol,
Muhammad Zohaib Anwar,
Nithu Sara John,
Justin Richardsson,
Ashley Hobb,
Abayomi S. Olabode,
Alexandru Lepsa,
Ana T. Duggan,
Andrea D. Tyler,
Arnaud N'Guessan,
Atul Kachru,
Brandon Chan,
Catherine Yoshida,
Christina K. Yung,
David Bujold,
Dusan Andric,
Edmund Su,
Emma J. Griffiths,
Gary Van Domselaar,
Gordon W. Jolly,
Heather K. E. Ward,
Henrich Feher
, et al. (45 additional authors not shown)
Abstract:
The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes acro…
▽ More
The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). The Portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. Here we also highlight Duotang, a web platform that presents genomic epidemiology and modeling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the Portal (COVID-MVP, CoVizu), are all open-source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Regression Trees for Fast and Adaptive Prediction Intervals
Authors:
Luben M. C. Cabezas,
Mateus P. Otto,
Rafael Izbicki,
Rafael B. Stern
Abstract:
Predictive models make mistakes. Hence, there is a need to quantify the uncertainty associated with their predictions. Conformal inference has emerged as a powerful tool to create statistically valid prediction regions around point predictions, but its naive application to regression problems yields non-adaptive regions. New conformal scores, often relying upon quantile regressors or conditional d…
▽ More
Predictive models make mistakes. Hence, there is a need to quantify the uncertainty associated with their predictions. Conformal inference has emerged as a powerful tool to create statistically valid prediction regions around point predictions, but its naive application to regression problems yields non-adaptive regions. New conformal scores, often relying upon quantile regressors or conditional density estimators, aim to address this limitation. Although they are useful for creating prediction bands, these scores are detached from the original goal of quantifying the uncertainty around an arbitrary predictive model. This paper presents a new, model-agnostic family of methods to calibrate prediction intervals for regression problems with local coverage guarantees. Our approach is based on pursuing the coarsest partition of the feature space that approximates conditional coverage. We create this partition by training regression trees and Random Forests on conformity scores. Our proposal is versatile, as it applies to various conformity scores and prediction settings and demonstrates superior scalability and performance compared to established baselines in simulated and real-world datasets. We provide a Python package clover that implements our methods using the standard scikit-learn interface.
△ Less
Submitted 13 February, 2024; v1 submitted 11 February, 2024;
originally announced February 2024.
-
A review of regularised estimation methods and cross-validation in spatiotemporal statistics
Authors:
Philipp Otto,
Alessandro Fassò,
Paolo Maranzano
Abstract:
This review article focuses on regularised estimation procedures applicable to geostatistical and spatial econometric models. These methods are particularly relevant in the case of big geospatial data for dimensionality reduction or model selection. To structure the review, we initially consider the most general case of multivariate spatiotemporal processes (i.e., $g > 1$ dimensions of the spatial…
▽ More
This review article focuses on regularised estimation procedures applicable to geostatistical and spatial econometric models. These methods are particularly relevant in the case of big geospatial data for dimensionality reduction or model selection. To structure the review, we initially consider the most general case of multivariate spatiotemporal processes (i.e., $g > 1$ dimensions of the spatial domain, a one-dimensional temporal domain, and $q \geq 1$ random variables). Then, the idea of regularised/penalised estimation procedures and different choices of shrinkage targets are discussed. Finally, guided by the elements of a mixed-effects model setup, which allows for a variety of spatiotemporal models, we show different regularisation procedures and how they can be used for the analysis of geo-referenced data, e.g. for selection of relevant regressors, dimensionality reduction of the covariance matrices, detection of conditionally independent locations, or the estimation of a full spatial interaction matrix.
△ Less
Submitted 15 May, 2024; v1 submitted 31 January, 2024;
originally announced February 2024.
-
Statistical monitoring of European cross-border physical electricity flows using novel temporal edge network processes
Authors:
Anna Malinovskaya,
Rebecca Killick,
Kathryn Leeming,
Philipp Otto
Abstract:
Conventional modelling of networks evolving in time focuses on capturing variations in the network structure. However, the network might be static from the origin or experience only deterministic, regulated changes in its structure, providing either a physical infrastructure or a specified connection arrangement for some other processes. Thus, to detect change in its exploitation, we need to focus…
▽ More
Conventional modelling of networks evolving in time focuses on capturing variations in the network structure. However, the network might be static from the origin or experience only deterministic, regulated changes in its structure, providing either a physical infrastructure or a specified connection arrangement for some other processes. Thus, to detect change in its exploitation, we need to focus on the processes happening on the network. In this work, we present the concept of monitoring random Temporal Edge Network (TEN) processes that take place on the edges of a graph having a fixed structure. Our framework is based on the Generalized Network Autoregressive statistical models with time-dependent exogenous variables (GNARX models) and Cumulative Sum (CUSUM) control charts. To demonstrate its effective detection of various types of change, we conduct a simulation study and monitor the real-world data of cross-border physical electricity flows in Europe.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Dynamic Spatiotemporal ARCH Models: Small and Large Sample Results
Authors:
Philipp Otto,
Osman Doğan,
Süleyman Taşpınar
Abstract:
This paper explores the estimation of a dynamic spatiotemporal autoregressive conditional heteroscedasticity (ARCH) model. The log-volatility term in this model can depend on (i) the spatial lag of the log-squared outcome variable, (ii) the time-lag of the log-squared outcome variable, (iii) the spatiotemporal lag of the log-squared outcome variable, (iv) exogenous variables, and (v) the unobserve…
▽ More
This paper explores the estimation of a dynamic spatiotemporal autoregressive conditional heteroscedasticity (ARCH) model. The log-volatility term in this model can depend on (i) the spatial lag of the log-squared outcome variable, (ii) the time-lag of the log-squared outcome variable, (iii) the spatiotemporal lag of the log-squared outcome variable, (iv) exogenous variables, and (v) the unobserved heterogeneity across regions and time, i.e., the regional and time fixed effects. We examine the small and large sample properties of two quasi-maximum likelihood estimators and a generalized method of moments estimator for this model. We first summarize the theoretical properties of these estimators and then compare their finite sample properties through Monte Carlo simulations.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
A Markov-switching spatio-temporal ARCH model
Authors:
Tzung Hsuen Khoo,
Dharini Pathmanathan,
Philipp Otto,
Sophie Dabo-Niang
Abstract:
Stock market indices are volatile by nature, and sudden shocks are known to affect volatility patterns. The autoregressive conditional heteroskedasticity (ARCH) and generalized ARCH (GARCH) models neglect structural breaks triggered by sudden shocks that may lead to an overestimation of persistence, causing an upward bias in the estimates. Different regime-switching models that have abrupt regime-…
▽ More
Stock market indices are volatile by nature, and sudden shocks are known to affect volatility patterns. The autoregressive conditional heteroskedasticity (ARCH) and generalized ARCH (GARCH) models neglect structural breaks triggered by sudden shocks that may lead to an overestimation of persistence, causing an upward bias in the estimates. Different regime-switching models that have abrupt regime-switching governed by a Markov chain were developed to model volatility in financial time series data. Volatility modelling was also extended to spatially interconnected time series, resulting in spatial variants of ARCH models. This inspired us to propose a Markov switching framework of the spatio-temporal log-ARCH model. In this article, we discuss the Markov-switching extension of the model, the estimation procedure and the smooth inferences of the regimes. The Monte-Carlo simulation studies show that the maximum likelihood estimation method for our proposed model has good finite sample properties. The proposed model was applied to 28 stock indices data that were presumably affected by the 2015-2016 Chinese stock market crash. The results showed that our model is a better fit compared to that of the one-regime counterpart. Furthermore, the smoothed inference of the data indicated the approximate periods where structural breaks occurred. This model can capture structural breaks that simultaneously occur in nearby locations.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Spatiotemporal modelling of PM$_{2.5}$ concentrations in Lombardy (Italy) -- A comparative study
Authors:
Philipp Otto,
Alessandro Fusta Moro,
Jacopo Rodeschini,
Qendrim Shaboviq,
Rosaria Ignaccolo,
Natalia Golini,
Michela Cameletti,
Paolo Maranzano,
Francesco Finazzi,
Alessandro Fassò
Abstract:
This study presents a comparative analysis of three predictive models with an increasing degree of flexibility: hidden dynamic geostatistical models (HDGM), generalised additive mixed models (GAMM), and the random forest spatiotemporal kriging models (RFSTK). These models are evaluated for their effectiveness in predicting PM$_{2.5}$ concentrations in Lombardy (North Italy) from 2016 to 2020. Desp…
▽ More
This study presents a comparative analysis of three predictive models with an increasing degree of flexibility: hidden dynamic geostatistical models (HDGM), generalised additive mixed models (GAMM), and the random forest spatiotemporal kriging models (RFSTK). These models are evaluated for their effectiveness in predicting PM$_{2.5}$ concentrations in Lombardy (North Italy) from 2016 to 2020. Despite differing methodologies, all models demonstrate proficient capture of spatiotemporal patterns within air pollution data with similar out-of-sample performance. Furthermore, the study delves into station-specific analyses, revealing variable model performance contingent on localised conditions. Model interpretation, facilitated by parametric coefficient analysis and partial dependence plots, unveils consistent associations between predictor variables and PM$_{2.5}$ concentrations. Despite nuanced variations in modelling spatiotemporal correlations, all models effectively accounted for the underlying dependence. In summary, this study underscores the efficacy of conventional techniques in modelling correlated spatiotemporal data, concurrently highlighting the complementary potential of Machine Learning and classical statistical approaches.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Spatial autoregressive fractionally integrated moving average model
Authors:
Philipp Otto,
Philipp Sibbertsen
Abstract:
In this paper, we introduce the concept of fractional integration for spatial autoregressive models. We show that the range of the dependence can be spatially extended or diminished by introducing a further fractional integration parameter to spatial autoregressive moving average models (SARMA). This new model is called the spatial autoregressive fractionally integrated moving average model, brief…
▽ More
In this paper, we introduce the concept of fractional integration for spatial autoregressive models. We show that the range of the dependence can be spatially extended or diminished by introducing a further fractional integration parameter to spatial autoregressive moving average models (SARMA). This new model is called the spatial autoregressive fractionally integrated moving average model, briefly sp-ARFIMA. We show the relation to time-series ARFIMA models and also to (higher-order) spatial autoregressive models. Moreover, an estimation procedure based on the maximum-likelihood principle is introduced and analysed in a series of simulation studies. Eventually, the use of the model is illustrated by an empirical example of atmospheric fine particles, so-called aerosol optical thickness, which is important in weather, climate and environmental science.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Spatial and Spatiotemporal Volatility Models: A Review
Authors:
Philipp Otto,
Osman Doğan,
Süleyman Taşpınar,
Wolfgang Schmid,
Anil K. Bera
Abstract:
Spatial and spatiotemporal volatility models are a class of models designed to capture spatial dependence in the volatility of spatial and spatiotemporal data. Spatial dependence in the volatility may arise due to spatial spillovers among locations; that is, if two locations are in close proximity, they can exhibit similar volatilities. In this paper, we aim to provide a comprehensive review of th…
▽ More
Spatial and spatiotemporal volatility models are a class of models designed to capture spatial dependence in the volatility of spatial and spatiotemporal data. Spatial dependence in the volatility may arise due to spatial spillovers among locations; that is, if two locations are in close proximity, they can exhibit similar volatilities. In this paper, we aim to provide a comprehensive review of the recent literature on spatial and spatiotemporal volatility models. We first briefly review time series volatility models and their multivariate extensions to motivate their spatial and spatiotemporal counterparts. We then review various spatial and spatiotemporal volatility specifications proposed in the literature along with their underlying motivations and estimation strategies. Through this analysis, we effectively compare all models and provide practical recommendations for their appropriate usage. We highlight possible extensions and conclude by outlining directions for future research.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Network log-ARCH models for forecasting stock market volatility
Authors:
Raffaele Mattera,
Philipp Otto
Abstract:
This paper presents a novel dynamic network autoregressive conditional heteroscedasticity (ARCH) model based on spatiotemporal ARCH models to forecast volatility in the US stock market. To improve the forecasting accuracy, the model integrates temporally lagged volatility information and information from adjacent nodes, which may instantaneously spill across the entire network. The model is also s…
▽ More
This paper presents a novel dynamic network autoregressive conditional heteroscedasticity (ARCH) model based on spatiotemporal ARCH models to forecast volatility in the US stock market. To improve the forecasting accuracy, the model integrates temporally lagged volatility information and information from adjacent nodes, which may instantaneously spill across the entire network. The model is also suitable for high-dimensional cases where multivariate ARCH models are typically no longer applicable. We adopt the theoretical foundations from spatiotemporal statistics and transfer the dynamic ARCH model for processes to networks. This new approach is compared with independent univariate log-ARCH models. We could quantify the improvements due to the instantaneous network ARCH effects, which are studied for the first time in this paper. The edges are determined based on various distance and correlation measures between the time series. The performances of the alternative networks' definitions are compared in terms of out-of-sample accuracy. Furthermore, we consider ensemble forecasts based on different network definitions.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
RFFNet: Large-Scale Interpretable Kernel Methods via Random Fourier Features
Authors:
Mateus P. Otto,
Rafael Izbicki
Abstract:
Kernel methods provide a flexible and theoretically grounded approach to nonlinear and nonparametric learning. While memory and run-time requirements hinder their applicability to large datasets, many low-rank kernel approximations, such as random Fourier features, were recently developed to scale up such kernel methods. However, these scalable approaches are based on approximations of isotropic k…
▽ More
Kernel methods provide a flexible and theoretically grounded approach to nonlinear and nonparametric learning. While memory and run-time requirements hinder their applicability to large datasets, many low-rank kernel approximations, such as random Fourier features, were recently developed to scale up such kernel methods. However, these scalable approaches are based on approximations of isotropic kernels, which cannot remove the influence of irrelevant features. In this work, we design random Fourier features for a family of automatic relevance determination (ARD) kernels, and introduce RFFNet, a new large-scale kernel method that learns the kernel relevances' on the fly via first-order stochastic optimization. We present an effective initialization scheme for the method's non-convex objective function, evaluate if hard-thresholding RFFNet's learned relevances yield a sensible rule for variable selection, and perform an extensive ablation study of RFFNet's components. Numerical validation on simulated and real-world data shows that our approach has a small memory footprint and run-time, achieves low prediction error, and effectively identifies relevant features, thus leading to more interpretable solutions. We supply users with an efficient, PyTorch-based library, that adheres to the scikit-learn standard API and code for fully reproducing our results.
△ Less
Submitted 12 April, 2024; v1 submitted 11 November, 2022;
originally announced November 2022.
-
A Dynamic Spatiotemporal Stochastic Volatility Model with an Application to Environmental Risks
Authors:
Philipp Otto,
Osman Doğan,
Süleyman Taşpınar
Abstract:
This article introduces a dynamic spatiotemporal stochastic volatility (SV) model with explicit terms for the spatial, temporal, and spatiotemporal spillover effects. Moreover, the model includes time-invariant site-specific constant log-volatility terms. Thus, this formulation allows to distinguish between spatial and temporal interactions, while each location may have a different volatility leve…
▽ More
This article introduces a dynamic spatiotemporal stochastic volatility (SV) model with explicit terms for the spatial, temporal, and spatiotemporal spillover effects. Moreover, the model includes time-invariant site-specific constant log-volatility terms. Thus, this formulation allows to distinguish between spatial and temporal interactions, while each location may have a different volatility level. We study the statistical properties of an outcome variable under this process and show that it introduces spatial dependence in the outcome variable. Further, we present a Bayesian estimation procedure based on the Markov Chain Monte Carlo (MCMC) approach using a suitable data transformation. After providing simulation evidence on the proposed Bayesian estimator's performance, we apply the model in a highly relevant field, namely environmental risk modeling. Even though there are only a few empirical studies on environmental risks, previous literature undoubtedly demonstrated the importance of climate variation studies. For example, for local air quality in Northern Italy in 2021, we show pronounced spatial and temporal spillovers and larger uncertainties/risks during the winter season compared to the summer season.
△ Less
Submitted 6 November, 2022;
originally announced November 2022.
-
Agrimonia: a dataset on livestock, meteorology and air quality in the Lombardy region, Italy
Authors:
Alessandro Fassò,
Jacopo Rodeschini,
Alessandro Fusta Moro,
Qendrim Shaboviq,
Paolo Maranzano,
Michela Cameletti,
Francesco Finazzi,
Natalia Golini,
Rosaria Ignaccolo,
Philipp Otto
Abstract:
The air in the Lombardy region, Italy, is one of the most polluted in Europe because of limited air circulation and high emission levels. There is a large scientific consensus that the agricultural sector has a significant impact on air quality. To support studies quantifying the role of the agricultural and livestock sectors on the Lombardy air quality, this paper presents a harmonised dataset co…
▽ More
The air in the Lombardy region, Italy, is one of the most polluted in Europe because of limited air circulation and high emission levels. There is a large scientific consensus that the agricultural sector has a significant impact on air quality. To support studies quantifying the role of the agricultural and livestock sectors on the Lombardy air quality, this paper presents a harmonised dataset containing daily values of air quality, weather, emissions, livestock, and land and soil use in the years 2016 - 2021, for the Lombardy region. The pollutant data come from the European Environmental Agency and the Lombardy Regional Environment Protection Agency, weather and emissions data from the European Copernicus programme, livestock data from the Italian zootechnical registry, and land and soil use data from the CORINE Land Cover project. The resulting dataset is designed to be used as is by those using air quality data for research.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Statistical process monitoring of artificial neural networks
Authors:
Anna Malinovskaya,
Pavlo Mozharovskyi,
Philipp Otto
Abstract:
The rapid advancement of models based on artificial intelligence demands innovative monitoring techniques which can operate in real time with low computational costs. In machine learning, especially if we consider artificial neural networks (ANNs), the models are often trained in a supervised manner. Consequently, the learned relationship between the input and the output must remain valid during t…
▽ More
The rapid advancement of models based on artificial intelligence demands innovative monitoring techniques which can operate in real time with low computational costs. In machine learning, especially if we consider artificial neural networks (ANNs), the models are often trained in a supervised manner. Consequently, the learned relationship between the input and the output must remain valid during the model's deployment. If this stationarity assumption holds, we can conclude that the ANN provides accurate predictions. Otherwise, the retraining or rebuilding of the model is required. We propose considering the latent feature representation of the data (called "embedding") generated by the ANN to determine the time when the data stream starts being nonstationary. In particular, we monitor embeddings by applying multivariate control charts based on the data depth calculation and normalized ranks. The performance of the introduced method is compared with benchmark approaches for various ANN architectures and different underlying data formats.
△ Less
Submitted 27 July, 2023; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Adaptive LASSO estimation for functional hidden dynamic geostatistical model
Authors:
Paolo Maranzano,
Philipp Otto,
Alessandro Fassò
Abstract:
We propose a novel model selection algorithm based on a penalized maximum likelihood estimator (PMLE) for functional hidden dynamic geostatistical models (f-HDGM). These models employ a classic mixed-effect regression structure with embedded spatiotemporal dynamics to model georeferenced data observed in a functional domain. Thus, the parameters of interest are functions across this domain. The al…
▽ More
We propose a novel model selection algorithm based on a penalized maximum likelihood estimator (PMLE) for functional hidden dynamic geostatistical models (f-HDGM). These models employ a classic mixed-effect regression structure with embedded spatiotemporal dynamics to model georeferenced data observed in a functional domain. Thus, the parameters of interest are functions across this domain. The algorithm simultaneously selects the relevant spline basis functions and regressors that are used to model the fixed-effects relationship between the response variable and the covariates. In this way, it automatically shrinks to zero irrelevant parts of the functional coefficients or the entire effect of irrelevant regressors. The algorithm is based on iterative optimisation and uses an adaptive least absolute shrinkage and selector operator (LASSO) penalty function, wherein the weights are obtained by the unpenalised f-HDGM maximum-likelihood estimators. The computational burden of maximisation is drastically reduced by a local quadratic approximation of the likelihood. Through a Monte Carlo simulation study, we analysed the performance of the algorithm under different scenarios, including strong correlations among the regressors. We showed that the penalised estimator outperformed the unpenalised estimator in all the cases we considered. We applied the algorithm to a real case study in which the recording of the hourly nitrogen dioxide concentrations in the Lombardy region in Italy was modelled as a functional process with several weather and land cover covariates.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
A Multivariate Spatial and Spatiotemporal ARCH Model
Authors:
Philipp Otto
Abstract:
This paper introduces a multivariate spatiotemporal autoregressive conditional heteroscedasticity (ARCH) model based on a vec-representation. The model includes instantaneous spatial autoregressive spill-over effects in the conditional variance, as they are usually present in spatial econometric applications. Furthermore, spatial and temporal cross-variable effects are explicitly modelled. We tran…
▽ More
This paper introduces a multivariate spatiotemporal autoregressive conditional heteroscedasticity (ARCH) model based on a vec-representation. The model includes instantaneous spatial autoregressive spill-over effects in the conditional variance, as they are usually present in spatial econometric applications. Furthermore, spatial and temporal cross-variable effects are explicitly modelled. We transform the model to a multivariate spatiotemporal autoregressive model using a log-squared transformation and derive a consistent quasi-maximum-likelihood estimator (QMLE). For finite samples and different error distributions, the performance of the QMLE is analysed in a series of Monte-Carlo simulations. In addition, we illustrate the practical usage of the new model with a real-world example. We analyse the monthly real-estate price returns for three different property types in Berlin from 2002 to 2014. We find weak (instantaneous) spatial interactions, while the temporal autoregressive structure in the market risks is of higher importance. Interactions between the different property types only occur in the temporally lagged variables. Thus, we see mainly temporal volatility clusters and weak spatial volatility spill-overs.
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
Dynamic Spatiotemporal ARCH Models
Authors:
Philipp Otto,
Osman Doğan,
Süleyman Taşpınar
Abstract:
Geo-referenced data are characterized by an inherent spatial dependence due to the geographical proximity. In this paper, we introduce a dynamic spatiotemporal autoregressive conditional heteroscedasticity (ARCH) process to describe the effects of (i) the log-squared time-lagged outcome variable, i.e., the temporal effect, (ii) the spatial lag of the log-squared outcome variable, i.e., the spatial…
▽ More
Geo-referenced data are characterized by an inherent spatial dependence due to the geographical proximity. In this paper, we introduce a dynamic spatiotemporal autoregressive conditional heteroscedasticity (ARCH) process to describe the effects of (i) the log-squared time-lagged outcome variable, i.e., the temporal effect, (ii) the spatial lag of the log-squared outcome variable, i.e., the spatial effect, and (iii) the spatial lag of the log-squared time-lagged outcome variable, i.e., the spatiotemporal effect, on the volatility of an outcome variable. Furthermore, our suggested process allows for the fixed effects over time and space to account for the unobserved heterogeneity. For this dynamic spatiotemporal ARCH model, we derive a generalized method of moments (GMM) estimator based on the linear and quadratic moment conditions of a specific transformation. We show the consistency and asymptotic normality of the GMM estimator, and determine the best set of moment functions. We investigate the finite-sample properties of the proposed GMM estimator in a series of Monte-Carlo simulations with different model specifications and error distributions. Our simulation results show that our suggested GMM estimator has good finite sample properties. In an empirical application, we use monthly log-returns of the average condominium prices of each postcode of Berlin from 1995 to 2015 (190 spatial units, 240 time points) to demonstrate the use of our suggested model. Our estimation results show that the temporal, spatial and spatiotemporal lags of the log-squared returns have statistically significant effects on the volatility of the log-returns.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
The best of both worlds: Combining population genetic and quantitative genetic models
Authors:
Léonard Dekens,
Sarah P. Otto,
Vincent Calvez
Abstract:
Numerous traits under migration-selection balance are shown to exhibit complex patterns of genetic architecture with large variance in effect sizes. However, the conditions under which such genetic architectures are stable have yet to be investigated, because studying the influence of a large number of small allelic effects on the maintenance of spatial polymorphism is mathematically challenging,…
▽ More
Numerous traits under migration-selection balance are shown to exhibit complex patterns of genetic architecture with large variance in effect sizes. However, the conditions under which such genetic architectures are stable have yet to be investigated, because studying the influence of a large number of small allelic effects on the maintenance of spatial polymorphism is mathematically challenging, due to the high complexity of the systems that arise. In particular, in the most simple case of a haploid population in a two-patch environment, while it is known from population genetics that polymorphism at a single major-effect locus is stable in the symmetric case, there exists no analytical predictions on how this polymorphism holds when a polygenic background also contributes to the trait. Here we propose to answer this question by introducing a new eco-evo methodology that allows us to take into account the combined contributions of a major-effect locus and of a quantitative background resulting from small-effect loci, where inheritance is encoded according to an extension to the infinitesimal model. In a regime of small variance contributed by the quantitative loci, we justify that traits are concentrated around the major alleles, according to a normal distribution, using new convex analysis arguments. This allows a reduction in the complexity of the system using a separation of time scales approach. We predict an undocumented phenomenon of loss of polymorphism at the major-effect locus despite strong selection for local adaptation, because the quantitative background slowly disrupts the rapidly established polymorphism at the major-effect locus, which is confirmed by individual-based simulations. Our study highlights how segregation of a quantitative background can greatly impact the dynamics of major-effect loci by provoking migrational meltdowns.
△ Less
Submitted 31 October, 2022; v1 submitted 22 November, 2021;
originally announced November 2021.
-
Multidimensional Lambert-Euler inversion and vector-multiplicative coalescent processes
Authors:
Yevgeniy Kovchegov,
Peter T. Otto
Abstract:
In this paper we show the existence of the minimal solution to the multidimensional Lambert-Euler inversion, a multidimensional generalization of $[-e^{-1} ,0)$ branch of Lambert W function $W_0(x)$. Specifically, for a given nonnegative irreducible symmetric matrix $V \in \mathbb{R}^{k \times k}$, we show that for ${\bf u}\in(0,\infty)^k$, if equation…
▽ More
In this paper we show the existence of the minimal solution to the multidimensional Lambert-Euler inversion, a multidimensional generalization of $[-e^{-1} ,0)$ branch of Lambert W function $W_0(x)$. Specifically, for a given nonnegative irreducible symmetric matrix $V \in \mathbb{R}^{k \times k}$, we show that for ${\bf u}\in(0,\infty)^k$, if equation $$y_j \exp\{-{\bf e}_j^T V {\bf y} \} = u_j ~~~~~~\forall j=1,...,k,$$ has at least one solution, it must have a minimal solution ${\bf y}^*$, where the minimum is achieved in all coordinates $y_j$ simultaneously. Moreover, such ${\bf y}^*$ is the unique solution satisfying $ρ\left(V D[y^*_j] \right) \leq 1$, where $D[y^*_j]={\sf diag}(y_j^*)$ is the diagonal matrix with entries $y^*_j$ and $ρ$ denotes the spectral radius.
Our main application is in the vector-multiplicative coalescent process. It is a coalescent process with $k$ types of particles and vector-valued weights that begins with $α_1n+...+α_k n$ particles partitioned into types of respective sizes, and in which two clusters of weights ${\bf x}$ and ${\bf y}$ would merge with rate $({\bf x}^{\sf T} V {\bf y})/n$. We use combinatorics to solve the corresponding modified Smoluchowski equations, obtained as a hydrodynamic limit of vector-multiplicative coalescent as $n \to \infty$, and use multidimensional Lambert-Euler inversion to establish gelation and find a closed form expression for the gelation time.
We also find the asymptotic length of the minimal spanning tree for a broad range of graphs equipped with random edge lengths.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
Generalized Spatial and Spatiotemporal ARCH Models
Authors:
Philipp Otto,
Wolfgang Schmid
Abstract:
In time-series analyses, particularly for finance, generalized autoregressive conditional heteroscedasticity (GARCH) models are widely applied statistical tools for modelling volatility clusters (i.e., periods of increased or decreased risk). In contrast, it has not been considered to be of critical importance until now to model spatial dependence in the conditional second moments. Only a few mode…
▽ More
In time-series analyses, particularly for finance, generalized autoregressive conditional heteroscedasticity (GARCH) models are widely applied statistical tools for modelling volatility clusters (i.e., periods of increased or decreased risk). In contrast, it has not been considered to be of critical importance until now to model spatial dependence in the conditional second moments. Only a few models have been proposed for modelling local clusters of increased risks. In this paper, we introduce a novel spatial GARCH process in a unified spatial and spatiotemporal GARCH framework, which also covers all previously proposed spatial ARCH models, exponential spatial GARCH, and time-series GARCH models. In contrast to previous spatiotemporal and time series models, this spatial GARCH allows for instantaneous spill-overs across all spatial units. For this common modelling framework, estimators are derived based on a non-linear least-squares approach. Eventually, the use of the model is demonstrated by a Monte Carlo simulation study and by an empirical example that focuses on real estate prices from 1995 to 2014 across the ZIP-Code areas of Berlin. A spatial autoregressive model is applied to the data to illustrate how locally varying model uncertainties (e.g., due to latent regressors) can be captured by the spatial GARCH-type models.
△ Less
Submitted 19 June, 2021;
originally announced June 2021.
-
A Spatiotemporal Functional Model for Bike-Sharing Systems -- An Example based on the City of Helsinki
Authors:
Andreas Piter,
Philipp Otto,
Hamza Alkhatib
Abstract:
Understanding the usage patterns for bike-sharing systems is essential in terms of supporting and enhancing operational planning for such schemes. Studies have demonstrated how factors such as weather conditions influence the number of bikes that should be available at bike-sharing stations at certain times during the day. However, the influences of these factors usually vary over the course of a…
▽ More
Understanding the usage patterns for bike-sharing systems is essential in terms of supporting and enhancing operational planning for such schemes. Studies have demonstrated how factors such as weather conditions influence the number of bikes that should be available at bike-sharing stations at certain times during the day. However, the influences of these factors usually vary over the course of a day, and if there is good temporal resolution, there could also be significant effects only for some hours/minutes (rush hours, the hours when shops are open, and so forth). Thus, in this paper, an analysis of Helsinki's bike-sharing data from 2017 is conducted that considers full temporal and spatial resolutions. Moreover, the data are available at a very high frequency. Hence, the station hire data is analysed in a spatiotemporal functional setting, where the number of bikes at a station is defined as a continuous function of the time of day. For this completely novel approach, we apply a functional spatiotemporal hierarchical model to investigate the effect of environmental factors and the magnitude of the spatial and temporal dependence. Challenges in computational complexity are faced using a bootstrap** approach. The results show the necessity of splitting the bike-sharing stations into two clusters based on the similarity of their spatiotemporal functional observations in order to model the station hire data of Helsinki's bike-sharing system effectively. The estimated functional influences of the proposed factors are different for the two clusters. Moreover, the estimated parameters reveal high random effects in the data that are not explained by the mean of the process. In this random-effects model, the temporal autoregressive parameter dominates the spatial dependence.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
Statistical learning for change point and anomaly detection in graphs
Authors:
Anna Malinovskaya,
Philipp Otto,
Torben Peters
Abstract:
Complex systems which can be represented in the form of static and dynamic graphs arise in different fields, e.g. communication, engineering and industry. One of the interesting problems in analysing dynamic network structures is to monitor changes in their development. Statistical learning, which encompasses both methods based on artificial intelligence and traditional statistics, can be used to…
▽ More
Complex systems which can be represented in the form of static and dynamic graphs arise in different fields, e.g. communication, engineering and industry. One of the interesting problems in analysing dynamic network structures is to monitor changes in their development. Statistical learning, which encompasses both methods based on artificial intelligence and traditional statistics, can be used to progress in this research area. However, the majority of approaches apply only one or the other framework. In this paper, we discuss the possibility of bringing together both disciplines in order to create enhanced network monitoring procedures focussing on the example of combining statistical process control and deep learning algorithms. Together with the presentation of change point and anomaly detection in network data, we propose to monitor the response times of ambulance services, applying jointly the control chart for quantile function values and a graph convolutional network.
△ Less
Submitted 10 November, 2020;
originally announced November 2020.
-
Online network monitoring
Authors:
Anna Malinovskaya,
Philipp Otto
Abstract:
The application of network analysis has found great success in a wide variety of disciplines; however, the popularity of these approaches has revealed the difficulty in handling networks whose complexity scales rapidly. One of the main interests in network analysis is the online detection of anomalous behaviour. To overcome the curse of dimensionality, we introduce a network surveillance method br…
▽ More
The application of network analysis has found great success in a wide variety of disciplines; however, the popularity of these approaches has revealed the difficulty in handling networks whose complexity scales rapidly. One of the main interests in network analysis is the online detection of anomalous behaviour. To overcome the curse of dimensionality, we introduce a network surveillance method bringing together network modelling and statistical process control. Our approach is to apply multivariate control charts based on exponential smoothing and cumulative sums in order to monitor networks determined by temporal exponential random graph models (TERGM). This allows us to account for temporal dependence, while simultaneously reducing the number of parameters to be monitored. The performance of the proposed charts is evaluated by calculating the average run length for both simulated and real data. To prove the appropriateness of the TERGM to describe network data, some measures of goodness of fit are inspected. We demonstrate the effectiveness of the proposed approach by an empirical application, monitoring daily flights in the United States to detect anomalous patterns.
△ Less
Submitted 14 May, 2021; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Extension to the Beraha-Kahane-Weiss Theorem with Applications
Authors:
Jason Brown,
Peter T. Otto
Abstract:
The beautiful Beraha-Kahane-Weiss theorem has found many applications within graph theory, allowing for the determination of the limits of root of graph polynomials in settings as vast as chromatic polynomials, network reliability, and generating polynomials related to independence and domination. Here we extend the class of functions to which the BKW theorem can be applied, and provide some appli…
▽ More
The beautiful Beraha-Kahane-Weiss theorem has found many applications within graph theory, allowing for the determination of the limits of root of graph polynomials in settings as vast as chromatic polynomials, network reliability, and generating polynomials related to independence and domination. Here we extend the class of functions to which the BKW theorem can be applied, and provide some applications in combinatorics.
△ Less
Submitted 4 August, 2020;
originally announced August 2020.
-
Estimation of the spatial weighting matrix for regular lattice data -- An adaptive lasso approach with cross-sectional resampling
Authors:
Miryam S. Merk,
Philipp Otto
Abstract:
Spatial econometric research typically relies on the assumption that the spatial dependence structure is known in advance and is represented by a deterministic spatial weights matrix. Contrary to classical approaches, we investigate the estimation of sparse spatial dependence structures for regular lattice data. In particular, an adaptive least absolute shrinkage and selection operator (lasso) is…
▽ More
Spatial econometric research typically relies on the assumption that the spatial dependence structure is known in advance and is represented by a deterministic spatial weights matrix. Contrary to classical approaches, we investigate the estimation of sparse spatial dependence structures for regular lattice data. In particular, an adaptive least absolute shrinkage and selection operator (lasso) is used to select and estimate the individual connections of the spatial weights matrix. To recover the spatial dependence structure, we propose cross-sectional resampling, assuming that the random process is exchangeable. The estimation procedure is based on a two-step approach to circumvent simultaneity issues that typically arise from endogenous spatial autoregressive dependencies. The two-step adaptive lasso approach with cross-sectional resampling is verified using Monte Carlo simulations. Eventually, we apply the procedure to model nitrogen dioxide ($\mathrm{NO_2}$) concentrations and show that estimating the spatial dependence structure contrary to using prespecified weights matrices improves the prediction accuracy considerably.
△ Less
Submitted 6 January, 2020;
originally announced January 2020.
-
Spatial and Spatiotemporal GARCH Models -- A Unified Approach
Authors:
Philipp Otto,
Wolfgang Schmid
Abstract:
In time-series analyses, particularly for finance, generalized autoregressive conditional heteroscedasticity (GARCH) models are widely applied statistical tools for modelling volatility clusters (i.e., periods of increased or decreased risk). In contrast, it has not been considered to be of critical importance until now to model spatial dependence in the conditional second moments. Only a few mode…
▽ More
In time-series analyses, particularly for finance, generalized autoregressive conditional heteroscedasticity (GARCH) models are widely applied statistical tools for modelling volatility clusters (i.e., periods of increased or decreased risk). In contrast, it has not been considered to be of critical importance until now to model spatial dependence in the conditional second moments. Only a few models have been proposed for modelling local clusters of increased risks. In this paper, we introduce novel spatial GARCH and exponential GARCH processes in a unified spatial and spatiotemporal GARCH-type model, which also covers all previously proposed spatial ARCH models as well as time-series GARCH models. For this common modelling framework, estimators are derived based on nonlinear least squares and on the maximum-likelihood approach. In addition to the theoretical contributions of this paper, we suggest a model selection strategy that is verified by a series of Monte Carlo simulation studies. Eventually, the use of the unified model is demonstrated by an empirical example that focuses on real estate prices from 1995 to 2014 across the ZIP-Code areas of Berlin. A spatial autoregressive model is applied to the data to illustrate how locally varying model uncertainties can be captured by the spatial GARCH-type models.
△ Less
Submitted 19 October, 2020; v1 submitted 22 August, 2019;
originally announced August 2019.
-
spGARCH: An R-Package for Spatial and Spatiotemporal ARCH models
Authors:
Philipp Otto
Abstract:
In this paper, a general overview on spatial and spatiotemporal ARCH models is provided. In particular, we distinguish between three different spatial ARCH-type models. In addition to the original definition of Otto et al. (2016), we introduce an exponential spatial ARCH model in this paper. For this new model, maximum-likelihood estimators for the parameters are proposed. In addition, we consider…
▽ More
In this paper, a general overview on spatial and spatiotemporal ARCH models is provided. In particular, we distinguish between three different spatial ARCH-type models. In addition to the original definition of Otto et al. (2016), we introduce an exponential spatial ARCH model in this paper. For this new model, maximum-likelihood estimators for the parameters are proposed. In addition, we consider a new complex-valued definition of the spatial ARCH process. From a practical point of view, the use of the R-package spGARCH is demonstrated. To be precise, we show how the proposed spatial ARCH models can be simulated and summarize the variety of spatial models, which can be estimated by the estimation functions provided in the package. Eventually, we apply all procedures to a real-data example.
△ Less
Submitted 5 December, 2018;
originally announced December 2018.
-
Estimation of the Spatial Weighting Matrix for Spatiotemporal Data under the Presence of Structural Breaks
Authors:
Philipp Otto,
Rick Steinert
Abstract:
In this paper, we propose a two-step lasso estimation approach to estimate the full spatial weights matrix of spatiotemporal autoregressive models. In addition, we allow for an unknown number of structural breaks in the local means of each spatial locations. The proposed approach jointly estimates the spatial dependence, all structural breaks, and the local mean levels. In addition, it is easy to…
▽ More
In this paper, we propose a two-step lasso estimation approach to estimate the full spatial weights matrix of spatiotemporal autoregressive models. In addition, we allow for an unknown number of structural breaks in the local means of each spatial locations. The proposed approach jointly estimates the spatial dependence, all structural breaks, and the local mean levels. In addition, it is easy to compute the suggested estimators, because of a convex objective function resulting from a slight simplification. Via simulation studies, we show the finite-sample performance of the estimators and provide a practical guidance, when the approach could be applied. Eventually, the invented method is illustrated by an empirical example of regional monthly real-estate prices in Berlin from 1995 to 2014. The spatial units are defined by the respective ZIP codes. In particular, we can estimate local mean levels and quantify the deviation of the observed prices from these levels due to spatial spill over effects.
△ Less
Submitted 10 August, 2022; v1 submitted 16 October, 2018;
originally announced October 2018.
-
Cross-Multiplicative Coalescent Processes and Applications
Authors:
Yevgeniy Kovchegov,
Peter T. Otto,
Anatoly Yambartsev
Abstract:
We introduce and analyze a novel type of coalescent processes called cross-multiplicative coalescent that models a system with two types of particles, $A$ and $B$. The bonds are formed only between the pairs of particles of opposite types with the same rate for each bond, producing connected components made of particles of both types. We analyze and solve the Smoluchowski coagulation system of equ…
▽ More
We introduce and analyze a novel type of coalescent processes called cross-multiplicative coalescent that models a system with two types of particles, $A$ and $B$. The bonds are formed only between the pairs of particles of opposite types with the same rate for each bond, producing connected components made of particles of both types. We analyze and solve the Smoluchowski coagulation system of equations obtained as a hydrodynamic limit of the corresponding Marcus-Lushnikov process. We establish that the cross-multiplicative kernel is a gelling kernel, and find the gelation time. As an application, we derive the limiting mean length of a minimal spanning tree on a complete bipartite graph $K_{α[n], β[n]}$ with partitions of sizes $α[n]=αn +o(\sqrt{n})$ and $β[n]=βn +o(\sqrt{n})$ and independent edge weights, distributed uniformly over $[0, 1]$.
△ Less
Submitted 26 September, 2019; v1 submitted 24 February, 2017;
originally announced February 2017.
-
Generalized Spatial and Spatiotemporal Autoregressive Conditional Heteroscedasticity
Authors:
Philipp Otto,
Wolfgang Schmid,
Robert Garthoff
Abstract:
In this paper, we introduce a new spatial model that incorporates heteroscedastic variance depending on neighboring locations. The proposed process is regarded as the spatial equivalent to the temporal autoregressive conditional heteroscedasticity (ARCH) model. We show additionally how the introduced spatial ARCH model can be used in spatiotemporal settings. In contrast to the temporal ARCH model,…
▽ More
In this paper, we introduce a new spatial model that incorporates heteroscedastic variance depending on neighboring locations. The proposed process is regarded as the spatial equivalent to the temporal autoregressive conditional heteroscedasticity (ARCH) model. We show additionally how the introduced spatial ARCH model can be used in spatiotemporal settings. In contrast to the temporal ARCH model, in which the distribution is known given the full information set of the prior periods, the distribution is not straightforward in the spatial and spatiotemporal setting. However, it is possible to estimate the parameters of the model using the maximum-likelihood approach. Via Monte Carlo simulations, we demonstrate the performance of the estimator for a specific spatial weighting matrix. Moreover, we combine the known spatial autoregressive model with the spatial ARCH model assuming heteroscedastic errors. Eventually, the proposed autoregressive process is illustrated using an empirical example. Specifically, we model lung cancer mortality in 3108 U.S. counties and compare the introduced model with two benchmark approaches.
△ Less
Submitted 2 September, 2016;
originally announced September 2016.
-
The aggregate path coupling method for the Potts model on bipartite graph
Authors:
Jose C. Hernandez,
Yevgeniy Kovchegov,
Peter T. Otto
Abstract:
In this paper, we derive the large deviations principle for the Potts model on the complete bipartite graph $K_{n,n}$ as $n$ increases to infinity. Next, for the Potts model on $K_{n,n}$, we provide an extension of the method of aggregate path coupling that was originally developed in Kovchegov et al 2011 for the mean-field Blume-Capel model and in Kovchegov and Otto 2015 for a general mean-field…
▽ More
In this paper, we derive the large deviations principle for the Potts model on the complete bipartite graph $K_{n,n}$ as $n$ increases to infinity. Next, for the Potts model on $K_{n,n}$, we provide an extension of the method of aggregate path coupling that was originally developed in Kovchegov et al 2011 for the mean-field Blume-Capel model and in Kovchegov and Otto 2015 for a general mean-field setting that included the Generalized Curie-Weiss-Potts model analyzed in Cuff et al 2012. We use the aggregate path coupling method to identify and prove the interface value $β_s$ separating the rapid and slow mixing regimes for the Glauber dynamics of the Potts model on $K_{n,n}$.
△ Less
Submitted 14 January, 2017; v1 submitted 9 July, 2016;
originally announced July 2016.
-
Polynomial representation for the expected length of minimal spanning trees
Authors:
Jared Nishikawa,
Peter T. Otto,
Colin Starr
Abstract:
In this paper, we investigate the polynomial integrand of an integral formula that yields the expected length of the minimal spanning tree of a graph whose edges are uniformly distributed over the interval [0, 1]. In particular, we derive a general formula for the coefficients of the polynomial and apply it to express the first few coefficients in terms of the structure of the underlying graph; e.…
▽ More
In this paper, we investigate the polynomial integrand of an integral formula that yields the expected length of the minimal spanning tree of a graph whose edges are uniformly distributed over the interval [0, 1]. In particular, we derive a general formula for the coefficients of the polynomial and apply it to express the first few coefficients in terms of the structure of the underlying graph; e.g. number of vertices, edges and cycles.
△ Less
Submitted 15 January, 2015;
originally announced January 2015.
-
Path Coupling and Aggregate Path Coupling
Authors:
Yevgeniy Kovchegov,
Peter T. Otto
Abstract:
In this survey paper, we describe and characterize an extension to the classical path coupling method applied statistical mechanical models, referred to as aggregate path coupling. In conjunction with large deviations estimates, we use this aggregate path coupling method to prove rapid mixing of Glauber dynamics for a large class of statistical mechanical models, including models that exhibit disc…
▽ More
In this survey paper, we describe and characterize an extension to the classical path coupling method applied statistical mechanical models, referred to as aggregate path coupling. In conjunction with large deviations estimates, we use this aggregate path coupling method to prove rapid mixing of Glauber dynamics for a large class of statistical mechanical models, including models that exhibit discontinuous phase transitions which have traditionally been more difficult to analyze rigorously. The parameter region for rapid mixing for the generalized Curie-Weiss-Potts model is derived as a new application of the aggregate path coupling method.
△ Less
Submitted 13 January, 2015;
originally announced January 2015.
-
Rapid Mixing of Glauber Dynamics of Gibbs Ensembles via Aggregate Path Coupling and Large Deviations Methods
Authors:
Yevgeniy Kovchegov,
Peter T. Otto
Abstract:
In this paper, we present a novel extension to the classical path coupling method to statistical mechanical models which we refer to as aggregate path coupling. In conjunction with large deviations estimates, we use this aggregate path coupling method to prove rapid mixing of Glauber dynamics for a large class of statistical mechanical models, including models that exhibit discontinuous phase tran…
▽ More
In this paper, we present a novel extension to the classical path coupling method to statistical mechanical models which we refer to as aggregate path coupling. In conjunction with large deviations estimates, we use this aggregate path coupling method to prove rapid mixing of Glauber dynamics for a large class of statistical mechanical models, including models that exhibit discontinuous phase transitions which have traditionally been more difficult to analyze rigorously. The parameter region for rapid mixing for the generalized Curie-Weiss-Potts model is derived as a new application of the aggregate path coupling method.
△ Less
Submitted 11 August, 2015; v1 submitted 23 December, 2013;
originally announced December 2013.
-
Mixing Times for the Mean-Field Blume-Capel Model via Aggregate Path Coupling
Authors:
Yevgeniy Kovchegov,
Peter T. Otto,
Mathew Titus
Abstract:
In this paper we investigate the relationship between the mixing times of the Glauber dynamics of a statistical mechanical system with its thermodynamic equilibrium structure. For this we consider the mean-field Blume-Capel model, one of the simplest statistical mechanical models that exhibits the following intricate phase transition structure: within a two dimensional parameter space there exists…
▽ More
In this paper we investigate the relationship between the mixing times of the Glauber dynamics of a statistical mechanical system with its thermodynamic equilibrium structure. For this we consider the mean-field Blume-Capel model, one of the simplest statistical mechanical models that exhibits the following intricate phase transition structure: within a two dimensional parameter space there exists a curve at which the model undergoes a second-order, continuous phase transition, a curve where the model undergoes a first-order, discontinuous phase transition, and a tricritical point which separates the two curves. We determine the interface between the regions of slow and rapid mixing. In order to completely determine the region of rapid mixing, we employ a novel extension of the path coupling method, successfully proving rapid mixing even in the absence of contraction between neighboring states.
△ Less
Submitted 16 February, 2011;
originally announced February 2011.
-
Asymptotic behavior of the finite-size magnetization as a function of the speed of approach to criticality
Authors:
Richard S. Ellis,
Jonathan Machta,
Peter Tak-Hun Otto
Abstract:
The main focus of this paper is to determine whether the thermodynamic magnetization is a physically relevant estimator of the finite-size magnetization. This is done by comparing the asymptotic behaviors of these two quantities along parameter sequences converging to either a second-order point or the tricritical point in the mean-field Blume--Capel model. We show that the thermodynamic magnetiza…
▽ More
The main focus of this paper is to determine whether the thermodynamic magnetization is a physically relevant estimator of the finite-size magnetization. This is done by comparing the asymptotic behaviors of these two quantities along parameter sequences converging to either a second-order point or the tricritical point in the mean-field Blume--Capel model. We show that the thermodynamic magnetization and the finite-size magnetization are asymptotic when the parameter $α$ governing the speed at which the sequence approaches criticality is below a certain threshold $α_0$. However, when $α$ exceeds $α_0$, the thermodynamic magnetization converges to 0 much faster than the finite-size magnetization. The asymptotic behavior of the finite-size magnetization is proved via a moderate deviation principle when $0<α<α_0$ and via a weak-convergence limit when $α>α_0$. To the best of our knowledge, our results are the first rigorous confirmation of the statistical mechanical theory of finite-size scaling for a mean-field model.
△ Less
Submitted 12 November, 2010; v1 submitted 7 August, 2009;
originally announced August 2009.
-
Ginzburg-Landau Polynomials and the Asymptotic Behavior of the Magnetization Near Critical and Tricritical Points
Authors:
Richard S. Ellis,
Jonathan Machta,
Peter Tak-Hun Otto
Abstract:
For the mean-field version of an important lattice-spin model due to Blume and Capel, we prove unexpected connections among the asymptotic behavior of the magnetization, the structure of the phase transitions, and a class of polynomials that we call the Ginzburg-Landau polynomials. The model depends on the parameters n, beta, and K, which represent, respectively, the number of spins, the inverse…
▽ More
For the mean-field version of an important lattice-spin model due to Blume and Capel, we prove unexpected connections among the asymptotic behavior of the magnetization, the structure of the phase transitions, and a class of polynomials that we call the Ginzburg-Landau polynomials. The model depends on the parameters n, beta, and K, which represent, respectively, the number of spins, the inverse temperature, and the interaction strength. Our main focus is on the asymptotic behavior of the magnetization m(beta_n,K_n) for appropriate sequences (beta_n,K_n) that converge to a second-order point or to the tricritical point of the model and that lie inside various subsets of the phase-coexistence region. The main result states that as (beta_n,K_n) converges to one of these points (beta,K), m(beta_n,K_n) ~ c |beta - beta_n|^gamma --> 0. In this formula gamma is a positive constant, and c is the unique positive, global minimum point of a certain polynomial g that we call the Ginzburg-Landau polynomial. This polynomial arises as a limit of appropriately scaled free-energy functionals, the global minimum points of which define the phase-transition structure of the model. For each sequence (beta_n,K_n) under study, the structure of the global minimum points of the associated Ginzburg-Landau polynomial mirrors the structure of the global minimum points of the free-energy functional in the region through which (beta_n,K_n) passes and thus reflects the phase-transition structure of the model in that region. The properties of the Ginzburg-Landau polynomials make rigorous the predictions of the Ginzburg-Landau phenomenology of critical phenomena, and the asymptotic formula for m(beta_n,K_n) makes rigorous the heuristic scaling theory of the tricritical point.
△ Less
Submitted 8 May, 2008; v1 submitted 3 March, 2008;
originally announced March 2008.
-
Multiple critical behavior of probabilistic limit theorems in the neighborhood of a tricritical point
Authors:
Marius Costeniuc,
Richard S. Ellis,
Peter Tak-Hun Otto
Abstract:
We derive probabilistic limit theorems that reveal the intricate structure of the phase transitions in a mean-field version of the Blume-Emery-Griffiths model. These probabilistic limit theorems consist of scaling limits for the total spin and moderate deviation principles (MDPs) for the total spin. The model under study is defined by a probability distribution that depends on the parameters…
▽ More
We derive probabilistic limit theorems that reveal the intricate structure of the phase transitions in a mean-field version of the Blume-Emery-Griffiths model. These probabilistic limit theorems consist of scaling limits for the total spin and moderate deviation principles (MDPs) for the total spin. The model under study is defined by a probability distribution that depends on the parameters $n$, $β$, and $K$, which represent, respectively, the number of spins, the inverse temperature, and the interaction strength. The intricate structure of the phase transitions is revealed by the existence of 18 scaling limits and 18 MDPs for the total spin. These limit results are obtained as $(β,K)$ converges along appropriate sequences to points belonging to various subsets of the phase diagram, which include a curve of second-order points and a tricritical point. The forms of the limiting densities in the scaling limits and of the rate functions in the MDPs reflect the influence of one or more sets that lie in neighborhoods of the critical points and the tricritical point. Of all the scaling limits, the structure of those near the tricritical point is by far the most complex, exhibiting new types of critical behavior when observed in a limit-theorem phase diagram in the space of the two parameters that parametrize the scaling limits.
△ Less
Submitted 19 September, 2006;
originally announced September 2006.
-
Analysis of phase transitions in the mean-field Blume-Emery-Griffiths model
Authors:
Richard S. Ellis,
Peter T. Otto,
Hugo Touchette
Abstract:
In this paper we give a complete analysis of the phase transitions in the mean-field Blume-Emery-Griffiths lattice-spin model with respect to the canonical ensemble, showing both a second-order, continuous phase transition and a first-order, discontinuous phase transition for appropriate values of the thermodynamic parameters that define the model. These phase transitions are analyzed both in te…
▽ More
In this paper we give a complete analysis of the phase transitions in the mean-field Blume-Emery-Griffiths lattice-spin model with respect to the canonical ensemble, showing both a second-order, continuous phase transition and a first-order, discontinuous phase transition for appropriate values of the thermodynamic parameters that define the model. These phase transitions are analyzed both in terms of the empirical measure and the spin per site by studying bifurcation phenomena of the corresponding sets of canonical equilibrium macrostates, which are defined via large deviation principles. Analogous phase transitions with respect to the microcanonical ensemble are also studied via a combination of rigorous analysis and numerical calculations. Finally, probabilistic limit theorems for appropriately scaled values of the total spin are proved with respect to the canonical ensemble. These limit theorems include both central-limit-type theorems, when the thermodynamic parameters are not equal to critical values, and noncentral-limit-type theorems, when these parameters equal critical values.
△ Less
Submitted 25 August, 2005;
originally announced August 2005.
-
Analysis of phase transitions in the mean-field Blume-Emery-Griffiths model
Authors:
R. S. Ellis,
P. Otto,
H. Touchette
Abstract:
In this paper we give a complete analysis of the phase transitions in the mean-field Blume-Emery-Griffiths lattice-spin model with respect to the canonical ensemble, showing both a second-order, continuous phase transition and a first-order, discontinuous phase transition for appropriate values of the thermodynamic parameters that define the model. These phase transitions are analyzed both in te…
▽ More
In this paper we give a complete analysis of the phase transitions in the mean-field Blume-Emery-Griffiths lattice-spin model with respect to the canonical ensemble, showing both a second-order, continuous phase transition and a first-order, discontinuous phase transition for appropriate values of the thermodynamic parameters that define the model. These phase transitions are analyzed both in terms of the empirical measure and the spin per site by studying bifurcation phenomena of the corresponding sets of canonical equilibrium macrostates, which are defined via large deviation principles. Analogous phase transitions with respect to the microcanonical ensemble are also studied via a combination of rigorous analysis and numerical calculations. Finally, probabilistic limit theorems for appropriately scaled values of the total spin are proved with respect to the canonical ensemble. These limit theorems include both central-limit-type theorems when the thermodynamic parameters are not equal to critical values and non-central-limit-type theorems when these parameters equal critical values.
△ Less
Submitted 2 September, 2004;
originally announced September 2004.