-
The trivariate wrapped Cauchy copula -- a multi-purpose model for angular data
Authors:
Shogo Kato,
Christophe Ley,
Sophia Loizidou
Abstract:
In this paper, we will present a new flexible distribution for three-dimensional angular data, or data on the three-dimensional torus. Our trivariate wrapped Cauchy copula has the following benefits: (i) simple form of density, (ii) adjustable degree of dependence between every pair of variables, (iii) interpretable and well-estimable parameters, (iv) well-known conditional distributions, (v) a si…
▽ More
In this paper, we will present a new flexible distribution for three-dimensional angular data, or data on the three-dimensional torus. Our trivariate wrapped Cauchy copula has the following benefits: (i) simple form of density, (ii) adjustable degree of dependence between every pair of variables, (iii) interpretable and well-estimable parameters, (iv) well-known conditional distributions, (v) a simple data generating mechanism, (vi) unimodality. Moreover, our construction allows for linear marginals, implying that our copula can also model cylindrical data. Parameter estimation via maximum likelihood is explained, a comparison with the competitors in the existing literature is given, and two real datasets are considered, one concerning protein dihedral angles and another about data obtained by a buoy in the Adriatic Sea.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Traffic Count Data Analysis Using Mixtures of Kato--Jones Distributions
Authors:
Kota Nagasaki,
Shogo Kato,
Wataru Nakanishi,
M. C. Jones
Abstract:
We discuss the modelling of traffic count data that show the variation of traffic volume within a day. For the modelling, we apply mixtures of Kato-Jones distributions in which each component is unimodal and affords a wide range of skewness and kurtosis. We consider two methods for parameter estimation, namely, a modified method of moments and the maximum likelihood method. These methods were seen…
▽ More
We discuss the modelling of traffic count data that show the variation of traffic volume within a day. For the modelling, we apply mixtures of Kato-Jones distributions in which each component is unimodal and affords a wide range of skewness and kurtosis. We consider two methods for parameter estimation, namely, a modified method of moments and the maximum likelihood method. These methods were seen to be useful for fitting the proposed mixtures to our data. As a result, the variation in traffic volume was classified into the morning and evening traffic whose distributions have different shapes, particularly different degrees of skewness and kurtosis.
△ Less
Submitted 9 July, 2024; v1 submitted 2 June, 2022;
originally announced June 2022.
-
Copula-based measures of asymmetry between the lower and upper tail probabilities
Authors:
Shogo Kato,
Toshinao Yoshiba,
Shinto Eguchi
Abstract:
We propose a copula-based measure of asymmetry between the lower and upper tail probabilities of bivariate distributions. The proposed measure has a simple form and possesses some desirable properties as a measure of asymmetry. The limit of the proposed measure as the index goes to the boundary of its domain can be expressed in a simple form under certain conditions on copulas. A sample analogue o…
▽ More
We propose a copula-based measure of asymmetry between the lower and upper tail probabilities of bivariate distributions. The proposed measure has a simple form and possesses some desirable properties as a measure of asymmetry. The limit of the proposed measure as the index goes to the boundary of its domain can be expressed in a simple form under certain conditions on copulas. A sample analogue of the proposed measure for a sample from a copula is presented and its weak convergence to a Gaussian process is shown. Another sample analogue of the presented measure, which is based on a sample from a distribution on $\mathbb{R}^2$, is given. Simple methods for interval estimation and nonparametric testing based on the two sample analogues are presented. As an example, the presented measure is applied to daily returns of S&P500 and Nikkei225.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
Flexible random-effects distribution models for meta-analysis
Authors:
Hisashi Noma,
Kengo Nagashima,
Shogo Kato,
Satoshi Teramukai,
Toshi A. Furukawa
Abstract:
In meta-analysis, the random-effects models are standard tools to address between-study heterogeneity in evidence synthesis analyses. For the random-effects distribution models, the normal distribution model has been adopted in most systematic reviews due to its computational and conceptual simplicity. However, the restrictive model assumption might have serious influences on the overall conclusio…
▽ More
In meta-analysis, the random-effects models are standard tools to address between-study heterogeneity in evidence synthesis analyses. For the random-effects distribution models, the normal distribution model has been adopted in most systematic reviews due to its computational and conceptual simplicity. However, the restrictive model assumption might have serious influences on the overall conclusions in practices. In this article, we first provide two examples of real-world evidence that clearly show that the normal distribution assumption is unsuitable. To address the model restriction problem, we propose alternative flexible random-effects models that can flexibly regulate skewness, kurtosis and tailweight: skew normal distribution, skew t-distribution, asymmetric Subbotin distribution, Jones-Faddy distribution, and sinh-arcsinh distribution. We also developed a R package, flexmeta, that can easily perform these methods. Using the flexible random-effects distribution models, the results of the two meta-analyses were markedly altered, potentially influencing the overall conclusions of these systematic reviews. The flexible methods and computational tools can provide more precise evidence, and these methods would be recommended at least as sensitivity analysis tools to assess the influence of the normal distribution assumption of the random-effects model.
△ Less
Submitted 1 August, 2020; v1 submitted 10 March, 2020;
originally announced March 2020.
-
Inequality Constrained Multilevel Models
Authors:
Bernet S. Kato,
Carel F. W. Peeters
Abstract:
Multilevel or hierarchical data structures can occur in many areas of research, including economics, psychology, sociology, agriculture, medicine, and public health. Over the last 25 years, there has been increasing interest in develo** suitable techniques for the statistical analysis of multilevel data, and this has resulted in a broad class of models known under the generic name of multilevel…
▽ More
Multilevel or hierarchical data structures can occur in many areas of research, including economics, psychology, sociology, agriculture, medicine, and public health. Over the last 25 years, there has been increasing interest in develo** suitable techniques for the statistical analysis of multilevel data, and this has resulted in a broad class of models known under the generic name of multilevel models. Generally, multilevel models are useful for exploring how relationships vary across higher-level units taking into account the within and between cluster variations. Research scientists often have substantive theories in mind when evaluating data with statistical models. Substantive theories often involve inequality constraints among the parameters to translate a theory into a model. This chapter shows how the inequality constrained multilevel linear model can be given a Bayesian formulation, how the model parameters can be estimated using a so-called augmented Gibbs sampler, and how posterior probabilities can be computed to assist the researcher in model selection.
△ Less
Submitted 4 January, 2018;
originally announced January 2018.
-
Spatial Clustering of Curves with Functional Covariates: A Bayesian Partitioning Model with Application to Spectra Radiance in Climate Study
Authors:
Zhen Zhang,
Chae Young Lim,
Tapabrata Maiti,
Seiji Kato
Abstract:
In climate change study, the infrared spectral signatures of climate change have recently been conceptually adopted, and widely applied to identifying and attributing atmospheric composition change. We propose a Bayesian hierarchical model for spatial clustering of the high-dimensional functional data based on the effects of functional covariates and local features. We couple the functional mixed-…
▽ More
In climate change study, the infrared spectral signatures of climate change have recently been conceptually adopted, and widely applied to identifying and attributing atmospheric composition change. We propose a Bayesian hierarchical model for spatial clustering of the high-dimensional functional data based on the effects of functional covariates and local features. We couple the functional mixed-effects model with a generalized spatial partitioning method for: (1) producing spatially contiguous clusters for the high-dimensional spatio-functional data; (2) improving the computational efficiency via parallel computing over subregions or multi-level partitions; and (3) capturing the near-boundary ambiguity and uncertainty for data-driven partitions. We propose a generalized partitioning method which puts less constraints on the shape of spatial clusters. Dimension reduction in the parameter space is also achieved via Bayesian wavelets to alleviate the increasing model complexity introduced by clusters. The model well captures the regional effects of the atmospheric and cloud properties on the spectral radiance measurements. The results elaborate the importance of exploiting spatially contiguous partitions for identifying regional effects and small-scale variability.
△ Less
Submitted 19 March, 2016;
originally announced April 2016.
-
A flexible family of distributions on the cylinder
Authors:
Shonosuke Sugasawa,
Kunio Shimizu,
Shogo Kato
Abstract:
We propose a flexible family of distributions, generalized $t$-distributions, on the cylinder which is obtained as a conditional distribution of a trivariate $t$ distribution. The new distribution has unimodality or bimodality, symmetry or asymmetry, depending on the values of parameters and flexibly fits the cylindrical data. The circular marginal of this distribution is distributed as a generali…
▽ More
We propose a flexible family of distributions, generalized $t$-distributions, on the cylinder which is obtained as a conditional distribution of a trivariate $t$ distribution. The new distribution has unimodality or bimodality, symmetry or asymmetry, depending on the values of parameters and flexibly fits the cylindrical data. The circular marginal of this distribution is distributed as a generalized $t$-distribution on the circle. Some other properties are also investigated. The proposed distribution is applied to the real cylindrical data.
△ Less
Submitted 17 July, 2015; v1 submitted 26 January, 2015;
originally announced January 2015.
-
Robust estimation of location and concentration parameters for the von Mises-Fisher distribution
Authors:
Shogo Kato,
Shinto Eguchi
Abstract:
Robust estimation of location and concentration parameters for the von Mises-Fisher distribution is discussed. A key reparametrisation is achieved by expressing the two parameters as one vector on the Euclidean space. With this representation, we first show that maximum likelihood estimator for the von Mises-Fisher distribution is not robust in some situations. Then we propose two families of robu…
▽ More
Robust estimation of location and concentration parameters for the von Mises-Fisher distribution is discussed. A key reparametrisation is achieved by expressing the two parameters as one vector on the Euclidean space. With this representation, we first show that maximum likelihood estimator for the von Mises-Fisher distribution is not robust in some situations. Then we propose two families of robust estimators which can be derived as minimisers of two density power divergences. The presented families enable us to estimate both location and concentration parameters simultaneously. Some properties of the estimators are explored. Simple iterative algorithms are suggested to find the estimates numerically. A comparison with the existing robust estimators is given as well as discussion on difference and similarity between the two proposed estimators. A simulation study is made to evaluate finite sample performance of the estimators. We consider a sea star dataset and discuss the selection of the tuning parameters and outlier detection.
△ Less
Submitted 31 January, 2012;
originally announced January 2012.