-
Dynamical hypothesis tests and Decision Theory for Gibbs distributions
Authors:
M. Denker,
A. O. Lopes,
S. R. C. Lopes
Abstract:
We consider the problem of testing for two Gibbs probabilities $μ_0$ and $μ_1$ defined for a dynamical system $(Ω,T)$. Due to the fact that in general full orbits are not observable or computable, one needs to restrict to subclasses of tests defined by a finite time series $h(x_0), h(x_1)=h(T(x_0)),..., h(x_n)=h(T^n(x_0))$, $x_0\in Ω$, $n\ge 0$, where $h:Ω\to\mathbb R$ denotes a suitable measurabl…
▽ More
We consider the problem of testing for two Gibbs probabilities $μ_0$ and $μ_1$ defined for a dynamical system $(Ω,T)$. Due to the fact that in general full orbits are not observable or computable, one needs to restrict to subclasses of tests defined by a finite time series $h(x_0), h(x_1)=h(T(x_0)),..., h(x_n)=h(T^n(x_0))$, $x_0\in Ω$, $n\ge 0$, where $h:Ω\to\mathbb R$ denotes a suitable measurable function. We determine in each class the Neyman-Pearson tests, the minimax tests, and the Bayes solutions, and show the asymptotic decay of their risk functions, as $n\to\infty$. In the case of $Ω$ being a symbolic space, for each $n\in \mathbb{N}$, these optimal tests rely on the information of the measures for cylinder sets of size $n$.
△ Less
Submitted 15 September, 2022; v1 submitted 1 December, 2021;
originally announced December 2021.
-
A Generalization of the Ornstein-Uhlenbeck Process: Theoretical Results, Simulations and Parameter Estimation
Authors:
J. Stein,
S. R. C. Lopes,
A. V. Medino
Abstract:
In this work, we study the class of stochastic process that generalizes the Ornstein-Uhlenbeck processes, hereafter called by \emph{Generalized Ornstein-Uhlenbeck Type Process} and denoted by GOU type process. We consider them driven by the class of noise processes such as Brownian motion, symmetric $α$-stable Lévy process, a Lévy process, and even a Poisson process. We give necessary and sufficie…
▽ More
In this work, we study the class of stochastic process that generalizes the Ornstein-Uhlenbeck processes, hereafter called by \emph{Generalized Ornstein-Uhlenbeck Type Process} and denoted by GOU type process. We consider them driven by the class of noise processes such as Brownian motion, symmetric $α$-stable Lévy process, a Lévy process, and even a Poisson process. We give necessary and sufficient conditions under the memory kernel function for the time-stationary and the Markov properties for these processes. When the GOU type process is driven by a Lévy noise we prove that it is infinitely divisible showing its generating triplet. Several examples derived from the GOU type process are illustrated showing some of their basic properties as well as some time series realizations. These examples also present their theoretical and empirical autocorrelation or normalized codifference functions depending on whether the process has a finite or infinite second moment. We also present the maximum likelihood estimation as well as the Bayesian estimation procedures for the so-called \emph{Cosine process}, a particular process in the class of GOU type processes. For the Bayesian estimation method, we consider the power series representation of Fox's H-function to better approximate the density function of a random variable $α$-stable distributed. We consider four goodness-of-fit tests for hel** to decide which \emph{Cosine process} (driven by a Gaussian or an $α$-stable noise) best fit real data sets. Two applications of GOU type model are presented: one based on the Apple company stock market price data and the other based on the cardiovascular mortality in Los Angeles County data.
△ Less
Submitted 13 August, 2021;
originally announced August 2021.
-
Pentadiagonal Matrices and an Application to the Centered MA(1) Stationary Gaussian Process
Authors:
Maicon J. Karling,
Artur O. Lopes,
Silvia R. C. Lopes
Abstract:
In this work, we study the properties of a pentadiagonal symmetric matrix with perturbed corners. More specifically, we present explicit expressions for characterizing when this matrix is non-negative and positive definite in two special and important cases. We also give a closed expression for the determinant of such matrices. Previous works present the determinant in a recurrence form but not in…
▽ More
In this work, we study the properties of a pentadiagonal symmetric matrix with perturbed corners. More specifically, we present explicit expressions for characterizing when this matrix is non-negative and positive definite in two special and important cases. We also give a closed expression for the determinant of such matrices. Previous works present the determinant in a recurrence form but not in an explicit one. As an application of these results, we also study the limiting cumulant generating function associated to the bivariate sequence of random vectors (n^{-1} (\sum_{k=1}^n X_k^2 , \sum_{k=2}^n X_k X_{k-1})_{n in N}, when (X_n)_{n in N} is the centered stationary moving average process of first order with Gaussian innovations. We exhibit the explicit expression of this limiting cumulant generating function. Finally, we present three examples illustrating the techniques studied here.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Explicit Bivariate Rate Functions for Large Deviations in AR(1) and MA(1) Processes with Gaussian Innovations
Authors:
M. J. Karling,
A. O. Lopes,
S. R. C. Lopes
Abstract:
We investigate large deviations properties for centered stationary AR(1) and MA(1) processes with independent Gaussian innovations, by giving the explicit bivariate rate functions for the sequence of random vectors $(\boldsymbol{S}_n)_{n \in \N} = \left(n^{-1}(\sum_{k=1}^n X_k, \sum_{k=1}^n X_k^2)\right)_{n \in \N}$. In the AR(1) case, we also give the explicit rate function for the bivariate rand…
▽ More
We investigate large deviations properties for centered stationary AR(1) and MA(1) processes with independent Gaussian innovations, by giving the explicit bivariate rate functions for the sequence of random vectors $(\boldsymbol{S}_n)_{n \in \N} = \left(n^{-1}(\sum_{k=1}^n X_k, \sum_{k=1}^n X_k^2)\right)_{n \in \N}$. In the AR(1) case, we also give the explicit rate function for the bivariate random sequence $(\W_n)_{n \geq 2} = \left(n^{-1}(\sum_{k=1}^n X_k^2, \sum_{k=2}^n X_k X_{k+1})\right)_{n \geq
2}$. Via Contraction Principle, we provide explicit rate functions for the sequences $(n^{-1} \sum_{k=1}^n X_k)_{n \in \N}$, $(n^{-1} \sum_{k=1}^n X_k^2)_{n \geq 2}$ and $(n^{-1} \sum_{k=2}^n X_k X_{k+1})_{n \geq 2}$, as well. In the AR(1) case, we present a new proof for an already known result on the explicit deviation function for the Yule-Walker estimator.
△ Less
Submitted 18 February, 2021;
originally announced February 2021.
-
Decision Theory and Large Deviations for Dynamical Hypotheses Tests: Neyman-Pearson, Min-Max and Bayesian Tests
Authors:
Hermes H. Ferreira,
Artur O. Lopes,
Silvia R. C. Lopes
Abstract:
We analyze hypotheses tests using classical results on large deviations to compare two models, each one described by a different Hölder Gibbs probability measure. One main difference to the classical hypothesis tests in Decision Theory is that here the two measures are singular with respect to each other. Among other objectives, we are interested in the decay rate of the wrong decisions probabilit…
▽ More
We analyze hypotheses tests using classical results on large deviations to compare two models, each one described by a different Hölder Gibbs probability measure. One main difference to the classical hypothesis tests in Decision Theory is that here the two measures are singular with respect to each other. Among other objectives, we are interested in the decay rate of the wrong decisions probability, when the sample size $n$ goes to infinity. We show a dynamical version of the Neyman-Pearson Lemma displaying the ideal test within a certain class of similar tests. This test becomes exponentially better, compared to other alternative tests, when the sample size goes to infinity. We are able to present the explicit exponential decay rate. We also consider both, the Min-Max and a certain type of Bayesian hypotheses tests. We shall consider these tests in the log likelihood framework by using several tools of Thermodynamic Formalism. Versions of the Stein's Lemma and Chernoff's information are also presented.
△ Less
Submitted 27 December, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Bayes posterior convergence for loss functions via almost additive Thermodynamic Formalism
Authors:
Artur O. Lopes,
Silvia R. C. Lopes,
Paulo Varandas
Abstract:
Statistical inference can be seen as information processing involving input information and output information that updates belief about some unknown parameters. We consider the Bayesian framework for making inferences about dynamical systems from ergodic observations, where the Bayesian procedure is based on the Gibbs posterior inference, a decision process generalization of standard Bayesian inf…
▽ More
Statistical inference can be seen as information processing involving input information and output information that updates belief about some unknown parameters. We consider the Bayesian framework for making inferences about dynamical systems from ergodic observations, where the Bayesian procedure is based on the Gibbs posterior inference, a decision process generalization of standard Bayesian inference where the likelihood is replaced by the exponential of a loss function. In the case of direct observation and almost-additive loss functions, we prove an exponential convergence of the a posteriori measures a limit measure. Our estimates on the Bayes posterior convergence for direct observation are related and extend those in a recent paper by K. McGoff, S. Mukherjee and A. Nobel. Our approach makes use of non-additive thermodynamic formalism and large deviation properties instead of joinings.
△ Less
Submitted 13 January, 2022; v1 submitted 10 December, 2020;
originally announced December 2020.
-
Fractionally Integrated Moving Average Stable Processes With Long-Range Dependence
Authors:
G. L. Feltes,
S. R. C. Lopes
Abstract:
Long memory processes driven by Lévy noise with finite second-order moments have been well studied in the literature. They form a very rich class of processes presenting an autocovariance function which decays like a power function. Here, we study a class of Lévy process whose second-order moments are infinite, the so-called $α$-stable processes. Based on Samorodnitsky and Taqqu (2000), we constru…
▽ More
Long memory processes driven by Lévy noise with finite second-order moments have been well studied in the literature. They form a very rich class of processes presenting an autocovariance function which decays like a power function. Here, we study a class of Lévy process whose second-order moments are infinite, the so-called $α$-stable processes. Based on Samorodnitsky and Taqqu (2000), we construct an isometry that allows us to define stochastic integrals concerning the linear fractional stable motion using Riemann-Liouville fractional integrals. With this construction, follows naturally an integration by parts formula. We then present a family of stationary $SαS$ processes with the property of long-range dependence, using a generalized measure to investigate its dependence structure. In the end, the law of large number's result for a time's sample of the process is shown as an application of the isometry and integration by parts formula.
△ Less
Submitted 18 April, 2022; v1 submitted 11 November, 2020;
originally announced November 2020.
-
Amazon Forest Fires Between 2001 and 2006 and Birth Weight in Porto Velho
Authors:
Taiane Schaedler Prass,
Sílvia Regina Costa Lopes,
José G. Dórea,
Rejane C. Marques,
Katiane G. Brandão
Abstract:
Birth weight data (22,012 live-births) from a public hospital in Porto Velho (Amazon) was used in multiple statistical models to assess the effects of forest-fire smoke on human reproductive outcome. Mean birth weights for girls (3,139 g) and boys (3,393 g) were considered statistically different (p-value < 2.2e-16). Among all models analyzed, the means were considered statistically different only…
▽ More
Birth weight data (22,012 live-births) from a public hospital in Porto Velho (Amazon) was used in multiple statistical models to assess the effects of forest-fire smoke on human reproductive outcome. Mean birth weights for girls (3,139 g) and boys (3,393 g) were considered statistically different (p-value < 2.2e-16). Among all models analyzed, the means were considered statistically different only when treated as a function of month and year (p-value = 0.0989, girls and 0.0079, boys) . The R 2 statistics indicate that the regression models considered are able to explain 65 % (girls) and 54 % (boys) of the variation of the mean birth weight.
△ Less
Submitted 24 April, 2019; v1 submitted 22 April, 2019;
originally announced April 2019.
-
Seasonal FIEGARCH Processes
Authors:
Sílvia Regina Costa Lopes,
Taiane Schaedler Prass
Abstract:
Here we develop the theory of seasonal FIEGARCH processes, denoted by SFIEGARCH, establishing conditions for the existence, the invertibility, the stationarity and the ergodicity of these processes. We analyze their asymptotic dependence structure by means of the autocovariance and autocorrelation functions. We also present some properties regarding their spectral representation. All properties ar…
▽ More
Here we develop the theory of seasonal FIEGARCH processes, denoted by SFIEGARCH, establishing conditions for the existence, the invertibility, the stationarity and the ergodicity of these processes. We analyze their asymptotic dependence structure by means of the autocovariance and autocorrelation functions. We also present some properties regarding their spectral representation. All properties are illustrated through graphical examples and an application of SFIEGARCH models to describe the volatility of the S&P500 US stock index log-return time series in the period from December 13, 2004 to October 10, 2009 is provided.
△ Less
Submitted 22 April, 2019;
originally announced April 2019.
-
Clustering and Classification of Genetic Data Through U-Statistics
Authors:
Gabriela Bettella Cybis,
Marcio Valk,
Silvia Regina Costa Lopes
Abstract:
Genetic data are frequently categorical and have complex dependence structures that are not always well understood. For this reason, clustering and classification based on genetic data, while highly relevant, are challenging statistical problems. Here we consider a highly versatile U-statistics based approach built on dissimilarities between pairs of data points for nonparametric clustering. In th…
▽ More
Genetic data are frequently categorical and have complex dependence structures that are not always well understood. For this reason, clustering and classification based on genetic data, while highly relevant, are challenging statistical problems. Here we consider a highly versatile U-statistics based approach built on dissimilarities between pairs of data points for nonparametric clustering. In this work we propose statistical tests to assess group homogeneity taking into account the multiple testing issues, and a clustering algorithm based on dissimilarities within and between groups that highly speeds up the homogeneity test. We also propose a test to verify classification significance of a sample in one of two groups. A Monte Carlo simulation study is presented to evaluate power of the classification test, considering different group sizes and degree of separation. Size and power of the homogeneity test are also analyzed through simulations that compare it to competing methods. Finally, the methodology is applied to three different genetic datasets: global human genetic diversity, breast tumor gene expression and Dengue virus serotypes. These applications showcase this statistical framework's ability to answer diverse biological questions while adapting to the specificities of the different datatypes.
△ Less
Submitted 10 June, 2016;
originally announced June 2016.
-
Risk Measure Estimation On Fiegarch Processes
Authors:
Taiane S. Prass,
Sílvia R. C. Lopes
Abstract:
We consider the Fractionally Integrated Exponential Generalized Autoregressive Conditional Heteroskedasticity process, denoted by FIEGARCH(p,d,q), introduced by Bollerslev and Mikkelsen (1996). We present a simulated study regarding the estimation of the risk measure $VaR_p$ on FIEGARCH processes. We consider the distribution function of the portfolio log-returns (univariate case) and the multivar…
▽ More
We consider the Fractionally Integrated Exponential Generalized Autoregressive Conditional Heteroskedasticity process, denoted by FIEGARCH(p,d,q), introduced by Bollerslev and Mikkelsen (1996). We present a simulated study regarding the estimation of the risk measure $VaR_p$ on FIEGARCH processes. We consider the distribution function of the portfolio log-returns (univariate case) and the multivariate distribution function of the risk-factor changes (multivariate case). We also compare the performance of the risk measures $VaR_p$, $ES_p$ and MaxLoss for a portfolio composed by stocks of four Brazilian companies.
△ Less
Submitted 22 May, 2013;
originally announced May 2013.
-
A Semiparametric Estimator for Long-Range Dependent Multivariate Processes
Authors:
Guilherme Pumi,
Sílvia R. C. Lopes
Abstract:
In this paper we propose a generalization of a class of Gaussian Semiparametric Estimators (GSE) of the fractional differencing parameter for long-range dependent multivariate time series. We generalize a known GSE-type estimator by introducing some modifications at the objective function level regarding the process' spectral density matrix estimator. We study large sample properties of the estima…
▽ More
In this paper we propose a generalization of a class of Gaussian Semiparametric Estimators (GSE) of the fractional differencing parameter for long-range dependent multivariate time series. We generalize a known GSE-type estimator by introducing some modifications at the objective function level regarding the process' spectral density matrix estimator. We study large sample properties of the estimator without assuming Gaussianity as well as hypothesis testing. The class of models considered here satisfies simple conditions on the spectral density function, restricted to a small neighborhood of the zero frequency. This includes, but is not limited to, the class of VARFIMA models. A simulation study to assess the finite sample properties of the proposed estimator is presented and supports its competitiveness. We also present an empirical application to an exchange rate data.
△ Less
Submitted 22 May, 2013;
originally announced May 2013.
-
MCMC Bayesian Estimation in FIEGARCH Models
Authors:
Taiane S. Prass,
Sílvia R. C. Lopes,
Jorge A. Achcar
Abstract:
Bayesian inference for fractionally integrated exponential generalized autoregressive conditional heteroskedastic (FIEGARCH) models using Markov Chain Monte Carlo (MCMC) methods is described. A simulation study is presented to access the performance of the procedure, under the presence of long-memory in the volatility. Samples from FIEGARCH processes are obtained upon considering the generalized e…
▽ More
Bayesian inference for fractionally integrated exponential generalized autoregressive conditional heteroskedastic (FIEGARCH) models using Markov Chain Monte Carlo (MCMC) methods is described. A simulation study is presented to access the performance of the procedure, under the presence of long-memory in the volatility. Samples from FIEGARCH processes are obtained upon considering the generalized error distribution (GED) for the innovation process. Different values for the tail-thickness parameter νare considered covering both scenarios, innovation processes with lighter (ν<2) and heavier (ν>2) tails than the Gaussian distribution (ν=2). A sensitivity analysis is performed by considering different prior density functions and by integrating (or not) the knowledge on the true parameter values to select the hyperparameter values.
△ Less
Submitted 15 April, 2013; v1 submitted 5 April, 2013;
originally announced April 2013.
-
A Generalization of a Gaussian Semiparametric Estimator on Multivariate Long-Range Dependent Processes
Authors:
Guilherme Pumi,
Sílvia R. C. Lopes
Abstract:
In this paper we propose and study a general class of Gaussian Semiparametric Estimators (GSE) of the fractional differencing parameter in the context of long-range dependent multivariate time series. We establish large sample properties of the estimator without assuming Gaussianity. The class of models considered here satisfies simple conditions on the spectral density function, restricted to a s…
▽ More
In this paper we propose and study a general class of Gaussian Semiparametric Estimators (GSE) of the fractional differencing parameter in the context of long-range dependent multivariate time series. We establish large sample properties of the estimator without assuming Gaussianity. The class of models considered here satisfies simple conditions on the spectral density function, restricted to a small neighborhood of the zero frequency and includes important class of VARFIMA processes. We also present a simulation study to assess the finite sample properties of the proposed estimator based on a smoothed version of the GSE which supports its competitiveness.
△ Less
Submitted 22 May, 2013; v1 submitted 3 May, 2012;
originally announced May 2012.
-
Parameterization of Copulas and Covariance Decay of Stochastic Processes
Authors:
Guilherme Pumi,
Sílvia R. C. Lopes
Abstract:
In this work we study the problem of constructing stochastic processes with a predetermined covariance decay by parameterizing its marginals and a given family of copulas. We show that the proposed methodology is compatibility-free and present several examples to illustrate the theory, including the important Gaussian and Euclidean families of copulas. We associate the theory to common applied tim…
▽ More
In this work we study the problem of constructing stochastic processes with a predetermined covariance decay by parameterizing its marginals and a given family of copulas. We show that the proposed methodology is compatibility-free and present several examples to illustrate the theory, including the important Gaussian and Euclidean families of copulas. We associate the theory to common applied time series models.
△ Less
Submitted 2 March, 2023; v1 submitted 15 April, 2012;
originally announced April 2012.
-
Theoretical Results on FIEGARCH Processes
Authors:
Sílvia R. C. Lopes,
Taiane S. Prass
Abstract:
Here we present a theoretical study on the main properties of Fractionally Integrated Exponential Generalized Autoregressive Conditional Heteroskedastic (FIEGARCH) processes. We analyze the conditions for the existence, the invertibility, the stationarity and the ergodicity of these processes. We prove that, if $\{X_t\}_{t \in \mathds{Z}}$ is a FIEGARCH$(p,d,q)$ process then, under mild conditions…
▽ More
Here we present a theoretical study on the main properties of Fractionally Integrated Exponential Generalized Autoregressive Conditional Heteroskedastic (FIEGARCH) processes. We analyze the conditions for the existence, the invertibility, the stationarity and the ergodicity of these processes. We prove that, if $\{X_t\}_{t \in \mathds{Z}}$ is a FIEGARCH$(p,d,q)$ process then, under mild conditions, $\{\ln(X_t^2)\}_{t\in\mathds{Z}}$ is an ARFIMA$(q,d,0)$, that is, an autoregressive fractionally integrated moving average process. The convergence order for the polynomial coefficients that describes the volatility is presented and results related to the spectral representation and to the covariance structure of both processes $\{\ln(X_t^2)\}_{t\in\mathds{Z}}$ and $\ {\ln(σ_t^2)\}_{t\in\mathds{Z}}$ are also discussed. Expressions for the kurtosis and the asymmetry measures for any stationary FIEGARCH$(p,d,q)$ process are also derived. The $h$-step ahead forecast for the processes $\{X_t\}_{t \in \mathds{Z}}$, $\{\ln(σ_t^2)\}_{t\in\mathds{Z}}$ and $\{\ln(X_t^2)\}_{t\in\mathds{Z}}$ are given with their respective mean square error forecast. The work also presents a Monte Carlo simulation study showing how to generate, estimate and forecast based on six different FIEGARCH models. The forecasting performance of six models belonging to the class of autoregressive conditional heteroskedastic models (namely, ARCH-type models) and radial basis models is compared through an empirical application to Brazilian stock market exchange index.
△ Less
Submitted 25 March, 2013; v1 submitted 19 January, 2012;
originally announced January 2012.
-
Copulas Related to Manneville-Pomeau Processes
Authors:
Sílvia R. C. Lopes,
Guilherme Pumi
Abstract:
In this work we derive the copulas related to Manneville-Pomeau processes. We examine both bidimensional and multidimensional cases and derive some properties for the related copulas. Computational issues, approximations and random variate generation problems are addressed and simple numerical experiments to test the approximations developed are also performed. In particular, we propose an approxi…
▽ More
In this work we derive the copulas related to Manneville-Pomeau processes. We examine both bidimensional and multidimensional cases and derive some properties for the related copulas. Computational issues, approximations and random variate generation problems are addressed and simple numerical experiments to test the approximations developed are also performed. In particular, we propose an approximation to the copulas derived which we show to converge uniformly to the true copula. To illustrate the usefulness of the theory, we derive a fast procedure to estimate the underlying parameter in Manneville-Pomeau processes.
△ Less
Submitted 18 January, 2012; v1 submitted 12 September, 2011;
originally announced September 2011.
-
Parameter Estimation in Manneville-Pomeau Processes
Authors:
B. P. Olbermann,
Silvia R. C. Lopes,
Artur O. Lopes
Abstract:
In this work we study a class of stochastic processes $\{X_t\}_{t\in\N}$, where $X_t = (φ\circ T_s^t)(X_0)$ is obtained from the iterations of the transformation T_s, invariant for an ergodic probability μ_s on [0,1] and a continuous by part function $φ:[0,1] \to \R$. We consider here $T_s:[0,1]\to [0,1]$ the Manneville-Pomeau transformation. The autocorrelation function of the resulting process…
▽ More
In this work we study a class of stochastic processes $\{X_t\}_{t\in\N}$, where $X_t = (φ\circ T_s^t)(X_0)$ is obtained from the iterations of the transformation T_s, invariant for an ergodic probability μ_s on [0,1] and a continuous by part function $φ:[0,1] \to \R$. We consider here $T_s:[0,1]\to [0,1]$ the Manneville-Pomeau transformation. The autocorrelation function of the resulting process decays hyperbolically (or polynomially) and we obtain efficient methods to estimate the parameter s from a finite time series. As a consequence we also estimate the rate of convergence of the autocorrelation decay of these processes. We compare different estimation methods based on the periodogram function, on the smoothed periodogram function, on the variance of the partial sum and on the wavelet theory.
△ Less
Submitted 11 July, 2007;
originally announced July 2007.