-
Asymptotics of Yule's nonsense correlation for Ornstein-Uhlenbeck paths: a Wiener chaos approach
Authors:
Soukaina Douissi,
Frederi G. Viens,
Khalifa Es-Sebaiy
Abstract:
In this paper, we study the distribution of the so-called "Yule's nonsense correlation statistic" on a time interval $[0,T]$ for a time horizon $T>0$ , when $T$ is large, for a pair $(X_{1},X_{2})$ of independent Ornstein-Uhlenbeck processes. This statistic is by definition equal to : \begin{equation*} ρ(T):=\frac{Y_{12}(T)}{\sqrt{Y_{11}(T)}\sqrt{Y_{22}(T)}}, \end{equation*} where the random varia…
▽ More
In this paper, we study the distribution of the so-called "Yule's nonsense correlation statistic" on a time interval $[0,T]$ for a time horizon $T>0$ , when $T$ is large, for a pair $(X_{1},X_{2})$ of independent Ornstein-Uhlenbeck processes. This statistic is by definition equal to : \begin{equation*} ρ(T):=\frac{Y_{12}(T)}{\sqrt{Y_{11}(T)}\sqrt{Y_{22}(T)}}, \end{equation*} where the random variables $Y_{ij}(T)$, $i,j=1,2$ are defined as \begin{equation*} Y_{ij}(T):=\int_{0}^{T}X_{i}(u)X_{j}(u)du-T\bar{X}_{i}\bar{X_{j}}, \bar{X}_{i}:=\frac{1}{T}\int_{0}^{T}X_{i}(u)du. \end{equation*} We assume $X_{1}$ and $X_{2}$ have the same drift parameter $θ>0$. We also study the asymptotic law of a discrete-type version of $ρ(T)$, where $Y_{ij}(T)$ above are replaced by their Riemann-sum discretizations. In this case, conditions are provided for how the discretization (in-fill) step relates to the long horizon $T$. We establish identical normal asymptotics for standardized $ρ(T)$ and its discrete-data version. The asymptotic variance of $ρ(T)T^{1/2}$ is $θ^{-1}$. We also establish speeds of convergence in the Kolmogorov distance, which are of Berry-Esséen-type (constant*$T^{-1/2}$) except for a $\ln T$ factor. Our method is to use the properties of Wiener-chaos variables, since $ρ(T)$ and its discrete version are comprised of ratios involving three such variables in the 2nd Wiener chaos. This methodology accesses the Kolmogorov distance thanks to a relation which stems from the connection between the Malliavin calculus and Stein's method on Wiener space.
△ Less
Submitted 5 August, 2021;
originally announced August 2021.
-
Risk, Agricultural Production, and Weather Index Insurance in Village India
Authors:
Jeffrey D. Michler,
Frederi G. Viens,
Gerald E. Shively
Abstract:
We investigate the sources of variability in agricultural production and their relative importance in the context of weather index insurance for smallholder farmers in India. Using parcel-level panel data, multilevel modeling, and Bayesian methods we measure how large a role seasonal variation in weather plays in explaining yield variance. Seasonal variation in weather accounts for 19-20 percent o…
▽ More
We investigate the sources of variability in agricultural production and their relative importance in the context of weather index insurance for smallholder farmers in India. Using parcel-level panel data, multilevel modeling, and Bayesian methods we measure how large a role seasonal variation in weather plays in explaining yield variance. Seasonal variation in weather accounts for 19-20 percent of total variance in crop yields. Motivated by this result, we derive pricing and payout schedules for actuarially fair index insurance. These calculations shed light on the low uptake rates of index insurance and provide direction for designing more suitable index insurance.
△ Less
Submitted 19 March, 2021;
originally announced March 2021.
-
Yule's "nonsense correlation" for Gaussian random walks
Authors:
Philip A. Ernst,
Dongzhou Huang,
Frederi G. Viens
Abstract:
The purpose of this paper is to provide an exact formula for the second moment of the empirical correlation of two independent Gaussian random walks as well as implicit formulas for higher moments. The proofs are based on a symbolically tractable integro-differential representation formula for the moments of any order in a class of empirical correlations, first established by Ernst et al. (2019) a…
▽ More
The purpose of this paper is to provide an exact formula for the second moment of the empirical correlation of two independent Gaussian random walks as well as implicit formulas for higher moments. The proofs are based on a symbolically tractable integro-differential representation formula for the moments of any order in a class of empirical correlations, first established by Ernst et al. (2019) and investigated previously in Ernst et al. (2017). We also provide rates of convergence of the empirical correlation of two independent Gaussian random walks to the empirical correlation of two independent Wiener processes, by exploiting the explicit nature of the computations used for the moments. At the level of distributions, in Wasserstein distance, the convergence rate is the inverse $n^{-1}$ of the number of data points $n$. This holds because we represent and couple the discrete and continuous correlations on a common probability space, where we establish convergence in $L^1$ at the rate $n^{-1}$.
△ Less
Submitted 27 September, 2021; v1 submitted 10 March, 2021;
originally announced March 2021.
-
Get on the BAND Wagon: A Bayesian Framework for Quantifying Model Uncertainties in Nuclear Dynamics
Authors:
D. R. Phillips,
R. J. Furnstahl,
U. Heinz,
T. Maiti,
W. Nazarewicz,
F. M. Nunes,
M. Plumlee,
M. T. Pratola,
S. Pratt,
F. G. Viens,
S. M. Wild
Abstract:
We describe the Bayesian Analysis of Nuclear Dynamics (BAND) framework, a cyberinfrastructure that we are develo** which will unify the treatment of nuclear models, experimental data, and associated uncertainties. We overview the statistical principles and nuclear-physics contexts underlying the BAND toolset, with an emphasis on Bayesian methodology's ability to leverage insight from multiple mo…
▽ More
We describe the Bayesian Analysis of Nuclear Dynamics (BAND) framework, a cyberinfrastructure that we are develo** which will unify the treatment of nuclear models, experimental data, and associated uncertainties. We overview the statistical principles and nuclear-physics contexts underlying the BAND toolset, with an emphasis on Bayesian methodology's ability to leverage insight from multiple models. In order to facilitate understanding of these tools we provide a simple and accessible example of the BAND framework's application. Four case studies are presented to highlight how elements of the framework will enable progress on complex, far-ranging problems in nuclear physics. By collecting notation and terminology, providing illustrative examples, and giving an overview of the associated techniques, this paper aims to open paths through which the nuclear physics and statistics communities can contribute to and build upon the BAND framework.
△ Less
Submitted 21 May, 2021; v1 submitted 14 December, 2020;
originally announced December 2020.
-
AR(1) processes driven by second-chaos white noise: Berry-Esséen bounds for quadratic variation and parameter estimation
Authors:
Soukaina Douissi,
Khalifa Es-Sebaiy,
Fatimah Alshahrani,
Frederi G. Viens
Abstract:
In this paper, we study the asymptotic behavior of the quadratic variation for the class of AR(1) processes driven by white noise in the second Wiener chaos. Using tools from the analysis on Wiener space, we give an upper bound for the total-variation speed of convergence to the normal law, which we apply to study the estimation of the model's mean-reversion. Simulations are performed to illustrat…
▽ More
In this paper, we study the asymptotic behavior of the quadratic variation for the class of AR(1) processes driven by white noise in the second Wiener chaos. Using tools from the analysis on Wiener space, we give an upper bound for the total-variation speed of convergence to the normal law, which we apply to study the estimation of the model's mean-reversion. Simulations are performed to illustrate the theoretical results.
△ Less
Submitted 15 July, 2019;
originally announced July 2019.
-
Berry-Esséen bounds for parameter estimation of general Gaussian processes
Authors:
Soukaina Douissi,
Khalifa Es-Sebaiy,
Frederi G. Viens
Abstract:
We study rates of convergence in central limit theorems for the partial sum of squares of general Gaussian sequences, using tools from analysis on Wiener space. No assumption of stationarity, asymptotically or otherwise, is made. The main theoretical tool is the so-called Optimal Fourth Moment Theorem \cite{NP2015}, which provides a sharp quantitative estimate of the total variation distance on Wi…
▽ More
We study rates of convergence in central limit theorems for the partial sum of squares of general Gaussian sequences, using tools from analysis on Wiener space. No assumption of stationarity, asymptotically or otherwise, is made. The main theoretical tool is the so-called Optimal Fourth Moment Theorem \cite{NP2015}, which provides a sharp quantitative estimate of the total variation distance on Wiener chaos to the normal law. The only assumptions made on the sequence are the existence of an asymptotic variance, that a least-squares-type estimator for this variance parameter has a bias and a variance which can be controlled, and that the sequence's auto-correlation function, which may exhibit long memory, has a no-worse memory than that of fractional Brownian motion with Hurst parameter }$H<3/4$.{\ \ Our main result is explicit, exhibiting the trade-off between bias, variance, and memory. We apply our result to study drift parameter estimation problems for subfractional Ornstein-Uhlenbeck and bifractional Ornstein-Uhlenbeck processes with fixed-time-step observations. These are processes which fail to be stationary or self-similar, but for which detailed calculations result in explicit formulas for the estimators' asymptotic normality.
△ Less
Submitted 7 June, 2017;
originally announced June 2017.
-
Parameter Estimation of Gaussian Stationary Processes using the Generalized Method of Moments
Authors:
Luis A. Barboza,
Frederi G. Viens
Abstract:
We consider the class of all stationary Gaussian process with explicit parametric spectral density. Under some conditions on the autocovariance function, we defined a GMM estimator that satisfies consistency and asymptotic normality, using the Breuer-Major theorem and previous results on ergodicity. This result is applied to the joint estimation of the three parameters of a stationary Ornstein-Uhl…
▽ More
We consider the class of all stationary Gaussian process with explicit parametric spectral density. Under some conditions on the autocovariance function, we defined a GMM estimator that satisfies consistency and asymptotic normality, using the Breuer-Major theorem and previous results on ergodicity. This result is applied to the joint estimation of the three parameters of a stationary Ornstein-Uhlenbeck (fOU) process driven by a fractional Brownian motion. The asymptotic normality of its GMM estimator applies for any H in (0,1) and under some restrictions on the remaining parameters. A numerical study is performed in the fOU case, to illustrate the estimator's practical performance when the number of datapoints is moderate.
△ Less
Submitted 16 January, 2017; v1 submitted 21 April, 2016;
originally announced April 2016.
-
Anderson polymer in a fractional Brownian environment: asymptotic behavior of the partition function
Authors:
Kamran Kalbasi,
Thomas S. Mountford,
Frederi G. Viens
Abstract:
We consider the Anderson polymer partition function $$ u(t):=\mathbb{E}^X\Bigl[e^{\int_0^t \mathrm{d}B^{X(s)}_s}\Bigr]\,, $$ where $\{B^{x}_t\,;\, t\geq0\}_{x\in\mathbb{Z}^d}$ is a family of independent fractional Brownian motions all with Hurst parameter $H\in(0,1)$, and $\{X(t)\}_{t\in \mathbb{R}^{\geq 0}}$ is a continuous-time simple symmetric random walk on $\mathbb{Z}^d$ with jump rate $κ$ an…
▽ More
We consider the Anderson polymer partition function $$ u(t):=\mathbb{E}^X\Bigl[e^{\int_0^t \mathrm{d}B^{X(s)}_s}\Bigr]\,, $$ where $\{B^{x}_t\,;\, t\geq0\}_{x\in\mathbb{Z}^d}$ is a family of independent fractional Brownian motions all with Hurst parameter $H\in(0,1)$, and $\{X(t)\}_{t\in \mathbb{R}^{\geq 0}}$ is a continuous-time simple symmetric random walk on $\mathbb{Z}^d$ with jump rate $κ$ and started from the origin. $\mathbb{E}^X$ is the expectation with respect to this random walk.
We prove that when $H\leq 1/2$, the function $u(t)$ almost surely grows asymptotically like $e^{l t}$, where $l>0$ is a deterministic number. More precisely, we show that as $t$ approaches $+\infty$, the expression $\{\frac{1}{t}\log u(t)\}_{t\in \mathbb{R}^{>0}}$ converges both almost surely and in the $\mathcal{L}^1$ sense to some deterministic number $l>0$.
For $H>1/2$, we first show that $\lim_{t\rightarrow \infty} \frac{1}{t}\log u(t)$ exists both almost surely and in the $\mathcal{L}^1$ sense, and equals a strictly positive deterministic number (possibly $+\infty$); hence almost surely $u(t)$ grows asymptotically at least like $e^{a t}$ for some deterministic constant $a>0$. On the other hand, we also show that almost surely and in the $\mathcal{L}^1$ sense, $\limsup_{t\rightarrow \infty} \frac{1}{t\sqrt{\log t}}\log u(t)$ is a deterministic finite real number (possibly zero), hence proving that almost surely $u(t)$ grows asymptotically at most like $e^{b t\sqrt{\log t}}$ for some deterministic positive constant $b$.
Finally, for $H>1/2$ when $\mathbb{Z}^d$ is replaced by a circle endowed with a Hölder continuous covariance function, we show that $\limsup_{t\rightarrow \infty} \frac{1}{t}\log u(t)$ is a finite deterministic positive number, hence proving that almost surely $u(t)$ grows asymptotically at most like $e^{c t}$ for some deterministic positive constant $c$.
△ Less
Submitted 24 March, 2017; v1 submitted 17 February, 2016;
originally announced February 2016.
-
Parameter Estimation for a partially observed Ornstein-Uhlenbeck process with long-memory noise
Authors:
Brahim El Onsy,
Khalifa Es-Sebaiy,
Frederi G. Viens
Abstract:
\noindent \textbf{Abstract}: We consider the parameter estimation problem for the Ornstein-Uhlenbeck process $X$ driven by a fractional Ornstein-Uhlenbeck process $V$, i.e. the pair of processes defined by the non-Markovian continuous-time long-memory dynamics $dX_{t}=-θX_{t}dt+dV_{t};\ t\geq 0$, with $dV_{t}=-ρV_{t}dt+dB_{t}^{H};\ t\geq 0$, where $θ>0$ and $ρ>0$ are unknown parameters, and…
▽ More
\noindent \textbf{Abstract}: We consider the parameter estimation problem for the Ornstein-Uhlenbeck process $X$ driven by a fractional Ornstein-Uhlenbeck process $V$, i.e. the pair of processes defined by the non-Markovian continuous-time long-memory dynamics $dX_{t}=-θX_{t}dt+dV_{t};\ t\geq 0$, with $dV_{t}=-ρV_{t}dt+dB_{t}^{H};\ t\geq 0$, where $θ>0$ and $ρ>0$ are unknown parameters, and $B^{H}$ is a fractional Brownian motion of Hurst index $H\in (\frac{1}{2},1)$. We study the strong consistency as well as the asymptotic normality of the joint least squares estimator $(\hatθ_{T},\widehat{ρ}% _{T}) $ of the pair $( θ,ρ) $, based either on continuous or discrete observations of $\{X_{s};\ s\in \lbrack 0,T]\}$ as the horizon $T$ increases to +$\infty $. Both cases qualify formally as partial-hbobservation questions since $V$ is unobserved. In the latter case, several discretization options are considered. Our proofs of asymptotic normality based on discrete data, rely on increasingly strict restrictions on the sampling frequency as one reduces the extent of sources of observation. The strategy for proving the asymptotic properties is to study the case of continuous-time observations using the Malliavin calculus, and then to exploit the fact that each discrete-data estimator can be considered as a perturbation of the continuous one in a mathematically precise way, despite the fact that the implementation of the discrete-time estimators is distant from the continuous estimator. In this sense, we contend that the continuous-time estimator cannot be implemented in practice in any naïve way, and serves only as a mathematical tool in the study of the discrete-time estimators' asymptotics.
△ Less
Submitted 12 October, 2016; v1 submitted 20 January, 2015;
originally announced January 2015.
-
Parameter estimation for SDEs related to stationary Gaussian processes
Authors:
Khalifa Es-Sebaiy,
Frederi G. Viens
Abstract:
In this paper, we study central and non-central limit theorems for partial sum of functionals of general stationary Gaussian fields. We apply our result to study drift parameter estimation problems for some stochastic differential equations related to stationary Gaussian processes.
In this paper, we study central and non-central limit theorems for partial sum of functionals of general stationary Gaussian fields. We apply our result to study drift parameter estimation problems for some stochastic differential equations related to stationary Gaussian processes.
△ Less
Submitted 20 January, 2015;
originally announced January 2015.
-
Reconstructing past temperatures from natural proxies and estimated climate forcings using short- and long-memory models
Authors:
Luis Barboza,
Bo Li,
Martin P. Tingley,
Frederi G. Viens
Abstract:
We produce new reconstructions of Northern Hemisphere annually averaged temperature anomalies back to 1000 AD, and explore the effects of including external climate forcings within the reconstruction and of accounting for short-memory and long-memory features. Our reconstructions are based on two linear models, with the first linking the latent temperature series to three main external forcings (s…
▽ More
We produce new reconstructions of Northern Hemisphere annually averaged temperature anomalies back to 1000 AD, and explore the effects of including external climate forcings within the reconstruction and of accounting for short-memory and long-memory features. Our reconstructions are based on two linear models, with the first linking the latent temperature series to three main external forcings (solar irradiance, greenhouse gas concentration and volcanism), and the second linking the observed temperature proxy data (tree rings, sediment record, ice cores, etc.) to the unobserved temperature series. Uncertainty is captured with additive noise, and a rigorous statistical investigation of the correlation structure in the regression errors is conducted through systematic comparisons between reconstructions that assume no memory, short-memory autoregressive models, and long-memory fractional Gaussian noise models. We use Bayesian estimation to fit the model parameters and to perform separate reconstructions of land-only and combined land-and-marine temperature anomalies. For model formulations that include forcings, both exploratory and Bayesian data analysis provide evidence against models with no memory. Model assessments indicate that models with no memory underestimate uncertainty. However, no single line of evidence is sufficient to favor short-memory models over long-memory ones, or to favor the opposite choice. When forcings are not included, the long-memory models appear to be necessary. While including external climate forcings substantially improves the reconstruction, accurate reconstructions that exclude these forcings are vital for testing the fidelity of climate models used for future projections.
△ Less
Submitted 4 March, 2015; v1 submitted 13 March, 2014;
originally announced March 2014.
-
Stein's lemma, Malliavin calculus, and tail bounds, with application to polymer fluctuation exponent
Authors:
Frederi G. Viens
Abstract:
We consider a random variable X satisfying almost-sure conditions involving G:=<DX,-DL^{-1}X> where DX is X's Malliavin derivative and L^{-1} is the inverse Ornstein-Uhlenbeck operator. A lower- (resp. upper-) bound condition on G is proved to imply a Gaussian-type lower (resp. upper) bound on the tail P[X>z]. Bounds of other natures are also given. A key ingredient is the use of Stein's lemma,…
▽ More
We consider a random variable X satisfying almost-sure conditions involving G:=<DX,-DL^{-1}X> where DX is X's Malliavin derivative and L^{-1} is the inverse Ornstein-Uhlenbeck operator. A lower- (resp. upper-) bound condition on G is proved to imply a Gaussian-type lower (resp. upper) bound on the tail P[X>z]. Bounds of other natures are also given. A key ingredient is the use of Stein's lemma, including the explicit form of the solution of Stein's equation relative to the function 1_{x>z}, and its relation to G. Another set of comparable results is established, without the use of Stein's lemma, using instead a formula for the density of a random variable based on G, recently devised by the author and Ivan Nourdin. As an application, via a Mehler-type formula for G, we show that the Brownian polymer in a Gaussian environment which is white-noise in time and positively correlated in space has deviations of Gaussian type and a fluctuation exponent χ=1/2. We also show this exponent remains 1/2 after a non-linear transformation of the polymer's Hamiltonian.
△ Less
Submitted 4 January, 2009;
originally announced January 2009.
-
Density estimates and concentration inequalities with Malliavin calculus
Authors:
Ivan Nourdin,
Frederi G. Viens
Abstract:
We show how to use the Malliavin calculus to obtain density estimates of the law of general centered random variables. In particular, under a non-degeneracy condition, we prove and use a new formula for the density of a random variable which is measurable and differentiable with respect to a given isonormal Gaussian process. Among other results, we apply our techniques to bound the density of th…
▽ More
We show how to use the Malliavin calculus to obtain density estimates of the law of general centered random variables. In particular, under a non-degeneracy condition, we prove and use a new formula for the density of a random variable which is measurable and differentiable with respect to a given isonormal Gaussian process. Among other results, we apply our techniques to bound the density of the maximum of a general Gaussian process from above and below; several new results ensue, including improvements on the so-called Borell-Sudakov inequality. We then explain what can be done when one is only interested in or capable of deriving concentration inequalities, i.e. tail bounds from above or below but not necessarily both simultaneously.
△ Less
Submitted 15 August, 2008; v1 submitted 14 August, 2008;
originally announced August 2008.
-
Statistical aspects of the fractional stochastic calculus
Authors:
Ciprian A. Tudor,
Frederi G. Viens
Abstract:
We apply the techniques of stochastic integration with respect to fractional Brownian motion and the theory of regularity and supremum estimation for stochastic processes to study the maximum likelihood estimator (MLE) for the drift parameter of stochastic processes satisfying stochastic equations driven by a fractional Brownian motion with any level of Hölder-regularity (any Hurst parameter). W…
▽ More
We apply the techniques of stochastic integration with respect to fractional Brownian motion and the theory of regularity and supremum estimation for stochastic processes to study the maximum likelihood estimator (MLE) for the drift parameter of stochastic processes satisfying stochastic equations driven by a fractional Brownian motion with any level of Hölder-regularity (any Hurst parameter). We prove existence and strong consistency of the MLE for linear and nonlinear equations. We also prove that a version of the MLE using only discrete observations is still a strongly consistent estimator.
△ Less
Submitted 17 August, 2007; v1 submitted 11 September, 2006;
originally announced September 2006.