-
Hill estimator and extreme quantile estimator for functionals of approximated stochastic processes
Authors:
Jaakko Pere,
Benny Avelin,
Valentin Garino,
Pauliina Ilmonen,
Lauri Viitasaari
Abstract:
We study the effect of approximation errors in assessing the extreme behaviour of univariate functionals of random objects. We build our framework into a general setting where estimation of the extreme value index and extreme quantiles of the functional is based on some approximated value instead of the true one. As an example, we consider the effect of discretisation errors in computation of the…
▽ More
We study the effect of approximation errors in assessing the extreme behaviour of univariate functionals of random objects. We build our framework into a general setting where estimation of the extreme value index and extreme quantiles of the functional is based on some approximated value instead of the true one. As an example, we consider the effect of discretisation errors in computation of the norms of paths of stochastic processes. In particular, we quantify connections between the sample size $n$ (the number of observed paths), the number of the discretisation points $m$, and the modulus of continuity function $φ$ describing the path continuity of the underlying stochastic process. As an interesting example fitting into our framework, we consider processes of form $Y(t) = \mathcal{R}Z(t)$, where $\mathcal{R}$ is a heavy-tailed random variable and the increments of the process $Z$ have lighter tails compared to $\mathcal{R}$.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
On Lamperti transformation and characterisations of discrete random fields
Authors:
Marko Voutilainen,
Lauri Viitasaari,
Pauliina Ilmonen
Abstract:
In this article we characterise discrete time stationary fields by difference equations involving stationary increment fields and self-similar fields. This gives connections between stationary fields, stationary increment fields and, through Lamperti transformation, self-similar fields. Our contribution is a natural generalisation of recently proved results covering the case of stationary processe…
▽ More
In this article we characterise discrete time stationary fields by difference equations involving stationary increment fields and self-similar fields. This gives connections between stationary fields, stationary increment fields and, through Lamperti transformation, self-similar fields. Our contribution is a natural generalisation of recently proved results covering the case of stationary processes.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
On sharp rate of convergence for discretisation of integrals driven by fractional Brownian motions and related processes with discontinuous integrands
Authors:
Ehsan Azmoodeh,
Pauliina Ilmonen,
Nourhan Shafik,
Tommi Sottinen,
Lauri Viitasaari
Abstract:
We consider equidistant approximations of stochastic integrals driven by Hölder continuous Gaussian processes of order $H>\frac12$ with discontinuous integrands involving bounded variation functions. We give exact rate of convergence in the $L^1$-distance and provide examples with different drivers. It turns out that the exact rate of convergence is proportional to $n^{1-2H}$ that is twice better…
▽ More
We consider equidistant approximations of stochastic integrals driven by Hölder continuous Gaussian processes of order $H>\frac12$ with discontinuous integrands involving bounded variation functions. We give exact rate of convergence in the $L^1$-distance and provide examples with different drivers. It turns out that the exact rate of convergence is proportional to $n^{1-2H}$ that is twice better compared to the best known results in the case of discontinuous integrands, and corresponds to the known rate in the case of smooth integrands. The novelty of our approach is that, instead of using multiplicative estimates for the integrals involved, we apply change of variables formula together with some facts on convex functions allowing us to compute expectations explicitly.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
Flexible transition probability model for assessing cost-effectiveness of breast cancer screening
Authors:
Nourhan Shafik,
Pauliina Ilmonen,
Lauri Viitasaari,
Tytti Sarkeala,
Sirpa Heinävaara
Abstract:
Breast cancer is the most common cancer among Western women. Fortunately, organized screening has reduced breast cancer mortality and, consequently, the European Union has recommended screening with mammography for 50-69-year-old women. This recommendation is followed well in Europe. Widening the screening target age further is supported by conditional recommendations for 45-49- and 70-74-year-old…
▽ More
Breast cancer is the most common cancer among Western women. Fortunately, organized screening has reduced breast cancer mortality and, consequently, the European Union has recommended screening with mammography for 50-69-year-old women. This recommendation is followed well in Europe. Widening the screening target age further is supported by conditional recommendations for 45-49- and 70-74-year-old women. However, before extending screening to new age groups, it's essential to carefully consider the benefits and costs locally as circumstances vary between different regions and/or countries. We propose a new approach to assess cost-effectiveness of breast cancer screening for a long-ongoing program with incomplete historical screening data. The new model is called flexible stage distribution model. It is based on estimating the stage distributions of breast cancer cases under different screening strategies. In the model, an ongoing screening strategy may be used as a baseline and other screening strategies may be incorporated by changes in the incidence rates. The model is flexible, as it enables to apply different approaches for estimating the altered stage distributions. Thus, if randomized data is available, one may rely on that. On the other hand, if randomized data is not available, altered stage distributions may be estimated by extrapolating the stage distributions of the youngest and oldest screened/non-screened age groups. We apply the proposed flexible stage distribution model for assessing incremental cost of extending the current biennial breast cancer screening to younger and older target ages in Finland.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
On optimal prediction of missing functional data with memory
Authors:
Pauliina Ilmonen,
Nourhan Shafik,
Tommi Sottinen,
Germain Van Bever,
Lauri Viitasaari
Abstract:
This paper considers the problem of reconstructing missing parts of functions based on their observed segments. It provides, for Gaussian processes and arbitrary bijective transformations thereof, theoretical expressions for the $L^2$-optimal reconstruction of the missing parts. These functions are obtained as solutions of explicit integral equations. In the discrete case, approximations of the so…
▽ More
This paper considers the problem of reconstructing missing parts of functions based on their observed segments. It provides, for Gaussian processes and arbitrary bijective transformations thereof, theoretical expressions for the $L^2$-optimal reconstruction of the missing parts. These functions are obtained as solutions of explicit integral equations. In the discrete case, approximations of the solutions provide consistent expressions of all missing values of the processes. Rates of convergence of these approximations, under extra assumptions on the transformation function, are provided. In the case of Gaussian processes with a parametric covariance structure, the estimation can be conducted separately for each function, and yields nonlinear solutions in presence of memory. Simulated examples show that the proposed reconstruction indeed fares better than the conventional interpolation methods in various situations.
△ Less
Submitted 21 August, 2022;
originally announced August 2022.
-
Integrated shape-sensitive functional metrics
Authors:
Sami Helander,
Petra Laketa,
Pauliina Ilmonen,
Stanislav Nagy,
Germain Van Bever,
Lauri Viitasaari
Abstract:
This paper develops a new integrated ball (pseudo)metric which provides an intermediary between a chosen starting (pseudo)metric d and the L_p distance in general function spaces. Selecting d as the Hausdorff or Fréchet distances, we introduce integrated shape-sensitive versions of these supremum-based metrics. The new metrics allow for finer analyses in functional settings, not attainable applyin…
▽ More
This paper develops a new integrated ball (pseudo)metric which provides an intermediary between a chosen starting (pseudo)metric d and the L_p distance in general function spaces. Selecting d as the Hausdorff or Fréchet distances, we introduce integrated shape-sensitive versions of these supremum-based metrics. The new metrics allow for finer analyses in functional settings, not attainable applying the non-integrated versions directly. Moreover, convergent discrete approximations make computations feasible in practice.
△ Less
Submitted 14 June, 2021;
originally announced July 2021.
-
Latent Model Extreme Value Index Estimation
Authors:
Joni Virta,
Niko Lietzén,
Lauri Viitasaari,
Pauliina Ilmonen
Abstract:
We propose a novel strategy for multivariate extreme value index estimation. In applications such as finance, volatility and risk present in the components of a multivariate time series are often driven by the same underlying factors, such as the subprime crisis in the US. To estimate the latent risk, we apply a two-stage procedure. First, a set of independent latent series is estimated using a me…
▽ More
We propose a novel strategy for multivariate extreme value index estimation. In applications such as finance, volatility and risk present in the components of a multivariate time series are often driven by the same underlying factors, such as the subprime crisis in the US. To estimate the latent risk, we apply a two-stage procedure. First, a set of independent latent series is estimated using a method of latent variable analysis. Then, univariate risk measures are estimated individually for the latent series to assess their contribution to the overall risk. As our main theoretical contribution, we derive conditions under which the effect of the first step to the asymptotic behavior of the risk estimators is negligible. Simulations demonstrate the theory under both i.i.d. and dependent data, and an application into financial data illustrates the usefulness of the method in extracting joint sources of risk in practice.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Modeling temporally uncorrelated components for complex-valued stationary processes
Authors:
Niko Lietzén,
Lauri Viitasaari,
Pauliina Ilmonen
Abstract:
We consider a complex-valued linear mixture model, under discrete weakly stationary processes. We recover latent components of interest, which have undergone a linear mixing. We study asymptotic properties of a classical unmixing estimator, that is based on simultaneous diagonalization of the covariance matrix and an autocovariance matrix with lag $τ$. Our main contribution is that our asymptotic…
▽ More
We consider a complex-valued linear mixture model, under discrete weakly stationary processes. We recover latent components of interest, which have undergone a linear mixing. We study asymptotic properties of a classical unmixing estimator, that is based on simultaneous diagonalization of the covariance matrix and an autocovariance matrix with lag $τ$. Our main contribution is that our asymptotic results can be applied to a large class of processes. In related literature, the processes are typically assumed to have weak correlations. We extend this class and consider the unmixing estimator under stronger dependency structures. In particular, we analyze the asymptotic behavior of the unmixing estimator under both, long- and short-range dependent complex-valued processes. Consequently, our theory covers unmixing estimators that converge slower than the usual $\sqrt{T}$ and unmixing estimators that produce non-Gaussian asymptotic distributions. The presented methodology is a powerful prepossessing tool and highly applicable in several fields of statistics. Complex-valued processes are frequently encountered in, for example, biomedical applications and signal processing. In addition, our approach can be applied to model real-valued problems that involve temporally uncorrelated pairs. These are encountered in, for example, applications in finance.
△ Less
Submitted 11 March, 2020; v1 submitted 9 March, 2020;
originally announced March 2020.
-
Vector-valued Generalised Ornstein-Uhlenbeck Processes
Authors:
Marko Voutilainen,
Lauri Viitasaari,
Pauliina Ilmonen,
Soledad Torres,
Ciprian Tudor
Abstract:
Generalisations of the Ornstein-Uhlenbeck process defined through Langevin equation $dU_t = - ΘU_t dt + dG_t,$ such as fractional Ornstein-Uhlenbeck processes, have recently received a lot of attention in the literature. In particular, estimation of the unknown parameter $Θ$ is widely studied under Gaussian stationary increment noise $G$. Langevin equation is well-known for its connections to phys…
▽ More
Generalisations of the Ornstein-Uhlenbeck process defined through Langevin equation $dU_t = - ΘU_t dt + dG_t,$ such as fractional Ornstein-Uhlenbeck processes, have recently received a lot of attention in the literature. In particular, estimation of the unknown parameter $Θ$ is widely studied under Gaussian stationary increment noise $G$. Langevin equation is well-known for its connections to physics. In addition to that, motivation for studying Langevin equation with a general noise $G$ stems from the fact that the equation characterises all univariate stationary processes. Most of the literature on the topic focuses on the one-dimensional case with Gaussian noise $G$. In this article, we consider estimation of the unknown model parameter in the multidimensional version of the Langevin equation, where the parameter $Θ$ is a matrix and $G$ is a general, not necessarily Gaussian, vector-valued process with stationary increments. Based on algebraic Riccati equations, we construct an estimator for the matrix $Θ$. Moreover, we prove the consistency of the estimator and derive its limiting distribution under natural assumptions. In addition, to motivate our work, we prove that the Langevin equation characterises all stationary processes in a multidimensional setting as well.
△ Less
Submitted 19 November, 2020; v1 submitted 5 September, 2019;
originally announced September 2019.
-
Oscillating Gaussian Processes
Authors:
Pauliina Ilmonen,
Soledad Torres,
Lauri Viitasaari
Abstract:
In this article we introduce and study oscillating Gaussian processes defined by $X_t = α_+ Y_t {\bf 1}_{Y_t >0} + α_- Y_t{\bf 1}_{Y_t<0}$, where $α_+,α_->0$ are free parameters and $Y$ is either stationary or self-similar Gaussian process. We study the basic properties of $X$ and we consider estimation of the model parameters. In particular, we show that the moment estimators converge in $L^p$ an…
▽ More
In this article we introduce and study oscillating Gaussian processes defined by $X_t = α_+ Y_t {\bf 1}_{Y_t >0} + α_- Y_t{\bf 1}_{Y_t<0}$, where $α_+,α_->0$ are free parameters and $Y$ is either stationary or self-similar Gaussian process. We study the basic properties of $X$ and we consider estimation of the model parameters. In particular, we show that the moment estimators converge in $L^p$ and are, when suitably normalised, asymptotically normal.
△ Less
Submitted 28 May, 2019;
originally announced May 2019.
-
Continuous time Gaussian process dynamical models in gene regulatory network inference
Authors:
Atte Aalto,
Lauri Viitasaari,
Pauliina Ilmonen,
Laurent Mombaerts,
Jorge Goncalves
Abstract:
One of the focus areas of modern scientific research is to reveal mysteries related to genes and their interactions. The dynamic interactions between genes can be encoded into a gene regulatory network (GRN), which can be used to gain understanding on the genetic mechanisms behind observable phenotypes. GRN inference from time series data has recently been a focus area of systems biology. Due to l…
▽ More
One of the focus areas of modern scientific research is to reveal mysteries related to genes and their interactions. The dynamic interactions between genes can be encoded into a gene regulatory network (GRN), which can be used to gain understanding on the genetic mechanisms behind observable phenotypes. GRN inference from time series data has recently been a focus area of systems biology. Due to low sampling frequency of the data, this is a notoriously difficult problem. We tackle the challenge by introducing the so-called continuous-time Gaussian process dynamical model, based on Gaussian process framework that has gained popularity in nonlinear regression problems arising in machine learning. The model dynamics are governed by a stochastic differential equation, where the dynamics function is modelled as a Gaussian process. We prove the existence and uniqueness of solutions of the stochastic differential equation. We derive the probability distribution for the Euler discretised trajectories and establish the convergence of the discretisation. We develop a GRN inference method called BINGO, based on the developed framework. BINGO is based on MCMC sampling of trajectories of the GPDM and estimating the hyperparameters of the covariance function of the Gaussian process. Using benchmark data examples, we show that BINGO is superior in dealing with poor time resolution and it is computationally feasible.
△ Less
Submitted 15 July, 2020; v1 submitted 24 August, 2018;
originally announced August 2018.
-
Fast tensorial JADE
Authors:
Joni Virta,
Niko Lietzén,
Pauliina Ilmonen,
Klaus Nordhausen
Abstract:
In this work, we propose a novel method for tensorial independent component analysis. Our approach is based on TJADE and $ k $-JADE, two recently proposed generalizations of the classical JADE algorithm. Our novel method achieves the consistency and the limiting distribution of TJADE under mild assumptions, and at the same time offers notable improvement in computational speed. Detailed mathematic…
▽ More
In this work, we propose a novel method for tensorial independent component analysis. Our approach is based on TJADE and $ k $-JADE, two recently proposed generalizations of the classical JADE algorithm. Our novel method achieves the consistency and the limiting distribution of TJADE under mild assumptions, and at the same time offers notable improvement in computational speed. Detailed mathematical proofs of the statistical properties of our method are given and, as a special case, a conjecture on the properties of $ k $-JADE is resolved. Simulations and timing comparisons demonstrate remarkable gain in speed. Moreover, the desired efficiency is obtained approximately for finite samples. The method is applied successfully to large-scale video data, for which neither TJADE nor $ k $-JADE is feasible. Finally, an experimental procedure is proposed to select the values of a set of tuning parameters.
△ Less
Submitted 17 January, 2020; v1 submitted 2 August, 2018;
originally announced August 2018.
-
On generalized ARCH model with stationary liquidity
Authors:
Pauliina Ilmonen,
Soledad Torres,
Ciprian Tudor,
Lauri Viitasaari,
Marko Voutilainen
Abstract:
We study a generalized ARCH model with liquidity given by a general stationary process. We provide minimal assumptions that ensure the existence and uniqueness of the stationary solution. In addition, we provide consistent estimators for the model parameters by using AR(1) type characterisation. We illustrate our results with several examples and simulation studies.
We study a generalized ARCH model with liquidity given by a general stationary process. We provide minimal assumptions that ensure the existence and uniqueness of the stationary solution. In addition, we provide consistent estimators for the model parameters by using AR(1) type characterisation. We illustrate our results with several examples and simulation studies.
△ Less
Submitted 22 June, 2018;
originally announced June 2018.
-
Note on AR(1)-characterisation of stationary processes and model fitting
Authors:
Marko Voutilainen,
Lauri Viitasaari,
Pauliina Ilmonen
Abstract:
It was recently proved that any strictly stationary stochastic process can be viewed as an autoregressive process of order one with coloured noise. Furthermore, it was proved that, using this characterisation, one can define closed form estimators for the model parameter based on autocovariance estimators for several different lags. However, this estimation procedure may fail in some special cases…
▽ More
It was recently proved that any strictly stationary stochastic process can be viewed as an autoregressive process of order one with coloured noise. Furthermore, it was proved that, using this characterisation, one can define closed form estimators for the model parameter based on autocovariance estimators for several different lags. However, this estimation procedure may fail in some special cases. In this article we provide a detailed analysis of these special cases. In particular, we prove that these cases correspond to degenerate processes.
△ Less
Submitted 28 May, 2018;
originally announced May 2018.
-
Positive definite functions on semilattices
Authors:
Vesa Kaarnioja,
Pentti Haukkanen,
Pauliina Ilmonen,
Mika Mattila
Abstract:
We introduce a notion of positive definiteness for functions $f\!:P\to\mathbb{R}$ defined on meet semilattices $(P,\preceq,\wedge)$ and prove several properties for these functions. In addition, we utilize the $LDL^{\rm T}$ decomposition of meet matrices in order to explore the properties of multivariate positive definite arithmetic functions $f\!:\mathbb{Z}_+^d\to\mathbb{R}$. Finally, we give a s…
▽ More
We introduce a notion of positive definiteness for functions $f\!:P\to\mathbb{R}$ defined on meet semilattices $(P,\preceq,\wedge)$ and prove several properties for these functions. In addition, we utilize the $LDL^{\rm T}$ decomposition of meet matrices in order to explore the properties of multivariate positive definite arithmetic functions $f\!:\mathbb{Z}_+^d\to\mathbb{R}$. Finally, we give a series of examples and counterexamples of positive definite functions.
△ Less
Submitted 27 April, 2020; v1 submitted 9 April, 2018;
originally announced April 2018.
-
On model fitting and estimation of strictly stationary processes
Authors:
Marko Voutilainen,
Lauri Viitasaari,
Pauliina Ilmonen
Abstract:
Stationary processes have been extensively studied in the literature. Their applications include modeling and forecasting numerous real life phenomena such as natural disasters, sales and market movements. When stationary processes are considered, modeling is traditionally based on fitting an autoregressive moving average (ARMA) process. However, we challenge this conventional approach. Instead of…
▽ More
Stationary processes have been extensively studied in the literature. Their applications include modeling and forecasting numerous real life phenomena such as natural disasters, sales and market movements. When stationary processes are considered, modeling is traditionally based on fitting an autoregressive moving average (ARMA) process. However, we challenge this conventional approach. Instead of fitting an ARMA model, we apply an AR(1) characterization in modeling any strictly stationary processes. Moreover, we derive consistent and asymptotically normal estimators of the corresponding model parameter.
△ Less
Submitted 9 January, 2018; v1 submitted 24 August, 2017;
originally announced August 2017.
-
On modeling weakly stationary processes
Authors:
Lauri Viitasaari,
Pauliina Ilmonen
Abstract:
In this article, we show that a general class of weakly stationary time series can be modeled applying Gaussian subordinated processes. We show that, for any given weakly stationary time series $(z_t)_{z\in\mathbb{N}}$ with given equal one-dimensional marginal distribution, one can always construct a function $f$ and a Gaussian process $(X_t)_{t\in\mathbb{N}}$ such that…
▽ More
In this article, we show that a general class of weakly stationary time series can be modeled applying Gaussian subordinated processes. We show that, for any given weakly stationary time series $(z_t)_{z\in\mathbb{N}}$ with given equal one-dimensional marginal distribution, one can always construct a function $f$ and a Gaussian process $(X_t)_{t\in\mathbb{N}}$ such that $\left(f(X_t)\right)_{t\in\mathbb{N}}$ has the same marginal distributions and, asymptotically, the same autocovariance function as $(z_t)_{t\in\mathbb{N}}$. Consequently, we obtain asymptotic distributions for the mean and autocovariance estimators by using the rich theory on limit theorems for Gaussian subordinated processes. This highlights the role of Gaussian subordinated processes in modeling general weakly stationary time series. We compare our approach to standard linear models, and show that our model is more flexible and requires weaker assumptions.
△ Less
Submitted 23 October, 2019; v1 submitted 29 July, 2017;
originally announced July 2017.
-
Generalized eigenvalue problems for meet and join matrices on semilattices
Authors:
Pauliina Ilmonen,
Vesa Kaarnioja
Abstract:
We study generalized eigenvalue problems for meet and join matrices with respect to incidence functions on semilattices. We provide new bounds for generalized eigenvalues of meet matrices with respect to join matrices under very general assumptions. The applied methodology is flexible, and it is shown in the case of GCD and LCM matrices that even sharper bounds can be obtained by applying the know…
▽ More
We study generalized eigenvalue problems for meet and join matrices with respect to incidence functions on semilattices. We provide new bounds for generalized eigenvalues of meet matrices with respect to join matrices under very general assumptions. The applied methodology is flexible, and it is shown in the case of GCD and LCM matrices that even sharper bounds can be obtained by applying the known properties of the divisor lattice. These results can also be easily modified for the dual problem of eigenvalues of join matrices with respect to meet matrices, which we briefly consider as well. We investigate the effectiveness of the obtained bounds for select examples involving number-theoretical lattices.
△ Less
Submitted 13 June, 2017; v1 submitted 15 May, 2017;
originally announced May 2017.
-
Computation of extremal eigenvalues of high-dimensional lattice-theoretic tensors via tensor-train decompositions
Authors:
Harri Hakula,
Pauliina Ilmonen,
Vesa Kaarnioja
Abstract:
This paper lies in the intersection of several fields: number theory, lattice theory, multilinear algebra, and scientific computing. We adapt existing solution algorithms for tensor eigenvalue problems to the tensor-train framework. As an application, we consider eigenvalue problems associated with a class of lattice-theoretic meet and join tensors, which may be regarded as multidimensional extens…
▽ More
This paper lies in the intersection of several fields: number theory, lattice theory, multilinear algebra, and scientific computing. We adapt existing solution algorithms for tensor eigenvalue problems to the tensor-train framework. As an application, we consider eigenvalue problems associated with a class of lattice-theoretic meet and join tensors, which may be regarded as multidimensional extensions of the classically studied meet and join matrices such as GCD and LCM matrices, respectively. In order to effectively apply the solution algorithms, we show that meet tensors have an explicit low-rank tensor-train decomposition with sparse tensor-train cores with respect to the dimension. Moreover, this representation is independent of tensor order, which eliminates the so-called curse of dimensionality from the numerical analysis of these objects and makes the solution of tensor eigenvalue problems tractable with increasing dimensionality and order. For LCM tensors it is shown that a tensor-train decomposition with an a priori known TT rank exists under certain assumptions. We present a series of easily reproducible numerical examples covering tensor eigenvalue and generalized eigenvalue problems that serve as future benchmarks. The numerical results are used to assess the sharpness of existing theoretical estimates.
△ Less
Submitted 4 October, 2017; v1 submitted 15 May, 2017;
originally announced May 2017.
-
On Asymptotic Properties of the Separating Hill Estimator
Authors:
Matias Heikkilä,
Yves Dominicy,
Pauliina Ilmonen
Abstract:
Modeling and understanding multivariate extreme events is challenging, but of great importance in various applications - e.g. in biostatistics, climatology, and finance. The separating Hill estimator can be used in estimating the extreme value index of a heavy tailed multivariate elliptical distribution. We consider the asymptotic behavior of the separating Hill estimator under estimated location…
▽ More
Modeling and understanding multivariate extreme events is challenging, but of great importance in various applications - e.g. in biostatistics, climatology, and finance. The separating Hill estimator can be used in estimating the extreme value index of a heavy tailed multivariate elliptical distribution. We consider the asymptotic behavior of the separating Hill estimator under estimated location and scatter. The asymptotic properties of the separating Hill estimator are known under elliptical distribution with known location and scatter. However, the effect of estimation of the location and scatter has previously been examined only in a simulation study. We show, analytically, that the separating Hill estimator is consistent and asymptotically normal under estimated location and scatter, when certain mild conditions are met.
△ Less
Submitted 12 January, 2016; v1 submitted 27 November, 2015;
originally announced November 2015.
-
Semiparametrically efficient inference based on signed ranks in symmetric independent component models
Authors:
Pauliina Ilmonen,
Davy Paindaveine
Abstract:
We consider semiparametric location-scatter models for which the $p$-variate observation is obtained as $X=ΛZ+μ$, where $μ$ is a $p$-vector, $Λ$ is a full-rank $p\times p$ matrix and the (unobserved) random $p$-vector $Z$ has marginals that are centered and mutually independent but are otherwise unspecified. As in blind source separation and independent component analysis (ICA), the parameter of i…
▽ More
We consider semiparametric location-scatter models for which the $p$-variate observation is obtained as $X=ΛZ+μ$, where $μ$ is a $p$-vector, $Λ$ is a full-rank $p\times p$ matrix and the (unobserved) random $p$-vector $Z$ has marginals that are centered and mutually independent but are otherwise unspecified. As in blind source separation and independent component analysis (ICA), the parameter of interest throughout the paper is $Λ$. On the basis of $n$ i.i.d. copies of $X$, we develop, under a symmetry assumption on $Z$, signed-rank one-sample testing and estimation procedures for $Λ$. We exploit the uniform local and asymptotic normality (ULAN) of the model to define signed-rank procedures that are semiparametrically efficient under correctly specified densities. Yet, as is usual in rank-based inference, the proposed procedures remain valid (correct asymptotic size under the null, for hypothesis testing, and root-$n$ consistency, for point estimation) under a very broad range of densities. We derive the asymptotic properties of the proposed procedures and investigate their finite-sample behavior through simulations.
△ Less
Submitted 23 February, 2012;
originally announced February 2012.