Search | arXiv e-print repository

Hill estimator and extreme quantile estimator for functionals of approximated stochastic processes

Authors: Jaakko Pere, Benny Avelin, Valentin Garino, Pauliina Ilmonen, Lauri Viitasaari

Abstract: We study the effect of approximation errors in assessing the extreme behaviour of univariate functionals of random objects. We build our framework into a general setting where estimation of the extreme value index and extreme quantiles of the functional is based on some approximated value instead of the true one. As an example, we consider the effect of discretisation errors in computation of the… ▽ More We study the effect of approximation errors in assessing the extreme behaviour of univariate functionals of random objects. We build our framework into a general setting where estimation of the extreme value index and extreme quantiles of the functional is based on some approximated value instead of the true one. As an example, we consider the effect of discretisation errors in computation of the norms of paths of stochastic processes. In particular, we quantify connections between the sample size $n$ (the number of observed paths), the number of the discretisation points $m$, and the modulus of continuity function $φ$ describing the path continuity of the underlying stochastic process. As an interesting example fitting into our framework, we consider processes of form $Y(t) = \mathcal{R}Z(t)$, where $\mathcal{R}$ is a heavy-tailed random variable and the increments of the process $Z$ have lighter tails compared to $\mathcal{R}$. △ Less

Submitted 7 July, 2023; originally announced July 2023.

MSC Class: 62G32; 60G70

arXiv:2301.01639 [pdf, ps, other]

On Lamperti transformation and characterisations of discrete random fields

Authors: Marko Voutilainen, Lauri Viitasaari, Pauliina Ilmonen

Abstract: In this article we characterise discrete time stationary fields by difference equations involving stationary increment fields and self-similar fields. This gives connections between stationary fields, stationary increment fields and, through Lamperti transformation, self-similar fields. Our contribution is a natural generalisation of recently proved results covering the case of stationary processe… ▽ More In this article we characterise discrete time stationary fields by difference equations involving stationary increment fields and self-similar fields. This gives connections between stationary fields, stationary increment fields and, through Lamperti transformation, self-similar fields. Our contribution is a natural generalisation of recently proved results covering the case of stationary processes. △ Less

Submitted 4 January, 2023; originally announced January 2023.

arXiv:2209.06708 [pdf, ps, other]

On sharp rate of convergence for discretisation of integrals driven by fractional Brownian motions and related processes with discontinuous integrands

Authors: Ehsan Azmoodeh, Pauliina Ilmonen, Nourhan Shafik, Tommi Sottinen, Lauri Viitasaari

Abstract: We consider equidistant approximations of stochastic integrals driven by Hölder continuous Gaussian processes of order $H>\frac12$ with discontinuous integrands involving bounded variation functions. We give exact rate of convergence in the $L^1$-distance and provide examples with different drivers. It turns out that the exact rate of convergence is proportional to $n^{1-2H}$ that is twice better… ▽ More We consider equidistant approximations of stochastic integrals driven by Hölder continuous Gaussian processes of order $H>\frac12$ with discontinuous integrands involving bounded variation functions. We give exact rate of convergence in the $L^1$-distance and provide examples with different drivers. It turns out that the exact rate of convergence is proportional to $n^{1-2H}$ that is twice better compared to the best known results in the case of discontinuous integrands, and corresponds to the known rate in the case of smooth integrands. The novelty of our approach is that, instead of using multiplicative estimates for the integrals involved, we apply change of variables formula together with some facts on convex functions allowing us to compute expectations explicitly. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: 21 pages

MSC Class: 60G15; 60G22; 60H05

arXiv:2209.01216 [pdf, other]

doi 10.1371/journal.pone.0287486

Flexible transition probability model for assessing cost-effectiveness of breast cancer screening

Authors: Nourhan Shafik, Pauliina Ilmonen, Lauri Viitasaari, Tytti Sarkeala, Sirpa Heinävaara

Abstract: Breast cancer is the most common cancer among Western women. Fortunately, organized screening has reduced breast cancer mortality and, consequently, the European Union has recommended screening with mammography for 50-69-year-old women. This recommendation is followed well in Europe. Widening the screening target age further is supported by conditional recommendations for 45-49- and 70-74-year-old… ▽ More Breast cancer is the most common cancer among Western women. Fortunately, organized screening has reduced breast cancer mortality and, consequently, the European Union has recommended screening with mammography for 50-69-year-old women. This recommendation is followed well in Europe. Widening the screening target age further is supported by conditional recommendations for 45-49- and 70-74-year-old women. However, before extending screening to new age groups, it's essential to carefully consider the benefits and costs locally as circumstances vary between different regions and/or countries. We propose a new approach to assess cost-effectiveness of breast cancer screening for a long-ongoing program with incomplete historical screening data. The new model is called flexible stage distribution model. It is based on estimating the stage distributions of breast cancer cases under different screening strategies. In the model, an ongoing screening strategy may be used as a baseline and other screening strategies may be incorporated by changes in the incidence rates. The model is flexible, as it enables to apply different approaches for estimating the altered stage distributions. Thus, if randomized data is available, one may rely on that. On the other hand, if randomized data is not available, altered stage distributions may be estimated by extrapolating the stage distributions of the youngest and oldest screened/non-screened age groups. We apply the proposed flexible stage distribution model for assessing incremental cost of extending the current biennial breast cancer screening to younger and older target ages in Finland. △ Less

Submitted 2 September, 2022; originally announced September 2022.

arXiv:2208.09925 [pdf, other]

On optimal prediction of missing functional data with memory

Authors: Pauliina Ilmonen, Nourhan Shafik, Tommi Sottinen, Germain Van Bever, Lauri Viitasaari

Abstract: This paper considers the problem of reconstructing missing parts of functions based on their observed segments. It provides, for Gaussian processes and arbitrary bijective transformations thereof, theoretical expressions for the $L^2$-optimal reconstruction of the missing parts. These functions are obtained as solutions of explicit integral equations. In the discrete case, approximations of the so… ▽ More This paper considers the problem of reconstructing missing parts of functions based on their observed segments. It provides, for Gaussian processes and arbitrary bijective transformations thereof, theoretical expressions for the $L^2$-optimal reconstruction of the missing parts. These functions are obtained as solutions of explicit integral equations. In the discrete case, approximations of the solutions provide consistent expressions of all missing values of the processes. Rates of convergence of these approximations, under extra assumptions on the transformation function, are provided. In the case of Gaussian processes with a parametric covariance structure, the estimation can be conducted separately for each function, and yields nonlinear solutions in presence of memory. Simulated examples show that the proposed reconstruction indeed fares better than the conventional interpolation methods in various situations. △ Less

Submitted 21 August, 2022; originally announced August 2022.

MSC Class: 62R10; 60G15; 60G25

arXiv:2107.08917 [pdf, other]

doi 10.1016/j.jmva.2021.104880

Integrated shape-sensitive functional metrics

Authors: Sami Helander, Petra Laketa, Pauliina Ilmonen, Stanislav Nagy, Germain Van Bever, Lauri Viitasaari

Abstract: This paper develops a new integrated ball (pseudo)metric which provides an intermediary between a chosen starting (pseudo)metric d and the L_p distance in general function spaces. Selecting d as the Hausdorff or Fréchet distances, we introduce integrated shape-sensitive versions of these supremum-based metrics. The new metrics allow for finer analyses in functional settings, not attainable applyin… ▽ More This paper develops a new integrated ball (pseudo)metric which provides an intermediary between a chosen starting (pseudo)metric d and the L_p distance in general function spaces. Selecting d as the Hausdorff or Fréchet distances, we introduce integrated shape-sensitive versions of these supremum-based metrics. The new metrics allow for finer analyses in functional settings, not attainable applying the non-integrated versions directly. Moreover, convergent discrete approximations make computations feasible in practice. △ Less

Submitted 14 June, 2021; originally announced July 2021.

MSC Class: 62R10; 62R20

Journal ref: J. Multivariate Anal. 189, 104880 (2022)

arXiv:2003.10330 [pdf, other]

Latent Model Extreme Value Index Estimation

Authors: Joni Virta, Niko Lietzén, Lauri Viitasaari, Pauliina Ilmonen

Abstract: We propose a novel strategy for multivariate extreme value index estimation. In applications such as finance, volatility and risk present in the components of a multivariate time series are often driven by the same underlying factors, such as the subprime crisis in the US. To estimate the latent risk, we apply a two-stage procedure. First, a set of independent latent series is estimated using a me… ▽ More We propose a novel strategy for multivariate extreme value index estimation. In applications such as finance, volatility and risk present in the components of a multivariate time series are often driven by the same underlying factors, such as the subprime crisis in the US. To estimate the latent risk, we apply a two-stage procedure. First, a set of independent latent series is estimated using a method of latent variable analysis. Then, univariate risk measures are estimated individually for the latent series to assess their contribution to the overall risk. As our main theoretical contribution, we derive conditions under which the effect of the first step to the asymptotic behavior of the risk estimators is negligible. Simulations demonstrate the theory under both i.i.d. and dependent data, and an application into financial data illustrates the usefulness of the method in extracting joint sources of risk in practice. △ Less

Submitted 23 March, 2020; originally announced March 2020.

Comments: 47 pages, 8 figures

arXiv:2003.04199 [pdf, other]

Modeling temporally uncorrelated components for complex-valued stationary processes

Authors: Niko Lietzén, Lauri Viitasaari, Pauliina Ilmonen

Abstract: We consider a complex-valued linear mixture model, under discrete weakly stationary processes. We recover latent components of interest, which have undergone a linear mixing. We study asymptotic properties of a classical unmixing estimator, that is based on simultaneous diagonalization of the covariance matrix and an autocovariance matrix with lag $τ$. Our main contribution is that our asymptotic… ▽ More We consider a complex-valued linear mixture model, under discrete weakly stationary processes. We recover latent components of interest, which have undergone a linear mixing. We study asymptotic properties of a classical unmixing estimator, that is based on simultaneous diagonalization of the covariance matrix and an autocovariance matrix with lag $τ$. Our main contribution is that our asymptotic results can be applied to a large class of processes. In related literature, the processes are typically assumed to have weak correlations. We extend this class and consider the unmixing estimator under stronger dependency structures. In particular, we analyze the asymptotic behavior of the unmixing estimator under both, long- and short-range dependent complex-valued processes. Consequently, our theory covers unmixing estimators that converge slower than the usual $\sqrt{T}$ and unmixing estimators that produce non-Gaussian asymptotic distributions. The presented methodology is a powerful prepossessing tool and highly applicable in several fields of statistics. Complex-valued processes are frequently encountered in, for example, biomedical applications and signal processing. In addition, our approach can be applied to model real-valued problems that involve temporally uncorrelated pairs. These are encountered in, for example, applications in finance. △ Less

Submitted 11 March, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

Comments: 41 pages, 4 figures

MSC Class: 62H12; 60F05; 60G15; 60G10; 94A12; 94A08

arXiv:1909.02376 [pdf, ps, other]

Vector-valued Generalised Ornstein-Uhlenbeck Processes

Authors: Marko Voutilainen, Lauri Viitasaari, Pauliina Ilmonen, Soledad Torres, Ciprian Tudor

Abstract: Generalisations of the Ornstein-Uhlenbeck process defined through Langevin equation $dU_t = - ΘU_t dt + dG_t,$ such as fractional Ornstein-Uhlenbeck processes, have recently received a lot of attention in the literature. In particular, estimation of the unknown parameter $Θ$ is widely studied under Gaussian stationary increment noise $G$. Langevin equation is well-known for its connections to phys… ▽ More Generalisations of the Ornstein-Uhlenbeck process defined through Langevin equation $dU_t = - ΘU_t dt + dG_t,$ such as fractional Ornstein-Uhlenbeck processes, have recently received a lot of attention in the literature. In particular, estimation of the unknown parameter $Θ$ is widely studied under Gaussian stationary increment noise $G$. Langevin equation is well-known for its connections to physics. In addition to that, motivation for studying Langevin equation with a general noise $G$ stems from the fact that the equation characterises all univariate stationary processes. Most of the literature on the topic focuses on the one-dimensional case with Gaussian noise $G$. In this article, we consider estimation of the unknown model parameter in the multidimensional version of the Langevin equation, where the parameter $Θ$ is a matrix and $G$ is a general, not necessarily Gaussian, vector-valued process with stationary increments. Based on algebraic Riccati equations, we construct an estimator for the matrix $Θ$. Moreover, we prove the consistency of the estimator and derive its limiting distribution under natural assumptions. In addition, to motivate our work, we prove that the Langevin equation characterises all stationary processes in a multidimensional setting as well. △ Less

Submitted 19 November, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

Comments: Alignment of equations has been changed so that they fit inside the margins. Updated reference list

MSC Class: 60G10; 62M10; 62H12; 62G05

arXiv:1905.12031 [pdf, ps, other]

Oscillating Gaussian Processes

Authors: Pauliina Ilmonen, Soledad Torres, Lauri Viitasaari

Abstract: In this article we introduce and study oscillating Gaussian processes defined by $X_t = α_+ Y_t {\bf 1}_{Y_t >0} + α_- Y_t{\bf 1}_{Y_t<0}$, where $α_+,α_->0$ are free parameters and $Y$ is either stationary or self-similar Gaussian process. We study the basic properties of $X$ and we consider estimation of the model parameters. In particular, we show that the moment estimators converge in $L^p$ an… ▽ More In this article we introduce and study oscillating Gaussian processes defined by $X_t = α_+ Y_t {\bf 1}_{Y_t >0} + α_- Y_t{\bf 1}_{Y_t<0}$, where $α_+,α_->0$ are free parameters and $Y$ is either stationary or self-similar Gaussian process. We study the basic properties of $X$ and we consider estimation of the model parameters. In particular, we show that the moment estimators converge in $L^p$ and are, when suitably normalised, asymptotically normal. △ Less

Submitted 28 May, 2019; originally announced May 2019.

MSC Class: 60G15 (primary); 60F05; 60F25; 62F10; 62F12

arXiv:1808.08161 [pdf, other]

doi 10.1038/s41467-020-17217-1

Continuous time Gaussian process dynamical models in gene regulatory network inference

Authors: Atte Aalto, Lauri Viitasaari, Pauliina Ilmonen, Laurent Mombaerts, Jorge Goncalves

Abstract: One of the focus areas of modern scientific research is to reveal mysteries related to genes and their interactions. The dynamic interactions between genes can be encoded into a gene regulatory network (GRN), which can be used to gain understanding on the genetic mechanisms behind observable phenotypes. GRN inference from time series data has recently been a focus area of systems biology. Due to l… ▽ More One of the focus areas of modern scientific research is to reveal mysteries related to genes and their interactions. The dynamic interactions between genes can be encoded into a gene regulatory network (GRN), which can be used to gain understanding on the genetic mechanisms behind observable phenotypes. GRN inference from time series data has recently been a focus area of systems biology. Due to low sampling frequency of the data, this is a notoriously difficult problem. We tackle the challenge by introducing the so-called continuous-time Gaussian process dynamical model, based on Gaussian process framework that has gained popularity in nonlinear regression problems arising in machine learning. The model dynamics are governed by a stochastic differential equation, where the dynamics function is modelled as a Gaussian process. We prove the existence and uniqueness of solutions of the stochastic differential equation. We derive the probability distribution for the Euler discretised trajectories and establish the convergence of the discretisation. We develop a GRN inference method called BINGO, based on the developed framework. BINGO is based on MCMC sampling of trajectories of the GPDM and estimating the hyperparameters of the covariance function of the Gaussian process. Using benchmark data examples, we show that BINGO is superior in dealing with poor time resolution and it is computationally feasible. △ Less

Submitted 15 July, 2020; v1 submitted 24 August, 2018; originally announced August 2018.

Comments: Preprint version of a published article. NOTE: The published version title is different from the preprint

Journal ref: "Gene regulatory network inference from sparsely sampled noisy data", Nature Communications 11: 3493 (2020)

arXiv:1808.00791 [pdf, other]

doi 10.1111/sjos.12445

Fast tensorial JADE

Authors: Joni Virta, Niko Lietzén, Pauliina Ilmonen, Klaus Nordhausen

Abstract: In this work, we propose a novel method for tensorial independent component analysis. Our approach is based on TJADE and $ k $-JADE, two recently proposed generalizations of the classical JADE algorithm. Our novel method achieves the consistency and the limiting distribution of TJADE under mild assumptions, and at the same time offers notable improvement in computational speed. Detailed mathematic… ▽ More In this work, we propose a novel method for tensorial independent component analysis. Our approach is based on TJADE and $ k $-JADE, two recently proposed generalizations of the classical JADE algorithm. Our novel method achieves the consistency and the limiting distribution of TJADE under mild assumptions, and at the same time offers notable improvement in computational speed. Detailed mathematical proofs of the statistical properties of our method are given and, as a special case, a conjecture on the properties of $ k $-JADE is resolved. Simulations and timing comparisons demonstrate remarkable gain in speed. Moreover, the desired efficiency is obtained approximately for finite samples. The method is applied successfully to large-scale video data, for which neither TJADE nor $ k $-JADE is feasible. Finally, an experimental procedure is proposed to select the values of a set of tuning parameters. △ Less

Submitted 17 January, 2020; v1 submitted 2 August, 2018; originally announced August 2018.

Comments: 44 pages, 11 figures. Note: the title of the manuscript was earlier "Asymptotically and computationally efficient tensorial JADE"

Journal ref: Scandinavian Journal of Statistics, 48, 164-187, 2021

arXiv:1806.08608 [pdf, other]

On generalized ARCH model with stationary liquidity

Authors: Pauliina Ilmonen, Soledad Torres, Ciprian Tudor, Lauri Viitasaari, Marko Voutilainen

Abstract: We study a generalized ARCH model with liquidity given by a general stationary process. We provide minimal assumptions that ensure the existence and uniqueness of the stationary solution. In addition, we provide consistent estimators for the model parameters by using AR(1) type characterisation. We illustrate our results with several examples and simulation studies. We study a generalized ARCH model with liquidity given by a general stationary process. We provide minimal assumptions that ensure the existence and uniqueness of the stationary solution. In addition, we provide consistent estimators for the model parameters by using AR(1) type characterisation. We illustrate our results with several examples and simulation studies. △ Less

Submitted 22 June, 2018; originally announced June 2018.

MSC Class: 60G10; 62M10; 62G05

arXiv:1805.10948 [pdf, other]

doi 10.15559/19-VMSTA132

Note on AR(1)-characterisation of stationary processes and model fitting

Authors: Marko Voutilainen, Lauri Viitasaari, Pauliina Ilmonen

Abstract: It was recently proved that any strictly stationary stochastic process can be viewed as an autoregressive process of order one with coloured noise. Furthermore, it was proved that, using this characterisation, one can define closed form estimators for the model parameter based on autocovariance estimators for several different lags. However, this estimation procedure may fail in some special cases… ▽ More It was recently proved that any strictly stationary stochastic process can be viewed as an autoregressive process of order one with coloured noise. Furthermore, it was proved that, using this characterisation, one can define closed form estimators for the model parameter based on autocovariance estimators for several different lags. However, this estimation procedure may fail in some special cases. In this article we provide a detailed analysis of these special cases. In particular, we prove that these cases correspond to degenerate processes. △ Less

Submitted 28 May, 2018; originally announced May 2018.

Report number: VTeX-VMSTA-VMSTA91 MSC Class: 60G10; 62M10

Journal ref: Modern Stochastics: Theory and Applications 2019, Vol. 6, No. 2, 195-207

arXiv:1804.03047 [pdf, other]

Positive definite functions on semilattices

Authors: Vesa Kaarnioja, Pentti Haukkanen, Pauliina Ilmonen, Mika Mattila

Abstract: We introduce a notion of positive definiteness for functions $f\!:P\to\mathbb{R}$ defined on meet semilattices $(P,\preceq,\wedge)$ and prove several properties for these functions. In addition, we utilize the $LDL^{\rm T}$ decomposition of meet matrices in order to explore the properties of multivariate positive definite arithmetic functions $f\!:\mathbb{Z}_+^d\to\mathbb{R}$. Finally, we give a s… ▽ More We introduce a notion of positive definiteness for functions $f\!:P\to\mathbb{R}$ defined on meet semilattices $(P,\preceq,\wedge)$ and prove several properties for these functions. In addition, we utilize the $LDL^{\rm T}$ decomposition of meet matrices in order to explore the properties of multivariate positive definite arithmetic functions $f\!:\mathbb{Z}_+^d\to\mathbb{R}$. Finally, we give a series of examples and counterexamples of positive definite functions. △ Less

Submitted 27 April, 2020; v1 submitted 9 April, 2018; originally announced April 2018.

Comments: 12 pages, 1 figure

MSC Class: 06A12; 11A25; 11C20; 15A69; 15B36

arXiv:1708.07446 [pdf, ps, other]

doi 10.15559/17-VMSTA91

On model fitting and estimation of strictly stationary processes

Authors: Marko Voutilainen, Lauri Viitasaari, Pauliina Ilmonen

Abstract: Stationary processes have been extensively studied in the literature. Their applications include modeling and forecasting numerous real life phenomena such as natural disasters, sales and market movements. When stationary processes are considered, modeling is traditionally based on fitting an autoregressive moving average (ARMA) process. However, we challenge this conventional approach. Instead of… ▽ More Stationary processes have been extensively studied in the literature. Their applications include modeling and forecasting numerous real life phenomena such as natural disasters, sales and market movements. When stationary processes are considered, modeling is traditionally based on fitting an autoregressive moving average (ARMA) process. However, we challenge this conventional approach. Instead of fitting an ARMA model, we apply an AR(1) characterization in modeling any strictly stationary processes. Moreover, we derive consistent and asymptotically normal estimators of the corresponding model parameter. △ Less

Submitted 9 January, 2018; v1 submitted 24 August, 2017; originally announced August 2017.

Comments: Published at https://doi.org/10.15559/17-VMSTA91 in the Modern Stochastics: Theory and Applications (https://www.i-journals.org/vtxpp/VMSTA) by VTeX (http://www.vtex.lt/)

Report number: VTeX-VMSTA-VMSTA91

Journal ref: Modern Stochastics: Theory and Applications 2017, Vol. 4, No. 4, 381-406

arXiv:1707.09490 [pdf, ps, other]

On modeling weakly stationary processes

Authors: Lauri Viitasaari, Pauliina Ilmonen

Abstract: In this article, we show that a general class of weakly stationary time series can be modeled applying Gaussian subordinated processes. We show that, for any given weakly stationary time series $(z_t)_{z\in\mathbb{N}}$ with given equal one-dimensional marginal distribution, one can always construct a function $f$ and a Gaussian process $(X_t)_{t\in\mathbb{N}}$ such that… ▽ More In this article, we show that a general class of weakly stationary time series can be modeled applying Gaussian subordinated processes. We show that, for any given weakly stationary time series $(z_t)_{z\in\mathbb{N}}$ with given equal one-dimensional marginal distribution, one can always construct a function $f$ and a Gaussian process $(X_t)_{t\in\mathbb{N}}$ such that $\left(f(X_t)\right)_{t\in\mathbb{N}}$ has the same marginal distributions and, asymptotically, the same autocovariance function as $(z_t)_{t\in\mathbb{N}}$. Consequently, we obtain asymptotic distributions for the mean and autocovariance estimators by using the rich theory on limit theorems for Gaussian subordinated processes. This highlights the role of Gaussian subordinated processes in modeling general weakly stationary time series. We compare our approach to standard linear models, and show that our model is more flexible and requires weaker assumptions. △ Less

Submitted 23 October, 2019; v1 submitted 29 July, 2017; originally announced July 2017.

MSC Class: 60G10; 62M10; 60F05

arXiv:1705.05169 [pdf, other]

doi 10.1016/j.laa.2017.09.023

Generalized eigenvalue problems for meet and join matrices on semilattices

Authors: Pauliina Ilmonen, Vesa Kaarnioja

Abstract: We study generalized eigenvalue problems for meet and join matrices with respect to incidence functions on semilattices. We provide new bounds for generalized eigenvalues of meet matrices with respect to join matrices under very general assumptions. The applied methodology is flexible, and it is shown in the case of GCD and LCM matrices that even sharper bounds can be obtained by applying the know… ▽ More We study generalized eigenvalue problems for meet and join matrices with respect to incidence functions on semilattices. We provide new bounds for generalized eigenvalues of meet matrices with respect to join matrices under very general assumptions. The applied methodology is flexible, and it is shown in the case of GCD and LCM matrices that even sharper bounds can be obtained by applying the known properties of the divisor lattice. These results can also be easily modified for the dual problem of eigenvalues of join matrices with respect to meet matrices, which we briefly consider as well. We investigate the effectiveness of the obtained bounds for select examples involving number-theoretical lattices. △ Less

Submitted 13 June, 2017; v1 submitted 15 May, 2017; originally announced May 2017.

Comments: 18 pages, 2 figures

MSC Class: 06A12; 15A18; 15B36; 11C20

arXiv:1705.05163 [pdf, ps, other]

Computation of extremal eigenvalues of high-dimensional lattice-theoretic tensors via tensor-train decompositions

Authors: Harri Hakula, Pauliina Ilmonen, Vesa Kaarnioja

Abstract: This paper lies in the intersection of several fields: number theory, lattice theory, multilinear algebra, and scientific computing. We adapt existing solution algorithms for tensor eigenvalue problems to the tensor-train framework. As an application, we consider eigenvalue problems associated with a class of lattice-theoretic meet and join tensors, which may be regarded as multidimensional extens… ▽ More This paper lies in the intersection of several fields: number theory, lattice theory, multilinear algebra, and scientific computing. We adapt existing solution algorithms for tensor eigenvalue problems to the tensor-train framework. As an application, we consider eigenvalue problems associated with a class of lattice-theoretic meet and join tensors, which may be regarded as multidimensional extensions of the classically studied meet and join matrices such as GCD and LCM matrices, respectively. In order to effectively apply the solution algorithms, we show that meet tensors have an explicit low-rank tensor-train decomposition with sparse tensor-train cores with respect to the dimension. Moreover, this representation is independent of tensor order, which eliminates the so-called curse of dimensionality from the numerical analysis of these objects and makes the solution of tensor eigenvalue problems tractable with increasing dimensionality and order. For LCM tensors it is shown that a tensor-train decomposition with an a priori known TT rank exists under certain assumptions. We present a series of easily reproducible numerical examples covering tensor eigenvalue and generalized eigenvalue problems that serve as future benchmarks. The numerical results are used to assess the sharpness of existing theoretical estimates. △ Less

Submitted 4 October, 2017; v1 submitted 15 May, 2017; originally announced May 2017.

Comments: 23 pages, 7 figures

MSC Class: 15A69; 15A18; 15B36; 11C20; 06A12

arXiv:1511.08627 [pdf, ps, other]

On Asymptotic Properties of the Separating Hill Estimator

Authors: Matias Heikkilä, Yves Dominicy, Pauliina Ilmonen

Abstract: Modeling and understanding multivariate extreme events is challenging, but of great importance in various applications - e.g. in biostatistics, climatology, and finance. The separating Hill estimator can be used in estimating the extreme value index of a heavy tailed multivariate elliptical distribution. We consider the asymptotic behavior of the separating Hill estimator under estimated location… ▽ More Modeling and understanding multivariate extreme events is challenging, but of great importance in various applications - e.g. in biostatistics, climatology, and finance. The separating Hill estimator can be used in estimating the extreme value index of a heavy tailed multivariate elliptical distribution. We consider the asymptotic behavior of the separating Hill estimator under estimated location and scatter. The asymptotic properties of the separating Hill estimator are known under elliptical distribution with known location and scatter. However, the effect of estimation of the location and scatter has previously been examined only in a simulation study. We show, analytically, that the separating Hill estimator is consistent and asymptotically normal under estimated location and scatter, when certain mild conditions are met. △ Less

Submitted 12 January, 2016; v1 submitted 27 November, 2015; originally announced November 2015.

MSC Class: 60G70; 62H12

arXiv:1202.5159 [pdf, ps, other]

doi 10.1214/11-AOS906

Semiparametrically efficient inference based on signed ranks in symmetric independent component models

Authors: Pauliina Ilmonen, Davy Paindaveine

Abstract: We consider semiparametric location-scatter models for which the $p$-variate observation is obtained as $X=ΛZ+μ$, where $μ$ is a $p$-vector, $Λ$ is a full-rank $p\times p$ matrix and the (unobserved) random $p$-vector $Z$ has marginals that are centered and mutually independent but are otherwise unspecified. As in blind source separation and independent component analysis (ICA), the parameter of i… ▽ More We consider semiparametric location-scatter models for which the $p$-variate observation is obtained as $X=ΛZ+μ$, where $μ$ is a $p$-vector, $Λ$ is a full-rank $p\times p$ matrix and the (unobserved) random $p$-vector $Z$ has marginals that are centered and mutually independent but are otherwise unspecified. As in blind source separation and independent component analysis (ICA), the parameter of interest throughout the paper is $Λ$. On the basis of $n$ i.i.d. copies of $X$, we develop, under a symmetry assumption on $Z$, signed-rank one-sample testing and estimation procedures for $Λ$. We exploit the uniform local and asymptotic normality (ULAN) of the model to define signed-rank procedures that are semiparametrically efficient under correctly specified densities. Yet, as is usual in rank-based inference, the proposed procedures remain valid (correct asymptotic size under the null, for hypothesis testing, and root-$n$ consistency, for point estimation) under a very broad range of densities. We derive the asymptotic properties of the proposed procedures and investigate their finite-sample behavior through simulations. △ Less

Submitted 23 February, 2012; originally announced February 2012.

Comments: Published in at http://dx.doi.org/10.1214/11-AOS906 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOS-AOS906

Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2448-2476

Showing 1–21 of 21 results for author: Ilmonen, P