Search | arXiv e-print repository

Estimating Factor-Based Spot Volatility Matrices with Noisy and Asynchronous High-Frequency Data

Authors: Degui Li, Oliver Linton, Haoxuan Zhang

Abstract: We propose a new estimator of high-dimensional spot volatility matrices satisfying a low-rank plus sparse structure from noisy and asynchronous high-frequency data collected for an ultra-large number of assets. The noise processes are allowed to be temporally correlated, heteroskedastic, asymptotically vanishing and dependent on the efficient prices. We define a kernel-weighted pre-averaging metho… ▽ More We propose a new estimator of high-dimensional spot volatility matrices satisfying a low-rank plus sparse structure from noisy and asynchronous high-frequency data collected for an ultra-large number of assets. The noise processes are allowed to be temporally correlated, heteroskedastic, asymptotically vanishing and dependent on the efficient prices. We define a kernel-weighted pre-averaging method to jointly tackle the microstructure noise and asynchronicity issues, and we obtain uniformly consistent estimates for latent prices. We impose a continuous-time factor model with time-varying factor loadings on the price processes, and estimate the common factors and loadings via a local principal component analysis. Assuming a uniform sparsity condition on the idiosyncratic volatility structure, we combine the POET and kernel-smoothing techniques to estimate the spot volatility matrices for both the latent prices and idiosyncratic errors. Under some mild restrictions, the estimated spot volatility matrices are shown to be uniformly consistent under various matrix norms. We provide Monte-Carlo simulation and empirical studies to examine the numerical performance of the developed estimation methodology. △ Less

Submitted 10 March, 2024; originally announced March 2024.

arXiv:2307.01348 [pdf, other]

Nonparametric Estimation of Large Spot Volatility Matrices for High-Frequency Financial Data

Authors: Ruijun Bu, Degui Li, Oliver Linton, Hanchao Wang

Abstract: In this paper, we consider estimating spot/instantaneous volatility matrices of high-frequency data collected for a large number of assets. We first combine classic nonparametric kernel-based smoothing with a generalised shrinkage technique in the matrix estimation for noise-free data under a uniform sparsity assumption, a natural extension of the approximate sparsity commonly used in the literatu… ▽ More In this paper, we consider estimating spot/instantaneous volatility matrices of high-frequency data collected for a large number of assets. We first combine classic nonparametric kernel-based smoothing with a generalised shrinkage technique in the matrix estimation for noise-free data under a uniform sparsity assumption, a natural extension of the approximate sparsity commonly used in the literature. The uniform consistency property is derived for the proposed spot volatility matrix estimator with convergence rates comparable to the optimal minimax one. For the high-frequency data contaminated by microstructure noise, we introduce a localised pre-averaging estimation method that reduces the effective magnitude of the noise. We then use the estimation tool developed in the noise-free scenario, and derive the uniform convergence rates for the developed spot volatility matrix estimator. We further combine the kernel smoothing with the shrinkage technique to estimate the time-varying volatility matrix of the high-dimensional noise vector. In addition, we consider large spot volatility matrix estimation in time-varying factor models with observable risk factors and derive the uniform convergence property. We provide numerical studies including simulation and empirical application to examine the performance of the proposed estimation methods in finite samples. △ Less

Submitted 3 July, 2023; originally announced July 2023.

arXiv:2302.02476 [pdf, other]

Estimating Time-Varying Networks for High-Dimensional Time Series

Authors: Jia Chen, Degui Li, Yuning Li, Oliver Linton

Abstract: We explore time-varying networks for high-dimensional locally stationary time series, using the large VAR model framework with both the transition and (error) precision matrices evolving smoothly over time. Two types of time-varying graphs are investigated: one containing directed edges of Granger causality linkages, and the other containing undirected edges of partial correlation linkages. Under… ▽ More We explore time-varying networks for high-dimensional locally stationary time series, using the large VAR model framework with both the transition and (error) precision matrices evolving smoothly over time. Two types of time-varying graphs are investigated: one containing directed edges of Granger causality linkages, and the other containing undirected edges of partial correlation linkages. Under the sparse structural assumption, we propose a penalised local linear method with time-varying weighted group LASSO to jointly estimate the transition matrices and identify their significant entries, and a time-varying CLIME method to estimate the precision matrices. The estimated transition and precision matrices are then used to determine the time-varying network structures. Under some mild conditions, we derive the theoretical properties of the proposed estimates including the consistency and oracle properties. In addition, we extend the methodology and theory to cover highly-correlated large-scale time series, for which the sparsity assumption becomes invalid and we allow for common factors before estimating the factor-adjusted time-varying networks. We provide extensive simulation studies and an empirical application to a large U.S. macroeconomic dataset to illustrate the finite-sample performance of our methods. △ Less

Submitted 5 February, 2023; originally announced February 2023.

arXiv:2210.08228 [pdf, other]

Nonparametric Estimation of Mediation Effects with A General Treatment

Authors: Lukang Huang, Wei Huang, Oliver Linton, Zheng Zhang

Abstract: To investigate causal mechanisms, causal mediation analysis decomposes the total treatment effect into the natural direct and indirect effects. This paper examines the estimation of the direct and indirect effects in a general treatment effect model, where the treatment can be binary, multi-valued, continuous, or a mixture. We propose generalized weighting estimators with weights estimated by solv… ▽ More To investigate causal mechanisms, causal mediation analysis decomposes the total treatment effect into the natural direct and indirect effects. This paper examines the estimation of the direct and indirect effects in a general treatment effect model, where the treatment can be binary, multi-valued, continuous, or a mixture. We propose generalized weighting estimators with weights estimated by solving an expanding set of equations. Under some sufficient conditions, we show that the proposed estimators are consistent and asymptotically normal. Specifically, when the treatment is discrete, the proposed estimators attain the semiparametric efficiency bounds. Meanwhile, when the treatment is continuous, the convergence rates of the proposed estimators are slower than $N^{-1/2}$; however, they are still more efficient than that constructed from the true weighting function. A simulation study reveals that our estimators exhibit a satisfactory finite-sample performance, while an application shows their practical value △ Less

Submitted 22 January, 2024; v1 submitted 15 October, 2022; originally announced October 2022.

arXiv:2201.13004 [pdf, ps, other]

Improving Estimation Efficiency via Regression-Adjustment in Covariate-Adaptive Randomizations with Imperfect Compliance

Authors: Liang Jiang, Oliver B. Linton, Haihan Tang, Yichong Zhang

Abstract: We investigate how to improve efficiency using regression adjustments with covariates in covariate-adaptive randomizations (CARs) with imperfect subject compliance. Our regression-adjusted estimators, which are based on the doubly robust moment for local average treatment effects, are consistent and asymptotically normal even with heterogeneous probability of assignment and misspecified regression… ▽ More We investigate how to improve efficiency using regression adjustments with covariates in covariate-adaptive randomizations (CARs) with imperfect subject compliance. Our regression-adjusted estimators, which are based on the doubly robust moment for local average treatment effects, are consistent and asymptotically normal even with heterogeneous probability of assignment and misspecified regression adjustments. We propose an optimal but potentially misspecified linear adjustment and its further improvement via a nonlinear adjustment, both of which lead to more efficient estimators than the one without adjustments. We also provide conditions for nonparametric and regularized adjustments to achieve the semiparametric efficiency bound under CARs. △ Less

Submitted 16 June, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

Comments: The paper is previously circulated under the title "Regression Adjustments under Covariate-Adaptive Randomizations with Imperfect Compliance." 94 pages

arXiv:1903.01459 [pdf, other]

Multiscale clustering of nonparametric regression curves

Authors: Michael Vogt, Oliver Linton

Abstract: In a wide range of modern applications, we observe a large number of time series rather than only a single one. It is often natural to suppose that there is some group structure in the observed time series. When each time series is modelled by a nonparametric regression equation, one may in particular assume that the observed time series can be partitioned into a small number of groups whose membe… ▽ More In a wide range of modern applications, we observe a large number of time series rather than only a single one. It is often natural to suppose that there is some group structure in the observed time series. When each time series is modelled by a nonparametric regression equation, one may in particular assume that the observed time series can be partitioned into a small number of groups whose members share the same nonparametric regression function. We develop a bandwidth-free clustering method to estimate the unknown group structure from the data. More precisely speaking, we construct multiscale estimators of the unknown groups and their unknown number which are free of classical bandwidth or smoothing parameters. In the theoretical part of the paper, we analyze the statistical properties of our estimators. Our theoretical results are derived under general conditions which allow the data to be dependent both in time series direction and across different time series. The technical analysis of the paper is complemented by a simulation study and a real-data application. △ Less

Submitted 4 March, 2019; originally announced March 2019.

MSC Class: 62G08; 62G20; 62H30

arXiv:1801.04202 [pdf, other]

A Simple and Efficient Estimation Method for Models with Nonignorable Missing Data

Authors: Chunrong Ai, Oliver Linton, Zheng Zhang

Abstract: This paper proposes a simple and efficient estimation procedure for the model with non-ignorable missing data studied by Morikawa and Kim (2016). Their semiparametrically efficient estimator requires explicit nonparametric estimation and so suffers from the curse of dimensionality and requires a bandwidth selection. We propose an estimation method based on the Generalized Method of Moments (hereaf… ▽ More This paper proposes a simple and efficient estimation procedure for the model with non-ignorable missing data studied by Morikawa and Kim (2016). Their semiparametrically efficient estimator requires explicit nonparametric estimation and so suffers from the curse of dimensionality and requires a bandwidth selection. We propose an estimation method based on the Generalized Method of Moments (hereafter GMM). Our method is consistent and asymptotically normal regardless of the number of moments chosen. Furthermore, if the number of moments increases appropriately our estimator can achieve the semiparametric efficiency bound derived in Morikawa and Kim (2016), but under weaker regularity conditions. Moreover, our proposed estimator and its consistent covariance matrix are easily computed with the widely available GMM package. We propose two data-based methods for selection of the number of moments. A small scale simulation study reveals that the proposed estimation indeed out-performs the existing alternatives in finite samples. △ Less

Submitted 12 January, 2018; originally announced January 2018.

arXiv:1708.09507 [pdf, ps, other]

Estimation in Semiparametric Quantile Factor Models

Authors: Shujie Ma, Oliver Linton, Jiti Gao

Abstract: We propose an estimation methodology for a semiparametric quantile factor panel model. We provide tools for inference that are robust to the existence of moments and to the form of weak cross-sectional dependence in the idiosyncratic error term. We apply our method to daily stock return data. We propose an estimation methodology for a semiparametric quantile factor panel model. We provide tools for inference that are robust to the existence of moments and to the form of weak cross-sectional dependence in the idiosyncratic error term. We apply our method to daily stock return data. △ Less

Submitted 30 August, 2017; originally announced August 2017.

arXiv:1604.06380 [pdf, ps, other]

Asymptotic properties of a Nadaraya-Watson type estimator for regression functions of infinite order

Authors: Seok Young Hong, Oliver Linton

Abstract: We consider a class of nonparametric time series regression models in which the regressor takes values in a sequence space. Technical challenges that hampered theoretical advances in these models include the lack of associated Lebesgue density and difficulties with regard to the choice of dependence structure in the autoregressive framework. We propose an infinite-dimensional Nadaraya-Watson type… ▽ More We consider a class of nonparametric time series regression models in which the regressor takes values in a sequence space. Technical challenges that hampered theoretical advances in these models include the lack of associated Lebesgue density and difficulties with regard to the choice of dependence structure in the autoregressive framework. We propose an infinite-dimensional Nadaraya-Watson type estimator, and investigate its asymptotic properties in detail under both static regressive and autoregressive contexts, aiming to answer the open questions left by Linton and Sancetta (2009). First we show pointwise consistency of the estimator under a set of mild regularity conditions. Furthermore, the asymptotic normality of the estimator is established, and then its uniform strong consistency is shown over a compact set of logarithmically increasing dimension with respect to $α$-mixing and near epoch dependent (NED) samples. We specify the explicit rates of convergence in terms of the Lambert W function, and show that the optimal rate is of logarithmic order, confirming the existence of the curse of infinite dimensionality. △ Less

Submitted 21 April, 2016; originally announced April 2016.

Comments: 36 pages

MSC Class: 62G

arXiv:1402.1937 [pdf, ps, other]

The Cross-Quantilogram: Measuring Quantile Dependence and Testing Directional Predictability between Time Series

Authors: Heejoon Han, Oliver Linton, Tatsushi Oka, Yoon-Jae Whang

Abstract: This paper proposes the cross-quantilogram to measure the quantile dependence between two time series. We apply it to test the hypothesis that one time series has no directional predictability to another time series. We establish the asymptotic distribution of the cross quantilogram and the corresponding test statistic. The limiting distributions depend on nuisance parameters. To construct consist… ▽ More This paper proposes the cross-quantilogram to measure the quantile dependence between two time series. We apply it to test the hypothesis that one time series has no directional predictability to another time series. We establish the asymptotic distribution of the cross quantilogram and the corresponding test statistic. The limiting distributions depend on nuisance parameters. To construct consistent confidence intervals we employ the stationary bootstrap procedure; we show the consistency of this bootstrap. Also, we consider the self-normalized approach, which is shown to be asymptotically pivotal under the null hypothesis of no predictability. We provide simulation studies and two empirical applications. First, we use the cross-quantilogram to detect predictability from stock variance to excess stock return. Compared to existing tools used in the literature of stock return predictability, our method provides a more complete relationship between a predictor and stock return. Second, we investigate the systemic risk of individual financial institutions, such as JP Morgan Chase, Goldman Sachs and AIG. This article has supplementary materials online. △ Less

Submitted 20 January, 2018; v1 submitted 9 February, 2014; originally announced February 2014.

Journal ref: Journal of Econometrics, 2016, 193, 251-270

arXiv:0709.1663 [pdf, ps, other]

Uniform Bahadur Representation for Local Polynomial Estimates of M-Regression and Its Application to The Additive Model

Authors: Efang Kong, Oliver Linton, Yingcun Xia

Abstract: We use local polynomial fitting to estimate the nonparametric M-regression function for strongly mixing stationary processes $\{(Y_{i},\underline{X}_{i})\}$. We establish a strong uniform consistency rate for the Bahadur representation of estimators of the regression function and its derivatives. These results are fundamental for statistical inference and for applications that involve plugging i… ▽ More We use local polynomial fitting to estimate the nonparametric M-regression function for strongly mixing stationary processes $\{(Y_{i},\underline{X}_{i})\}$. We establish a strong uniform consistency rate for the Bahadur representation of estimators of the regression function and its derivatives. These results are fundamental for statistical inference and for applications that involve plugging in such estimators into other functionals where some control over higher order terms are required. We apply our results to the estimation of an additive M-regression model. △ Less

Submitted 29 November, 2007; v1 submitted 11 September, 2007; originally announced September 2007.

Comments: 40 pages

Showing 1–11 of 11 results for author: Linton, O