-
Provably Convergent and Robust Newton-Raphson Method: A New Dawn in Primitive Variable Recovery for Relativistic MHD
Authors:
Chaoyi Cai,
Jianxian Qiu,
Kailiang Wu
Abstract:
A long-standing and formidable challenge faced by all conservative schemes for relativistic magnetohydrodynamics (RMHD) is the recovery of primitive variables from conservative ones. This process involves solving highly nonlinear equations subject to physical constraints. An ideal solver should be "robust, accurate, and fast -- it is at the heart of all conservative RMHD schemes," as emphasized in…
▽ More
A long-standing and formidable challenge faced by all conservative schemes for relativistic magnetohydrodynamics (RMHD) is the recovery of primitive variables from conservative ones. This process involves solving highly nonlinear equations subject to physical constraints. An ideal solver should be "robust, accurate, and fast -- it is at the heart of all conservative RMHD schemes," as emphasized in [S.C. Noble et al., ApJ, 641:626-637, 2006]. Despite over three decades of research, seeking efficient solvers that can provably guarantee stability and convergence remains an open problem.
This paper presents the first theoretical analysis for designing a robust, physical-constraint-preserving (PCP), and provably (quadratically) convergent Newton-Raphson (NR) method for primitive variable recovery in RMHD. Our key innovation is a unified approach for the initial guess, devised based on sophisticated analysis. It ensures that the NR iteration consistently converges and adheres to physical constraints. Given the extreme nonlinearity and complexity of the iterative function, the theoretical analysis is highly nontrivial and technical. We discover a pivotal inequality for delineating the convexity and concavity of the iterative function and establish theories to guarantee the PCP property and convergence. We also develop theories to determine a computable initial guess within a theoretical "safe" interval. Intriguingly, we find that the unique positive root of a cubic polynomial always falls within this interval. Our PCP NR method is versatile and can be seamlessly integrated into any RMHD scheme that requires the recovery of primitive variables, potentially leading to a broad impact in this field. As an application, we incorporate it into a discontinuous Galerkin method, resulting in fully PCP schemes. Several numerical experiments demonstrate the efficiency and robustness of the PCP NR method.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Scaling limit of heavy tailed nearly unstable cumulative INAR($\infty$) processes and rough fractional diffusions
Authors:
Yingli Wang,
Chunhao Cai,
** He
Abstract:
In this paper, we investigated the scaling limit of heavy-tailed unstable cumulative INAR($\infty$) processes. These processes exhibit a power law tail of the form $n^{-(1+α)}$, with $α\in (\frac{1}{2}, 1)$, where the $\ell^1$ norm of the kernel vector is close to $1$. The result is in contrast to scaling limit of the continuous-time heavy tailed unstable Hawkes processes and the one of INAR($p$)…
▽ More
In this paper, we investigated the scaling limit of heavy-tailed unstable cumulative INAR($\infty$) processes. These processes exhibit a power law tail of the form $n^{-(1+α)}$, with $α\in (\frac{1}{2}, 1)$, where the $\ell^1$ norm of the kernel vector is close to $1$. The result is in contrast to scaling limit of the continuous-time heavy tailed unstable Hawkes processes and the one of INAR($p$) processes. We show that the discrete-time scaling limit also has long-memory property and can also be seen as an integrated fractional Cox-Ingersoll-Ross process.
△ Less
Submitted 16 April, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Provably convergent Newton-Raphson methods for recovering primitive variables with applications to physical-constraint-preserving Hermite WENO schemes for relativistic hydrodynamics
Authors:
Chaoyi Cai,
Jianxian Qiu,
Kailiang Wu
Abstract:
The relativistic hydrodynamics (RHD) equations have three crucial intrinsic physical constraints on the primitive variables: positivity of pressure and density, and subluminal fluid velocity. However, numerical simulations can violate these constraints, leading to nonphysical results or even simulation failure. Designing genuinely physical-constraint-preserving (PCP) schemes is very difficult, as…
▽ More
The relativistic hydrodynamics (RHD) equations have three crucial intrinsic physical constraints on the primitive variables: positivity of pressure and density, and subluminal fluid velocity. However, numerical simulations can violate these constraints, leading to nonphysical results or even simulation failure. Designing genuinely physical-constraint-preserving (PCP) schemes is very difficult, as the primitive variables cannot be explicitly reformulated using conservative variables due to relativistic effects. In this paper, we propose three efficient Newton--Raphson (NR) methods for robustly recovering primitive variables from conservative variables. Importantly, we rigorously prove that these NR methods are always convergent and PCP, meaning they preserve the physical constraints throughout the NR iterations. The discovery of these robust NR methods and their PCP convergence analyses are highly nontrivial and technical. As an application, we apply the proposed NR methods to design PCP finite volume Hermite weighted essentially non-oscillatory (HWENO) schemes for solving the RHD equations. Our PCP HWENO schemes incorporate high-order HWENO reconstruction, a PCP limiter, and strong-stability-preserving time discretization. We rigorously prove the PCP property of the fully discrete schemes using convex decomposition techniques. Moreover, we suggest the characteristic decomposition with rescaled eigenvectors and scale-invariant nonlinear weights to enhance the performance of the HWENO schemes in simulating large-scale RHD problems. Several demanding numerical tests are conducted to demonstrate the robustness, accuracy, and high resolution of the proposed PCP HWENO schemes and to validate the efficiency of our NR methods.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
Transfer Learning for Contextual Multi-armed Bandits
Authors:
Changxiao Cai,
T. Tony Cai,
Hongzhe Li
Abstract:
Motivated by a range of applications, we study in this paper the problem of transfer learning for nonparametric contextual multi-armed bandits under the covariate shift model, where we have data collected on source bandits before the start of the target bandit learning. The minimax rate of convergence for the cumulative regret is established and a novel transfer learning algorithm that attains the…
▽ More
Motivated by a range of applications, we study in this paper the problem of transfer learning for nonparametric contextual multi-armed bandits under the covariate shift model, where we have data collected on source bandits before the start of the target bandit learning. The minimax rate of convergence for the cumulative regret is established and a novel transfer learning algorithm that attains the minimax regret is proposed. The results quantify the contribution of the data from the source domains for learning in the target domain in the context of nonparametric contextual multi-armed bandits.
In view of the general impossibility of adaptation to unknown smoothness, we develop a data-driven algorithm that achieves near-optimal statistical guarantees (up to a logarithmic factor) while automatically adapting to the unknown parameters over a large collection of parameter spaces under an additional self-similarity assumption. A simulation study is carried out to illustrate the benefits of utilizing the data from the auxiliary source domains for learning in the target domain.
△ Less
Submitted 24 January, 2024; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Asymptotics of Karhunen-Lo{è}ve Eigenvalues for sub-fractional Brownian motion and its application
Authors:
Jun-Qi Hu,
Ying-Li Wang,
Chun-Hao Cai
Abstract:
In the present paper, the Karhunen-Lo{è}ve eigenvalues for a sub-fractional Brownian motion are considered in the case of $H>\frac12$. Rigorous large $n$ asymptotics for those eigenvalues are shown, based on functional analysis method. By virtue of these asymptotics, along with some standard large deviations results, asymptotically estimates for the closely related problem of small $L^2$-ball prob…
▽ More
In the present paper, the Karhunen-Lo{è}ve eigenvalues for a sub-fractional Brownian motion are considered in the case of $H>\frac12$. Rigorous large $n$ asymptotics for those eigenvalues are shown, based on functional analysis method. By virtue of these asymptotics, along with some standard large deviations results, asymptotically estimates for the closely related problem of small $L^2$-ball probabilities for a sub-fractional Brownian motion are derived. By the way, asymptotic analysis on the Karhunen-Lo{è}ve eigenvalues for the corresponding "derivative" process is also established.
△ Less
Submitted 13 October, 2021; v1 submitted 7 October, 2021;
originally announced October 2021.
-
On the continuity of optimal stop** surfaces for jump-diffusions
Authors:
Cheng Cai,
Tiziano De Angelis,
Jan Palczewski
Abstract:
We show that optimal stop** surfaces $(t,y)\mapsto x_*(t,y)$ arising from time-inhomogeneous optimal stop** problems on two-dimensional jump-diffusions $(X,Y)$ are continuous (jointly in time and space) under mild monotonicity and regularity assumptions of local nature.
We show that optimal stop** surfaces $(t,y)\mapsto x_*(t,y)$ arising from time-inhomogeneous optimal stop** problems on two-dimensional jump-diffusions $(X,Y)$ are continuous (jointly in time and space) under mild monotonicity and regularity assumptions of local nature.
△ Less
Submitted 7 June, 2022; v1 submitted 22 September, 2021;
originally announced September 2021.
-
The American put with finite-time maturity and stochastic interest rate
Authors:
Cheng Cai,
Tiziano De Angelis,
Jan Palczewski
Abstract:
In this paper we study pricing of American put options on the Black and Scholes market with a stochastic interest rate and finite-time maturity. We prove that the option value is a $C^1$ function of the initial time, interest rate and stock price. By means of Ito calculus we rigorously derive the option value's early exercise premium formula and the associated hedging portfolio. We prove the exist…
▽ More
In this paper we study pricing of American put options on the Black and Scholes market with a stochastic interest rate and finite-time maturity. We prove that the option value is a $C^1$ function of the initial time, interest rate and stock price. By means of Ito calculus we rigorously derive the option value's early exercise premium formula and the associated hedging portfolio. We prove the existence of an optimal exercise boundary splitting the state space into continuation and stop** region. The boundary has a parametrisation as a jointly continuous function of time and stock price, and it is the unique solution to an integral equation which we compute numerically. Our results hold for a large class of interest rate models including CIR and Vasicek models. We show a numerical study of the option price and the optimal exercise boundary for Vasicek model.
△ Less
Submitted 5 February, 2024; v1 submitted 17 April, 2021;
originally announced April 2021.
-
A change of variable formula with applications to multi-dimensional optimal stop** problems
Authors:
Cheng Cai,
Tiziano De Angelis
Abstract:
We derive a change of variable formula for $C^1$ functions $U:\R_+\times\R^m\to\R$ whose second order spatial derivatives may explode and not be integrable in the neighbourhood of a surface $b:\R_+\times\R^{m-1}\to \R$ that splits the state space into two sets $\cC$ and $\cD$. The formula is tailored for applications in problems of optimal stop** where it is generally very hard to control the se…
▽ More
We derive a change of variable formula for $C^1$ functions $U:\R_+\times\R^m\to\R$ whose second order spatial derivatives may explode and not be integrable in the neighbourhood of a surface $b:\R_+\times\R^{m-1}\to \R$ that splits the state space into two sets $\cC$ and $\cD$. The formula is tailored for applications in problems of optimal stop** where it is generally very hard to control the second order derivatives of the value function near the optimal stop** boundary. Differently to other existing papers on similar topics we only require that the surface $b$ be monotonic in each variable and we formally obtain the same expression as the classical Itô's formula.
△ Less
Submitted 6 July, 2023; v1 submitted 12 April, 2021;
originally announced April 2021.
-
Minimax Estimation of Linear Functions of Eigenvectors in the Face of Small Eigen-Gaps
Authors:
Gen Li,
Changxiao Cai,
H. Vincent Poor,
Yuxin Chen
Abstract:
Eigenvector perturbation analysis plays a vital role in various data science applications. A large body of prior works, however, focused on establishing $\ell_{2}$ eigenvector perturbation bounds, which are often highly inadequate in addressing tasks that rely on fine-grained behavior of an eigenvector. This paper makes progress on this by studying the perturbation of linear functions of an unknow…
▽ More
Eigenvector perturbation analysis plays a vital role in various data science applications. A large body of prior works, however, focused on establishing $\ell_{2}$ eigenvector perturbation bounds, which are often highly inadequate in addressing tasks that rely on fine-grained behavior of an eigenvector. This paper makes progress on this by studying the perturbation of linear functions of an unknown eigenvector. Focusing on two fundamental problems -- matrix denoising and principal component analysis -- in the presence of Gaussian noise, we develop a suite of statistical theory that characterizes the perturbation of arbitrary linear functions of an unknown eigenvector. In order to mitigate a non-negligible bias issue inherent to the natural ``plug-in'' estimator, we develop de-biased estimators that (1) achieve minimax lower bounds for a family of scenarios (modulo some logarithmic factor), and (2) can be computed in a data-driven manner without sample splitting. Noteworthily, the proposed estimators are nearly minimax optimal even when the associated eigen-gap is {\em substantially smaller} than what is required in prior statistical theory.
△ Less
Submitted 4 July, 2022; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Authors:
Gen Li,
Changxiao Cai,
Yuxin Chen,
Yuting Wei,
Yuejie Chi
Abstract:
Q-learning, which seeks to learn the optimal Q-function of a Markov decision process (MDP) in a model-free fashion, lies at the heart of reinforcement learning. When it comes to the synchronous setting (such that independent samples for all state-action pairs are drawn from a generative model in each iteration), substantial progress has been made towards understanding the sample efficiency of Q-le…
▽ More
Q-learning, which seeks to learn the optimal Q-function of a Markov decision process (MDP) in a model-free fashion, lies at the heart of reinforcement learning. When it comes to the synchronous setting (such that independent samples for all state-action pairs are drawn from a generative model in each iteration), substantial progress has been made towards understanding the sample efficiency of Q-learning. Consider a $γ$-discounted infinite-horizon MDP with state space $\mathcal{S}$ and action space $\mathcal{A}$: to yield an entrywise $\varepsilon$-approximation of the optimal Q-function, state-of-the-art theory for Q-learning requires a sample size exceeding the order of $\frac{|\mathcal{S}||\mathcal{A}|}{(1-γ)^5\varepsilon^{2}}$, which fails to match existing minimax lower bounds. This gives rise to natural questions: what is the sharp sample complexity of Q-learning? Is Q-learning provably sub-optimal? This paper addresses these questions for the synchronous setting: (1) when $|\mathcal{A}|=1$ (so that Q-learning reduces to TD learning), we prove that the sample complexity of TD learning is minimax optimal and scales as $\frac{|\mathcal{S}|}{(1-γ)^3\varepsilon^2}$ (up to log factor); (2) when $|\mathcal{A}|\geq 2$, we settle the sample complexity of Q-learning to be on the order of $\frac{|\mathcal{S}||\mathcal{A}|}{(1-γ)^4\varepsilon^2}$ (up to log factor). Our theory unveils the strict sub-optimality of Q-learning when $|\mathcal{A}|\geq 2$, and rigorizes the negative impact of over-estimation in Q-learning. Finally, we extend our analysis to accommodate asynchronous Q-learning (i.e., the case with Markovian samples), sharpening the horizon dependency of its sample complexity to be $\frac{1}{(1-γ)^4}$.
△ Less
Submitted 17 March, 2023; v1 submitted 12 February, 2021;
originally announced February 2021.
-
A Note On Inference for the Mixed Fractional Ornstein-Uhlenbeck Process with Drift
Authors:
Chunhao Cai,
Min Zhang
Abstract:
This paper is devoted to parameter estimation of the mixed fractional Ornstein-Uhlenbeck process with a drift. Large sample asymptotical properties of the Maximum Likelihood Estimator is deduced using the Laplace transform computations or the Cameron-Martin formula with extra part from \cite{CK19}
This paper is devoted to parameter estimation of the mixed fractional Ornstein-Uhlenbeck process with a drift. Large sample asymptotical properties of the Maximum Likelihood Estimator is deduced using the Laplace transform computations or the Cameron-Martin formula with extra part from \cite{CK19}
△ Less
Submitted 16 January, 2021; v1 submitted 8 September, 2020;
originally announced September 2020.
-
Uncertainty quantification for nonconvex tensor completion: Confidence intervals, heteroscedasticity and optimality
Authors:
Changxiao Cai,
H. Vincent Poor,
Yuxin Chen
Abstract:
We study the distribution and uncertainty of nonconvex optimization for noisy tensor completion -- the problem of estimating a low-rank tensor given incomplete and corrupted observations of its entries. Focusing on a two-stage estimation algorithm proposed by Cai et al. (2019), we characterize the distribution of this nonconvex estimator down to fine scales. This distributional theory in turn allo…
▽ More
We study the distribution and uncertainty of nonconvex optimization for noisy tensor completion -- the problem of estimating a low-rank tensor given incomplete and corrupted observations of its entries. Focusing on a two-stage estimation algorithm proposed by Cai et al. (2019), we characterize the distribution of this nonconvex estimator down to fine scales. This distributional theory in turn allows one to construct valid and short confidence intervals for both the unseen tensor entries and the unknown tensor factors. The proposed inferential procedure enjoys several important features: (1) it is fully adaptive to noise heteroscedasticity, and (2) it is data-driven and automatically adapts to unknown noise distributions. Furthermore, our findings unveil the statistical optimality of nonconvex tensor completion: it attains un-improvable $\ell_{2}$ accuracy -- including both the rates and the pre-constants -- when estimating both the unknown tensor and the underlying tensor factors.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
Maximum likelihood estimation for mixed fractional Vasicek processes
Authors:
Chunhao Cai,
Yinzhong Huang,
Weilin Xiao
Abstract:
The mixed fractional Vasicek model, which is an extended model of the traditional Vasicek model, has been widely used in modelling volatility, interest rate and exchange rate. Obviously, if some phenomenon are modeled by the mixed fractional Vasicek model, statistical inference for this process is of great interest. Based on continuous time observations, this paper considers the problem of estimat…
▽ More
The mixed fractional Vasicek model, which is an extended model of the traditional Vasicek model, has been widely used in modelling volatility, interest rate and exchange rate. Obviously, if some phenomenon are modeled by the mixed fractional Vasicek model, statistical inference for this process is of great interest. Based on continuous time observations, this paper considers the problem of estimating the drift parameters in the mixed fractional Vasicek model. We will propose the maximum likelihood estimators of the drift parameters in the mixed fractional Vasicek model with the Radon-Nikodym derivative for a mixed fractional Brownian motion. Using the fundamental martingale and the Laplace transform, both the strong consistency and the asymptotic normality of the maximum likelihood estimators have been established for all $H\in(0,1)$, $H\neq 1/2$.
△ Less
Submitted 14 July, 2020; v1 submitted 30 March, 2020;
originally announced March 2020.
-
Optimal hedging of a perpetual American put with a single trade
Authors:
Cheng Cai,
Tiziano De Angelis,
Jan Palczewski
Abstract:
It is well-known that using delta hedging to hedge financial options is not feasible in practice. Traders often rely on discrete-time hedging strategies based on fixed trading times or fixed trading prices (i.e., trades only occur if the underlying asset's price reaches some predetermined values). Motivated by this insight and with the aim of obtaining explicit solutions, we consider the seller of…
▽ More
It is well-known that using delta hedging to hedge financial options is not feasible in practice. Traders often rely on discrete-time hedging strategies based on fixed trading times or fixed trading prices (i.e., trades only occur if the underlying asset's price reaches some predetermined values). Motivated by this insight and with the aim of obtaining explicit solutions, we consider the seller of a perpetual American put option who can hedge her portfolio once until the underlying stock price leaves a certain range of values $(a,b)$. We determine optimal trading boundaries as functions of the initial stock holding, and an optimal hedging strategy for a bond/stock portfolio. Optimality here refers to the variance of the hedging error at the (random) time when the stock leaves the interval $(a,b)$. Our study leads to analytical expressions for both the optimal boundaries and the optimal stock holding, which can be evaluated numerically with no effort.
△ Less
Submitted 23 September, 2020; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Elder-Rule-Staircodes for Augmented Metric Spaces
Authors:
Chen Cai,
Woo** Kim,
Facundo Memoli,
Yusu Wang
Abstract:
An augmented metric space is a metric space $(X, d_X)$ equipped with a function $f_X: X \to \mathbb{R}$. This type of data arises commonly in practice, e.g, a point cloud $X$ in $\mathbb{R}^d$ where each point $x\in X$ has a density function value $f_X(x)$ associated to it. An augmented metric space $(X, d_X, f_X)$ naturally gives rise to a 2-parameter filtration $\mathcal{K}$. However, the result…
▽ More
An augmented metric space is a metric space $(X, d_X)$ equipped with a function $f_X: X \to \mathbb{R}$. This type of data arises commonly in practice, e.g, a point cloud $X$ in $\mathbb{R}^d$ where each point $x\in X$ has a density function value $f_X(x)$ associated to it. An augmented metric space $(X, d_X, f_X)$ naturally gives rise to a 2-parameter filtration $\mathcal{K}$. However, the resulting 2-parameter persistent homology $\mathrm{H}_{\bullet}(\mathcal{K})$ could still be of wild representation type, and may not have simple indecomposables. In this paper, motivated by the elder-rule for the zeroth homology of 1-parameter filtration, we propose a barcode-like summary, called the elder-rule-staircode, as a way to encode $\mathrm{H}_0(\mathcal{K})$. Specifically, if $n = |X|$, the elder-rule-staircode consists of $n$ number of staircase-like blocks in the plane. We show that if $\mathrm{H}_0(\mathcal{K})$ is interval decomposable, then the barcode of $\mathrm{H}_0(\mathcal{K})$ is equal to the elder-rule-staircode. Furthermore, regardless of the interval decomposability, the fibered barcode, the dimension function (a.k.a. the Hilbert function), and the graded Betti numbers of $\mathrm{H}_0(\mathcal{K})$ can all be efficiently computed once the elder-rule-staircode is given. Finally, we develop and implement an efficient algorithm to compute the elder-rule-staircode in $O(n^2\log n)$ time, which can be improved to $O(n^2α(n))$ if $X$ is from a fixed dimensional Euclidean space $\mathbb{R}^d$, where $α(n)$ is the inverse Ackermann function.
△ Less
Submitted 12 July, 2020; v1 submitted 9 March, 2020;
originally announced March 2020.
-
KoPA: Automated Kronecker Product Approximation
Authors:
Chencheng Cai,
Rong Chen,
Han Xiao
Abstract:
We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank…
▽ More
We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank matrix approximation, and includes it as a special case. Comparing with the latter, KoPA also offers a greater flexibility, since it allows the user to choose the configuration, which are the dimensions of the two smaller matrices forming the Kronecker product. On the other hand, the configuration to be used is usually unknown, and needs to be determined from the data in order to achieve the optimal balance between accuracy and parsimony. We propose to use extended information criteria to select the configuration. Under the paradigm of high dimensional analysis, we show that the proposed procedure is able to select the true configuration with probability tending to one, under suitable conditions on the signal-to-noise ratio. We demonstrate the superiority of KoPA over the low rank approximations through numerical studies, and several benchmark image examples.
△ Less
Submitted 26 August, 2020; v1 submitted 5 December, 2019;
originally announced December 2019.
-
Nonconvex Low-Rank Tensor Completion from Noisy Data
Authors:
Changxiao Cai,
Gen Li,
H. Vincent Poor,
Yuxin Chen
Abstract:
We study a noisy tensor completion problem of broad practical interest, namely, the reconstruction of a low-rank tensor from highly incomplete and randomly corrupted observations of its entries. While a variety of prior work has been dedicated to this problem, prior algorithms either are computationally too expensive for large-scale applications, or come with sub-optimal statistical guarantees. Fo…
▽ More
We study a noisy tensor completion problem of broad practical interest, namely, the reconstruction of a low-rank tensor from highly incomplete and randomly corrupted observations of its entries. While a variety of prior work has been dedicated to this problem, prior algorithms either are computationally too expensive for large-scale applications, or come with sub-optimal statistical guarantees. Focusing on "incoherent" and well-conditioned tensors of a constant CP rank, we propose a two-stage nonconvex algorithm -- (vanilla) gradient descent following a rough initialization -- that achieves the best of both worlds. Specifically, the proposed nonconvex algorithm faithfully completes the tensor and retrieves all individual tensor factors within nearly linear time, while at the same time enjoying near-optimal statistical guarantees (i.e. minimal sample complexity and optimal estimation accuracy). The estimation errors are evenly spread out across all entries, thus achieving optimal $\ell_{\infty}$ statistical accuracy. We have also discussed how to extend our approach to accommodate asymmetric tensors. The insight conveyed through our analysis of nonconvex optimization might have implications for other tensor estimation problems.
△ Less
Submitted 2 June, 2021; v1 submitted 11 November, 2019;
originally announced November 2019.
-
Subspace Estimation from Unbalanced and Incomplete Data Matrices: $\ell_{2,\infty}$ Statistical Guarantees
Authors:
Changxiao Cai,
Gen Li,
Yuejie Chi,
H. Vincent Poor,
Yuxin Chen
Abstract:
This paper is concerned with estimating the column space of an unknown low-rank matrix $\boldsymbol{A}^{\star}\in\mathbb{R}^{d_{1}\times d_{2}}$, given noisy and partial observations of its entries. There is no shortage of scenarios where the observations -- while being too noisy to support faithful recovery of the entire matrix -- still convey sufficient information to enable reliable estimation…
▽ More
This paper is concerned with estimating the column space of an unknown low-rank matrix $\boldsymbol{A}^{\star}\in\mathbb{R}^{d_{1}\times d_{2}}$, given noisy and partial observations of its entries. There is no shortage of scenarios where the observations -- while being too noisy to support faithful recovery of the entire matrix -- still convey sufficient information to enable reliable estimation of the column space of interest. This is particularly evident and crucial for the highly unbalanced case where the column dimension $d_{2}$ far exceeds the row dimension $d_{1}$, which is the focal point of the current paper. We investigate an efficient spectral method, which operates upon the sample Gram matrix with diagonal deletion. While this algorithmic idea has been studied before, we establish new statistical guarantees for this method in terms of both $\ell_{2}$ and $\ell_{2,\infty}$ estimation accuracy, which improve upon prior results if $d_{2}$ is substantially larger than $d_{1}$. To illustrate the effectiveness of our findings, we derive matching minimax lower bounds with respect to the noise levels, and develop consequences of our general theory for three applications of practical importance: (1) tensor completion from noisy data, (2) covariance estimation / principal component analysis with missing data, and (3) community recovery in bipartite graphs. Our theory leads to improved performance guarantees for all three cases.
△ Less
Submitted 15 November, 2020; v1 submitted 9 October, 2019;
originally announced October 2019.
-
Group Representation Theory for Knowledge Graph Embedding
Authors:
Chen Cai
Abstract:
Knowledge graph embedding has recently become a popular way to model relations and infer missing links. In this paper, we present a group theoretical perspective of knowledge graph embedding, connecting previous methods with different group actions. Furthermore, by utilizing Schur's lemma from group representation theory, we show that the state of the art embedding method RotatE can model relation…
▽ More
Knowledge graph embedding has recently become a popular way to model relations and infer missing links. In this paper, we present a group theoretical perspective of knowledge graph embedding, connecting previous methods with different group actions. Furthermore, by utilizing Schur's lemma from group representation theory, we show that the state of the art embedding method RotatE can model relations from any finite Abelian group.
△ Less
Submitted 27 November, 2019; v1 submitted 11 September, 2019;
originally announced September 2019.
-
Mixed sub-fractional Brownian motion and drift estimation of related Ornstein-Uhlenbeck process
Authors:
Chunhao Cai,
Qinghua Wang,
Weilin Xiao
Abstract:
In this paper, we will first give the numerical simulation of the sub-fractional Brownian motion through the relation of fractional Brownian motion instead of its representation of random walk. In order to verify the rationality of this simulation, we propose a practical estimator associated with the LSE of the drift parameter of mixed sub-fractional Ornstein-Uhlenbeck process, and illustrate the…
▽ More
In this paper, we will first give the numerical simulation of the sub-fractional Brownian motion through the relation of fractional Brownian motion instead of its representation of random walk. In order to verify the rationality of this simulation, we propose a practical estimator associated with the LSE of the drift parameter of mixed sub-fractional Ornstein-Uhlenbeck process, and illustrate the asymptotical properties according to our method of simulation when the Hurst parameter $H>1/2$.
△ Less
Submitted 8 January, 2021; v1 submitted 5 September, 2018;
originally announced September 2018.
-
Malliavin Derivative for the Unknown Parameter in surplus process with mixed fractional Brownian motion
Authors:
Chunhao Cai,
Yingzhong Huang
Abstract:
In this paper, we will construct the Malliavin derivative and the stochastic integral with respect to the Mixed fractional Brownian motion (mfbm) for H > 1/2. As an application, we try to estimate the drift parameter via Malliavin derivative for surplus process with mixed fractional Brownian motion
In this paper, we will construct the Malliavin derivative and the stochastic integral with respect to the Mixed fractional Brownian motion (mfbm) for H > 1/2. As an application, we try to estimate the drift parameter via Malliavin derivative for surplus process with mixed fractional Brownian motion
△ Less
Submitted 7 July, 2021; v1 submitted 3 February, 2018;
originally announced February 2018.
-
Controlled Mean-Reverting Estimation for The AR(1) Model with Stationary Gaussian Noise
Authors:
Chunhao Cai
Abstract:
This paper deals with the maximum likelihood estimator for the mean-reverting parameter of a first order autoregressive models with exogenous variables, which are stationary Gaussian noises (Colored noise). Using the method of the Laplace transform, both the asymptotic properties and the asymptotic design problem of the maximum likelihood estimator are investigated. The numerical simulation result…
▽ More
This paper deals with the maximum likelihood estimator for the mean-reverting parameter of a first order autoregressive models with exogenous variables, which are stationary Gaussian noises (Colored noise). Using the method of the Laplace transform, both the asymptotic properties and the asymptotic design problem of the maximum likelihood estimator are investigated. The numerical simulation results confirm the theoretical analysis and show that the proposed maximum likelihood estimator performs well in finite sample.
△ Less
Submitted 17 November, 2020; v1 submitted 26 October, 2017;
originally announced October 2017.
-
Simulation of Integro-Differential Equation and Application in Estimation of Ruin Probability with Mixed Fractional Brownian Motion
Authors:
Chunhao Cai,
Weilin Xiao
Abstract:
In this paper, we are concerned with the numerical solution of one type integro-differential equation by a probability method based on the fundamental martingale of mixed Gaussian processes. As an application, we will try to simulate the estimation of ruin probability with an unknown parameter driven not by the classical Lévy process but by the mixed fractional Brownian motion.
In this paper, we are concerned with the numerical solution of one type integro-differential equation by a probability method based on the fundamental martingale of mixed Gaussian processes. As an application, we will try to simulate the estimation of ruin probability with an unknown parameter driven not by the classical Lévy process but by the mixed fractional Brownian motion.
△ Less
Submitted 7 May, 2020; v1 submitted 11 September, 2017;
originally announced September 2017.
-
Experiment design for controlled partially observed fractional diffusion process
Authors:
Chunhao Cai,
Wujun LV
Abstract:
We consider a controlled second order differential equation which is partially observed with an additional fractional noise. we study the asymptotic (for large observation time) design problem of the input and give an efficient estimator of the unknown signal drift parameter. When the input depends on the unknow parameter, we will try the one-step estimation procedure using the Newton-Raphson meth…
▽ More
We consider a controlled second order differential equation which is partially observed with an additional fractional noise. we study the asymptotic (for large observation time) design problem of the input and give an efficient estimator of the unknown signal drift parameter. When the input depends on the unknow parameter, we will try the one-step estimation procedure using the Newton-Raphson method.
△ Less
Submitted 8 April, 2019; v1 submitted 28 September, 2016;
originally announced September 2016.
-
Non-parametric threshold estimation for classical risk process perturbed by diffusion
Authors:
Chunhao Cai,
Junyi Guo,
Honglong You
Abstract:
In this paper,we consider a macro approximation of the flow of a risk reserve, The process is observed at discrete time points. Because we cannot directly observe each jump time and size then we will make use of a technique for identifying the times when jumps larger than a suitably defined threshold occurred. We estimate the jump size and survival probability of our risk process from discrete obs…
▽ More
In this paper,we consider a macro approximation of the flow of a risk reserve, The process is observed at discrete time points. Because we cannot directly observe each jump time and size then we will make use of a technique for identifying the times when jumps larger than a suitably defined threshold occurred. We estimate the jump size and survival probability of our risk process from discrete observations.
△ Less
Submitted 21 June, 2016;
originally announced June 2016.
-
Occupation times of intervals until last passage times for spectrally negative Levy processes
Authors:
Bo Li,
Chunhao Cai
Abstract:
In this paper, we derive the joint Laplace transforms of occupation times until its last passage times as well as its positions. Motivated by Baurdoux [2], the last times before an independent exponential variable are studied. By applying dual arguments, explicit formulas are derived in terms of new analytical identities from Loeffen et al. [12].
In this paper, we derive the joint Laplace transforms of occupation times until its last passage times as well as its positions. Motivated by Baurdoux [2], the last times before an independent exponential variable are studied. By applying dual arguments, explicit formulas are derived in terms of new analytical identities from Loeffen et al. [12].
△ Less
Submitted 22 September, 2016; v1 submitted 24 May, 2016;
originally announced May 2016.
-
Asymptotic properties of the MLE for the autoregressive process coefficients under stationary Gaussian noise
Authors:
Alexandre Brouste,
Chunhao Cai,
Marina Kleptsyna
Abstract:
In this paper we are interested in the Maximum Likelihood Estimator (MLE) of the vector parameter of an autoregressive process of order $p$ with regular stationary Gaussian noise. We exhibit the large sample asymptotical properties of the MLE under very mild conditions. Simulations are done for fractional Gaussian noise (fGn), autoregressive noise (AR(1)) and moving average noise (MA(1)).
In this paper we are interested in the Maximum Likelihood Estimator (MLE) of the vector parameter of an autoregressive process of order $p$ with regular stationary Gaussian noise. We exhibit the large sample asymptotical properties of the MLE under very mild conditions. Simulations are done for fractional Gaussian noise (fGn), autoregressive noise (AR(1)) and moving average noise (MA(1)).
△ Less
Submitted 22 April, 2013;
originally announced April 2013.
-
Mixed Gaussian processes: A filtering approach
Authors:
Chunhao Cai,
Pavel Chigansky,
Marina Kleptsyna
Abstract:
This paper presents a new approach to the analysis of mixed processes \[X_t=B_t+G_t,\qquad t\in[0,T],\] where $B_t$ is a Brownian motion and $G_t$ is an independent centered Gaussian process. We obtain a new canonical innovation representation of $X$, using linear filtering theory. When the kernel \[K(s,t)=\frac{\partial^2}{\partial s\,\partial t}\mathbb{E}G_tG_s,\qquad s\ne t\] has a weak singula…
▽ More
This paper presents a new approach to the analysis of mixed processes \[X_t=B_t+G_t,\qquad t\in[0,T],\] where $B_t$ is a Brownian motion and $G_t$ is an independent centered Gaussian process. We obtain a new canonical innovation representation of $X$, using linear filtering theory. When the kernel \[K(s,t)=\frac{\partial^2}{\partial s\,\partial t}\mathbb{E}G_tG_s,\qquad s\ne t\] has a weak singularity on the diagonal, our results generalize the classical innovation formulas beyond the square integrable setting. For kernels with stronger singularity, our approach is applicable to processes with additional "fractional" structure, including the mixed fractional Brownian motion from mathematical finance. We show how previously-known measure equivalence relations and semimartingale properties follow from our canonical representation in a unified way, and complement them with new formulas for Radon-Nikodym densities.
△ Less
Submitted 2 September, 2016; v1 submitted 30 August, 2012;
originally announced August 2012.