Search | arXiv e-print repository

Provably Convergent and Robust Newton-Raphson Method: A New Dawn in Primitive Variable Recovery for Relativistic MHD

Authors: Chaoyi Cai, Jianxian Qiu, Kailiang Wu

Abstract: A long-standing and formidable challenge faced by all conservative schemes for relativistic magnetohydrodynamics (RMHD) is the recovery of primitive variables from conservative ones. This process involves solving highly nonlinear equations subject to physical constraints. An ideal solver should be "robust, accurate, and fast -- it is at the heart of all conservative RMHD schemes," as emphasized in… ▽ More A long-standing and formidable challenge faced by all conservative schemes for relativistic magnetohydrodynamics (RMHD) is the recovery of primitive variables from conservative ones. This process involves solving highly nonlinear equations subject to physical constraints. An ideal solver should be "robust, accurate, and fast -- it is at the heart of all conservative RMHD schemes," as emphasized in [S.C. Noble et al., ApJ, 641:626-637, 2006]. Despite over three decades of research, seeking efficient solvers that can provably guarantee stability and convergence remains an open problem. This paper presents the first theoretical analysis for designing a robust, physical-constraint-preserving (PCP), and provably (quadratically) convergent Newton-Raphson (NR) method for primitive variable recovery in RMHD. Our key innovation is a unified approach for the initial guess, devised based on sophisticated analysis. It ensures that the NR iteration consistently converges and adheres to physical constraints. Given the extreme nonlinearity and complexity of the iterative function, the theoretical analysis is highly nontrivial and technical. We discover a pivotal inequality for delineating the convexity and concavity of the iterative function and establish theories to guarantee the PCP property and convergence. We also develop theories to determine a computable initial guess within a theoretical "safe" interval. Intriguingly, we find that the unique positive root of a cubic polynomial always falls within this interval. Our PCP NR method is versatile and can be seamlessly integrated into any RMHD scheme that requires the recovery of primitive variables, potentially leading to a broad impact in this field. As an application, we incorporate it into a discontinuous Galerkin method, resulting in fully PCP schemes. Several numerical experiments demonstrate the efficiency and robustness of the PCP NR method. △ Less

Submitted 8 April, 2024; originally announced April 2024.

Comments: 26 pages, 7 figures

arXiv:2403.11773 [pdf, other]

Scaling limit of heavy tailed nearly unstable cumulative INAR($\infty$) processes and rough fractional diffusions

Authors: Yingli Wang, Chunhao Cai, ** He

Abstract: In this paper, we investigated the scaling limit of heavy-tailed unstable cumulative INAR($\infty$) processes. These processes exhibit a power law tail of the form $n^{-(1+α)}$, with $α\in (\frac{1}{2}, 1)$, where the $\ell^1$ norm of the kernel vector is close to $1$. The result is in contrast to scaling limit of the continuous-time heavy tailed unstable Hawkes processes and the one of INAR($p$)… ▽ More In this paper, we investigated the scaling limit of heavy-tailed unstable cumulative INAR($\infty$) processes. These processes exhibit a power law tail of the form $n^{-(1+α)}$, with $α\in (\frac{1}{2}, 1)$, where the $\ell^1$ norm of the kernel vector is close to $1$. The result is in contrast to scaling limit of the continuous-time heavy tailed unstable Hawkes processes and the one of INAR($p$) processes. We show that the discrete-time scaling limit also has long-memory property and can also be seen as an integrated fractional Cox-Ingersoll-Ross process. △ Less

Submitted 16 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

Comments: arXiv admin note: text overlap with arXiv:1504.03100 by other authors

MSC Class: 60G22; 60F05

arXiv:2305.14805 [pdf, other]

Provably convergent Newton-Raphson methods for recovering primitive variables with applications to physical-constraint-preserving Hermite WENO schemes for relativistic hydrodynamics

Authors: Chaoyi Cai, Jianxian Qiu, Kailiang Wu

Abstract: The relativistic hydrodynamics (RHD) equations have three crucial intrinsic physical constraints on the primitive variables: positivity of pressure and density, and subluminal fluid velocity. However, numerical simulations can violate these constraints, leading to nonphysical results or even simulation failure. Designing genuinely physical-constraint-preserving (PCP) schemes is very difficult, as… ▽ More The relativistic hydrodynamics (RHD) equations have three crucial intrinsic physical constraints on the primitive variables: positivity of pressure and density, and subluminal fluid velocity. However, numerical simulations can violate these constraints, leading to nonphysical results or even simulation failure. Designing genuinely physical-constraint-preserving (PCP) schemes is very difficult, as the primitive variables cannot be explicitly reformulated using conservative variables due to relativistic effects. In this paper, we propose three efficient Newton--Raphson (NR) methods for robustly recovering primitive variables from conservative variables. Importantly, we rigorously prove that these NR methods are always convergent and PCP, meaning they preserve the physical constraints throughout the NR iterations. The discovery of these robust NR methods and their PCP convergence analyses are highly nontrivial and technical. As an application, we apply the proposed NR methods to design PCP finite volume Hermite weighted essentially non-oscillatory (HWENO) schemes for solving the RHD equations. Our PCP HWENO schemes incorporate high-order HWENO reconstruction, a PCP limiter, and strong-stability-preserving time discretization. We rigorously prove the PCP property of the fully discrete schemes using convex decomposition techniques. Moreover, we suggest the characteristic decomposition with rescaled eigenvectors and scale-invariant nonlinear weights to enhance the performance of the HWENO schemes in simulating large-scale RHD problems. Several demanding numerical tests are conducted to demonstrate the robustness, accuracy, and high resolution of the proposed PCP HWENO schemes and to validate the efficiency of our NR methods. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 49 pages

arXiv:2211.12612 [pdf, ps, other]

Transfer Learning for Contextual Multi-armed Bandits

Authors: Changxiao Cai, T. Tony Cai, Hongzhe Li

Abstract: Motivated by a range of applications, we study in this paper the problem of transfer learning for nonparametric contextual multi-armed bandits under the covariate shift model, where we have data collected on source bandits before the start of the target bandit learning. The minimax rate of convergence for the cumulative regret is established and a novel transfer learning algorithm that attains the… ▽ More Motivated by a range of applications, we study in this paper the problem of transfer learning for nonparametric contextual multi-armed bandits under the covariate shift model, where we have data collected on source bandits before the start of the target bandit learning. The minimax rate of convergence for the cumulative regret is established and a novel transfer learning algorithm that attains the minimax regret is proposed. The results quantify the contribution of the data from the source domains for learning in the target domain in the context of nonparametric contextual multi-armed bandits. In view of the general impossibility of adaptation to unknown smoothness, we develop a data-driven algorithm that achieves near-optimal statistical guarantees (up to a logarithmic factor) while automatically adapting to the unknown parameters over a large collection of parameter spaces under an additional self-similarity assumption. A simulation study is carried out to illustrate the benefits of utilizing the data from the auxiliary source domains for learning in the target domain. △ Less

Submitted 24 January, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

Comments: Accepted to the Annals of Statistics

arXiv:2110.03203 [pdf, ps, other]

Asymptotics of Karhunen-Lo{è}ve Eigenvalues for sub-fractional Brownian motion and its application

Authors: Jun-Qi Hu, Ying-Li Wang, Chun-Hao Cai

Abstract: In the present paper, the Karhunen-Lo{è}ve eigenvalues for a sub-fractional Brownian motion are considered in the case of $H>\frac12$. Rigorous large $n$ asymptotics for those eigenvalues are shown, based on functional analysis method. By virtue of these asymptotics, along with some standard large deviations results, asymptotically estimates for the closely related problem of small $L^2$-ball prob… ▽ More In the present paper, the Karhunen-Lo{è}ve eigenvalues for a sub-fractional Brownian motion are considered in the case of $H>\frac12$. Rigorous large $n$ asymptotics for those eigenvalues are shown, based on functional analysis method. By virtue of these asymptotics, along with some standard large deviations results, asymptotically estimates for the closely related problem of small $L^2$-ball probabilities for a sub-fractional Brownian motion are derived. By the way, asymptotic analysis on the Karhunen-Lo{è}ve eigenvalues for the corresponding "derivative" process is also established. △ Less

Submitted 13 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

MSC Class: 60G15; 60G22; 47B40

arXiv:2109.10810 [pdf, ps, other]

doi 10.1137/21M1448094

On the continuity of optimal stop** surfaces for jump-diffusions

Authors: Cheng Cai, Tiziano De Angelis, Jan Palczewski

Abstract: We show that optimal stop** surfaces $(t,y)\mapsto x_*(t,y)$ arising from time-inhomogeneous optimal stop** problems on two-dimensional jump-diffusions $(X,Y)$ are continuous (jointly in time and space) under mild monotonicity and regularity assumptions of local nature. We show that optimal stop** surfaces $(t,y)\mapsto x_*(t,y)$ arising from time-inhomogeneous optimal stop** problems on two-dimensional jump-diffusions $(X,Y)$ are continuous (jointly in time and space) under mild monotonicity and regularity assumptions of local nature. △ Less

Submitted 7 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

Comments: 18 pages, strengthened discussion of related literature

MSC Class: 60G40; 35R35; 60J60; 60J76

arXiv:2104.08502 [pdf, other]

doi 10.1111/mafi.12361

The American put with finite-time maturity and stochastic interest rate

Authors: Cheng Cai, Tiziano De Angelis, Jan Palczewski

Abstract: In this paper we study pricing of American put options on the Black and Scholes market with a stochastic interest rate and finite-time maturity. We prove that the option value is a $C^1$ function of the initial time, interest rate and stock price. By means of Ito calculus we rigorously derive the option value's early exercise premium formula and the associated hedging portfolio. We prove the exist… ▽ More In this paper we study pricing of American put options on the Black and Scholes market with a stochastic interest rate and finite-time maturity. We prove that the option value is a $C^1$ function of the initial time, interest rate and stock price. By means of Ito calculus we rigorously derive the option value's early exercise premium formula and the associated hedging portfolio. We prove the existence of an optimal exercise boundary splitting the state space into continuation and stop** region. The boundary has a parametrisation as a jointly continuous function of time and stock price, and it is the unique solution to an integral equation which we compute numerically. Our results hold for a large class of interest rate models including CIR and Vasicek models. We show a numerical study of the option price and the optimal exercise boundary for Vasicek model. △ Less

Submitted 5 February, 2024; v1 submitted 17 April, 2021; originally announced April 2021.

Comments: Corrections in proofs of Propositions 3.3 and 3.11

MSC Class: 91G20; 91G30; 93E20; 60J60; 35R35

arXiv:2104.05835 [pdf, ps, other]

A change of variable formula with applications to multi-dimensional optimal stop** problems

Authors: Cheng Cai, Tiziano De Angelis

Abstract: We derive a change of variable formula for $C^1$ functions $U:\R_+\times\R^m\to\R$ whose second order spatial derivatives may explode and not be integrable in the neighbourhood of a surface $b:\R_+\times\R^{m-1}\to \R$ that splits the state space into two sets $\cC$ and $\cD$. The formula is tailored for applications in problems of optimal stop** where it is generally very hard to control the se… ▽ More We derive a change of variable formula for $C^1$ functions $U:\R_+\times\R^m\to\R$ whose second order spatial derivatives may explode and not be integrable in the neighbourhood of a surface $b:\R_+\times\R^{m-1}\to \R$ that splits the state space into two sets $\cC$ and $\cD$. The formula is tailored for applications in problems of optimal stop** where it is generally very hard to control the second order derivatives of the value function near the optimal stop** boundary. Differently to other existing papers on similar topics we only require that the surface $b$ be monotonic in each variable and we formally obtain the same expression as the classical Itô's formula. △ Less

Submitted 6 July, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

Comments: 26 pages; final version accepted for publication

MSC Class: 60H05; 60G44; 60J60; 60J65; 60G40; 35R35

arXiv:2104.03298 [pdf, ps, other]

Minimax Estimation of Linear Functions of Eigenvectors in the Face of Small Eigen-Gaps

Authors: Gen Li, Changxiao Cai, H. Vincent Poor, Yuxin Chen

Abstract: Eigenvector perturbation analysis plays a vital role in various data science applications. A large body of prior works, however, focused on establishing $\ell_{2}$ eigenvector perturbation bounds, which are often highly inadequate in addressing tasks that rely on fine-grained behavior of an eigenvector. This paper makes progress on this by studying the perturbation of linear functions of an unknow… ▽ More Eigenvector perturbation analysis plays a vital role in various data science applications. A large body of prior works, however, focused on establishing $\ell_{2}$ eigenvector perturbation bounds, which are often highly inadequate in addressing tasks that rely on fine-grained behavior of an eigenvector. This paper makes progress on this by studying the perturbation of linear functions of an unknown eigenvector. Focusing on two fundamental problems -- matrix denoising and principal component analysis -- in the presence of Gaussian noise, we develop a suite of statistical theory that characterizes the perturbation of arbitrary linear functions of an unknown eigenvector. In order to mitigate a non-negligible bias issue inherent to the natural ``plug-in'' estimator, we develop de-biased estimators that (1) achieve minimax lower bounds for a family of scenarios (modulo some logarithmic factor), and (2) can be computed in a data-driven manner without sample splitting. Noteworthily, the proposed estimators are nearly minimax optimal even when the associated eigen-gap is {\em substantially smaller} than what is required in prior statistical theory. △ Less

Submitted 4 July, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

arXiv:2102.06548 [pdf, other]

Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis

Authors: Gen Li, Changxiao Cai, Yuxin Chen, Yuting Wei, Yuejie Chi

Abstract: Q-learning, which seeks to learn the optimal Q-function of a Markov decision process (MDP) in a model-free fashion, lies at the heart of reinforcement learning. When it comes to the synchronous setting (such that independent samples for all state-action pairs are drawn from a generative model in each iteration), substantial progress has been made towards understanding the sample efficiency of Q-le… ▽ More Q-learning, which seeks to learn the optimal Q-function of a Markov decision process (MDP) in a model-free fashion, lies at the heart of reinforcement learning. When it comes to the synchronous setting (such that independent samples for all state-action pairs are drawn from a generative model in each iteration), substantial progress has been made towards understanding the sample efficiency of Q-learning. Consider a $γ$-discounted infinite-horizon MDP with state space $\mathcal{S}$ and action space $\mathcal{A}$: to yield an entrywise $\varepsilon$-approximation of the optimal Q-function, state-of-the-art theory for Q-learning requires a sample size exceeding the order of $\frac{|\mathcal{S}||\mathcal{A}|}{(1-γ)^5\varepsilon^{2}}$, which fails to match existing minimax lower bounds. This gives rise to natural questions: what is the sharp sample complexity of Q-learning? Is Q-learning provably sub-optimal? This paper addresses these questions for the synchronous setting: (1) when $|\mathcal{A}|=1$ (so that Q-learning reduces to TD learning), we prove that the sample complexity of TD learning is minimax optimal and scales as $\frac{|\mathcal{S}|}{(1-γ)^3\varepsilon^2}$ (up to log factor); (2) when $|\mathcal{A}|\geq 2$, we settle the sample complexity of Q-learning to be on the order of $\frac{|\mathcal{S}||\mathcal{A}|}{(1-γ)^4\varepsilon^2}$ (up to log factor). Our theory unveils the strict sub-optimality of Q-learning when $|\mathcal{A}|\geq 2$, and rigorizes the negative impact of over-estimation in Q-learning. Finally, we extend our analysis to accommodate asynchronous Q-learning (i.e., the case with Markovian samples), sharpening the horizon dependency of its sample complexity to be $\frac{1}{(1-γ)^4}$. △ Less

Submitted 17 March, 2023; v1 submitted 12 February, 2021; originally announced February 2021.

Comments: accepted to Operations Research

arXiv:2009.03757 [pdf, ps, other]

A Note On Inference for the Mixed Fractional Ornstein-Uhlenbeck Process with Drift

Authors: Chunhao Cai, Min Zhang

Abstract: This paper is devoted to parameter estimation of the mixed fractional Ornstein-Uhlenbeck process with a drift. Large sample asymptotical properties of the Maximum Likelihood Estimator is deduced using the Laplace transform computations or the Cameron-Martin formula with extra part from \cite{CK19} This paper is devoted to parameter estimation of the mixed fractional Ornstein-Uhlenbeck process with a drift. Large sample asymptotical properties of the Maximum Likelihood Estimator is deduced using the Laplace transform computations or the Cameron-Martin formula with extra part from \cite{CK19} △ Less

Submitted 16 January, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

arXiv:2006.08580 [pdf, ps, other]

Uncertainty quantification for nonconvex tensor completion: Confidence intervals, heteroscedasticity and optimality

Authors: Changxiao Cai, H. Vincent Poor, Yuxin Chen

Abstract: We study the distribution and uncertainty of nonconvex optimization for noisy tensor completion -- the problem of estimating a low-rank tensor given incomplete and corrupted observations of its entries. Focusing on a two-stage estimation algorithm proposed by Cai et al. (2019), we characterize the distribution of this nonconvex estimator down to fine scales. This distributional theory in turn allo… ▽ More We study the distribution and uncertainty of nonconvex optimization for noisy tensor completion -- the problem of estimating a low-rank tensor given incomplete and corrupted observations of its entries. Focusing on a two-stage estimation algorithm proposed by Cai et al. (2019), we characterize the distribution of this nonconvex estimator down to fine scales. This distributional theory in turn allows one to construct valid and short confidence intervals for both the unseen tensor entries and the unknown tensor factors. The proposed inferential procedure enjoys several important features: (1) it is fully adaptive to noise heteroscedasticity, and (2) it is data-driven and automatically adapts to unknown noise distributions. Furthermore, our findings unveil the statistical optimality of nonconvex tensor completion: it attains un-improvable $\ell_{2}$ accuracy -- including both the rates and the pre-constants -- when estimating both the unknown tensor and the underlying tensor factors. △ Less

Submitted 15 June, 2020; originally announced June 2020.

Comments: Accepted in part to ICML 2020

Journal ref: IEEE Transactions on Information Theory, vol. 69, no. 1, pp. 407-452, Jan. 2023

arXiv:2003.13351 [pdf, ps, other]

Maximum likelihood estimation for mixed fractional Vasicek processes

Authors: Chunhao Cai, Yinzhong Huang, Weilin Xiao

Abstract: The mixed fractional Vasicek model, which is an extended model of the traditional Vasicek model, has been widely used in modelling volatility, interest rate and exchange rate. Obviously, if some phenomenon are modeled by the mixed fractional Vasicek model, statistical inference for this process is of great interest. Based on continuous time observations, this paper considers the problem of estimat… ▽ More The mixed fractional Vasicek model, which is an extended model of the traditional Vasicek model, has been widely used in modelling volatility, interest rate and exchange rate. Obviously, if some phenomenon are modeled by the mixed fractional Vasicek model, statistical inference for this process is of great interest. Based on continuous time observations, this paper considers the problem of estimating the drift parameters in the mixed fractional Vasicek model. We will propose the maximum likelihood estimators of the drift parameters in the mixed fractional Vasicek model with the Radon-Nikodym derivative for a mixed fractional Brownian motion. Using the fundamental martingale and the Laplace transform, both the strong consistency and the asymptotic normality of the maximum likelihood estimators have been established for all $H\in(0,1)$, $H\neq 1/2$. △ Less

Submitted 14 July, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

arXiv:2003.06249 [pdf, other]

doi 10.1137/20M1325265

Optimal hedging of a perpetual American put with a single trade

Authors: Cheng Cai, Tiziano De Angelis, Jan Palczewski

Abstract: It is well-known that using delta hedging to hedge financial options is not feasible in practice. Traders often rely on discrete-time hedging strategies based on fixed trading times or fixed trading prices (i.e., trades only occur if the underlying asset's price reaches some predetermined values). Motivated by this insight and with the aim of obtaining explicit solutions, we consider the seller of… ▽ More It is well-known that using delta hedging to hedge financial options is not feasible in practice. Traders often rely on discrete-time hedging strategies based on fixed trading times or fixed trading prices (i.e., trades only occur if the underlying asset's price reaches some predetermined values). Motivated by this insight and with the aim of obtaining explicit solutions, we consider the seller of a perpetual American put option who can hedge her portfolio once until the underlying stock price leaves a certain range of values $(a,b)$. We determine optimal trading boundaries as functions of the initial stock holding, and an optimal hedging strategy for a bond/stock portfolio. Optimality here refers to the variance of the hedging error at the (random) time when the stock leaves the interval $(a,b)$. Our study leads to analytical expressions for both the optimal boundaries and the optimal stock holding, which can be evaluated numerically with no effort. △ Less

Submitted 23 September, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

Comments: Section 6 added and Section 7 expanded

MSC Class: 91G10; 91G80; 60J60; 35R35

arXiv:2003.04523 [pdf, other]

doi 10.1137/20M1353605

Elder-Rule-Staircodes for Augmented Metric Spaces

Authors: Chen Cai, Woo** Kim, Facundo Memoli, Yusu Wang

Abstract: An augmented metric space is a metric space $(X, d_X)$ equipped with a function $f_X: X \to \mathbb{R}$. This type of data arises commonly in practice, e.g, a point cloud $X$ in $\mathbb{R}^d$ where each point $x\in X$ has a density function value $f_X(x)$ associated to it. An augmented metric space $(X, d_X, f_X)$ naturally gives rise to a 2-parameter filtration $\mathcal{K}$. However, the result… ▽ More An augmented metric space is a metric space $(X, d_X)$ equipped with a function $f_X: X \to \mathbb{R}$. This type of data arises commonly in practice, e.g, a point cloud $X$ in $\mathbb{R}^d$ where each point $x\in X$ has a density function value $f_X(x)$ associated to it. An augmented metric space $(X, d_X, f_X)$ naturally gives rise to a 2-parameter filtration $\mathcal{K}$. However, the resulting 2-parameter persistent homology $\mathrm{H}_{\bullet}(\mathcal{K})$ could still be of wild representation type, and may not have simple indecomposables. In this paper, motivated by the elder-rule for the zeroth homology of 1-parameter filtration, we propose a barcode-like summary, called the elder-rule-staircode, as a way to encode $\mathrm{H}_0(\mathcal{K})$. Specifically, if $n = |X|$, the elder-rule-staircode consists of $n$ number of staircase-like blocks in the plane. We show that if $\mathrm{H}_0(\mathcal{K})$ is interval decomposable, then the barcode of $\mathrm{H}_0(\mathcal{K})$ is equal to the elder-rule-staircode. Furthermore, regardless of the interval decomposability, the fibered barcode, the dimension function (a.k.a. the Hilbert function), and the graded Betti numbers of $\mathrm{H}_0(\mathcal{K})$ can all be efficiently computed once the elder-rule-staircode is given. Finally, we develop and implement an efficient algorithm to compute the elder-rule-staircode in $O(n^2\log n)$ time, which can be improved to $O(n^2α(n))$ if $X$ is from a fixed dimensional Euclidean space $\mathbb{R}^d$, where $α(n)$ is the inverse Ackermann function. △ Less

Submitted 12 July, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

Comments: A few important questions considered in the previous version have been settled; see Example 4.12 and Section 4.3 in particular. The paper has been reorganized. This is the full version of the paper in the Proceedings of the 36th International Symposium on Computational Geometry (SoCG 2020); 41 pages, 17 figures

Journal ref: SIAM Journal on Applied Algebra and Geometry (2021)

arXiv:1912.02392 [pdf, other]

KoPA: Automated Kronecker Product Approximation

Authors: Chencheng Cai, Rong Chen, Han Xiao

Abstract: We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank… ▽ More We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank matrix approximation, and includes it as a special case. Comparing with the latter, KoPA also offers a greater flexibility, since it allows the user to choose the configuration, which are the dimensions of the two smaller matrices forming the Kronecker product. On the other hand, the configuration to be used is usually unknown, and needs to be determined from the data in order to achieve the optimal balance between accuracy and parsimony. We propose to use extended information criteria to select the configuration. Under the paradigm of high dimensional analysis, we show that the proposed procedure is able to select the true configuration with probability tending to one, under suitable conditions on the signal-to-noise ratio. We demonstrate the superiority of KoPA over the low rank approximations through numerical studies, and several benchmark image examples. △ Less

Submitted 26 August, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

arXiv:1911.04436 [pdf, ps, other]

Nonconvex Low-Rank Tensor Completion from Noisy Data

Authors: Changxiao Cai, Gen Li, H. Vincent Poor, Yuxin Chen

Abstract: We study a noisy tensor completion problem of broad practical interest, namely, the reconstruction of a low-rank tensor from highly incomplete and randomly corrupted observations of its entries. While a variety of prior work has been dedicated to this problem, prior algorithms either are computationally too expensive for large-scale applications, or come with sub-optimal statistical guarantees. Fo… ▽ More We study a noisy tensor completion problem of broad practical interest, namely, the reconstruction of a low-rank tensor from highly incomplete and randomly corrupted observations of its entries. While a variety of prior work has been dedicated to this problem, prior algorithms either are computationally too expensive for large-scale applications, or come with sub-optimal statistical guarantees. Focusing on "incoherent" and well-conditioned tensors of a constant CP rank, we propose a two-stage nonconvex algorithm -- (vanilla) gradient descent following a rough initialization -- that achieves the best of both worlds. Specifically, the proposed nonconvex algorithm faithfully completes the tensor and retrieves all individual tensor factors within nearly linear time, while at the same time enjoying near-optimal statistical guarantees (i.e. minimal sample complexity and optimal estimation accuracy). The estimation errors are evenly spread out across all entries, thus achieving optimal $\ell_{\infty}$ statistical accuracy. We have also discussed how to extend our approach to accommodate asymmetric tensors. The insight conveyed through our analysis of nonconvex optimization might have implications for other tensor estimation problems. △ Less

Submitted 2 June, 2021; v1 submitted 11 November, 2019; originally announced November 2019.

Comments: Accepted to Operations Research

Journal ref: Operations Research, vol. 70, no. 2, pp. 1219-1237, 2022

arXiv:1910.04267 [pdf, ps, other]

Subspace Estimation from Unbalanced and Incomplete Data Matrices: $\ell_{2,\infty}$ Statistical Guarantees

Authors: Changxiao Cai, Gen Li, Yuejie Chi, H. Vincent Poor, Yuxin Chen

Abstract: This paper is concerned with estimating the column space of an unknown low-rank matrix $\boldsymbol{A}^{\star}\in\mathbb{R}^{d_{1}\times d_{2}}$, given noisy and partial observations of its entries. There is no shortage of scenarios where the observations -- while being too noisy to support faithful recovery of the entire matrix -- still convey sufficient information to enable reliable estimation… ▽ More This paper is concerned with estimating the column space of an unknown low-rank matrix $\boldsymbol{A}^{\star}\in\mathbb{R}^{d_{1}\times d_{2}}$, given noisy and partial observations of its entries. There is no shortage of scenarios where the observations -- while being too noisy to support faithful recovery of the entire matrix -- still convey sufficient information to enable reliable estimation of the column space of interest. This is particularly evident and crucial for the highly unbalanced case where the column dimension $d_{2}$ far exceeds the row dimension $d_{1}$, which is the focal point of the current paper. We investigate an efficient spectral method, which operates upon the sample Gram matrix with diagonal deletion. While this algorithmic idea has been studied before, we establish new statistical guarantees for this method in terms of both $\ell_{2}$ and $\ell_{2,\infty}$ estimation accuracy, which improve upon prior results if $d_{2}$ is substantially larger than $d_{1}$. To illustrate the effectiveness of our findings, we derive matching minimax lower bounds with respect to the noise levels, and develop consequences of our general theory for three applications of practical importance: (1) tensor completion from noisy data, (2) covariance estimation / principal component analysis with missing data, and (3) community recovery in bipartite graphs. Our theory leads to improved performance guarantees for all three cases. △ Less

Submitted 15 November, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

Comments: Accepted to Annals of Statistics

Journal ref: Annals of Statistics, vol. 49, no. 2, pp. 944-967, 2021

arXiv:1909.05100

Group Representation Theory for Knowledge Graph Embedding

Authors: Chen Cai

Abstract: Knowledge graph embedding has recently become a popular way to model relations and infer missing links. In this paper, we present a group theoretical perspective of knowledge graph embedding, connecting previous methods with different group actions. Furthermore, by utilizing Schur's lemma from group representation theory, we show that the state of the art embedding method RotatE can model relation… ▽ More Knowledge graph embedding has recently become a popular way to model relations and infer missing links. In this paper, we present a group theoretical perspective of knowledge graph embedding, connecting previous methods with different group actions. Furthermore, by utilizing Schur's lemma from group representation theory, we show that the state of the art embedding method RotatE can model relations from any finite Abelian group. △ Less

Submitted 27 November, 2019; v1 submitted 11 September, 2019; originally announced September 2019.

Comments: Paper withdrawn due to company policy

arXiv:1809.02038 [pdf, other]

Mixed sub-fractional Brownian motion and drift estimation of related Ornstein-Uhlenbeck process

Authors: Chunhao Cai, Qinghua Wang, Weilin Xiao

Abstract: In this paper, we will first give the numerical simulation of the sub-fractional Brownian motion through the relation of fractional Brownian motion instead of its representation of random walk. In order to verify the rationality of this simulation, we propose a practical estimator associated with the LSE of the drift parameter of mixed sub-fractional Ornstein-Uhlenbeck process, and illustrate the… ▽ More In this paper, we will first give the numerical simulation of the sub-fractional Brownian motion through the relation of fractional Brownian motion instead of its representation of random walk. In order to verify the rationality of this simulation, we propose a practical estimator associated with the LSE of the drift parameter of mixed sub-fractional Ornstein-Uhlenbeck process, and illustrate the asymptotical properties according to our method of simulation when the Hurst parameter $H>1/2$. △ Less

Submitted 8 January, 2021; v1 submitted 5 September, 2018; originally announced September 2018.

arXiv:1802.00982

Malliavin Derivative for the Unknown Parameter in surplus process with mixed fractional Brownian motion

Authors: Chunhao Cai, Yingzhong Huang

Abstract: In this paper, we will construct the Malliavin derivative and the stochastic integral with respect to the Mixed fractional Brownian motion (mfbm) for H > 1/2. As an application, we try to estimate the drift parameter via Malliavin derivative for surplus process with mixed fractional Brownian motion In this paper, we will construct the Malliavin derivative and the stochastic integral with respect to the Mixed fractional Brownian motion (mfbm) for H > 1/2. As an application, we try to estimate the drift parameter via Malliavin derivative for surplus process with mixed fractional Brownian motion △ Less

Submitted 7 July, 2021; v1 submitted 3 February, 2018; originally announced February 2018.

Comments: In the proof of Theorem 3.1 of the V3 and V4 we do not give the suitable citation and also there exists mistakes in this Theorem

arXiv:1710.09610 [pdf, ps, other]

Controlled Mean-Reverting Estimation for The AR(1) Model with Stationary Gaussian Noise

Authors: Chunhao Cai

Abstract: This paper deals with the maximum likelihood estimator for the mean-reverting parameter of a first order autoregressive models with exogenous variables, which are stationary Gaussian noises (Colored noise). Using the method of the Laplace transform, both the asymptotic properties and the asymptotic design problem of the maximum likelihood estimator are investigated. The numerical simulation result… ▽ More This paper deals with the maximum likelihood estimator for the mean-reverting parameter of a first order autoregressive models with exogenous variables, which are stationary Gaussian noises (Colored noise). Using the method of the Laplace transform, both the asymptotic properties and the asymptotic design problem of the maximum likelihood estimator are investigated. The numerical simulation results confirm the theoretical analysis and show that the proposed maximum likelihood estimator performs well in finite sample. △ Less

Submitted 17 November, 2020; v1 submitted 26 October, 2017; originally announced October 2017.

arXiv:1709.03418 [pdf, other]

Simulation of Integro-Differential Equation and Application in Estimation of Ruin Probability with Mixed Fractional Brownian Motion

Authors: Chunhao Cai, Weilin Xiao

Abstract: In this paper, we are concerned with the numerical solution of one type integro-differential equation by a probability method based on the fundamental martingale of mixed Gaussian processes. As an application, we will try to simulate the estimation of ruin probability with an unknown parameter driven not by the classical Lévy process but by the mixed fractional Brownian motion. In this paper, we are concerned with the numerical solution of one type integro-differential equation by a probability method based on the fundamental martingale of mixed Gaussian processes. As an application, we will try to simulate the estimation of ruin probability with an unknown parameter driven not by the classical Lévy process but by the mixed fractional Brownian motion. △ Less

Submitted 7 May, 2020; v1 submitted 11 September, 2017; originally announced September 2017.

arXiv:1609.08948 [pdf, ps, other]

Experiment design for controlled partially observed fractional diffusion process

Authors: Chunhao Cai, Wujun LV

Abstract: We consider a controlled second order differential equation which is partially observed with an additional fractional noise. we study the asymptotic (for large observation time) design problem of the input and give an efficient estimator of the unknown signal drift parameter. When the input depends on the unknow parameter, we will try the one-step estimation procedure using the Newton-Raphson meth… ▽ More We consider a controlled second order differential equation which is partially observed with an additional fractional noise. we study the asymptotic (for large observation time) design problem of the input and give an efficient estimator of the unknown signal drift parameter. When the input depends on the unknow parameter, we will try the one-step estimation procedure using the Newton-Raphson method. △ Less

Submitted 8 April, 2019; v1 submitted 28 September, 2016; originally announced September 2016.

arXiv:1606.06459 [pdf, ps, other]

Non-parametric threshold estimation for classical risk process perturbed by diffusion

Authors: Chunhao Cai, Junyi Guo, Honglong You

Abstract: In this paper,we consider a macro approximation of the flow of a risk reserve, The process is observed at discrete time points. Because we cannot directly observe each jump time and size then we will make use of a technique for identifying the times when jumps larger than a suitably defined threshold occurred. We estimate the jump size and survival probability of our risk process from discrete obs… ▽ More In this paper,we consider a macro approximation of the flow of a risk reserve, The process is observed at discrete time points. Because we cannot directly observe each jump time and size then we will make use of a technique for identifying the times when jumps larger than a suitably defined threshold occurred. We estimate the jump size and survival probability of our risk process from discrete observations. △ Less

Submitted 21 June, 2016; originally announced June 2016.

arXiv:1605.07709 [pdf, ps, other]

doi 10.1007/s10959-017-0782-0

Occupation times of intervals until last passage times for spectrally negative Levy processes

Authors: Bo Li, Chunhao Cai

Abstract: In this paper, we derive the joint Laplace transforms of occupation times until its last passage times as well as its positions. Motivated by Baurdoux [2], the last times before an independent exponential variable are studied. By applying dual arguments, explicit formulas are derived in terms of new analytical identities from Loeffen et al. [12]. In this paper, we derive the joint Laplace transforms of occupation times until its last passage times as well as its positions. Motivated by Baurdoux [2], the last times before an independent exponential variable are studied. By applying dual arguments, explicit formulas are derived in terms of new analytical identities from Loeffen et al. [12]. △ Less

Submitted 22 September, 2016; v1 submitted 24 May, 2016; originally announced May 2016.

MSC Class: 60G51; 60J55

arXiv:1304.5929 [pdf, other]

Asymptotic properties of the MLE for the autoregressive process coefficients under stationary Gaussian noise

Authors: Alexandre Brouste, Chunhao Cai, Marina Kleptsyna

Abstract: In this paper we are interested in the Maximum Likelihood Estimator (MLE) of the vector parameter of an autoregressive process of order $p$ with regular stationary Gaussian noise. We exhibit the large sample asymptotical properties of the MLE under very mild conditions. Simulations are done for fractional Gaussian noise (fGn), autoregressive noise (AR(1)) and moving average noise (MA(1)). In this paper we are interested in the Maximum Likelihood Estimator (MLE) of the vector parameter of an autoregressive process of order $p$ with regular stationary Gaussian noise. We exhibit the large sample asymptotical properties of the MLE under very mild conditions. Simulations are done for fractional Gaussian noise (fGn), autoregressive noise (AR(1)) and moving average noise (MA(1)). △ Less

Submitted 22 April, 2013; originally announced April 2013.

arXiv:1208.6253 [pdf, ps, other]

doi 10.1214/15-AOP1041

Mixed Gaussian processes: A filtering approach

Authors: Chunhao Cai, Pavel Chigansky, Marina Kleptsyna

Abstract: This paper presents a new approach to the analysis of mixed processes \[X_t=B_t+G_t,\qquad t\in[0,T],\] where $B_t$ is a Brownian motion and $G_t$ is an independent centered Gaussian process. We obtain a new canonical innovation representation of $X$, using linear filtering theory. When the kernel \[K(s,t)=\frac{\partial^2}{\partial s\,\partial t}\mathbb{E}G_tG_s,\qquad s\ne t\] has a weak singula… ▽ More This paper presents a new approach to the analysis of mixed processes \[X_t=B_t+G_t,\qquad t\in[0,T],\] where $B_t$ is a Brownian motion and $G_t$ is an independent centered Gaussian process. We obtain a new canonical innovation representation of $X$, using linear filtering theory. When the kernel \[K(s,t)=\frac{\partial^2}{\partial s\,\partial t}\mathbb{E}G_tG_s,\qquad s\ne t\] has a weak singularity on the diagonal, our results generalize the classical innovation formulas beyond the square integrable setting. For kernels with stronger singularity, our approach is applicable to processes with additional "fractional" structure, including the mixed fractional Brownian motion from mathematical finance. We show how previously-known measure equivalence relations and semimartingale properties follow from our canonical representation in a unified way, and complement them with new formulas for Radon-Nikodym densities. △ Less

Submitted 2 September, 2016; v1 submitted 30 August, 2012; originally announced August 2012.

Comments: Published at http://dx.doi.org/10.1214/15-AOP1041 in the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOP-AOP1041

Journal ref: Annals of Probability 2016, Vol. 44, No. 4, 3032-3075

Showing 1–28 of 28 results for author: Cai, C