Search | arXiv e-print repository

Entropic Optimal Transport Eigenmaps for Nonlinear Alignment and Joint Embedding of High-Dimensional Datasets

Authors: Boris Landa, Yuval Kluger, Rong Ma

Abstract: Embedding high-dimensional data into a low-dimensional space is an indispensable component of data analysis. In numerous applications, it is necessary to align and jointly embed multiple datasets from different studies or experimental conditions. Such datasets may share underlying structures of interest but exhibit individual distortions, resulting in misaligned embeddings using traditional techni… ▽ More Embedding high-dimensional data into a low-dimensional space is an indispensable component of data analysis. In numerous applications, it is necessary to align and jointly embed multiple datasets from different studies or experimental conditions. Such datasets may share underlying structures of interest but exhibit individual distortions, resulting in misaligned embeddings using traditional techniques. In this work, we propose \textit{Entropic Optimal Transport (EOT) eigenmaps}, a principled approach for aligning and jointly embedding a pair of datasets with theoretical guarantees. Our approach leverages the leading singular vectors of the EOT plan matrix between two datasets to extract their shared underlying structure and align the datasets accordingly in a common embedding space. We interpret our approach as an inter-data variant of the classical Laplacian eigenmaps and diffusion maps embeddings, showing that it enjoys many favorable analogous properties. We then analyze a data-generative model where two observed high-dimensional datasets share latent variables on a common low-dimensional manifold, but each dataset is subject to data-specific translation, scaling, nuisance structures, and noise. We show that in a high-dimensional asymptotic regime, the EOT plan recovers the shared manifold structure by approximating a kernel function evaluated at the locations of the latent variables. Subsequently, we provide a geometric interpretation of our embedding by relating it to the eigenfunctions of population-level operators encoding the density and geometry of the shared manifold. Finally, we showcase the performance of our approach for data integration and embedding through simulations and analyses of real-world biological data, demonstrating its advantages over alternative methods in challenging scenarios. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.00961 [pdf, other]

Kronecker-product random matrices and a matrix least squares problem

Authors: Zhou Fan, Renyuan Ma

Abstract: We study the eigenvalue distribution and resolvent of a Kronecker-product random matrix model $A \otimes I_{n \times n}+I_{n \times n} \otimes B+Θ\otimes Ξ\in \mathbb{C}^{n^2 \times n^2}$, where $A,B$ are independent Wigner matrices and $Θ,Ξ$ are deterministic and diagonal. For fixed spectral arguments, we establish a quantitative approximation for the Stieltjes transform by that of an approximati… ▽ More We study the eigenvalue distribution and resolvent of a Kronecker-product random matrix model $A \otimes I_{n \times n}+I_{n \times n} \otimes B+Θ\otimes Ξ\in \mathbb{C}^{n^2 \times n^2}$, where $A,B$ are independent Wigner matrices and $Θ,Ξ$ are deterministic and diagonal. For fixed spectral arguments, we establish a quantitative approximation for the Stieltjes transform by that of an approximating free operator, and a diagonal deterministic equivalent approximation for the resolvent. We further obtain sharp estimates in operator norm for the $n \times n$ resolvent blocks, and show that off-diagonal resolvent entries fall on two differing scales of $n^{-1/2}$ and $n^{-1}$ depending on their locations in the Kronecker structure. Our study is motivated by consideration of a matrix-valued least-squares optimization problem $\min_{X \in \mathbb{R}^{n \times n}} \frac{1}{2}\|XA+BX\|_F^2+\frac{1}{2}\sum_{ij} ξ_iθ_j x_{ij}^2$ subject to a linear constraint. For random instances of this problem defined by Wigner inputs $A,B$, our analyses imply an asymptotic characterization of the minimizer $X$ and its associated minimum objective value as $n \to \infty$. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2403.18217 [pdf, ps, other]

Mixed Variational Formulation of Coupled Plates

Authors: Jun Hu, Zhen Liu, Rui Ma, Ruishu Wang

Abstract: This paper proposes a mixed variational formulation for the problem of two coupled plates with a rigid {junction}. The proposed mixed {formulation} introduces {the union of} stresses and moments as {an auxiliary variable}, which {are} commonly of great interest in practical applications. The primary challenge lies in determining a suitable {space involving} both boundary and junction conditions of… ▽ More This paper proposes a mixed variational formulation for the problem of two coupled plates with a rigid {junction}. The proposed mixed {formulation} introduces {the union of} stresses and moments as {an auxiliary variable}, which {are} commonly of great interest in practical applications. The primary challenge lies in determining a suitable {space involving} both boundary and junction conditions of the auxiliary variable. The {theory} of densely defined operators in Hilbert spaces is employed to define {a nonstandard Sobolev space} without the use of trace operators. The well-posedness is established for the mixed formulation. Based on these conditions, this paper provides a framework {of} conforming {mixed} finite element methods. Numerical experiments are given to validate the theoretical results. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.18065 [pdf, ps, other]

Primitive elements in the Hall algebra of a cyclic quiver

Authors: Renda Ma

Abstract: We provide an explicit formula for primitive elements in the Hall algebras of nilpotent representations of cyclic quivers. We provide an explicit formula for primitive elements in the Hall algebras of nilpotent representations of cyclic quivers. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: PhD thesis

arXiv:2401.03276 [pdf, ps, other]

Robust Discrete Choice Model for Travel Behavior Prediction With Data Uncertainties

Authors: Baichuan Mo, Yunhan Zheng, Xiaotong Guo, Ruoyun Ma, **hua Zhao

Abstract: Discrete choice models (DCMs) are the canonical methods for travel behavior modeling and prediction. However, in many scenarios, the collected data for DCMs are subject to measurement errors. Previous studies on measurement errors mostly focus on "better estimating model parameters" with training data. In this study, we focus on "better predicting new samples' behavior" when there are measurement… ▽ More Discrete choice models (DCMs) are the canonical methods for travel behavior modeling and prediction. However, in many scenarios, the collected data for DCMs are subject to measurement errors. Previous studies on measurement errors mostly focus on "better estimating model parameters" with training data. In this study, we focus on "better predicting new samples' behavior" when there are measurement errors in testing data. To this end, we propose a robust discrete choice model framework that is able to account for data uncertainties in both features and labels. The model is based on robust optimization theory that minimizes the worst-case loss over a set of uncertainty data scenarios. Specifically, for feature uncertainties, we assume that the $\ell_p$-norm of the measurement errors in features is smaller than a pre-established threshold. We model label uncertainties by limiting the number of mislabeled choices to at most $Γ$. Based on these assumptions, we derive a tractable robust counterpart for robust-feature and robust-label DCM models. The derived robust-feature binary logit (BNL) and the robust-label multinomial logit (MNL) models are exact. However, the formulation for the robust-feature MNL model is an approximation of the exact robust optimization problem. The proposed models are validated in a binary choice data set and a multinomial choice data set, respectively. Results show that the robust models (both features and labels) can outperform the conventional BNL and MNL models in prediction accuracy and log-likelihood. We show that the robustness works like "regularization" and thus has better generalizability. △ Less

Submitted 6 January, 2024; originally announced January 2024.

arXiv:2312.00854 [pdf, other]

A Probabilistic Neural Twin for Treatment Planning in Peripheral Pulmonary Artery Stenosis

Authors: John D. Lee, Jakob Richter, Martin R. Pfaller, Jason M. Szafron, Karthik Menon, Andrea Zanoni, Michael R. Ma, Jeffrey A. Feinstein, Jacqueline Kreutzer, Alison L. Marsden, Daniele E. Schiavazzi

Abstract: The substantial computational cost of high-fidelity models in numerical hemodynamics has, so far, relegated their use mainly to offline treatment planning. New breakthroughs in data-driven architectures and optimization techniques for fast surrogate modeling provide an exciting opportunity to overcome these limitations, enabling the use of such technology for time-critical decisions. We discuss an… ▽ More The substantial computational cost of high-fidelity models in numerical hemodynamics has, so far, relegated their use mainly to offline treatment planning. New breakthroughs in data-driven architectures and optimization techniques for fast surrogate modeling provide an exciting opportunity to overcome these limitations, enabling the use of such technology for time-critical decisions. We discuss an application to the repair of multiple stenosis in peripheral pulmonary artery disease through either transcatheter pulmonary artery rehabilitation or surgery, where it is of interest to achieve desired pressures and flows at specific locations in the pulmonary artery tree, while minimizing the risk for the patient. Since different degrees of success can be achieved in practice during treatment, we formulate the problem in probability, and solve it through a sample-based approach. We propose a new offline-online pipeline for probabilsitic real-time treatment planning which combines offline assimilation of boundary conditions, model reduction, and training dataset generation with online estimation of marginal probabilities, possibly conditioned on the degree of augmentation observed in already repaired lesions. Moreover, we propose a new approach for the parametrization of arbitrarily shaped vascular repairs through iterative corrections of a zero-dimensional approximant. We demonstrate this pipeline for a diseased model of the pulmonary artery tree available through the Vascular Model Repository. △ Less

Submitted 1 December, 2023; originally announced December 2023.

arXiv:2308.00359 [pdf, ps, other]

The asymptotic stability of solitons in the focusing Hirota equation on the line

Authors: Ruihong Ma, Engui Fan

Abstract: In this paper, the $\overline\partial$-steepest descent method and Bäcklund transformation are used to study the asymptotic stability of solitons to the Cauchy problem of focusing Hirota equation. The solution of the RH problem is further decomposed into pure radiation solution and solitons solution obtained by using $\overline\partial$-techniques and Bäcklund transformation respectively. As a dir… ▽ More In this paper, the $\overline\partial$-steepest descent method and Bäcklund transformation are used to study the asymptotic stability of solitons to the Cauchy problem of focusing Hirota equation. The solution of the RH problem is further decomposed into pure radiation solution and solitons solution obtained by using $\overline\partial$-techniques and Bäcklund transformation respectively. As a directly consequence, the asymptotic stability of solitons for the Hirota equation is obtained. △ Less

Submitted 5 September, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 43 pages. arXiv admin note: text overlap with arXiv:1302.1215 by other authors

arXiv:2307.09853 [pdf, ps, other]

A proof of a conjecture of Mao on Beck's partition statistics modulo 8

Authors: Renrong Mao, Ernest X. W. Xia

Abstract: Beck introduced two partition statistics $NT(r,m,n)$ and $M_ω(r,m,n)$,which denote the total number of parts in the partition of $n$ with rank congruent to $r$ modulo $m$ and the total number of ones in the partition of $n$ with crank congruent to $r$ modulo $m$, respectively. In recent years, a number of congruences and identities on $NT(r,m,n)$ and $M_ω(r,m,n)$ for some small $m $ have been esta… ▽ More Beck introduced two partition statistics $NT(r,m,n)$ and $M_ω(r,m,n)$,which denote the total number of parts in the partition of $n$ with rank congruent to $r$ modulo $m$ and the total number of ones in the partition of $n$ with crank congruent to $r$ modulo $m$, respectively. In recent years, a number of congruences and identities on $NT(r,m,n)$ and $M_ω(r,m,n)$ for some small $m $ have been established.In this paper, we prove an identity on $NT(r,8,n)$ and $M_ω(r,4,n)$ which confirm a conjecture given by Mao. △ Less

Submitted 19 July, 2023; originally announced July 2023.

arXiv:2305.07869 [pdf, ps, other]

Some new curious congruences involving multiple harmonic sums

Authors: Rong Ma, Ni Li

Abstract: It is significant to study congruences involving multiple harmonic sums. Let $p$ be an odd prime, in recent years, the following curious congruence $$\sum_{\substack{i+j+k=p \\ i, j, k>0}} \frac{1}{i j k} \equiv-2 B_{p-3}\pmod p$$ has been generalized along different directions, where $B_n$ denote the $n$th Bernoulli number. In this paper, we obtain several new generalizations of the above congrue… ▽ More It is significant to study congruences involving multiple harmonic sums. Let $p$ be an odd prime, in recent years, the following curious congruence $$\sum_{\substack{i+j+k=p \\ i, j, k>0}} \frac{1}{i j k} \equiv-2 B_{p-3}\pmod p$$ has been generalized along different directions, where $B_n$ denote the $n$th Bernoulli number. In this paper, we obtain several new generalizations of the above congruence by applying congruences involving multiple harmonic sums. For example, we have $$\sum_{\substack{k_1+k_2+\cdots+k_n=p \\ k_i> 0, 1 \le i \le n}} \dfrac{(-1)^{k_1}\left(\dfrac{k_1}{3}\right)}{k_1 \cdots k_n} \equiv \dfrac{(n-1)!}{n}\dfrac{2^{n-1}+1}{3\cdot6^{n-1}}B_{p-n}\left(\dfrac{1}{3}\right)\pmod p,$$ where $n$ is even, $B_n(x)$ denote the Bernoulli polynomials. △ Less

Submitted 13 May, 2023; originally announced May 2023.

Comments: 12 pages

MSC Class: 11A07; 11B68

arXiv:2303.05805 [pdf, ps, other]

A new mixed finite element for the linear elasticity problem in 3D

Authors: Jun Hu, Rui Ma, Yuanxun Sun

Abstract: This paper constructs the first mixed finite element for the linear elasticity problem in 3D using $P_3$ polynomials for the stress and discontinuous $P_2$ polynomials for the displacement on tetrahedral meshes under some mild mesh conditions. The degrees of freedom of the stress space as well as the corresponding nodal basis are established by characterizing a space of some piecewise constant sym… ▽ More This paper constructs the first mixed finite element for the linear elasticity problem in 3D using $P_3$ polynomials for the stress and discontinuous $P_2$ polynomials for the displacement on tetrahedral meshes under some mild mesh conditions. The degrees of freedom of the stress space as well as the corresponding nodal basis are established by characterizing a space of some piecewise constant symmetric matrices on a patch around each edge. Macro-element techniques are used to define a stable interpolation to prove the discrete inf-sup condition. Optimal convergence is obtained theoretically. △ Less

Submitted 20 August, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

arXiv:2303.05268 [pdf, ps, other]

Andrews-Beck type congrences modulo powers of 5

Authors: Nankun Hong, Renrong Mao

Abstract: Let $NT(m, k, n)$ denote the total number of parts in the partitions of n with rank congruent to m modulo k. Andrews proved Beck's conjecture on congruences for $NT(m, k, n)$ modulo 5 and 7. Generalizing Andrews'results, Chern obtain congruences for $NT(m, k, n)$ modulo 11 and 13. More recently, the second author use the theory of Hecke operators to establish congruences for such partition statist… ▽ More Let $NT(m, k, n)$ denote the total number of parts in the partitions of n with rank congruent to m modulo k. Andrews proved Beck's conjecture on congruences for $NT(m, k, n)$ modulo 5 and 7. Generalizing Andrews'results, Chern obtain congruences for $NT(m, k, n)$ modulo 11 and 13. More recently, the second author use the theory of Hecke operators to establish congruences for such partition statistics modulo powers of primes $\ell \ge 7$. In this paper, we obtain Andrews-Beck type congruences modulo powers of 5. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2211.15874 [pdf, ps, other]

Some congruences involving generalized Bernoulli numbers and Bernoulli polynomials

Authors: Ni Li, Rong Ma

Abstract: Let $[x]$ be the integral part of $x$, $n>1$ be a positive integer and $χ_n$ denote the trivial Dirichlet character modulo $n$. In this paper, we use an identity established by Z. H. Sun to get congruences of $T_{m,k}(n)=\sum_{x=1}^{[n/m]}\frac{χ_n(x)}{x^k}\left(\bmod n^{r+1}\right)$ for $r\in \{1,2\}$, any positive integer $m $ with $n \equiv \pm 1 \left(\bmod m \right)$ in terms of Bernoulli pol… ▽ More Let $[x]$ be the integral part of $x$, $n>1$ be a positive integer and $χ_n$ denote the trivial Dirichlet character modulo $n$. In this paper, we use an identity established by Z. H. Sun to get congruences of $T_{m,k}(n)=\sum_{x=1}^{[n/m]}\frac{χ_n(x)}{x^k}\left(\bmod n^{r+1}\right)$ for $r\in \{1,2\}$, any positive integer $m $ with $n \equiv \pm 1 \left(\bmod m \right)$ in terms of Bernoulli polynomials. As its an application, we also obtain some new congruences involving binomial coefficients modulo $n^4$ in terms of generalized Bernoulli numbers. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: 21pages

MSC Class: 11B68; 11A07 ACM Class: B.2

arXiv:2211.09298 [pdf, other]

Asymptotic behaviour of a conservative reaction-diffusion system associated with a Markov process algebra model

Authors: Jie Ding, Ruiming Ma, Zhigui Lin, Zhi Ling

Abstract: This paper demonstrates a lower and upper solution method to investigate the asymptotic behaviour of the conservative reaction-diffusion systems associated with Markovian process algebra models. In particular, we have proved the uniform convergence of the solution to its constant equilibrium for a case study as time tends to infinity, together with experimental results illustrations. This paper demonstrates a lower and upper solution method to investigate the asymptotic behaviour of the conservative reaction-diffusion systems associated with Markovian process algebra models. In particular, we have proved the uniform convergence of the solution to its constant equilibrium for a case study as time tends to infinity, together with experimental results illustrations. △ Less

Submitted 16 November, 2022; originally announced November 2022.

arXiv:2205.07415 [pdf, ps, other]

Explosion of continuous-state branching processes with competition in Lévy environment

Authors: Rugang Ma, Xiaowen Zhou

Abstract: Using the Lyapunov criteria arguments, we find sufficient conditions on explosion/nonexplosion for continuous-state branching processes with competition in Lévy random environment. In particular, we identify the necessary and sufficient conditions on explosion/nonexplosion when the competition function is a power function and the Lévy measure of the associated branching mechanism is stable. Using the Lyapunov criteria arguments, we find sufficient conditions on explosion/nonexplosion for continuous-state branching processes with competition in Lévy random environment. In particular, we identify the necessary and sufficient conditions on explosion/nonexplosion when the competition function is a power function and the Lévy measure of the associated branching mechanism is stable. △ Less

Submitted 23 August, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

arXiv:2204.07895 [pdf, ps, other]

New conforming finite element divdiv complexes in three dimensions

Authors: Jun Hu, Yizhou Liang, Rui Ma, Min Zhang

Abstract: In this paper, the first family of conforming finite element divdiv complexes on cuboid grids in three dimensions is constructed. Besides, a new family of conforming finite element divdiv complexes with enhanced smoothness on tetrahedral grids is presented. These complexes are exact in the sense that the range of each discrete map is the kernel space of the succeeding one. In this paper, the first family of conforming finite element divdiv complexes on cuboid grids in three dimensions is constructed. Besides, a new family of conforming finite element divdiv complexes with enhanced smoothness on tetrahedral grids is presented. These complexes are exact in the sense that the range of each discrete map is the kernel space of the succeeding one. △ Less

Submitted 16 April, 2022; originally announced April 2022.

arXiv:2202.10007 [pdf, other]

Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

Authors: Rong Ma, Zijian Guo, T. Tony Cai, Hongzhe Li

Abstract: This paper studies the problem of statistical inference for genetic relatedness between binary traits based on individual-level genome-wide association data. Specifically, under the high-dimensional logistic regression models, we define parameters characterizing the cross-trait genetic correlation, the genetic covariance and the trait-specific genetic variance. A novel weighted debiasing method is… ▽ More This paper studies the problem of statistical inference for genetic relatedness between binary traits based on individual-level genome-wide association data. Specifically, under the high-dimensional logistic regression models, we define parameters characterizing the cross-trait genetic correlation, the genetic covariance and the trait-specific genetic variance. A novel weighted debiasing method is developed for the logistic Lasso estimator and computationally efficient debiased estimators are proposed. The rates of convergence for these estimators are studied and their asymptotic normality is established under mild conditions. Moreover, we construct confidence intervals and statistical tests for these parameters, and provide theoretical justifications for the methods, including the coverage probability and expected length of the confidence intervals, as well as the size and power of the proposed tests. Numerical studies are conducted under both model generated data and simulated genetic data to show the superiority of the proposed methods. By analyzing a real data set on autoimmune diseases, we demonstrate its ability to obtain novel insights about the shared genetic architecture between ten pediatric autoimmune diseases. △ Less

Submitted 5 October, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

arXiv:2201.06438 [pdf, other]

Matrix Reordering for Noisy Disordered Matrices: Optimality and Computationally Efficient Algorithms

Authors: T. Tony Cai, Rong Ma

Abstract: Motivated by applications in single-cell biology and metagenomics, we investigate the problem of matrix reordering based on a noisy disordered monotone Toeplitz matrix model. We establish the fundamental statistical limit for this problem in a decision-theoretic framework and demonstrate that a constrained least squares estimator achieves the optimal rate. However, due to its computational complex… ▽ More Motivated by applications in single-cell biology and metagenomics, we investigate the problem of matrix reordering based on a noisy disordered monotone Toeplitz matrix model. We establish the fundamental statistical limit for this problem in a decision-theoretic framework and demonstrate that a constrained least squares estimator achieves the optimal rate. However, due to its computational complexity, we analyze a popular polynomial-time algorithm, spectral seriation, and show that it is suboptimal. To address this, we propose a novel polynomial-time adaptive sorting algorithm with guaranteed performance improvement. Simulations and analyses of two real single-cell RNA sequencing datasets demonstrate the superiority of our algorithm over existing methods. △ Less

Submitted 13 August, 2023; v1 submitted 17 January, 2022; originally announced January 2022.

Comments: accepted by IEEE Transactions on Information Theory

arXiv:2112.12426 [pdf, ps, other]

On the primes in floor function sets

Authors: Rong Ma, Jie Wu

Abstract: Let [t] be the integral part of the real number t and let 1 P be the characteristic function of the primes. Denote by $π$ G (x) the number of primes in the floor function set G(x) := {[ x n ] : 1 n x} and by S 1 P (x) the number of primes in the sequence {[ x n ]} n 1. Very recently, Heyman proves Let [t] be the integral part of the real number t and let 1 P be the characteristic function of the primes. Denote by $π$ G (x) the number of primes in the floor function set G(x) := {[ x n ] : 1 n x} and by S 1 P (x) the number of primes in the sequence {[ x n ]} n 1. Very recently, Heyman proves △ Less

Submitted 29 December, 2021; v1 submitted 23 December, 2021; originally announced December 2021.

arXiv:2108.01540 [pdf, ps, other]

On the new identities of Dirichlet $L$-functions

Authors: Rong Ma, **glei Zhang, Yulong Zhang

Abstract: Let $q\ge3$ be an integer, $χ$ be a Dirichlet character modulo $q$, and $L(s,χ)$ denote the Dirichlet $L$-functions corresponding to $χ$. In this paper, we show some special function series, and give some new identities for the Dirichlet $L$-functions involving Gauss sums. Specially, we give specific identities for $L(2,χ)$. Let $q\ge3$ be an integer, $χ$ be a Dirichlet character modulo $q$, and $L(s,χ)$ denote the Dirichlet $L$-functions corresponding to $χ$. In this paper, we show some special function series, and give some new identities for the Dirichlet $L$-functions involving Gauss sums. Specially, we give specific identities for $L(2,χ)$. △ Less

Submitted 3 August, 2021; originally announced August 2021.

Comments: 10 pages

MSC Class: 11M20 ACM Class: F.2.0

arXiv:2107.00109 [pdf, other]

Adaptive Capped Least Squares

Authors: Qiang Sun, Rui Mao, Wen-Xin Zhou

Abstract: This paper proposes the capped least squares regression with an adaptive resistance parameter, hence the name, adaptive capped least squares regression. The key observation is, by taking the resistant parameter to be data dependent, the proposed estimator achieves full asymptotic efficiency without losing the resistance property: it achieves the maximum breakdown point asymptotically. Computationa… ▽ More This paper proposes the capped least squares regression with an adaptive resistance parameter, hence the name, adaptive capped least squares regression. The key observation is, by taking the resistant parameter to be data dependent, the proposed estimator achieves full asymptotic efficiency without losing the resistance property: it achieves the maximum breakdown point asymptotically. Computationally, we formulate the proposed regression problem as a quadratic mixed integer programming problem, which becomes computationally expensive when the sample size gets large. The data-dependent resistant parameter, however, makes the loss function more convex-like for larger-scale problems. This makes a fast randomly initialized gradient descent algorithm possible for global optimization. Numerical examples indicate the superiority of the proposed estimator compared with classical methods. Three data applications to cancer cell lines, stationary background recovery in video surveillance, and blind image inpainting showcase its broad applicability. △ Less

Submitted 30 June, 2021; originally announced July 2021.

arXiv:2106.03344 [pdf, other]

Statistical Inference for High-Dimensional Linear Regression with Blockwise Missing Data

Authors: Fei Xue, Rong Ma, Hongzhe Li

Abstract: Blockwise missing data occurs frequently when we integrate multisource or multimodality data where different sources or modalities contain complementary information. In this paper, we consider a high-dimensional linear regression model with blockwise missing covariates and a partially observed response variable. Under this framework, we propose a computationally efficient estimator for the regress… ▽ More Blockwise missing data occurs frequently when we integrate multisource or multimodality data where different sources or modalities contain complementary information. In this paper, we consider a high-dimensional linear regression model with blockwise missing covariates and a partially observed response variable. Under this framework, we propose a computationally efficient estimator for the regression coefficient vector based on carefully constructed unbiased estimating equations and a blockwise imputation procedure, and obtain its rate of convergence. Furthermore, building upon an innovative projected estimating equation technique that intrinsically achieves bias-correction of the initial estimator, we propose a nearly unbiased estimator for each individual regression coefficient, which is asymptotically normally distributed under mild conditions. Based on these debiased estimators, asymptotically valid confidence intervals and statistical tests about each regression coefficient are constructed. Numerical studies and application analysis of the Alzheimer's Disease Neuroimaging Initiative data show that the proposed method performs better and benefits more from unsupervised samples than existing methods. △ Less

Submitted 28 June, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

Comments: V2: 40 pages, 2 figures. Accepted at Statistica Sinica

arXiv:2105.07536 [pdf, other]

Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data

Authors: T. Tony Cai, Rong Ma

Abstract: This paper investigates the theoretical foundations of the t-distributed stochastic neighbor embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data visualization method. A novel theoretical framework for the analysis of t-SNE based on the gradient descent approach is presented. For the early exaggeration stage of t-SNE, we show its asymptotic equivalence to power iterations… ▽ More This paper investigates the theoretical foundations of the t-distributed stochastic neighbor embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data visualization method. A novel theoretical framework for the analysis of t-SNE based on the gradient descent approach is presented. For the early exaggeration stage of t-SNE, we show its asymptotic equivalence to power iterations based on the underlying graph Laplacian, characterize its limiting behavior, and uncover its deep connection to Laplacian spectral clustering, and fundamental principles including early stop** as implicit regularization. The results explain the intrinsic mechanism and the empirical benefits of such a computational strategy. For the embedding stage of t-SNE, we characterize the kinematics of the low-dimensional map throughout the iterations, and identify an amplification phase, featuring the intercluster repulsion and the expansive behavior of the low-dimensional map, and a stabilization phase. The general theory explains the fast convergence rate and the exceptional empirical performance of t-SNE for visualizing clustered data, brings forth interpretations of the t-SNE visualizations, and provides theoretical guidance for applying t-SNE and selecting its tuning parameters in various applications. △ Less

Submitted 31 October, 2022; v1 submitted 16 May, 2021; originally announced May 2021.

Comments: Accepted by Journal of Machine Learning Research

arXiv:2104.00216 [pdf, ps, other]

On the difference between a D. H. Lehmer number and its inverse over short interval

Authors: Yana Niu, Rong Ma, Haodong Wang

Abstract: Let $q>2$ be an odd integer. For each integer $x$ with $0<x<q$ and $(q,x)= 1$, we know that there exists one and only one $\bar{x}$ with $0<\bar{x}<q$ such that $x\bar{x}\equiv1(\bmod q)$. A Lehmer number is defined to be any integer $a$ with $2\dagger(a+\bar{a})$. For any nonnegative integer $k$, Let $… ▽ More Let $q>2$ be an odd integer. For each integer $x$ with $0<x<q$ and $(q,x)= 1$, we know that there exists one and only one $\bar{x}$ with $0<\bar{x}<q$ such that $x\bar{x}\equiv1(\bmod q)$. A Lehmer number is defined to be any integer $a$ with $2\dagger(a+\bar{a})$. For any nonnegative integer $k$, Let $$ M(x,q,k)=\displaystyle\mathop {\displaystyle\mathop{\sum{'}}_{a=1}^{q} \displaystyle\mathop{\sum{'}}_{b\leq xq}}_{\mbox{$\tiny\begin{array}{c} 2|a+b+1\\ ab\equiv1(\bmod q)\end{array}$}}(a-b)^{2k}.$$ The main purpose of this paper is to study the properties of $M(x,q,k)$, and give a sharp asymptotic formula, by using estimates of Kloosterman's sums and properties of trigonometric sums. △ Less

Submitted 31 March, 2021; originally announced April 2021.

Comments: 13 pages, no figures

MSC Class: 11L05 ACM Class: B.2

arXiv:2103.00088 [pdf, ps, other]

Conforming finite element DIVDIV complexes and the application for the linearized Einstein-Bianchi system

Authors: Jun Hu, Yizhou Liang, Rui Ma

Abstract: This paper presents the first family of conforming finite element divdiv complexes on tetrahedral grids in three dimensions. In these complexes, finite element spaces of $H(\text{divdiv},Ω;\mathbb{S})$ are from a current preprint [Chen and Huang, arXiv: 2007.12399, 2020] while finite element spaces of both $H(\text{symcurl},Ω;\mathbb{T})$ and $H^1(Ω;\mathbb{R}^3)$ are newly constructed here. It is… ▽ More This paper presents the first family of conforming finite element divdiv complexes on tetrahedral grids in three dimensions. In these complexes, finite element spaces of $H(\text{divdiv},Ω;\mathbb{S})$ are from a current preprint [Chen and Huang, arXiv: 2007.12399, 2020] while finite element spaces of both $H(\text{symcurl},Ω;\mathbb{T})$ and $H^1(Ω;\mathbb{R}^3)$ are newly constructed here. It is proved that these finite element complexes are exact. As a result, they can be used to discretize the linearized Einstein-Bianchi system within the dual formulation. △ Less

Submitted 26 February, 2021; originally announced March 2021.

arXiv:2102.05839 [pdf, ps, other]

Distribution of Eigenvalues of Matrix Ensembles arising from Wigner and Palindromic Toeplitz Blocks

Authors: Keller Blackwell, Neelima Borade, Arup Bose, Charles Devlin VI, Noah Luntzlara, Renyuan Ma, Steven J. Miller, Soumendu Sundar Mukherjee, Mengxi Wang, Wanqiao Xu

Abstract: Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions; this correspondence has allowed RMT to successfully predict many number theoretic behaviors. However there are some operations which to date have no RMT analogue. Our motivation is to find an RMT analogue of Rankin-Selberg convolution, which constructs a new $L$-functi… ▽ More Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions; this correspondence has allowed RMT to successfully predict many number theoretic behaviors. However there are some operations which to date have no RMT analogue. Our motivation is to find an RMT analogue of Rankin-Selberg convolution, which constructs a new $L$-functions from an input pair. We report one such attempt; while it does not appear to model convolution, it does create new ensembles with properties hybridizing those of its constituents. For definiteness we concentrate on the ensemble of palindromic real symmetric Toeplitz (PST) matrices and the ensemble of real symmetric matrices, whose limiting spectral measures are the Gaussian and semi-circular distributions, respectively; these were chosen as they are the two extreme cases in terms of moment calculations. For a PST matrix $A$ and a real symmetric matrix $B$, we construct an ensemble of random real symmetric block matrices whose first row is $\lbrace A, B \rbrace$ and whose second row is $\lbrace B, A \rbrace$. By Markov's Method of Moments and the use of free probability, we show this ensemble converges weakly and almost surely to a new, universal distribution with a hybrid of Gaussian and semi-circular behaviors. We extend this construction by considering an iterated concatenation of matrices from an arbitrary pair of random real symmetric sub-ensembles with different limiting spectral measures. We prove that finite iterations converge to new, universal distributions with hybrid behavior, and that infinite iterations converge to the limiting spectral measure of the dominant component matrix. △ Less

Submitted 10 February, 2021; originally announced February 2021.

Comments: 14 pages, 5 figures. arXiv admin note: text overlap with arXiv:1908.03834

MSC Class: 15A52 (primary); 60F99; 62H10 (secondary)

arXiv:2101.00621 [pdf, other]

Verifying Global Optimality of Candidate Solutions to Polynomial Optimization Problems using a Determinant Relaxation Hierarchy

Authors: Sikun Xu, Ruoyi Ma, Daniel K. Molzahn, Hassan Hijazi, Cédric Josz

Abstract: We propose a method for verifying that a given feasible point for a polynomial optimization problem is globally optimal. The approach relies on the Lasserre hierarchy and the result of Lasserre regarding the importance of the convexity of the feasible set as opposed to that of the individual constraints. By focusing solely on certifying global optimality and relaxing the Lasserre hierarchy using n… ▽ More We propose a method for verifying that a given feasible point for a polynomial optimization problem is globally optimal. The approach relies on the Lasserre hierarchy and the result of Lasserre regarding the importance of the convexity of the feasible set as opposed to that of the individual constraints. By focusing solely on certifying global optimality and relaxing the Lasserre hierarchy using necessary conditions for positive semidefiniteness based on matrix determinants, the proposed method is implementable as a computationally tractable linear program. We demonstrate this method via application to several instances of polynomial optimization, including the optimal power flow problem used to operate electric power systems. △ Less

Submitted 3 January, 2021; originally announced January 2021.

Comments: 6 pages, 4 figures

arXiv:2010.15267 [pdf, other]

A Parameter-free and Projection-free Restarting Level Set Method for Adaptive Constrained Convex Optimization Under the Error Bound Condition

Authors: Qihang Lin, Runchao Ma, Selvaprabu Nadarajah, Negar Soheili

Abstract: Recent efforts to accelerate first-order methods have focused on convex optimization problems that satisfy a geometric property known as error-bound condition, which covers a broad class of problems, including piece-wise linear programs and strongly convex programs. Parameter-free first-order methods that employ projection-free updates have the potential to broaden the benefit of acceleration. Suc… ▽ More Recent efforts to accelerate first-order methods have focused on convex optimization problems that satisfy a geometric property known as error-bound condition, which covers a broad class of problems, including piece-wise linear programs and strongly convex programs. Parameter-free first-order methods that employ projection-free updates have the potential to broaden the benefit of acceleration. Such a method has been developed for unconstrained convex optimization but is lacking for general constrained convex optimization. We propose a parameter-free level-set method for the latter constrained case based on projection-free subgradient decent that exhibits accelerated convergence for problems that satisfy an error-bound condition. Our method maintains a separate copy of the level-set sub-problem for each level parameter value and restarts the computation of these copies based on objective function progress. Applying such a restarting scheme in a level-set context is novel and results in an algorithm that dynamically adapts the precision of each copy. This property is key to extending prior restarting methods based on static precision that have been proposed for unconstrained convex optimization to handle constraints. We report promising numerical performance relative to benchmark methods. △ Less

Submitted 29 September, 2022; v1 submitted 28 October, 2020; originally announced October 2020.

arXiv:2010.02638 [pdf, ps, other]

A family of mixed finite elements for the biharmonic equations on triangular and tetrahedral grids

Authors: Jun Hu, Rui Ma, Min Zhang

Abstract: This paper introduces a new family of mixed finite elements for solving a mixed formulation of the biharmonic equations in two and three dimensions. The symmetric stress $\bmσ=-\nabla^{2}u$ is sought in the Sobolev space $H({\rm{div}}\bm{div},Ω;\mathbb{S})$ simultaneously with the displacement $u$ in $L^{2}(Ω)$. Stemming from the structure of $H(\bm{div},Ω;\mathbb{S})$ conforming elements for the… ▽ More This paper introduces a new family of mixed finite elements for solving a mixed formulation of the biharmonic equations in two and three dimensions. The symmetric stress $\bmσ=-\nabla^{2}u$ is sought in the Sobolev space $H({\rm{div}}\bm{div},Ω;\mathbb{S})$ simultaneously with the displacement $u$ in $L^{2}(Ω)$. Stemming from the structure of $H(\bm{div},Ω;\mathbb{S})$ conforming elements for the linear elasticity problems proposed by J. Hu and S. Zhang, the $H({\rm{div}}\bm{div},Ω;\mathbb{S})$ conforming finite element spaces are constructed by imposing the normal continuity of $\bm{div}\bmσ$ on the $H(\bm{div},Ω;\mathbb{S})$ conforming spaces of $P_{k}$ symmetric tensors. The inheritance makes the basis functions easy to compute. The discrete spaces for $u$ are composed of the piecewise $P_{k-2}$ polynomials without requiring any continuity. Such mixed finite elements are inf-sup stable on both triangular and tetrahedral grids for $k\geq 3$, and the optimal order of convergence is achieved. Besides, the superconvergence and the postprocessing results are displayed. Some numerical experiments are provided to demonstrate the theoretical analysis. △ Less

Submitted 24 June, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

Comments: 27 pages, 3 figures. Accept by SCI China Math

arXiv:2005.01272 [pdf, ps, other]

doi 10.1016/j.jmaa.2020.124771

Variations of Andrews-Beck type congruences

Authors: Song Heng Chan, Renrong Mao, Robert Osburn

Abstract: We prove three variations of recent results due to Andrews on congruences for $NT(m,k,n)$, the total number of parts in the partitions of $n$ with rank congruent to $m$ modulo $k$. We also conjecture new congruences and relations for $NT(m,k,n)$ and for a related crank-type function. We prove three variations of recent results due to Andrews on congruences for $NT(m,k,n)$, the total number of parts in the partitions of $n$ with rank congruent to $m$ modulo $k$. We also conjecture new congruences and relations for $NT(m,k,n)$ and for a related crank-type function. △ Less

Submitted 5 November, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

Comments: 15 pages, typos corrected, to appear in the Journal of Mathematical Analysis and Applications

MSC Class: 11P81; 05A17

Journal ref: Journal of Mathematical Analysis and Applications 495 (2021), no. 2, 124771

arXiv:2004.04856 [pdf, other]

doi 10.1093/biomet/asaa059

The Asymptotic Distribution of Modularity in Weighted Signed Networks

Authors: Rong Ma, Ian Barnett

Abstract: Modularity is a popular metric for quantifying the degree of community structure within a network. The distribution of the largest eigenvalue of a network's edge weight or adjacency matrix is well studied and is frequently used as a substitute for modularity when performing statistical inference. However, we show that the largest eigenvalue and modularity are asymptotically uncorrelated, which sug… ▽ More Modularity is a popular metric for quantifying the degree of community structure within a network. The distribution of the largest eigenvalue of a network's edge weight or adjacency matrix is well studied and is frequently used as a substitute for modularity when performing statistical inference. However, we show that the largest eigenvalue and modularity are asymptotically uncorrelated, which suggests the need for inference directly on modularity itself when the network size is large. To this end, we derive the asymptotic distributions of modularity in the case where the network's edge weight matrix belongs to the Gaussian Orthogonal Ensemble, and study the statistical power of the corresponding test for community structure under some alternative model. We empirically explore universality extensions of the limiting distribution and demonstrate the accuracy of these asymptotic distributions through type I error simulations. We also compare the empirical powers of the modularity based tests with some existing methods. Our method is then used to test for the presence of community structure in two real data applications. △ Less

Submitted 9 April, 2020; originally announced April 2020.

Journal ref: Biometrika (2020)

arXiv:2003.08062 [pdf, ps, other]

An adaptive finite element scheme for the Hellinger--Reissner elasticity mixed eigenvalue problem

Authors: Fleurianne Bertrand, Daniele Boffi, Rui Ma

Abstract: In this paper we study the approximation of eigenvalues arising from the mixed Hellinger--Reissner elasticity problem by using the simple finite element using partial relaxation of $C^0$ vertex continuity of stresses introduced recently by Jun Hu and Rui Ma. We prove that the method converge when a residual type error estimator is considered and that the estimator decays optimally with respect to… ▽ More In this paper we study the approximation of eigenvalues arising from the mixed Hellinger--Reissner elasticity problem by using the simple finite element using partial relaxation of $C^0$ vertex continuity of stresses introduced recently by Jun Hu and Rui Ma. We prove that the method converge when a residual type error estimator is considered and that the estimator decays optimally with respect to the number of degrees of freedom. △ Less

Submitted 18 March, 2020; originally announced March 2020.

arXiv:2002.07624 [pdf, other]

Optimal Structured Principal Subspace Estimation: Metric Entropy and Minimax Rates

Authors: T. Tony Cai, Hongzhe Li, Rong Ma

Abstract: Driven by a wide range of applications, many principal subspace estimation problems have been studied individually under different structural constraints. This paper presents a unified framework for the statistical analysis of a general structured principal subspace estimation problem which includes as special cases non-negative PCA/SVD, sparse PCA/SVD, subspace constrained PCA/SVD, and spectral c… ▽ More Driven by a wide range of applications, many principal subspace estimation problems have been studied individually under different structural constraints. This paper presents a unified framework for the statistical analysis of a general structured principal subspace estimation problem which includes as special cases non-negative PCA/SVD, sparse PCA/SVD, subspace constrained PCA/SVD, and spectral clustering. General minimax lower and upper bounds are established to characterize the interplay between the information-geometric complexity of the structural set for the principal subspaces, the signal-to-noise ratio (SNR), and the dimensionality. The results yield interesting phase transition phenomena concerning the rates of convergence as a function of the SNRs and the fundamental limit for consistent estimation. Applying the general results to the specific settings yields the minimax rates of convergence for those problems, including the previous unknown optimal rates for non-negative PCA/SVD, sparse SVD and subspace constrained PCA/SVD. △ Less

Submitted 16 November, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

arXiv:1912.13153 [pdf, ps, other]

On the mean value of the generalized Dirichlet L-functions with the weight of the Gauss Sums

Authors: Rong Ma, Yana Niu

Abstract: Let $q\ge3$ be an integer, $χ$ denote a Dirichlet character modulo $q$, for any real number $a\ge 0$, we define the generalized Dirichlet $L$-functions $$ L(s,χ,a)=\sum_{n=1}^{\infty}\frac{χ(n)}{(n+a)^s}, $$ where $s=σ+it$ with $σ>1$ and $t$ both real. It can be extended to all $s$ by analytic continuation. For any integer $m$, the famous Gauss sum $G(m,χ)$ is defined as follows:… ▽ More Let $q\ge3$ be an integer, $χ$ denote a Dirichlet character modulo $q$, for any real number $a\ge 0$, we define the generalized Dirichlet $L$-functions $$ L(s,χ,a)=\sum_{n=1}^{\infty}\frac{χ(n)}{(n+a)^s}, $$ where $s=σ+it$ with $σ>1$ and $t$ both real. It can be extended to all $s$ by analytic continuation. For any integer $m$, the famous Gauss sum $G(m,χ)$ is defined as follows: $$G(m,χ)=\sum_{a=1}^{q}χ(a)e\left(\frac{am}{q}\right), $$ where $e(y)=e^{2πiy}$. The main purpose of this paper is to use the analytic method to study the mean value properties of the generalized Dirichlet $L$-functions with the weight of the Gauss Sums, and obtain a sharp asymptotic formula. △ Less

Submitted 30 December, 2019; originally announced December 2019.

Comments: 14 pages

MSC Class: 11M20 ACM Class: F.2.0

arXiv:1911.12516 [pdf, other]

doi 10.1093/biomet/asaa082

Optimal Estimation of Bacterial Growth Rates Based on Permuted Monotone Matrix

Authors: Rong Ma, T. Tony Cai, Hongzhe Li

Abstract: Motivated by the problem of estimating the bacterial growth rates for genome assemblies from shotgun metagenomic data, we consider the permuted monotone matrix model $Y=ΘΠ+Z$, where $Y\in \mathbb{R}^{n\times p}$ is observed, $Θ\in \mathbb{R}^{n\times p}$ is an unknown approximately rank-one signal matrix with monotone rows, $Π\in \mathbb{R}^{p\times p}$ is an unknown permutation matrix, and… ▽ More Motivated by the problem of estimating the bacterial growth rates for genome assemblies from shotgun metagenomic data, we consider the permuted monotone matrix model $Y=ΘΠ+Z$, where $Y\in \mathbb{R}^{n\times p}$ is observed, $Θ\in \mathbb{R}^{n\times p}$ is an unknown approximately rank-one signal matrix with monotone rows, $Π\in \mathbb{R}^{p\times p}$ is an unknown permutation matrix, and $Z\in \mathbb{R}^{n\times p}$ is the noise matrix. This paper studies the estimation of the extreme values associated to the signal matrix $Θ$, including its first and last columns, as well as their difference. Treating these estimation problems as compound decision problems, minimax rate-optimal estimators are constructed using the spectral column sorting method. Numerical experiments through simulated and synthetic microbiome metagenomic data are presented, showing the superiority of the proposed methods over the alternatives. The methods are illustrated by comparing the growth rates of gut bacteria between inflammatory bowel disease patients and normal controls. △ Less

Submitted 26 August, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

Journal ref: Biometrika (2020)

arXiv:1911.10604 [pdf, other]

doi 10.1080/01621459.2020.1713794

Optimal Permutation Recovery in Permuted Monotone Matrix Model

Authors: Rong Ma, T. Tony Cai, Hongzhe Li

Abstract: Motivated by recent research on quantifying bacterial growth dynamics based on genome assemblies, we consider a permuted monotone matrix model $Y=ΘΠ+Z$, where the rows represent different samples, the columns represent contigs in genome assemblies and the elements represent log-read counts after preprocessing steps and Guanine-Cytosine (GC) adjustment. In this model, $Θ$ is an unknown mean matrix… ▽ More Motivated by recent research on quantifying bacterial growth dynamics based on genome assemblies, we consider a permuted monotone matrix model $Y=ΘΠ+Z$, where the rows represent different samples, the columns represent contigs in genome assemblies and the elements represent log-read counts after preprocessing steps and Guanine-Cytosine (GC) adjustment. In this model, $Θ$ is an unknown mean matrix with monotone entries for each row, $Π$ is a permutation matrix that permutes the columns of $Θ$, and $Z$ is a noise matrix. This paper studies the problem of estimation/recovery of $Π$ given the observed noisy matrix $Y$. We propose an estimator based on the best linear projection, which is shown to be minimax rate-optimal for both exact recovery, as measured by the 0-1 loss, and partial recovery, as quantified by the normalized Kendall's tau distance. Simulation studies demonstrate the superior empirical performance of the proposed estimator over alternative methods. We demonstrate the methods using a synthetic metagenomics dataset of 45 closely related bacterial species and a real metagenomic dataset to compare the bacterial growth dynamics between the responders and the non-responders of the IBD patients after 8 weeks of treatment. △ Less

Submitted 13 July, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

Journal ref: Journal of the American Statistical Association, 2020

arXiv:1908.11518 [pdf, other]

Inexact Proximal-Point Penalty Methods for Constrained Non-Convex Optimization

Authors: Qihang Lin, Runchao Ma, Yangyang Xu

Abstract: In this paper, an inexact proximal-point penalty method is studied for constrained optimization problems, where the objective function is non-convex, and the constraint functions can also be non-convex. The proposed method approximately solves a sequence of subproblems, each of which is formed by adding to the original objective function a proximal term and quadratic penalty terms associated to th… ▽ More In this paper, an inexact proximal-point penalty method is studied for constrained optimization problems, where the objective function is non-convex, and the constraint functions can also be non-convex. The proposed method approximately solves a sequence of subproblems, each of which is formed by adding to the original objective function a proximal term and quadratic penalty terms associated to the constraint functions. Under a weak-convexity assumption, each subproblem is made strongly convex and can be solved effectively to a required accuracy by an optimal gradient-based method. The computational complexity of the proposed method is analyzed separately for the cases of convex constraint and non-convex constraint. For both cases, the complexity results are established in terms of the number of proximal gradient steps needed to find an $\varepsilon$-stationary point. When the constraint functions are convex, we show a complexity result of $\tilde O(\varepsilon^{-5/2})$ to produce an $\varepsilon$-stationary point under the Slater's condition. When the constraint functions are non-convex, the complexity becomes $\tilde O(\varepsilon^{-3})$ if a non-singularity condition holds on constraints and otherwise $\tilde O(\varepsilon^{-4})$ if a feasible initial solution is available. △ Less

Submitted 1 December, 2020; v1 submitted 29 August, 2019; originally announced August 2019.

Comments: submitted to journal; corrected a few ? in references

arXiv:1908.03834 [pdf, other]

Distribution of Eigenvalues of Random Real Symmetric Block Matrices

Authors: Keller Blackwell, Neelima Borade, Charles Devlin VI, Noah Luntzlara, Renyuan Ma, Steven J. Miller, Mengxi Wang, Wanqiao Xu

Abstract: Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions. Many statistics in one can be interpreted in terms of quantities of the other; for example, zeros of $L$-functions correspond to eigenvalues of matrices, and values of $L$-functions to values of the characteristic polynomials. This correspondence has allowed RMT to suc… ▽ More Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions. Many statistics in one can be interpreted in terms of quantities of the other; for example, zeros of $L$-functions correspond to eigenvalues of matrices, and values of $L$-functions to values of the characteristic polynomials. This correspondence has allowed RMT to successfully predict many number theory behaviors; however, there are some operations which to date have no RMT analogue. The motivation of this paper is to try and find an RMT equivalent to Rankin-Selberg convolution, which builds a new $L$-functions from an input pair. For definiteness we concentrate on two specific families, the ensemble of palindromic real symmetric Toeplitz (PST) matrices and the ensemble of real symmetric (RS) matrices, whose limiting spectral measures are the Gaussian and semicircle distributions, respectively; these were chosen as they are the two extreme cases in terms of moment calculations. For a PST matrix $A$ and a RS matrix $B$, we construct an ensemble of random real symmetric block matrices whose first row is $\lbrace A, B \rbrace$ and whose second row is $\lbrace B, A \rbrace$. By Markov's Method of Moments, we show this ensemble converges weakly and almost surely to a new, universal distribution with a hybrid of Gaussian and semicircle behaviors. We extend this construction by considering an iterated concatenation of matrices from an arbitrary pair of random real symmetric sub-ensembles with different limiting spectral measures. We prove that finite iterations converge to new, universal distributions with hybrid behavior, and that infinite iterations converge to the limiting spectral measures of the component matrices. △ Less

Submitted 10 August, 2019; originally announced August 2019.

Comments: 49 pages, 5 figures, 3 tables

MSC Class: 15A52 (primary); 60F99; 62H10 (secondary)

arXiv:1908.01871 [pdf, ps, other]

Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints

Authors: Runchao Ma, Qihang Lin, Tianbao Yang

Abstract: Optimization models with non-convex constraints arise in many tasks in machine learning, e.g., learning with fairness constraints or Neyman-Pearson classification with non-convex loss. Although many efficient methods have been developed with theoretical convergence guarantees for non-convex unconstrained problems, it remains a challenge to design provably efficient algorithms for problems with non… ▽ More Optimization models with non-convex constraints arise in many tasks in machine learning, e.g., learning with fairness constraints or Neyman-Pearson classification with non-convex loss. Although many efficient methods have been developed with theoretical convergence guarantees for non-convex unconstrained problems, it remains a challenge to design provably efficient algorithms for problems with non-convex functional constraints. This paper proposes a class of subgradient methods for constrained optimization where the objective function and the constraint functions are weakly convex and nonsmooth. Our methods solve a sequence of strongly convex subproblems, where a quadratic regularization term is added to both the objective function and each constraint function. Each subproblem can be solved by various algorithms for strongly convex optimization. Under a uniform Slater's condition, we establish the computation complexities of our methods for finding a nearly stationary point. △ Less

Submitted 23 March, 2023; v1 submitted 5 August, 2019; originally announced August 2019.

Comments: This article has been published in International Conference on Machine Learning (ICML), 2020. We didn't post the final version to arxiv soon after publication, which leads to the paper being cited under the old title and causes other confusion. We therefore update it in arxiv to avoid the issues of multiple versions

arXiv:1904.08401 [pdf, ps, other]

doi 10.30757/ALEA.v19-37

Complete convergence theorem for a two-level contact process

Authors: Ruibo Ma

Abstract: We study a two-level contact process. We think of fleas living on a species of animals. The animals are a supercritical contact process in $\mathbb{Z}^d$. The contact process acts as the random environment for the fleas. The fleas do not affect the animals, give birth at rate $μ$ when they are living on a host animal, and die at rate $δ$ when they do not have a host animal. The main result is that… ▽ More We study a two-level contact process. We think of fleas living on a species of animals. The animals are a supercritical contact process in $\mathbb{Z}^d$. The contact process acts as the random environment for the fleas. The fleas do not affect the animals, give birth at rate $μ$ when they are living on a host animal, and die at rate $δ$ when they do not have a host animal. The main result is that if the contact process is supercritical and the fleas survive then the complete convergence theorem holds. This is done using a block construction so as a corollary we conclude that the fleas die out at their critical value. △ Less

Submitted 14 May, 2022; v1 submitted 17 April, 2019; originally announced April 2019.

Journal ref: ALEA, Lat. Am. J. Probab. Math. Stat. 19, 943-955 (2022)

arXiv:1902.03846 [pdf, ps, other]

On asymptotic properties of the generalized Dirichlet $L$-functions

Authors: Rong Ma, Yana Niu, Yulong Zhang

Abstract: Let $q\ge3$ be an integer, $χ$ denote a Dirichlet character modulo $q$, for any real number $a\ge 0$, we define the generalized Dirichlet $L$-functions $$ L(s,χ,a)=\sum_{n=1}^{\infty}\frac{χ(n)}{(n+a)^s}, $$ where $s=σ+it$ with $σ>1$ and $t$ both real. It can be extended to all $s$ by analytic continuation. In this paper, we study the mean value properties of the generalized Dirichlet $L$-function… ▽ More Let $q\ge3$ be an integer, $χ$ denote a Dirichlet character modulo $q$, for any real number $a\ge 0$, we define the generalized Dirichlet $L$-functions $$ L(s,χ,a)=\sum_{n=1}^{\infty}\frac{χ(n)}{(n+a)^s}, $$ where $s=σ+it$ with $σ>1$ and $t$ both real. It can be extended to all $s$ by analytic continuation. In this paper, we study the mean value properties of the generalized Dirichlet $L$-functions, and obtain several sharp asymptotic formulae by using analytic method. △ Less

Submitted 11 February, 2019; originally announced February 2019.

Comments: 15 pages,accepted by IJNT

MSC Class: 11M20 ACM Class: F.2.2

Journal ref: completed in 2018

arXiv:1811.08671 [pdf, other]

A Structure-Preserving One-Sided Jacobi Method for Computing the SVD of a Quaternion Matrix

Authors: Ru-Ru Ma, Zheng-Jian Bai

Abstract: In this paper, we provide a structure-preserving one-sided cyclic Jacobi method for computing the singular value decomposition of a quaternion matrix. In this method, the columns of the quaternion matrix are orthogonalized in pairs by using a sequence of orthogonal JRS-symplectic Jacobi matrices to its real counterpart. The quadratic convergence is also established under some mild conditions. Nume… ▽ More In this paper, we provide a structure-preserving one-sided cyclic Jacobi method for computing the singular value decomposition of a quaternion matrix. In this method, the columns of the quaternion matrix are orthogonalized in pairs by using a sequence of orthogonal JRS-symplectic Jacobi matrices to its real counterpart. The quadratic convergence is also established under some mild conditions. Numerical tests are reported to illustrate the efficiency of the proposed method. △ Less

Submitted 21 November, 2018; originally announced November 2018.

arXiv:1808.09810 [pdf, ps, other]

Optimal Superconvergence Analysis for the Crouzeix-Raviart and the Morley elements

Authors: Jun Hu, Limin Ma, Rui Ma

Abstract: In this paper, an improved superconvergence analysis is presented for both the Crouzeix-Raviart element and the Morley element. The main idea of the analysis is to employ a discrete Helmholtz decomposition of the difference between the canonical interpolation and the finite element solution for the first order mixed Raviart--Thomas element and the mixed Hellan--Herrmann--Johnson element, respectiv… ▽ More In this paper, an improved superconvergence analysis is presented for both the Crouzeix-Raviart element and the Morley element. The main idea of the analysis is to employ a discrete Helmholtz decomposition of the difference between the canonical interpolation and the finite element solution for the first order mixed Raviart--Thomas element and the mixed Hellan--Herrmann--Johnson element, respectively. This, in particular, allows for proving a full one order superconvergence result for these two mixed finite elements. Finally, a full one order superconvergence result of both the Crouzeix-Raviart element and the Morley element follows from their special relations with the first order mixed Raviart--Thomas element and the mixed Hellan--Herrmann--Johnson element respectively. Those superconvergence results are also extended to mildly-structured meshes. △ Less

Submitted 21 October, 2019; v1 submitted 27 August, 2018; originally announced August 2018.

Comments: 20 pages, 3 figures, 3 tables. arXiv admin note: text overlap with arXiv:1802.01896

arXiv:1808.08159 [pdf, other]

A heterogeneous spatial model in which savanna and forest coexist in a stable equilibrium

Authors: Rick Durrett, Ruibo Ma

Abstract: In work with a variety of co-authors, Staver and Levin have argued that savannah and forest coexist as alternative stable states with discontinuous changes in density of trees at the boundary. Here we formulate a nonhomogeneous spatial model of the competition between forest and savannah. We prove that coexistence occurs for a time that is exponential in the size of the system, and that after an i… ▽ More In work with a variety of co-authors, Staver and Levin have argued that savannah and forest coexist as alternative stable states with discontinuous changes in density of trees at the boundary. Here we formulate a nonhomogeneous spatial model of the competition between forest and savannah. We prove that coexistence occurs for a time that is exponential in the size of the system, and that after an initial transient, boundaries between the alternative equilibria remain stable. △ Less

Submitted 26 January, 2019; v1 submitted 24 August, 2018; originally announced August 2018.

Comments: 19 pages, 1 figure

MSC Class: 60K35; 92D40

arXiv:1808.02613 [pdf, ps, other]

Power domination in regular claw-free graphs

Authors: Changhong Lu, Rui Mao, Bing Wang

Abstract: In this paper, we first show that the power domination number of a connected $4$-regular claw-free graph on $n$ vertices is at most $\frac{n+1}{5}$, and the bound is sharp. The statement partly disprove the conjecture presented by Dorbec et al. in SIAM J. Discrete Math., 27:1559-1574, 2013. Then we present a dynamic programming style linear-time algorithm for weighted power domination problem in t… ▽ More In this paper, we first show that the power domination number of a connected $4$-regular claw-free graph on $n$ vertices is at most $\frac{n+1}{5}$, and the bound is sharp. The statement partly disprove the conjecture presented by Dorbec et al. in SIAM J. Discrete Math., 27:1559-1574, 2013. Then we present a dynamic programming style linear-time algorithm for weighted power domination problem in trees. △ Less

Submitted 7 August, 2018; originally announced August 2018.

arXiv:1807.08090 [pdf, ps, other]

Partial relaxation of C^0 vertex continuity of stresses of conforming mixed finite elements for the elasticity problem

Authors: Jun Hu, Rui Ma

Abstract: A conforming triangular mixed element recently proposed by Hu and Zhang for linear elasticity is extended by rearranging the global degrees of freedom. More precisely, adaptive meshes $\mathcal{T}_1$, $\cdots$, $\mathcal{T}_N$ which are successively refined from an initial mesh $\mathcal{T}_0$ through a newest vertex bisection strategy, admit a crucial hierarchical structure, namely, a newly added… ▽ More A conforming triangular mixed element recently proposed by Hu and Zhang for linear elasticity is extended by rearranging the global degrees of freedom. More precisely, adaptive meshes $\mathcal{T}_1$, $\cdots$, $\mathcal{T}_N$ which are successively refined from an initial mesh $\mathcal{T}_0$ through a newest vertex bisection strategy, admit a crucial hierarchical structure, namely, a newly added vertex $\boldsymbol{x}$ of the mesh $\mathcal{T}_\ell$ is the midpoint of an edge $e$ of the coarse mesh $\mathcal{T}_{\ell-1}$. Such a hierarchical structure is explored to partially relax the $C^0$ vertex continuity of symmetric matrix-valued functions in the discrete stress space of the original element on $\mathcal{T}_\ell$ and results in an extended discrete stress space. A feature of this extended discrete stress space is its nestedness in the sense that a space on a coarse mesh $\mathcal{T}$ is a subspace of a space on any refinement $\hat{\mathcal{T}}$ of $\mathcal{T}$, which allows a proof of convergence of a standard adaptive algorithm. The idea is extended to impose a general traction boundary condition on the discrete level. Numerical experiments are provided to illustrate performance on both uniform and adaptive meshes. △ Less

Submitted 11 September, 2019; v1 submitted 21 July, 2018; originally announced July 2018.

arXiv:1805.06970 [pdf, other]

doi 10.1080/01621459.2019.1699421

Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models

Authors: Rong Ma, T. Tony Cai, Hongzhe Li

Abstract: High-dimensional logistic regression is widely used in analyzing data with binary outcomes. In this paper, global testing and large-scale multiple testing for the regression coefficients are considered in both single- and two-regression settings. A test statistic for testing the global null hypothesis is constructed using a generalized low-dimensional projection for bias correction and its asympto… ▽ More High-dimensional logistic regression is widely used in analyzing data with binary outcomes. In this paper, global testing and large-scale multiple testing for the regression coefficients are considered in both single- and two-regression settings. A test statistic for testing the global null hypothesis is constructed using a generalized low-dimensional projection for bias correction and its asymptotic null distribution is derived. A lower bound for the global testing is established, which shows that the proposed test is asymptotically minimax optimal over some sparsity range. For testing the individual coefficients simultaneously, multiple testing procedures are proposed and shown to control the false discovery rate (FDR) and falsely discovered variables (FDV) asymptotically. Simulation studies are carried out to examine the numerical performance of the proposed tests and their superiority over existing methods. The testing procedures are also illustrated by analyzing a data set of a metabolomics study that investigates the association between fecal metabolites and pediatric Crohn's disease and the effects of treatment on such associations. △ Less

Submitted 19 November, 2020; v1 submitted 17 May, 2018; originally announced May 2018.

Comments: Typos corrected

Journal ref: Journal of the American Statistical Association (2019)

arXiv:1801.08120 [pdf, other]

doi 10.5705/ss.202019.0445

Optimal Estimation of Simultaneous Signals Using Absolute Inner Product with Applications to Integrative Genomics

Authors: Rong Ma, T. Tony Cai, Hongzhe Li

Abstract: Integrating the summary statistics from genome-wide association study (\textsc{gwas}) and expression quantitative trait loci (e\textsc{qtl}) data provides a powerful way of identifying the genes whose expression levels are potentially associated with complex diseases. A parameter called $T$-score that quantifies the genetic overlap between a gene and the disease phenotype based on the summary stat… ▽ More Integrating the summary statistics from genome-wide association study (\textsc{gwas}) and expression quantitative trait loci (e\textsc{qtl}) data provides a powerful way of identifying the genes whose expression levels are potentially associated with complex diseases. A parameter called $T$-score that quantifies the genetic overlap between a gene and the disease phenotype based on the summary statistics is introduced based on the mean values of two Gaussian sequences. Specifically, given two independent samples $\mathbf{x}_n\sim N(θ, Σ_1)$ and $\mathbf{y}_n\sim N(μ, Σ_2)$, the $T$-score is defined as $\sum_{i=1}^n |θ_iμ_i|$, a non-smooth functional, which characterizes the amount of shared signals between two absolute normal mean vectors $|θ|$ and $|μ|$. Using approximation theory, estimators are constructed and shown to be minimax rate-optimal and adaptive over various parameter spaces. Simulation studies demonstrate the superiority of the proposed estimators over existing methods. The method is applied to an integrative analysis of heart failure genomics datasets and we identify several genes and biological pathways that are potentially causal to human heart failure. △ Less

Submitted 4 October, 2020; v1 submitted 24 January, 2018; originally announced January 2018.

Journal ref: Statistica Sinica (2020)

arXiv:1701.02674 [pdf, ps, other]

Some new formulas for Appell series over finite fields

Authors: Long Li, Xin Li, Rui Mao

Abstract: In 1987 Greene introduced the notion of the finite field analogue of hypergeometric series. In this paper we give a finite field analogue of Appell series and obtain some transformation and reduction formulas. We also establish the generating functions for Appell series over finite fields. In 1987 Greene introduced the notion of the finite field analogue of hypergeometric series. In this paper we give a finite field analogue of Appell series and obtain some transformation and reduction formulas. We also establish the generating functions for Appell series over finite fields. △ Less

Submitted 3 January, 2017; originally announced January 2017.

arXiv:1611.04293 [pdf, ps, other]

A Mathematic Expression of the Genes of Chinese Traditional Philosophy

Authors: Kegong Chen, Ruyun Ma

Abstract: We provide a mathematic model for the Traditional Yin-and-Yang Double Fish Diagram which from Chinese Traditional Philosophy. We provide a mathematic model for the Traditional Yin-and-Yang Double Fish Diagram which from Chinese Traditional Philosophy. △ Less

Submitted 15 November, 2016; v1 submitted 14 November, 2016; originally announced November 2016.

Comments: 10 pages, 6 figures

arXiv:1604.07903 [pdf, ps, other]

Conforming mixed triangular prism and nonconforming mixed tetrahedral elements for the linear elasticity problem

Authors: Jun Hu, Rui Ma

Abstract: We propose two families of mixed finite elements for solving the classical Hellinger-Reissner mixed problem of the linear elasticity equations in three dimensions. First, a family of conforming mixed triangular prism elements is constructed by product of elements on triangular meshes and elements in one dimension. The well-posedness is established for all elements with $k\geq1$, which are of… ▽ More We propose two families of mixed finite elements for solving the classical Hellinger-Reissner mixed problem of the linear elasticity equations in three dimensions. First, a family of conforming mixed triangular prism elements is constructed by product of elements on triangular meshes and elements in one dimension. The well-posedness is established for all elements with $k\geq1$, which are of $k+1$ order convergence for both the stress and displacement. Besides, a family of reduced stress spaces is proposed by drop** the degrees of polynomial functions associated with faces. As a result, the lowest order conforming mixed triangular prism element has 93 plus 33 degrees of freedom on each element. Second, we construct a new family of nonconforming mixed tetrahedral elements. The shape function spaces of our stress spaces are different from those of the elements in literature. △ Less

Submitted 26 April, 2016; originally announced April 2016.

MSC Class: 65N30; 73C02

Showing 1–50 of 83 results for author: Ma, R