Skip to main content

Showing 1–50 of 131 results for author: Ma, C

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.13989  [pdf, other

    stat.ML cs.IT cs.LG math.ST

    Random pairing MLE for estimation of item parameters in Rasch model

    Authors: Yuepeng Yang, Cong Ma

    Abstract: The Rasch model, a classical model in the item response theory, is widely used in psychometrics to model the relationship between individuals' latent traits and their binary responses on assessments or questionnaires. In this paper, we introduce a new likelihood-based estimator -- random pairing maximum likelihood estimator ($\mathsf{RP\text{-}MLE}$) and its bootstrapped variant multiple random pa… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2405.20618  [pdf, other

    math.NA cs.CG

    CPAFT: A Consistent Parallel Advancing Front Technique for Unstructured Triangular/Tetrahedral Mesh Generation

    Authors: Chengdi Ma, Jizu Huang, Hao Luo, Chao Yang

    Abstract: Compared with the remarkable progress made in parallel numerical solvers of partial differential equations,the development of algorithms for generating unstructured triangular/tetrahedral meshes has been relatively sluggish. In this paper, we propose a novel, consistent parallel advancing front technique (CPAFT) by combining the advancing front technique, the domain decomposition method based on s… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    MSC Class: 65M50; 65M55; 68W10

  3. arXiv:2404.15624  [pdf, other

    math.NA

    A new framework of high-order unfitted finite element methods using ALE maps for moving-domain problems

    Authors: Wenhao Lu, Chuwen Ma, Weiying Zheng

    Abstract: As a sequel to our previous work [C. Ma, Q. Zhang and W. Zheng, SIAM J. Numer. Anal., 60 (2022)], [C. Ma and W. Zheng, J. Comput. Phys. 469 (2022)], this paper presents a generic framework of arbitrary Lagrangian-Eulerian unfitted finite element (ALE-UFE) methods for partial differential equations (PDEs) on time-varying domains. The ALE-UFE method has a great potential in develo** high-order unf… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2403.19123  [pdf, ps, other

    math.NA

    Schrödingerisation based computationally stable algorithms for ill-posed problems in partial differential equations

    Authors: Shi **, Nana Liu, Chuwen Ma

    Abstract: We introduce a simple and stable computational method for ill-posed partial differential equation (PDE) problems. The method is based on Schrödingerization, introduced in [S. **, N. Liu and Y. Yu, Phys. Rev. A, 108 (2023), 032603], which maps all linear PDEs into Schrödinger-type equations in one higher dimension, for quantum simulations of these PDEs. Although the original problem is ill-posed,… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  5. arXiv:2403.16714  [pdf, other

    math.NA

    A Mixed Multiscale Spectral Generalized Finite Element Method

    Authors: Christian Alber, Chupeng Ma, Robert Scheichl

    Abstract: We present a multiscale mixed finite element method for solving second order elliptic equations with general $L^{\infty}$-coefficients arising from flow in highly heterogeneous porous media. Our approach is based on a multiscale spectral generalized finite element method (MS-GFEM) and exploits the superior local mass conservation properties of mixed finite elements. Following the MS-GFEM framework… ▽ More

    Submitted 4 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  6. arXiv:2402.17732  [pdf, other

    math.ST cs.LG stat.ML

    Batched Nonparametric Contextual Bandits

    Authors: Rong Jiang, Cong Ma

    Abstract: We study nonparametric contextual bandits under batch constraints, where the expected reward for each action is modeled as a smooth function of covariates, and the policy updates are made at the end of each batch of observations. We establish a minimax regret lower bound for this setting and propose a novel batch learning algorithm that achieves the optimal regret (up to logarithmic factors). In e… ▽ More

    Submitted 10 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Add lower bound when grid is adaptively chosen; add results on adaptivity to margin parameter

  7. arXiv:2402.14696  [pdf, ps, other

    math.NA

    On Schrödingerization based quantum algorithms for linear dynamical systems with inhomogeneous terms

    Authors: Shi **, Nana Liu, Chuwen Ma

    Abstract: We analyze the Schrödingerisation method for quantum simulation of a general class of non-unitary dynamics with inhomogeneous source terms. The Schrödingerisation technique, introduced in \cite{JLY22a,JLY23}, transforms any linear ordinary and partial differential equations with non-unitary dynamics into a system under unitary dynamics via a warped phase transition that maps the equations into a h… ▽ More

    Submitted 27 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  8. arXiv:2402.07445  [pdf, other

    stat.ML cs.IT cs.LG math.ST

    Top-$K$ ranking with a monotone adversary

    Authors: Yuepeng Yang, Antares Chen, Lorenzo Orecchia, Cong Ma

    Abstract: In this paper, we address the top-$K$ ranking problem with a monotone adversary. We consider the scenario where a comparison graph is randomly generated and the adversary is allowed to add arbitrary edges. The statistician's goal is then to accurately identify the top-$K$ preferred items based on pairwise comparisons derived from this semi-random comparison graph. The main contribution of this pap… ▽ More

    Submitted 20 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to Conference of Learning Theory, 2024

  9. arXiv:2402.00382  [pdf, other

    math.ST stat.ML

    On the design-dependent suboptimality of the Lasso

    Authors: Reese Pathak, Cong Ma

    Abstract: This paper investigates the effect of the design matrix on the ability (or inability) to estimate a sparse parameter in linear regression. More specifically, we characterize the optimal rate of estimation when the smallest singular value of the design matrix is bounded away from zero. In addition to this information-theoretic result, we provide and analyze a procedure which is simultaneously stati… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 19 pages, 1 figure

  10. arXiv:2402.00305  [pdf, ps, other

    math.ST cs.IT cs.SI stat.ML

    Information-Theoretic Thresholds for Planted Dense Cycles

    Authors: Cheng Mao, Alexander S. Wein, Shenduo Zhang

    Abstract: We study a random graph model for small-world networks which are ubiquitous in social and biological sciences. In this model, a dense cycle of expected bandwidth $n τ$, representing the hidden one-dimensional geometry of vertices, is planted in an ambient random graph on $n$ vertices. For both detection and recovery of the planted dense cycle, we characterize the information-theoretic thresholds i… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 31 pages, 1 figure

    MSC Class: 94A15; 62B10; 68Q87; 05C80; 05C60

  11. arXiv:2311.17385  [pdf, ps, other

    math.RA

    Invariants of Quantizations of Unimodular Quadratic Polynomial Poisson Algebras of Dimension 3

    Authors: Chengyuan Ma

    Abstract: Let $P = \Bbbk[x_1, x_2, x_3]$ be a unimodular quadratic Poisson algebra, with its Poisson bracket written as $\{x_i, x_j\} = \displaystyle{\sum_{k,l}c_{i,j}^{k,l}x_kx_l}$, $1 \leq i < j \leq 3$. Let $P_{\hbar}$ be the deformation quantization of $P$ constructed as follows:… ▽ More

    Submitted 24 January, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  12. arXiv:2311.15961  [pdf, ps, other

    stat.ML cs.LG math.ST

    Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

    Authors: Jiawei Ge, Shange Tang, Jianqing Fan, Cong Ma, Chi **

    Abstract: A key challenge of modern machine learning systems is to achieve Out-of-Distribution (OOD) generalization -- generalizing to target data whose distribution differs from that of source data. Despite its significant importance, the fundamental question of ``what are the most effective algorithms for OOD generalization'' remains open even under the standard setting of covariate shift. This paper addr… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  13. arXiv:2311.08761  [pdf, other

    math.NA

    A unified framework for multiscale spectral generalized FEMs and low-rank approximations to multiscale PDEs

    Authors: Chupeng Ma

    Abstract: This work presents an abstract framework for the design, implementation, and analysis of the multiscale spectral generalized finite element method (MS-GFEM), a particular numerical multiscale method originally proposed in [I. Babuska and R. Lipton, Multiscale Model.\;\,Simul., 9 (2011), pp.~373--406]. MS-GFEM is a partition of unity method employing optimal local approximation spaces constructed f… ▽ More

    Submitted 15 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  14. arXiv:2310.06159  [pdf, other

    cs.LG math.OC stat.ML

    Provably Accelerating Ill-Conditioned Low-rank Estimation via Scaled Gradient Descent, Even with Overparameterization

    Authors: Cong Ma, Xingyu Xu, Tian Tong, Yuejie Chi

    Abstract: Many problems encountered in science and engineering can be formulated as estimating a low-rank object (e.g., matrices and tensors) from incomplete, and possibly corrupted, linear measurements. Through the lens of matrix and tensor factorization, one of the most popular approaches is to employ simple iterative algorithms such as gradient descent (GD) to recover the low-rank factors directly, which… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Book chapter for "Explorations in the Mathematics of Data Science - The Inaugural Volume of the Center for Approximation and Mathematical Data Analytics". arXiv admin note: text overlap with arXiv:2104.14526

  15. arXiv:2309.00388  [pdf, ps, other

    math.DG

    On conformally flat cubic metrics with weakly isotropic scalar curvature

    Authors: Cuiling Ma, Xiaoling Zhang

    Abstract: The conformal properties of metrics are meaningful in Riemannian and Finsler geometry, and cubic metrics are useful in physics and biology. In this paper, we study the conformally flat cubic metrics with weakly isotropic scalar curvature. We also prove that such metrics must be Minkowski metrics.

    Submitted 1 September, 2023; originally announced September 2023.

  16. arXiv:2308.08408  [pdf, ps, other

    quant-ph math.QA

    Quantum simulation of Maxwell's equations via Schrödingersation

    Authors: Shi **, Nana Liu, Chuwen Ma

    Abstract: We present quantum algorithms for electromagnetic fields governed by Maxwell's equations. The algorithms are based on the Schrödingersation approach, which transforms any linear PDEs and ODEs with non-unitary dynamics into a system evolving under unitary dynamics, via a warped phase transformation that maps the equation into one higher dimension. In this paper, our quantum algorithms are based on… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  17. arXiv:2306.04162  [pdf, ps, other

    math.AP

    On the Defocusing Cubic Nonlinear Wave Equation on $\mathbb{H}^3$ with Radial Initial Data in $H^{\frac{1}{2}+δ} \times H^{-\frac{1}{2}+δ}$

    Authors: Chutian Ma

    Abstract: In this paper we prove global well-posedness and scattering for the defocusing cubic nonlinear wave equation in the hyperbolic space $\mathbb{H}^3$, under the assumption that the initial data is radial and lies in $H^{\frac{1}{2}+δ}(\mathbb{H}^3)\times H^{-\frac{1}{2}+δ}(\mathbb{H}^3)$

    Submitted 7 June, 2023; originally announced June 2023.

  18. arXiv:2306.03335  [pdf, other

    stat.ML cs.LG math.ST

    Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage

    Authors: Yu Gui, Cong Ma, Yiqiao Zhong

    Abstract: We investigate the role of projection heads, also known as projectors, within the encoder-projector framework (e.g., SimCLR) used in contrastive learning. We aim to demystify the observed phenomenon where representations learned before projectors outperform those learned after -- measured using the downstream linear classification accuracy, even when the projectors themselves are linear. In this… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  19. arXiv:2305.19001  [pdf, other

    stat.ML cs.IT cs.LG math.OC math.ST

    High-probability sample complexities for policy evaluation with linear function approximation

    Authors: Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei

    Abstract: This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes. We investigate the sample complexities required to guarantee a predefined estimation error of the best linear coefficients for two widely-used policy evaluation algorithms: the temporal difference (TD) learning algorithm and the two-timescale li… ▽ More

    Submitted 2 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: The first two authors contributed equally; paper accepted to IEEE Transactions on Information Theory

  20. arXiv:2305.12467  [pdf, other

    cs.LG math.OC

    Understanding Multi-phase Optimization Dynamics and Rich Nonlinear Behaviors of ReLU Networks

    Authors: Mingze Wang, Chao Ma

    Abstract: The training process of ReLU neural networks often exhibits complicated nonlinear phenomena. The nonlinearity of models and non-convexity of loss pose significant challenges for theoretical analysis. Therefore, most previous theoretical works on the optimization dynamics of neural networks focus either on local analysis (like the end of training) or approximate linear models (like Neural Tangent K… ▽ More

    Submitted 27 December, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 94 pages, NeurIPS 2023 Spotlight

  21. arXiv:2304.08135  [pdf, ps, other

    cs.DS cs.CC math.ST stat.ML

    Detection of Dense Subhypergraphs by Low-Degree Polynomials

    Authors: Abhishek Dhawan, Cheng Mao, Alexander S. Wein

    Abstract: Detection of a planted dense subgraph in a random graph is a fundamental statistical and computational problem that has been extensively studied in recent years. We study a hypergraph version of the problem. Let $G^r(n,p)$ denote the $r$-uniform Erdős-Rényi hypergraph model with $n$ vertices and edge density $p$. We consider detecting the presence of a planted $G^r(n^γ, n^{-α})$ subhypergraph in a… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 31 pages

  22. arXiv:2303.00852  [pdf, ps, other

    math.AP

    On the Fourier Truncation Method for the Rough Data Cubic Defocusing NLW on $\mathbb{H}^3$

    Authors: Chutian Ma

    Abstract: In this paper, we study the cubic defocusing nonlinear wave equation on the three dimensional hyperbolic space. We use the Fourier truncation method to show that the equation is globally well-posed and scatters if the initial data lies in $H^s(\mathbb{H}^3)$, $s>\frac{182}{201}\approx 0.905$.

    Submitted 1 March, 2023; originally announced March 2023.

  23. arXiv:2302.13588  [pdf, ps, other

    math.RA

    Invariants of Unimodular Quadratic Polynomial Poisson Algebras of Dimension 3

    Authors: Chengyuan Ma

    Abstract: Let $P = \Bbbk[x1,x2,x3]$ be a unimodular quadratic Poisson algebra and let $G$ be a finite subgroup of the graded Poisson automorphism group of $P$. In this paper, we prove a variant of the Shephard-Todd-Chevalley theorem for $P$ and variants the Shephard-Todd-Chevalley theorem and the Watanabe theorem for its Poisson envelo** algebra $U(P)$ under the induced group $\widetilde{G}$.

    Submitted 3 April, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

  24. arXiv:2302.10066  [pdf, other

    math.ST cs.LG stat.ML

    Sharp analysis of EM for learning mixtures of pairwise differences

    Authors: Abhishek Dhawan, Cheng Mao, Ashwin Pananjady

    Abstract: We consider a symmetric mixture of linear regressions with random samples from the pairwise comparison design, which can be seen as a noisy version of a type of Euclidean distance geometry problem. We analyze the expectation-maximization (EM) algorithm locally around the ground truth and establish that the sequence converges linearly, providing an $\ell_\infty$-norm guarantee on the estimation err… ▽ More

    Submitted 22 June, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 45 pages, 2 figures

  25. arXiv:2302.09940  [pdf

    math.AT cs.CC

    Computing persistent homology by spanning trees and critical simplices

    Authors: Dinghua Shi, Zhifeng Chen, Chuang Ma, Guanrong Chen

    Abstract: Topological data analysis can extract effective information from higher-dimensional data. Its mathematical basis is persistent homology. The persistent homology can calculate topological features at different spatiotemporal scales of the dataset; that is, establishing the integrated taxonomic relation among points, lines and simplices. Here, the simplicial network composed of all-order simplices i… ▽ More

    Submitted 27 September, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 12 pages, 6 figures, 5 tables

    ACM Class: E.m; G.2.2; J.2

  26. arXiv:2302.06737  [pdf, ps, other

    math.ST cs.DS stat.ML

    Detection-Recovery Gap for Planted Dense Cycles

    Authors: Cheng Mao, Alexander S. Wein, Shenduo Zhang

    Abstract: Planted dense cycles are a type of latent structure that appears in many applications, such as small-world networks in social sciences and sequence assembly in computational biology. We consider a model where a dense cycle with expected bandwidth $n τ$ and edge density $p$ is planted in an Erdős-Rényi graph $G(n,q)$. We characterize the computational thresholds for the associated detection and rec… ▽ More

    Submitted 20 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 41 pages, 1 figure

  27. arXiv:2302.01186  [pdf, other

    cs.LG eess.SP math.OC stat.ML

    The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing

    Authors: Xingyu Xu, Yandi Shen, Yuejie Chi, Cong Ma

    Abstract: We propose $\textsf{ScaledGD($λ$)}$, a preconditioned gradient descent method to tackle the low-rank matrix sensing problem when the true rank is unknown, and when the matrix is possibly ill-conditioned. Using overparametrized factor representations, $\textsf{ScaledGD($λ$)}$ starts from a small random initialization, and proceeds by gradient descent with a specific form of damped preconditioning t… ▽ More

    Submitted 6 November, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: New analysis in the noisy and the approximately low-rank settings

  28. arXiv:2211.16739  [pdf, other

    cs.CV math.NA math.OC

    Quasi Non-Negative Quaternion Matrix Factorization with Application to Color Face Recognition

    Authors: Yifen Ke, Changfeng Ma, Zhigang Jia, Yajun Xie, Riwei Liao

    Abstract: To address the non-negativity dropout problem of quaternion models, a novel quasi non-negative quaternion matrix factorization (QNQMF) model is presented for color image processing. To implement QNQMF, the quaternion projected gradient algorithm and the quaternion alternating direction method of multipliers are proposed via formulating QNQMF as the non-convex constraint quaternion optimization pro… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 35 pages, 8 figures

  29. arXiv:2211.13893  [pdf, other

    math.NA

    Scalable multiscale-spectral GFEM with an application to composite aero-structures

    Authors: Jean Bénézech, Linus Seelinger, Peter Bastian, Richard Butler, Timothy Dodwell, Chupeng Ma, Robert Scheichl

    Abstract: In this paper, the first large-scale application of multiscale-spectral generalized finite element methods (MS-GFEM) to composite aero-structures is presented. The crucial novelty lies in the introduction of A-harmonicity in the local approximation spaces, which in contrast to [Babuska, Lipton, Multiscale Model. Simul. 9, 2011] is enforced more efficiently via a constraint in the local eigenproble… ▽ More

    Submitted 1 March, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  30. arXiv:2211.06515  [pdf, other

    cs.LG math.NA

    Multilevel-in-Layer Training for Deep Neural Network Regression

    Authors: Colin Ponce, Ruipeng Li, Christina Mao, Panayot Vassilevski

    Abstract: A common challenge in regression is that for many problems, the degrees of freedom required for a high-quality solution also allows for overfitting. Regularization is a class of strategies that seek to restrict the range of possible solutions so as to discourage overfitting while still enabling good solutions, and different regularization strategies impose different types of restrictions. In this… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 24 pages, 9 figures, submitted to Numerical Linear Algebra with Applications

  31. arXiv:2210.13760  [pdf, ps, other

    math.AP

    A Scattering Result of the Radial Cubic Defocusing Schrödinger Equation on the 3d Hyperbolic Space

    Authors: Chutian Ma

    Abstract: In this paper, we study the defocusing cubic Schrödinger equation on three dimensional hyperbolic space $\mathbb{H}^3$ with radial initial data in the Sobolev Space $H^s(0<s<1)$. Our main result is that the initial value problem is globally wellposed and scatters for $\frac{15}{16}<s<1$. This is an extension of the work of Staffilani and Yu to the three dimensional hyperbolic space.

    Submitted 26 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  32. arXiv:2210.02612  [pdf

    eess.SY cs.AI cs.LG math.OC

    Lyapunov Function Consistent Adaptive Network Signal Control with Back Pressure and Reinforcement Learning

    Authors: Chaolun Ma, Bruce Wang, Zihao Li, Ahmadreza Mahmoudzadeh, Yunlong Zhang

    Abstract: In traffic signal control, flow-based (optimizing the overall flow) and pressure-based methods (equalizing and alleviating congestion) are commonly used but often considered separately. This study introduces a unified framework using Lyapunov control theory, defining specific Lyapunov functions respectively for these methods. We have found interesting results. For example, the well-recognized back… ▽ More

    Submitted 16 January, 2024; v1 submitted 5 October, 2022; originally announced October 2022.

  33. arXiv:2209.12313  [pdf, other

    cs.DS math.ST stat.ML

    Random graph matching at Otter's threshold via counting chandeliers

    Authors: Cheng Mao, Yihong Wu, Jiaming Xu, Sophie H. Yu

    Abstract: We propose an efficient algorithm for graph matching based on similarity scores constructed from counting a certain family of weighted trees rooted at each vertex. For two Erdős-Rényi graphs $\mathcal{G}(n,q)$ whose edges are correlated through a latent vertex correspondence, we show that this algorithm correctly matches all but a vanishing fraction of the vertices with high probability, provided… ▽ More

    Submitted 13 February, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

  34. arXiv:2209.01957  [pdf, other

    math.NA

    Exponential convergence of a generalized FEM for heterogeneous reaction-diffusion equations

    Authors: Chupeng Ma, Jens Markus Melenk

    Abstract: A generalized finite element method is proposed for solving a heterogeneous reaction-diffusion equation with a singular perturbation parameter $\varepsilon$, based on locally approximating the solution on each subdomain by solution of a local reaction-diffusion equation and eigenfunctions of a local eigenproblem. These local problems are posed on some domains slightly larger than the subdomains wi… ▽ More

    Submitted 8 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

  35. arXiv:2209.00228  [pdf, ps, other

    math.DS math.CA

    Dimensions of projected sets and measures on typical self-affine sets

    Authors: De-Jun Feng, Chiu-Hong Lo, Cai-Yun Ma

    Abstract: Let $T_1,\ldots, T_m$ be a family of $d\times d$ invertible real matrices with $\|T_i\|<1/2$ for $1\leq i\leq m$. For ${\bf a}=(a_1,\ldots, a_m)\in \Bbb R^{md}$, let $π^{\bf a}:\; Σ=\{1,\ldots, m\}^{\Bbb N}\to \Bbb R^d$ denote the coding map associated with the affine IFS $\{T_ix+a_i\}_{i=1}^m$. We show that for every Borel probability measure $μ$ on $Σ$, each of the following dimensions (lower an… ▽ More

    Submitted 20 July, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: Minor changes. To appear in Adv. Math

    MSC Class: 28A80; 37C45; 31A15; 49Q15; 60B05

  36. arXiv:2208.07996  [pdf, other

    stat.ME math.ST

    Correcting Convexity Bias in Function and Functional Estimate

    Authors: Chao Ma, Lexing Ying

    Abstract: A general framework with a series of different methods is proposed to improve the estimate of convex function (or functional) values when only noisy observations of the true input are available. Technically, our methods catch the bias introduced by the convexity and remove this bias from a baseline estimate. Theoretical analysis are conducted to show that the proposed methods can strictly reduce t… ▽ More

    Submitted 14 September, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

    MSC Class: 62G05; 65K99

  37. arXiv:2208.05308  [pdf, other

    math.DS math.OC

    A dynamical system based on projection operator for solving absolute value equations associated with second-order cone

    Authors: Cairong Chen, Dongmei Yu, Deren Han, Changfeng Ma

    Abstract: A new equivalent reformulation of the absolute value equations associated with second-order cone (SOCAVEs) is emphasised, from which a dynamical system based on projection operator for solving SOCAVEs is constructed. Under proper assumptions, the equilibrium points of the dynamical system exist and could be (globally) asymptotically stable. Some numerical simulations are given to show the effectiv… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 5 figures

  38. arXiv:2207.11719  [pdf, other

    cs.LG math.OC

    Gradient-based Bi-level Optimization for Deep Learning: A Survey

    Authors: Can Chen, Xi Chen, Chen Ma, Zixuan Liu, Xue Liu

    Abstract: Bi-level optimization, especially the gradient-based category, has been widely used in the deep learning community including hyperparameter optimization and meta-knowledge extraction. Bi-level optimization embeds one problem within another and the gradient-based category solves the outer-level task by computing the hypergradient, which is much more efficient than classical methods such as the evol… ▽ More

    Submitted 9 July, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: AI4Science; Bi-level Optimization; Hyperparameter Optimization; Meta Learning; Implicit Function

  39. arXiv:2207.06559  [pdf, other

    cs.LG cs.AI cs.MA math.OC stat.ML

    Scalable Model-based Policy Optimization for Decentralized Networked Systems

    Authors: Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

    Abstract: Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple tasks. Such a challenge is more outstanding in multi-agent tasks, as each step of operation is more costly requiring communications or shifting or resources. This work aims to improve data efficiency of multi-agent control by model-based learning. We consider networke… ▽ More

    Submitted 1 September, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: 8 pages, 7 figures, accepted by The 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  40. Optimal tuning-free convex relaxation for noisy matrix completion

    Authors: Yuepeng Yang, Cong Ma

    Abstract: This paper is concerned with noisy matrix completion--the problem of recovering a low-rank matrix from partial and noisy entries. Under uniform sampling and incoherence assumptions, we prove that a tuning-free square-root matrix completion estimator (square-root MC) achieves optimal statistical performance for solving the noisy matrix completion problem. Similar to the square-root Lasso estimator… ▽ More

    Submitted 6 June, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE Transactions on Information Theory

    Journal ref: IEEE Transactions on Information Theory, vol. 69, no. 10, pp. 6571-6585, Oct. 2023

  41. arXiv:2206.09109  [pdf, other

    stat.ML cs.LG eess.SP math.OC

    Fast and Provable Tensor Robust Principal Component Analysis via Scaled Gradient Descent

    Authors: Harry Dong, Tian Tong, Cong Ma, Yuejie Chi

    Abstract: An increasing number of data science and machine learning problems rely on computation with tensors, which better capture the multi-way relationships and interactions of data than matrices. When tap** into this critical advantage, a key challenge is to develop computationally efficient and provably correct algorithms for extracting useful information from tensor data that are simultaneously robu… ▽ More

    Submitted 22 February, 2023; v1 submitted 18 June, 2022; originally announced June 2022.

  42. arXiv:2206.06834  [pdf, other

    eess.SY math.OC

    Distributed Coordination of Charging Stations Considering Aggregate EV Power Flexibility

    Authors: Dongxiang Yan, Chengbin Ma, Yue Chen

    Abstract: In recent years, electric vehicle (EV) charging stations have witnessed a rapid growth. However, effective management of charging stations is challenging due to individual EV owners' privacy concerns, competing interests of different stations, and the coupling distribution network constraints. To cope with this challenge, this paper proposes a two-stage scheme. In the first stage, the aggregate EV… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: 11 pages, 14 figures

  43. arXiv:2206.02139  [pdf, other

    cs.LG math.OC

    Early Stage Convergence and Global Convergence of Training Mildly Parameterized Neural Networks

    Authors: Mingze Wang, Chao Ma

    Abstract: The convergence of GD and SGD when training mildly parameterized neural networks starting from random initialization is studied. For a broad range of models and loss functions, including the most commonly used square loss and cross entropy loss, we prove an ``early stage convergence'' result. We show that the loss is decreased by a significant amount in the early stage of the training, and this de… ▽ More

    Submitted 29 May, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: 73 pages

  44. arXiv:2205.02986  [pdf, other

    math.ST cs.LG stat.ML

    Optimally tackling covariate shift in RKHS-based nonparametric regression

    Authors: Cong Ma, Reese Pathak, Martin J. Wainwright

    Abstract: We study the covariate shift problem in the context of nonparametric regression over a reproducing kernel Hilbert space (RKHS). We focus on two natural families of covariate shift problems defined using the likelihood ratios between the source and target distributions. When the likelihood ratios are uniformly bounded, we prove that the kernel ridge regression (KRR) estimator with a carefully chose… ▽ More

    Submitted 6 June, 2023; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: to appear in the Annals of Statistics

  45. arXiv:2204.01057  [pdf, other

    math.CO cs.DM cs.LG

    A Survey on Machine Learning Solutions for Graph Pattern Extraction

    Authors: Kai Siong Yow, Ningyi Liao, Siqiang Luo, Reynold Cheng, Chenhao Ma, Xiaolin Han

    Abstract: A subgraph is constructed by using a subset of vertices and edges of a given graph. There exist many graph properties that are hereditary for subgraphs. Hence, researchers from different communities have paid a great deal of attention in studying numerous subgraph problems, on top of the ordinary graph problems. Many algorithms are proposed in studying subgraph problems, where one common approach… ▽ More

    Submitted 2 June, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: v1: 41 pages; v2: 40 pages ; v3: This version focuses on just subgraph problems (discussions on other classic graph problems can be found in the earlier versions)

    MSC Class: 05C90; 68M07; 68R10

  46. arXiv:2202.02837  [pdf, other

    math.ST cs.LG stat.ML

    A new similarity measure for covariate shift with applications to nonparametric regression

    Authors: Reese Pathak, Cong Ma, Martin J. Wainwright

    Abstract: We study covariate shift in the context of nonparametric regression. We introduce a new measure of distribution mismatch between the source and target distributions that is based on the integrated ratio of probabilities of balls at a given radius. We use the scaling of this measure with respect to the radius to characterize the minimax rate of estimation over a family of Hölder continuous function… ▽ More

    Submitted 6 February, 2022; originally announced February 2022.

    Comments: 22 pages, 2 figures, 1 table

  47. arXiv:2201.10219  [pdf, ps, other

    math.OA math.FA

    John-Nirenberg inequalities for noncommutative column BMO and Lipschitz martingales

    Authors: Guixiang Hong, Congbian Ma, Yu Wang

    Abstract: In this paper, we continue the study of John-Nirenberg theorems for BMO/Lipschitz spaces in the noncommutative martingale setting. As conjectured from the classical case, a desired noncommutative ``stop** time" argument was discovered to obtain the distribution function inequality form of John-Nirenberg theorem. This not only provides another approach without using duality and interpolation to t… ▽ More

    Submitted 20 May, 2023; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: There is something wrong in my paper

  48. A note on Hausdorff measures of self-similar sets in $\mathbb{R}^d$

    Authors: Cai-Yun Ma, Yu-Feng Wu

    Abstract: We prove that for all $s\in(0,d)$ and $c\in (0,1)$ there exists a self-similar set $E\subset \mathbb{R}^d$ with Hausdorff dimension $s$ such that $\mathcal{H}^s(E)=c|E|^s$. This answers a question raised by Zhiying Wen[16].

    Submitted 5 January, 2022; originally announced January 2022.

    MSC Class: 28A78; 28A80

    Journal ref: Ann. Fenn. Math. 46(2), 957--963, 2021

  49. arXiv:2112.14874  [pdf, ps, other

    math.PR math.ST

    Strong Local Nondeterminism and Exact Modulus of Continuity for Isotropic Gaussian Random Fields on Compact Two-Point Homogeneous Spaces

    Authors: Tianshi Lu, Chunsheng Ma, Yimin Xiao

    Abstract: This paper is concerned with sample path properties of isotropic Gaussian fields on compact two-point homogeneous spaces. In particular, we establish the property of strong local nondeterminism of an isotropic Gaussian field based on the high-frequency behavior of its angular power spectrum, and then exploit this result to establish an exact uniform modulus of continuity for its sample paths.

    Submitted 29 December, 2021; originally announced December 2021.

    MSC Class: 60G6; 60G17; 60G15; 42C40

  50. arXiv:2112.14864  [pdf, other

    math.NA

    A high-order unfitted finite element method for moving interface problems

    Authors: Chuwen Ma, Weiying Zheng

    Abstract: We propose a $k^{\rm th}$-order unfitted finite element method ($2\le k\le 4$) to solve the moving interface problem of the Oseen equations. Thorough error estimates for the discrete solutions are presented by considering errors from interface-tracking, time integration, and spatial discretization. In literatures on time-dependent Stokes interface problems, error estimates for the discrete pressur… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.