Search | arXiv e-print repository

Volume preserving nonhomogeneous Gauss curvature flow in hyperbolic space

Authors: Yong Wei, Bo Yang, Tailong Zhou

Abstract: We consider the volume preserving flow of smooth, closed and convex hypersurfaces in the hyperbolic space $\mathbb{H}^{n+1}$ with speed given by a general nonhomogeneous function of the Gauss curvature. For a large class of speed functions, we prove that the solution of the flow remains convex, exists for all positive time $t\in [0,\infty)$ and converges to a geodesic sphere exponentially as… ▽ More We consider the volume preserving flow of smooth, closed and convex hypersurfaces in the hyperbolic space $\mathbb{H}^{n+1}$ with speed given by a general nonhomogeneous function of the Gauss curvature. For a large class of speed functions, we prove that the solution of the flow remains convex, exists for all positive time $t\in [0,\infty)$ and converges to a geodesic sphere exponentially as $t\to\infty$ in the smooth topology. A key step is to show the $L^1$ oscillation decay of the Gauss curvature to its average along a subsequence of times going to the infinity, which combined with an argument using the hyperbolic curvature measure theory implies the Hausdorff convergence. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 27 pages. All comments are welcome. arXiv admin note: substantial text overlap with arXiv:2210.06035

MSC Class: 53E10; 53C42

arXiv:2405.08639 [pdf, ps, other]

Upwards homogeneity in iterated symmetric extensions

Authors: Calliope Ryan-Smith, Jonathan Schilhan, Yujun Wei

Abstract: It is sometimes desirable in choiceless constructions of set theory that one iteratively extends some ground model without adding new sets of ordinals after the first extension. Pushing this further, one may wish to have models $V \subseteq M \subseteq N$ of $\mathsf{ZF}$ such that $N$ contains no subsets of $V$ that do not already appear in $M$. We isolate, in the case that $M$ and $N$ are symmet… ▽ More It is sometimes desirable in choiceless constructions of set theory that one iteratively extends some ground model without adding new sets of ordinals after the first extension. Pushing this further, one may wish to have models $V \subseteq M \subseteq N$ of $\mathsf{ZF}$ such that $N$ contains no subsets of $V$ that do not already appear in $M$. We isolate, in the case that $M$ and $N$ are symmetric extensions (particular inner models of a generic extension of $V$), the exact conditions that cause this behaviour and show how it can broadly be applied to many known constructions. We call this behaviour upwards homogeneity. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 16 pages, 1 figure

MSC Class: 03E25 (Primary) 03E35; 03E40 (Secondary)

arXiv:2405.07147 [pdf, ps, other]

Randomized algorithms for computing the tensor train approximation and their applications

Authors: Maolin Che, Yimin Wei, Hong Yan

Abstract: In this paper, we focus on the fixed TT-rank and precision problems of finding an approximation of the tensor train (TT) decomposition of a tensor. Note that the TT-SVD and TT-cross are two well-known algorithms for these two problems. Firstly, by combining the random projection technique with the power scheme, we obtain two types of randomized algorithms for the fixed TT-rank problem. Secondly, b… ▽ More In this paper, we focus on the fixed TT-rank and precision problems of finding an approximation of the tensor train (TT) decomposition of a tensor. Note that the TT-SVD and TT-cross are two well-known algorithms for these two problems. Firstly, by combining the random projection technique with the power scheme, we obtain two types of randomized algorithms for the fixed TT-rank problem. Secondly, by using the non-asymptotic theory of sub-random Gaussian matrices, we derive the upper bounds of the proposed randomized algorithms. Thirdly, we deduce a new deterministic strategy to estimate the desired TT-rank with a given tolerance and another adaptive randomized algorithm that finds a low TT-rank representation satisfying a given tolerance, and is beneficial when the target TT-rank is not known in advance. We finally illustrate the accuracy of the proposed algorithms via some test tensors from synthetic and real databases. In particular, for the fixed TT-rank problem, the proposed algorithms can be several times faster than the TT-SVD, and the accuracy of the proposed algorithms and the TT-SVD are comparable for several test tensors. △ Less

Submitted 11 May, 2024; originally announced May 2024.

Comments: 43 pages, 9 figures and 4 tables

MSC Class: 15A18; 15A69; 65F55; 68W20

arXiv:2404.13525 [pdf, other]

QR Decomposition of Dual Matrices and its Application to Traveling Wave Identification in the Brain

Authors: Renjie Xu, Tong Wei, Yimin Wei, Pengpeng Xie

Abstract: Matrix decompositions in dual number representations have played an important role in fields such as kinematics and computer graphics in recent years. In this paper, we present a QR decomposition algorithm for dual number matrices, specifically geared towards its application in traveling wave identification, utilizing the concept of proper orthogonal decomposition. When dealing with large-scale pr… ▽ More Matrix decompositions in dual number representations have played an important role in fields such as kinematics and computer graphics in recent years. In this paper, we present a QR decomposition algorithm for dual number matrices, specifically geared towards its application in traveling wave identification, utilizing the concept of proper orthogonal decomposition. When dealing with large-scale problems, we present explicit solutions for the QR, thin QR, and randomized QR decompositions of dual number matrices, along with their respective algorithms with column pivoting. The QR decomposition of dual matrices is an accurate first-order perturbation, with the Q-factor satisfying rigorous perturbation bounds, leading to enhanced orthogonality. In numerical experiments, we discuss the suitability of different QR algorithms when confronted with various large-scale dual matrices, providing their respective domains of applicability. Subsequently, we employed the QR decomposition of dual matrices to compute the DMPGI, thereby attaining results of higher precision. Moreover, we apply the QR decomposition in the context of traveling wave identification, employing the notion of proper orthogonal decomposition to perform a validation analysis of large-scale functional magnetic resonance imaging (fMRI) data for brain functional circuits. Our approach significantly improves the identification of two types of wave signals compared to previous research, providing empirical evidence for cognitive neuroscience theories. △ Less

Submitted 21 April, 2024; originally announced April 2024.

arXiv:2404.11991 [pdf, ps, other]

Pohozaev identities and Kelvin transformation of semilinear Grushin equation

Authors: Yawei Wei, Xiaodong Zhou

Abstract: In this paper, we study Pohozaev identities, Kelvin transformation and their applications of semilinear Grushin equation. First, we establish two Pohozaev identities generated from translations and determine the location of the concentration point for solution of a kind of Grushin equation by such identities. Next, we establish Pohozaev identity generated from scaling and prove the nonexistence of… ▽ More In this paper, we study Pohozaev identities, Kelvin transformation and their applications of semilinear Grushin equation. First, we establish two Pohozaev identities generated from translations and determine the location of the concentration point for solution of a kind of Grushin equation by such identities. Next, we establish Pohozaev identity generated from scaling and prove the nonexistence of nontrivial solutions of another kind of Grushin equation by such identity. Finally, we provide the change of Grushin operator by Kelvin transformation and obtain the decay rate of solution at infinity for a critical Grushin equation by Kelvin transformation. △ Less

Submitted 18 April, 2024; originally announced April 2024.

MSC Class: 35J70; 35A22; 35B40

arXiv:2404.10298 [pdf, ps, other]

Anisotropic Gauss curvature flow of complete non-compact graphs

Authors: Shu**g Pan, Yong Wei

Abstract: In this paper, we consider the anisotropic $α$-Gauss curvature flow for complete noncompact convex hypersurfaces in the Euclidean space with the anisotropy determined by a smooth closed uniformly convex Wulff shape. We show that for all positive power $α>0$, if the initial hypersurface is complete noncompact and locally uniformly convex, then the solution of the flow exists for all positive time. In this paper, we consider the anisotropic $α$-Gauss curvature flow for complete noncompact convex hypersurfaces in the Euclidean space with the anisotropy determined by a smooth closed uniformly convex Wulff shape. We show that for all positive power $α>0$, if the initial hypersurface is complete noncompact and locally uniformly convex, then the solution of the flow exists for all positive time. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: 23 pages. All comments are welcome

MSC Class: 53C44; 53C42

arXiv:2404.08196 [pdf, ps, other]

Properties of fractional p-Laplace equations with sign-changing potential

Authors: Yubo Duan, Yawei Wei

Abstract: In this paper, we consider the nonlinear equation involving the fractional p-Laplacian with sign-changing potential. This model draws inspiration from De Giorgi Conjecture. There are two main results in this paper. Firstly, we obtain that the solution is radially symmetric within the bounded domain, by applying the moving plane method. Secondly, by exploiting the idea of the sliding method, we con… ▽ More In this paper, we consider the nonlinear equation involving the fractional p-Laplacian with sign-changing potential. This model draws inspiration from De Giorgi Conjecture. There are two main results in this paper. Firstly, we obtain that the solution is radially symmetric within the bounded domain, by applying the moving plane method. Secondly, by exploiting the idea of the sliding method, we construct the appropriate auxiliary functions to prove that the solution is monotone increasing in some direction in the unbounded domain. The different properties of the solution in bounded and unbounded domains are mainly attributed to the inherent non-locality of the fractional p-Laplacian. △ Less

Submitted 11 April, 2024; originally announced April 2024.

MSC Class: 35R11; 35J60; 35B40

arXiv:2404.08192 [pdf, ps, other]

Wellposedness of the Master Equation for Mean Field Games with Grushin Type Diffusion

Authors: Yiming Jiang, Yawei Wei, Yiyun Yang

Abstract: We study the wellposedness of the master equation for a second-order mean field games with the Grushin type diffusion. In order to do this, we obtain the properties of its solution by investigating a degenerate mean field games system for which there exists an equivalent characterization with the master equation. The crucial points of this paper are to explore some regularities of solutions to two… ▽ More We study the wellposedness of the master equation for a second-order mean field games with the Grushin type diffusion. In order to do this, we obtain the properties of its solution by investigating a degenerate mean field games system for which there exists an equivalent characterization with the master equation. The crucial points of this paper are to explore some regularities of solutions to two types of linear degenerate partial differential equations and a kind of degenerate linear coupled system so as to derive the existence of solutions to the master equation. △ Less

Submitted 11 April, 2024; originally announced April 2024.

arXiv:2404.06372 [pdf, ps, other]

ABP estimate and comparison principle for cone degenerate quasilinear elliptic equations

Authors: Hua Chen, Jiangtao Hu, Xiaochun Liu, Yawei Wei, Mengnan Zhang

Abstract: In this paper, we study the cone degenerate quasilinear elliptic equations. We provide the existence of the viscosity solutions by proving Alexandrov-Bakelman-Pucci and Hölder estimates. Further more, we give the comparison principle by an equivalent transformation. In this paper, we study the cone degenerate quasilinear elliptic equations. We provide the existence of the viscosity solutions by proving Alexandrov-Bakelman-Pucci and Hölder estimates. Further more, we give the comparison principle by an equivalent transformation. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.05251 [pdf, ps, other]

Association schemes arising from non-weakly regular bent functions

Authors: Yadi Wei, Jiaxin Wang, Fang-Wei Fu

Abstract: Association schemes play an important role in algebraic combinatorics and have important applications in coding theory, graph theory and design theory. The methods to construct association schemes by using bent functions have been extensively studied. Recently, in [13], {Ö}zbudak and Pelen constructed infinite families of symmetric association schemes of classes $5$ and $6$ by using ternary non-we… ▽ More Association schemes play an important role in algebraic combinatorics and have important applications in coding theory, graph theory and design theory. The methods to construct association schemes by using bent functions have been extensively studied. Recently, in [13], {Ö}zbudak and Pelen constructed infinite families of symmetric association schemes of classes $5$ and $6$ by using ternary non-weakly regular bent functions.They also stated that constructing $2p$-class association schemes from $p$-ary non-weakly regular bent functions is an interesting problem, where $p>3$ is an odd prime. In this paper, using non-weakly regular bent functions, we construct infinite families of symmetric association schemes of classes $2p$, $(2p+1)$ and $\frac{3p+1}{2}$ for any odd prime $p$. Fusing those association schemes, we also obtain $t$-class symmetric association schemes, where $t=4,5,6,7$. In addition, we give the sufficient and necessary conditions for the partitions $P$, $D$, $T$, $U$ and $V$ (defined in this paper) to induce symmetric association schemes. △ Less

Submitted 13 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.04062 [pdf, other]

Derivative-free tree optimization for complex systems

Authors: Ye Wei, Bo Peng, Ruiwen Xie, Yangtao Chen, Yu Qin, Peng Wen, Stefan Bauer, Po-Yen Tung

Abstract: A tremendous range of design tasks in materials, physics, and biology can be formulated as finding the optimum of an objective function depending on many parameters without knowing its closed-form expression or the derivative. Traditional derivative-free optimization techniques often rely on strong assumptions about objective functions, thereby failing at optimizing non-convex systems beyond 100 d… ▽ More A tremendous range of design tasks in materials, physics, and biology can be formulated as finding the optimum of an objective function depending on many parameters without knowing its closed-form expression or the derivative. Traditional derivative-free optimization techniques often rely on strong assumptions about objective functions, thereby failing at optimizing non-convex systems beyond 100 dimensions. Here, we present a tree search method for derivative-free optimization that enables accelerated optimal design of high-dimensional complex systems. Specifically, we introduce stochastic tree expansion, dynamic upper confidence bound, and short-range backpropagation mechanism to evade local optimum, iteratively approximating the global optimum using machine learning models. This development effectively confronts the dimensionally challenging problems, achieving convergence to global optima across various benchmark functions up to 2,000 dimensions, surpassing the existing methods by 10- to 20-fold. Our method demonstrates wide applicability to a wide range of real-world complex systems spanning materials, physics, and biology, considerably outperforming state-of-the-art algorithms. This enables efficient autonomous knowledge discovery and facilitates self-driving virtual laboratories. Although we focus on problems within the realm of natural science, the advancements in optimization techniques achieved herein are applicable to a broader spectrum of challenges across all quantitative disciplines. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 39 pages, 3 figures

arXiv:2403.04892 [pdf, other]

Generalized Choi-Davis-Jensen's Operator Inequalities and Their Applications

Authors: Shih Yu Chang, Yimin Wei

Abstract: The original Choi-Davis-Jensen's inequality, with its wide-ranging applications in diverse scientific and engineering fields, has motivated researchers to explore generalizations. In this study, we extend Davis-Choi-Jensen's inequality by considering a nonlinear map instead of a normalized linear map and generalize operator convex function to any continuous function defined in a compact region. Th… ▽ More The original Choi-Davis-Jensen's inequality, with its wide-ranging applications in diverse scientific and engineering fields, has motivated researchers to explore generalizations. In this study, we extend Davis-Choi-Jensen's inequality by considering a nonlinear map instead of a normalized linear map and generalize operator convex function to any continuous function defined in a compact region. The Stone-Weierstrass theorem and Kantorovich function are instrumental in formulating and proving generalized Choi-Davis-Jensen's inequalities. Additionally, we present an application of this generalized inequality in the context of statistical physics. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2403.03852 [pdf, other]

Accelerating Convergence of Score-Based Diffusion Models, Provably

Authors: Gen Li, Yu Huang, Timofey Efimov, Yuting Wei, Yuejie Chi, Yuxin Chen

Abstract: Score-based diffusion models, while achieving remarkable empirical performance, often suffer from low sampling speed, due to extensive function evaluations needed during the sampling phase. Despite a flurry of recent activities towards speeding up diffusion generative modeling in practice, theoretical underpinnings for acceleration techniques remain severely limited. In this paper, we design novel… ▽ More Score-based diffusion models, while achieving remarkable empirical performance, often suffer from low sampling speed, due to extensive function evaluations needed during the sampling phase. Despite a flurry of recent activities towards speeding up diffusion generative modeling in practice, theoretical underpinnings for acceleration techniques remain severely limited. In this paper, we design novel training-free algorithms to accelerate popular deterministic (i.e., DDIM) and stochastic (i.e., DDPM) samplers. Our accelerated deterministic sampler converges at a rate $O(1/{T}^2)$ with $T$ the number of steps, improving upon the $O(1/T)$ rate for the DDIM sampler; and our accelerated stochastic sampler converges at a rate $O(1/T)$, outperforming the rate $O(1/\sqrt{T})$ for the DDPM sampler. The design of our algorithms leverages insights from higher-order approximation, and shares similar intuitions as popular high-order ODE solvers like the DPM-Solver-2. Our theory accommodates $\ell_2$-accurate score estimates, and does not require log-concavity or smoothness on the target distribution. △ Less

Submitted 6 March, 2024; originally announced March 2024.

Comments: The first two authors contributed equally

arXiv:2402.16213 [pdf, ps, other]

Sparse gradient bounds for divergence form elliptic equations

Authors: Olli Saari, Hua-Yang Wang, Yuanhong Wei

Abstract: We provide sparse estimates for gradients of solutions to divergence form elliptic partial differential equations in terms of the source data. We give a general result of Meyers (or Gehring) type, a result for linear equations with VMO coefficients and a result for linear equations with Dini continuous coefficients. In addition, we provide an abstract theorem conditional on PDE estimates available… ▽ More We provide sparse estimates for gradients of solutions to divergence form elliptic partial differential equations in terms of the source data. We give a general result of Meyers (or Gehring) type, a result for linear equations with VMO coefficients and a result for linear equations with Dini continuous coefficients. In addition, we provide an abstract theorem conditional on PDE estimates available. The linear results have the full range of weighted estimates with Muckenhoupt weights as a consequence. △ Less

Submitted 23 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

Comments: v2: writing improved all over and more details added

arXiv:2402.13491 [pdf, other]

Algebraic Riccati Tensor Equations with Applications in Multilinear Control Systems

Authors: Yuchao Wang, Yimin Wei, Guofeng Zhang, Shih Yu Chang

Abstract: In a recent interesting paper [8], Chen et al. initialized the control-theoretic study of a class of discrete-time multilinear time-invariant (MLTI) control systems, where system states, inputs and outputs are all tensors endowed with the Einstein product. Criteria for fundamental system-theoretic notions such as stability, reachability and observability are established by means of tensor decompos… ▽ More In a recent interesting paper [8], Chen et al. initialized the control-theoretic study of a class of discrete-time multilinear time-invariant (MLTI) control systems, where system states, inputs and outputs are all tensors endowed with the Einstein product. Criteria for fundamental system-theoretic notions such as stability, reachability and observability are established by means of tensor decomposition. The purpose of this paper is to continue this novel research direction. Specifically, we focus on continuous-time MLTI control systems. We define Hamiltonian tensors and symplectic tensors and establish the Schur-Hamiltonian tensor decomposition and symplectic tensor singular value decomposition (SVD). Based on these we propose the algebraic Riccati tensor equation (ARTE) and show that it has a unique positive semidefinite solution if the system is stablizable and detectable. A tensor-based Newton method is proposed to find numerical solutions of the ARTE. The tensor version of the bounded real lemma is also established. A first-order robustness analysis of the ARTE is conducted. Finally, a numerical example is used to demonstrate the proposed theory and algorithms. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: 25 pages, 6 figures

MSC Class: 15A69; 93B35; 93C05; 93D15

arXiv:2402.07802 [pdf, ps, other]

Towards a mathematical theory for consistency training in diffusion models

Authors: Gen Li, Zhihan Huang, Yuting Wei

Abstract: Consistency models, which were proposed to mitigate the high computational overhead during the sampling phase of diffusion models, facilitate single-step sampling while attaining state-of-the-art empirical performance. When integrated into the training phase, consistency models attempt to train a sequence of consistency functions capable of map** any point at any time step of the diffusion proce… ▽ More Consistency models, which were proposed to mitigate the high computational overhead during the sampling phase of diffusion models, facilitate single-step sampling while attaining state-of-the-art empirical performance. When integrated into the training phase, consistency models attempt to train a sequence of consistency functions capable of map** any point at any time step of the diffusion process to its starting point. Despite the empirical success, a comprehensive theoretical understanding of consistency training remains elusive. This paper takes a first step towards establishing theoretical underpinnings for consistency models. We demonstrate that, in order to generate samples within $\varepsilon$ proximity to the target in distribution (measured by some Wasserstein metric), it suffices for the number of steps in consistency learning to exceed the order of $d^{5/2}/\varepsilon$, with $d$ the data dimension. Our theory offers rigorous insights into the validity and efficacy of consistency models, illuminating their utility in downstream inference tasks. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: The first two authors contributed equally

arXiv:2401.16836 [pdf, other]

Coseparable Nonnegative Tensor Factorization With T-CUR Decomposition

Authors: Juefei Chen, Longxiu Huang, Yimin Wei

Abstract: Nonnegative Matrix Factorization (NMF) is an important unsupervised learning method to extract meaningful features from data. To address the NMF problem within a polynomial time framework, researchers have introduced a separability assumption, which has recently evolved into the concept of coseparability. This advancement offers a more efficient core representation for the original data. However,… ▽ More Nonnegative Matrix Factorization (NMF) is an important unsupervised learning method to extract meaningful features from data. To address the NMF problem within a polynomial time framework, researchers have introduced a separability assumption, which has recently evolved into the concept of coseparability. This advancement offers a more efficient core representation for the original data. However, in the real world, the data is more natural to be represented as a multi-dimensional array, such as images or videos. The NMF's application to high-dimensional data involves vectorization, which risks losing essential multi-dimensional correlations. To retain these inherent correlations in the data, we turn to tensors (multidimensional arrays) and leverage the tensor t-product. This approach extends the coseparable NMF to the tensor setting, creating what we term coseparable Nonnegative Tensor Factorization (NTF). In this work, we provide an alternating index selection method to select the coseparable core. Furthermore, we validate the t-CUR sampling theory and integrate it with the tensor Discrete Empirical Interpolation Method (t-DEIM) to introduce an alternative, randomized index selection process. These methods have been tested on both synthetic and facial analysis datasets. The results demonstrate the efficiency of coseparable NTF when compared to coseparable NMF. △ Less

Submitted 7 May, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.13933 [pdf, ps, other]

Solutions to the First Order Difference Equations in the Multivariate Difference Field

Authors: Lixin Du, Yarong Wei

Abstract: The bivariate difference field provides an algebraic framework for a sequence satisfying a recurrence of order two. Based on this, we focus on sequences satisfying a recurrence of higher order, and consider the multivariate difference field, in which the summation problem could be transformed into solving the first order difference equations. We then show a criterion for deciding whether the diffe… ▽ More The bivariate difference field provides an algebraic framework for a sequence satisfying a recurrence of order two. Based on this, we focus on sequences satisfying a recurrence of higher order, and consider the multivariate difference field, in which the summation problem could be transformed into solving the first order difference equations. We then show a criterion for deciding whether the difference equation has a rational solution and present an algorithm for computing one rational solution of such a difference equation, if it exists. Moreover we get the rational solution set of such an equation. △ Less

Submitted 24 January, 2024; originally announced January 2024.

arXiv:2401.11388 [pdf, ps, other]

Polynomial Solutions to the First Order Difference Equations in the Bivariate Difference Field

Authors: Yarong Wei

Abstract: The bivariate difference filed $(\mathbb{F}(α, β), σ)$ provides an algebraic framework for a sequence satisfying a recurrence of order two and it could transform the summation involving a sequence satisfying a recurrence of order two into the first order difference equations in the bivariate difference field. Based on it, we present an algorithm for finding all the polynomial solutions of such equ… ▽ More The bivariate difference filed $(\mathbb{F}(α, β), σ)$ provides an algebraic framework for a sequence satisfying a recurrence of order two and it could transform the summation involving a sequence satisfying a recurrence of order two into the first order difference equations in the bivariate difference field. Based on it, we present an algorithm for finding all the polynomial solutions of such equations in the bivariate difference field, and show an upper bound on the degree for polynomial solutions which is sufficient to compute polynomial solution by using the undetermined method. △ Less

Submitted 20 January, 2024; originally announced January 2024.

arXiv:2401.11387 [pdf, ps, other]

Rational Solutions to the First Order Difference Equations in the Bivariate Difference Field

Authors: Qing-Hu Hou, Yarong Wei

Abstract: Inspired by Karr's algorithm, we consider the summations involving a sequence satisfying a recurrence of order two. The structure of such summations provides an algebraic framework for solving the difference equations of form $aσ(g)+bg=f$ in the bivariate difference field $(\mathbb{F}(α, β), σ)$, where $a, b,f\in\mathbb{F}(α,β)\setminus\{0\}$ are known binary functions of $α$, $β$, and $α$, $β$ ar… ▽ More Inspired by Karr's algorithm, we consider the summations involving a sequence satisfying a recurrence of order two. The structure of such summations provides an algebraic framework for solving the difference equations of form $aσ(g)+bg=f$ in the bivariate difference field $(\mathbb{F}(α, β), σ)$, where $a, b,f\in\mathbb{F}(α,β)\setminus\{0\}$ are known binary functions of $α$, $β$, and $α$, $β$ are two algebraically independent transcendental elements, $σ$ is a transformation that satisfies $σ(α)=β$, $σ(β)=uα+vβ$, where $u,v\neq 0\in\mathbb{F}$. Based on it, we then describe algorithms for finding the universal denominator for those equations in the bivariate difference field under certain assumptions. This reduces the general problem of finding the rational solutions of such equations to the problem of finding the polynomial solutions of such equations. △ Less

Submitted 20 January, 2024; originally announced January 2024.

arXiv:2401.03923 [pdf, other]

A non-asymptotic distributional theory of approximate message passing for sparse and robust regression

Authors: Gen Li, Yuting Wei

Abstract: Characterizing the distribution of high-dimensional statistical estimators is a challenging task, due to the breakdown of classical asymptotic theory in high dimension. This paper makes progress towards this by develo** non-asymptotic distributional characterizations for approximate message passing (AMP) -- a family of iterative algorithms that prove effective as both fast estimators and powerfu… ▽ More Characterizing the distribution of high-dimensional statistical estimators is a challenging task, due to the breakdown of classical asymptotic theory in high dimension. This paper makes progress towards this by develo** non-asymptotic distributional characterizations for approximate message passing (AMP) -- a family of iterative algorithms that prove effective as both fast estimators and powerful theoretical machinery -- for both sparse and robust regression. Prior AMP theory, which focused on high-dimensional asymptotics for the most part, failed to describe the behavior of AMP when the number of iterations exceeds $o\big({\log n}/{\log \log n}\big)$ (with $n$ the sample size). We establish the first finite-sample non-asymptotic distributional theory of AMP for both sparse and robust regression that accommodates a polynomial number of iterations. Our results derive approximate accuracy of Gaussian approximation of the AMP iterates, which improves upon all prior results and implies enhanced distributional characterizations for both optimally tuned Lasso and robust M-estimator. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.01423 [pdf, other]

Hadamard integrators for wave equations in time and frequency domain: Eulerian formulations via butterfly algorithms

Authors: Yuxiao Wei, ** Cheng, Shingyu Leung, Robert Burridge, Jianliang Qian

Abstract: Starting from the Kirchhoff-Huygens representation and Duhamel's principle of time-domain wave equations, we propose novel butterfly-compressed Hadamard integrators for self-adjoint wave equations in both time and frequency domain in an inhomogeneous medium. First, we incorporate the leading term of Hadamard's ansatz into the Kirchhoff-Huygens representation to develop a short-time valid propagato… ▽ More Starting from the Kirchhoff-Huygens representation and Duhamel's principle of time-domain wave equations, we propose novel butterfly-compressed Hadamard integrators for self-adjoint wave equations in both time and frequency domain in an inhomogeneous medium. First, we incorporate the leading term of Hadamard's ansatz into the Kirchhoff-Huygens representation to develop a short-time valid propagator. Second, using the Fourier transform in time, we derive the corresponding Eulerian short-time propagator in frequency domain; on top of this propagator, we further develop a time-frequency-time (TFT) method for the Cauchy problem of time-domain wave equations. Third, we further propose the time-frequency-time-frequency (TFTF) method for the corresponding point-source Helmholtz equation, which provides Green's functions of the Helmholtz equation for all angular frequencies within a given frequency band. Fourth, to implement TFT and TFTF methods efficiently, we introduce butterfly algorithms to compress oscillatory integral kernels at different frequencies. As a result, the proposed methods can construct wave field beyond caustics implicitly and advance spatially overturning waves in time naturally with quasi-optimal computational complexity and memory usage. Furthermore, once constructed the Hadamard integrators can be employed to solve both time-domain wave equations with various initial conditions and frequency-domain wave equations with different point sources. Numerical examples for two-dimensional wave equations illustrate the accuracy and efficiency of the proposed methods. △ Less

Submitted 4 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

Comments: 34 pages, 16 figures, 4 tables

MSC Class: 65M80; 65Y20

arXiv:2312.16961 [pdf, ps, other]

On connectedness in the parametric geometry of numbers

Authors: Yuming Wei, Han Zhang

Abstract: Via multilinear algebra, we formulate a criterion for connectedness in the parametric geometry of numbers in terms of pencils, which are certain algebraic varieties in the space of matrices. As a consequence, we obtain a connectedness result for generic lattices arising from Diophantine approximation on analytic submanifolds, and sharpen Schmidt and Summerer's results of connectedness on simultane… ▽ More Via multilinear algebra, we formulate a criterion for connectedness in the parametric geometry of numbers in terms of pencils, which are certain algebraic varieties in the space of matrices. As a consequence, we obtain a connectedness result for generic lattices arising from Diophantine approximation on analytic submanifolds, and sharpen Schmidt and Summerer's results of connectedness on simultaneous Diophantine approximation and approximation by linear forms. △ Less

Submitted 9 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

Comments: Fix typos and mistakes, fill in more details and correct the error in Lemma 2.2. Main results unchanged

MSC Class: 11H06; 11J13; 37B05

arXiv:2311.17507 [pdf, ps, other]

Computation of outer inverse of tensors based on $t$-product

Authors: Ratikanta Behera, Jajati Keshari Sahoo, Yimin Wei

Abstract: Tensor operations play an essential role in various fields of science and engineering, including multiway data analysis. In this study, we establish a few basic properties of the range and null space of a tensor using block circulant matrices and the discrete Fourier matrix. We then discuss the outer inverse of tensors based on $t$-product with a prescribed range and kernel of third-order tensors.… ▽ More Tensor operations play an essential role in various fields of science and engineering, including multiway data analysis. In this study, we establish a few basic properties of the range and null space of a tensor using block circulant matrices and the discrete Fourier matrix. We then discuss the outer inverse of tensors based on $t$-product with a prescribed range and kernel of third-order tensors. We address the relation of this outer inverse with other generalized inverses, such as the Moore-Penrose inverse, group inverse, and Drazin inverse. In addition, we present a few algorithms for computing the outer inverses of the tensors. In particular, a $t$-QR decomposition based algorithm is developed for computing the outer inverses. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: 21

arXiv:2311.14766 [pdf, other]

Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing

Authors: Feiyang Han, Yimin Wei, Zhaofeng Liu, Yanxing Qi

Abstract: Reinforcement Learning from Human Feedback (RLHF) has played a crucial role in the success of large models such as ChatGPT. RLHF is a reinforcement learning framework which combines human feedback to improve learning effectiveness and performance. However, obtaining preferences feedback manually is quite expensive in commercial applications. Some statistical commercial indicators are usually more… ▽ More Reinforcement Learning from Human Feedback (RLHF) has played a crucial role in the success of large models such as ChatGPT. RLHF is a reinforcement learning framework which combines human feedback to improve learning effectiveness and performance. However, obtaining preferences feedback manually is quite expensive in commercial applications. Some statistical commercial indicators are usually more valuable and always ignored in RLHF. There exists a gap between commercial target and model training. In our research, we will attempt to fill this gap with statistical business feedback instead of human feedback, using AB testing which is a well-established statistical method. Reinforcement Learning from Statistical Feedback (RLSF) based on AB testing is proposed. Statistical inference methods are used to obtain preferences for training the reward network, which fine-tunes the pre-trained model in reinforcement learning framework, achieving greater business value. Furthermore, we extend AB testing with double selections at a single time-point to ANT testing with multiple selections at different feedback time points. Moreover, we design numerical experiences to validate the effectiveness of our algorithm framework. △ Less

Submitted 24 November, 2023; originally announced November 2023.

arXiv:2311.10312 [pdf, ps, other]

Mean Field Games with infinitely degenerate diffusion and non-coercive Hamiltonian

Authors: Yiming Jiang, **gchuang Ren, Yawei Wei, Jie Xue

Abstract: In this paper, we consider a class of infinitely degenerate partial differential systems to obtain the Nash equilibria in the mean field games. The degeneracy in the diffusion and the Hamiltonian may be different. This feature brings difficulties to the uniform boundness of the solutions, which is central to the existence and regularity results. First, from the perspective of the value function in… ▽ More In this paper, we consider a class of infinitely degenerate partial differential systems to obtain the Nash equilibria in the mean field games. The degeneracy in the diffusion and the Hamiltonian may be different. This feature brings difficulties to the uniform boundness of the solutions, which is central to the existence and regularity results. First, from the perspective of the value function in the stochastic optimal control problems, we prove the Lipschitz continuity and the semiconcavity for the solutions of the Hamilton-Jacobi equations (HJE). Then the existence of the weak solutions for the degenerate systems is obtained via a vanishing viscosity method. Furthermore, by constructing an auxiliary function, we conclude the regularity of the viscosity solution for the HJE in the almost everywhere sense. △ Less

Submitted 16 November, 2023; originally announced November 2023.

MSC Class: 35Q89; 35K65; 35A01

arXiv:2310.18915 [pdf, other]

Multi/Single-stage structured zero-gradient-sum approach for prescribed-time optimization

Authors: Shuaiyu Zhou, Yiheng Wei, **de Cao, Yang Liu

Abstract: Prescribed-time convergence mechanism has become a prominent research focus in the current field of optimization and control due to its ability to precisely control the target completion time. The recently arisen prescribed-time algorithms for distributed optimization, currently necessitate multi-stage structures to achieve global convergence. This paper introduces two modified zero-gradient-sum a… ▽ More Prescribed-time convergence mechanism has become a prominent research focus in the current field of optimization and control due to its ability to precisely control the target completion time. The recently arisen prescribed-time algorithms for distributed optimization, currently necessitate multi-stage structures to achieve global convergence. This paper introduces two modified zero-gradient-sum algorithms, each based on a multi-stage and a single-stage structural frameworks established in this work. These algorithms are designed to achieve prescribed-time convergence and relax two common yet stringent conditions. This work also bridges the gap in current research on single-stage structured PTDO algorithm. The excellent convergence performance of the proposed algorithms is validated through a case study. △ Less

Submitted 29 October, 2023; originally announced October 2023.

arXiv:2310.12368 [pdf, ps, other]

doi 10.1016/j.laa.2023.06.006

Isomorphism Classes of Idempotent Evolution Algebras

Authors: Yangjiang Wei, Yi Ming Zou

Abstract: We showed that isomorphism classes of idempotent evolution algebras are in bijection with the orbits of the semidirect product group of the symmetric group and the torus, considered the combinatoric problem of enumeration of isomorphism classes for these algebras over arbitrary finite fields, derived a general counting formula, and obtained explicit formulas for the numbers of isomorphism classes… ▽ More We showed that isomorphism classes of idempotent evolution algebras are in bijection with the orbits of the semidirect product group of the symmetric group and the torus, considered the combinatoric problem of enumeration of isomorphism classes for these algebras over arbitrary finite fields, derived a general counting formula, and obtained explicit formulas for the numbers of isomorphism classes in dimensions 2, 3, and 4 over any finite field. △ Less

Submitted 18 October, 2023; originally announced October 2023.

MSC Class: 17D92; 05A05

Journal ref: LAA 675 (2023)

arXiv:2310.04682 [pdf, other]

Hypergraph Analysis Based on a Compatible Tensor Product Structure

Authors: Jiaqi Gu, Shenghao Feng, Yimin Wei

Abstract: We propose a tensor product structure that is compatible with the hypergraph structure. We define the algebraic connectivity of the $(m+1)$-uniform hypergraph in this product, and prove the relationship with the vertex connectivity. We introduce some connectivity optimization problem into the hypergraph, and solve them with the algebraic connectivity. We introduce the Laplacian eigenmap algorithm… ▽ More We propose a tensor product structure that is compatible with the hypergraph structure. We define the algebraic connectivity of the $(m+1)$-uniform hypergraph in this product, and prove the relationship with the vertex connectivity. We introduce some connectivity optimization problem into the hypergraph, and solve them with the algebraic connectivity. We introduce the Laplacian eigenmap algorithm to the hypergraph under our tensor product. △ Less

Submitted 6 October, 2023; originally announced October 2023.

arXiv:2309.02830 [pdf, ps, other]

The characteristic polynomials of uniform hypercycles with length four

Authors: Cunxiang Duan, Ligong Wang, Yulong Wei

Abstract: Let $C_{m}$ be a cycle with length $m.$ The $k$-uniform hypercycle with length $m$ obtained by adding $k-2$ new vertices in every edge of $C_{m},$ denoted by $C_{m,k}.$ In this paper, we obtain some trace formulas of uniform hypercycles with length four. Moreover, we give the characteristic polynomials of uniform hypercycles with length four. Let $C_{m}$ be a cycle with length $m.$ The $k$-uniform hypercycle with length $m$ obtained by adding $k-2$ new vertices in every edge of $C_{m},$ denoted by $C_{m,k}.$ In this paper, we obtain some trace formulas of uniform hypercycles with length four. Moreover, we give the characteristic polynomials of uniform hypercycles with length four. △ Less

Submitted 6 September, 2023; originally announced September 2023.

arXiv:2308.10434 [pdf, ps, other]

Degenerate Mean Field Games with Hörmander diffusion

Authors: Yiming Jiang, **gchuang Ren, Yawei Wei, Jie Xue

Abstract: In this paper, we study a class of degenerate mean field game systems arising from the mean field games with Hörmander diffusion, where the generic player may have a ``forbidden'' direction at some point. Here we prove the existence and uniqueness of the classical solutions in weighted Hölder spaces for the PDE systems, which describe the Nash equilibria in the games. The degeneracy causes the lac… ▽ More In this paper, we study a class of degenerate mean field game systems arising from the mean field games with Hörmander diffusion, where the generic player may have a ``forbidden'' direction at some point. Here we prove the existence and uniqueness of the classical solutions in weighted Hölder spaces for the PDE systems, which describe the Nash equilibria in the games. The degeneracy causes the lack of commutation of vector fields and the fundamental solution which are the main difficulties in the proof of the global Schauder estimate and the weak maximum principle. Based on the idea of the localizing technique and the local homogeneity of degenerate operators, we extend the maximum regularity result and obtain the global Schauder estimates. For the weak maximum principle, we construct a subsolution instead of the fundamental solution of the degenerate operators. △ Less

Submitted 20 August, 2023; originally announced August 2023.

Comments: We deeply appreciate your consideration of the manuscript. Thank you very much

MSC Class: 35Q89; 35K65; 35A01

arXiv:2308.09232 [pdf, other]

Hadamard integrator for time-dependent wave equations: Lagrangian formulation via ray tracing

Authors: Yuxiao Wei, ** Cheng, Robert Burridge, Jianliang Qian

Abstract: We propose a novel Hadamard integrator for the self-adjoint time-dependent wave equation in an inhomogeneous medium. First, we create a new asymptotic series based on the Gelfand-Shilov function, dubbed Hadamard's ansatz, to approximate the Green's function of the time-dependent wave equation. Second, incorporating the leading term of Hadamard's ansatz into the Kirchhoff-Huygens representation, we… ▽ More We propose a novel Hadamard integrator for the self-adjoint time-dependent wave equation in an inhomogeneous medium. First, we create a new asymptotic series based on the Gelfand-Shilov function, dubbed Hadamard's ansatz, to approximate the Green's function of the time-dependent wave equation. Second, incorporating the leading term of Hadamard's ansatz into the Kirchhoff-Huygens representation, we develop an original Hadamard integrator for the Cauchy problem of the time-dependent wave equation and derive the corresponding Lagrangian formulation in geodesic polar coordinates. Third, to construct the Hadamard integrator in the Lagrangian formulation efficiently, we use a short-time ray tracing method to obtain wavefront locations accurately, and we further develop fast algorithms to compute Chebyshev-polynomial based low-rank representations of both wavefront locations and variants of Hadamard coefficients. Fourth, equipped with these low-rank representations, we apply the Hadamard integrator to efficiently solve time-dependent wave equations with highly oscillatory initial conditions, where the time step size is independent of the initial conditions. By judiciously choosing the medium-dependent time step, our new Hadamard integrator can propagate wave field beyond caustics implicitly and advance spatially overturning waves in time naturally. Moreover, since the integrator is independent of initial conditions, the Hadamard integrator can be applied to many different initial conditions once it is constructed. Both two-dimensional and three-dimensional numerical examples illustrate the accuracy and performance of the proposed method. △ Less

Submitted 25 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

Comments: 38 pages, 14 figures, 1 table

arXiv:2306.14591 [pdf, ps, other]

A Heintze-Karcher type inequality in hyperbolic space

Authors: Yingxiang Hu, Yong Wei, Tailong Zhou

Abstract: In this paper, we prove a new Heintze-Karcher type inequality for shifted mean convex hypersurfaces in hyperbolic space. As applications, we prove an Alexandrov type theorem for closed embedded hypersurfaces with constant shifted $k$th mean curvature in hyperbolic space. Furthermore, a uniqueness result for $h$-convex hypersurfaces satisfying certain curvature equations is obtained. In this paper, we prove a new Heintze-Karcher type inequality for shifted mean convex hypersurfaces in hyperbolic space. As applications, we prove an Alexandrov type theorem for closed embedded hypersurfaces with constant shifted $k$th mean curvature in hyperbolic space. Furthermore, a uniqueness result for $h$-convex hypersurfaces satisfying certain curvature equations is obtained. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Comments: 15 pages. All comments are welcome

arXiv:2306.09251 [pdf, ps, other]

Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models

Authors: Gen Li, Yuting Wei, Yuxin Chen, Yuejie Chi

Abstract: Diffusion models, which convert noise into new data instances by learning to reverse a Markov diffusion process, have become a cornerstone in contemporary generative modeling. While their practical power has now been widely recognized, the theoretical underpinnings remain far from mature. In this work, we develop a suite of non-asymptotic theory towards understanding the data generation process of… ▽ More Diffusion models, which convert noise into new data instances by learning to reverse a Markov diffusion process, have become a cornerstone in contemporary generative modeling. While their practical power has now been widely recognized, the theoretical underpinnings remain far from mature. In this work, we develop a suite of non-asymptotic theory towards understanding the data generation process of diffusion models in discrete time, assuming access to $\ell_2$-accurate estimates of the (Stein) score functions. For a popular deterministic sampler (based on the probability flow ODE), we establish a convergence rate proportional to $1/T$ (with $T$ the total number of steps), improving upon past results; for another mainstream stochastic sampler (i.e., a type of the denoising diffusion probabilistic model), we derive a convergence rate proportional to $1/\sqrt{T}$, matching the state-of-the-art theory. Imposing only minimal assumptions on the target data distribution (e.g., no smoothness assumption is imposed), our results characterize how $\ell_2$ score estimation errors affect the quality of the data generation processes. In contrast to prior works, our theory is developed based on an elementary yet versatile non-asymptotic approach without resorting to toolboxes for SDEs and ODEs. Further, we design two accelerated variants, improving the convergence to $1/T^2$ for the ODE-based sampler and $1/T$ for the DDPM-type sampler, which might be of independent theoretical and empirical interest. △ Less

Submitted 6 March, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: accepted in part to ICLR 2024

arXiv:2306.04240 [pdf, other]

T-ADAF: Adaptive Data Augmentation Framework for Image Classification Network based on Tensor T-product Operator

Authors: Feiyang Han, Yun Miao, Zhaoyi Sun, Yimin Wei

Abstract: Image classification is one of the most fundamental tasks in Computer Vision. In practical applications, the datasets are usually not as abundant as those in the laboratory and simulation, which is always called as Data Hungry. How to extract the information of data more completely and effectively is very important. Therefore, an Adaptive Data Augmentation Framework based on the tensor T-product O… ▽ More Image classification is one of the most fundamental tasks in Computer Vision. In practical applications, the datasets are usually not as abundant as those in the laboratory and simulation, which is always called as Data Hungry. How to extract the information of data more completely and effectively is very important. Therefore, an Adaptive Data Augmentation Framework based on the tensor T-product Operator is proposed in this paper, to triple one image data to be trained and gain the result from all these three images together with only less than 0.1% increase in the number of parameters. At the same time, this framework serves the functions of column image embedding and global feature intersection, enabling the model to obtain information in not only spatial but frequency domain, and thus improving the prediction accuracy of the model. Numerical experiments have been designed for several models, and the results demonstrate the effectiveness of this adaptive framework. Numerical experiments show that our data augmentation framework can improve the performance of original neural network model by 2%, which provides competitive results to state-of-the-art methods. △ Less

Submitted 7 June, 2023; originally announced June 2023.

arXiv:2305.19001 [pdf, other]

High-probability sample complexities for policy evaluation with linear function approximation

Authors: Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei

Abstract: This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes. We investigate the sample complexities required to guarantee a predefined estimation error of the best linear coefficients for two widely-used policy evaluation algorithms: the temporal difference (TD) learning algorithm and the two-timescale li… ▽ More This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes. We investigate the sample complexities required to guarantee a predefined estimation error of the best linear coefficients for two widely-used policy evaluation algorithms: the temporal difference (TD) learning algorithm and the two-timescale linear TD with gradient correction (TDC) algorithm. In both the on-policy setting, where observations are generated from the target policy, and the off-policy setting, where samples are drawn from a behavior policy potentially different from the target policy, we establish the first sample complexity bound with high-probability convergence guarantee that attains the optimal dependence on the tolerance level. We also exhihit an explicit dependence on problem-related quantities, and show in the on-policy setting that our upper bound matches the minimax lower bound on crucial problem parameters, including the choice of the feature maps and the problem dimension. △ Less

Submitted 2 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

Comments: The first two authors contributed equally; paper accepted to IEEE Transactions on Information Theory

arXiv:2305.16589 [pdf, other]

The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model

Authors: Laixi Shi, Gen Li, Yuting Wei, Yuxin Chen, Matthieu Geist, Yuejie Chi

Abstract: This paper investigates model robustness in reinforcement learning (RL) to reduce the sim-to-real gap in practice. We adopt the framework of distributionally robust Markov decision processes (RMDPs), aimed at learning a policy that optimizes the worst-case performance when the deployed environment falls within a prescribed uncertainty set around the nominal MDP. Despite recent efforts, the sample… ▽ More This paper investigates model robustness in reinforcement learning (RL) to reduce the sim-to-real gap in practice. We adopt the framework of distributionally robust Markov decision processes (RMDPs), aimed at learning a policy that optimizes the worst-case performance when the deployed environment falls within a prescribed uncertainty set around the nominal MDP. Despite recent efforts, the sample complexity of RMDPs remained mostly unsettled regardless of the uncertainty set in use. It was unclear if distributional robustness bears any statistical consequences when benchmarked against standard RL. Assuming access to a generative model that draws samples based on the nominal MDP, we characterize the sample complexity of RMDPs when the uncertainty set is specified via either the total variation (TV) distance or $χ^2$ divergence. The algorithm studied here is a model-based method called {\em distributionally robust value iteration}, which is shown to be near-optimal for the full range of uncertainty levels. Somewhat surprisingly, our results uncover that RMDPs are not necessarily easier or harder to learn than standard MDPs. The statistical consequence incurred by the robustness requirement depends heavily on the size and shape of the uncertainty set: in the case w.r.t.~the TV distance, the minimax sample complexity of RMDPs is always smaller than that of standard MDPs; in the case w.r.t.~the $χ^2$ divergence, the sample complexity of RMDPs can often far exceed the standard MDP counterpart. △ Less

Submitted 12 April, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

Comments: Neural Information Processing Systems (2023)

arXiv:2304.10052 [pdf, ps, other]

Minimum $Φ$-distance estimators for finite mixing measures

Authors: Yun Wei, Sayan Mukherjee, XuanLong Nguyen

Abstract: Finite mixture models have long been used across a variety of fields in engineering and sciences. Recently there has been a great deal of interest in quantifying the convergence behavior of the mixing measure, a fundamental object that encapsulates all unknown parameters in a mixture distribution. In this paper we propose a general framework for estimating the mixing measure arising in finite mixt… ▽ More Finite mixture models have long been used across a variety of fields in engineering and sciences. Recently there has been a great deal of interest in quantifying the convergence behavior of the mixing measure, a fundamental object that encapsulates all unknown parameters in a mixture distribution. In this paper we propose a general framework for estimating the mixing measure arising in finite mixture models, which we term minimum $Φ$-distance estimators. We establish a general theory for the minimum $Φ$-distance estimator, where sharp probability bounds are obtained on the estimation error for the mixing measures in terms of the suprema of the associated empirical processes for a suitably chosen function class $Φ$. Our framework includes several existing and seemingly distinct estimation methods as special cases but also motivates new estimators. For instance, it extends the minimum Kolmogorov-Smirnov distance estimator to the multivariate setting, and it extends the method of moments to cover a broader family of probability kernels beyond the Gaussian. Moreover, it also includes methods that are applicable to complex (e.g., non-Euclidean) observation domains, using tools from reproducing kernel Hilbert spaces. It will be shown that under general conditions the methods achieve optimal rates of estimation under Wasserstein metrics in either minimax or pointwise sense of convergence; the latter case can be achieved when no upper bound on the finite number of components is given. △ Less

Submitted 19 April, 2023; originally announced April 2023.

Comments: 57 pages

MSC Class: Primary 62H30; 62C20; Secondary 62F12

arXiv:2303.11612 [pdf, other]

Efficient algorithms for Tucker decomposition via approximate matrix multiplication

Authors: Maolin Che, Yimin Wei, Hong Yan

Abstract: This paper develops fast and efficient algorithms for computing Tucker decomposition with a given multilinear rank. By combining random projection and the power scheme, we propose two efficient randomized versions for the truncated high-order singular value decomposition (T-HOSVD) and the sequentially T-HOSVD (ST-HOSVD), which are two common algorithms for approximating Tucker decomposition. To re… ▽ More This paper develops fast and efficient algorithms for computing Tucker decomposition with a given multilinear rank. By combining random projection and the power scheme, we propose two efficient randomized versions for the truncated high-order singular value decomposition (T-HOSVD) and the sequentially T-HOSVD (ST-HOSVD), which are two common algorithms for approximating Tucker decomposition. To reduce the complexities of these two algorithms, fast and efficient algorithms are designed by combining two algorithms and approximate matrix multiplication. The theoretical results are also achieved based on the bounds of singular values of standard Gaussian matrices and the theoretical results for approximate matrix multiplication. Finally, the efficiency of these algorithms are illustrated via some test tensors from synthetic and real datasets. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: 52 pages and 25 figures

arXiv:2303.06958 [pdf, ps, other]

CPQR-based randomized algorithms for generalized CUR decompositions

Authors: Guihua Zhang, Hanyu Li, Yimin Wei

Abstract: Based on the column pivoted QR decomposition, we propose some randomized algorithms including pass-efficient ones for the generalized CUR decompositions of matrix pair and matrix triplet. Detailed error analyses of these algorithms are provided. Numerical experiments are given to test the proposed randomized algorithms. Based on the column pivoted QR decomposition, we propose some randomized algorithms including pass-efficient ones for the generalized CUR decompositions of matrix pair and matrix triplet. Detailed error analyses of these algorithms are provided. Numerical experiments are given to test the proposed randomized algorithms. △ Less

Submitted 13 March, 2023; originally announced March 2023.

arXiv:2303.01383 [pdf, other]

Singular Value Decomposition of Dual Matrices and its Application to Traveling Wave Identification in the Brain

Authors: Tong Wei, Weiyang Ding, Yimin Wei

Abstract: Matrix factorizations in dual number algebra, a hypercomplex system, have been applied to kinematics, mechanisms, and other fields recently. We develop an approach to identify spatiotemporal patterns in the brain such as traveling waves using the singular value decomposition of dual matrices in this paper. Theoretically, we propose the compact dual singular value decomposition (CDSVD) of dual comp… ▽ More Matrix factorizations in dual number algebra, a hypercomplex system, have been applied to kinematics, mechanisms, and other fields recently. We develop an approach to identify spatiotemporal patterns in the brain such as traveling waves using the singular value decomposition of dual matrices in this paper. Theoretically, we propose the compact dual singular value decomposition (CDSVD) of dual complex matrices with explicit expressions as well as a necessary and sufficient condition for its existence. Furthermore, based on the CDSVD, we report on the optimal solution to the best rank-$k$ approximation under a newly defined quasi-metric in the dual complex number system. The CDSVD is also related to the dual Moore-Penrose generalized inverse. Numerically, comparisons with other available algorithms are conducted, which indicate less computational costs of our proposed CDSVD. In addition, the infinitesimal part of the CDSVD can identify the true rank of the original matrix from the noise-added matrix, but the classical SVD cannot. Next, we employ experiments on simulated time-series data and a road monitoring video to demonstrate the beneficial effect of the infinitesimal parts of dual matrices in spatiotemporal pattern identification. Finally, we apply this approach to the large-scale brain fMRI data, identify three kinds of traveling waves, and further validate the consistency between our analytical results and the current knowledge of cerebral cortex function. △ Less

Submitted 17 August, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

arXiv:2303.01327 [pdf, ps, other]

Noda Iteration for Computing Generalized Tensor Eigenpairs

Authors: Wanli Ma, Weiyang Ding, Yimin Wei

Abstract: In this paper, we propose the tensor Noda iteration (NI) and its inexact version for solving the eigenvalue problem of a particular class of tensor pairs called generalized $\mathcal{M}$-tensor pairs. A generalized $\mathcal{M}$-tensor pair consists of a weakly irreducible nonnegative tensor and a nonsingular $\mathcal{M}$-tensor within a linear combination. It is shown that any generalized… ▽ More In this paper, we propose the tensor Noda iteration (NI) and its inexact version for solving the eigenvalue problem of a particular class of tensor pairs called generalized $\mathcal{M}$-tensor pairs. A generalized $\mathcal{M}$-tensor pair consists of a weakly irreducible nonnegative tensor and a nonsingular $\mathcal{M}$-tensor within a linear combination. It is shown that any generalized $\mathcal{M}$-tensor pair admits a unique positive generalized eigenvalue with a positive eigenvector. A modified tensor Noda iteration(MTNI) is developed for extending the Noda iteration for nonnegative matrix eigenproblems. In addition, the inexact generalized tensor Noda iteration method (IGTNI) and the generalized Newton-Noda iteration method (GNNI) are also introduced for more efficient implementations and faster convergence. Under a mild assumption on the initial values, the convergence of these algorithms is guaranteed. The efficiency of these algorithms is illustrated by numerical experiments. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: 45 pages, 6 figures

MSC Class: 65Fxx ACM Class: G.1.3

arXiv:2303.00930 [pdf, ps, other]

doi 10.1016/j.aim.2023.109213

Geometric inequalities involving three quantities in warped product manifolds

Authors: Kwok-Kun Kwong, Yong Wei

Abstract: In this paper, we establish two families of sharp geometric inequalities for closed hypersurfaces in space forms or other warped product manifolds. Both families of inequalities compare three distinct geometric quantities. The first family concerns the $k$-th boundary momentum, area, and weighted volume, and has applications to Weinstock-type inequalities for Steklov or Wentzell eigenvalues on sta… ▽ More In this paper, we establish two families of sharp geometric inequalities for closed hypersurfaces in space forms or other warped product manifolds. Both families of inequalities compare three distinct geometric quantities. The first family concerns the $k$-th boundary momentum, area, and weighted volume, and has applications to Weinstock-type inequalities for Steklov or Wentzell eigenvalues on star-shaped mean convex domains. This generalizes the main results of [12]. The second family involves a weighted $k$-th mean curvature integral and two distinct quermassintegrals and extends the authors' recent work [33] with G. Wheeler and V.-M. Wheeler. △ Less

Submitted 1 March, 2023; originally announced March 2023.

Comments: 21 pages

MSC Class: 53E10; 53C42

Journal ref: Advances in Mathematics, Vol. 430, 1 Oct 2023, article no. 109213

arXiv:2302.06855 [pdf, other]

Splitting Method for Support Vector Machine in Reproducing Kernel Banach Space with Lower Semi-continuous Loss Function

Authors: Mingyu Mo, Yimin Wei, Qi Ye

Abstract: In this paper, we use the splitting method to solve support vector machine in reproducing kernel Banach space with lower semi-continuous loss function. We equivalently transfer support vector machines in reproducing kernel Banach space with lower semi-continuous loss function to a finite-dimensional tensor Optimization and propose the splitting method based on alternating direction method of multi… ▽ More In this paper, we use the splitting method to solve support vector machine in reproducing kernel Banach space with lower semi-continuous loss function. We equivalently transfer support vector machines in reproducing kernel Banach space with lower semi-continuous loss function to a finite-dimensional tensor Optimization and propose the splitting method based on alternating direction method of multipliers. By Kurdyka-Lojasiewicz inequality, the iterative sequence obtained by this splitting method is globally convergent to a stationary point if the loss function is lower semi-continuous and subanalytic. Finally, several numerical performances demonstrate the effectiveness. △ Less

Submitted 14 February, 2023; originally announced February 2023.

Comments: arXiv admin note: text overlap with arXiv:2208.12522

arXiv:2302.06080 [pdf, ps, other]

On the g$π$-Hirano invertibility in Banach algebras

Authors: Honglin Zou, Tingting Li, Yujie Wei

Abstract: In a Banach algebra, we introduce a new type of generalized inverse called g$π$-Hirano inverse. Firstly, several existence criteria and the equivalent definition of this inverse are investigated. Then, we discuss the relationship between the g$π$-Hirano invertibility of $a$, $b$ and that of the sum $a+b$ under some weaker conditions. Finally, as applications to the previous additive results, some… ▽ More In a Banach algebra, we introduce a new type of generalized inverse called g$π$-Hirano inverse. Firstly, several existence criteria and the equivalent definition of this inverse are investigated. Then, we discuss the relationship between the g$π$-Hirano invertibility of $a$, $b$ and that of the sum $a+b$ under some weaker conditions. Finally, as applications to the previous additive results, some equivalent characterizations for the g$π$-Hirano invertibility of the anti-triangular matrix over Banach algebras are obtained.In particular, some results in this paper are different from the corresponding ones of classical generalized inverses, such as Drazin inverse and generalized Drazin inverse. △ Less

Submitted 12 February, 2023; originally announced February 2023.

MSC Class: 15A09; 32A65; 47A10

arXiv:2302.04674 [pdf, other]

On some analytic properties of nabla tempered fractional calculus

Authors: Yiheng Wei, Linlin Zhao, Xuan Zhao, **de Cao

Abstract: Despite many applications regarding fractional calculus have been reported in literature, it is still unknown how to model some practical process. One major challenge in solving such a problem is that, the nonlocal property is needed while the infinite memory is undesired. Under this context, a new kind nabla fractional calculus accompanied by a tempered function is formulated. However, many prope… ▽ More Despite many applications regarding fractional calculus have been reported in literature, it is still unknown how to model some practical process. One major challenge in solving such a problem is that, the nonlocal property is needed while the infinite memory is undesired. Under this context, a new kind nabla fractional calculus accompanied by a tempered function is formulated. However, many properties of such fractional calculus needed to be discovered. From this, this paper gives particular emphasis to the topic. Some remarkable properties like the equivalence relation, the nabla Taylor formula, and the nabla Laplace transform for such nabla fractional calculus are developed and analyzed. It is believed that this work greatly enriches the mathematical theory of nabla tempered fractional calculus and provides high value and huge potential for further applications. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 22 pages, 4 figures

arXiv:2302.03682 [pdf, other]

Approximate message passing from random initialization with applications to $\mathbb{Z}_{2}$ synchronization

Authors: Gen Li, Wei Fan, Yuting Wei

Abstract: This paper is concerned with the problem of reconstructing an unknown rank-one matrix with prior structural information from noisy observations. While computing the Bayes-optimal estimator seems intractable in general due to its nonconvex nature, Approximate Message Passing (AMP) emerges as an efficient first-order method to approximate the Bayes-optimal estimator. However, the theoretical underpi… ▽ More This paper is concerned with the problem of reconstructing an unknown rank-one matrix with prior structural information from noisy observations. While computing the Bayes-optimal estimator seems intractable in general due to its nonconvex nature, Approximate Message Passing (AMP) emerges as an efficient first-order method to approximate the Bayes-optimal estimator. However, the theoretical underpinnings of AMP remain largely unavailable when it starts from random initialization, a scheme of critical practical utility. Focusing on a prototypical model called $\mathbb{Z}_{2}$ synchronization, we characterize the finite-sample dynamics of AMP from random initialization, uncovering its rapid global convergence. Our theory provides the first non-asymptotic characterization of AMP in this model without requiring either an informative initialization (e.g., spectral initialization) or sample splitting. △ Less

Submitted 7 February, 2023; originally announced February 2023.

arXiv:2301.13163 [pdf, other]

Randomized GCUR decompositions

Authors: Zhengbang Cao, Yimin Wei, Pengpeng Xie

Abstract: By exploiting the random sampling techniques, this paper derives an efficient randomized algorithm for computing a generalized CUR decomposition, which provides low-rank approximations of both matrices simultaneously in terms of some of their rows and columns. For large-scale data sets that are expensive to store and manipulate, a new variant of the discrete empirical interpolation method known as… ▽ More By exploiting the random sampling techniques, this paper derives an efficient randomized algorithm for computing a generalized CUR decomposition, which provides low-rank approximations of both matrices simultaneously in terms of some of their rows and columns. For large-scale data sets that are expensive to store and manipulate, a new variant of the discrete empirical interpolation method known as L-DEIM, which needs much lower cost and provides a significant acceleration in practice, is also combined with the random sampling approach to further improve the efficiency of our algorithm. Moreover, adopting the randomized algorithm to implement the truncation process of restricted singular value decomposition (RSVD), combined with the L-DEIM procedure, we propose a fast algorithm for computing an RSVD based CUR decomposition, which provides a coordinated low-rank approximation of the three matrices in a CUR-type format simultaneously and provides advantages over the standard CUR approximation for some applications. We establish detailed probabilistic error analysis for the algorithms and provide numerical results that show the promise of our approaches. △ Less

Submitted 5 April, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

MSC Class: 65F55; 15A23

arXiv:2301.06500 [pdf, other]

Factorization of certain Macdonald Littlewood-Richardson coefficients

Authors: Konstantin Matveev, Yuchen Wei

Abstract: We find and prove a factorization formula for certain Macdonald Littlewood-Richardson coefficients $c_{λμ}^ν(q,t)$. Namely, we consider the case that the Kostka number $K_{μ, ν-λ}$ is $1$. This settles a particular case of a more general conjecture of Richard Stanely. This conjecture proposes that a factorization formula exists whenever the corresponding regular Littlewood-Richardson coefficient… ▽ More We find and prove a factorization formula for certain Macdonald Littlewood-Richardson coefficients $c_{λμ}^ν(q,t)$. Namely, we consider the case that the Kostka number $K_{μ, ν-λ}$ is $1$. This settles a particular case of a more general conjecture of Richard Stanely. This conjecture proposes that a factorization formula exists whenever the corresponding regular Littlewood-Richardson coefficient $c^ν_{λμ}$ is $1$. △ Less

Submitted 16 January, 2023; originally announced January 2023.

arXiv:2301.00581 [pdf, ps, other]

Bent Partitions, Vectorial Dual-Bent Functions and Partial Difference Sets

Authors: Jiaxin Wang, Fang-Wei Fu, Yadi Wei

Abstract: It is known that partial spreads is a class of bent partitions. In \cite{AM2022Be,MP2021Be}, two classes of bent partitions whose forms are similar to partial spreads were presented. In \cite{AKM2022Ge}, more bent partitions $Γ_{1}, Γ_{2}, Γ_{1}^{\bullet}, Γ_{2}^{\bullet}, Θ_{1}, Θ_{2}$ were presented from (pre)semifields, including the bent partitions given in \cite{AM2022Be,MP2021Be}. In this pa… ▽ More It is known that partial spreads is a class of bent partitions. In \cite{AM2022Be,MP2021Be}, two classes of bent partitions whose forms are similar to partial spreads were presented. In \cite{AKM2022Ge}, more bent partitions $Γ_{1}, Γ_{2}, Γ_{1}^{\bullet}, Γ_{2}^{\bullet}, Θ_{1}, Θ_{2}$ were presented from (pre)semifields, including the bent partitions given in \cite{AM2022Be,MP2021Be}. In this paper, we investigate the relations between bent partitions and vectorial dual-bent functions. For any prime $p$, we show that one can generate certain bent partitions (called bent partitions satisfying Condition $\mathcal{C}$) from certain vectorial dual-bent functions (called vectorial dual-bent functions satisfying Condition A). In particular, when $p$ is an odd prime, we show that bent partitions satisfying Condition $\mathcal{C}$ one-to-one correspond to vectorial dual-bent functions satisfying Condition A. We give an alternative proof that $Γ_{1}, Γ_{2}, Γ_{1}^{\bullet}, Γ_{2}^{\bullet}, Θ_{1}, Θ_{2}$ are bent partitions. We present a secondary construction of vectorial dual-bent functions, which can be used to generate more bent partitions. We show that any ternary weakly regular bent function $f: V_{n}^{(3)}\rightarrow \mathbb{F}_{3}$ ($n$ even) of $2$-form can generate a bent partition. When such $f$ is weakly regular but not regular, the generated bent partition by $f$ is not coming from a normal bent partition, which answers an open problem proposed in \cite{AM2022Be}. We give a sufficient condition on constructing partial difference sets from bent partitions, and when $p$ is an odd prime, we provide a characterization of bent partitions satisfying Condition $\mathcal{C}$ in terms of partial difference sets. △ Less

Submitted 2 January, 2023; originally announced January 2023.

Showing 1–50 of 186 results for author: Wei, Y