Search | arXiv e-print repository

A Random Integration Algorithm for High-dimensional Function Spaces

Authors: Liang Chen, Minqiang Xu, Haizhang Zhang

Abstract: We introduce a novel random integration algorithm that boasts both high convergence order and polynomial tractability for functions characterized by sparse frequencies or rapidly decaying Fourier coefficients. Specifically, for integration in periodic isotropic Sobolev space and the isotropic Sobolev space with compact support, our approach attains a nearly optimal root mean square error (RMSE) bo… ▽ More We introduce a novel random integration algorithm that boasts both high convergence order and polynomial tractability for functions characterized by sparse frequencies or rapidly decaying Fourier coefficients. Specifically, for integration in periodic isotropic Sobolev space and the isotropic Sobolev space with compact support, our approach attains a nearly optimal root mean square error (RMSE) bound. In contrast to previous nearly optimal algorithms, our method exhibits polynomial tractability, ensuring that the number of samples does not scale exponentially with increasing dimensions. Our integration algorithm also enjoys nearly optimal bound for weighted Korobov space. Furthermore, the algorithm can be applied without the need for prior knowledge of weights, distinguishing it from the component-by-component algorithm. For integration in the Wiener algebra, the sample complexity of our algorithm is independent of the decay rate of Fourier coefficients. The effectiveness of the integration is confirmed through numerical experiments. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.13798 [pdf, ps, other]

Aubin Property and Strong Regularity Are Equivalent for Nonlinear Second-Order Cone Programming

Authors: Liang Chen, Ruoning Chen, Defeng Sun, Junyuan Zhu

Abstract: This paper solves a fundamental open problem in variational analysis on the equivalence between the Aubin property and the strong regularity for nonlinear second-order cone programming (SOCP) at a locally optimal solution. We achieve this by introducing a reduction approach to the Aubin property characterized by the Mordukhovich criterion and a lemma of alternative choices on cones to replace the… ▽ More This paper solves a fundamental open problem in variational analysis on the equivalence between the Aubin property and the strong regularity for nonlinear second-order cone programming (SOCP) at a locally optimal solution. We achieve this by introducing a reduction approach to the Aubin property characterized by the Mordukhovich criterion and a lemma of alternative choices on cones to replace the S-lemma used in Outrata and Ramírez [SIAM J. Optim. 21 (2011) 789-823] and Opazo, Outrata, and Ramírez [SIAM J. Optim. 27 (2017) 2141-2151], where the same SOCP was considered under the strict complementarity condition except for possibly only one block of constraints. As a byproduct, we also offer a new approach to the well-known result of Dontchev and Rockafellar [SIAM J. Optim. 6 (1996) 1087-1105] on the equivalence of the two concepts in conventional nonlinear programming. △ Less

Submitted 19 June, 2024; originally announced June 2024.

MSC Class: 90C; 90C31; 90C46

arXiv:2406.09772 [pdf, other]

Accelerated Over-Relaxation Heavy-Ball Methods with Provable Acceleration and Global Convergence

Authors: **grong Wei, Long Chen

Abstract: The heavy-ball momentum method has gained widespread popularity for accelerating gradient descent by incorporating a momentum term. Recent studies have conclusively shown that the heavy-ball method cannot achieve an accelerated convergence rate for general smooth strongly convex optimization problems. This work introduces the Accelerated Over-Relaxation Heavy-Ball (AOR-HB) method, a novel approach… ▽ More The heavy-ball momentum method has gained widespread popularity for accelerating gradient descent by incorporating a momentum term. Recent studies have conclusively shown that the heavy-ball method cannot achieve an accelerated convergence rate for general smooth strongly convex optimization problems. This work introduces the Accelerated Over-Relaxation Heavy-Ball (AOR-HB) method, a novel approach that represents the first heavy-ball method to demonstrate provable global and accelerated convergence for smooth strongly convex optimization. The key innovation of the AOR-HB method lies in the application of an over-relaxation technique to the gradient term. This novel approach enables the method to be applied to min-max problems and meet optimal lower complexity bounds. This breakthrough addresses a long-standing theoretical gap in heavy-ball momentum methods and paves the way for develo** accelerated methods that transcend the boundaries of convex optimization to non-convex optimization. Numerical experiments validate the effectiveness of the proposed algorithms, with their performance matching that of other leading first-order optimization methods. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.09449 [pdf, ps, other]

Smooth solutions to the Christoffel problem in $\mathbb{H}^{n+1}$

Authors: Li Chen

Abstract: The famous Christoffel problem is possibly the oldest problem of prescribed curvatures for convex hypersurfaces in Euclidean space. Recently, this problem has been naturally formulated in the context of uniformly $h$-convex hypersurfaces in hyperbolic space by Espinar-Gálvez-Mira. Surprisingly, Espinar-Gálvez-Mira find that the Christoffel problem in hyperbolic space is essentially equivalent to… ▽ More The famous Christoffel problem is possibly the oldest problem of prescribed curvatures for convex hypersurfaces in Euclidean space. Recently, this problem has been naturally formulated in the context of uniformly $h$-convex hypersurfaces in hyperbolic space by Espinar-Gálvez-Mira. Surprisingly, Espinar-Gálvez-Mira find that the Christoffel problem in hyperbolic space is essentially equivalent to the Nirenberg-Kazdan-Warner problem on prescribing scalar curvature on $\mathbb{S}^n$. This equivalence opens a new door to study the Nirenberg-Kazdan-Warner problem. In this paper, we establish a existence of solutions to the Christoffel problem in hyperbolic space by proving a full rank theorem. As a corollary, a existence of solutions to the Nirenberg-Kazdan-Warner problem follows. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 22 pages. arXiv admin note: substantial text overlap with arXiv:2302.01604

arXiv:2406.08306 [pdf, ps, other]

2-dimensional Ricci limit spaces

Authors: Lina Chen

Abstract: In this note, we will show that if a measured Gromov-Hausdorff limit space of a sequence of Riemannian manifolds with lower Ricci curvature bound has dense 2-regular set, then it is homeomorphic to a 2-dimensional manifold in an open full measure set. This result gives a positive answer to an open problem in [Naber, Open problem 3.4] in dimension 2 and for dimension larger than 2 there are counter… ▽ More In this note, we will show that if a measured Gromov-Hausdorff limit space of a sequence of Riemannian manifolds with lower Ricci curvature bound has dense 2-regular set, then it is homeomorphic to a 2-dimensional manifold in an open full measure set. This result gives a positive answer to an open problem in [Naber, Open problem 3.4] in dimension 2 and for dimension larger than 2 there are counterexamples by [HNW, Zhou]. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.07655 [pdf, ps, other]

Analogues of Alder-Type Partition Inequalities for Fixed Perimeter Partitions

Authors: Ling Chen, Isabelle Hernandez, Zain Shields, Holly Swisher

Abstract: In a 2016 paper, Straub proved an analogue to Euler's partition identity for partitions with fixed perimeter. Later, Fu and Tang provided a refinement and generalization of Straub's analogue to $d$-distinct partitions as well as a result related to the first Rogers-Ramanujan identity. Motivated by Alder-type partition identities and their generalizations, we build on work of Fu and Tang to establi… ▽ More In a 2016 paper, Straub proved an analogue to Euler's partition identity for partitions with fixed perimeter. Later, Fu and Tang provided a refinement and generalization of Straub's analogue to $d$-distinct partitions as well as a result related to the first Rogers-Ramanujan identity. Motivated by Alder-type partition identities and their generalizations, we build on work of Fu and Tang to establish generalized Alder-type partition inequalities in a fixed perimeter setting, and notably, a reverse Alder-type inequality. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 13 pages

arXiv:2406.07552 [pdf, ps, other]

Cohomology of a restricted Lie algebra with a restricted derivation in characteristic 2

Authors: Dan Mao, Liangyun Chen

Abstract: This paper mainly studies the ResLieDer pair in characteristic 2, that is, a restricted Lie algebra with a restricted derivation. We define the restricted representation of a ResLieDer pair and the corresponding cohomology complex. We show that a ResLieDer pair is rigid if the second cohomology group is trivial and a deformation of order $n$ is extensible if and only if its obstruction class is tr… ▽ More This paper mainly studies the ResLieDer pair in characteristic 2, that is, a restricted Lie algebra with a restricted derivation. We define the restricted representation of a ResLieDer pair and the corresponding cohomology complex. We show that a ResLieDer pair is rigid if the second cohomology group is trivial and a deformation of order $n$ is extensible if and only if its obstruction class is trivial. Moreover, we prove that the central extensions of a ResLieDer pair are classified by the second cohomology group. Finally, we show that a pair of restricted derivations is extensible if and only if its obstruction class is trivial. △ Less

Submitted 12 February, 2024; originally announced June 2024.

Comments: 26 page

arXiv:2406.00743 [pdf, ps, other]

Quantization property of n-Laplacian mean field equation and sharp Moser-Onofri inequality

Authors: Lu Chen, Guozhen Lu, Bohan Wang

Abstract: In this paper, we are concerned with the following $n$-Laplacian mean field equation \[ \left\{ {\begin{array}{*{20}{c}} { - Δ_n u = λe^u} & {\rm in} \ \ Ω, \\ {\ \ \ \ u = 0} &\ {\rm on}\ \partial Ω, \end{array}} \right. \] \[\] where $Ω$ is a smooth bounded domain of $\mathbb{R}^n \ (n\geq 2)$ and $- Δ_n u =-{\rm div}(|\nabla u|^{n-2}\nabla u)$. We first establish the quantization property of… ▽ More In this paper, we are concerned with the following $n$-Laplacian mean field equation \[ \left\{ {\begin{array}{*{20}{c}} { - Δ_n u = λe^u} & {\rm in} \ \ Ω, \\ {\ \ \ \ u = 0} &\ {\rm on}\ \partial Ω, \end{array}} \right. \] \[\] where $Ω$ is a smooth bounded domain of $\mathbb{R}^n \ (n\geq 2)$ and $- Δ_n u =-{\rm div}(|\nabla u|^{n-2}\nabla u)$. We first establish the quantization property of solutions to the above $n$-Laplacian mean field equation. As an application, combining the Pohozaev identity and the capacity estimate, we obtain the sharp constant $C(n)$ of the Moser-Onofri inequality in the $n$-dimensional unit ball $B^n:=B^n(0,1)$, $$\mathop {\inf }\limits_{u \in W_0^{1,n}(B^n)}\frac{1}{ n C_n}\int_{B^n} | \nabla u|^n dx- \ln \int_{B^n} {e^u} dx\geq C(n), $$ which extends the result of Caglioti-Lions-Marchioro-Pulvirenti in \cite{Caglioti} to the case of $n$-dimensional ball. Here $C_n=(\frac{n^2}{n-1})^{n-1} ω_{n-1}$ and $ω_{n-1}$ is the surface measure of $B^n$. For the Moser-Onofri inequality in a general bounded domain of $\mathbb{R}^n$, we apply the technique of $n$-harmonic transplantation to give the optimal concentration level of the Moser-Onofri inequality and obtain the criterion for the existence and non-existence of extremals for the Moser-Onofri inequality. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.19637 [pdf, other]

Inference in semiparametric formation models for directed networks

Authors: Lianqiang Qu, Lu Chen, Ting Yan, Yuguo Chen

Abstract: We propose a semiparametric model for dyadic link formations in directed networks. The model contains a set of degree parameters that measure different effects of popularity or outgoingness across nodes, a regression parameter vector that reflects the homophily effect resulting from the nodal attributes or pairwise covariates associated with edges, and a set of latent random noises with unknown di… ▽ More We propose a semiparametric model for dyadic link formations in directed networks. The model contains a set of degree parameters that measure different effects of popularity or outgoingness across nodes, a regression parameter vector that reflects the homophily effect resulting from the nodal attributes or pairwise covariates associated with edges, and a set of latent random noises with unknown distributions. Our interest lies in inferring the unknown degree parameters and homophily parameters. The dimension of the degree parameters increases with the number of nodes. Under the high-dimensional regime, we develop a kernel-based least squares approach to estimate the unknown parameters. The major advantage of our estimator is that it does not encounter the incidental parameter problem for the homophily parameters. We prove consistency of all the resulting estimators of the degree parameters and homophily parameters. We establish high-dimensional central limit theorems for the proposed estimators and provide several applications of our general theory, including testing the existence of degree heterogeneity, testing sparse signals and recovering the support. Simulation studies and a real data application are conducted to illustrate the finite sample performance of the proposed methods. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 28 pages, 3 figures

arXiv:2405.17727 [pdf, ps, other]

Optimal stability of Hardy-Littlewood-Sobolev and Sobolev inequalities of arbitrary orders with dimension-dependent constants

Authors: Lu Chen, Guozhen Lu, Hanli Tang

Abstract: Dolbeault-Esteban-Figalli-Frank-Loss [19] and Chen-Lu-Tang [17] established the optimal asymptotic lower bound for stability of the first-order Sobolev inequality and fractional Sobolev inequality of order $s$ for $0<s<1$ respectively. However, it left the problem of the optimal lower bound for stability of high-order Sobolev inequality and high-order fractional Sobolev inequality unsolved. The pu… ▽ More Dolbeault-Esteban-Figalli-Frank-Loss [19] and Chen-Lu-Tang [17] established the optimal asymptotic lower bound for stability of the first-order Sobolev inequality and fractional Sobolev inequality of order $s$ for $0<s<1$ respectively. However, it left the problem of the optimal lower bound for stability of high-order Sobolev inequality and high-order fractional Sobolev inequality unsolved. The purpose of this paper is to solve this problem. The main difficulty lies in establishing the optimal asymptotic behavior for the local stability of the Sobolev inequality for all $0<s<n/2$. The proof of the local stability when $0<s\leq 1$ relies on ``cuttings" at various heights and this helps to split the $L^2$ integral of first order or fractional order derivative of order $0<s<1$. However, this approach does not seem to work for $1<s<n/2$. In order to overcome this difficulty, we directly establish the local stability for the HLS inequality with the optimal asymptotic lower bounds. To achieve our goal, we develop a new strategy based on the $H^{-s}-$decomposition instead of $L^{\frac{2n}{n+2s}}-$decomposition to obtain the local stability of the HLS inequality with $L^{\frac{2n}{n+2s}}-$distance. This kind of ``new local stability" also brings more difficulties to using the rearrangement flow to deduce the global stability from local stability because of the non-uniqueness of $\|r\|_{\frac{2n}{n+2s}}$ and non-continuity of $\|r\|_{\frac{2n}{n+2s}}$ norm for the rearrangement flow. We establish the norm comparison theorem for $\|r\|_{\frac{2n}{n+2s}}$ and "new continuity" theorem for the rearrangement flow to overcome this difficulty (see Lemma 3.1, Lemma 3.3 and Lemma 3.5). △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 38 pages

arXiv:2405.15128 [pdf, ps, other]

Fluctuations around the mean-field limit for attractive Riesz potentials in the moderate regime

Authors: Li Chen, Alexandra Holzinger, Ansgar Jüngel

Abstract: A central limit theorem is shown for moderately interacting particles in the whole space. The interaction potential approximates singular attractive or repulsive potentials of sub-Coulomb type. It is proved that the fluctuations become asymptotically Gaussians in the limit of infinitely many particles. The methodology is inspired by the classical work of Oelschläger on fluctuations for the porous-… ▽ More A central limit theorem is shown for moderately interacting particles in the whole space. The interaction potential approximates singular attractive or repulsive potentials of sub-Coulomb type. It is proved that the fluctuations become asymptotically Gaussians in the limit of infinitely many particles. The methodology is inspired by the classical work of Oelschläger on fluctuations for the porous-medium equation. The novelty in this work is that we can allow for attractive potentials in the moderate regime and still obtain asymptotic Gaussian fluctuations. The key element of the proof is the mean-square convergence in expectation for smoothed empirical measures associated to moderately interacting $N$-particle systems with rate $N^{-1/2-\varepsilon}$ for some $\varepsilon>0$. To allow for attractive potentials, the proof uses a quantitative mean-field convergence in probability with any algebraic rate and a law-of-large-numbers estimate as well as a systematic separation of the terms to be estimated in a mean-field part and a law-of-large-numbers part. △ Less

Submitted 23 May, 2024; originally announced May 2024.

MSC Class: 35Q70; 35Q92; 60J70; 82C22

arXiv:2405.12134 [pdf, other]

Two-dimensional signal-dependent parabolic-elliptic Keller-Segel system and its means field derivation

Authors: Lukas Bol, Li Chen, Yue Li

Abstract: In this paper, the well-posedness of two-dimensional signal-dependent Keller-Segel system and its mean field derivation from a interacting particle system on the whole space are investigated. The signal dependence effect is reflected by the fact that the diffusion coefficient in the particle system depends nonlinearly on the interactions between the individuals. Therefore, the mathematical challen… ▽ More In this paper, the well-posedness of two-dimensional signal-dependent Keller-Segel system and its mean field derivation from a interacting particle system on the whole space are investigated. The signal dependence effect is reflected by the fact that the diffusion coefficient in the particle system depends nonlinearly on the interactions between the individuals. Therefore, the mathematical challenge in studying the well-posedness of this system lies in the possible degeneracy and the aggregation effect when the concentration of signal becomes unbounded. The well-established method on bounded domain, to obtain the appropriate estimates for the signal concentration, is invalid for the whole space case. Motivated by the entropy minimization method and Onofri's inequality, which has been successfully applied for parabolic-parabolic Keller-Segel system, we establish a complete entropy estimate benefited from linear diffusion term, which plays important role in obtaining the Lp estimates for the solution. Furthermore, the upper bound for the concentration of signal is obtained. Based on estimates we obtained for the density of cells, the rigorous mean-field derivation is proved by introducing an intermediate particle system with a mollified interaction potential with logarithmic scaling. By using this mollification, we obtain the convergence of the particle trajectories in expectation, which implies the weak propagation of chaos. Additionally, under a regularity assumption of the cell-density, we derive the strong L1 convergence for the propagation of chaos by using relative entropy method. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.07537 [pdf, other]

Probabilistic Rounding Error Analysis From A Statistical Perspective

Authors: Yiming Fang, Li Chen

Abstract: The conventional probabilistic rounding error analysis in numerical linear algebra provides worst-case bounds with an associated failure probability, which can still be pessimistic. In this paper, we develop a new probabilistic rounding error analysis from a statistical perspective. By assuming both the data and the relative error are independent random variables, we derive the approximate closed-… ▽ More The conventional probabilistic rounding error analysis in numerical linear algebra provides worst-case bounds with an associated failure probability, which can still be pessimistic. In this paper, we develop a new probabilistic rounding error analysis from a statistical perspective. By assuming both the data and the relative error are independent random variables, we derive the approximate closed-form expressions for the expectation and variance of the rounding errors in various key computational kernels. Our analytical expressions have three notable characteristics: they are statistical and do not involve a failure probability; they are sharper than other deterministic and probabilistic bounds, using mean square error as the metric; they are correct to all orders of unit roundoff and valid for any dimension. Furthermore, numerical experiments validate the accuracy of our derivations and demonstrate that our analytical expressions are generally at least two orders of magnitude tighter than alternative worst-case bounds, exemplified through the inner products. We also discuss a scenario involving inner products where the underlying assumptions are invalid, i.e., input data are dependent, rendering the analytical expressions inapplicable. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 24 pages, 7 figures. Submitted to SIAM for possible publication

arXiv:2405.03648 [pdf, ps, other]

Proof of the geometric Langlands conjecture II: Kac-Moody localization and the FLE

Authors: D. Arinkin, D. Beraldo, J. Campbell, L. Chen, J. Faergeman, D. Gaitsgory, K. Lin, S. Raskin, N. Rozenblyum

Abstract: This paper is the second in a series of five that together prove the geometric Langlands conjecture. Our goals are two-fold: (1) Formulate and prove the Fundamental Local Equivalence (FLE) at the critical level; (2) Study the interaction between Kac-Moody localization and the global geometric Langlands functor of ref. [GLC1]. This paper contains an extensive Appendix, whose primary goals are… ▽ More This paper is the second in a series of five that together prove the geometric Langlands conjecture. Our goals are two-fold: (1) Formulate and prove the Fundamental Local Equivalence (FLE) at the critical level; (2) Study the interaction between Kac-Moody localization and the global geometric Langlands functor of ref. [GLC1]. This paper contains an extensive Appendix, whose primary goals are: (a) Development the theory of ind-coherent sheaves in infinite type; (b)Development of the formalism of factorization categories. △ Less

Submitted 23 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.00545 [pdf, other]

A Double Maximization Approach for Optimizing the LM Rate of Mismatched Decoding

Authors: Lingyi Chen, Shitong Wu, Xinwei Li, Huihui Wu, Hao Wu, Wenyi Zhang

Abstract: An approach is established for maximizing the Lower bound on the Mismatch capacity (hereafter abbreviated as LM rate), a key performance bound in mismatched decoding, by optimizing the channel input probability distribution. Under a fixed channel input probability distribution, the computation of the corresponding LM rate is a convex optimization problem. When optimizing the channel input probabil… ▽ More An approach is established for maximizing the Lower bound on the Mismatch capacity (hereafter abbreviated as LM rate), a key performance bound in mismatched decoding, by optimizing the channel input probability distribution. Under a fixed channel input probability distribution, the computation of the corresponding LM rate is a convex optimization problem. When optimizing the channel input probability distribution, however, the corresponding optimization problem adopts a max-min formulation, which is generally non-convex and is intractable with standard approaches. To solve this problem, a novel dual form of the LM rate is proposed, thereby transforming the max-min formulation into an equivalent double maximization formulation. This new formulation leads to a maximization problem setup wherein each individual optimization direction is convex. Consequently, an alternating maximization algorithm is established to solve the resultant maximization problem setup. Each step of the algorithm only involves a closed-form iteration, which is efficiently implemented with standard optimization procedures. Numerical experiments show the proposed approach for optimizing the LM rate leads to noticeable rate gains. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2404.19308 [pdf, other]

doi 10.1002/andp.202200289

A characterization of entangled two-qubit states via partial-transpose-moments

Authors: Lin Zhang, Ming-**g Zhao, Lin Chen, Hua Xiang, Yi Shen

Abstract: Although quantum entanglement is an important resource, its characterization is quite challenging. The partial transposition is a common method to detect bipartite entanglement. In this paper, the authors study the partial-transpose(PT)-moments of two-qubit states,and completely describe the whole region, composed of the second and third PT-moments, for all two-qubit states. Furthermore, they dete… ▽ More Although quantum entanglement is an important resource, its characterization is quite challenging. The partial transposition is a common method to detect bipartite entanglement. In this paper, the authors study the partial-transpose(PT)-moments of two-qubit states,and completely describe the whole region, composed of the second and third PT-moments, for all two-qubit states. Furthermore, they determine the accurate region corresponding to all entangled two-qubit states. The states corresponding to those boundary points of the whole region, and to the border lines between separable and entangled states are analyzed. As an application, they characterize the entangled region of PT-moments for the two families of Werner states and Bell-diagonal states. The relations between entanglement and the pairs of PT-moments are revealed from these typical examples. They also numerically plot the whole region of possible PT-moments for all two-qubit X-states, and find that this region is almost the same as the whole region of PT-moments for all two-qubit states. Moreover, they extend their results to detect the entanglement of multiqubit states. By utilizing the PT-moment-based method to characterize the entanglement of the multiqubit states mixed by the GHZ and W states, they propose an operational way of verifying the genuine entanglement in such states. △ Less

Submitted 30 April, 2024; originally announced April 2024.

Comments: 31 pages, LaTeX, 9 figures

Journal ref: Annalen der Physik 534, 2200289 (2022)

arXiv:2404.12795 [pdf, other]

Stability for a class of three-tori with small negative scalar curvature

Authors: Edward Bryden, Lizhi Chen

Abstract: We define a flexible class of Riemmanian metrics on the three-torus. Then, using Stern's inequality relating scalar curvature to harmonic one-forms, we show that any sequence of metrics in this family whose negative part of the scalar curvature tends to zero in $L^2$ norm has a subsequence which converges to some flat metric on the three-torus in the sense of Dong-Song. We define a flexible class of Riemmanian metrics on the three-torus. Then, using Stern's inequality relating scalar curvature to harmonic one-forms, we show that any sequence of metrics in this family whose negative part of the scalar curvature tends to zero in $L^2$ norm has a subsequence which converges to some flat metric on the three-torus in the sense of Dong-Song. △ Less

Submitted 14 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

Comments: In the second version, the abstract is updated; some typos are corrected

arXiv:2404.11628 [pdf, ps, other]

Classification of positive solutions of critical anisotropic Sobolev equation without the finite volume constraint

Authors: Lu Chen, Yabo Yang

Abstract: In this paper, we classify all positive solutions of the critical anisotropic Sobolev equation \begin{equation*} -Δ^{H}_{p}u = u^{p^{*}-1}, \ \ x\in \mathbb{R}^n \end{equation*} without the finite volume constraint for $n \geq 2$ and $\frac{(n+1)}{3} \leq p < n$, where $p^{*} = \frac{np}{n-p}$ denotes the critical Sobolev exponent and $-Δ^{H}_{p}=-div(H^{p-1}(\cdot)\nabla H(\cdot))$ denotes the an… ▽ More In this paper, we classify all positive solutions of the critical anisotropic Sobolev equation \begin{equation*} -Δ^{H}_{p}u = u^{p^{*}-1}, \ \ x\in \mathbb{R}^n \end{equation*} without the finite volume constraint for $n \geq 2$ and $\frac{(n+1)}{3} \leq p < n$, where $p^{*} = \frac{np}{n-p}$ denotes the critical Sobolev exponent and $-Δ^{H}_{p}=-div(H^{p-1}(\cdot)\nabla H(\cdot))$ denotes the anisotropic $p$-Laplace operator. This result removes the finite volume assumption on the classification of critical anisotropic $p$-Laplace equation which was obtained by Ciraolo-Figalli-Roncoroni in the literature \cite{CFR}. The method is based on constructing suitable vector fields integral inequality and using Newton's type inequality. △ Less

Submitted 10 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.08541 [pdf, ps, other]

Existence of monotone Morse flow lines of the expander functional

Authors: Jacob Bernstein, Letian Chen, Lu Wang

Abstract: Given a smooth asymptotically conical self-expander that is strictly unstable we construct a (singular) Morse flow line of the expander functional that connects it to a stable self-expander. This flow is monotone in a suitable sense and has small singular set. Given a smooth asymptotically conical self-expander that is strictly unstable we construct a (singular) Morse flow line of the expander functional that connects it to a stable self-expander. This flow is monotone in a suitable sense and has small singular set. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 46 pages

MSC Class: 53E10; 49Q20

arXiv:2404.00438 [pdf, other]

Communication Efficient Distributed Training with Distributed Lion

Authors: Bo Liu, Lemeng Wu, Lizhang Chen, Kaizhao Liang, Jiaxu Zhu, Chen Liang, Raghuraman Krishnamoorthi, Qiang Liu

Abstract: The Lion optimizer has been a promising competitor with the AdamW for training large AI models, with advantages on memory, computation, and sample efficiency. In this paper, we introduce Distributed Lion, an innovative adaptation of Lion for distributed training environments. Leveraging the sign operator in Lion, our Distributed Lion only requires communicating binary or lower-precision vectors be… ▽ More The Lion optimizer has been a promising competitor with the AdamW for training large AI models, with advantages on memory, computation, and sample efficiency. In this paper, we introduce Distributed Lion, an innovative adaptation of Lion for distributed training environments. Leveraging the sign operator in Lion, our Distributed Lion only requires communicating binary or lower-precision vectors between workers to the center server, significantly reducing the communication cost. Our theoretical analysis confirms Distributed Lion's convergence properties. Empirical results demonstrate its robustness across a range of tasks, worker counts, and batch sizes, on both vision and language problems. Notably, Distributed Lion attains comparable performance to standard Lion or AdamW optimizers applied on aggregated gradients, but with significantly reduced communication bandwidth. This feature is particularly advantageous for training large models. In addition, we also demonstrate that Distributed Lion presents a more favorable performance-bandwidth balance compared to existing efficient distributed methods such as deep gradient compression and ternary gradients. △ Less

Submitted 30 March, 2024; originally announced April 2024.

Comments: 22 pages

arXiv:2404.00078 [pdf, other]

Irreversible and dissipative systems

Authors: J. Beck, W. W. L. Chen, Y. Yang

Abstract: We study some new dynamical systems where the corresponding piecewise linear flow is neither time reversible nor measure preserving. We create a dissipative system by starting with a finite polysquare translation surface, and then modifying it by including a one-sided barrier on a common vertical edge of two adjacent atomic squares, in the form of a union of finitely many intervals. The line flow… ▽ More We study some new dynamical systems where the corresponding piecewise linear flow is neither time reversible nor measure preserving. We create a dissipative system by starting with a finite polysquare translation surface, and then modifying it by including a one-sided barrier on a common vertical edge of two adjacent atomic squares, in the form of a union of finitely many intervals. The line flow in this system partitions the system into a transient set and a recurrent set. We are interested in the geometry of these two sets. △ Less

Submitted 28 May, 2024; v1 submitted 28 March, 2024; originally announced April 2024.

Comments: 44 pages, 39 figures

MSC Class: 37E35; 11K38

arXiv:2404.00077 [pdf, other]

A note on the Kronecker--Weyl equidistribution theorem

Authors: J. Beck, W. W. L. Chen, Y. Yang

Abstract: We study the relationship between the discrete and the continuous versions of the Kronecker--Weyl equidistribution theorem, as well as their possible extension to manifolds in higher dimensions. We also investigate a way to deduce in some limited way uniformity results in higher dimension from results in lower dimension. We study the relationship between the discrete and the continuous versions of the Kronecker--Weyl equidistribution theorem, as well as their possible extension to manifolds in higher dimensions. We also investigate a way to deduce in some limited way uniformity results in higher dimension from results in lower dimension. △ Less

Submitted 29 May, 2024; v1 submitted 28 March, 2024; originally announced April 2024.

Comments: 12 pages, 1 figure

MSC Class: 37E35; 11K38

arXiv:2403.19960 [pdf, other]

A note on density of geodesics

Authors: J. Beck, W. W. L. Chen, Y. Yang

Abstract: We extend the famous result of Katok and Zemlyakov on the density of half-infinite geodesics on finite flat rational surfaces to half-infinite geodesics on a finite polycube translation $3$-manifold. We also extend this original result to establish a weak uniformity statement. We extend the famous result of Katok and Zemlyakov on the density of half-infinite geodesics on finite flat rational surfaces to half-infinite geodesics on a finite polycube translation $3$-manifold. We also extend this original result to establish a weak uniformity statement. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: 10 pages, 3 figures

MSC Class: 37E35; 11K38

arXiv:2403.19958 [pdf, other]

Uniformity of geodesic flow in non-integrable 3-manifolds

Authors: J. Beck, W. W. L. Chen, Y. Yang

Abstract: Almost nothing is known concerning the extension of $3$-dimensional Kronecker--Weyl equidistribution theorem on geodesic flow from the unit torus $[0,1)^3$ to non-integrable finite polycube translation $3$-manifolds. In the special case when a finite polycube translation $3$-manifold is the cartesian product of a finite polysquare translation surface with the unit torus $[0,1)$, we have develope… ▽ More Almost nothing is known concerning the extension of $3$-dimensional Kronecker--Weyl equidistribution theorem on geodesic flow from the unit torus $[0,1)^3$ to non-integrable finite polycube translation $3$-manifolds. In the special case when a finite polycube translation $3$-manifold is the cartesian product of a finite polysquare translation surface with the unit torus $[0,1)$, we have developed a splitting method with which we can make some progress. This is a somewhat restricted system, in the sense that one of the directions is integrable. We then combine this with a split-covering argument to extend our results to some other finite polycube translation $3$-manifolds which satisfy a rather special condition and where none of the $3$ directions is integrable. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: 44 pages, 32 figures

MSC Class: 37E35; 11K38

arXiv:2403.19954 [pdf, other]

Billiards in polyhedra: a method to convert 2-dimensional uniformity to 3-dimensional uniformity

Authors: J. Beck, W. W. L. Chen, Y. Yang

Abstract: The class of 2-dimensional non-integrable flat dynamical systems has a rather extensive literature with many deep results, but the methods developed for this type of problems, both the traditional approach via Teichmüller geometry and our recent shortline-ancestor method, appear to be exclusively plane-specific. Thus we know very little of any real significance concerning 3-dimensional systems.… ▽ More The class of 2-dimensional non-integrable flat dynamical systems has a rather extensive literature with many deep results, but the methods developed for this type of problems, both the traditional approach via Teichmüller geometry and our recent shortline-ancestor method, appear to be exclusively plane-specific. Thus we know very little of any real significance concerning 3-dimensional systems. Our purpose here is to describe some very limited extensions of uniformity in 2 dimensions to uniformity in 3 dimensions. We consider a 3-manifold which is the cartesian product of the regular octagonal surface with the unit torus. This is a restricted system, in the sense that one of the directions is integrable. However, this restriction also allows us to make use of a transference theorem for arithmetic progressions established earlier by Beck, Donders and Yang. △ Less

Submitted 29 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

Comments: 13 pages, 2 figures

MSC Class: 37E35; 11K38

arXiv:2403.12460 [pdf, ps, other]

Stochastic variance reduced gradient method for linear ill-posed inverse problems

Authors: Qinian **, Liuhong Chen

Abstract: In this paper we apply the stochastic variance reduced gradient (SVRG) method, which is a popular variance reduction method in optimization for accelerating the stochastic gradient method, to solve large scale linear ill-posed systems in Hilbert spaces. Under {\it a priori} choices of stop** indices, we derive a convergence rate result when the sought solution satisfies a benchmark source condit… ▽ More In this paper we apply the stochastic variance reduced gradient (SVRG) method, which is a popular variance reduction method in optimization for accelerating the stochastic gradient method, to solve large scale linear ill-posed systems in Hilbert spaces. Under {\it a priori} choices of stop** indices, we derive a convergence rate result when the sought solution satisfies a benchmark source condition and establish a convergence result without using any source condition. To terminate the method in an {\it a posteriori} manner, we consider the discrepancy principle and show that it terminates the method in finite many iteration steps almost surely. Various numerical results are reported to test the performance of the method. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2403.07656 [pdf, other]

Optimal control of stochastic cylinder flow using data-driven compressive sensing method

Authors: Liuhong Chen, Ju Ming, Max D. Gunzburger

Abstract: A stochastic optimal control problem for incompressible Newtonian channel flow past a circular cylinder is used as a prototype optimal control problem for the stochastic Navier-Stokes equations. The inlet flow and the rotation speed of the cylinder are allowed to have stochastic perturbations. The control acts on the cylinder via adjustment of the rotation speed. Possible objectives of the control… ▽ More A stochastic optimal control problem for incompressible Newtonian channel flow past a circular cylinder is used as a prototype optimal control problem for the stochastic Navier-Stokes equations. The inlet flow and the rotation speed of the cylinder are allowed to have stochastic perturbations. The control acts on the cylinder via adjustment of the rotation speed. Possible objectives of the control include, among others, tracking a desired (given) velocity field or minimizing the kinetic energy, enstrophy, or the drag of the flow over a given body. Owing to the high computational requirements, the direct application of the classical Monte Carlo methods for our problem is limited. To overcome the difficulty, we use a multi-fidelity data-driven compressive sensing based polynomial chaos expansions (MDCS-PCE). An effective gradient-based optimization for the discrete optimality systems resulted from the MDCS-PCE discretization is developed. The strategy can be applied broadly to many stochastic flow control problems. Numerical tests are performed to validate our methodology. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2403.01884 [pdf, ps, other]

Local existence of classical solution to the chemotaxis-shallow water system with vacuum in $\mathbb{R}^2$

Authors: Li Chen, Zhen Luo, Yucheng Wang

Abstract: In this paper, we consider the chemotaxis-shallow water system in $\mathbb{R}^2$. We establish the local existence of classical solution without assuming the initial height is small or has a small perturbation near a constant. The far field behavior of the height is a constant which could be either vacuum or non-vacuum. The initial data is allowed vacuum and the spatial measure of the set of vacuu… ▽ More In this paper, we consider the chemotaxis-shallow water system in $\mathbb{R}^2$. We establish the local existence of classical solution without assuming the initial height is small or has a small perturbation near a constant. The far field behavior of the height is a constant which could be either vacuum or non-vacuum. The initial data is allowed vacuum and the spatial measure of the set of vacuum can be arbitrarily large. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2403.00651 [pdf, ps, other]

Regularities for solutions to the $L_p$ dual Minkowski problem for unbounded closed sets

Authors: Li Chen, Qiang Tu

Abstract: Recently, the $L_p$ dual Minkowski problem for unbounded closed convex sets in a pointed closed convex cone was proposed and a weak solution to this problem was provided. In smooth setting, this problem is equivalent to solving the Dirichlet problem for a class of Monge-Ampère type equations. In this paper, we show the existence, regularity and uniqueness of solutions to this Monge-Ampère type equ… ▽ More Recently, the $L_p$ dual Minkowski problem for unbounded closed convex sets in a pointed closed convex cone was proposed and a weak solution to this problem was provided. In smooth setting, this problem is equivalent to solving the Dirichlet problem for a class of Monge-Ampère type equations. In this paper, we show the existence, regularity and uniqueness of solutions to this Monge-Ampère type equation in the case $p\geq 1$ by studying variational properties for a family of Monge-Ampère functionals. Moreover, the existence and optimal global Hölder regularity in the case $p<1$ and $q\geq n$ is also be discussed. △ Less

Submitted 29 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: 39 pages

arXiv:2403.00097 [pdf, other]

Irrational rotations and 2-filling rays

Authors: Lvzhou Chen, Alexander J. Rasmussen

Abstract: We study a skew product transformation associated to an irrational rotation of the circle [0,1]/~. This skew product keeps track of the number of times an orbit of the rotation lands in the two complementary intervals of {0,1/2} in the circle. We show that under certain conditions on the continued fraction expansion of the irrational number defining the rotation, the skew product transformation ha… ▽ More We study a skew product transformation associated to an irrational rotation of the circle [0,1]/~. This skew product keeps track of the number of times an orbit of the rotation lands in the two complementary intervals of {0,1/2} in the circle. We show that under certain conditions on the continued fraction expansion of the irrational number defining the rotation, the skew product transformation has certain dense orbits. This is in spite of the presence of numerous non-dense orbits. We use this to construct laminations on infinite type surfaces with exotic properties. In particular, we show that for every infinite type surface with an isolated planar end, there is an infinite clique of 2-filling rays based at that end. These 2-filling rays are relevant to Bavard--Walker's loop graphs. △ Less

Submitted 24 May, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

Comments: v2: Added funding information. 21 pages, 9 figures

arXiv:2402.18745 [pdf, other]

Degree-heterogeneous Latent Class Analysis for High-dimensional Discrete Data

Authors: Zhongyuan Lyu, Ling Chen, Yuqi Gu

Abstract: The latent class model is a widely used mixture model for multivariate discrete data. Besides the existence of qualitatively heterogeneous latent classes, real data often exhibit additional quantitative heterogeneity nested within each latent class. The modern latent class analysis also faces extra challenges, including the high-dimensionality, sparsity, and heteroskedastic noise inherent in discr… ▽ More The latent class model is a widely used mixture model for multivariate discrete data. Besides the existence of qualitatively heterogeneous latent classes, real data often exhibit additional quantitative heterogeneity nested within each latent class. The modern latent class analysis also faces extra challenges, including the high-dimensionality, sparsity, and heteroskedastic noise inherent in discrete data. Motivated by these phenomena, we introduce the Degree-heterogeneous Latent Class Model and propose a spectral approach to clustering and statistical inference in the challenging high-dimensional sparse data regime. We propose an easy-to-implement HeteroClustering algorithm. It uses heteroskedastic PCA with L2 normalization to remove degree effects and perform clustering in the top singular subspace of the data matrix. We establish an exponential error rate for HeteroClustering, leading to exact clustering under minimal signal-to-noise conditions. We further investigate the estimation and inference of the high-dimensional continuous item parameters in the model, which are crucial to interpreting and finding useful markers for latent classes. We provide comprehensive procedures for global testing and multiple testing of these parameters with valid error controls. The superior performance of our methods is demonstrated through extensive simulations and applications to three diverse real-world datasets from political voting records, genetic variations, and single-cell sequencing. △ Less

Submitted 1 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.13865 [pdf, other]

Variable Projection Algorithms: Theoretical Insights and A Novel Approach for Problems with Large Residual

Authors: Guangyong Chen, Peng Xue, Min Gan, **g Chen, Wenzhong Guo, C. L. Philip. Chen

Abstract: This paper delves into an in-depth exploration of the Variable Projection (VP) algorithm, a powerful tool for solving separable nonlinear optimization problems across multiple domains, including system identification, image processing, and machine learning. We first establish a theoretical framework to examine the effect of the approximate treatment of the coupling relationship among parameters on… ▽ More This paper delves into an in-depth exploration of the Variable Projection (VP) algorithm, a powerful tool for solving separable nonlinear optimization problems across multiple domains, including system identification, image processing, and machine learning. We first establish a theoretical framework to examine the effect of the approximate treatment of the coupling relationship among parameters on the local convergence of the VP algorithm and theoretically prove that the Kaufman's VP algorithm can achieve a similar convergence rate as the Golub \& Pereyra's form. These studies fill the gap in the existing convergence theory analysis, and provide a solid foundation for understanding the mechanism of VP algorithm and broadening its application horizons. Furthermore, drawing inspiration from these theoretical revelations, we design a refined VP algorithm for handling separable nonlinear optimization problems characterized by large residual, called VPLR, which boosts the convergence performance by addressing the interdependence of parameters within the separable model and by continually correcting the approximated Hessian matrix to counteract the influence of large residual during the iterative process. The effectiveness of this refined algorithm is corroborated through numerical experimentation. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 17 pages, 6 figures

arXiv:2402.13347 [pdf, other]

Edge-averaged virtual element methods for convection-diffusion and convection-dominated problems

Authors: Shuhao Cao, Long Chen, Seulip Lee

Abstract: This manuscript develops edge-averaged virtual element (EAVE) methodologies to address convection-diffusion problems effectively in the convection-dominated regime. It introduces a variant of EAVE that ensures monotonicity (producing an $M$-matrix) on Voronoi polygonal meshes, provided their duals are Delaunay triangulations with acute angles. Furthermore, the study outlines a comprehensive framew… ▽ More This manuscript develops edge-averaged virtual element (EAVE) methodologies to address convection-diffusion problems effectively in the convection-dominated regime. It introduces a variant of EAVE that ensures monotonicity (producing an $M$-matrix) on Voronoi polygonal meshes, provided their duals are Delaunay triangulations with acute angles. Furthermore, the study outlines a comprehensive framework for EAVE methodologies, introducing another variant that integrates with the stiffness matrix derived from the lowest-order virtual element method for the Poisson equation. Numerical experiments confirm the theoretical advantages of the monotonicity property and demonstrate an optimal convergence rate across various mesh configurations. △ Less

Submitted 20 February, 2024; originally announced February 2024.

MSC Class: 65N30; 65N12

arXiv:2402.12652 [pdf, other]

PDEformer: Towards a Foundation Model for One-Dimensional Partial Differential Equations

Authors: Zhanhong Ye, Xiang Huang, Leheng Chen, Hongsheng Liu, Zidong Wang, Bin Dong

Abstract: This paper introduces PDEformer, a neural solver for partial differential equations (PDEs) capable of simultaneously addressing various types of PDEs. We propose to represent the PDE in the form of a computational graph, facilitating the seamless integration of both symbolic and numerical information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed to… ▽ More This paper introduces PDEformer, a neural solver for partial differential equations (PDEs) capable of simultaneously addressing various types of PDEs. We propose to represent the PDE in the form of a computational graph, facilitating the seamless integration of both symbolic and numerical information inherent in a PDE. A graph Transformer and an implicit neural representation (INR) are employed to generate mesh-free predicted solutions. Following pretraining on data exhibiting a certain level of diversity, our model achieves zero-shot accuracies on benchmark datasets that is comparable to those of specifically trained expert models. Additionally, PDEformer demonstrates promising results in the inverse problem of PDE coefficient recovery. △ Less

Submitted 30 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.05612 [pdf, other]

Parking on supercritical geometric Bienaymé--Galton--Watson trees

Authors: Linxiao Chen, Alice Contat

Abstract: Consider a supercritical Bienaymé--Galton--Watson tree $ \mathcal{T}$ with geometric offspring distribution. Each vertex of this tree represents a parking spot which can accommodate at most one car. On the top of this tree, we add $(A_u : u \in \mathcal{T})$ i.i.d.\ non negative integers sampled according to a given law $ μ$, which are the car arrivals on $ \mathcal{T}$. Each car tries to park on… ▽ More Consider a supercritical Bienaymé--Galton--Watson tree $ \mathcal{T}$ with geometric offspring distribution. Each vertex of this tree represents a parking spot which can accommodate at most one car. On the top of this tree, we add $(A_u : u \in \mathcal{T})$ i.i.d.\ non negative integers sampled according to a given law $ μ$, which are the car arrivals on $ \mathcal{T}$. Each car tries to park on its arriving vertex and if the spot is already occupied, it drives towards the root and takes the first available spot. If no spot is found, then it exits the tree without parking. In this paper, we provide a criterion to determine the phase of the parking process (subcritical, critical, or supercritical) depending on the generating function of $ μ$. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 15 pages, 3 figures; comments are welcome !

arXiv:2401.16570 [pdf, ps, other]

Stochastic Kimura Equations

Authors: Roland Riachi, Linan Chen

Abstract: In this work we study the one-dimensional stochastic Kimura equation $\partial_{t}u\left(z,t\right)=z\partial_{z}^{2}u\left(z,t\right)+u\left(z,t\right)\dot{W}\left(z,t\right)$ for $z,t>0$ equipped with a Dirichlet boundary condition at $0$, with $\dot{W}$ being a Gaussian space-time noise. This equation can be seen as a degenerate analog of the parabolic Anderson model. We combine the Wiener chao… ▽ More In this work we study the one-dimensional stochastic Kimura equation $\partial_{t}u\left(z,t\right)=z\partial_{z}^{2}u\left(z,t\right)+u\left(z,t\right)\dot{W}\left(z,t\right)$ for $z,t>0$ equipped with a Dirichlet boundary condition at $0$, with $\dot{W}$ being a Gaussian space-time noise. This equation can be seen as a degenerate analog of the parabolic Anderson model. We combine the Wiener chaos theory from Malliavin calculus, the Duhamel perturbation technique from PDEs, and the kernel analysis of (deterministic) degenerate diffusion equations to develop a solution theory for the stochastic Kimura equation. We establish results on existence, uniqueness, moments, and continuity for the solution $u\left(z,t\right)$. In particular, we investigate how the stochastic potential and the degeneracy in the diffusion operator jointly affect the properties of $u\left(z,t\right)$ near the boundary. We also derive explicit estimates on the comparison under the $L^{2}-$ norm between $u\left(z,t\right)$ and its deterministic counterpart for $\left(z,t\right)$ within a proper range. △ Less

Submitted 5 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

Comments: 45 pages

MSC Class: 60H15; 60H30; 35K65

arXiv:2401.11081 [pdf, other]

Learning from Aggregate responses: Instance Level versus Bag Level Loss Functions

Authors: Adel Javanmard, Lin Chen, Vahab Mirrokni, Ashwinkumar Badanidiyuru, Gang Fu

Abstract: Due to the rise of privacy concerns, in many practical applications the training data is aggregated before being shared with the learner, in order to protect privacy of users' sensitive responses. In an aggregate learning framework, the dataset is grouped into bags of samples, where each bag is available only with an aggregate response, providing a summary of individuals' responses in that bag. In… ▽ More Due to the rise of privacy concerns, in many practical applications the training data is aggregated before being shared with the learner, in order to protect privacy of users' sensitive responses. In an aggregate learning framework, the dataset is grouped into bags of samples, where each bag is available only with an aggregate response, providing a summary of individuals' responses in that bag. In this paper, we study two natural loss functions for learning from aggregate responses: bag-level loss and the instance-level loss. In the former, the model is learnt by minimizing a loss between aggregate responses and aggregate model predictions, while in the latter the model aims to fit individual predictions to the aggregate responses. In this work, we show that the instance-level loss can be perceived as a regularized form of the bag-level loss. This observation lets us compare the two approaches with respect to bias and variance of the resulting estimators, and introduce a novel interpolating estimator which combines the two approaches. For linear regression tasks, we provide a precise characterization of the risk of the interpolating estimator in an asymptotic regime where the size of the training set grows in proportion to the features dimension. Our analysis allows us to theoretically understand the effect of different factors, such as bag size on the model prediction risk. In addition, we propose a mechanism for differentially private learning from aggregate responses and derive the optimal bag size in terms of prediction risk-privacy trade-off. We also carry out thorough experiments to corroborate our theory and show the efficacy of the interpolating estimator. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: To appear in the Twelfth International Conference on Learning Representations (ICLR 2024)

arXiv:2401.09747 [pdf, ps, other]

Stochastic theta methods for random periodic solution of stochastic differential equations under non-globally Lipschitz conditions

Authors: Ziheng Chen, Liangmin Cao, Lin Chen

Abstract: This work focuses on the numerical approximations of random periodic solutions of stochastic differential equations (SDEs). Under non-globally Lipschitz conditions, we prove the existence and uniqueness of random periodic solutions for the considered equations and its numerical approximations generated by the stochastic theta (ST) methods with theta within (1/2,1]. It is shown that the random peri… ▽ More This work focuses on the numerical approximations of random periodic solutions of stochastic differential equations (SDEs). Under non-globally Lipschitz conditions, we prove the existence and uniqueness of random periodic solutions for the considered equations and its numerical approximations generated by the stochastic theta (ST) methods with theta within (1/2,1]. It is shown that the random periodic solution of each ST method converges strongly in the mean square sense to that of SDEs for all step size. More precisely, the mean square convergence order is 1/2 for SDEs with multiplicative noise and 1 for SDEs with additive noise. Numerical results are finally reported to confirm these theoretical findings. △ Less

Submitted 20 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

arXiv:2401.08592 [pdf, ps, other]

Double Extensions of Multiplicative Restricted Hom-Lie Algebras

Authors: Dan Mao, Zeyu Hao, Liangyun Chen

Abstract: In this paper, we study the double extension of a restricted quadratic Hom-Lie algebra $(V,[\cdot,\cdot]_{V},α_{V},B_{V})$, which is an enlargement of $V$ by means of a central extension and a restricted derivation $\mathscr{D}$. In particular, we prove that the double extension of a restricted quadratic Hom-Lie algebra $V$ with a $\mathscr{D}$-invariant bilinear form $B_{V}$ is restricted. Conver… ▽ More In this paper, we study the double extension of a restricted quadratic Hom-Lie algebra $(V,[\cdot,\cdot]_{V},α_{V},B_{V})$, which is an enlargement of $V$ by means of a central extension and a restricted derivation $\mathscr{D}$. In particular, we prove that the double extension of a restricted quadratic Hom-Lie algebra $V$ with a $\mathscr{D}$-invariant bilinear form $B_{V}$ is restricted. Conversely, any irreducible restricted quadratic Hom-Lie algebra with nonzero center is proved to be the double extension of another restricted quadratic Hom-Lie algebra. △ Less

Submitted 17 November, 2023; originally announced January 2024.

Comments: 39pages

arXiv:2401.07672 [pdf, ps, other]

Accelerated Gradient Methods with Gradient Restart: Global Linear Convergence

Authors: Chenglong Bao, Liang Chen, Jiahong Li, Zuowei Shen

Abstract: Gradient restarting has been shown to improve the numerical performance of accelerated gradient methods. This paper provides a mathematical analysis to understand these advantages. First, we establish global linear convergence guarantees for the gradient restarted accelerated proximal gradient method when solving strongly convex composite optimization problems. Second, through analysis of the corr… ▽ More Gradient restarting has been shown to improve the numerical performance of accelerated gradient methods. This paper provides a mathematical analysis to understand these advantages. First, we establish global linear convergence guarantees for the gradient restarted accelerated proximal gradient method when solving strongly convex composite optimization problems. Second, through analysis of the corresponding ordinary differential equation model, we prove the continuous trajectory of gradient restarted Nesterov's accelerated gradient method exhibits global linear convergence for quadratic strongly convex objectives, while the non-restarted version provably lacks this property by [Su, Boyd, and Candés, J. Mach. Learn. Res., 2016, 17(153), 1-43]. △ Less

Submitted 15 January, 2024; originally announced January 2024.

MSC Class: 90C25; 65K05; 65B05; 90C06; 90C30

arXiv:2401.06925 [pdf, ps, other]

Modeling Latent Selection with Structural Causal Models

Authors: Leihao Chen, Onno Zoeter, Joris M. Mooij

Abstract: Selection bias is ubiquitous in real-world data, and can lead to misleading results if not dealt with properly. We introduce a conditioning operation on Structural Causal Models (SCMs) to model latent selection from a causal perspective. We show that the conditioning operation transforms an SCM with the presence of an explicit latent selection mechanism into an SCM without such selection mechanism… ▽ More Selection bias is ubiquitous in real-world data, and can lead to misleading results if not dealt with properly. We introduce a conditioning operation on Structural Causal Models (SCMs) to model latent selection from a causal perspective. We show that the conditioning operation transforms an SCM with the presence of an explicit latent selection mechanism into an SCM without such selection mechanism, which partially encodes the causal semantics of the selected subpopulation according to the original SCM. Furthermore, we show that this conditioning operation preserves the simplicity, acyclicity, and linearity of SCMs, and commutes with marginalization. Thanks to these properties, combined with marginalization and intervention, the conditioning operation offers a valuable tool for conducting causal reasoning tasks within causal models where latent details have been abstracted away. We demonstrate by example how classical results of causal inference can be generalized to include selection bias and how the conditioning operation helps with modeling of real-world problems. △ Less

Submitted 12 January, 2024; originally announced January 2024.

arXiv:2401.01797 [pdf, other]

Parabolic Anderson model in bounded domains of recurrent metric measure spaces

Authors: Fabrice Baudoin, Li Chen, Che-Hung Huang, Cheng Ouyang, Samy Tindel, **g Wang

Abstract: A metric measure space equipped with a Dirichlet form is called recurrent if its Hausdorff dimension is less than its walk dimension. In bounded domains of such spaces we study the parabolic Anderson models \[ \partial_{t} u(t,x) = Δu(t,x) + βu(t,x) \, \dot{W}_α(t,x) \] where the noise $W_α$ is white in time and colored in space when $α>0$ while for $α=0$ it is also white in space. Both Dirichlet… ▽ More A metric measure space equipped with a Dirichlet form is called recurrent if its Hausdorff dimension is less than its walk dimension. In bounded domains of such spaces we study the parabolic Anderson models \[ \partial_{t} u(t,x) = Δu(t,x) + βu(t,x) \, \dot{W}_α(t,x) \] where the noise $W_α$ is white in time and colored in space when $α>0$ while for $α=0$ it is also white in space. Both Dirichlet and Neumann boundary conditions are considered. Besides proving existence and uniqueness in the Itô sense we also get precise $L^p$ estimates for the moments and intermittency properties of the solution as a consequence. Our study reveals new exponents which are intrinsically associated to the geometry of the underlying space and the results for instance apply in metric graphs or fractals like the Sierpiński gasket for which we prove scaling invariance properties of the models. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 50 pages, 3 figures

arXiv:2312.16729 [pdf, ps, other]

Behavioural pseudometrics for continuous-time diffusions

Authors: Linan Chen, Florence Clerc, Prakash Panangaden

Abstract: Bisimulation is a concept that captures behavioural equivalence of states in a variety of types of transition systems. It has been widely studied in a discrete-time setting where the notion of a step is fundamental. In our setting we are considering "flow"-processes emphasizing that they evolve in continuous time. In such continuous-time settings, the concepts are not straightforward adaptations o… ▽ More Bisimulation is a concept that captures behavioural equivalence of states in a variety of types of transition systems. It has been widely studied in a discrete-time setting where the notion of a step is fundamental. In our setting we are considering "flow"-processes emphasizing that they evolve in continuous time. In such continuous-time settings, the concepts are not straightforward adaptations of their discrete-time analogues and we restrict our study to diffusions that do not lose mass over time and with additional regularity constraints. In previous work we proposed different definitions of behavioural equivalences for continuous-time stochastic processes where the evolution is a flow through time. That work only addressed equivalences. In this work, we aim at quantifying how differently processes behave. We present two pseudometrics for diffusion-like processes. These pseudometrics are fixpoints of two different functionals on the space of 1-bounded pseudometrics on the state space. We also characterize these pseudometrics in terms of real-valued modal logics; this is a quantitative analogue of the notion of logical characterization of bisimulation. These real-valued modal logics indicate that the two pseudometrics are different and thus yield different notions of behavioural equivalence. △ Less

Submitted 30 April, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

arXiv:2312.16017 [pdf, ps, other]

Classification of positive solutions of Hardy-Sobolev equation without the finite volume constraints

Authors: Lu Chen, Yabo Yang

Abstract: In this paper, we are concerned with the critical Hardy-Sobolev equation \begin{equation*} -Δ_{p}u = \frac{u^{p^{*}_s-1}}{|x|^{s}}, \ \ x\in \mathbb{R}^n \end{equation*} where $p^{*}_s = \frac{(n-s)p}{n-p}$ denotes the critical Hardy-Sobolev exponent. We classify the positive solutions of this equation for $0 < s < \frac{p-1}{p}$ and $\frac{(2s+n+1)+\sqrt{(2s+n+1)^2-12s}}{6} \leq p < n$ without fi… ▽ More In this paper, we are concerned with the critical Hardy-Sobolev equation \begin{equation*} -Δ_{p}u = \frac{u^{p^{*}_s-1}}{|x|^{s}}, \ \ x\in \mathbb{R}^n \end{equation*} where $p^{*}_s = \frac{(n-s)p}{n-p}$ denotes the critical Hardy-Sobolev exponent. We classify the positive solutions of this equation for $0 < s < \frac{p-1}{p}$ and $\frac{(2s+n+1)+\sqrt{(2s+n+1)^2-12s}}{6} \leq p < n$ without finite volume constraints, which extends Ou's result in \cite{9} in the literature. The method is based on constructing suitable vector fields integral inequality and using Newton's type inequality. △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2312.12355 [pdf, other]

Transformed Primal-Dual Methods with Variable-Preconditioners

Authors: Long Chen, Ruchi Guo, **grong Wei

Abstract: This paper introduces a novel Transformed Primal-Dual with variable-metric/preconditioner (TPDv) algorithm, designed to efficiently solve affine constrained optimization problems common in nonlinear partial differential equations (PDEs). Diverging from traditional methods, TPDv iteratively updates time-evolving preconditioning operators, enhancing adaptability. The algorithm is derived and analyze… ▽ More This paper introduces a novel Transformed Primal-Dual with variable-metric/preconditioner (TPDv) algorithm, designed to efficiently solve affine constrained optimization problems common in nonlinear partial differential equations (PDEs). Diverging from traditional methods, TPDv iteratively updates time-evolving preconditioning operators, enhancing adaptability. The algorithm is derived and analyzed, demonstrating global linear convergence rates under mild assumptions. Numerical experiments on challenging nonlinear PDEs, including the Darcy-Forchheimer model and a nonlinear electromagnetic problem, showcase the algorithm's superiority over existing methods in terms of iteration numbers and computational efficiency. The paper concludes with a comprehensive convergence analysis. △ Less

Submitted 19 December, 2023; originally announced December 2023.

MSC Class: 37N30; 47J25; 65K05; 65N12; 90C30

arXiv:2312.11787 [pdf, ps, other]

Optimal asymptotic lower bound for stability of fractional Sobolev inequality and the global stability of Log-Sobolev inequality on the sphere

Authors: Lu Chen, Guozhen Lu, Hanli Tang

Abstract: In this paper, we are concerned with the optimal asymptotic lower bound for the stability of fractional Sobolev inequality: \begin{equation}\label{Sob sta ine} \left\|(-Δ)^{s/2} U \right\|_2^2 - \mathcal S_{s,n} \| U\|_{\frac{2n}{n-2s}}^2\geq C_{n,s} d^{2}(U, \mathcal{M}_s), \end{equation} where $\mathcal{M}_s$ is the set of maximizers of the fractional Sobolev inequality of order $s$,… ▽ More In this paper, we are concerned with the optimal asymptotic lower bound for the stability of fractional Sobolev inequality: \begin{equation}\label{Sob sta ine} \left\|(-Δ)^{s/2} U \right\|_2^2 - \mathcal S_{s,n} \| U\|_{\frac{2n}{n-2s}}^2\geq C_{n,s} d^{2}(U, \mathcal{M}_s), \end{equation} where $\mathcal{M}_s$ is the set of maximizers of the fractional Sobolev inequality of order $s$, $s\in (0,\min\{1,n/6\})$ and $C_{n,s}$ denotes the optimal lower bound of stability. We prove that the optimal lower bound $C_{n,s}$ is equal to $O(\frac{1}{n})$ when $n\rightarrow +\infty$ for any $s\in (0,1)$, which extends the work by Dolbeault-Esteban-Figalli-Frank-Loss [18] when $s=1$ and quantify the asymptotic behavior for lower bound of stability of fractional Sobolev inequality established by the author's previous work in [15] in the case of $s\in (0,\min\{1,n/6\})$. Moreover, $C_{n,s}$ is equal to $O(s)$ when $s\rightarrow 0$ for any dimension $n$. (See Theorem 1.1 for these asymptotic estimates.) As an application, we derive the global stability for the log-Sobolev inequality with the optimal asymptotic lower bound on the sphere through the stability of fractional Sobolev inequalities with optimal asymptotic lower bound and the end-point differentiation method (see Theorem 1.3). This sharpens the earlier work by the authors [14] on the local stability for the log-Sobolev inequality on the sphere. We also obtain the asymptotically optimal lower bound for the Hardy-Littlewood-Sobolev inequality when $s\to 0$ and $n\to \infty$ (See Theorem 1.4 and the subsequent Remark 1.5). △ Less

Submitted 22 February, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: 24 pages. Some typos have been corrected. We also added an explanation about the asymptotical optimality of the lower bound of the stability of the Hardy-Littlewood-Sobolev inequality (See Theorem 1.4 and Remark 1.5)

arXiv:2311.15508 [pdf, other]

Surface map** class group actions on 3-manifolds

Authors: Alina Al Beaini, Lei Chen, Bena Tshishiku

Abstract: For each circle bundle $S^1\to X\toΣ_g$ over a surface with genus $g\ge2$, there is a natural surjection $π:Homeo^+(X)\to Mod(Σ_g)$. When $X$ is the unit tangent bundle $UΣ_g$, it is well-known that $π$ splits. On the other hand $π$ does not split when the Euler number $e(X)$ is not divisible by the Euler characteristic $χ(Σ_g)$ by work of the second two authors. In this paper we show that this ho… ▽ More For each circle bundle $S^1\to X\toΣ_g$ over a surface with genus $g\ge2$, there is a natural surjection $π:Homeo^+(X)\to Mod(Σ_g)$. When $X$ is the unit tangent bundle $UΣ_g$, it is well-known that $π$ splits. On the other hand $π$ does not split when the Euler number $e(X)$ is not divisible by the Euler characteristic $χ(Σ_g)$ by work of the second two authors. In this paper we show that this homomorphism does not split in many cases where $χ(Σ_g)$ divides $e(X)$. △ Less

Submitted 26 November, 2023; originally announced November 2023.

Comments: 15 pages

arXiv:2311.09051 [pdf, other]

Distributional Finite Element curl div Complexes and Application to Quad Curl Problems

Authors: Long Chen, Xuehai Huang, Chao Zhang

Abstract: The paper addresses the challenge of constructing conforming finite element spaces for high-order differential operators in high dimensions, with a focus on the curl div operator in three dimensions. Tangential-normal continuity is introduced in order to develop distributional finite element curl div complexes. The spaces constructed are applied to discretize a quad curl problem, demonstrating opt… ▽ More The paper addresses the challenge of constructing conforming finite element spaces for high-order differential operators in high dimensions, with a focus on the curl div operator in three dimensions. Tangential-normal continuity is introduced in order to develop distributional finite element curl div complexes. The spaces constructed are applied to discretize a quad curl problem, demonstrating optimal order of convergence. Furthermore, a hybridization technique is proposed, demonstrating its equivalence to nonconforming finite elements and weak Galerkin methods. △ Less

Submitted 15 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: 25 pages, 3 figures

arXiv:2311.08406 [pdf, ps, other]

Modular structure theory on Hom-Lie algebras

Authors: Dan Mao, Baoling Guan, Liangyun Chen

Abstract: The aim of this paper is to transfer the restrictedness theory to Hom-Lie algebras. The concept of restricted Hom-Lie algebras which is introduced in \cite{BM2} will be used in this paper. First, the existence of $p$-structures on a Hom-Lie algebra is studied and the direct sum of restricted Hom-Lie algebras is analyzed. Then, the definition of a restrictable Hom-Lie algebra is given and the equiv… ▽ More The aim of this paper is to transfer the restrictedness theory to Hom-Lie algebras. The concept of restricted Hom-Lie algebras which is introduced in \cite{BM2} will be used in this paper. First, the existence of $p$-structures on a Hom-Lie algebra is studied and the direct sum of restricted Hom-Lie algebras is analyzed. Then, the definition of a restrictable Hom-Lie algebra is given and the equivalence relation between restrictable Hom-Lie algebras and restricted Hom-Lie algebras is constructed. Finally, the $p$-envelopes of a Hom-Lie algebra are defined and studied. △ Less

Submitted 30 November, 2023; v1 submitted 7 August, 2023; originally announced November 2023.

Comments: 18pages

arXiv:2311.07935 [pdf, ps, other]

Counting function and lower bound for Dirichlet eigenvalues of the m-order logarithmic Laplacian

Authors: Huyuan Chen, Long Chen

Abstract: Our aim in this article is to obtain the limit of counting function for the Dirichlet eigenvalues involving the m-order logarithmic Laplacian in a bounded Lipschitz domain and to derive also the lower bound. Our aim in this article is to obtain the limit of counting function for the Dirichlet eigenvalues involving the m-order logarithmic Laplacian in a bounded Lipschitz domain and to derive also the lower bound. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: 19

Showing 1–50 of 706 results for author: Chen, L