-
Large and Small Deviations for Statistical Sequence Matching
Authors:
Lin Zhou,
Qianyun Wang,
**g**g Wang,
Lin Bai,
Alfred O. Hero
Abstract:
We revisit the problem of statistical sequence matching between two databases of sequences initiated by Unnikrishnan (TIT 2015) and derive theoretical performance guarantees for the generalized likelihood ratio test (GLRT). We first consider the case where the number of matched pairs of sequences between the databases is known. In this case, the task is to accurately find the matched pairs of sequ…
▽ More
We revisit the problem of statistical sequence matching between two databases of sequences initiated by Unnikrishnan (TIT 2015) and derive theoretical performance guarantees for the generalized likelihood ratio test (GLRT). We first consider the case where the number of matched pairs of sequences between the databases is known. In this case, the task is to accurately find the matched pairs of sequences among all possible matches between the sequences in the two databases. We analyze the performance of the GLRT by Unnikrishnan and explicitly characterize the tradeoff between the mismatch and false reject probabilities under each hypothesis in both large and small deviations regimes. Furthermore, we demonstrate the optimality of Unnikrishnan's GLRT test under the generalized Neyman-Person criterion for both regimes and illustrate our theoretical results via numerical examples. Subsequently, we generalize our achievability analyses to the case where the number of matched pairs is unknown, and an additional error probability needs to be considered. When one of the two databases contains a single sequence, the problem of statistical sequence matching specializes to the problem of multiple classification introduced by Gutman (TIT 1989). For this special case, our result for the small deviations regime strengthens previous result of Zhou, Tan and Motani (Information and Inference 2020) by removing unnecessary conditions on the generating distributions.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
On $\{1,2\}$-distance-balancedness of generalized Petersen graphs
Authors:
Gang Ma,
Jianfeng Wang,
Sandi Klavžar
Abstract:
A connected graph $G$ of diameter ${\rm diam}(G) \ge \ell$ is $\ell$-distance-balanced if $|W_{xy}|=|W_{yx}|$ for every $x,y\in V(G)$ with $d_{G}(x,y)=\ell$, where $W_{xy}$ is the set of vertices of $G$ that are closer to $x$ than to $y$. It is proved that if $k\ge 3$ and $n>k(k+2)$, then the generalized Petersen graph $GP(n,k)$ is not distance-balanced and that $GP(k(k+2),k)$ is distance-balanced…
▽ More
A connected graph $G$ of diameter ${\rm diam}(G) \ge \ell$ is $\ell$-distance-balanced if $|W_{xy}|=|W_{yx}|$ for every $x,y\in V(G)$ with $d_{G}(x,y)=\ell$, where $W_{xy}$ is the set of vertices of $G$ that are closer to $x$ than to $y$. It is proved that if $k\ge 3$ and $n>k(k+2)$, then the generalized Petersen graph $GP(n,k)$ is not distance-balanced and that $GP(k(k+2),k)$ is distance-balanced. This significantly improves the main result of Yang et al.\ [Electron.\ J.\ Combin.\ 16 (2009) \#N33]. It is also proved that if $k\ge 6$, where $k$ is even, and $n>\frac{5}{4}k^2+2k$, or if $k\ge 5$, where $k$ is odd, and $n>\frac{7}{4}k^2+\frac{3}{4}k$, then $GP(n,k)$ is not $2$-distance-balanced. These results partially resolve a conjecture of Miklavič and Šparl [Discrete Appl.\ Math.\ 244 (2018) 143--154].
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Global well-posedness, scattering and blow-up for the energy-critical, Schrödinger equation with indefinite potential in the radial case
Authors:
Jun Wang,
Zhaoyang Yin
Abstract:
In this paper, we study the well-posedness theory and the scattering asymptotics for the energy-critical, Schrödinger equation with indefinite potential \begin{equation*}
\left\{\begin{array}{l} i \partial_t u+Δu-V(x)u +|u|^{\frac{4}{N-2}}u=0,\ (x, t) \in \mathbb{R}^N \times \mathbb{R}, \\ \left.u\right|_{t=0}=u_0 \in H ^1(\mathbb{R}^N), \end{array}\right. \end{equation*} where…
▽ More
In this paper, we study the well-posedness theory and the scattering asymptotics for the energy-critical, Schrödinger equation with indefinite potential \begin{equation*}
\left\{\begin{array}{l} i \partial_t u+Δu-V(x)u +|u|^{\frac{4}{N-2}}u=0,\ (x, t) \in \mathbb{R}^N \times \mathbb{R}, \\ \left.u\right|_{t=0}=u_0 \in H ^1(\mathbb{R}^N), \end{array}\right. \end{equation*} where $V(x):\mathbb{R}^N\rightarrow \mathbb{R}$ is indefinite and satisfies appropriate conditions. Using contraction map** method and concentration compactness argument, we obtain the well-posedness theory in proper function spaces and scattering asymptotics. Moreover, we get a positive ground state solution which is radially symmetric by using variational methods. This paper extends the results of \cite{KCEMF2006}(Invent. Math) to the potential equation and develops the recent conclusions.
△ Less
Submitted 22 June, 2024;
originally announced July 2024.
-
On well (edge) dominated and equimatchable strong product graphs
Authors:
Yixin Cao,
Guiqiang Mou,
Jianxin Wang
Abstract:
A graph is well-(edge-)dominated if every minimal (edge) dominating set is minimum. A graph is equimatchable if every maximal matching is maximum. We study these concepts on strong product graphs. We fully characterize well-edge-dominated and equimatchable strong product graphs of nontrivial graphs, and identify a large family of graphs whose strong products with any well-dominated graph are well-…
▽ More
A graph is well-(edge-)dominated if every minimal (edge) dominating set is minimum. A graph is equimatchable if every maximal matching is maximum. We study these concepts on strong product graphs. We fully characterize well-edge-dominated and equimatchable strong product graphs of nontrivial graphs, and identify a large family of graphs whose strong products with any well-dominated graph are well-dominated.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
A Parallel iterative Algorithm for primal-dual weak Galerkin Schemes
Authors:
Chunmei Wang,
Jun** Wang
Abstract:
This paper presents and analyzes a parallelizable iterative procedure based on domain decomposition for primal-dual weak Galerkin (PDWG) finite element methods applied to the Poisson equation. The existence and uniqueness of the PDWG solution are established. Optimal order of error estimates are derived in both a discrete norm and the $L^2$ norm. The convergence analysis is conducted for domain de…
▽ More
This paper presents and analyzes a parallelizable iterative procedure based on domain decomposition for primal-dual weak Galerkin (PDWG) finite element methods applied to the Poisson equation. The existence and uniqueness of the PDWG solution are established. Optimal order of error estimates are derived in both a discrete norm and the $L^2$ norm. The convergence analysis is conducted for domain decompositions into individual elements associated with the PDWG methods, which can be extended to larger subdomains without any difficulty.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
LLT Polynomials and Hecke Algebra Traces
Authors:
Alejandro H. Morales,
Mark A. Skandera,
Jiayuan Wang
Abstract:
We show that coefficients in unicellular LLT polynomials are evaluations of Hecke algebra traces at Kazhdan-Lusztig basis elements. We express these in terms of traditional trace bases, induction, and Kazhdan-Lusztig R-polynomials.
We show that coefficients in unicellular LLT polynomials are evaluations of Hecke algebra traces at Kazhdan-Lusztig basis elements. We express these in terms of traditional trace bases, induction, and Kazhdan-Lusztig R-polynomials.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
A multi-mesh approach for accurate computation of multi-target functionals in aerodynamics design
Authors:
Guanghui Hu,
Ruo Li,
**gfeng Wang
Abstract:
Aerodynamic optimal design is crucial for enhancing performance of aircrafts, while calculating multi-target functionals through solving dual equations with arbitrary right-hand sides remains challenging. In this paper, a novel multi-target framework of DWR-based mesh refinement is proposed and analyzed. Theoretically, an extrapolation method is generalized to expand multi-variable functionals, wh…
▽ More
Aerodynamic optimal design is crucial for enhancing performance of aircrafts, while calculating multi-target functionals through solving dual equations with arbitrary right-hand sides remains challenging. In this paper, a novel multi-target framework of DWR-based mesh refinement is proposed and analyzed. Theoretically, an extrapolation method is generalized to expand multi-variable functionals, which guarantees the dual equations of different objective functionals can be calculated separately. Numerically, an algorithm of calculating multi-target functionals is designed based on the multi-mesh approach, which can help to obtain different dual solutions simultaneously. One feature of our framework is the algorithm is easy to implement with the help of the hierarchical geometry tree structure and the calculation avoids the Galerkin orthogonality naturally. The framework takes a balance between different targets even when they are not the same orders of magnitude. While existing approach uses a linear combination of different components in multi-target functionals for adaptation, it introduces additional coefficients for adjusting. With each component calculated under a dual-consistent scheme, this multi-mesh framework addresses challenges such as the lift-drag ratio and other kinds of multi-target functionals, ensuring smooth convergence and precise calculations of dual solutions.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Effective Generation of Feasible Solutions for Integer Programming via Guided Diffusion
Authors:
Hao Zeng,
Jiaqi Wang,
Avirup Das,
Junying He,
Kunpeng Han,
Haoyuan Hu,
Mingfei Sun
Abstract:
Feasible solutions are crucial for Integer Programming (IP) since they can substantially speed up the solving process. In many applications, similar IP instances often exhibit similar structures and shared solution distributions, which can be potentially modeled by deep learning methods. Unfortunately, existing deep-learning-based algorithms, such as Neural Diving and Predict-and-search framework,…
▽ More
Feasible solutions are crucial for Integer Programming (IP) since they can substantially speed up the solving process. In many applications, similar IP instances often exhibit similar structures and shared solution distributions, which can be potentially modeled by deep learning methods. Unfortunately, existing deep-learning-based algorithms, such as Neural Diving and Predict-and-search framework, are limited to generating only partial feasible solutions, and they must rely on solvers like SCIP and Gurobi to complete the solutions for a given IP problem. In this paper, we propose a novel framework that generates complete feasible solutions end-to-end. Our framework leverages contrastive learning to characterize the relationship between IP instances and solutions, and learns latent embeddings for both IP instances and their solutions. Further, the framework employs diffusion models to learn the distribution of solution embeddings conditioned on IP representations, with a dedicated guided sampling strategy that accounts for both constraints and objectives. We empirically evaluate our framework on four typical datasets of IP problems, and show that it effectively generates complete feasible solutions with a high probability (> 89.7 \%) without the reliance of Solvers and the quality of solutions is comparable to the best heuristic solutions from Gurobi. Furthermore, by integrating our method's sampled partial solutions with the CompleteSol heuristic from SCIP, the resulting feasible solutions outperform those from state-of-the-art methods across all datasets, exhibiting a 3.7 to 33.7\% improvement in the gap to optimal values, and maintaining a feasible ratio of over 99.7\% for all datasets.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Global well-posedness, scattering and blow-up for the energy-critical, Schrödinger equation with general nonlinearity in the radial case
Authors:
Jun Wang,
Zhaoyang Yin
Abstract:
In this paper, we study the well-posedness theory and the scattering asymptotics for the energy-critical, Schrödinger equation with general nonlinearity \begin{equation*}
\left\{\begin{array}{l} i \partial_t u+Δu + f(u)=0,\ (x, t) \in \mathbb{R}^N \times \mathbb{R}, \\ \left.u\right|_{t=0}=u_0 \in H ^1(\mathbb{R}^N), \end{array}\right. \end{equation*} where $f:\mathbb{C}\rightarrow \mathbb{C}$ s…
▽ More
In this paper, we study the well-posedness theory and the scattering asymptotics for the energy-critical, Schrödinger equation with general nonlinearity \begin{equation*}
\left\{\begin{array}{l} i \partial_t u+Δu + f(u)=0,\ (x, t) \in \mathbb{R}^N \times \mathbb{R}, \\ \left.u\right|_{t=0}=u_0 \in H ^1(\mathbb{R}^N), \end{array}\right. \end{equation*} where $f:\mathbb{C}\rightarrow \mathbb{C}$ satisfies Sobolev critical growth condition. Using contraction map** method and concentration compactness argument, we obtain the well-posedness theory in proper function spaces and scattering asymptotics. This paper generalizes the conclusions in \cite{KCEMF2006}(Invent. Math).
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Extending Structures for Dendriform Algebras
Authors:
Yuanyuan Zhang,
Junwen Wang
Abstract:
In this paper, we devote to extending structures for dendriform algebras. First, we define extending datums and unified products of dendriform algebras, and theoretically solve the extending structure problem. As an application, we consider flag datums as a special case of extending structures, and give an example of the extending structure problem. Second, we introduce matched pairs and bicrossed…
▽ More
In this paper, we devote to extending structures for dendriform algebras. First, we define extending datums and unified products of dendriform algebras, and theoretically solve the extending structure problem. As an application, we consider flag datums as a special case of extending structures, and give an example of the extending structure problem. Second, we introduce matched pairs and bicrossed products of dendriform algebras and theoretically solve the factorization problem for dendriform algebras. Moreover, we also introduce cocycle semidirect products and nonabelian semidirect products as special cases of unified products. Finally, we define the deformation map on a dendriform extending structure (more general case), not necessary a matched pair, which is more practical in the classifying complements problem.
△ Less
Submitted 25 June, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric Models
Authors:
Jiayi Wang,
Zhengling Qi,
Raymond K. W. Wong
Abstract:
In this paper, we delve into the statistical analysis of the fitted Q-evaluation (FQE) method, which focuses on estimating the value of a target policy using offline data generated by some behavior policy. We provide a comprehensive theoretical understanding of FQE estimators under both parameteric and nonparametric models on the $Q$-function. Specifically, we address three key questions related t…
▽ More
In this paper, we delve into the statistical analysis of the fitted Q-evaluation (FQE) method, which focuses on estimating the value of a target policy using offline data generated by some behavior policy. We provide a comprehensive theoretical understanding of FQE estimators under both parameteric and nonparametric models on the $Q$-function. Specifically, we address three key questions related to FQE that remain largely unexplored in the current literature: (1) Is the optimal convergence rate for estimating the policy value regarding the sample size $n$ ($n^{-1/2}$) achievable for FQE under a non-parametric model with a fixed horizon ($T$)? (2) How does the error bound depend on the horizon $T$? (3) What is the role of the probability ratio function in improving the convergence of FQE estimators? Specifically, we show that under the completeness assumption of $Q$-functions, which is mild in the non-parametric setting, the estimation errors for policy value using both parametric and non-parametric FQE estimators can achieve an optimal rate in terms of $n$. The corresponding error bounds in terms of both $n$ and $T$ are also established. With an additional realizability assumption on ratio functions, the rate of estimation errors can be improved from $T^{1.5}/\sqrt{n}$ to $T/\sqrt{n}$, which matches the sharpest known bound in the current literature under the tabular setting.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Measure This, Not That: Optimizing the Cost and Model-Based Information Content of Measurements
Authors:
Jialu Wang,
Zedong Peng,
Ryan Hughes,
Debangsu Bhattacharyya,
David E. Bernal Neira,
Alexander W. Dowling
Abstract:
Model-based design of experiments (MBDoE) is a powerful framework for selecting and calibrating science-based mathematical models from data. This work extends popular MBDoE workflows by proposing a convex mixed integer (non)linear programming (MINLP) problem to optimize the selection of measurements. The solver MindtPy is modified to support calculating the D-optimality objective and its gradient…
▽ More
Model-based design of experiments (MBDoE) is a powerful framework for selecting and calibrating science-based mathematical models from data. This work extends popular MBDoE workflows by proposing a convex mixed integer (non)linear programming (MINLP) problem to optimize the selection of measurements. The solver MindtPy is modified to support calculating the D-optimality objective and its gradient via an external package, \texttt{SciPy}, using the grey-box module in Pyomo. The new approach is demonstrated in two case studies: estimating highly correlated kinetics from a batch reactor and estimating transport parameters in a large-scale rotary packed bed for CO$_2$ capture. Both case studies show how examining the Pareto-optimal trade-offs between information content measured by A- and D-optimality versus measurement budget offers practical guidance for selecting measurements for scientific experiments.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Convergence of bi-spatial pullback random attractors and stochastic Liouville type equations for nonautonomous stochastic p-Laplacian lattice system
Authors:
**tao Wang,
Qinghai Peng,
Chunqiu Li
Abstract:
We consider convergence properties of the long-term behaviors with respect to the coefficient of the stochastic term for a nonautonomous stochastic $p$-Laplacian lattice equation with multiplicative noise. First, the upper semi-continuity of pullback random $(\ell^2,\ell^q)$-attractor is proved for each $q\in[1,+\infty)$. Then, a convergence result of the time-dependent invariant sample Borel prob…
▽ More
We consider convergence properties of the long-term behaviors with respect to the coefficient of the stochastic term for a nonautonomous stochastic $p$-Laplacian lattice equation with multiplicative noise. First, the upper semi-continuity of pullback random $(\ell^2,\ell^q)$-attractor is proved for each $q\in[1,+\infty)$. Then, a convergence result of the time-dependent invariant sample Borel probability measures is obtained in $\ell^2$. Next, we show that the invariant sample measures satisfy a stochastic Liouville type equation and a termwise convergence of the stochastic Liouville type equations is verified. Furthermore, each family of the invariant sample measures is turned out to be a sample statistical solution, which hence also fulfills a convergence consequence.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Bottom spectrum of three-dimensional manifolds with scalar curvature lower bound
Authors:
Ovidiu Munteanu,
Jia** Wang
Abstract:
A classical result of Cheng states that the bottom spectrum of complete manifolds of fixed dimension and Ricci curvature lower bound achieves its maximal value on the corresponding hyperbolic space. The paper establishes an analogous result for three-dimensional complete manifolds with scalar curvature lower bound subject to some necessary topological assumptions. The rigidity issue is also addres…
▽ More
A classical result of Cheng states that the bottom spectrum of complete manifolds of fixed dimension and Ricci curvature lower bound achieves its maximal value on the corresponding hyperbolic space. The paper establishes an analogous result for three-dimensional complete manifolds with scalar curvature lower bound subject to some necessary topological assumptions. The rigidity issue is also addressed and a splitting theorem is obtained for such manifolds with the maximal bottom spectrum.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Nonlinear Eigen-approach ADMM for Sparse Optimization on Stiefel Manifold
Authors:
Jiawei Wang,
Rencang Li,
Richard Yi Da Xu
Abstract:
With the growing interest and applications in machine learning and data science, finding an efficient method to sparse analysis the high-dimensional data and optimizing a dimension reduction model to extract lower dimensional features has becoming more and more important. Orthogonal constraints (Stiefel manifold) is a commonly met constraint in these applications, and the sparsity is usually enfor…
▽ More
With the growing interest and applications in machine learning and data science, finding an efficient method to sparse analysis the high-dimensional data and optimizing a dimension reduction model to extract lower dimensional features has becoming more and more important. Orthogonal constraints (Stiefel manifold) is a commonly met constraint in these applications, and the sparsity is usually enforced through the element-wise L1 norm. Many applications can be found on optimization over Stiefel manifold within the area of physics and machine learning. In this paper, we propose a novel idea by tackling the Stiefel manifold through an nonlinear eigen-approach by first using ADMM to split the problem into smooth optimization over manifold and convex non-smooth optimization, and then transforming the former into the form of nonlinear eigenvalue problem with eigenvector dependency (NEPv) which is solved by self-consistent field (SCF) iteration, and the latter can be found to have an closed-form solution through proximal gradient. Compared with existing methods, our proposed algorithm takes the advantage of specific structure of the objective function, and has efficient convergence results under mild assumptions.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Hardness of Learning Neural Networks under the Manifold Hypothesis
Authors:
Bobak T. Kiani,
Jason Wang,
Melanie Weber
Abstract:
The manifold hypothesis presumes that high-dimensional data lies on or near a low-dimensional manifold. While the utility of encoding geometric structure has been demonstrated empirically, rigorous analysis of its impact on the learnability of neural networks is largely missing. Several recent results have established hardness results for learning feedforward and equivariant neural networks under…
▽ More
The manifold hypothesis presumes that high-dimensional data lies on or near a low-dimensional manifold. While the utility of encoding geometric structure has been demonstrated empirically, rigorous analysis of its impact on the learnability of neural networks is largely missing. Several recent results have established hardness results for learning feedforward and equivariant neural networks under i.i.d. Gaussian or uniform Boolean data distributions. In this paper, we investigate the hardness of learning under the manifold hypothesis. We ask which minimal assumptions on the curvature and regularity of the manifold, if any, render the learning problem efficiently learnable. We prove that learning is hard under input manifolds of bounded curvature by extending proofs of hardness in the SQ and cryptographic settings for Boolean data inputs to the geometric setting. On the other hand, we show that additional assumptions on the volume of the data manifold alleviate these fundamental limitations and guarantee learnability via a simple interpolation argument. Notable instances of this regime are manifolds which can be reliably reconstructed via manifold learning. Looking forward, we comment on and empirically explore intermediate regimes of manifolds, which have heterogeneous features commonly found in real world data.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Non-asymptotic Properties of Generalized Mondrian Forests in Statistical Learning
Authors:
Haoran Zhan,
**gli Wang,
Yingcun Xia
Abstract:
Since the publication of Breiman (2001), Random Forests (RF) have been widely used in both regression and classification. Later on, other forests are also proposed and studied in literature and Mondrian Forests are notable examples built on the Mondrian process; see Lakshminarayanan et al. (2014). In this paper, we propose an ensemble estimator in general statistical learning based on Mondrian For…
▽ More
Since the publication of Breiman (2001), Random Forests (RF) have been widely used in both regression and classification. Later on, other forests are also proposed and studied in literature and Mondrian Forests are notable examples built on the Mondrian process; see Lakshminarayanan et al. (2014). In this paper, we propose an ensemble estimator in general statistical learning based on Mondrian Forests, which can be regarded as an extension of RF. This general framework includes many common learning problems, such as least squared regression, least $\ell_1$ regression, quantile regression and classification. Under mild conditions of loss functions, we give the upper bound of the regret function of our estimator and show that such estimator is statistically consistent.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Intersecting families with large shadow degree
Authors:
Peter Frankl,
Jian Wang
Abstract:
A $k$-uniform family $\mathcal{F}$ is called intersecting if $F\cap F'\neq \emptyset$ for all $F,F'\in \mathcal{F}$. The shadow family $\partial \mathcal{F}$ is the family of $(k-1)$-element sets that are contained in some members of $\mathcal{F}$. The shadow degree (or minimum positive co-degree) of $\mathcal{F}$ is defined as the maximum integer $r$ such that every $E\in \partial \mathcal{F}$ is…
▽ More
A $k$-uniform family $\mathcal{F}$ is called intersecting if $F\cap F'\neq \emptyset$ for all $F,F'\in \mathcal{F}$. The shadow family $\partial \mathcal{F}$ is the family of $(k-1)$-element sets that are contained in some members of $\mathcal{F}$. The shadow degree (or minimum positive co-degree) of $\mathcal{F}$ is defined as the maximum integer $r$ such that every $E\in \partial \mathcal{F}$ is contained in at least $r$ members of $\mathcal{F}$. In 2021, Balogh, Lemons and Palmer determined the maximum size of an intersecting $k$-uniform family with shadow degree at least $r$ for $n\geq n_0(k,r)$, where $n_0(k,r)$ is doubly exponential in $k$ for $4\leq r\leq k$. In the present paper, we present a short proof of this result for $n\geq 2(r+1)^rk \frac{\binom{2k-1}{k}}{\binom{2r-1}{r}}$ and $4\leq r\leq k$.
△ Less
Submitted 4 June, 2024; v1 submitted 1 June, 2024;
originally announced June 2024.
-
Improving Generalization and Convergence by Enhancing Implicit Regularization
Authors:
Mingze Wang,
Haotian He,
**bo Wang,
Zilin Wang,
Guanhua Huang,
Feiyu Xiong,
Zhiyu Li,
Weinan E,
Lei Wu
Abstract:
In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I…
▽ More
In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that IRE can be practically incorporated with {\em generic base optimizers} without introducing significant computational overload. Experiments show that IRE consistently improves the generalization performance for image classification tasks across a variety of benchmark datasets (CIFAR-10/100, ImageNet) and models (ResNets and ViTs). Surprisingly, IRE also achieves a $2\times$ {\em speed-up} compared to AdamW in the pre-training of Llama models (of sizes ranging from 60M to 229M) on datasets including Wikitext-103, Minipile, and Openwebtext. Moreover, we provide theoretical guarantees, showing that IRE can substantially accelerate the convergence towards flat minima in Sharpness-aware Minimization (SAM).
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Entry-Wise Eigenvector Analysis and Improved Rates for Topic Modeling on Short Documents
Authors:
Zheng Tracy Ke,
**gming Wang
Abstract:
Topic modeling is a widely utilized tool in text analysis. We investigate the optimal rate for estimating a topic model. Specifically, we consider a scenario with $n$ documents, a vocabulary of size $p$, and document lengths at the order $N$. When $N\geq c\cdot p$, referred to as the long-document case, the optimal rate is established in the literature at $\sqrt{p/(Nn)}$. However, when $N=o(p)$, r…
▽ More
Topic modeling is a widely utilized tool in text analysis. We investigate the optimal rate for estimating a topic model. Specifically, we consider a scenario with $n$ documents, a vocabulary of size $p$, and document lengths at the order $N$. When $N\geq c\cdot p$, referred to as the long-document case, the optimal rate is established in the literature at $\sqrt{p/(Nn)}$. However, when $N=o(p)$, referred to as the short-document case, the optimal rate remains unknown. In this paper, we first provide new entry-wise large-deviation bounds for the empirical singular vectors of a topic model. We then apply these bounds to improve the error rate of a spectral algorithm, Topic-SCORE. Finally, by comparing the improved error rate with the minimax lower bound, we conclude that the optimal rate is still $\sqrt{p/(Nn)}$ in the short-document case.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Polytopes with low excess degree
Authors:
Guillermo Pineda-Villavicencio,
Jie Wang,
David Yost
Abstract:
We study the existence and structure of $d$-polytopes for which the number $f_1$ of edges is small compared to the number $f_0$ of vertices. Our results are more elegantly expressed in terms of the excess degree of the polytope, defined as $2f_1-df_0$. We show that the excess degree of a $d$-polytope cannot lie in the range $[d+3,2d-7]$, complementing the known result that values in the range…
▽ More
We study the existence and structure of $d$-polytopes for which the number $f_1$ of edges is small compared to the number $f_0$ of vertices. Our results are more elegantly expressed in terms of the excess degree of the polytope, defined as $2f_1-df_0$. We show that the excess degree of a $d$-polytope cannot lie in the range $[d+3,2d-7]$, complementing the known result that values in the range $[1,d-3]$ are impossible. In particular, many pairs $(f_0,f_1)$ are not realised by any polytope. For $d$-polytopes with excess degree $d-2$, strong structural results are known; we establish comparable results for excess degrees $d$, $d+2$, and $2d-6$. Frequently, in polytopes with low excess degree, say at most $2d-6$, the nonsimple vertices all have the same degree and they form either a face or a missing face. We show that excess degree $d+1$ is possible only for $d=3,5$, or $7$, complementing the known result that an excess degree $d-1$ is possible only for $d=3$ or $5$.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Structure-preserving finite element methods for computing dynamics of rotating Bose-Einstein condensate
Authors:
Meng Li,
Junjun Wang,
Zhen Guan,
Zhijie Du
Abstract:
This work is concerned with the construction and analysis of structure-preserving Galerkin methods for computing the dynamics of rotating Bose-Einstein condensate (BEC) based on the Gross-Pitaevskii equation with angular momentum rotation. Due to the presence of the rotation term, constructing finite element methods (FEMs) that preserve both mass and energy remains an unresolved issue, particularl…
▽ More
This work is concerned with the construction and analysis of structure-preserving Galerkin methods for computing the dynamics of rotating Bose-Einstein condensate (BEC) based on the Gross-Pitaevskii equation with angular momentum rotation. Due to the presence of the rotation term, constructing finite element methods (FEMs) that preserve both mass and energy remains an unresolved issue, particularly in the context of nonconforming FEMs. Furthermore, in comparison to existing works, we provide a comprehensive convergence analysis, offering a thorough demonstration of the methods' optimal and high-order convergence properties. Finally, extensive numerical results are presented to check the theoretical analysis of the structure-preserving numerical method for rotating BEC, and the quantized vortex lattice's behavior is scrutinized through a series of numerical tests.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
The generalized 4-connectivity of godan graphs
Authors:
**g Wang,
Jiang Wu,
Yuanqiu Huang,
Zhangdong Ouyang
Abstract:
The generalized $k$-connectivity of a graph $G$, denoted by $κ_k(G)$, is the minimum number of internally edge disjoint $S$-trees for any $S\subseteq V(G)$ and $|S|=k$. The generalized $k$-connectivity is a natural extension of the classical connectivity and plays a key role in applications related to the modern interconnection networks. The godan graph $EA_n$ is a kind of Cayley graphs which poss…
▽ More
The generalized $k$-connectivity of a graph $G$, denoted by $κ_k(G)$, is the minimum number of internally edge disjoint $S$-trees for any $S\subseteq V(G)$ and $|S|=k$. The generalized $k$-connectivity is a natural extension of the classical connectivity and plays a key role in applications related to the modern interconnection networks. The godan graph $EA_n$ is a kind of Cayley graphs which posses many desirable properties. In this paper, we shall study the generalized 4-connectivity of $EA_n$ and show that $κ_4(EA_n)=n-1$ for $n\ge 3$.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Truncated Variance Reduced Value Iteration
Authors:
Yujia **,
Ishani Karmarkar,
Aaron Sidford,
Jiayi Wang
Abstract:
We provide faster randomized algorithms for computing an $ε$-optimal policy in a discounted Markov decision process with $A_{\text{tot}}$-state-action pairs, bounded rewards, and discount factor $γ$. We provide an $\tilde{O}(A_{\text{tot}}[(1 - γ)^{-3}ε^{-2} + (1 - γ)^{-2}])$-time algorithm in the sampling setting, where the probability transition matrix is unknown but accessible through a generat…
▽ More
We provide faster randomized algorithms for computing an $ε$-optimal policy in a discounted Markov decision process with $A_{\text{tot}}$-state-action pairs, bounded rewards, and discount factor $γ$. We provide an $\tilde{O}(A_{\text{tot}}[(1 - γ)^{-3}ε^{-2} + (1 - γ)^{-2}])$-time algorithm in the sampling setting, where the probability transition matrix is unknown but accessible through a generative model which can be queried in $\tilde{O}(1)$-time, and an $\tilde{O}(s + (1-γ)^{-2})$-time algorithm in the offline setting where the probability transition matrix is known and $s$-sparse. These results improve upon the prior state-of-the-art which either ran in $\tilde{O}(A_{\text{tot}}[(1 - γ)^{-3}ε^{-2} + (1 - γ)^{-3}])$ time [Sidford, Wang, Wu, Ye 2018] in the sampling setting, $\tilde{O}(s + A_{\text{tot}} (1-γ)^{-3})$ time [Sidford, Wang, Wu, Yang, Ye 2018] in the offline setting, or time at least quadratic in the number of states using interior point methods for linear programming. We achieve our results by building upon prior stochastic variance-reduced value iteration methods [Sidford, Wang, Wu, Yang, Ye 2018]. We provide a variant that carefully truncates the progress of its iterates to improve the variance of new variance-reduced sampling procedures that we introduce to implement the steps. Our method is essentially model-free and can be implemented in $\tilde{O}(A_{\text{tot}})$-space when given generative model access. Consequently, our results take a step in closing the sample-complexity gap between model-free and model-based methods.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Spatial asymptotic behaviors of fractional stochastic heat equations driven by additive Lévy white noise
Authors:
Yuichi Shiozawa,
Jian Wang
Abstract:
We establish explicit integral tests for spatial asymptotic behaviors of fractional stochastic heat equations driven by additive Lévy white noise. Our results indicate that fractional stochastic heat equations enjoy the so-called additive physical intermittent property in all dimensions when the driven Lévy white noise is sufficiently light-tailed. The proofs are based on heat kernel estimates for…
▽ More
We establish explicit integral tests for spatial asymptotic behaviors of fractional stochastic heat equations driven by additive Lévy white noise. Our results indicate that fractional stochastic heat equations enjoy the so-called additive physical intermittent property in all dimensions when the driven Lévy white noise is sufficiently light-tailed. The proofs are based on heat kernel estimates for the fractional Laplacian and exact tail behaviors for Poissonian functionals associated with the driven Lévy white noise.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Equivariant Twisted Bismut Laplacian with Torsion and KKW type theorems
Authors:
Jian Wang,
Yong Wang
Abstract:
This paper aims to provide an explicit computation of the noncommutative residue density associated with equivariant twisted Bismut Laplacian with torsion on compact manifolds with (or without) boundary. We prove the equivariant twisted Kastler-Kalau-Walze type theorems with torsion on compact manifolds with boundary.
This paper aims to provide an explicit computation of the noncommutative residue density associated with equivariant twisted Bismut Laplacian with torsion on compact manifolds with (or without) boundary. We prove the equivariant twisted Kastler-Kalau-Walze type theorems with torsion on compact manifolds with boundary.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
A robust solver for H(curl) convection-diffusion and its local Fourier analysis
Authors:
**dong Wang,
Shuonan Wu
Abstract:
In this paper, we present a robust and efficient multigrid solver based on an exponential-fitting discretization for 2D H(curl) convection-diffusion problems. By leveraging an exponential identity, we characterize the kernel of H(curl) convection-diffusion problems and design a suitable hybrid smoother. This smoother incorporates a lexicographic Gauss-Seidel smoother within a downwind type and smo…
▽ More
In this paper, we present a robust and efficient multigrid solver based on an exponential-fitting discretization for 2D H(curl) convection-diffusion problems. By leveraging an exponential identity, we characterize the kernel of H(curl) convection-diffusion problems and design a suitable hybrid smoother. This smoother incorporates a lexicographic Gauss-Seidel smoother within a downwind type and smoothing over an auxiliary problem, corresponding to H(grad) convection-diffusion problems for kernel correction. We analyze the convergence properties of the smoothers and the two-level method using local Fourier analysis (LFA). The performance of the algorithms demonstrates robustness in both convection-dominated and diffusion-dominated cases.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
A Nonabelian Hodge Correspondence for Principal Bundles in Positive Characteristic
Authors:
Mao Sheng,
Hao Sun,
Jian** Wang
Abstract:
In this paper, we prove a nonabelian Hodge correspondence for principal bundles on a smooth variety $X$ in positive characteristic, which generalizes the Ogus-Vologodsky correspondence for vector bundles. Then we extend the correspondence to logahoric torsors over a log pair $(X,D)$, where $D$ a reduced normal crossing divisor in $X$. As an intermediate step, we prove a correspondence between prin…
▽ More
In this paper, we prove a nonabelian Hodge correspondence for principal bundles on a smooth variety $X$ in positive characteristic, which generalizes the Ogus-Vologodsky correspondence for vector bundles. Then we extend the correspondence to logahoric torsors over a log pair $(X,D)$, where $D$ a reduced normal crossing divisor in $X$. As an intermediate step, we prove a correspondence between principal bundles on root stacks $\mathscr{X}$ and parahoric torsors on $(X,D)$, which generalizes the correspondence on curves given by Balaji--Seshadri to higher dimensional case.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Exploiting Sign Symmetries in Minimizing Sums of Rational Functions
Authors:
Feng Guo,
Jie Wang,
Jianhao Zheng
Abstract:
This paper is devoted to the problem of minimizing a sum of rational functions over a basic semialgebraic set. We provide a hierarchy of sum of squares (SOS) relaxations that is dual to the generalized moment problem approach due to Bugarin, Henrion, and Lasserre. The investigation of the dual SOS aspect offers two benefits: 1) it allows us to conduct a convergence rate analysis for the hierarchy;…
▽ More
This paper is devoted to the problem of minimizing a sum of rational functions over a basic semialgebraic set. We provide a hierarchy of sum of squares (SOS) relaxations that is dual to the generalized moment problem approach due to Bugarin, Henrion, and Lasserre. The investigation of the dual SOS aspect offers two benefits: 1) it allows us to conduct a convergence rate analysis for the hierarchy; 2) it leads to a sign symmetry adapted hierarchy consisting of block-diagonal semidefinite relaxations. When the problem possesses correlative sparsity as well as sign symmetries, we propose sparse semidefinite relaxations by exploiting both structures. Various numerical experiments are performed to demonstrate the efficiency of our approach. Finally, an application to maximizing sums of generalized Rayleigh quotients is presented.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
The positive fundamental group of ${\rm Sp}(2n)$
Authors:
Jian Wang,
Qinglong Zhou
Abstract:
In this paper, we examine the homotopy classes of positive loops in ${\rm Sp}(2n)$. We demonstrate that two positive loops are homotopic if and only if they are homotopic through positive loops. As consequences, we can extend several results of McDuff \cite{McD} and Chance \cite{Cha} to higher dimensional symplectic manifolds without dimensional restrictions.
In this paper, we examine the homotopy classes of positive loops in ${\rm Sp}(2n)$. We demonstrate that two positive loops are homotopic if and only if they are homotopic through positive loops. As consequences, we can extend several results of McDuff \cite{McD} and Chance \cite{Cha} to higher dimensional symplectic manifolds without dimensional restrictions.
△ Less
Submitted 2 July, 2024; v1 submitted 12 May, 2024;
originally announced May 2024.
-
The existence for the classical solution of the Navier-Stokes equations
Authors:
Jianfeng Wang
Abstract:
In this paper we will discuss the existence for the classical solution of the Navier-Stokes equations. First, we transform it into generalized integral equations. Next, we discuss the existence of the classical solution by Leray-Schauder degree and Sobolev space\ $H^{-m_{1}}(Ω_{1})$.
In this paper we will discuss the existence for the classical solution of the Navier-Stokes equations. First, we transform it into generalized integral equations. Next, we discuss the existence of the classical solution by Leray-Schauder degree and Sobolev space\ $H^{-m_{1}}(Ω_{1})$.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Design optimization in unilateral contact using pressure constraints and Bayesian optimization
Authors:
**gyi Wang,
Jerome Solberg,
Mike A. Puso,
Eric B. Chin,
Cosmin G. Petra
Abstract:
Design optimization problems, e.g., shape optimization, that involve deformable bodies in unilateral contact are challenging as they require robust contact solvers, complex optimization methods that are typically gradient-based, and sensitivity derivations. Notably, the problems are nonsmooth, adding significant difficulty to the optimization process. We study design optimization problems in frict…
▽ More
Design optimization problems, e.g., shape optimization, that involve deformable bodies in unilateral contact are challenging as they require robust contact solvers, complex optimization methods that are typically gradient-based, and sensitivity derivations. Notably, the problems are nonsmooth, adding significant difficulty to the optimization process. We study design optimization problems in frictionless unilateral contact subject to pressure constraints, using both gradient-based and gradient-free optimization methods, namely Bayesian optimization. The contact simulation problem is solved via the mortar contact and finite element methods. For the gradient-based method, we use the direct differentiation method to compute the sensitivities of the cost and constraint function with respect to the design variables. Then, we use Ipopt to solve the optimization problems. For the gradient-free approach, we use a constrained Bayesian optimization algorithm based on the standard Gaussian Process surrogate model. We present numerical examples that control the contact pressure, inspired by real-life engineering applications, to demonstrate the effectiveness, strengths and shortcomings of both methods. Our results suggest that both optimization methods perform reasonably well for these nonsmooth problems.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Moment-SOS relaxations for moment and tensor recovery problems
Authors:
Lei Huang,
Jiawang Nie,
Jiajia Wang
Abstract:
This paper studies moment and tensor recovery problems whose decomposing vectors are contained in some given semialgebraic sets. We propose Moment-SOS relaxations with generic objectives for recovering moments and tensors, whose decomposition lengths are expected to be low. This kind of problems have broad applications in various tensor decomposition questions. Numerical experiments are provided t…
▽ More
This paper studies moment and tensor recovery problems whose decomposing vectors are contained in some given semialgebraic sets. We propose Moment-SOS relaxations with generic objectives for recovering moments and tensors, whose decomposition lengths are expected to be low. This kind of problems have broad applications in various tensor decomposition questions. Numerical experiments are provided to demonstrate the efficiency of this approach.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
Invariant sample measures and sample statistical solutions for nonautonomous stochastic lattice Cahn-Hilliard equation with nonlinear noise
Authors:
**tao Wang,
Dongdong Zhu,
Chunqiu Li
Abstract:
We consider a stochastic lattice Cahn-Hilliard equation with nonautonomous nonlinear noise. First, we prove the existence of pullback random attractors in $\ell^2$ for the generated nonautonomous random dynamical system. Then, we construct the time-dependent invariant sample Borel probability measures based on the pullback random attractor. Moreover, we develop a general stochastic Liouville type…
▽ More
We consider a stochastic lattice Cahn-Hilliard equation with nonautonomous nonlinear noise. First, we prove the existence of pullback random attractors in $\ell^2$ for the generated nonautonomous random dynamical system. Then, we construct the time-dependent invariant sample Borel probability measures based on the pullback random attractor. Moreover, we develop a general stochastic Liouville type equation for nonautonomous random dynamical systems and show that the invariant sample measures obtained satisfy the stochastic Liouville type equation. At last, we define a new kind of statistical solution -- sample statistical solution corresponding to the invariant sample measures and show that each family of invariant sample measures is a sample statistical solution.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Error Estimation in the Mean-Field Limit of Kinetic Flocking Models with Local Alignments
Authors:
**huan Wang,
Keyu Li,
Hui Huang
Abstract:
In this paper, we present an innovative particle system characterized by moderate interactions, designed to accurately approximate kinetic flocking models that incorporate singular interaction forces and local alignment mechanisms. We establish the existence of weak solutions to the corresponding flocking equations and provide an error estimate for the mean-field limit. This is achieved through th…
▽ More
In this paper, we present an innovative particle system characterized by moderate interactions, designed to accurately approximate kinetic flocking models that incorporate singular interaction forces and local alignment mechanisms. We establish the existence of weak solutions to the corresponding flocking equations and provide an error estimate for the mean-field limit. This is achieved through the regularization of singular forces and a nonlocal approximation strategy for local alignments. We show that, by selecting the regularization and localization parameters logarithmically with respect to the number of particles, the particle system effectively approximates the mean-field equation.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Extremal triangle-free graphs with chromatic number at least four
Authors:
Sijie Ren,
Jian Wang,
Shipeng Wang,
Weihua Yang
Abstract:
Let $G$ be an $n$-vertex triangle-free graph. The celebrated Mantel's theorem showed that $e(G)\leq \lfloor\frac{n^2}{4}\rfloor$. In 1962, Erdős (together with Gallai), and independently Andrásfai, proved that if $G$ is non-bipartite then $e(G)\leq \lfloor\frac{(n-1)^2}{4}\rfloor+1$. In this paper, we extend this result and show that if $G$ has chromatic number at least four and $n\geq 150$, then…
▽ More
Let $G$ be an $n$-vertex triangle-free graph. The celebrated Mantel's theorem showed that $e(G)\leq \lfloor\frac{n^2}{4}\rfloor$. In 1962, Erdős (together with Gallai), and independently Andrásfai, proved that if $G$ is non-bipartite then $e(G)\leq \lfloor\frac{(n-1)^2}{4}\rfloor+1$. In this paper, we extend this result and show that if $G$ has chromatic number at least four and $n\geq 150$, then $e(G)\leq \lfloor\frac{(n-3)^2}{4}\rfloor+5$. The blow-up of Grötzsch graph shows that this bound is best possible.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Strengthening Lasserre's Hierarchy in Real and Complex Polynomial Optimization
Authors:
Jie Wang
Abstract:
This paper studies shift operators which arises from extractions of solutions for Lasserre's hierarchy. First, we establish a connection between multiplication operators and shift operators. More importantly, we derive new positive semidefinite conditions of rank-one moment sequences via shift operators, and utilize these conditions to strengthen Lasserre's hierarchy for real and complex polynomia…
▽ More
This paper studies shift operators which arises from extractions of solutions for Lasserre's hierarchy. First, we establish a connection between multiplication operators and shift operators. More importantly, we derive new positive semidefinite conditions of rank-one moment sequences via shift operators, and utilize these conditions to strengthen Lasserre's hierarchy for real and complex polynomial optimization. Furthermore, we integrate the strengthening technique with correlative sparsity and sign symmetries present in polynomial optimization problems. Extensive numerical experiments show that our strengthening technique can significantly improve the bound (especially for complex polynomial optimization) and allows to achieve global optimality at lower relaxation orders, thus providing substantial computational savings and considerable speedup.
△ Less
Submitted 28 May, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Association schemes arising from non-weakly regular bent functions
Authors:
Yadi Wei,
Jiaxin Wang,
Fang-Wei Fu
Abstract:
Association schemes play an important role in algebraic combinatorics and have important applications in coding theory, graph theory and design theory. The methods to construct association schemes by using bent functions have been extensively studied. Recently, in [13], {Ö}zbudak and Pelen constructed infinite families of symmetric association schemes of classes $5$ and $6$ by using ternary non-we…
▽ More
Association schemes play an important role in algebraic combinatorics and have important applications in coding theory, graph theory and design theory. The methods to construct association schemes by using bent functions have been extensively studied. Recently, in [13], {Ö}zbudak and Pelen constructed infinite families of symmetric association schemes of classes $5$ and $6$ by using ternary non-weakly regular bent functions.They also stated that constructing $2p$-class association schemes from $p$-ary non-weakly regular bent functions is an interesting problem, where $p>3$ is an odd prime. In this paper, using non-weakly regular bent functions, we construct infinite families of symmetric association schemes of classes $2p$, $(2p+1)$ and $\frac{3p+1}{2}$ for any odd prime $p$. Fusing those association schemes, we also obtain $t$-class symmetric association schemes, where $t=4,5,6,7$. In addition, we give the sufficient and necessary conditions for the partitions $P$, $D$, $T$, $U$ and $V$ (defined in this paper) to induce symmetric association schemes.
△ Less
Submitted 13 June, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
On open manifolds admitting no complete metric with positive scalar curvature
Authors:
Yuguang Shi,
Jian Wang,
Runzhang Wu,
**tian Zhu
Abstract:
In this paper, we investigate the topological obstruction problem for positive scalar curvature and uniformly positive scalar curvature on open manifolds. We present a definition for open Schoen-Yau-Schick manifolds and prove that there is no complete metric with positive scalar curvature on these manifolds. Similarly, we define weak Schoen-Yau-Shick manifolds by analogy, which are expected to adm…
▽ More
In this paper, we investigate the topological obstruction problem for positive scalar curvature and uniformly positive scalar curvature on open manifolds. We present a definition for open Schoen-Yau-Schick manifolds and prove that there is no complete metric with positive scalar curvature on these manifolds. Similarly, we define weak Schoen-Yau-Shick manifolds by analogy, which are expected to admit no complete metrics with uniformly positive scalar curvature.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Stochastic Approximation Proximal Subgradient Method for Stochastic Convex-Concave Minimax Optimization
Authors:
Yu-Hong Dai,
Jiani Wang,
Liwei Zhang
Abstract:
This paper presents a stochastic approximation proximal subgradient (SAPS) method for stochastic convex-concave minimax optimization. By accessing unbiased and variance bounded approximate subgradients, we show that this algorithm exhibits ${\rm O}(N^{-1/2})$ expected convergence rate of the minimax optimality measure if the parameters in the algorithm are properly chosen, where $N$ denotes the nu…
▽ More
This paper presents a stochastic approximation proximal subgradient (SAPS) method for stochastic convex-concave minimax optimization. By accessing unbiased and variance bounded approximate subgradients, we show that this algorithm exhibits ${\rm O}(N^{-1/2})$ expected convergence rate of the minimax optimality measure if the parameters in the algorithm are properly chosen, where $N$ denotes the number of iterations. Moreover, we show that the algorithm has ${\rm O}(\log(N)N^{-1/2})$ minimax optimality measure bound with high probability. Further we study a specific stochastic convex-concave minimax optimization problems arising from stochastic convex conic optimization problems, which the the bounded subgradient condition is fail. To overcome the lack of the bounded subgradient conditions in convex-concave minimax problems, we propose a linearized stochastic approximation augmented Lagrange (LSAAL) method and prove that this algorithm exhibits ${\rm O}(N^{-1/2})$ expected convergence rate for the minimax optimality measure and ${\rm O}(\log^2(N)N^{-1/2})$ minimax optimality measure bound with high probability as well. Preliminary numerical results demonstrate the effect of the SAPS and LSAAL methods.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Non-Convex Robust Hypothesis Testing using Sinkhorn Uncertainty Sets
Authors:
Jie Wang,
Rui Gao,
Yao Xie
Abstract:
We present a new framework to address the non-convex robust hypothesis testing problem, wherein the goal is to seek the optimal detector that minimizes the maximum of worst-case type-I and type-II risk functions. The distributional uncertainty sets are constructed to center around the empirical distribution derived from samples based on Sinkhorn discrepancy. Given that the objective involves non-c…
▽ More
We present a new framework to address the non-convex robust hypothesis testing problem, wherein the goal is to seek the optimal detector that minimizes the maximum of worst-case type-I and type-II risk functions. The distributional uncertainty sets are constructed to center around the empirical distribution derived from samples based on Sinkhorn discrepancy. Given that the objective involves non-convex, non-smooth probabilistic functions that are often intractable to optimize, existing methods resort to approximations rather than exact solutions. To tackle the challenge, we introduce an exact mixed-integer exponential conic reformulation of the problem, which can be solved into a global optimum with a moderate amount of input data. Subsequently, we propose a convex approximation, demonstrating its superiority over current state-of-the-art methodologies in literature. Furthermore, we establish connections between robust hypothesis testing and regularized formulations of non-robust risk functions, offering insightful interpretations. Our numerical study highlights the satisfactory testing performance and computational efficiency of the proposed framework.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Constrained Bayesian optimization with merit functions
Authors:
J. Wang,
C. G. Petra,
J. L. Peterson
Abstract:
Bayesian optimization is a powerful optimization tool for problems where native first-order derivatives are unavailable. Recently, constrained Bayesian optimization (CBO) has been applied to many engineering applications where constraints are essential. However, several obstacles remain with current CBO algorithms that could prevent a wider adoption. We propose CBO algorithms using merit functions…
▽ More
Bayesian optimization is a powerful optimization tool for problems where native first-order derivatives are unavailable. Recently, constrained Bayesian optimization (CBO) has been applied to many engineering applications where constraints are essential. However, several obstacles remain with current CBO algorithms that could prevent a wider adoption. We propose CBO algorithms using merit functions, such as the penalty merit function, in acquisition functions, inspired by nonlinear optimization methods, e.g., sequential quadratic programming. Merit functions measure the potential progress of both the objective and constraint functions, thus increasing algorithmic efficiency and allowing infeasible initial samples. The acquisition functions with merit functions are relaxed to have closed forms, making its implementation readily available wherever Bayesian optimization is. We further propose a unified CBO algorithm that can be seen as extension to the popular expected constrained improvement (ECI) approach. We demonstrate the effectiveness and efficiency of the proposed algorithms through numerical experiments on synthetic problems and a practical data-driven engineering design problem in the field of plasma physics.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Research on Personal Credit Risk Assessment Methods Based on Causal Inference
Authors:
Jiaxin Wang,
YiLong Ma
Abstract:
The discussion on causality in human history dates back to ancient Greece, yet to this day, there is still no consensus. Fundamentally, this stems from the nature of human cognition, as understanding causality requires abstract tools to transcend the limitations of human cognition. In recent decades, the rapid development of mathematical and computational tools has provided new theoretical and tec…
▽ More
The discussion on causality in human history dates back to ancient Greece, yet to this day, there is still no consensus. Fundamentally, this stems from the nature of human cognition, as understanding causality requires abstract tools to transcend the limitations of human cognition. In recent decades, the rapid development of mathematical and computational tools has provided new theoretical and technical means for exploring causality, creating more avenues for investigation.
Based on this, this paper introduces a new definition of causality using category theory, proposed by Samuel Eilenberg and Saunders Mac Lane in 1945 to avoid the self-referential contradictions in set theory, notably the Russell paradox. Within this framework, the feasibility of indicator synthesis in causal inference is demonstrated. Due to the limitations in the development of category theory-related technical tools, this paper adopts the widely-used probabilistic causal graph tool proposed by Judea Pearl in 1995 to study the application of causal inference in personal credit risk management. The specific work includes: research on the construction method of causal inference index system, definition of causality and feasibility proof of indicator synthesis causal inference within this framework, application methods of causal graph model and intervention alternative criteria in personal credit risk management, and so on.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Improved Algorithm and Bounds for Successive Projection
Authors:
Jiashun **,
Zheng Tracy Ke,
Gabriel Moryoussef,
Jiajun Tang,
**gming Wang
Abstract:
Given a $K$-vertex simplex in a $d$-dimensional space, suppose we measure $n$ points on the simplex with noise (hence, some of the observed points fall outside the simplex). Vertex hunting is the problem of estimating the $K$ vertices of the simplex. A popular vertex hunting algorithm is successive projection algorithm (SPA). However, SPA is observed to perform unsatisfactorily under strong noise…
▽ More
Given a $K$-vertex simplex in a $d$-dimensional space, suppose we measure $n$ points on the simplex with noise (hence, some of the observed points fall outside the simplex). Vertex hunting is the problem of estimating the $K$ vertices of the simplex. A popular vertex hunting algorithm is successive projection algorithm (SPA). However, SPA is observed to perform unsatisfactorily under strong noise or outliers. We propose pseudo-point SPA (pp-SPA). It uses a projection step and a denoise step to generate pseudo-points and feed them into SPA for vertex hunting. We derive error bounds for pp-SPA, leveraging on extreme value theory of (possibly) high-dimensional random vectors. The results suggest that pp-SPA has faster rates and better numerical performances than SPA. Our analysis includes an improved non-asymptotic bound for the original SPA, which is of independent interest.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.
-
Dirichlet heat kernel estimates of subordinate diffusion processes with diffusive components in $C^{1, α}$ open sets
Authors:
Jie-Ming Wang
Abstract:
In this paper, we derive explicit sharp two-sided estimates of the Dirichlet heat kernels for a class of symmetric subordinate diffusion processes with diffusive components in $C^{1, α}(α\in (0, 1])$ open sets in $\mathbb R^d$ when the scaling order of the Laplace exponent of purely discontinuous part of the subordinator is between $0$ and $1$ including $1.$ The main result of this paper shows the…
▽ More
In this paper, we derive explicit sharp two-sided estimates of the Dirichlet heat kernels for a class of symmetric subordinate diffusion processes with diffusive components in $C^{1, α}(α\in (0, 1])$ open sets in $\mathbb R^d$ when the scaling order of the Laplace exponent of purely discontinuous part of the subordinator is between $0$ and $1$ including $1.$ The main result of this paper shows the stability of Dirichlet heat kernel estimates for such processes in $C^{1, α}$ open sets in the sense that the estimates depend on the divergence elliptic operator only via its uniform ellipticity constant and the Dini continuity modulus of the diffusion coefficients. As a corollary, we obtain the sharp two-sided estimates for Green functions of those processes in bounded $C^{1, α}$ open sets.
△ Less
Submitted 29 April, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
A mechanism-driven reinforcement learning framework for shape optimization of airfoils
Authors:
**gfeng Wang,
Guanghui Hu
Abstract:
In this paper, a novel mechanism-driven reinforcement learning framework is proposed for airfoil shape optimization. To validate the framework, a reward function is designed and analyzed, from which the equivalence between the maximizing the cumulative reward and achieving the optimization objectives is guaranteed theoretically. To establish a quality exploration, and to obtain an accurate reward…
▽ More
In this paper, a novel mechanism-driven reinforcement learning framework is proposed for airfoil shape optimization. To validate the framework, a reward function is designed and analyzed, from which the equivalence between the maximizing the cumulative reward and achieving the optimization objectives is guaranteed theoretically. To establish a quality exploration, and to obtain an accurate reward from the environment, an efficient solver for steady Euler equations is employed in the reinforcement learning method. The solver utilizes the Bézier curve to describe the shape of the airfoil, and a Newton-geometric multigrid method for the solution. In particular, a dual-weighted residual-based h-adaptive method is used for efficient calculation of target functional. To effectively streamline the airfoil shape during the deformation process, we introduce the Laplacian smoothing, and propose a Bézier fitting strategy, which not only remits mesh tangling but also guarantees a precise manipulation of the geometry. In addition, a neural network architecture is designed based on an attention mechanism to make the learning process more sensitive to the minor change of the airfoil geometry. Numerical experiments demonstrate that our framework can handle the optimization problem with hundreds of design variables. It is worth mentioning that, prior to this work, there are limited works combining such high-fidelity partial differential equatons framework with advanced reinforcement learning algorithms for design problems with such high dimensionality.
△ Less
Submitted 26 May, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
Uniform large deviations and metastability of random dynamical systems
Authors:
Jifa Jiang,
Jian Wang,
Jianliang Zhai,
Tusheng Zhang
Abstract:
In this paper, we first provide a criterion on uniform large deviation principles (ULDP) of stochastic differential equations under Lyapunov conditions on the coefficients, which can be applied to stochastic systems with coefficients of polynomial growth and possible degenerate driving noises. In the second part, using the ULDP criterion we preclude the concentration of limiting measures of invari…
▽ More
In this paper, we first provide a criterion on uniform large deviation principles (ULDP) of stochastic differential equations under Lyapunov conditions on the coefficients, which can be applied to stochastic systems with coefficients of polynomial growth and possible degenerate driving noises. In the second part, using the ULDP criterion we preclude the concentration of limiting measures of invariant measures of stochastic dynamical systems on repellers and acyclic saddle chains and extend Freidlin and Wentzell's asymptotics theorem to stochastic systems with unbounded coefficients. Of particular interest, we determine the limiting measures of the invariant measures of the famous stochastic van der Pol equation and van der Pol Duffing equation whose noises are naturally degenerate. We also construct two examples to match the global phase portraits of Freidlin and Wentzell's unperturbed systems and to explicitly compute their transition difficulty matrices. Other applications include stochastic May-Leonard system and random systems with infinitely many equivalent classes.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
High-order accurate positivity-preserving and well-balanced discontinuous Galerkin schemes for ten-moment Gaussian closure equations with source terms
Authors:
Jiangfu Wang,
Huazhong Tang,
Kailiang Wu
Abstract:
This paper proposes novel high-order accurate discontinuous Galerkin (DG) schemes for the one- and two-dimensional ten-moment Gaussian closure equations with source terms defined by a known potential function. Our DG schemes exhibit the desirable capability of being well-balanced (WB) for a known hydrostatic equilibrium state while simultaneously preserving positive density and positive-definite a…
▽ More
This paper proposes novel high-order accurate discontinuous Galerkin (DG) schemes for the one- and two-dimensional ten-moment Gaussian closure equations with source terms defined by a known potential function. Our DG schemes exhibit the desirable capability of being well-balanced (WB) for a known hydrostatic equilibrium state while simultaneously preserving positive density and positive-definite anisotropic pressure tensor. The well-balancedness is built on carefully modifying the solution states in the Harten-Lax-van Leer-contact (HLLC) flux, and appropriate reformulation and discretization of the source terms. Our novel modification technique overcomes the difficulties posed by the anisotropic effects, maintains the high-order accuracy, and ensures that the modified solution state remains within the physically admissible state set. Positivity-preserving analyses of our WB DG schemes are conducted by using several key properties of the admissible state set, the HLLC flux and the HLLC solver, as well as the geometric quasilinearization (GQL) approach in [Wu & Shu, SIAM Review, 65: 1031-1073, 2023], which was originally applied to analyze the admissible state set and physical-constraints-preserving schemes for the relativistic magnetohydrodynamics in [Wu & Tang, M3AS, 27: 1871-1928, 2017], to address the difficulties arising from the nonlinear constraints on pressure tensor. Moreover, the proposed WB DG schemes satisfy the weak positivity for the cell averages, implying the use of a scaling limiter to enforce the physical admissibility of the DG solution polynomials at certain points of interest. Extensive numerical experiments are conducted to validate the preservation of equilibrium states, accuracy in capturing small perturbations to such states, robustness in solving problems involving low density or low pressure, and high resolution for both smooth and discontinuous solutions.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Hamiltonian Descent and Coordinate Hamiltonian Descent
Authors:
Jun-Kun Wang
Abstract:
We propose an optimization algorithm called Hamiltonian Descent, which is a direct counterpart of classical Hamiltonian Monte Carlo in sampling. We find that Hamiltonian Descent for solving strongly convex quadratic problems exhibits a novel update scheme that involves matrix-power-vector products. We also propose Coordinate Hamiltonian Descent and its parallelizable variant, which turns out to en…
▽ More
We propose an optimization algorithm called Hamiltonian Descent, which is a direct counterpart of classical Hamiltonian Monte Carlo in sampling. We find that Hamiltonian Descent for solving strongly convex quadratic problems exhibits a novel update scheme that involves matrix-power-vector products. We also propose Coordinate Hamiltonian Descent and its parallelizable variant, which turns out to encapsulate the classical Gauss-Seidel method, Successive Over-relaxation, Jacobi method, and more, for solving a linear system of equations. The result not only offers a new perspective on these existing algorithms but also leads to a broader class of update schemes that guarantee the convergence.
△ Less
Submitted 29 May, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Scalar curvature rigidity of the four-dimensional sphere
Authors:
Simone Cecchini,
**min Wang,
Zhizhang Xie,
Bo Zhu
Abstract:
Let $(M,g)$ be a closed connected oriented (possibly non-spin) smooth four-dimensional manifold with scalar curvature bounded below by $n(n-1)$. In this paper, we prove that if $f$ is a smooth map of non-zero degree from $(M, g)$ to the unit four-sphere, then $f$ is an isometry. Following ideas of Gromov, we use $μ$-bubbles and a version with coefficients of the rigidity of the three-sphere to rul…
▽ More
Let $(M,g)$ be a closed connected oriented (possibly non-spin) smooth four-dimensional manifold with scalar curvature bounded below by $n(n-1)$. In this paper, we prove that if $f$ is a smooth map of non-zero degree from $(M, g)$ to the unit four-sphere, then $f$ is an isometry. Following ideas of Gromov, we use $μ$-bubbles and a version with coefficients of the rigidity of the three-sphere to rule out the case of strict inequality. Our proof of rigidity is based on the harmonic map heat flow coupled with the Ricci flow.
△ Less
Submitted 20 March, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.