-
A Customized Augmented Lagrangian Method for Block-Structured Integer Programming
Authors:
Rui Wang,
Chuwen Zhang,
Shanwen Pu,
Jianjun Gao,
Zaiwen Wen
Abstract:
Integer programming with block structures has received considerable attention recently and is widely used in many practical applications such as train timetabling and vehicle routing problems. It is known to be NP-hard due to the presence of integer variables. We define a novel augmented Lagrangian function by directly penalizing the inequality constraints and establish the strong duality between…
▽ More
Integer programming with block structures has received considerable attention recently and is widely used in many practical applications such as train timetabling and vehicle routing problems. It is known to be NP-hard due to the presence of integer variables. We define a novel augmented Lagrangian function by directly penalizing the inequality constraints and establish the strong duality between the primal problem and the augmented Lagrangian dual problem. Then, a customized augmented Lagrangian method is proposed to address the block-structures. In particular, the minimization of the augmented Lagrangian function is decomposed into multiple subproblems by decoupling the linking constraints and these subproblems can be efficiently solved using the block coordinate descent method. We also establish the convergence property of the proposed method. To make the algorithm more practical, we further introduce several refinement techniques to identify high-quality feasible solutions. Numerical experiments on a few interesting scenarios show that our proposed algorithm often achieves a satisfactory solution and is quite effective.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Existence and uniqueness of weak solutions to a parabolic nonlocal 1-Laplacian equation
Authors:
Dingding Li,
Chao Zhang
Abstract:
We consider a class of parabolic nonlocal $1$-Laplacian equation \begin{align*} u_t+(-Δ)^s_1u=f \quad \text{ in }Ω\times(0,T]. \end{align*} By employing the Rothe time-discretization method, we establish the existence and uniqueness of weak solutions to the equation above. In particular, different from the previous results on the local case, we infer that the weak solution maintains $\frac{1}{2}$-…
▽ More
We consider a class of parabolic nonlocal $1$-Laplacian equation \begin{align*} u_t+(-Δ)^s_1u=f \quad \text{ in }Ω\times(0,T]. \end{align*} By employing the Rothe time-discretization method, we establish the existence and uniqueness of weak solutions to the equation above. In particular, different from the previous results on the local case, we infer that the weak solution maintains $\frac{1}{2}$-Hölder continuity in time.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
A polynomial time algorithm for Sylvester waves when entries are bounded
Authors:
Guoce Xin,
Chen Zhang
Abstract:
The Sylvester's denumerant \( d(t; \boldsymbol{a}) \) is a quantity that counts the number of nonnegative integer solutions to the equation \( \sum_{i=1}^{N} a_i x_i = t \), where \( \boldsymbol{a} = (a_1, \dots, a_N) \) is a sequence of distinct positive integers with \( \gcd(\boldsymbol{a}) = 1 \). We present a polynomial time algorithm in $N$ for computing \( d(t; \boldsymbol{a}) \) when \( \bo…
▽ More
The Sylvester's denumerant \( d(t; \boldsymbol{a}) \) is a quantity that counts the number of nonnegative integer solutions to the equation \( \sum_{i=1}^{N} a_i x_i = t \), where \( \boldsymbol{a} = (a_1, \dots, a_N) \) is a sequence of distinct positive integers with \( \gcd(\boldsymbol{a}) = 1 \). We present a polynomial time algorithm in $N$ for computing \( d(t; \boldsymbol{a}) \) when \( \boldsymbol{a} \) is bounded and \( t \) is a parameter. The proposed algorithm is rooted in the use of cyclotomic polynomials and builds upon recent results by Xin-Zhang-Zhang on the efficient computation of generalized Todd polynomials. The algorithm has been implemented in \texttt{Maple} under the name \texttt{Cyc-Denum} and demonstrates superior performance when \( a_i \leq 500 \) compared to Sills-Zeilberger's \texttt{Maple} package \texttt{PARTITIONS}.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
On $p$-adic Transference Theorem
Authors:
Chi Zhang
Abstract:
Dual lattice is an important concept of Euclidean lattices. In 2024, Deng gave the definition to the concept of the dual lattice of a $p$-adic lattice from the duality theory of locally compact abelian groups. He also proved some important properties of the dual lattice of $p$-adic lattices, which can be viewed as $p$-adic analogues of the famous Minkowski's first, second theorems and transference…
▽ More
Dual lattice is an important concept of Euclidean lattices. In 2024, Deng gave the definition to the concept of the dual lattice of a $p$-adic lattice from the duality theory of locally compact abelian groups. He also proved some important properties of the dual lattice of $p$-adic lattices, which can be viewed as $p$-adic analogues of the famous Minkowski's first, second theorems and transference theorems for Euclidean lattices. However, he only proved the lower bound of the transference theorems for $p$-adic lattices. The upper bound is left as an open question. In this paper, we prove the upper bound of the transference theorems for $p$-adic lattices. We then prove that the dual basis of an orthogonal basis is also an orthogonal basis with respect to the maximum norm.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Quantitative analysis and its applications for Keller-Segel type systems
Authors:
Mengyao Ding,
Yuzhou Fang,
Chao Zhang
Abstract:
In this paper, we utilize the De Giorgi iteration to quantitatively analyze the upper bound of solutions for Keller-Segel type systems. The refined upper bound estimate presented here has broad applications in determining large time behaviours of weak solutions and improving the regularity for models involving the $p$-Laplace operator. To demonstrate the applicability of our findings, we investiga…
▽ More
In this paper, we utilize the De Giorgi iteration to quantitatively analyze the upper bound of solutions for Keller-Segel type systems. The refined upper bound estimate presented here has broad applications in determining large time behaviours of weak solutions and improving the regularity for models involving the $p$-Laplace operator. To demonstrate the applicability of our findings, we investigate the asymptotic stability of a chemotaxis model with nonlinear signal production and a chemotaxis-Navier-Stokes model with a logistic source. Additionally, within the context of $p$-Laplacian diffusion, we establish Hölder continuity for a chemotaxis-haptotaxis model and a chemotaxis-Stokes model.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Harnack inequality for doubly nonlinear mixed local and nonlocal parabolic equations
Authors:
Vicentiu Radulescu,
Bin Shang,
Chao Zhang
Abstract:
In this paper, we establish the Harnack inequality of nonnegative weak solutions to the doubly nonlinear mixed local and nonlocal parabolic equations. This result is obtained by combining a related comparison principle, a local boundedness estimate, and an integral Harnack-type inequality. Our proof is based on the expansion of positivity together with a comparison argument.
In this paper, we establish the Harnack inequality of nonnegative weak solutions to the doubly nonlinear mixed local and nonlocal parabolic equations. This result is obtained by combining a related comparison principle, a local boundedness estimate, and an integral Harnack-type inequality. Our proof is based on the expansion of positivity together with a comparison argument.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Quantum Algorithms and Lower Bounds for Finite-Sum Optimization
Authors:
Yexin Zhang,
Chenyi Zhang,
Cong Fang,
Liwei Wang,
Tongyang Li
Abstract:
Finite-sum optimization has wide applications in machine learning, covering important problems such as support vector machines, regression, etc. In this paper, we initiate the study of solving finite-sum optimization problems by quantum computing. Specifically, let $f_1,\ldots,f_n\colon\mathbb{R}^d\to\mathbb{R}$ be $\ell$-smooth convex functions and $ψ\colon\mathbb{R}^d\to\mathbb{R}$ be a $μ$-stro…
▽ More
Finite-sum optimization has wide applications in machine learning, covering important problems such as support vector machines, regression, etc. In this paper, we initiate the study of solving finite-sum optimization problems by quantum computing. Specifically, let $f_1,\ldots,f_n\colon\mathbb{R}^d\to\mathbb{R}$ be $\ell$-smooth convex functions and $ψ\colon\mathbb{R}^d\to\mathbb{R}$ be a $μ$-strongly convex proximal function. The goal is to find an $ε$-optimal point for $F(\mathbf{x})=\frac{1}{n}\sum_{i=1}^n f_i(\mathbf{x})+ψ(\mathbf{x})$. We give a quantum algorithm with complexity $\tilde{O}\big(n+\sqrt{d}+\sqrt{\ell/μ}\big(n^{1/3}d^{1/3}+n^{-2/3}d^{5/6}\big)\big)$, improving the classical tight bound $\tildeΘ\big(n+\sqrt{n\ell/μ}\big)$. We also prove a quantum lower bound $\tildeΩ(n+n^{3/4}(\ell/μ)^{1/4})$ when $d$ is large enough. Both our quantum upper and lower bounds can extend to the cases where $ψ$ is not necessarily strongly convex, or each $f_i$ is Lipschitz but not necessarily smooth. In addition, when $F$ is nonconvex, our quantum algorithm can find an $ε$-critial point using $\tilde{O}(n+\ell(d^{1/3}n^{1/3}+\sqrt{d})/ε^2)$ queries.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Boundedness of variation, oscillation and maximal differential transform on BMO space
Authors:
Wenting Hu,
Kai Wu,
Dongyong Yang,
Chao Zhang
Abstract:
In this paper, we prove that the oscillation operator, variation operator and maximal differential transform associated with the approximate identities are bounded from ${\rm BMO}({\mathbb R}^n)$ to its subspace ${\rm BLO}({\mathbb R}^n)$.
In this paper, we prove that the oscillation operator, variation operator and maximal differential transform associated with the approximate identities are bounded from ${\rm BMO}({\mathbb R}^n)$ to its subspace ${\rm BLO}({\mathbb R}^n)$.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Generalized Young Measure Solutions for a Class of Quasilinear Parabolic Equations with Linear Growth
Authors:
**gfeng Shao,
Zhichang Guo,
Chao Zhang
Abstract:
Using the generalized Young measure theory, we extend the theory of Young measure solutions to a class of quasilinear parabolic equations with linear growth, and introduce the concept of generalized Young measure solutions. We prove the existence and uniqueness of the generalized Young measure solutions. In addition, for the gradient flow of convex parabolic variational integral, we show that the…
▽ More
Using the generalized Young measure theory, we extend the theory of Young measure solutions to a class of quasilinear parabolic equations with linear growth, and introduce the concept of generalized Young measure solutions. We prove the existence and uniqueness of the generalized Young measure solutions. In addition, for the gradient flow of convex parabolic variational integral, we show that the generalized Young measure solutions are equivalent to the strong solutions.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
On the weak Harnack inequalities for nonlocal double phase problems
Authors:
Yuzhou Fang,
Chao Zhang
Abstract:
This paper is devoted to studying the weak Harnack inequalities for nonlocal double phase functionals by using expansion of positivity, whose prototype is $$ \iint_{\mathbb{R}^n\times\mathbb{R}^n} \left(\frac{|u(x)-u(y)|^p}{|x-y|^{n+sp}}+a(x,y)\frac{|u(x)-u(y)|^q}{|x-y|^{n+tq}}\right) \,dxdy $$ with $a\ge0$ and $0<s\le t<1<p\le q$. The core of our approach is to establish several measure theoretic…
▽ More
This paper is devoted to studying the weak Harnack inequalities for nonlocal double phase functionals by using expansion of positivity, whose prototype is $$ \iint_{\mathbb{R}^n\times\mathbb{R}^n} \left(\frac{|u(x)-u(y)|^p}{|x-y|^{n+sp}}+a(x,y)\frac{|u(x)-u(y)|^q}{|x-y|^{n+tq}}\right) \,dxdy $$ with $a\ge0$ and $0<s\le t<1<p\le q$. The core of our approach is to establish several measure theoretical estimates based on the nonlocal Caccioppoli-type inequality, where the challenges consist in controlling subtle interaction between the pointwise behaviour of modulating coefficient and the growth exponents. Meanwhile, a quantitative boundedness result on the minimizer of such functionals is also discussed.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Sweedler duality for Hom-algebras and Hom-modules
Authors:
Jiacheng Sun,
Shuanhong Wang,
Chi Zhang,
Haoran Zhu
Abstract:
The construction of Sweedler duality is an important tool in the theory of Hopf algebras over a field, which is a right adjoint to the dual algebra functor. In this paper, we study the Sweedler duality of Hom-algebras and their Hom-modules. We delve into the structure of Hom-coalgebras and derive the linear morphisms associated with them. Additionally, as an application, we present the (right) Hom…
▽ More
The construction of Sweedler duality is an important tool in the theory of Hopf algebras over a field, which is a right adjoint to the dual algebra functor. In this paper, we study the Sweedler duality of Hom-algebras and their Hom-modules. We delve into the structure of Hom-coalgebras and derive the linear morphisms associated with them. Additionally, as an application, we present the (right) Hom-(co)module morphisms under the Sweedler duality.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Comparisons Are All You Need for Optimizing Smooth Functions
Authors:
Chenyi Zhang,
Tongyang Li
Abstract:
When optimizing machine learning models, there are various scenarios where gradient computations are challenging or even infeasible. Furthermore, in reinforcement learning (RL), preference-based RL that only compares between options has wide applications, including reinforcement learning with human feedback in large language models. In this paper, we systematically study optimization of a smooth f…
▽ More
When optimizing machine learning models, there are various scenarios where gradient computations are challenging or even infeasible. Furthermore, in reinforcement learning (RL), preference-based RL that only compares between options has wide applications, including reinforcement learning with human feedback in large language models. In this paper, we systematically study optimization of a smooth function $f\colon\mathbb{R}^n\to\mathbb{R}$ only assuming an oracle that compares function values at two points and tells which is larger. When $f$ is convex, we give two algorithms using $\tilde{O}(n/ε)$ and $\tilde{O}(n^{2})$ comparison queries to find an $ε$-optimal solution, respectively. When $f$ is nonconvex, our algorithm uses $\tilde{O}(n/ε^2)$ comparison queries to find an $ε$-approximate stationary point. All these results match the best-known zeroth-order algorithms with function evaluation queries in $n$ dependence, thus suggest that \emph{comparisons are all you need for optimizing smooth functions using derivative-free methods}. In addition, we also give an algorithm for esca** saddle points and reaching an $ε$-second order stationary point of a nonconvex $f$, using $\tilde{O}(n^{1.5}/ε^{2.5})$ comparison queries.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm
Authors:
Fuzhong Zhou,
Chenyu Zhang,
Xu Chen,
Xuan Di
Abstract:
We propose a discrete time graphon game formulation on continuous state and action spaces using a representative player to study stochastic games with heterogeneous interaction among agents. This formulation admits both philosophical and mathematical advantages, compared to a widely adopted formulation using a continuum of players. We prove the existence and uniqueness of the graphon equilibrium w…
▽ More
We propose a discrete time graphon game formulation on continuous state and action spaces using a representative player to study stochastic games with heterogeneous interaction among agents. This formulation admits both philosophical and mathematical advantages, compared to a widely adopted formulation using a continuum of players. We prove the existence and uniqueness of the graphon equilibrium with mild assumptions, and show that this equilibrium can be used to construct an approximate solution for finite player game on networks, which is challenging to analyze and solve due to curse of dimensionality. An online oracle-free learning algorithm is developed to solve the equilibrium numerically, and sample complexity analysis is provided for its convergence.
△ Less
Submitted 4 June, 2024; v1 submitted 8 May, 2024;
originally announced May 2024.
-
Constructive reachability for linear control problems under conic constraints
Authors:
Camille Pouchol,
Emmanuel Trélat,
Christophe Zhang
Abstract:
Motivated by applications requiring sparse or nonnegative controls, we investigate reachability properties of linear infinite-dimensional control problems under conic constraints. Relaxing the problem to convex constraints if the initial cone is not already convex, we provide a constructive approach based on minimising a properly defined dual functional, which covers both the approximate and exact…
▽ More
Motivated by applications requiring sparse or nonnegative controls, we investigate reachability properties of linear infinite-dimensional control problems under conic constraints. Relaxing the problem to convex constraints if the initial cone is not already convex, we provide a constructive approach based on minimising a properly defined dual functional, which covers both the approximate and exact reachability problems. Our main results heavily rely on convex analysis, Fenchel duality and the Fenchel-Rockafellar theorem. As a byproduct, we uncover new sufficient conditions for approximate and exact reachability under convex conic constraints. We also prove that these conditions are in fact necessary. When the constraints are nonconvex, our method leads to sufficient conditions ensuring that the constructed controls fulfill the original constraints, which is in the flavour of bang-bang type properties. We show that our approach encompasses and generalises several works, and we obtain new results for different types of conic constraints and control systems.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
A characterization of wavelet sets on Vilenkin groups with its application to construction of MRA wavelets
Authors:
Jun Liu,
Chi Zhang
Abstract:
Let $G$ be a Vilenkin group. In 2008, Y. A. Farkov constructed wavelets on $G$ via the multiresolution analysis method. In this article, a characterization of wavelet sets on $G$ is established, which provides another method for the construction of wavelets. As an application, the relation between multiresolution analyses and wavelets determined from wavelet sets is also presented. To some extent,…
▽ More
Let $G$ be a Vilenkin group. In 2008, Y. A. Farkov constructed wavelets on $G$ via the multiresolution analysis method. In this article, a characterization of wavelet sets on $G$ is established, which provides another method for the construction of wavelets. As an application, the relation between multiresolution analyses and wavelets determined from wavelet sets is also presented. To some extent, these results positively answer a question mentioned by P. Mahapatra and D. Singh in [Bull. Sci. Math. 167 (2021), Paper No. 102945, 20 pp].
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Optimal Pricing for Linear-Quadratic Games with Nonlinear Interaction Between Agents
Authors:
Jiamin Cai,
Chenyue Zhang,
Hoi-To Wai
Abstract:
This paper studies a class of network games with linear-quadratic payoffs and externalities exerted through a strictly concave interaction function. This class of game is motivated by the diminishing marginal effects with peer influences. We analyze the optimal pricing strategy for this class of network game. First, we prove the existence of a unique Nash Equilibrium (NE). Second, we study the opt…
▽ More
This paper studies a class of network games with linear-quadratic payoffs and externalities exerted through a strictly concave interaction function. This class of game is motivated by the diminishing marginal effects with peer influences. We analyze the optimal pricing strategy for this class of network game. First, we prove the existence of a unique Nash Equilibrium (NE). Second, we study the optimal pricing strategy of a monopolist selling a divisible good to agents. We show that the optimal pricing strategy, found by solving a bilevel optimization problem, is strictly better when the monopolist knows the network structure as opposed to the best strategy agnostic to network structure. Numerical experiments demonstrate that in most cases, the maximum revenue is achieved with an asymmetric network. These results contrast with the previously studied case of linear interaction function, where a network-independent price is proven optimal with symmetric networks. Lastly, we describe an efficient algorithm to find the optimal pricing strategy.
△ Less
Submitted 3 June, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
On monomial algebras with representation-finite envelo** algebras
Authors:
Jianguo Zhou,
Yu-Zhe Liu,
Chao Zhang
Abstract:
The present paper mainly considers the representation type of the envelo** algebra of monomial algebra. Let $A$ be a monomial algebra and $A^e= A\otimes_{\mathrm{l}\!\mathrm{k}} A^{\mathrm{op}}$ its envelo** algebra. It is shown that $A^e$ is representation-finite if and only if $A \cong \pmb{A}_n/\mathrm{rad}^2 \pmb{A}_n$, where $\pmb{A}_n$ is the path algebra…
▽ More
The present paper mainly considers the representation type of the envelo** algebra of monomial algebra. Let $A$ be a monomial algebra and $A^e= A\otimes_{\mathrm{l}\!\mathrm{k}} A^{\mathrm{op}}$ its envelo** algebra. It is shown that $A^e$ is representation-finite if and only if $A \cong \pmb{A}_n/\mathrm{rad}^2 \pmb{A}_n$, where $\pmb{A}_n$ is the path algebra $\mathrm{l}\!\mathrm{k}\mathcal{Q}$ with $\mathcal{Q} = 1 \longrightarrow 2 \longrightarrow \cdots \longrightarrow n$. Moreover, we show that the number of all isoclasses of indecomposable $(\pmb{A}_n/ \mathrm{rad}^2\pmb{A}_n)^e$-modules is $\frac{4}{3}n^3 + n^2-\frac{7}{3}n+1$, and classify all indecomposable modules over $(\pmb{A}_n/ \mathrm{rad}^2\pmb{A}_n)^e$. Finally, the Clebsch-Gordon problem over $(\pmb{A}_n/ \mathrm{rad}^2\pmb{A}_n)^e$ is studied.
△ Less
Submitted 26 April, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Three Simple Reduction Formulas for the Denumerant Functions
Authors:
Feihu Liu,
Guoce Xin,
Chen Zhang
Abstract:
Let $A$ be a nonempty set of positive integers. The restricted partition function $p_A(n)$ denotes the number of partitions of $n$ with parts in $A$. When the elements in $A$ are pairwise relatively prime positive integers, Ehrhart, Sertöz-Özlük, and Brown-Chou-Shiue derived three reduction formulas for $p_A(n)$ for $A$ with three parameters. We extend their findings for general $A$ using the Bern…
▽ More
Let $A$ be a nonempty set of positive integers. The restricted partition function $p_A(n)$ denotes the number of partitions of $n$ with parts in $A$. When the elements in $A$ are pairwise relatively prime positive integers, Ehrhart, Sertöz-Özlük, and Brown-Chou-Shiue derived three reduction formulas for $p_A(n)$ for $A$ with three parameters. We extend their findings for general $A$ using the Bernoulli-Barnes polynomials.
△ Less
Submitted 24 April, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Lowest-degree robust finite element schemes for inhomogeneous bi-Laplace problems
Authors:
Bin Dai,
Huilan Zeng,
Chensong Zhang,
Shuo Zhang
Abstract:
In this paper, we study the numerical method for the bi-Laplace problems with inhomogeneous coefficients; particularly, we propose finite element schemes on rectangular grids respectively for an inhomogeneous fourth-order elliptic singular perturbation problem and for the Helmholtz transmission eigenvalue problem. The new methods use the reduced rectangle Morley (RRM for short) element space with…
▽ More
In this paper, we study the numerical method for the bi-Laplace problems with inhomogeneous coefficients; particularly, we propose finite element schemes on rectangular grids respectively for an inhomogeneous fourth-order elliptic singular perturbation problem and for the Helmholtz transmission eigenvalue problem. The new methods use the reduced rectangle Morley (RRM for short) element space with piecewise quadratic polynomials, which are of the lowest degree possible. For the finite element space, a discrete analogue of an equality by Grisvard is proved for the stability issue and a locally-averaged interpolation operator is constructed for the approximation issue. Optimal convergence rates of the schemes are proved, and numerical experiments are given to verify the theoretical analysis.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
A New Approach to Solving Singularly Perturbed NLS at Local Potential Maxima
Authors:
Chengxiang Zhang
Abstract:
This paper presents a new approach for addressing the singularly perturbed nonlinear Schrödinger (NLS) equation:
\begin{equation}
-\varepsilon^2Δv + V(x) v =f(v),\ v>0,\ \lim_{|x|\to \infty} v(x)=0,
\end{equation}
where $V$ possesses a local maximum point and $f$ satisfies the Berestycki-Lions conditions.The key to our approach is the derivation of a refined lower bound on the gradient norm.
This paper presents a new approach for addressing the singularly perturbed nonlinear Schrödinger (NLS) equation:
\begin{equation}
-\varepsilon^2Δv + V(x) v =f(v),\ v>0,\ \lim_{|x|\to \infty} v(x)=0,
\end{equation}
where $V$ possesses a local maximum point and $f$ satisfies the Berestycki-Lions conditions.The key to our approach is the derivation of a refined lower bound on the gradient norm.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
A Fast Observability for Diffusion Equations in $\mathbb R^N$
Authors:
Yueliang Duan,
Can Zhang
Abstract:
Given an equidistributed set in the whole Euclidean space, we have established in [1] that there exists a constant positive $C$ such that the observability inequality of diffusion equations holds for all $T\in]0,1[$, with an observability cost being of the form $Ce^{C/T}$. In this paper, for any small constant $\varepsilon>0$, we prove that there exists a nontrivial equidistributed set (in the sen…
▽ More
Given an equidistributed set in the whole Euclidean space, we have established in [1] that there exists a constant positive $C$ such that the observability inequality of diffusion equations holds for all $T\in]0,1[$, with an observability cost being of the form $Ce^{C/T}$. In this paper, for any small constant $\varepsilon>0$, we prove that there exists a nontrivial equidistributed set (in the sense that whose complementary set is unbounded), so that the above observability cost can be improved to a fast form of $Ce^{\varepsilon/T}$ for certain constant $C>0$. The proof is based on the strategy used in [1], as well as an interpolation inequality for gradients of solutions to elliptic equations obtained recently in [2].
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
An Augmented Lagrangian Method for Training Recurrent Neural Networks
Authors:
Yue Wang,
Chao Zhang,
Xiaojun Chen
Abstract:
Recurrent Neural Networks (RNNs) are widely used to model sequential data in a wide range of areas, such as natural language processing, speech recognition, machine translation, and time series analysis. In this paper, we model the training process of RNNs with the ReLU activation function as a constrained optimization problem with a smooth nonconvex objective function and piecewise smooth nonconv…
▽ More
Recurrent Neural Networks (RNNs) are widely used to model sequential data in a wide range of areas, such as natural language processing, speech recognition, machine translation, and time series analysis. In this paper, we model the training process of RNNs with the ReLU activation function as a constrained optimization problem with a smooth nonconvex objective function and piecewise smooth nonconvex constraints. We prove that any feasible point of the optimization problem satisfies the no nonzero abnormal multiplier constraint qualification (NNAMCQ), and any local minimizer is a Karush-Kuhn-Tucker (KKT) point of the problem. Moreover, we propose an augmented Lagrangian method (ALM) and design an efficient block coordinate descent (BCD) method to solve the subproblems of the ALM. The update of each block of the BCD method has a closed-form solution. The stop criterion for the inner loop is easy to check and can be stopped in finite steps. Moreover, we show that the BCD method can generate a directional stationary point of the subproblem. Furthermore, we establish the global convergence of the ALM to a KKT point of the constrained optimization problem. Compared with the state-of-the-art algorithms, numerical results demonstrate the efficiency and effectiveness of the ALM for training RNNs.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
An 8-flow theorem for signed graphs
Authors:
Rong Luo,
Edita Máčajová,
Martin Škoviera,
Cun-Quan Zhang
Abstract:
We prove that a signed graph admits a nowhere-zero $8$-flow provided that it is flow-admissible and the underlying graph admits a nowhere-zero $4$-flow. When combined with the 4-color theorem, this implies that every flow-admissible bridgeless planar signed graph admits a nowhere-zero $8$-flow. Our result improves and generalizes previous results of Li et al. (European J. Combin. 108 (2023), 10362…
▽ More
We prove that a signed graph admits a nowhere-zero $8$-flow provided that it is flow-admissible and the underlying graph admits a nowhere-zero $4$-flow. When combined with the 4-color theorem, this implies that every flow-admissible bridgeless planar signed graph admits a nowhere-zero $8$-flow. Our result improves and generalizes previous results of Li et al. (European J. Combin. 108 (2023), 103627), which state that every flow-admissible signed $3$-edge-colorable cubic graph admits a nowhere-zero $10$-flow, and that every flow-admissible signed hamiltonian graph admits a nowhere-zero $8$-flow.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Near-Optimal Quantum Algorithm for Minimizing the Maximal Loss
Authors:
Hao Wang,
Chenyi Zhang,
Tongyang Li
Abstract:
The problem of minimizing the maximum of $N$ convex, Lipschitz functions plays significant roles in optimization and machine learning. It has a series of results, with the most recent one requiring $O(Nε^{-2/3} + ε^{-8/3})$ queries to a first-order oracle to compute an $ε$-suboptimal point. On the other hand, quantum algorithms for optimization are rapidly advancing with speedups shown on many imp…
▽ More
The problem of minimizing the maximum of $N$ convex, Lipschitz functions plays significant roles in optimization and machine learning. It has a series of results, with the most recent one requiring $O(Nε^{-2/3} + ε^{-8/3})$ queries to a first-order oracle to compute an $ε$-suboptimal point. On the other hand, quantum algorithms for optimization are rapidly advancing with speedups shown on many important optimization problems. In this paper, we conduct a systematic study for quantum algorithms and lower bounds for minimizing the maximum of $N$ convex, Lipschitz functions. On one hand, we develop quantum algorithms with an improved complexity bound of $\tilde{O}(\sqrt{N}ε^{-5/3} + ε^{-8/3})$. On the other hand, we prove that quantum algorithms must take $\tildeΩ(\sqrt{N}ε^{-2/3})$ queries to a first order quantum oracle, showing that our dependence on $N$ is optimal up to poly-logarithmic factors.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Quantitative uniqueness estimates for stochastic parabolic equations on the whole Euclidean space
Authors:
Yuanhang Liu,
Donghui Yang,
Xingwu Zeng,
Can Zhang
Abstract:
In this paper, a quantitative estimate of unique continuation for the stochastic heat equation with bounded potentials on the whole Euclidean space is established. This paper generalizes the earlier results in [29] and [17] from a bounded domain to an unbounded one. The proof is based on the locally parabolic-type frequency function method. An observability estimate from measurable sets in time fo…
▽ More
In this paper, a quantitative estimate of unique continuation for the stochastic heat equation with bounded potentials on the whole Euclidean space is established. This paper generalizes the earlier results in [29] and [17] from a bounded domain to an unbounded one. The proof is based on the locally parabolic-type frequency function method. An observability estimate from measurable sets in time for the same equation is also derived.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
A Multilayer Eigen-Sensitivity Method Using Loop Gain Model for Oscillation Diagnosis of Converter-Based System
Authors:
Haoxiang Zong,
Chen Zhang,
Xu Cai,
Marta Molinas
Abstract:
Loop gain-based eigen-sensitivity (LGES) is a useful frequency-domain tool for oscillation diagnosis of converter-based system. However, the existing theory is still scant in two aspects: participation factor (PF) is bound up with the frequency-domain modal characteristic that does not necessarily point to the stability as that of the time-domain eigen-sensitivity (i.e., PF of oscillation mode); a…
▽ More
Loop gain-based eigen-sensitivity (LGES) is a useful frequency-domain tool for oscillation diagnosis of converter-based system. However, the existing theory is still scant in two aspects: participation factor (PF) is bound up with the frequency-domain modal characteristic that does not necessarily point to the stability as that of the time-domain eigen-sensitivity (i.e., PF of oscillation mode); a systematic LGES analysis framework containing both component- and parameter- level sensitivity is missing. These two factors hinder the application of LGES method on the proper evaluation of stability effects, which are closely related with the time-domain oscillation mode. To address these issues, this paper proposes a multilayer LGES method directed to the oscillation mode, and a full set of indices like PF, component and parameter sensitivity are established. The link from the eigen-sensitivity of frequency domain to that of time domain is revealed, through which it is shown how the proposed LGES method can facilitate the control parameter tuning-guided oscillation suppression. The effectiveness of the proposed LGES method is validated via case studies conducted on a generic AC/DC converter-based system.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Supercloseness of the DDG method for a singularly perturbed convection diffusion problem on Shishkin mesh
Authors:
Xiaoqi Ma,
** Zhang,
Xinyi Feng,
Chunxiao Zhang
Abstract:
This paper investigates the supercloseness of a singularly perturbed convection diffusion problem using the direct discontinuous Galerkin (DDG) method on a Shishkin mesh. The main technical difficulties lie in controlling the diffusion term inside the layer, the convection term outside the layer, and the inter element jump term caused by the discontinuity of the numerical solution. The main idea i…
▽ More
This paper investigates the supercloseness of a singularly perturbed convection diffusion problem using the direct discontinuous Galerkin (DDG) method on a Shishkin mesh. The main technical difficulties lie in controlling the diffusion term inside the layer, the convection term outside the layer, and the inter element jump term caused by the discontinuity of the numerical solution. The main idea is to design a new composite interpolation, in which a global projection is used outside the layer to satisfy the interface conditions determined by the selection of numerical flux, thereby eliminating or controlling the troublesome terms on the unit interface; and inside the layer, Gauß Lobatto projection is used to improve the convergence order of the diffusion term. On the basis of that, by selecting appropriate parameters in the numerical flux, we obtain the supercloseness result of almost $k+1$ order under an energy norm. Numerical experiments support our main theoretical conclusion.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
The super approximation property of $\mathrm{SL}_2(\mathbb{Z}/q\mathbb{Z}) \times \mathrm{SL}_2(\mathbb{Z}/q\mathbb{Z}) \times \mathrm{SL}_2(\mathbb{Z}/q\mathbb{Z})$
Authors:
Chong Zhang
Abstract:
Take $S \subset \mathrm{SL}_2(\mathbb{Z}) \times \mathrm{SL}_2(\mathbb{Z})\times \mathrm{SL}_2(\mathbb{Z})$ be finite symmetric and assume $S$ generates a group $G$ which is Zariski-dense in $\mathrm{SL}_2 \times \mathrm{SL}_2\times \mathrm{SL}_2(\mathbb{Z})$. This paper proves that the Cayley graphs $$ \{\mathcal{C} a y(G(\bmod q), S(\bmod q))\}_{q \in \mathbb{Z}_{+}} $$ form a family of expander…
▽ More
Take $S \subset \mathrm{SL}_2(\mathbb{Z}) \times \mathrm{SL}_2(\mathbb{Z})\times \mathrm{SL}_2(\mathbb{Z})$ be finite symmetric and assume $S$ generates a group $G$ which is Zariski-dense in $\mathrm{SL}_2 \times \mathrm{SL}_2\times \mathrm{SL}_2(\mathbb{Z})$. This paper proves that the Cayley graphs $$ \{\mathcal{C} a y(G(\bmod q), S(\bmod q))\}_{q \in \mathbb{Z}_{+}} $$ form a family of expanders.
△ Less
Submitted 18 December, 2023;
originally announced February 2024.
-
Interference Among First-Price Pacing Equilibria: A Bias and Variance Analysis
Authors:
Luofeng Liao,
Christian Kroer,
Sergei Leonenkov,
Okke Schrijvers,
Liang Shi,
Nicolas Stier-Moses,
Congshan Zhang
Abstract:
Online A/B testing is widely used in the internet industry to inform decisions on new feature roll-outs. For online marketplaces (such as advertising markets), standard approaches to A/B testing may lead to biased results when buyers operate under a budget constraint, as budget consumption in one arm of the experiment impacts performance of the other arm. To counteract this interference, one can u…
▽ More
Online A/B testing is widely used in the internet industry to inform decisions on new feature roll-outs. For online marketplaces (such as advertising markets), standard approaches to A/B testing may lead to biased results when buyers operate under a budget constraint, as budget consumption in one arm of the experiment impacts performance of the other arm. To counteract this interference, one can use a budget-split design where the budget constraint operates on a per-arm basis and each arm receives an equal fraction of the budget, leading to ``budget-controlled A/B testing.'' Despite clear advantages of budget-controlled A/B testing, performance degrades when budget are split too small, limiting the overall throughput of such systems. In this paper, we propose a parallel budget-controlled A/B testing design where we use market segmentation to identify submarkets in the larger market, and we run parallel experiments on each submarket.
Our contributions are as follows: First, we introduce and demonstrate the effectiveness of the parallel budget-controlled A/B test design with submarkets in a large online marketplace environment. Second, we formally define market interference in first-price auction markets using the first price pacing equilibrium (FPPE) framework. Third, we propose a debiased surrogate that eliminates the first-order bias of FPPE, drawing upon the principles of sensitivity analysis in mathematical programs. Fourth, we derive a plug-in estimator for the surrogate and establish its asymptotic normality. Fifth, we provide an estimation procedure for submarket parallel budget-controlled A/B tests. Finally, we present numerical examples on semi-synthetic data, confirming that the debiasing technique achieves the desired coverage properties.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
An efficient unconditional energy stable scheme for the simulation of droplet formation
Authors:
**peng Zhang,
Changjuan Zhang,
** Wang
Abstract:
We have developed an efficient and unconditionally energy-stable method for simulating droplet formation dynamics. Our approach involves a novel time-marching scheme based on the scalar auxiliary variable technique, specifically designed for solving the Cahn-Hilliard-Navier-Stokes phase field model with variable density and viscosity. We have successfully applied this method to simulate droplet fo…
▽ More
We have developed an efficient and unconditionally energy-stable method for simulating droplet formation dynamics. Our approach involves a novel time-marching scheme based on the scalar auxiliary variable technique, specifically designed for solving the Cahn-Hilliard-Navier-Stokes phase field model with variable density and viscosity. We have successfully applied this method to simulate droplet formation in scenarios where a Newtonian fluid is injected through a vertical tube into another immiscible Newtonian fluid. To tackle the challenges posed by nonhomogeneous Dirichlet boundary conditions at the tube entrance, we have introduced additional nonlocal auxiliary variables and associated ordinary differential equations. These additions effectively eliminate the influence of boundary terms. Moreover, we have incorporated stabilization terms into the scheme to enhance its numerical effectiveness. Notably, our resulting scheme is fully decoupled, requiring the solution of only linear systems at each time step. We have also demonstrated the energy decaying property of the scheme, with suitable modifications. To assess the accuracy and stability of our algorithm, we have conducted extensive numerical simulations. Additionally, we have examined the dynamics of droplet formation and explored the impact of dimensionless parameters on the process. Overall, our work presents a refined method for simulating droplet formation dynamics, offering improved efficiency, energy stability, and accuracy.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
The exponential turnpike property for periodic linear quadratic optimal control problems in infinite dimension
Authors:
Emmanuel Trélat,
Xingwu Zeng,
Can Zhang
Abstract:
In this paper, we establish an exponential periodic turnpike property for linear quadratic optimal control problems governed by periodic systems in infinite dimension. We show that the optimal trajectory converges exponentially to a periodic orbit when the time horizon tends to infinity. Similar results are obtained for the optimal control and adjoint state. Our proof is based on the large time be…
▽ More
In this paper, we establish an exponential periodic turnpike property for linear quadratic optimal control problems governed by periodic systems in infinite dimension. We show that the optimal trajectory converges exponentially to a periodic orbit when the time horizon tends to infinity. Similar results are obtained for the optimal control and adjoint state. Our proof is based on the large time behavior of solutions of operator differential Riccati equations with periodic coefficients.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
On melting for the 3D radial Stefan problem
Authors:
Chencheng Zhang
Abstract:
We consider the three-dimensional radial Stefan problem which describes the evolution of a radial symmetric ice ball with free boundary
\begin{equation*}
\left\{\begin{aligned}
&\partial_{t}u-\partial_{rr}u-\frac{2}{r}\partial_{r}u=0 \quad in\ r\geqλ(t),\\
&\partial_{r}u(t,λ(t))=-\dotλ(t),\\
&u(t,λ(t))=0,\\
&u(0,\cdot)=u_{0},\quad λ(0)=λ_{0}.
\end{aligned}\right. \end{equation*}
We…
▽ More
We consider the three-dimensional radial Stefan problem which describes the evolution of a radial symmetric ice ball with free boundary
\begin{equation*}
\left\{\begin{aligned}
&\partial_{t}u-\partial_{rr}u-\frac{2}{r}\partial_{r}u=0 \quad in\ r\geqλ(t),\\
&\partial_{r}u(t,λ(t))=-\dotλ(t),\\
&u(t,λ(t))=0,\\
&u(0,\cdot)=u_{0},\quad λ(0)=λ_{0}.
\end{aligned}\right. \end{equation*}
We prove the existence in the radial class of finite time melting with rates \begin{equation*}
λ(t)=\left\{\begin{aligned}
&4\sqrtπ\frac{\sqrt{T-t}}{|\log (T-t)|}(1+o_{t\rightarrow T}(1)),\\
&c(u_{0},k)(1+o_{t\rightarrow T}(1))(T-t)^{\frac{k+1}{2}},\quad k\in{\mathbb{N}}^{*},
\end{aligned}\right. \end{equation*}
which respectively correspond to the fundamental stable melting rate and a sequence of codimension $k$ unstable rates. Our analysis mainly depend on the methods developed in [17] which deals with the similar problems in two dimensions and also the construction of both stable and unstable finite time blow-up solutions for the harmonic heat flow in [49],[50].
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
A Plücker coordinate mirror for partial flag varieties and quantum Schubert calculus
Authors:
Changzheng Li,
Konstanze Rietsch,
Mingzhi Yang,
Chi Zhang
Abstract:
We construct a Plücker coordinate superpotential $\mathcal{F}_-$ that is mirror to a partial flag variety $\mathbb{ F}\ell(n_\bullet)$. Its Jacobi ring recovers the small quantum cohomology of $\mathbb{ F}\ell(n_\bullet)$ and we prove a folklore conjecture in mirror symmetry. Namely, we show that the eigenvalues for the action of the first Chern class $c_1(\mathbb{ F}\ell(n_\bullet))$ on quantum c…
▽ More
We construct a Plücker coordinate superpotential $\mathcal{F}_-$ that is mirror to a partial flag variety $\mathbb{ F}\ell(n_\bullet)$. Its Jacobi ring recovers the small quantum cohomology of $\mathbb{ F}\ell(n_\bullet)$ and we prove a folklore conjecture in mirror symmetry. Namely, we show that the eigenvalues for the action of the first Chern class $c_1(\mathbb{ F}\ell(n_\bullet))$ on quantum cohomology are equal to the critical values of $\mathcal{F}_-$. We achieve this by proving new identities in quantum Schubert calculus that are inspired by our formula for $\mathcal{F}_-$ and the mirror symmetry conjecture.
△ Less
Submitted 9 February, 2024; v1 submitted 28 January, 2024;
originally announced January 2024.
-
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
Authors:
Chenyu Zhang,
Han Wang,
Aritra Mitra,
James Anderson
Abstract:
Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing the sample complexity of reinforcement learning tasks by exploiting information from different agents. However, when each agent interacts with a potentially different environment, little to nothing is known theoretically about the non-asymptotic performance of FRL algorithms. The lack of such results can be att…
▽ More
Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing the sample complexity of reinforcement learning tasks by exploiting information from different agents. However, when each agent interacts with a potentially different environment, little to nothing is known theoretically about the non-asymptotic performance of FRL algorithms. The lack of such results can be attributed to various technical challenges and their intricate interplay: Markovian sampling, linear function approximation, multiple local updates to save communication, heterogeneity in the reward functions and transition kernels of the agents' MDPs, and continuous state-action spaces. Moreover, in the on-policy setting, the behavior policies vary with time, further complicating the analysis. In response, we introduce FedSARSA, a novel federated on-policy reinforcement learning scheme, equipped with linear function approximation, to address these challenges and provide a comprehensive finite-time error analysis. Notably, we establish that FedSARSA converges to a policy that is near-optimal for all agents, with the extent of near-optimality proportional to the level of heterogeneity. Furthermore, we prove that FedSARSA leverages agent collaboration to enable linear speedups as the number of agents increases, which holds for both fixed and adaptive step-size configurations.
△ Less
Submitted 14 April, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
Newton polytopes of dual $k$-Schur polynomials
Authors:
Bo Wang,
Candice X. T. Zhang,
Zhong-Xue Zhang
Abstract:
Rado's theorem about permutahedra and dominance order on partitions reveals that each Schur polynomial is M-convex, or equivalently, it has a saturated Newton polytope and this polytope is a generalized permutahedron as well. In this paper we show that the support of each dual $k$-Schur polynomial indexed by a $k$-bounded partition coincides with that of the Schur polynomial indexed by the same pa…
▽ More
Rado's theorem about permutahedra and dominance order on partitions reveals that each Schur polynomial is M-convex, or equivalently, it has a saturated Newton polytope and this polytope is a generalized permutahedron as well. In this paper we show that the support of each dual $k$-Schur polynomial indexed by a $k$-bounded partition coincides with that of the Schur polynomial indexed by the same partition, and hence the two polynomials share the same saturated Newton polytope. The main result is based on our recursive algorithm to generate a semistandard $k$-tableau for a given shape and $k$-weight. As consequences, we obtain the M-convexity of dual $k$-Schur polynomials, affine Stanley symmetric polynomials and cylindric skew Schur polynomials.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis
Authors:
Qian Gong,
Chengzhu Zhang,
Xin Liang,
Viktor Reshniak,
Jieyang Chen,
Anand Rangarajan,
Sanjay Ranka,
Nicolas Vidal,
Lipeng Wan,
Paul Ullrich,
Norbert Podhorszki,
Robert Jacob,
Scott Klasky
Abstract:
Scientific discoveries are increasingly constrained by limited storage space and I/O capacities. For time-series simulations and experiments, their data often need to be decimated over timesteps to accommodate storage and I/O limitations. In this paper, we propose a technique that addresses storage costs while improving post-analysis accuracy through spatiotemporal adaptive, error-controlled lossy…
▽ More
Scientific discoveries are increasingly constrained by limited storage space and I/O capacities. For time-series simulations and experiments, their data often need to be decimated over timesteps to accommodate storage and I/O limitations. In this paper, we propose a technique that addresses storage costs while improving post-analysis accuracy through spatiotemporal adaptive, error-controlled lossy compression. We investigate the trade-off between data precision and temporal output rates, revealing that reducing data precision and increasing timestep frequency lead to more accurate analysis outcomes. Additionally, we integrate spatiotemporal feature detection with data compression and demonstrate that performing adaptive error-bounded compression in higher dimensional space enables greater compression ratios, leveraging the error propagation theory of a transformation-based compressor.
To evaluate our approach, we conduct experiments using the well-known E3SM climate simulation code and apply our method to compress variables used for cyclone tracking. Our results show a significant reduction in storage size while enhancing the quality of cyclone tracking analysis, both quantitatively and qualitatively, in comparison to the prevalent timestep decimation approach. Compared to three state-of-the-art lossy compressors lacking feature preservation capabilities, our adaptive compression framework improves perfectly matched cases in TC tracking by 26.4-51.3% at medium compression ratios and by 77.3-571.1% at large compression ratios, with a merely 5-11% computational overhead.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
cuPDLP-C: A Strengthened Implementation of cuPDLP for Linear Programming by C language
Authors:
Haihao Lu,
**wen Yang,
Haodong Hu,
Qi Huangfu,
**song Liu,
Tianhao Liu,
Yinyu Ye,
Chuwen Zhang,
Dongdong Ge
Abstract:
A recent GPU implementation of the Restarted Primal-Dual Hybrid Gradient Method for Linear Programming was proposed in Lu and Yang (2023). Its computational results demonstrate the significant computational advantages of the GPU-based first-order algorithm on certain large-scale problems. The average performance also achieves a level close to commercial solvers for the first time in history. Howev…
▽ More
A recent GPU implementation of the Restarted Primal-Dual Hybrid Gradient Method for Linear Programming was proposed in Lu and Yang (2023). Its computational results demonstrate the significant computational advantages of the GPU-based first-order algorithm on certain large-scale problems. The average performance also achieves a level close to commercial solvers for the first time in history. However, due to limitations in experimental hardware and the disadvantage of implementing the algorithm in Julia compared to C language, neither the commercial solver nor cuPDLP reached their maximum efficiency. Therefore, in this report, we have re-implemented and optimized cuPDLP in C language. Utilizing state-of-the-art CPU and GPU hardware, we extensively compare cuPDLP with the best commercial solvers. The experiments further highlight its substantial computational advantages and potential for solving large-scale linear programming problems. We also discuss the profound impact this breakthrough may have on mathematical programming research and the entire operations research community.
△ Less
Submitted 7 January, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
MGCNN: a learnable multigrid solver for sparse linear systems from PDEs on structured grids
Authors:
Yan Xie,
Minrui Lv,
Chensong Zhang
Abstract:
This paper presents a learnable solver tailored to iteratively solve sparse linear systems from discretized partial differential equations (PDEs). Unlike traditional approaches relying on specialized expertise, our solver streamlines the algorithm design process for a class of PDEs through training, which requires only training data of coefficient distributions. The proposed method is anchored by…
▽ More
This paper presents a learnable solver tailored to iteratively solve sparse linear systems from discretized partial differential equations (PDEs). Unlike traditional approaches relying on specialized expertise, our solver streamlines the algorithm design process for a class of PDEs through training, which requires only training data of coefficient distributions. The proposed method is anchored by three core principles: (1) a multilevel hierarchy to promote rapid convergence, (2) adherence to linearity concerning the right-hand-side of equations, and (3) weights sharing across different levels to facilitate adaptability to various problem sizes. Built on these foundational principles and considering the similar computation pattern of the convolutional neural network (CNN) as multigrid components, we introduce a network adept at solving linear systems from PDEs with heterogeneous coefficients, discretized on structured grids. Notably, our proposed solver possesses the ability to generalize over right-hand-side terms, PDE coefficients, and grid sizes, thereby ensuring its training is purely offline. To evaluate its effectiveness, we train the solver on convection-diffusion equations featuring heterogeneous diffusion coefficients. The solver exhibits swift convergence to high accuracy over a range of grid sizes, extending from $31 \times 31$ to $4095 \times 4095$. Remarkably, our method outperforms the classical Geometric Multigrid (GMG) solver, demonstrating a speedup of approximately 3 to 8 times. Furthermore, our numerical investigation into the solver's capacity to generalize to untrained coefficient distributions reveals promising outcomes.
△ Less
Submitted 9 May, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Two-sample inference for sparse functional data
Authors:
Chi Zhang,
Peijun Sang,
Yingli Qin
Abstract:
We propose a novel test procedure for comparing mean functions across two groups within the reproducing kernel Hilbert space (RKHS) framework. Our proposed method is adept at handling sparsely and irregularly sampled functional data when observation times are random for each subject. Conventional approaches, which are built upon functional principal components analysis, usually assume a homogeneou…
▽ More
We propose a novel test procedure for comparing mean functions across two groups within the reproducing kernel Hilbert space (RKHS) framework. Our proposed method is adept at handling sparsely and irregularly sampled functional data when observation times are random for each subject. Conventional approaches, which are built upon functional principal components analysis, usually assume a homogeneous covariance structure across groups. Nonetheless, justifying this assumption in real-world scenarios can be challenging. To eliminate the need for a homogeneous covariance structure, we first develop the functional Bahadur representation for the mean estimator under the RKHS framework; this representation naturally leads to the desirable pointwise limiting distributions. Moreover, we establish weak convergence for the mean estimator, allowing us to construct a test statistic for the mean difference. Our method is easily implementable and outperforms some conventional tests in controlling type I errors across various settings. We demonstrate the finite sample performance of our approach through extensive simulations and two real-world applications.
△ Less
Submitted 29 December, 2023; v1 submitted 12 December, 2023;
originally announced December 2023.
-
An algebraic combinatorial approach to Sylvester's denumerant
Authors:
Guoce Xin,
Chen Zhang
Abstract:
For a positive integer sequence $\boldsymbol{a}=(a_1, \dots, a_{N+1})$, Sylvester's denumerant $E(\boldsymbol{a}; t)$ counts the number of nonnegative integer solutions to $\sum_{i=1}^{N+1} a_i x_i = t$ for a nonnegative integer $t$. It has been extensively studied and a well-known result asserts that $E(\boldsymbol{a}; t)$ is a quasi-polynomial in $t$ of degree $N$. A milestone is Baldoni et al.'…
▽ More
For a positive integer sequence $\boldsymbol{a}=(a_1, \dots, a_{N+1})$, Sylvester's denumerant $E(\boldsymbol{a}; t)$ counts the number of nonnegative integer solutions to $\sum_{i=1}^{N+1} a_i x_i = t$ for a nonnegative integer $t$. It has been extensively studied and a well-known result asserts that $E(\boldsymbol{a}; t)$ is a quasi-polynomial in $t$ of degree $N$. A milestone is Baldoni et al.'s polynomial algorithm in 2015 for computing the top $k$ coefficients when $k$ is fixed. Their development uses heavily lattice point counting theory in computational geometry. In this paper, we explain their work in the context of algebraic combinatorics and simplify their computation. Our work is based on constant term method, Barvinok's unimodular cone decomposition, and recent results on fast computation of generalized Todd polynomials. We develop the algorithm \texttt{CT-Knapsack}, together with an implementation in \texttt{Maple}. Our algorithm avoids plenty of repeated computations and is hence faster.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Norm Orthogonal Bases and Invariants of $p$-adic Lattices
Authors:
Chi Zhang,
Yingpu Deng,
Zhaonan Wang
Abstract:
In 2018, the longest vector problem (LVP) and the closest vector problem (CVP) in $p$-adic lattices were introduced. These problems are closely linked to the orthogonalization process. In this paper, we first prove that every $p$-adic lattice has an orthogonal basis and give definition to the successive maxima and the escape distance, as the $p$-adic analogues of the successive minima and the cove…
▽ More
In 2018, the longest vector problem (LVP) and the closest vector problem (CVP) in $p$-adic lattices were introduced. These problems are closely linked to the orthogonalization process. In this paper, we first prove that every $p$-adic lattice has an orthogonal basis and give definition to the successive maxima and the escape distance, as the $p$-adic analogues of the successive minima and the covering radius in Euclidean lattices. Then, we present deterministic polynomial time algorithms to perform the orthogonalization process, solve the LVP and solve the CVP with an orthogonal basis of the whole vector space. Finally, we conclude that orthogonalization and the CVP are polynomially equivalent.
△ Less
Submitted 24 January, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Dynamical State Feedback Control for Linear Input Delay Systems, Part I: Dissipative Stabilization via Semidefinite Programming
Authors:
Qian Feng,
Cong Zhang,
Bo Wei
Abstract:
It is well known that predictor controllers can completely eliminate the destabilizing effects of input delays. However, their design is typically based on direct constructions that leave little room for incorporating closed-loop performance objectives. To address this issue, we introduce the concept of parameterized linear dynamical state feedbacks (LDSFs) that can achieve both input delay compen…
▽ More
It is well known that predictor controllers can completely eliminate the destabilizing effects of input delays. However, their design is typically based on direct constructions that leave little room for incorporating closed-loop performance objectives. To address this issue, we introduce the concept of parameterized linear dynamical state feedbacks (LDSFs) that can achieve both input delay compensation and stabilization for linear input delay systems with dissipative constraints. This control construct draws inspiration from recent developments in the mathematical treatment of distributed delays, and generalizes conventional predictor controllers, where the degree of parameterization can be increased by adjusting the integral term. A sufficient condition for the existence of the LDSF is formulated as matrix inequalities by constructing a complete type Krasovskii functional. To solve the bilinear matrix inequality in the synthesis condition, we employ an inner convex approximation algorithm that can be initialized using the gains of a predictor controller obtained via explicit construction. Unlike traditional predictor controllers, the parameters of our LTDS can be directly tuned via the proposed optimization framework. Numerical examples and simulation have been experimented to demonstrate the validity and effectiveness of our methodology.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
A Universal Trust-Region Method for Convex and Nonconvex Optimization
Authors:
Yuntian Jiang,
Chang He,
Chuwen Zhang,
Dongdong Ge,
Bo Jiang,
Yinyu Ye
Abstract:
This paper presents a universal trust-region method simultaneously incorporating quadratic regularization and the ball constraint. We introduce a novel mechanism to set the parameters in the proposed method that unifies the analysis for convex and nonconvex optimization. Our method exhibits an iteration complexity of $\tilde O(ε^{-3/2})$ to find an approximate second-order stationary point for non…
▽ More
This paper presents a universal trust-region method simultaneously incorporating quadratic regularization and the ball constraint. We introduce a novel mechanism to set the parameters in the proposed method that unifies the analysis for convex and nonconvex optimization. Our method exhibits an iteration complexity of $\tilde O(ε^{-3/2})$ to find an approximate second-order stationary point for nonconvex optimization. Meanwhile, the analysis reveals that the universal method attains an $O(ε^{-1/2})$ complexity bound for convex optimization and can be accelerated. These results are complementary to the existing literature as the trust-region method was historically conceived for nonconvex optimization. Finally, we develop an adaptive universal method to address practical implementations. The numerical results show the effectiveness of our method in both nonconvex and convex problems.
△ Less
Submitted 12 March, 2024; v1 submitted 19 November, 2023;
originally announced November 2023.
-
Optimal H{ö}lder convergence of a class of singular steady states to the Bahouri-Chemin patch
Authors:
Yupei Huang,
Chilin Zhang
Abstract:
In \cite{elgindi2022regular}, a family of singular steady states near the Bahouri-Chemin patch was introduced. In this paper, we obtain the optimal regularity and convergence of the singular steady states contruced in \cite{elgindi2022regular} to the Bahouri-Chemin patch. We first derive a boundary Harnack principle, and then obtain the optimal convergence results using the singular integral repre…
▽ More
In \cite{elgindi2022regular}, a family of singular steady states near the Bahouri-Chemin patch was introduced. In this paper, we obtain the optimal regularity and convergence of the singular steady states contruced in \cite{elgindi2022regular} to the Bahouri-Chemin patch. We first derive a boundary Harnack principle, and then obtain the optimal convergence results using the singular integral representation based on Green's function.
△ Less
Submitted 15 June, 2024; v1 submitted 18 November, 2023;
originally announced November 2023.
-
Distributional Finite Element curl div Complexes and Application to Quad Curl Problems
Authors:
Long Chen,
Xuehai Huang,
Chao Zhang
Abstract:
The paper addresses the challenge of constructing conforming finite element spaces for high-order differential operators in high dimensions, with a focus on the curl div operator in three dimensions. Tangential-normal continuity is introduced in order to develop distributional finite element curl div complexes. The spaces constructed are applied to discretize a quad curl problem, demonstrating opt…
▽ More
The paper addresses the challenge of constructing conforming finite element spaces for high-order differential operators in high dimensions, with a focus on the curl div operator in three dimensions. Tangential-normal continuity is introduced in order to develop distributional finite element curl div complexes. The spaces constructed are applied to discretize a quad curl problem, demonstrating optimal order of convergence. Furthermore, a hybridization technique is proposed, demonstrating its equivalence to nonconforming finite elements and weak Galerkin methods.
△ Less
Submitted 15 November, 2023; v1 submitted 15 November, 2023;
originally announced November 2023.
-
D invariants obstruction to sliceness of a class of algebraically slice knots
Authors:
Chen Zhang
Abstract:
We use d invariants of the 2-fold branched cover to show nonsliceness of a set of algebraically slice knots.
We use d invariants of the 2-fold branched cover to show nonsliceness of a set of algebraically slice knots.
△ Less
Submitted 12 November, 2023;
originally announced November 2023.
-
The edge-girth-regularity of Wenger graphs
Authors:
Fuyuan Yang,
Qiang Sun,
Chao Zhang
Abstract:
Let $n\ge 1$ be an integer and $\mathbb{F}_q$ be a finite field of characteristic $p$ with $q$ elements. In this paper, it is proved that the Wenger graph $W_n(q)$ and linearized Wenger graph $L_m(q)$ are edge-girth-regular $(v,k,g,λ)$-graphs, and the parameter $λ$ of graphs $W_n(q)$ and $L_m(q)$ is completely determined. Here, an edge-girth-regular graph $egr(v,k,g,λ)$ means a $k$-regular graph o…
▽ More
Let $n\ge 1$ be an integer and $\mathbb{F}_q$ be a finite field of characteristic $p$ with $q$ elements. In this paper, it is proved that the Wenger graph $W_n(q)$ and linearized Wenger graph $L_m(q)$ are edge-girth-regular $(v,k,g,λ)$-graphs, and the parameter $λ$ of graphs $W_n(q)$ and $L_m(q)$ is completely determined. Here, an edge-girth-regular graph $egr(v,k,g,λ)$ means a $k$-regular graph of order $v$ and girth $g$ satisfying that any edge is contained in $λ$ distinct $g$-cycles. As a direct corollary, we obtain the number of girth cycles of graph $W_n(q)$, and the lower bounds on the generalized Turán numbers $ex(n, C_{6}, \mathscr{C}_{5})$ and $ex(n, C_{8}, \mathscr{C}_{7})$, where $C_k$ is the cycle of length $k$ and $\mathscr{C}_k = \{C_3, C_4, \dots , C_k\}$.Moreover, there exist a family of $egr(2q^3,q,8,(q-1)^3(q-2))$-graphs for $q$ odd, and the order of graph $W_2(q)$ and extremal $egr(v,q,8,(q-1)^3(q-2))$-graph have same asymptotic order for $q$ odd.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Entropy solutions to the fully non-local diffusion equations
Authors:
Ying Li,
Chao Zhang
Abstract:
We consider the fully non-local diffusion equations with non-negative $L^1$-data. Based on the approximation and energy methods, we prove the existence and uniqueness of non-negative entropy solutions for such problems. In particular, our results are valid for the time-space fractional Laplacian equations.
We consider the fully non-local diffusion equations with non-negative $L^1$-data. Based on the approximation and energy methods, we prove the existence and uniqueness of non-negative entropy solutions for such problems. In particular, our results are valid for the time-space fractional Laplacian equations.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
On the solutions of nonlocal 1-Laplacian equation with $L^1$-data
Authors:
Dingding Li,
Chao Zhang
Abstract:
We study the solutions to a nonlocal 1-Laplacian equation given by $$ 2\text{P.V.}\int_{\mathbb{R}^N}\frac{u(x)-u(y)}{|u(x)-u(y)|} \frac{dy}{|x-y|^{N+s}}=f(x) \quad \textmd{for } x\in Ω, $$ with Dirichlet boundary condition $u(x)=0$ in $\mathbb R^N\backslash Ω$ and nonnegative $L^1$-data. By investigating the asymptotic behaviour of renormalized solutions $u_p$ to the nonlocal $p$-Laplacian equati…
▽ More
We study the solutions to a nonlocal 1-Laplacian equation given by $$ 2\text{P.V.}\int_{\mathbb{R}^N}\frac{u(x)-u(y)}{|u(x)-u(y)|} \frac{dy}{|x-y|^{N+s}}=f(x) \quad \textmd{for } x\in Ω, $$ with Dirichlet boundary condition $u(x)=0$ in $\mathbb R^N\backslash Ω$ and nonnegative $L^1$-data. By investigating the asymptotic behaviour of renormalized solutions $u_p$ to the nonlocal $p$-Laplacian equations as $p$ goes to $1^+$, we introduce a suitable definition of solutions and prove that the limit function $u$ of $\{u_p\}$ is a solution of the nonlocal $1$-Laplacian equation above.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
On nilponent hypergroups
Authors:
Chi Zhang,
Wenbin Guo
Abstract:
In this paper, we establish the theory of nilpotent hypergroups and study some properties of nilpotent hypergroups and provided some structural characterizations of nilpotent hypergroups.
In this paper, we establish the theory of nilpotent hypergroups and study some properties of nilpotent hypergroups and provided some structural characterizations of nilpotent hypergroups.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.