Search | arXiv e-print repository

Learning-rate-free Momentum SGD with Reshuffling Converges in Nonsmooth Nonconvex Optimization

Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

Abstract: In this paper, we propose a generalized framework for develo** learning-rate-free momentum stochastic gradient descent (SGD) methods in the minimization of nonsmooth nonconvex functions, especially in training nonsmooth neural networks. Our framework adaptively generates learning rates based on the historical data of stochastic subgradients and iterates. Under mild conditions, we prove that our… ▽ More In this paper, we propose a generalized framework for develo** learning-rate-free momentum stochastic gradient descent (SGD) methods in the minimization of nonsmooth nonconvex functions, especially in training nonsmooth neural networks. Our framework adaptively generates learning rates based on the historical data of stochastic subgradients and iterates. Under mild conditions, we prove that our proposed framework enjoys global convergence to the stationary points of the objective function in the sense of the conservative field, hence providing convergence guarantees for training nonsmooth neural networks. Based on our proposed framework, we propose a novel learning-rate-free momentum SGD method (LFM). Preliminary numerical experiments reveal that LFM performs comparably to the state-of-the-art learning-rate-free methods (which have not been shown theoretically to be convergence) across well-known neural network training benchmarks. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: 26 pages

arXiv:2406.07789 [pdf, ps, other]

A posteriori error estimates for the exponential midpoint method for linear and semilinear parabolic equations

Authors: Xianfa Hu, Wansheng Wang, Mengli Mao, Jiliang Cao

Abstract: In this paper, the a posteriori error estimates of the exponential midpoint method for time discretization are studied for linear and semilinear parabolic equations. Using the exponential midpoint approximation defined by a continuous and piecewise linear interpolation of nodal values yields the suboptimal order estimates. Based on the property of the entire function, we introduce a continuous and… ▽ More In this paper, the a posteriori error estimates of the exponential midpoint method for time discretization are studied for linear and semilinear parabolic equations. Using the exponential midpoint approximation defined by a continuous and piecewise linear interpolation of nodal values yields the suboptimal order estimates. Based on the property of the entire function, we introduce a continuous and piecewise quadratic time reconstruction of the exponential midpoint method to derive the optimal order estimates, and the error bounds are solely dependent on the discretization parameters, the data of the problem and the approximation of the entire function. Several numerical examples are implemented to illustrate the theoretical results. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2405.20554 [pdf, ps, other]

Three approaches to a categorical Torelli theorem for cubic threefolds of non-Eckardt type via the equivariant Kuznetsov components

Authors: Sebastian Casalaina-Martin, Xianyu Hu, Xun Lin, Shizhuo Zhang, Zheng Zhang

Abstract: Let $Y$ be a cubic threefold with a non-Eckardt type involution $τ$. Our first main result is that the $τ$-equivariant category of the Kuznetsov component $\mathcal{K}u_{\mathbb{Z}_2}(Y)$ determines the isomorphism class of $Y$ for general $(Y,τ)$. We shall prove this categorical Torelli theorem via three approaches: a noncommutative Hodge theoretical one (using a generalization of the intermediat… ▽ More Let $Y$ be a cubic threefold with a non-Eckardt type involution $τ$. Our first main result is that the $τ$-equivariant category of the Kuznetsov component $\mathcal{K}u_{\mathbb{Z}_2}(Y)$ determines the isomorphism class of $Y$ for general $(Y,τ)$. We shall prove this categorical Torelli theorem via three approaches: a noncommutative Hodge theoretical one (using a generalization of the intermediate Jacobian construction in [perry2020integral], a Bridgeland moduli theoretical one (using equivariant stability conditions), and a Chow theoretical one (using some techniques in [kuznetsovnonclodedfield2021].The remaining part of the paper is devoted to proving an equivariant infinitesimal categorical Torelli for non-Eckardt cubic threefolds $(Y,τ)$. To accomplish it, we prove a compatibility theorem on the algebra structures of the Hochschild cohomology of the bounded derived category $D^b(X)$ of a smooth projective variety $X$ and on the Hochschild cohomology of a semi-orthogonal component of $D^b(X)$. Another key ingredient is a generalization of a result in [macri2009infinitesimal] which shows that the twisted Hochschild-Kostant-Rosenberg isomorphism is compatible with the actions on the Hochschild cohomology and on the singular cohomology induced by an automorphism of $X$. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 37 pages, comments are welcome

MSC Class: 14F05; 14J45; 14D20; 14D23

arXiv:2405.06248 [pdf, other]

Adversarial neural network methods for topology optimization of eigenvalue problems

Authors: Xindi Hu, Jiaming Weng, Shengfeng Zhu

Abstract: This research presents a novel method using an adversarial neural network to solve the eigenvalue topology optimization problems. The study focuses on optimizing the first eigenvalues of second-order elliptic and fourth-order biharmonic operators subject to geometry constraints. These models are usually solved with topology optimization algorithms based on sensitivity analysis, in which it is expe… ▽ More This research presents a novel method using an adversarial neural network to solve the eigenvalue topology optimization problems. The study focuses on optimizing the first eigenvalues of second-order elliptic and fourth-order biharmonic operators subject to geometry constraints. These models are usually solved with topology optimization algorithms based on sensitivity analysis, in which it is expensive to repeatedly solve the nonlinear constrained eigenvalue problem with traditional numerical methods such as finite elements or finite differences. In contrast, our method leverages automatic differentiation within the deep learning framework. Furthermore, the adversarial neural networks enable different neural networks to train independently, which improves the training efficiency and achieve satisfactory optimization results. Numerical results are presented to verify effectiveness of the algorithms for maximizing and minimizing the first eigenvalues. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2404.10398 [pdf, other]

Problem of eigenvalues of stochastic Hamiltonian systems with boundary conditions and Markov chain

Authors: Tian Chen, Xijun Hu, Zhen Wu

Abstract: In this paper, we study the eigenvalue problem of stochastic Hamiltonian system driven by Brownian motion and Markov chain with boundary conditions and time-dependent coefficients. For any dimensional case, the existence of the first eigenvalue is proven and the corresponding eigenfunctions are constructed by virtue of dual transformation and generalized Riccati equation system. Furthermore, we ha… ▽ More In this paper, we study the eigenvalue problem of stochastic Hamiltonian system driven by Brownian motion and Markov chain with boundary conditions and time-dependent coefficients. For any dimensional case, the existence of the first eigenvalue is proven and the corresponding eigenfunctions are constructed by virtue of dual transformation and generalized Riccati equation system. Furthermore, we have more finely characterized the existence of all eigenvalues and constructed the related eigenfunctions for one-dimensional Hamiltonian system. Moreover, the increasing order of these eigenvalues have also been given. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: 22 pages

MSC Class: 60J10; 34B99; 34L15

arXiv:2404.09438 [pdf, other]

Develo** Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

Authors: Nachuan Xiao, Kuangyu Ding, Xiaoyin Hu, Kim-Chuan Toh

Abstract: In this paper, we consider the minimization of a nonsmooth nonconvex objective function $f(x)$ over a closed convex subset $\mathcal{X}$ of $\mathbb{R}^n$, with additional nonsmooth nonconvex constraints $c(x) = 0$. We develop a unified framework for develo** Lagrangian-based methods, which takes a single-step update to the primal variables by some subgradient methods in each iteration. These su… ▽ More In this paper, we consider the minimization of a nonsmooth nonconvex objective function $f(x)$ over a closed convex subset $\mathcal{X}$ of $\mathbb{R}^n$, with additional nonsmooth nonconvex constraints $c(x) = 0$. We develop a unified framework for develo** Lagrangian-based methods, which takes a single-step update to the primal variables by some subgradient methods in each iteration. These subgradient methods are ``embedded'' into our framework, in the sense that they are incorporated as black-box updates to the primal variables. We prove that our proposed framework inherits the global convergence guarantees from these embedded subgradient methods under mild conditions. In addition, we show that our framework can be extended to solve constrained optimization problems with expectation constraints. Based on the proposed framework, we show that a wide range of existing stochastic subgradient methods, including the proximal SGD, proximal momentum SGD, and proximal ADAM, can be embedded into Lagrangian-based methods. Preliminary numerical experiments on deep learning tasks illustrate that our proposed framework yields efficient variants of Lagrangian-based methods with convergence guarantees for nonconvex nonsmooth constrained optimization problems. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: 30 pages, 4 figures

arXiv:2404.02753 [pdf, other]

The irreducibility of some families of linear series with imposed ramifications

Authors: Xiaoyu Hu

Abstract: Suppose the generalized Brill-Noether number is zero, we prove that there exists a family of twice-marked smooth projective curves such that the family of linear series with two imposed ramification conditons is irreducible. Moreover, under certain conditions, we show that the monodromy group contains the alternating group. In the case $r=1$, the monodromy group is the full symmetric group. Suppose the generalized Brill-Noether number is zero, we prove that there exists a family of twice-marked smooth projective curves such that the family of linear series with two imposed ramification conditons is irreducible. Moreover, under certain conditions, we show that the monodromy group contains the alternating group. In the case $r=1$, the monodromy group is the full symmetric group. △ Less

Submitted 8 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

arXiv:2403.20192 [pdf, ps, other]

Small Ball Probabilities for Simple Random Tensors

Authors: Xuehan Hu, Grigoris Paouris

Abstract: We study the small ball probability of an order-$\ell$ simple random tensor $X=X^{(1)}\otimes\cdots\otimes X^{(\ell)}$ where $X^{(i)}, 1\leq i\leq\ell$ are independent random vectors in $\mathbb{R}^n$ that are log-concave or have independent coordinates with bounded densities. We show that the probability that the projection of $X$ onto an $m$-dimensional subspace $F$ falls within an Euclidean bal… ▽ More We study the small ball probability of an order-$\ell$ simple random tensor $X=X^{(1)}\otimes\cdots\otimes X^{(\ell)}$ where $X^{(i)}, 1\leq i\leq\ell$ are independent random vectors in $\mathbb{R}^n$ that are log-concave or have independent coordinates with bounded densities. We show that the probability that the projection of $X$ onto an $m$-dimensional subspace $F$ falls within an Euclidean ball of length $\varepsilon$ is upper bounded by $\frac{\varepsilon}{(\ell-1)!}\left(C\log\left(\frac{e}{\varepsilon}\right)\right)^{\ell}$ and also this upper bound is sharp when $m$ is small. We also established that a much better estimate holds true for a random subspace. △ Less

Submitted 29 March, 2024; originally announced March 2024.

arXiv:2403.00623 [pdf, other]

Analysis of the particle relaxation method for generating uniform particle distributions in smoothed particle hydrodynamics

Authors: Yu Fan, Xiaoliang Li, Shuoguo Zhang, Xiangyu Hu, Nikolaus A. Adams

Abstract: We establish a theoretical framework of the particle relaxation method for uniform particle generation of Smoothed Particle Hydrodynamics. We achieve this by reformulating the particle relaxation as an optimization problem. The objective function is an integral difference between discrete particle-based and smoothed-analytical volume fractions. The analysis demonstrates that the particle relaxatio… ▽ More We establish a theoretical framework of the particle relaxation method for uniform particle generation of Smoothed Particle Hydrodynamics. We achieve this by reformulating the particle relaxation as an optimization problem. The objective function is an integral difference between discrete particle-based and smoothed-analytical volume fractions. The analysis demonstrates that the particle relaxation method in the domain interior is essentially equivalent to employing a gradient descent approach to solve this optimization problem, and we can extend such an equivalence to the bounded domain by introducing a proper boundary term. Additionally, each periodic particle distribution has a spatially uniform particle volume, denoted as characteristic volume. The relaxed particle distribution has the largest characteristic volume, and the kernel cut-off radius determines this volume. This insight enables us to control the relaxed particle distribution by selecting the target kernel cut-off radius for a given kernel function. △ Less

Submitted 1 March, 2024; originally announced March 2024.

MSC Class: 65N50; 70F10; 74S30

arXiv:2402.04827 [pdf, other]

The scaling limit of the volume of loop O(n) quadrangulations

Authors: Élie Aïdékon, William Da Silva, XingJian Hu

Abstract: We study the volume of rigid loop-$O(n)$ quadrangulations with a boundary of length $2p$ in the critical non-generic regime. We prove that, as the half-perimeter $p$ goes to infinity, the volume scales in distribution to an explicit random variable. This limiting random variable is described in terms of the multiplicative cascades of Chen, Curien and Maillard arXiv:1702.06916, or alternatively (in… ▽ More We study the volume of rigid loop-$O(n)$ quadrangulations with a boundary of length $2p$ in the critical non-generic regime. We prove that, as the half-perimeter $p$ goes to infinity, the volume scales in distribution to an explicit random variable. This limiting random variable is described in terms of the multiplicative cascades of Chen, Curien and Maillard arXiv:1702.06916, or alternatively (in the dilute case) as the law of the area of a unit-boundary $γ$-quantum disc, as determined by Ang and Gwynne arXiv:1903.09120, for suitable $γ$. Our arguments go through a classification of the map into several regions, where we rule out the contribution of bad regions to be left with a tractable portion of the map. One key observable for this classification is a Markov chain which explores the nested loops around a size-biased vertex pick in the map, making explicit the spinal structure of the discrete multiplicative cascade. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 45 pages, 6 figures, comments welcome!

MSC Class: 05C80; 60K35; 60J80

arXiv:2401.12359 [pdf, ps, other]

Positivstellensätze and Moment problems with Universal Quantifiers

Authors: Xiaomeng Hu, Igor Klep, Jiawang Nie

Abstract: This paper studies Positivstellensätze and moment problems for sets that are given by universal quantifiers. Let $Q$ be a closed set and let $g = (g_1,...,g_s)$ be a tuple of polynomials in two vector variables $x$ and $y$. Then $K$ is described as the set of all points $x$ such that each $g_j(x, y) \ge 0$ for all $y \in Q$. Fix a measure $ν$ with $supp(ν) = Q$, and assume it satisfies the Carlema… ▽ More This paper studies Positivstellensätze and moment problems for sets that are given by universal quantifiers. Let $Q$ be a closed set and let $g = (g_1,...,g_s)$ be a tuple of polynomials in two vector variables $x$ and $y$. Then $K$ is described as the set of all points $x$ such that each $g_j(x, y) \ge 0$ for all $y \in Q$. Fix a measure $ν$ with $supp(ν) = Q$, and assume it satisfies the Carleman condition. The first main result of the paper is a Positivstellensatz with universal quantifiers: if a polynomial $f(x)$ is positive on $K$, then it belongs to the quadratic module $QM(g,ν)$ associated to $(g,ν)$, under the archimedeanness assumption on $QM(g,ν)$. Here, $QM(g,ν)$ denotes the quadratic module of polynomials in $x$ that can be represented as \[τ_0(x) + \int τ_1(x,y)g_1(x, y)\, dν(y) + \cdots + \int τ_s(x,y) g_s(x, y)\, dν(y), \] where each $τ_j$ is a sum of squares polynomial. Second, necessary and sufficient conditions for a full (or truncated) multisequence to admit a representing measure supported in $K$ are given. In particular, the classical flat extension theorem of Curto and Fialkow is generalized to truncated moment problems on such a set $K$. Finally, applications of these results for solving semi-infinite optimization problems are presented. △ Less

Submitted 22 January, 2024; originally announced January 2024.

MSC Class: 13J30; 44A60; 90C23; 47A57; 90C34

arXiv:2312.17392 [pdf, ps, other]

Equivariant Kuznetsov Components of Certain Cubic Fourfolds

Authors: Xianyu Hu

Abstract: Let $M$ denote a specific cubic fourfold that accommodates a group action by $\mathbb{Z}/3\mathbb{Z}$. Through utilization of derived Mckay correspondence, we present a new proof establishing the identification of the equivariant Kuznetsov component in the equivariant derived category of $M$ with the derived category of certain abelian surface. This surface naturally emerges from the defining equa… ▽ More Let $M$ denote a specific cubic fourfold that accommodates a group action by $\mathbb{Z}/3\mathbb{Z}$. Through utilization of derived Mckay correspondence, we present a new proof establishing the identification of the equivariant Kuznetsov component in the equivariant derived category of $M$ with the derived category of certain abelian surface. This surface naturally emerges from the defining equation of the cubic fourfold $M$. △ Less

Submitted 28 December, 2023; originally announced December 2023.

MSC Class: 14E16; 14F08; 14J35

arXiv:2311.08634 [pdf, ps, other]

On the minimum degree of minimally $ t $-tough, claw-free graphs

Authors: Hui Ma, Xiaomin Hu, Weihua Yang

Abstract: A graph $ G $ is minimally $ t $-tough if the toughness of $ G $ is $ t $ and deletion of any edge from $ G $ decreases its toughness. Katona et al. conjectured that the minimum degree of any minimally $ t $-tough graph is $ \lceil 2t\rceil $ and proved that the minimum degree of minimally $ \frac{1}2 $-tough and $ 1 $-tough, claw-free graphs is 1 and 2, respectively. We have show that every minim… ▽ More A graph $ G $ is minimally $ t $-tough if the toughness of $ G $ is $ t $ and deletion of any edge from $ G $ decreases its toughness. Katona et al. conjectured that the minimum degree of any minimally $ t $-tough graph is $ \lceil 2t\rceil $ and proved that the minimum degree of minimally $ \frac{1}2 $-tough and $ 1 $-tough, claw-free graphs is 1 and 2, respectively. We have show that every minimally $ 3/2 $-tough, claw-free graph has a vertex of degree of $ 3 $. In this paper, we give an upper bound on the minimum degree of minimally $t$-tough, claw-free graphs for $ t\geq 2 $. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2311.06793 [pdf, ps, other]

Rationality of $\mbox{dlog}$ $\mathbb{A}^1$-zeta functions

Authors: Xiaowen Hu

Abstract: For every smooth proper scheme over a finite field $\mathbb{F}_q$, Bilu, Ho, Srinivasan, Vogt, and Wickelgren introduced the dlog zeta function with coefficients in the Grothendieck-Witt ring $\mathrm{GW}(\mathbb{F}_q)$, enriching the dlog of the classical Weil zeta function with coefficients in $\mathbb{Z}$. They defined a notion of dlog rationality of such dlog zeta functions, which enriches the… ▽ More For every smooth proper scheme over a finite field $\mathbb{F}_q$, Bilu, Ho, Srinivasan, Vogt, and Wickelgren introduced the dlog zeta function with coefficients in the Grothendieck-Witt ring $\mathrm{GW}(\mathbb{F}_q)$, enriching the dlog of the classical Weil zeta function with coefficients in $\mathbb{Z}$. They defined a notion of dlog rationality of such dlog zeta functions, which enriches the rationality of the Weil zeta function, and showed the dlog rationality for simple cellular schemes. In this paper, we show that for any smooth proper schemes over $\mathbb{F}_q$, the dlog zeta function is rational, but not necessarily dlog rational. △ Less

Submitted 12 November, 2023; originally announced November 2023.

Comments: Comments are welcome!

MSC Class: 14G10; 11R04 (Primary) 14F42; 11G25 (Secondary)

arXiv:2310.17938 [pdf, other]

Autoequivalences of Blow-Ups of Minimal Surfaces

Authors: Xianyu Hu, Johannes Krah

Abstract: Let X be the blow-up of the projective plane in a finite set of points in very general position. We show that X has only standard autoequivalences, no nontrivial Fourier-Mukai partners, and admits no spherical objects. Further, we show that the same result holds if X is a blow-up of finitely many points in a minimal surface of nonnegative Kodaira dimension which contains no (-2)-curves. Independen… ▽ More Let X be the blow-up of the projective plane in a finite set of points in very general position. We show that X has only standard autoequivalences, no nontrivial Fourier-Mukai partners, and admits no spherical objects. Further, we show that the same result holds if X is a blow-up of finitely many points in a minimal surface of nonnegative Kodaira dimension which contains no (-2)-curves. Independently, we characterize spherical objects on blow-ups of minimal surfaces of positive Kodaira dimension. △ Less

Submitted 27 October, 2023; originally announced October 2023.

Comments: 9 pages

MSC Class: 14F08; 14J26

arXiv:2310.09381 [pdf, ps, other]

A Local Fourier Analysis for Additive Schwarz Smoothers

Authors: Álvaro Pé de la Riva, Carmen Rodrigo, Francisco J. Gaspar, James H. Adler, Xiaozhe Hu, Ludmil Zikatanov

Abstract: In this work, a local Fourier analysis is presented to study the convergence of multigrid methods based on additive Schwarz smoothers. This analysis is presented as a general framework which allows us to study these smoothers for any type of discretization and problem. The presented framework is crucial in practice since it allows one to know a priori the answer to questions such as what is the si… ▽ More In this work, a local Fourier analysis is presented to study the convergence of multigrid methods based on additive Schwarz smoothers. This analysis is presented as a general framework which allows us to study these smoothers for any type of discretization and problem. The presented framework is crucial in practice since it allows one to know a priori the answer to questions such as what is the size of the patch to use within these relaxations, the size of the overlap**, or even the optimal values for the weights involved in the smoother. Results are shown for a class of additive and restricted additive Schwarz relaxations used within a multigrid framework applied to high-order finite-element discretizations and saddle point problems, which are two of the contexts in which these type of relaxations are widely used. △ Less

Submitted 13 October, 2023; originally announced October 2023.

arXiv:2309.11397 [pdf, other]

Explicit compactifications of moduli spaces of secondary Burniat surfaces

Authors: Valery Alexeev, Xiaoyan Hu

Abstract: We describe explicitly the geometric compactifications, obtained by adding slc surfaces $X$ with ample canonical class, of moduli spaces of Burniat surfaces of degrees $K^2=5$, $4$ and $3$. We describe explicitly the geometric compactifications, obtained by adding slc surfaces $X$ with ample canonical class, of moduli spaces of Burniat surfaces of degrees $K^2=5$, $4$ and $3$. △ Less

Submitted 20 September, 2023; originally announced September 2023.

MSC Class: 14D22; 14J29

arXiv:2309.10343 [pdf, ps, other]

Endpoint theory for the compactness of commutators

Authors: Dinghuai Wang, Xi Hu, Shuai Qi

Abstract: In this paper, we establish a Minkowski-type inequality for weak Lebesgue space, which allows us to obtain a characterization of relative compactness in these spaces. Furthermore, we are the first to investigate the compactness results of commutators at the endpoint. The paper provides a comprehensive study of the compactness properties of commutators of Calderón-Zygmund operators in Hardy and… ▽ More In this paper, we establish a Minkowski-type inequality for weak Lebesgue space, which allows us to obtain a characterization of relative compactness in these spaces. Furthermore, we are the first to investigate the compactness results of commutators at the endpoint. The paper provides a comprehensive study of the compactness properties of commutators of Calderón-Zygmund operators in Hardy and $L^{1}(\mathbb{R}^n)$ type spaces. Additionally, we provide factorization theorems for Hardy spaces in terms of singular integral operators in the $L^1(\mathbb{R}^n)$ space. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: 36 pages

MSC Class: Primary 46B50; 46E30; Secondary 42B20

arXiv:2309.02838 [pdf, other]

An SPH formulation for general plate and shell structures with finite deformation and large rotation

Authors: Dong Wu, Chi Zhang, Xiangyu Hu

Abstract: In this paper, we propose a reduced-dimensional smoothed particle hydrodynamics (SPH) formulation for quasi-static and dynamic analyses of plate and shell structures undergoing finite deformation and large rotation. By exploiting Uflyand-Mindlin plate theory, the present surface-particle formulation is able to resolve the thin structures by using only one layer of particles at the mid-surface. To… ▽ More In this paper, we propose a reduced-dimensional smoothed particle hydrodynamics (SPH) formulation for quasi-static and dynamic analyses of plate and shell structures undergoing finite deformation and large rotation. By exploiting Uflyand-Mindlin plate theory, the present surface-particle formulation is able to resolve the thin structures by using only one layer of particles at the mid-surface. To resolve the geometric non-linearity and capture finite deformation and large rotation, two reduced-dimensional linear-reproducing correction matrices are introduced, and weighted non-singularity conversions between the rotation angle and pseudo normal are formulated. A new non-isotropic Kelvin-Voigt dam** is proposed especially for the both thin and moderately thick plate and shell structures to increase the numerical stability. In addition, a shear-scaled momentum-conserving hourglass control algorithm with an adaptive limiter is introduced to suppress the mismatches between the particle position and pseudo normal and those estimated with the deformation gradient. A comprehensive set of test problems, for which the analytical or numerical results from literature or those of the volume-particle SPH model are available for quantitative and qualitative comparison, are examined to demonstrate the accuracy and stability of the present method. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: 62 pages and 25 figures

arXiv:2308.13123 [pdf]

Multiscale modeling of thermal properties in Polyurethane incorporated with phase change materials composites: A case study

Authors: Bokai Liu, Weizhuo Lu, Xiaoyue Hu, Chao Zhang, Cuixia Wang, Yilin Qu, Thomas Olofsson

Abstract: Polyurethane (PU) is an ideal thermal insulation material due to its excellent thermal properties. The incorporation of Phase Change Materials (PCMs) capsules into Polyurethane (PU) has been shown to be effective in building envelopes. This design can significantly increase the stability of the indoor thermal environment and reduce the fluctuation of indoor air temperature. We develop a multiscale… ▽ More Polyurethane (PU) is an ideal thermal insulation material due to its excellent thermal properties. The incorporation of Phase Change Materials (PCMs) capsules into Polyurethane (PU) has been shown to be effective in building envelopes. This design can significantly increase the stability of the indoor thermal environment and reduce the fluctuation of indoor air temperature. We develop a multiscale model of a PU-PCM foam composite and study the thermal conductivity of this material. Later, the design of materials can be optimized by obtaining thermal conductivity. We conduct a case study based on the performance of this optimized material to fully consider the thermal comfort of the occupants of a building envelope with the application of PU-PCMs composites in a single room. At the same time, we also predict the energy consumption of this case. All the outcomes show that this design is promising, enabling the passive design of building energy and significantly improving occupants' comfort. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.03764 [pdf]

Deployment of Leader-Follower Automated Vehicle Systems for Smart Work Zone Applications with a Queuing-based Traffic Assignment Approach

Authors: Qing Tang, Xianbiao Hu

Abstract: The emerging technology of the Autonomous Truck Mounted Attenuator (ATMA), a leader-follower style vehicle system, utilizes connected and automated vehicle capabilities to enhance safety during transportation infrastructure maintenance in work zones. However, the speed difference between ATMA vehicles and general vehicles creates a moving bottleneck that reduces capacity and increases queue length… ▽ More The emerging technology of the Autonomous Truck Mounted Attenuator (ATMA), a leader-follower style vehicle system, utilizes connected and automated vehicle capabilities to enhance safety during transportation infrastructure maintenance in work zones. However, the speed difference between ATMA vehicles and general vehicles creates a moving bottleneck that reduces capacity and increases queue length, resulting in additional delays. The different routes taken by ATMA cause diverse patterns of time-varying capacity drops, which may affect the user equilibrium traffic assignment and lead to different system costs. This manuscript focuses on optimizing the routing for ATMA vehicles in a network to minimize the system cost associated with the slow-moving operation. To achieve this, a queuing-based traffic assignment approach is proposed to identify the system cost caused by the ATMA system. A queuing-based time-dependent (QBTD) travel time function, considering capacity drop, is introduced and applied in the static user equilibrium traffic assignment problem, with a result of adding dynamic characteristics. Subsequently, we formulate the queuing-based traffic assignment problem and solve it using a modified path-based algorithm. The methodology is validated using a small-size and a large-size network and compared with two benchmark models to analyze the benefit of capacity drop modeling and QBTD travel time function. Furthermore, the approach is applied to quantify the impact of different routes on the traffic system and identify an optimal route for ATMA vehicles performing maintenance work. Finally, sensitivity analysis is conducted to explore how the impact changes with variations in traffic demand and capacity reduction. △ Less

Submitted 23 July, 2023; originally announced August 2023.

arXiv:2308.00363 [pdf, ps, other]

Anomalous smoothing effect on the incompressible Navier-Stokes-Fourier limit from Boltzmann with periodic velocity

Authors: Zhongyang Gu, Xin Hu, Tsuyoshi Yoneda

Abstract: Adding some nontrivial terms composed from a microstructure, we prove the existence of a global-in-time weak solution, whose enstrophy is bounded for all the time, to an incompressible 3D Navier-Stokes-Fourier system for arbitrary initial data. It cannot be expected to directly derive the energy inequality for this new system of equations. The main idea is to employ the hydrodynamic limit from the… ▽ More Adding some nontrivial terms composed from a microstructure, we prove the existence of a global-in-time weak solution, whose enstrophy is bounded for all the time, to an incompressible 3D Navier-Stokes-Fourier system for arbitrary initial data. It cannot be expected to directly derive the energy inequality for this new system of equations. The main idea is to employ the hydrodynamic limit from the Boltzmann equation with periodic velocity and a specially designed collision operator. △ Less

Submitted 22 May, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 57 pages

MSC Class: 35Q20; 76P05; 35Q30; 76D05

arXiv:2308.00338 [pdf, other]

A symplectic dynamics approach to the spatial isosceles three-body problem

Authors: Xijun Hu, Lei Liu, Yuwei Ou, Pedro A. S. Salomão, Guowei Yu

Abstract: We study the spatial isosceles three-body problem from the perspective of Symplectic Dynamics. For certain choices of mass ratio, angular momentum, and energy, the dynamics on the energy surface is equivalent to a Reeb flow on the tight three-sphere. We find a Hopf link formed by the Euler orbit and a symmetric brake orbit, which spans an open book decomposition whose pages are annulus-like global… ▽ More We study the spatial isosceles three-body problem from the perspective of Symplectic Dynamics. For certain choices of mass ratio, angular momentum, and energy, the dynamics on the energy surface is equivalent to a Reeb flow on the tight three-sphere. We find a Hopf link formed by the Euler orbit and a symmetric brake orbit, which spans an open book decomposition whose pages are annulus-like global surfaces of section. In the case of large mass ratios, the Hopf link is non-resonant, forcing the existence of infinitely many periodic orbits. The rotation number of the Euler orbit plays a fundamental role in the existence of periodic orbits and their symmetries. We explore such symmetries in the Hill region and show that the Euler orbit is negative hyperbolic for an open set of parameters while it can never be positive hyperbolic. Finally, we address convexity and determine for each parameter whether the energy surface is strictly convex, convex, or non-convex. Dynamical consequences of this fact are then discussed. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: 66 pages, 15 figures

arXiv:2307.10053 [pdf, other]

SGD-type Methods with Guaranteed Global Stability in Nonsmooth Nonconvex Optimization

Authors: Nachuan Xiao, Xiaoyin Hu, Kim-Chuan Toh

Abstract: In this paper, we focus on providing convergence guarantees for variants of the stochastic subgradient descent (SGD) method in minimizing nonsmooth nonconvex functions. We first develop a general framework to establish global stability for general stochastic subgradient methods, where the corresponding differential inclusion admits a coercive Lyapunov function. We prove that, with sufficiently sma… ▽ More In this paper, we focus on providing convergence guarantees for variants of the stochastic subgradient descent (SGD) method in minimizing nonsmooth nonconvex functions. We first develop a general framework to establish global stability for general stochastic subgradient methods, where the corresponding differential inclusion admits a coercive Lyapunov function. We prove that, with sufficiently small stepsizes and controlled noises, the iterates asymptotically stabilize around the stable set of its corresponding differential inclusion. Then we introduce a scheme for develo** SGD-type methods with regularized update directions for the primal variables. Based on our developed framework, we prove the global stability of our proposed scheme under mild conditions. We further illustrate that our scheme yields variants of SGD-type methods, which enjoy guaranteed convergence in training nonsmooth neural networks. In particular, by employing the sign map to regularize the update directions, we propose a novel subgradient method named the Sign-map Regularized SGD method (SRSGD). Preliminary numerical experiments exhibit the high efficiency of SRSGD in training deep neural networks. △ Less

Submitted 13 May, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

Comments: 36 pages

arXiv:2306.13570 [pdf, other]

Synchronous dynamic game on system observability considering one or two steps optimality

Authors: Yueyue Xu, Xiaoming Hu, Lin Wang

Abstract: This paper studies a system security problem in the context of observability based on a two-party non-cooperative asynchronous dynamic game. A system is assumed to be secure if it is not observable. Both the defender and the attacker have means to modify dimension of the unobservable subspace, which is set as the value function. Utilizing tools from geometric control, we construct the best respons… ▽ More This paper studies a system security problem in the context of observability based on a two-party non-cooperative asynchronous dynamic game. A system is assumed to be secure if it is not observable. Both the defender and the attacker have means to modify dimension of the unobservable subspace, which is set as the value function. Utilizing tools from geometric control, we construct the best response set under one-step or two-step optimality to minimize or maximize the value function. We find that the best response sets under one-step optimality are not single-valued maps, resulting in a variety of game outcomes. In the dynamic game considering two-step optimality, definition and existence conditions of lock and oscillation game modes are given. Finally, the best response under two-step optimality and the Stackelberg game equilibrium are compared. △ Less

Submitted 23 June, 2023; originally announced June 2023.

arXiv:2306.06614 [pdf, ps, other]

Cost-reduction implicit exponential Runge-Kutta methods for highly oscillatory systems

Authors: Xianfa Hu, Wansheng Wang, Bin Wang, Yonglei Fang

Abstract: In this paper, two novel classes of implicit exponential Runge-Kutta (ERK) methods are studied for solving highly oscillatory systems. First of all, we analyze the symplectic conditions of two kinds of exponential integrators, and present a first-order symplectic method. In order to solve highly oscillatory problems, the highly accurate implicit ERK integrators (up to order four) are formulated by… ▽ More In this paper, two novel classes of implicit exponential Runge-Kutta (ERK) methods are studied for solving highly oscillatory systems. First of all, we analyze the symplectic conditions of two kinds of exponential integrators, and present a first-order symplectic method. In order to solve highly oscillatory problems, the highly accurate implicit ERK integrators (up to order four) are formulated by comparing the Taylor expansions of numerical and exact solutions, it is shown that the order conditions of two new kinds of exponential methods are identical to the order conditions of classical Runge-Kutta (RK) methods. Moreover, we investigate the linear stability properties of these exponential methods. Finally, numerical results not only present the long time energy preservation of the first-order symplectic method, but also illustrate the accuracy and efficiency of these formulated methods in comparison with standard ERK methods. △ Less

Submitted 4 December, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

arXiv:2305.06073 [pdf, other]

Algebraic multigrid methods for metric-perturbed coupled problems

Authors: Ana Budisa, Xiaozhe Hu, Miroslav Kuchta, Kent-Andre Mardal, Ludmil Tomov Zikatanov

Abstract: We develop multilevel methods for interface-driven multiphysics problems that can be coupled across dimensions and where complexity and strength of the interface coupling deteriorates the performance of standard methods. We focus on solvers based on aggregation-based algebraic multigrid methods with custom smoothers that preserve the coupling information on each coarse level. We prove that with th… ▽ More We develop multilevel methods for interface-driven multiphysics problems that can be coupled across dimensions and where complexity and strength of the interface coupling deteriorates the performance of standard methods. We focus on solvers based on aggregation-based algebraic multigrid methods with custom smoothers that preserve the coupling information on each coarse level. We prove that with the proper choice of subspace splitting we obtain uniform convergence in discretization and physical parameters in the two-level setting. Additionally, we show parameter robustness and scalability with regards to number of the degrees of freedom of the system on several numerical examples related to the biophysical processes in the brain, namely the electric signalling in excitable tissue modeled by bidomain, EMI and reduced EMI equations. △ Less

Submitted 10 May, 2023; originally announced May 2023.

MSC Class: 65F08; 65N55; 65S05 ACM Class: G.1.8

arXiv:2305.03938 [pdf, other]

Adam-family Methods for Nonsmooth Optimization with Convergence Guarantees

Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

Abstract: In this paper, we present a comprehensive study on the convergence properties of Adam-family methods for nonsmooth optimization, especially in the training of nonsmooth neural networks. We introduce a novel two-timescale framework that adopts a two-timescale updating scheme, and prove its convergence properties under mild assumptions. Our proposed framework encompasses various popular Adam-family… ▽ More In this paper, we present a comprehensive study on the convergence properties of Adam-family methods for nonsmooth optimization, especially in the training of nonsmooth neural networks. We introduce a novel two-timescale framework that adopts a two-timescale updating scheme, and prove its convergence properties under mild assumptions. Our proposed framework encompasses various popular Adam-family methods, providing convergence guarantees for these methods in training nonsmooth neural networks. Furthermore, we develop stochastic subgradient methods that incorporate gradient clip** techniques for training nonsmooth neural networks with heavy-tailed noise. Through our framework, we show that our proposed methods converge even when the evaluation noises are only assumed to be integrable. Extensive numerical experiments demonstrate the high efficiency and robustness of our proposed methods. △ Less

Submitted 19 February, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

Comments: 53 pages

arXiv:2304.03470 [pdf, ps, other]

Stochastic Verification Theorems for Stochastic Control Problems of Reflected FBSDEs

Authors: Lu Liu, Xinlei Hu, Qingmeng Wei

Abstract: In this paper, the stochastic verification theorems for stochastic control problems of reflected forward-backward stochastic differential equations are studied. We carry out the work within the frameworks of classical and viscosity solutions. The sufficient conditions of verifying the controls to be optimal are given. We also construct the feedback optimal control laws from the classical and visco… ▽ More In this paper, the stochastic verification theorems for stochastic control problems of reflected forward-backward stochastic differential equations are studied. We carry out the work within the frameworks of classical and viscosity solutions. The sufficient conditions of verifying the controls to be optimal are given. We also construct the feedback optimal control laws from the classical and viscosity solutions of the associated Hamilton-Jacobi-Bellman equations with obstacles. Finally, we apply the theoretical results in two concrete examples. One is for the case of the classical solution, and the other is for the case of the viscosity solution. △ Less

Submitted 5 June, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

MSC Class: 93E20; 35D40; 49K45

arXiv:2304.00103 [pdf, ps, other]

Parameter-free preconditioning for nearly-incompressible linear elasticity

Authors: James H Adler, Xiaozhe Hu, Yuwen Li, Ludmil T. Zikatanov

Abstract: It is well known that via the augmented Lagrangian method, one can solve Stokes' system by solving the nearly incompressible linear elasticity equation. In this paper, we show that the converse holds, and approximate the inverse of the linear elasticity operator with a convex linear combination of parameter-free operators. In such a way, we construct a uniform preconditioner for linear elasticity… ▽ More It is well known that via the augmented Lagrangian method, one can solve Stokes' system by solving the nearly incompressible linear elasticity equation. In this paper, we show that the converse holds, and approximate the inverse of the linear elasticity operator with a convex linear combination of parameter-free operators. In such a way, we construct a uniform preconditioner for linear elasticity for all values of the Lamé parameter $λ\in [0,\infty)$. Numerical results confirm that by using inf-sup stable finite-element spaces for the solution of Stokes' equations, the proposed preconditioner is robust in $λ$. △ Less

Submitted 31 March, 2023; originally announced April 2023.

MSC Class: 65F08; 65N12; 65N22

arXiv:2303.14308 [pdf, ps, other]

Polynomial Optimization Relaxations for Generalized Semi-Infinite Programs

Authors: Xiaomeng Hu, Jiawang Nie

Abstract: This paper studies generalized semi-infinite programs (GSIPs) given by polynomials. We propose a hierarchy of polynomial optimization relaxations to solve them. They are based on Lagrange multiplier expressions and polynomial extensions. Moment-SOS relaxations are applied to solve the polynomial optimization. The convergence of this hierarchy is shown under certain conditions. In particular, the c… ▽ More This paper studies generalized semi-infinite programs (GSIPs) given by polynomials. We propose a hierarchy of polynomial optimization relaxations to solve them. They are based on Lagrange multiplier expressions and polynomial extensions. Moment-SOS relaxations are applied to solve the polynomial optimization. The convergence of this hierarchy is shown under certain conditions. In particular, the classical semi-infinite programs (SIPs) can be solved as a special case of GSIPs. We also study GSIPs that have convex infinity constraints and show that they can be solved exactly by a single polynomial optimization relaxation. The computational efficiency is demonstrated by extensive numerical results. △ Less

Submitted 24 March, 2023; originally announced March 2023.

Comments: 30 pages

arXiv:2303.08350 [pdf, ps, other]

Wiener's criterion for degenerate parabolic equations

Authors: Xi Hu, Lin Tang

Abstract: In this paper, we prove Wiener's criterion for parabolic equations with singular and degenerate coefficients. To be precise, we study the problem of the regularity of boundary points for the Dirichlet problem for degenerate parabolic equations, and give a geometric characterization of those boundary points that are regular. In this paper, we prove Wiener's criterion for parabolic equations with singular and degenerate coefficients. To be precise, we study the problem of the regularity of boundary points for the Dirichlet problem for degenerate parabolic equations, and give a geometric characterization of those boundary points that are regular. △ Less

Submitted 15 March, 2023; originally announced March 2023.

Comments: 65pages

MSC Class: 35A08; 35B05; 35K65

arXiv:2303.06698 [pdf, ps, other]

Branch & Learn with Post-hoc Correction for Predict+Optimize with Unknown Parameters in Constraints

Authors: Xinyi Hu, Jasper C. H. Lee, Jimmy H. M. Lee

Abstract: Combining machine learning and constrained optimization, Predict+Optimize tackles optimization problems containing parameters that are unknown at the time of solving. Prior works focus on cases with unknowns only in the objectives. A new framework was recently proposed to cater for unknowns also in constraints by introducing a loss function, called Post-hoc Regret, that takes into account the cost… ▽ More Combining machine learning and constrained optimization, Predict+Optimize tackles optimization problems containing parameters that are unknown at the time of solving. Prior works focus on cases with unknowns only in the objectives. A new framework was recently proposed to cater for unknowns also in constraints by introducing a loss function, called Post-hoc Regret, that takes into account the cost of correcting an unsatisfiable prediction. Since Post-hoc Regret is non-differentiable, the previous work computes only its approximation. While the notion of Post-hoc Regret is general, its specific implementation is applicable to only packing and covering linear programming problems. In this paper, we first show how to compute Post-hoc Regret exactly for any optimization problem solvable by a recursive algorithm satisfying simple conditions. Experimentation demonstrates substantial improvement in the quality of solutions as compared to the earlier approximation approach. Furthermore, we show experimentally the empirical behavior of different combinations of correction and penalty functions used in the Post-hoc Regret of the same benchmarks. Results provide insights for defining the appropriate Post-hoc Regret in different application scenarios. △ Less

Submitted 12 March, 2023; originally announced March 2023.

arXiv:2212.12491 [pdf, ps, other]

Critical Fujita exponent for a semilinear heat equation with degenerate coefficients

Authors: Xi Hu, Lin Tang

Abstract: We prove the existence of a critical Fujita exponent for a non-homogeneous semilinear heat equation which involves degenerate coefficients. More precisely, in order to give a rather complete theory, we focus on two types of weights $w(x)=|x_1|^a$ or $w(x)=|x|^b$ where $a, b>0$ in a suitable range. The coefficients under consideration admit either a singularity at the origin or a line of singularit… ▽ More We prove the existence of a critical Fujita exponent for a non-homogeneous semilinear heat equation which involves degenerate coefficients. More precisely, in order to give a rather complete theory, we focus on two types of weights $w(x)=|x_1|^a$ or $w(x)=|x|^b$ where $a, b>0$ in a suitable range. The coefficients under consideration admit either a singularity at the origin or a line of singularities. In the latter case, the problem is related to the fractional Laplacian. △ Less

Submitted 9 November, 2022; originally announced December 2022.

Comments: 24pages. arXiv admin note: substantial text overlap with arXiv:1711.11187 by other authors

MSC Class: 35B33; 35K58; 35K65

arXiv:2212.07234 [pdf, ps, other]

Two Ramsey-Turán numbers involving triangles

Authors: Xinyu Hu, Qizhong Lin

Abstract: Given integers $p, q\ge2$, we say that a graph $G$ is $(K_p,K_q)$-free if there exists a red/blue edge coloring of $G$ such that it contains neither a red $K_p$ nor a blue $K_q$. Fix a function $f( n )$, the Ramsey-Turán number $RT( {n,p,q,f( n ))} $ is the maximum number of edges in an $n$-vertex $(K_p,K_q)$-free graph with independence number at most $f( n )$. For any $δ>0$, let… ▽ More Given integers $p, q\ge2$, we say that a graph $G$ is $(K_p,K_q)$-free if there exists a red/blue edge coloring of $G$ such that it contains neither a red $K_p$ nor a blue $K_q$. Fix a function $f( n )$, the Ramsey-Turán number $RT( {n,p,q,f( n ))} $ is the maximum number of edges in an $n$-vertex $(K_p,K_q)$-free graph with independence number at most $f( n )$. For any $δ>0$, let $ρ(p, q,δ) = \mathop {\lim }\limits_{n \to \infty } \frac{RT(n,p, q,δn)}{n^2}$. We always call $ρ(p, q):= \mathop {\lim }\limits_{δ\to 0}ρ(p, q,δ)$ the Ramsey-Turán density of $K_p$ and $K_q$. In 1993, Erdős, Hajnal, Simonovits, Sós and Szemerédi proposed to determine the value of $ρ(3,q)$ for $q\ge3$, and they conjectured that for $q \ge 2$, $ρ\left( {3,2q - 1} \right) = \frac{1}{2}(1 - \frac{1}{r(3,q) - 1})$. Recently, Kim, Kim and Liu (2019) conjectured that for $q \ge 2$, $ρ( {3,2q } ) = \frac{1}{2}( 1 - \frac{1}{r( {3,q} )})$. Erdős et al. (1993) determined $ρ(3,q)$ for $q=3,4,5$ and $ρ(4,4)$. There is no progress on the Ramsey-Turán density $ρ(p, q)$ in the past thirty years. In this paper, we obtain $ρ(3,6)=\frac{5}{12}$ and $ρ(3,7)=\frac{7}{16}$. Moreover, we show that the corresponding asymptotically extremal structures are weakly stable, which answers a problem of Erdős et al. (1993) for the two cases. △ Less

Submitted 14 June, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

Comments: 30 pages. The proofs have been slightly revised, especially the second part

arXiv:2212.05164 [pdf, other]

Convolution theorems associated with quaternion linear canonical transform and applications

Authors: Xiaoxiao Hu, Dong Cheng, Kit Ian Kou

Abstract: Novel types of convolution operators for quaternion linear canonical transform (QLCT) are proposed. Type one and two are defined in the spatial and QLCT spectral domains, respectively. They are distinct in the quaternion space and are consistent once in complex or real space. Various types of convolution formulas are discussed. Consequently, the QLCT of the convolution of two quaternionic function… ▽ More Novel types of convolution operators for quaternion linear canonical transform (QLCT) are proposed. Type one and two are defined in the spatial and QLCT spectral domains, respectively. They are distinct in the quaternion space and are consistent once in complex or real space. Various types of convolution formulas are discussed. Consequently, the QLCT of the convolution of two quaternionic functions can be implemented by the product of their QLCTs, or the summation of the products of their QLCTs. As applications, correlation operators and theorems of the QLCT are derived. The proposed convolution formulas are used to solve Fredholm integral equations with special kernels. Some systems of second-order partial differential equations, which can be transformed into the second-order quaternion partial differential equations, can be solved by the convolution formulas as well. As a final point, we demonstrate that the convolution theorem facilitates the design of multiplicative filters. △ Less

Submitted 9 December, 2022; originally announced December 2022.

arXiv:2212.02698 [pdf, other]

CDOpt: A Python Package for a Class of Riemannian Optimization

Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

Abstract: Optimization over the embedded submanifold defined by constraints $c(x) = 0$ has attracted much interest over the past few decades due to its wide applications in various areas. Plenty of related optimization packages have been developed based on Riemannian optimization approaches, which rely on some basic geometrical materials of Riemannian manifolds, including retractions, vector transports, etc… ▽ More Optimization over the embedded submanifold defined by constraints $c(x) = 0$ has attracted much interest over the past few decades due to its wide applications in various areas. Plenty of related optimization packages have been developed based on Riemannian optimization approaches, which rely on some basic geometrical materials of Riemannian manifolds, including retractions, vector transports, etc. These geometrical materials can be challenging to determine in general. Existing packages only accommodate a few well-known manifolds whose geometrical materials are easily accessible. For other manifolds which are not contained in these packages, the users have to develop the geometric materials by themselves. In addition, it is not always tractable to adopt advanced features from various state-of-the-art unconstrained optimization solvers to Riemannian optimization approaches. We introduce CDOpt (available at https://cdopt.github.io/), a user-friendly Python package for a class Riemannian optimization. Based on constraint dissolving approaches, Riemannian optimization problems are transformed into their equivalent unconstrained counterparts in CDOpt. Therefore, solving Riemannian optimization problems through CDOpt directly benefits from various existing solvers and the rich expertise gained over decades for unconstrained optimization. Moreover, all the computations in CDOpt related to any manifold in question are conducted on its constraints expression, hence users can easily define new manifolds in CDOpt without any background on differential geometry. Furthermore, CDOpt extends the neural layers from PyTorch and Flax, thus allows users to train manifold constrained neural networks directly by the solvers for unconstrained optimization. Extensive numerical experiments demonstrate that CDOpt is highly efficient and robust in solving various classes of Riemannian optimization problems. △ Less

Submitted 28 March, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

Comments: 31 pages

arXiv:2210.13274 [pdf, other]

HAZniCS -- Software Components for Multiphysics Problems

Authors: Ana Budisa, Xiaozhe Hu, Miroslav Kuchta, Kent-Andre Mardal, Ludmil Zikatanov

Abstract: We introduce the software toolbox HAZniCS for solving interface-coupled multiphysics problems. HAZniCS is a suite of modules that combines the well-known FEniCS framework for finite element discretization with solver and graph library HAZmath. The focus of the paper is on the design and implementation of a pool of robust and efficient solver algorithms which tackle issues related to the complex in… ▽ More We introduce the software toolbox HAZniCS for solving interface-coupled multiphysics problems. HAZniCS is a suite of modules that combines the well-known FEniCS framework for finite element discretization with solver and graph library HAZmath. The focus of the paper is on the design and implementation of a pool of robust and efficient solver algorithms which tackle issues related to the complex interfacial coupling of the physical problems often encountered in applications in brain biomechanics. The robustness and efficiency of the numerical algorithms and methods is shown in several numerical examples, namely the Darcy-Stokes equations that model flow of cerebrospinal fluid in the human brain and the mixed-dimensional model of electrodiffusion in the brain tissue. △ Less

Submitted 6 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

MSC Class: 65-04; 65N55; 65F08; 65H10 ACM Class: G.1.8; G.4

arXiv:2210.12407 [pdf, ps, other]

Two new families of fourth-order explicit exponential Runge--Kutta methods with four stages for first-order differential systems

Authors: Xianfa Hu, Yonglei Fang, Bin Wang

Abstract: In this paper, two new families of fourth-order explicit exponential Runge--Kutta (ERK) methods with four stages are studied for solving first-order differential systems $y'(t)+My(t)=f(y(t))$. By comparing the Taylor series of the exact solution, the order conditions of these ERK methods are derived, which are exactly identical to the order conditions of explicit Runge--Kutta methods, and these ER… ▽ More In this paper, two new families of fourth-order explicit exponential Runge--Kutta (ERK) methods with four stages are studied for solving first-order differential systems $y'(t)+My(t)=f(y(t))$. By comparing the Taylor series of the exact solution, the order conditions of these ERK methods are derived, which are exactly identical to the order conditions of explicit Runge--Kutta methods, and these ERK methods reduce to classical Runge--Kutta methods once $M\rightarrow \mathbf{0}$. Moreover, we analyze the stability properties and the convergence of the new methods. Several numerical examples are implemented to illustrate the accuracy and efficiency of these ERK methods by comparison with standard exponential integrators. △ Less

Submitted 18 June, 2024; v1 submitted 22 October, 2022; originally announced October 2022.

arXiv:2210.11175 [pdf, other]

Structure-Preserving Discretization of Fractional Vector Calculus using Discrete Exterior Calculus

Authors: Alon Jacobson, Xiaozhe Hu

Abstract: Fractional vector calculus is the building block of the fractional partial differential equations that model non-local or long-range phenomena, e.g., anomalous diffusion, fractional electromagnetism, and fractional advection-dispersion. In this work, we reformulate a type of fractional vector calculus that uses Caputo fractional partial derivatives and discretize this reformulation using discrete… ▽ More Fractional vector calculus is the building block of the fractional partial differential equations that model non-local or long-range phenomena, e.g., anomalous diffusion, fractional electromagnetism, and fractional advection-dispersion. In this work, we reformulate a type of fractional vector calculus that uses Caputo fractional partial derivatives and discretize this reformulation using discrete exterior calculus on a cubical complex in the structure-preserving way, meaning that the continuous-level properties $\operatorname{curl}^α\operatorname{grad}^α= \mathbf{0}$ and $\operatorname{div}^α\operatorname{curl}^α= 0$ hold exactly on the discrete level. We discuss important properties of our fractional discrete exterior derivatives and verify their second-order convergence in the root mean square error numerically. Our proposed discretization has the potential to provide accurate and stable numerical solutions to fractional partial differential equations and exactly preserve fundamental physics laws on the discrete level regardless of the mesh size. △ Less

Submitted 26 January, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

Comments: 25 pages, 4 figures

MSC Class: 65M99 65N99 26A33 35R11

Journal ref: Computers & Mathematics with Applications, Volume 153, 1 January 2024, Pages 186-196

arXiv:2210.00685 [pdf, ps, other]

Two new classes of exponential Runge-Kutta integrators for efficiently solving stiff systems or highly oscillatory problems

Authors: Bin Wang, Xianfa Hu, Xinyuan Wu

Abstract: We note a fact that stiff systems or differential equations that have highly oscillatory solutions cannot be solved efficiently using conventional methods. In this paper, we study two new classes of exponential Runge-Kutta (ERK) integrators for efficiently solving stiff systems or highly oscillatory problems. We first present a novel class of explicit modified version of exponential Runge-Kutta (M… ▽ More We note a fact that stiff systems or differential equations that have highly oscillatory solutions cannot be solved efficiently using conventional methods. In this paper, we study two new classes of exponential Runge-Kutta (ERK) integrators for efficiently solving stiff systems or highly oscillatory problems. We first present a novel class of explicit modified version of exponential Runge-Kutta (MVERK) methods based on the order conditions. Furthermore, we consider a class of explicit simplified version of exponential Runge-Kutta (SVERK) methods. Numerical results demonstrate the high efficiency of the explicit MVERK integrators and SVERK methods derived in this paper compared with the well-known explicit ERK integrators for stiff systems or highly oscillatory problems in the literature. △ Less

Submitted 5 December, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

arXiv:2209.11659 [pdf, other]

Rational approximation preconditioners for multiphysics problems

Authors: Ana Budisa, Xiaozhe Hu, Miroslav Kuchta, Kent-Andre Mardal, Ludmil Zikatanov

Abstract: We consider a class of mathematical models describing multiphysics phenomena interacting through interfaces. On such interfaces, the traces of the fields lie (approximately) in the range of a weighted sum of two fractional differential operators. We use a rational function approximation to precondition such operators. We first demonstrate the robustness of the approximation for ordinary functions… ▽ More We consider a class of mathematical models describing multiphysics phenomena interacting through interfaces. On such interfaces, the traces of the fields lie (approximately) in the range of a weighted sum of two fractional differential operators. We use a rational function approximation to precondition such operators. We first demonstrate the robustness of the approximation for ordinary functions given by weighted sums of fractional exponents. Additionally, we present more realistic examples utilizing the proposed preconditioning techniques in interface coupling between Darcy and Stokes equations. △ Less

Submitted 6 November, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

MSC Class: 65F08; 65F60

arXiv:2209.03668 [pdf, other]

Predict+Optimize for Packing and Covering LPs with Unknown Parameters in Constraints

Authors: Xinyi Hu, Jasper C. H. Lee, Jimmy H. M. Lee

Abstract: Predict+Optimize is a recently proposed framework which combines machine learning and constrained optimization, tackling optimization problems that contain parameters that are unknown at solving time. The goal is to predict the unknown parameters and use the estimates to solve for an estimated optimal solution to the optimization problem. However, all prior works have focused on the case where unk… ▽ More Predict+Optimize is a recently proposed framework which combines machine learning and constrained optimization, tackling optimization problems that contain parameters that are unknown at solving time. The goal is to predict the unknown parameters and use the estimates to solve for an estimated optimal solution to the optimization problem. However, all prior works have focused on the case where unknown parameters appear only in the optimization objective and not the constraints, for the simple reason that if the constraints were not known exactly, the estimated optimal solution might not even be feasible under the true parameters. The contributions of this paper are two-fold. First, we propose a novel and practically relevant framework for the Predict+Optimize setting, but with unknown parameters in both the objective and the constraints. We introduce the notion of a correction function, and an additional penalty term in the loss function, modelling practical scenarios where an estimated optimal solution can be modified into a feasible solution after the true parameters are revealed, but at an additional cost. Second, we propose a corresponding algorithmic approach for our framework, which handles all packing and covering linear programs. Our approach is inspired by the prior work of Mandi and Guns, though with crucial modifications and re-derivations for our very different setting. Experimentation demonstrates the superior empirical performance of our method over classical approaches. △ Less

Submitted 8 September, 2022; originally announced September 2022.

arXiv:2208.13076 [pdf, other]

doi 10.1016/j.cam.2023.115449

Pressure-robust enriched Galerkin methods for the Stokes equations

Authors: Xiaozhe Hu, Seulip Lee, Lin Mu, Son-Young Yi

Abstract: In this paper, we present a pressure-robust enriched Galerkin (EG) scheme for solving the Stokes equations, which is an enhanced version of the EG scheme for the Stokes problem proposed in [Son-Young Yi, Xiaozhe Hu, Sanghyun Lee, James H. Adler, An enriched Galerkin method for the Stokes equations, Computers and Mathematics with Applications, accepted, 2022]. The pressure-robustness is achieved by… ▽ More In this paper, we present a pressure-robust enriched Galerkin (EG) scheme for solving the Stokes equations, which is an enhanced version of the EG scheme for the Stokes problem proposed in [Son-Young Yi, Xiaozhe Hu, Sanghyun Lee, James H. Adler, An enriched Galerkin method for the Stokes equations, Computers and Mathematics with Applications, accepted, 2022]. The pressure-robustness is achieved by employing a velocity reconstruction operator on the load vector on the right-hand side of the discrete system. An a priori error analysis proves that the velocity error is independent of the pressure and viscosity. We also propose and analyze a perturbed version of our pressure-robust EG method that allows for the elimination of the degrees of freedom corresponding to the discontinuous component of the velocity vector via static condensation. The resulting method can be viewed as a stabilized $H^1$-conforming $\mathbb{P}_1$-$\mathbb{P}_0$ method. Further, we consider efficient block preconditioners whose performances are independent of the viscosity. The theoretical results are confirmed through various numerical experiments in two and three dimensions. △ Less

Submitted 9 July, 2023; v1 submitted 27 August, 2022; originally announced August 2022.

MSC Class: 65N15; 65N30; 65F08

arXiv:2208.04169 [pdf, other]

A Stable Mimetic Finite-Difference Method for Convection-Dominated Diffusion Equations

Authors: James H. Adler, Casey Cavanaugh, Xiaozhe Hu, Andy Huang, Nathaniel Trask

Abstract: Convection-diffusion equations arise in a variety of applications such as particle transport, electromagnetics, and magnetohydrodynamics. Simulation of the convection-dominated regime for these problems, even with high-fidelity techniques, is particularly challenging due to the presence of sharp boundary layers and shocks causing jumps and discontinuities in the solution, and numerical issues such… ▽ More Convection-diffusion equations arise in a variety of applications such as particle transport, electromagnetics, and magnetohydrodynamics. Simulation of the convection-dominated regime for these problems, even with high-fidelity techniques, is particularly challenging due to the presence of sharp boundary layers and shocks causing jumps and discontinuities in the solution, and numerical issues such as loss of the maximum principle in the discretization. These complications cause instabilities, admitting large oscillations in the numerical solution when using traditional methods. Drawing connections to the simplex-averaged finite-element method (S. Wu and J. Xu, 2020), this paper develops a mimetic finite-difference (MFD) discretization using exponentially-averaged coefficients to overcome instability of the numerical solution as the diffusion coefficient approaches zero. The finite-element framework allows for transparent analysis of the MFD, such as proving well-posedness and deriving error estimates. Numerical tests are presented confirming the stability of the method and verifying the error estimates. △ Less

Submitted 23 May, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

MSC Class: 35M12; 65N06; 65N30

arXiv:2208.00732 [pdf, ps, other]

An Improved Unconstrained Approach for Bilevel Optimization

Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

Abstract: In this paper, we focus on the nonconvex-strongly-convex bilevel optimization problem (BLO). In this BLO, the objective function of the upper-level problem is nonconvex and possibly nonsmooth, and the lower-level problem is smooth and strongly convex with respect to the underlying variable $y$. We show that the feasible region of BLO is a Riemannian manifold. Then we transform BLO to its correspon… ▽ More In this paper, we focus on the nonconvex-strongly-convex bilevel optimization problem (BLO). In this BLO, the objective function of the upper-level problem is nonconvex and possibly nonsmooth, and the lower-level problem is smooth and strongly convex with respect to the underlying variable $y$. We show that the feasible region of BLO is a Riemannian manifold. Then we transform BLO to its corresponding unconstrained constraint dissolving problem (CDB), whose objective function is explicitly formulated from the objective functions in BLO. We prove that BLO is equivalent to the unconstrained optimization problem CDB. Therefore, various efficient unconstrained approaches, together with their theoretical results, can be directly applied to BLO through CDB. We propose a unified framework for develo** subgradient-based methods for CDB. Remarkably, we show that several existing efficient algorithms can fit the unified framework and be interpreted as descent algorithms for CDB. These examples further demonstrate the great potential of our proposed approach. △ Less

Submitted 23 December, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

Comments: 27 pages, revised version

MSC Class: 15A18; 65F15; 65K05; 90C06

arXiv:2208.00418 [pdf, ps, other]

On the general Sombor index of unicyclic graphs with a given diameter

Authors: Xipeng Hu, Ling** Zhong

Abstract: The general Sombor index of $G$ is defined as $SO_α(G)= \sum_{uv\in G}\left(d^2_{G}(u)+d^2_{G}(v)\right)^α$. For $0<α<1$, we have the upper bound of $SO_α(G)$ on unicyclic graphs with a fixed diameter, and the extremal graph is also characterized. The general Sombor index of $G$ is defined as $SO_α(G)= \sum_{uv\in G}\left(d^2_{G}(u)+d^2_{G}(v)\right)^α$. For $0<α<1$, we have the upper bound of $SO_α(G)$ on unicyclic graphs with a fixed diameter, and the extremal graph is also characterized. △ Less

Submitted 31 July, 2022; originally announced August 2022.

Comments: 12 pages, 5 figures

arXiv:2207.13025 [pdf, other]

The minimum degree of minimally $t$-tough graphs

Authors: Xiaomin Hu, Hui Ma, Weihua Yang

Abstract: A graph $ G $ is minimally $ t $-tough if the toughness of $ G $ is $ t $ and deletion of any edge from $ G $ decreases its toughness. Katona et al. conjectured that the minimum degree of any minimally $ t $-tough graph is $ \lceil 2t\rceil $ and gave some upper bounds on the minimum degree of the minimally $ t $-tough graphs in \cite{Katona, Gyula}. In this paper, we show that a minimally 1-tough… ▽ More A graph $ G $ is minimally $ t $-tough if the toughness of $ G $ is $ t $ and deletion of any edge from $ G $ decreases its toughness. Katona et al. conjectured that the minimum degree of any minimally $ t $-tough graph is $ \lceil 2t\rceil $ and gave some upper bounds on the minimum degree of the minimally $ t $-tough graphs in \cite{Katona, Gyula}. In this paper, we show that a minimally 1-tough graph $ G $ with girth $ g\geq 5 $ has minimum degree at most $ \lfloor\frac{n}{g+1}\rfloor+g-1$, and a minimally $ 1 $-tough graph with girth $ 4 $ has minimum degree at most $ \frac{n+6}{4}$. We also prove that the minimum degree of minimally $\frac{3}2$-tough claw-free graphs is $ 3 $. △ Less

Submitted 17 June, 2022; originally announced July 2022.

Comments: 15pages

arXiv:2207.10725 [pdf, other]

A Deep Neural Network/Meshfree Method for Solving Dynamic Two-phase Interface Problems

Authors: Xingwen Zhu, Xiaozhe Hu, Pengtao Sun

Abstract: In this paper, a meshfree method using the deep neural network (DNN) approach is developed for solving two kinds of dynamic two-phase interface problems governed by different dynamic partial differential equations on either side of the stationary interface with the jump and high-contrast coefficients. The first type of two-phase interface problem to be studied is the fluid-fluid (two-phase flow) i… ▽ More In this paper, a meshfree method using the deep neural network (DNN) approach is developed for solving two kinds of dynamic two-phase interface problems governed by different dynamic partial differential equations on either side of the stationary interface with the jump and high-contrast coefficients. The first type of two-phase interface problem to be studied is the fluid-fluid (two-phase flow) interface problem modeled by Navier-Stokes equations with high-contrast physical parameters across the interface. The second one belongs to fluid-structure interaction (FSI) problems modeled by Navier-Stokes equations on one side of the interface and the structural equation on the other side of the interface, both the fluid and the structure interact with each other via the kinematic- and the dynamic interface conditions across the interface. The DNN/meshfree method is respectively developed for the above two-phase interface problems by representing solutions of PDEs using the DNNs' structure and reformulating the dynamic interface problem as a least-squares minimization problem based upon a space-time sampling point set. Approximation error analyses are also carried out for each kind of interface problem, which reveals an intrinsic strategy about how to efficiently build a sampling-point training dataset to obtain a more accurate DNNs' approximation. In addition, compared with traditional discretization approaches, the proposed DNN/meshfree method and its error analysis technique can be smoothly extended to many other dynamic interface problems with fixed interfaces. Numerical experiments are conducted to illustrate the accuracies of the proposed DNN/meshfree method for the presented two-phase interface problems. Theoretical results are validated to some extent through three numerical examples. △ Less

Submitted 21 July, 2022; originally announced July 2022.

arXiv:2207.07847 [pdf, other]

Solving Graph Laplacians via Multilevel Sparsifiers

Authors: Xiaozhe Hu, Junyuan Lin

Abstract: We consider effective preconditioners for solving Laplacians of general weighted graphs. Theoretically, spectral sparsifiers (SSs) provide preconditioners of optimal computational complexity. However, they are not easy to use for real-world applications due to the implementation complications. Multigrid (MG) methods, on the contrary, are computationally efficient but lack of theoretical justificat… ▽ More We consider effective preconditioners for solving Laplacians of general weighted graphs. Theoretically, spectral sparsifiers (SSs) provide preconditioners of optimal computational complexity. However, they are not easy to use for real-world applications due to the implementation complications. Multigrid (MG) methods, on the contrary, are computationally efficient but lack of theoretical justifications. To bridge the gap between theory and practice, we adopt ideas of MG and SS methods and proposed preconditioners that can be used in practice with theoretical guarantees. We expand the original graph based on a multilevel structure to obtain an equivalent expanded graph. Although the expanded graph has a low diameter, a favorable property for constructing SSs, it has negatively weighted edges, which is an unfavorable property for the SSs. We design an algorithm to properly eliminate the negatively weighted edges and prove that the resulting expanded graph with positively weighted edges is spectrally equivalent to the expanded graph, thus, the original graph. Due to the low-diameter property of the positively-weighted expanded graph preconditioner (PEGP), existing algorithms for finding SSs can be easily applied. To demonstrate the advantage of working with the PEGP, we propose a type of SS, multilevel sparsifier preconditioner (MSP), that can be constructed in an easy and deterministic manner. We provide some preliminary numerical experiments to verify our theoretical findings and illustrate the practical effectiveness of PEGP and MSP in real-world applications. △ Less

Submitted 29 August, 2022; v1 submitted 16 July, 2022; originally announced July 2022.

MSC Class: 05C50; 05C85; 65F10; 68R10

Showing 1–50 of 252 results for author: Hu, X