-
Orthogonal Constrained Neural Networks for Solving Structured Inverse Eigenvalue Problems
Authors:
Shuai Zhang,
Xuelian Jiang,
Hao Qian,
Yingxiang Xu
Abstract:
This paper introduces a novel neural network for efficiently solving Structured Inverse Eigenvalue Problems (SIEPs). The main contributions lie in two aspects: firstly, a unified framework is proposed that can handle various SIEPs instances. Particularly, an innovative method for handling nonnegativity constraints is devised using the ReLU function. Secondly, a novel neural network based on multil…
▽ More
This paper introduces a novel neural network for efficiently solving Structured Inverse Eigenvalue Problems (SIEPs). The main contributions lie in two aspects: firstly, a unified framework is proposed that can handle various SIEPs instances. Particularly, an innovative method for handling nonnegativity constraints is devised using the ReLU function. Secondly, a novel neural network based on multilayer perceptrons, utilizing the Stiefel layer, is designed to efficiently solve SIEP. By incorporating the Stiefel layer through matrix orthogonal decomposition, the orthogonality of similarity transformations is ensured, leading to accurate solutions for SIEPs. Hence, we name this new network Stiefel Multilayer Perceptron (SMLP). Furthermore, SMLP is an unsupervised learning approach with a lightweight structure that is easy to train. Several numerical tests from literature and engineering domains demonstrate the efficiency of SMLP.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization
Authors:
Zihao Jiao,
Mengyi Sha,
Haoyu Zhang,
Xinyu Jiang,
Wei Qi
Abstract:
Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and trans…
▽ More
Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and transparency of city management through conversational interactions. Specifically, to accommodate diverse users' requirements and enhance computational tractability, City-LEO leverages LLM's logical reasoning capabilities on prior knowledge to scope down large-scale optimization problems efficiently. In the human-like decision process, City-LEO also incorporates End-to-end (E2E) model to synergize the prediction and optimization. The E2E framework be conducive to co** with environmental uncertainties and involving more query-relevant features, and then facilitates transparent and interpretable decision-making process. In case study, we employ City-LEO in the operations management of e-bike sharing (EBS) system. The numerical results demonstrate that City-LEO has superior performance when benchmarks against the full-scale optimization problem. With less computational time, City-LEO generates more satisfactory and relevant solutions to the users' requirements, and achieves lower global suboptimality without significantly compromising accuracy. In a broader sense, our proposed agent offers promise to develop LLM-embedded OR tools for smart-city operations management.
△ Less
Submitted 17 June, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
Lie Symmetry Net: Preserving Conservation Laws in Modelling Financial Market Dynamics via Differential Equations
Authors:
Xuelian Jiang,
Tongtian Zhu,
Can Wang,
Yingxiang Xu,
Fengxiang He
Abstract:
This paper employs a novel Lie symmetry-based framework to model the intrinsic symmetries within financial market. Specifically, we introduce {\it Lie symmetry net} (LSN), which characterises the Lie symmetry of the differential equations (DE) estimating financial market dynamics, such as the Black-Scholes equation and the Vašiček equation. To simulate these differential equations in a symmetry-aw…
▽ More
This paper employs a novel Lie symmetry-based framework to model the intrinsic symmetries within financial market. Specifically, we introduce {\it Lie symmetry net} (LSN), which characterises the Lie symmetry of the differential equations (DE) estimating financial market dynamics, such as the Black-Scholes equation and the Vašiček equation. To simulate these differential equations in a symmetry-aware manner, LSN incorporates a Lie symmetry risk derived from the conservation laws associated with the Lie symmetry operators of the target differential equations. This risk measures how well the Lie symmetry is realised and guides the training of LSN under the structural risk minimisation framework. Extensive numerical experiments demonstrate that LSN effectively realises the Lie symmetry and achieves an error reduction of more than {\it one order of magnitude} compared to state-of-the-art methods. The code is available at \href{https://github.com/Jxl163/LSN_code}{https://github.com/Jxl163/LSN$\_$code}.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance Reduction
Authors:
Anton Rodomanov,
Xiaowen Jiang,
Sebastian Stich
Abstract:
We present adaptive gradient methods (both basic and accelerated) for solving convex composite optimization problems in which the main part is approximately smooth (a.k.a. $(δ, L)$-smooth) and can be accessed only via a (potentially biased) stochastic gradient oracle. This setting covers many interesting examples including Hölder smooth problems and various inexact computations of the stochastic g…
▽ More
We present adaptive gradient methods (both basic and accelerated) for solving convex composite optimization problems in which the main part is approximately smooth (a.k.a. $(δ, L)$-smooth) and can be accessed only via a (potentially biased) stochastic gradient oracle. This setting covers many interesting examples including Hölder smooth problems and various inexact computations of the stochastic gradient. Our methods use AdaGrad stepsizes and are adaptive in the sense that they do not require knowing any problem-dependent constants except an estimate of the diameter of the feasible set but nevertheless achieve the best possible convergence rates as if they knew the corresponding constants. We demonstrate that AdaGrad stepsizes work in a variety of situations by proving, in a unified manner, three types of new results. First, we establish efficiency guarantees for our methods in the classical setting where the oracle's variance is uniformly bounded. We then show that, under more refined assumptions on the variance, the same methods without any modifications enjoy implicit variance reduction properties allowing us to express their complexity estimates in terms of the variance only at the minimizer. Finally, we show how to incorporate explicit SVRG-type variance reduction into our methods and obtain even faster algorithms. In all three cases, we present both basic and accelerated algorithms achieving state-of-the-art complexity bounds. As a direct corollary of our results, we obtain universal stochastic gradient methods for Hölder smooth problems which can be used in all situations.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
A Knowledge-driven Memetic Algorithm for the Energy-efficient Distributed Homogeneous Flow Shop Scheduling Problem
Authors:
Yunbao Xu,
Xuemei Jiang,
Jun Li,
Lining Xing,
Yanjie Song
Abstract:
The reduction of carbon emissions in the manufacturing industry holds significant importance in achieving the national "double carbon" target. Ensuring energy efficiency is a crucial factor to be incorporated into future generation manufacturing systems. In this study, energy consumption is considered in the distributed homogeneous flow shop scheduling problem (DHFSSP). A knowledge-driven memetic…
▽ More
The reduction of carbon emissions in the manufacturing industry holds significant importance in achieving the national "double carbon" target. Ensuring energy efficiency is a crucial factor to be incorporated into future generation manufacturing systems. In this study, energy consumption is considered in the distributed homogeneous flow shop scheduling problem (DHFSSP). A knowledge-driven memetic algorithm (KDMA) is proposed to address the energy-efficient DHFSSP (EEDHFSSP). KDMA incorporates a collaborative initialization strategy to generate high-quality initial populations. Furthermore, several algorithmic improvements including update strategy, local search strategy, and carbon reduction strategy are employed to improve the search performance of the algorithm. The effectiveness of KDMA in solving EEDHFSSP is verified through extensive simulation experiments. It is evident that KDMA outperforms many state-of-the-art algorithms across various evaluation aspects.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Federated Optimization with Doubly Regularized Drift Correction
Authors:
Xiaowen Jiang,
Anton Rodomanov,
Sebastian U. Stich
Abstract:
Federated learning is a distributed optimization paradigm that allows training machine learning models across decentralized devices while kee** the data localized. The standard method, FedAvg, suffers from client drift which can hamper performance and increase communication costs over centralized methods. Previous works proposed various strategies to mitigate drift, yet none have shown uniformly…
▽ More
Federated learning is a distributed optimization paradigm that allows training machine learning models across decentralized devices while kee** the data localized. The standard method, FedAvg, suffers from client drift which can hamper performance and increase communication costs over centralized methods. Previous works proposed various strategies to mitigate drift, yet none have shown uniformly improved communication-computation trade-offs over vanilla gradient descent.
In this work, we revisit DANE, an established method in distributed optimization. We show that (i) DANE can achieve the desired communication reduction under Hessian similarity constraints. Furthermore, (ii) we present an extension, DANE+, which supports arbitrary inexact local solvers and has more freedom to choose how to aggregate the local updates. We propose (iii) a novel method, FedRed, which has improved local computational complexity and retains the same communication complexity compared to DANE/DANE+. This is achieved by using doubly regularized drift correction.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Accelerating Gradient Tracking with Periodic Global Averaging
Authors:
Shu**g Feng,
Xin Jiang
Abstract:
Decentralized optimization algorithms have recently attracted increasing attention due to its wide applications in all areas of science and engineering. In these algorithms, a collection of agents collaborate to minimize the average of a set of heterogeneous cost functions in a decentralized manner. State-of-the-art decentralized algorithms like Gradient Tracking (GT) and Exact Diffusion (ED) invo…
▽ More
Decentralized optimization algorithms have recently attracted increasing attention due to its wide applications in all areas of science and engineering. In these algorithms, a collection of agents collaborate to minimize the average of a set of heterogeneous cost functions in a decentralized manner. State-of-the-art decentralized algorithms like Gradient Tracking (GT) and Exact Diffusion (ED) involve communication at each iteration. Yet, communication between agents is often expensive, resource intensive, and can be very slow. To this end, several strategies have been developed to balance between communication overhead and convergence rate of decentralized methods. In this paper, we introduce GT-PGA, which incorporates~GT with periodic global averaging. With the additional PGA, the influence of poor network connectivity in the GT algorithm can be compensated or controlled by a careful selection of the global averaging period. Under the stochastic, nonconvex setup, our analysis quantifies the crucial trade-off between the connectivity of network topology and the PGA period. Thus, with a suitable design of the PGA period, GT-PGA improves the convergence rate of vanilla GT. Numerical experiments are conducted to support our theory, and simulation results reveal that the proposed GT-PGA accelerates practical convergence, especially when the network is sparse.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Model-free $H_{\infty}$ control of Itô stochastic system via off-policy reinforcement learning
Authors:
**g Guo **g Guo,
Xiushan Jiang,
Weihai Zhang
Abstract:
The stochastic $H_{\infty}$ control is studied for a linear stochastic Itô system with an unknown system model. The linear stochastic $H_{\infty}$ control issue is known to be transformable into the problem of solving a so-called generalized algebraic Riccati equation (GARE), which is a nonlinear equation that is typically difficult to solve analytically. Worse, model-based techniques cannot be ut…
▽ More
The stochastic $H_{\infty}$ control is studied for a linear stochastic Itô system with an unknown system model. The linear stochastic $H_{\infty}$ control issue is known to be transformable into the problem of solving a so-called generalized algebraic Riccati equation (GARE), which is a nonlinear equation that is typically difficult to solve analytically. Worse, model-based techniques cannot be utilized to approximately solve a GARE when an accurate system model is unavailable or prohibitively expensive to construct in reality. To address these issues, an off-policy reinforcement learning (RL) approach is presented to learn the solution of a GARE from real system data rather than a system model; its convergence is demonstrated, and the robustness of RL to errors in the learning process is investigated. In the off-policy RL approach, the system data may be created with behavior policies rather than the target policies, which is highly significant and promising for use in actual systems. Finally, the proposed off-policy RL approach is validated on a stochastic linear F-16 aircraft system.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
An eternal hypersurface flow arising in centro-affine geometry
Authors:
Xinjie Jiang,
Changzheng Qu,
Yun Yang
Abstract:
In this paper, the existence and uniqueness for a specific centro-affine invariant hypersurface flow in $R^{n+1}$ are studied, and the corresponding evolutionary processes in both centro-affine and Euclidean settings are explored. It turns out that the flow exhibits similar properties as the standard heat flow. In addition, the long time existence of the flow is investigated, which asserts that th…
▽ More
In this paper, the existence and uniqueness for a specific centro-affine invariant hypersurface flow in $R^{n+1}$ are studied, and the corresponding evolutionary processes in both centro-affine and Euclidean settings are explored. It turns out that the flow exhibits similar properties as the standard heat flow. In addition, the long time existence of the flow is investigated, which asserts that the hypersurface governed by the flow converges asymptotically toward an ellipsoid via systematically investigating evolutions of the centro-affine invariants. Furthermore, the classification of the eternal solutions for the flow is provided.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Improved uniform error bounds for long-time dynamics of the high-dimensional nonlinear space fractional sine-Gordon equation with weak nonlinearity
Authors:
Junqing Jia,
Xiaoqing Chi,
Xiaoyun Jiang
Abstract:
In this paper, we derive the improved uniform error bounds for the long-time dynamics of the $d$-dimensional $(d=2,3)$ nonlinear space fractional sine-Gordon equation (NSFSGE). The nonlinearity strength of the NSFSGE is characterized by $\varepsilon^2$ where $0<\varepsilon \le 1$ is a dimensionless parameter. The second-order time-splitting method is applied to the temporal discretization and the…
▽ More
In this paper, we derive the improved uniform error bounds for the long-time dynamics of the $d$-dimensional $(d=2,3)$ nonlinear space fractional sine-Gordon equation (NSFSGE). The nonlinearity strength of the NSFSGE is characterized by $\varepsilon^2$ where $0<\varepsilon \le 1$ is a dimensionless parameter. The second-order time-splitting method is applied to the temporal discretization and the Fourier pseudo-spectral method is used for the spatial discretization. To obtain the explicit relation between the numerical errors and the parameter $\varepsilon$, we introduce the regularity compensation oscillation technique to the convergence analysis of fractional models. Then we establish the improved uniform error bounds $O\left(\varepsilon^2 τ^2\right)$ for the semi-discretization scheme and $O\left(h^m+\varepsilon^2 τ^2\right)$ for the full-discretization scheme up to the long time at $O(1/\varepsilon^2)$. Further, we extend the time-splitting Fourier pseudo-spectral method to the complex NSFSGE as well as the oscillatory complex NSFSGE, and the improved uniform error bounds for them are also given. Finally, extensive numerical examples in two-dimension or three-dimension are provided to support the theoretical analysis. The differences in dynamic behaviors between the fractional sine-Gordon equation and classical sine-Gordon equation are also discussed.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Sparse factorization of the square all-ones matrix of arbitrary order
Authors:
Xin Jiang,
Edward Duc Hien Nguyen,
César A. Uribe,
Bicheng Ying
Abstract:
In this paper, we study sparse factorization of the (scaled) square all-ones matrix $J$ of arbitrary order. We introduce the concept of hierarchically banded matrices and propose two types of hierarchically banded factorization of $J$: the reduced hierarchically banded (RHB) factorization and the doubly stochastic hierarchically banded (DSHB) factorization. Based on the DSHB factorization, we prop…
▽ More
In this paper, we study sparse factorization of the (scaled) square all-ones matrix $J$ of arbitrary order. We introduce the concept of hierarchically banded matrices and propose two types of hierarchically banded factorization of $J$: the reduced hierarchically banded (RHB) factorization and the doubly stochastic hierarchically banded (DSHB) factorization. Based on the DSHB factorization, we propose the sequential doubly stochastic (SDS) factorization, in which~$J$ is decomposed as a product of sparse, doubly stochastic matrices. Finally, we discuss the application of the proposed sparse factorizations to the decentralized average consensus problem and decentralized optimization.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
A continuous cusp closing process for negative Kähler-Einstein metrics
Authors:
Xin Fu,
Hans-Joachim Hein,
Xumin Jiang
Abstract:
We give an example of a family of smooth complex algebraic surfaces of degree $6$ in $\mathbb{CP}^3$ develo** an isolated elliptic singularity. We show via a gluing construction that the unique Kähler-Einstein metrics of Ricci curvature $-1$ on these sextics develop a complex hyperbolic cusp in the limit, and that near the tip of the forming cusp a Tian-Yau gravitational instanton bubbles off.
We give an example of a family of smooth complex algebraic surfaces of degree $6$ in $\mathbb{CP}^3$ develo** an isolated elliptic singularity. We show via a gluing construction that the unique Kähler-Einstein metrics of Ricci curvature $-1$ on these sextics develop a complex hyperbolic cusp in the limit, and that near the tip of the forming cusp a Tian-Yau gravitational instanton bubbles off.
△ Less
Submitted 21 January, 2024;
originally announced January 2024.
-
Boundedness and moduli of traditional stable minimal models
Authors:
Xiaowei Jiang
Abstract:
For good minimal models with semi-log canonical (slc) singularities, polarized by effective divisors that are relatively ample over the bases of Iitaka fibration, Birkar proves that they belong to a bounded family after fixing appropriate numerical invariants recently. Subsequently, he constructs their projective coarse moduli spaces. In this paper, we consider good minimal models with only Kawama…
▽ More
For good minimal models with semi-log canonical (slc) singularities, polarized by effective divisors that are relatively ample over the bases of Iitaka fibration, Birkar proves that they belong to a bounded family after fixing appropriate numerical invariants recently. Subsequently, he constructs their projective coarse moduli spaces. In this paper, we consider good minimal models with only Kawamata log terminal (klt) singularities but polarized by possibly non-effective divisors. We prove that they still belong to a bounded family after fixing the same invariants. As an application, we construct separated coarse moduli spaces for klt good minimal models polarized by line bundles.
△ Less
Submitted 17 January, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Model-free Reinforcement Learning for ${H_{2}/H_{\infty}}$ Control of Stochastic Discrete-time Systems
Authors:
Xiushan Jiang,
Li Wang,
Dongya Zhao,
Ling Shi
Abstract:
This paper proposes a reinforcement learning (RL) algorithm for infinite horizon $\rm {H_{2}/H_{\infty}}$ problem in a class of stochastic discrete-time systems, rather than using a set of coupled generalized algebraic Riccati equations (GAREs). The algorithm is able to learn the optimal control policy for the system even when its parameters are unknown. Additionally, the paper explores the effect…
▽ More
This paper proposes a reinforcement learning (RL) algorithm for infinite horizon $\rm {H_{2}/H_{\infty}}$ problem in a class of stochastic discrete-time systems, rather than using a set of coupled generalized algebraic Riccati equations (GAREs). The algorithm is able to learn the optimal control policy for the system even when its parameters are unknown. Additionally, the paper explores the effect of detection noise as well as the convergence of the algorithm, and shows that the control policy is admissible after a finite number of iterations. The algorithm is also able to handle multi-objective control problems within stochastic fields. Finally, the algorithm is applied to the F-16 aircraft autopilot with multiplicative noise.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
On Graphs with Finite-Time Consensus and Their Use in Gradient Tracking
Authors:
Edward Duc Hien Nguyen,
Xin Jiang,
Bicheng Ying,
César A. Uribe
Abstract:
This paper studies sequences of graphs satisfying the finite-time consensus property (i.e., iterating through such a finite sequence is equivalent to performing global or exact averaging) and their use in Gradient Tracking. We provide an explicit weight matrix representation of the studied sequences and prove their finite-time consensus property. Moreover, we incorporate the studied finite-time co…
▽ More
This paper studies sequences of graphs satisfying the finite-time consensus property (i.e., iterating through such a finite sequence is equivalent to performing global or exact averaging) and their use in Gradient Tracking. We provide an explicit weight matrix representation of the studied sequences and prove their finite-time consensus property. Moreover, we incorporate the studied finite-time consensus topologies into Gradient Tracking and present a new algorithmic scheme called Gradient Tracking for Finite-Time Consensus Topologies (GT-FT). We analyze the new scheme for nonconvex problems with stochastic gradient estimates. Our analysis shows that the convergence rate of GT-FT does not depend on the heterogeneity of the agents' functions or the connectivity of any individual graph in the topology sequence. Furthermore, owing to the sparsity of the graphs, GT-FT requires lower communication costs than Gradient Tracking using the static counterpart of the topology sequence.
△ Less
Submitted 14 November, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
A nonmonotone proximal quasi-Newton method for multiobjective optimization
Authors:
Xiaoxue Jiang
Abstract:
This paper proposes a nonmonotone proximal quasi-Newton algorithm for unconstrained convex multiobjective composite optimization problems. To design the search direction, we minimize the max-scalarization of the variations of the Hessian approximations and nonsmooth terms. Subsequently, a nonmonotone line search is used to determine the step size, we allow for the decrease of a convex combination…
▽ More
This paper proposes a nonmonotone proximal quasi-Newton algorithm for unconstrained convex multiobjective composite optimization problems. To design the search direction, we minimize the max-scalarization of the variations of the Hessian approximations and nonsmooth terms. Subsequently, a nonmonotone line search is used to determine the step size, we allow for the decrease of a convex combination of recent function values. Under the assumption of strong convexity of the objective function, we prove that the sequence generated by this method converges to a Pareto optimal. Furthermore, based on the strong convexity, Hessian continuity and Dennis-Moré criterion, we use a basic inequality to derive the local superlinear convergence rate of the proposed algorithm. Numerical experiments results demonstrate the feasibility and effectiveness of the proposed algorithm on a set of test problems.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
The singular sets of degenerate and nonlocal elliptic equations on Poincaré-Einstein manifolds
Authors:
Xumin Jiang,
Yannick Sire,
Ruobing Zhang
Abstract:
The main objects of this paper include some degenerate and nonlocal elliptic operators which naturally arise in the conformal invariant theory of Poincaré-Einstein manifolds. These operators generally reflect the correspondence between the Riemannian geometry of a complete Poincaré-Einstein manifold and the conformal geometry of its associated conformal infinity. In this setting, we develop the qu…
▽ More
The main objects of this paper include some degenerate and nonlocal elliptic operators which naturally arise in the conformal invariant theory of Poincaré-Einstein manifolds. These operators generally reflect the correspondence between the Riemannian geometry of a complete Poincaré-Einstein manifold and the conformal geometry of its associated conformal infinity. In this setting, we develop the quantitative differentiation theory that includes quantitative stratification for the singular set and Minkowski type estimates for the (quantitatively) stratified singular sets. All these, together with a new $ε$-regularity result for degenerate/singular elliptic operators on Poincaré-Einstein manifolds, lead to uniform Hausdorff measure estimates for the singular sets. Furthermore, the main results in this paper provide a delicate synergy between the geometry of Poincaré-Einstein manifolds and the elliptic theory of associated degenerate elliptic operators.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Fast time-step** discontinuous Galerkin method for the subdiffusion equation
Authors:
Hui Zhang,
Fanhai Zeng,
Xiaoyun Jiang,
Zhimin Zhang
Abstract:
The nonlocality of the fractional operator causes numerical difficulties for long time computation of the time-fractional evolution equations. This paper develops a high-order fast time-step** discontinuous Galerkin finite element method for the time-fractional diffusion equations, which saves storage and computational time. The optimal error estimate…
▽ More
The nonlocality of the fractional operator causes numerical difficulties for long time computation of the time-fractional evolution equations. This paper develops a high-order fast time-step** discontinuous Galerkin finite element method for the time-fractional diffusion equations, which saves storage and computational time. The optimal error estimate $O(N^{-p-1} + h^{m+1} + \varepsilon N^{rα})$ of the current time-step** discontinuous Galerkin method is rigorous proved, where $N$ denotes the number of time intervals, $p$ is the degree of polynomial approximation on each time subinterval, $h$ is the maximum space step, $r\ge1$, $m$ is the order of finite element space, and $\varepsilon>0$ can be arbitrarily small. Numerical simulations verify the theoretical analysis.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Resolution-independent generative models based on operator learning for physics-constrained Bayesian inverse problems
Authors:
Xinchao Jiang,
Xin Wang,
Ziming Wen,
Hu Wang
Abstract:
The Bayesian inference approach is widely used to tackle inverse problems due to its versatile and natural ability to handle ill-posedness. However, it often faces challenges when dealing with situations involving continuous fields or large-resolution discrete representations (high-dimensional). Moreover, the prior distribution of unknown parameters is commonly difficult to be determined. In this…
▽ More
The Bayesian inference approach is widely used to tackle inverse problems due to its versatile and natural ability to handle ill-posedness. However, it often faces challenges when dealing with situations involving continuous fields or large-resolution discrete representations (high-dimensional). Moreover, the prior distribution of unknown parameters is commonly difficult to be determined. In this study, an Operator Learning-based Generative Adversarial Network (OL-GAN) is proposed and integrated into the Bayesian inference framework to handle these issues. Unlike most Bayesian approaches, the distinctive characteristic of the proposed method is to learn the joint distribution of parameters and responses. By leveraging the trained generative model, the posteriors of the unknown parameters can theoretically be approximated by any sampling algorithm (e.g., Markov Chain Monte Carlo, MCMC) in a low-dimensional latent space shared by the components of the joint distribution. The latent space is typically a simple and easy-to-sample distribution (e.g., Gaussian, uniform), which significantly reduces the computational cost associated with the Bayesian inference while avoiding prior selection concerns. Furthermore, incorporating operator learning enables resolution-independent in the generator. Predictions can be obtained at desired coordinates, and inversions can be performed even if the observation data are misaligned with the training data. Finally, the effectiveness of the proposed method is validated through several numerical experiments.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
On the Convergence of Newton-type Proximal Gradient Method for Multiobjective Optimization Problems
Authors:
Jian Chen,
Xiaoxue Jiang,
Li** Tang,
Xinmin Yang
Abstract:
In a recent study, Ansary (Optim Methods Softw 38(3):570-590,2023) proposed a Newton-type proximal gradient method for nonlinear multiobjective optimization problems (NPGMO). However, the favorable convergence properties typically associated with Newton-type methods were not established for NPGMO in Ansary's work. In response to this gap, we develop a straightforward framework for analyzing the co…
▽ More
In a recent study, Ansary (Optim Methods Softw 38(3):570-590,2023) proposed a Newton-type proximal gradient method for nonlinear multiobjective optimization problems (NPGMO). However, the favorable convergence properties typically associated with Newton-type methods were not established for NPGMO in Ansary's work. In response to this gap, we develop a straightforward framework for analyzing the convergence behavior of the NPGMO. Specifically, under the assumption of strong convexity, we demonstrate that the NPGMO enjoys quadratic termination, superlinear convergence, and quadratic convergence for problems that are quadratic, twice continuously differentiable and twice Lipschitz continuously differentiable, respectively.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
A novel reduced basis method for adjoint sensitivity analysis of dynamic topology optimization
Authors:
Shuhao Li,
Hu Wang,
Jichao Yin,
Xinchao Jiang,
Yaya Zhang
Abstract:
In gradient-based time domain topology optimization, design sensitivity analysis (DSA) of the dynamic response is essential, and requires high computational cost to directly differentiate, especially for high-order dynamic system. To address this issue, this study develops an efficient reduced basis method (RBM)-based discrete adjoint sensitivity analysis method, which on the one hand significantl…
▽ More
In gradient-based time domain topology optimization, design sensitivity analysis (DSA) of the dynamic response is essential, and requires high computational cost to directly differentiate, especially for high-order dynamic system. To address this issue, this study develops an efficient reduced basis method (RBM)-based discrete adjoint sensitivity analysis method, which on the one hand significantly improves the efficiency of sensitivity analysis and on the other hand avoids the consistency errors caused by the continuum method. In this algorithm, the basis functions of the adjoint problem are constructed in the offline phase based on the greedy-POD method, and a novel model-based estimation is developed to facilitate the acceleration of this process. Based on these basis functions, a fast and reasonably accurate model is then built by Galerkin projection for sensitivity analysis in each dynamic topology optimization iteration. Finally, the effectiveness of the error measures, the efficiency and the accuracy of the presented reduced-order method are verified by 2D and 3D dynamic structure studies.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Adaptive SGD with Polyak stepsize and Line-search: Robust Convergence and Variance Reduction
Authors:
Xiaowen Jiang,
Sebastian U. Stich
Abstract:
The recently proposed stochastic Polyak stepsize (SPS) and stochastic line-search (SLS) for SGD have shown remarkable effectiveness when training over-parameterized models. However, in non-interpolation settings, both algorithms only guarantee convergence to a neighborhood of a solution which may result in a worse output than the initial guess. While artificially decreasing the adaptive stepsize h…
▽ More
The recently proposed stochastic Polyak stepsize (SPS) and stochastic line-search (SLS) for SGD have shown remarkable effectiveness when training over-parameterized models. However, in non-interpolation settings, both algorithms only guarantee convergence to a neighborhood of a solution which may result in a worse output than the initial guess. While artificially decreasing the adaptive stepsize has been proposed to address this issue (Orvieto et al. [2022]), this approach results in slower convergence rates for convex and over-parameterized models. In this work, we make two contributions: Firstly, we propose two new variants of SPS and SLS, called AdaSPS and AdaSLS, which guarantee convergence in non-interpolation settings and maintain sub-linear and linear convergence rates for convex and strongly convex functions when training over-parameterized models. AdaSLS requires no knowledge of problem-dependent parameters, and AdaSPS requires only a lower bound of the optimal function value as input. Secondly, we equip AdaSPS and AdaSLS with a novel variance reduction technique and obtain algorithms that require $\smash{\widetilde{\mathcal{O}}}(n+1/ε)$ gradient evaluations to achieve an $\mathcal{O}(ε)$-suboptimality for convex functions, which improves upon the slower $\mathcal{O}(1/ε^2)$ rates of AdaSPS and AdaSLS without variance reduction in the non-interpolation regimes. Moreover, our result matches the fast rates of AdaSVRG but removes the inner-outer-loop structure, which is easier to implement and analyze. Finally, numerical experiments on synthetic and real datasets validate our theory and demonstrate the effectiveness and robustness of our algorithms.
△ Less
Submitted 21 August, 2023; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Almost-sure convergence of iterates and multipliers in stochastic sequential quadratic optimization
Authors:
Frank E. Curtis,
Xin Jiang,
Qi Wang
Abstract:
Stochastic sequential quadratic optimization (SQP) methods for solving continuous optimization problems with nonlinear equality constraints have attracted attention recently, such as for solving large-scale data-fitting problems subject to nonconvex constraints. However, for a recently proposed subclass of such methods that is built on the popular stochastic-gradient methodology from the unconstra…
▽ More
Stochastic sequential quadratic optimization (SQP) methods for solving continuous optimization problems with nonlinear equality constraints have attracted attention recently, such as for solving large-scale data-fitting problems subject to nonconvex constraints. However, for a recently proposed subclass of such methods that is built on the popular stochastic-gradient methodology from the unconstrained setting, convergence guarantees have been limited to the asymptotic convergence of the expected value of a stationarity measure to zero. This is in contrast to the unconstrained setting in which almost-sure convergence guarantees (of the gradient of the objective to zero) can be proved for stochastic-gradient-based methods. In this paper, new almost-sure convergence guarantees for the primal iterates, Lagrange multipliers, and stationarity measures generated by a stochastic SQP algorithm in this subclass of methods are proved. It is shown that the error in the Lagrange multipliers can be bounded by the distance of the primal iterate to a primal stationary point plus the error in the latest stochastic gradient estimate. It is further shown that, subject to certain assumptions, this latter error can be made to vanish by employing a running average of the Lagrange multipliers that are computed during the run of the algorithm. The results of numerical experiments are provided to demonstrate the proved theoretical guarantees.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
A globally convergent difference-of-convex algorithmic framework and application to log-determinant optimization problems
Authors:
Chaorui Yao,
Xin Jiang
Abstract:
The difference-of-convex algorithm (DCA) is a conceptually simple method for the minimization of (possibly) nonconvex functions that are expressed as the difference of two convex functions. At each iteration, DCA constructs a global overestimator of the objective and solves the resulting convex subproblem. Despite its conceptual simplicity, the theoretical understanding and algorithmic framework o…
▽ More
The difference-of-convex algorithm (DCA) is a conceptually simple method for the minimization of (possibly) nonconvex functions that are expressed as the difference of two convex functions. At each iteration, DCA constructs a global overestimator of the objective and solves the resulting convex subproblem. Despite its conceptual simplicity, the theoretical understanding and algorithmic framework of DCA needs further investigation. In this paper, global convergence of DCA at a linear rate is established under an extended Polyak--Łojasiewicz condition. The proposed condition holds for a class of DC programs with a bounded, closed, and convex constraint set, for which global convergence of DCA cannot be covered by existing analyses. Moreover, the DCProx computational framework is proposed, in which the DCA subproblems are solved by a primal--dual proximal algorithm with Bregman distances. With a suitable choice of Bregman distances, DCProx has simple update rules with cheap per-iteration complexity. As an application, DCA is applied to several fundamental problems in network information theory, for which no existing numerical methods are able to compute the global optimum. For these problems, our analysis proves the global convergence of DCA, and more importantly, DCProx solves the DCA subproblems efficiently. Numerical experiments are conducted to verify the efficiency of DCProx.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
Improved uniform error bounds of exponential wave integrator method for long-time dynamics of the space fractional Klein-Gordon equation with weak nonlinearity
Authors:
Junqing Jia,
Xiaoyun Jiang
Abstract:
An improved uniform error bound at $O\left(h^m+\varepsilon^2 τ^2\right)$ is established in $H^{α/2}$-norm for the long-time dynamics of the nonlinear space fractional Klein-Gordon equation (NSFKGE). A second-order exponential wave integrator (EWI) method is used to semi-discretize NSFKGE in time and the Fourier spectral method in space is applied to derive the full-discretization scheme. Regularit…
▽ More
An improved uniform error bound at $O\left(h^m+\varepsilon^2 τ^2\right)$ is established in $H^{α/2}$-norm for the long-time dynamics of the nonlinear space fractional Klein-Gordon equation (NSFKGE). A second-order exponential wave integrator (EWI) method is used to semi-discretize NSFKGE in time and the Fourier spectral method in space is applied to derive the full-discretization scheme. Regularity compensation oscillation (RCO) technique is employed to prove the improved uniform error bounds at $O\left(\varepsilon^2 τ^2\right)$ in temporal semi-discretization and $O\left(h^m+\varepsilon^2 τ^2\right)$ in full-discretization up to the long-time $T_{\varepsilon}=T / \varepsilon^2$ ($T>0$ fixed), respectively. Complex NSFKGE and oscillatory complex NSFKGE with nonlinear terms of general power exponents are also discussed. Finally, the correctness of the theoretical analysis and the effectiveness of the method are verified by numerical examples.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
A survey of path planning and feedrate interpolation in computer numerical control
Authors:
Hong-yu Ma,
Li-yong Shen,
Xin Jiang,
Qiang Zou,
Chun-ming Yuan
Abstract:
This paper presents a brief survey (in Chinese) on path planning and feedrate interpolation. Numerical control technology is widely employed in the modern manufacturing industry, and related research has been emphasized by academia and industry. The traditional process of numerical control technology is mainly composed of tool path planning and feedrate interpolation. To attain the machining of hi…
▽ More
This paper presents a brief survey (in Chinese) on path planning and feedrate interpolation. Numerical control technology is widely employed in the modern manufacturing industry, and related research has been emphasized by academia and industry. The traditional process of numerical control technology is mainly composed of tool path planning and feedrate interpolation. To attain the machining of high speed and precision, several problems in tool path planning and feedrate interpolation are usually transformed into mathematical optimization models. To better undertake the research on the integrated design and optimization idea of tool path planning and feedrate interpolation, it is necessary to systematically review and drawn on the existing representative works. We will introduce the relevant methods and technical progress of tool path planning and feedrate interpolation in CNC machining successively, including tool path planning based on end milling, tool orientation optimization, G-code processing and corner transition, feedrate planning of parameter curves, and some new machining optimization methods proposed recently.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
A PML method for signal-propagation problems in axon
Authors:
Xue Jiang,
Maohui Lyu,
Tao Yin,
Weiying Zheng
Abstract:
This work is focused on the modelling of signal propagations in myelinated axons to characterize the functions of the myelin sheath in the neural structure. Based on reasonable assumptions on the medium properties, we derive a two-dimensional neural-signaling model in cylindrical coordinates from the time-harmonic Maxwell's equations. The well-posedness of model is established upon Dirichlet bound…
▽ More
This work is focused on the modelling of signal propagations in myelinated axons to characterize the functions of the myelin sheath in the neural structure. Based on reasonable assumptions on the medium properties, we derive a two-dimensional neural-signaling model in cylindrical coordinates from the time-harmonic Maxwell's equations. The well-posedness of model is established upon Dirichlet boundary conditions at the two ends of the neural structure and the radiative condition in the radial direction of the structure. Using the perfectly matched layer (PML) method, we truncate the unbounded background medium and propose an approximate problem on the truncated domain. The well-posedness of the PML problem and the exponential convergence of the approximate solution to the exact solution are established. Numerical experiments based on finite element discretization are presented to demonstrate the theoretical results and the efficiency of our methods to simulate the signal propagation in axons.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
Fast method and convergence analysis of fractional magnetohydrodynamic coupled flow and heat transfer model for generalized second-grade fluid
Authors:
Xiaoqing Chi,
Hui Zhang,
Xiaoyun Jiang
Abstract:
In this paper, we first establish a new fractional magnetohydrodynamic (MHD) coupled flow and heat transfer model for a generalized second-grade fluid. This coupled model consists of a fractional momentum equation and a heat conduction equation with a generalized form of Fourier law. The second-order fractional backward difference formula is applied to the temporal discretization and the Legendre…
▽ More
In this paper, we first establish a new fractional magnetohydrodynamic (MHD) coupled flow and heat transfer model for a generalized second-grade fluid. This coupled model consists of a fractional momentum equation and a heat conduction equation with a generalized form of Fourier law. The second-order fractional backward difference formula is applied to the temporal discretization and the Legendre spectral method is used for the spatial discretization. The fully discrete scheme is proved to be stable and convergent with an accuracy of $O(τ^2+N^{-r})$, where $τ$ is the time step size and $N$ is the polynomial degree. To reduce the memory requirements and computational cost, a fast method is developed, which is based on a globally uniform approximation of the trapezoidal rule for integrals on the real line. And the strict convergence of the numerical scheme with this fast method is proved. We present the results of several numerical experiments to verify the effectiveness of the proposed method. Finally, we simulate the unsteady fractional MHD flow and heat transfer of the generalized second-grade fluid through a porous medium. The effects of the relevant parameters on the velocity and temperature are presented and analyzed in detail.
△ Less
Submitted 26 November, 2022;
originally announced November 2022.
-
An E-PINN assisted practical uncertainty quantification for inverse problems
Authors:
Xinchao Jiang,
Xin Wanga,
Ziming Wena,
Enying Li,
Hu Wang
Abstract:
How to solve inverse problems is the challenge of many engineering and industrial applications. Recently, physics-informed neural networks (PINNs) have emerged as a powerful approach to solve inverse problems efficiently. However, it is difficult for PINNs to quantify the uncertainty of results. Therefore, this study proposed ensemble PINNs (E-PINNs) to handle this issue. The E-PINN uses ensemble…
▽ More
How to solve inverse problems is the challenge of many engineering and industrial applications. Recently, physics-informed neural networks (PINNs) have emerged as a powerful approach to solve inverse problems efficiently. However, it is difficult for PINNs to quantify the uncertainty of results. Therefore, this study proposed ensemble PINNs (E-PINNs) to handle this issue. The E-PINN uses ensemble statistics of several basic models to provide uncertainty quantifications for the inverse solution based on the PINN framework, and it is employed to solve the inverse problems in which the unknown quantity is propagated through partial differential equations (PDEs), especially the identification of the unknown field (e.g., space function) of a given physical system. Compared with other data-driven approaches, the suggested method is more than straightforward to implement, and also obtains high-quality uncertainty estimates of the quantity of interest (QoI) without significantly increasing the complexity of the algorithm. This work discusses the good properties of ensemble learning in field inversion and uncertainty quantification. The effectiveness of the proposed method is demonstrated through several numerical experiments. To enhance the robustness of models, adversarial training (AT) is applied. Furthermore, an adaptive active sampling (AS) strategy based on the uncertainty estimates from E-PINNs is also proposed to improve the accuracy of material field inversion problems.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
A stochastic agent-based model to evaluate COVID-19 transmission influenced by human mobility
Authors:
Kejie Chen,
Yanqing Li,
Rongxin Zhou,
Xiaomo Jiang
Abstract:
The COVID-19 pandemic has created an urgent need for mathematical models that can project epidemic trends and evaluate the effectiveness of mitigation strategies. To forecast the transmission of COVID-19, a major challenge is the accurate assessment of the multi-scale human mobility and how they impact the infection through close contacts. By combining the stochastic agent-based modeling strategy…
▽ More
The COVID-19 pandemic has created an urgent need for mathematical models that can project epidemic trends and evaluate the effectiveness of mitigation strategies. To forecast the transmission of COVID-19, a major challenge is the accurate assessment of the multi-scale human mobility and how they impact the infection through close contacts. By combining the stochastic agent-based modeling strategy and hierarchical structures of spatial containers corresponding to the notion of places in geography, this study proposes a novel model, Mob-Cov, to study the impact of human traveling behaviour and individual health conditions on the disease outbreak and the probability of zero COVID in the population. Specifically, individuals perform power-law type of local movements within a container and global transport between different-level containers. Frequent short movements inside a small-level container (e.g. a road or a county) and a large population size influence the local crowdedness of people, which accelerates the infection and regional transmission. Travels between large-level containers (e.g. cities and nations) facilitate global spread and outbreak. Moreover, dynamic infection and recovery in the population are able to drive the bifurcation of the system to a "zero-COVID" state or a "live with COVID" state, depending on the mobility patterns, population number and health conditions. Reducing total population and local people accumulation as well as restricting global travels help achieve zero-COVID. In summary, the Mob-Cov model considers more realistic human mobility in a wide range of spatial scales, and has been designed with equal emphasis on performance, low simulation cost, accuracy, ease of use and flexibility. It is a useful tool for researchers and politicians to investigate the pandemic dynamics and plan actions against the disease.
△ Less
Submitted 17 November, 2022; v1 submitted 6 September, 2022;
originally announced September 2022.
-
A nonlocal curve flow in centro-affine geometry
Authors:
Xinjie Jiang,
Yun Yang,
Yanhua Yu
Abstract:
In this paper, the isoperimetric inequality in centro-affine plane geometry is obtained. We also investigate the long-term behavior of an invariant plane curve flow, whose evolution process can be expressed as a second-order nonlinear parabolic equation with respect to centro-affine curvature. The forward and backward limits in time are discussed, which shows that a closed convex embedded curve ma…
▽ More
In this paper, the isoperimetric inequality in centro-affine plane geometry is obtained. We also investigate the long-term behavior of an invariant plane curve flow, whose evolution process can be expressed as a second-order nonlinear parabolic equation with respect to centro-affine curvature. The forward and backward limits in time are discussed, which shows that a closed convex embedded curve may converge to an ellipse when evolving according to this flow.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
Distributed stochastic projection-free solver for constrained optimization
Authors:
Xia Jiang,
Xianlin Zeng,
Lihua Xie,
Jian Sun,
Jie Chen
Abstract:
This paper proposes a distributed stochastic projection-free algorithm for large-scale constrained finite-sum optimization whose constraint set is complicated such that the projection onto the constraint set can be expensive. The global cost function is allocated to multiple agents, each of which computes its local stochastic gradients and communicates with its neighbors to solve the global proble…
▽ More
This paper proposes a distributed stochastic projection-free algorithm for large-scale constrained finite-sum optimization whose constraint set is complicated such that the projection onto the constraint set can be expensive. The global cost function is allocated to multiple agents, each of which computes its local stochastic gradients and communicates with its neighbors to solve the global problem. Stochastic gradient methods enable low computational cost, while they are hard and slow to converge due to the variance caused by random sampling. To construct a convergent distributed stochastic projection-free algorithm, this paper incorporates a variance reduction technique and gradient tracking technique in the Frank-Wolfe update. We develop a sampling rule for the variance reduction technique to reduce the variance introduced by stochastic gradients. Complete and rigorous proofs show that the proposed distributed projection-free algorithm converges with a sublinear convergence rate and enjoys superior complexity guarantees for both convex and non-convex objective functions. By comparative simulations, we demonstrate the convergence and computational efficiency of the proposed algorithm.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
Bregman three-operator splitting methods
Authors:
Xin Jiang,
Lieven Vandenberghe
Abstract:
The paper presents primal-dual proximal splitting methods for convex optimization, in which generalized Bregman distances are used to define the primal and dual proximal update steps. The methods extend the primal and dual Condat-Vu algorithms and the primal-dual three-operator (PD3O) algorithm. The Bregman extensions of the Condat-Vu algorithms are derived from the Bregman proximal point method a…
▽ More
The paper presents primal-dual proximal splitting methods for convex optimization, in which generalized Bregman distances are used to define the primal and dual proximal update steps. The methods extend the primal and dual Condat-Vu algorithms and the primal-dual three-operator (PD3O) algorithm. The Bregman extensions of the Condat-Vu algorithms are derived from the Bregman proximal point method applied to a monotone inclusion problem. Based on this interpretation, a unified framework for the convergence analysis of the two methods is presented. We also introduce a line search procedure for stepsize selection in the Bregman dual Condat-Vu algorithm applied to equality-constrained problems. Finally, we propose a Bregman extension of PD3O and analyze its convergence.
△ Less
Submitted 3 October, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
An Adaptive Finite Element DtN Method for Maxwell's Equations
Authors:
Gang Bao,
Mingming Zhang,
Xue Jiang,
Peijun Li,
Xiaokai Yuan
Abstract:
This paper is concerned with a numerical solution to the scattering of a time-harmonic electromagnetic wave by a bounded and impenetrable obstacle in three dimensions. The electromagnetic wave propagation is modeled by a boundary value problem of Maxwell's equations in the exterior domain of the obstacle. Based on the Dirichlet-to-Neumann (DtN) operator, which is defined by an infinite series, an…
▽ More
This paper is concerned with a numerical solution to the scattering of a time-harmonic electromagnetic wave by a bounded and impenetrable obstacle in three dimensions. The electromagnetic wave propagation is modeled by a boundary value problem of Maxwell's equations in the exterior domain of the obstacle. Based on the Dirichlet-to-Neumann (DtN) operator, which is defined by an infinite series, an exact transparent boundary condition is introduced and the scattering problem is reduced equivalently into a bounded domain. An a posteriori error estimate based adaptive finite element DtN method is developed to solve the discrete variational problem, where the DtN operator is truncated into a sum of finitely many terms. The a posteriori error estimate takes into account both the finite element approximation error and the truncation error of the DtN operator. The latter is shown to decay exponentially with respect to the truncation parameter. Numerical experiments are presented to illustrate the effectiveness of the proposed method.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Topology optimization on complex surfaces based on the moving morphable component (MMC) method and computational conformal map** (CCM)
Authors:
Wendong Huo,
Chang Liu,
Zongliang Du,
Xudong Jiang,
Zhengyu Liu,
Xu Guo
Abstract:
In the present paper, an integrated paradigm for topology optimization on complex surfaces with arbitrary genus is proposed. The approach is constructed based on the two-dimensional (2D) Moving Morphable Component (MMC) framework, where a set of structural components are used as the basic units of optimization, and computational conformal map** (CCM) technique, with which a complex surface repre…
▽ More
In the present paper, an integrated paradigm for topology optimization on complex surfaces with arbitrary genus is proposed. The approach is constructed based on the two-dimensional (2D) Moving Morphable Component (MMC) framework, where a set of structural components are used as the basic units of optimization, and computational conformal map** (CCM) technique, with which a complex surface represented by an unstructured triangular mesh can be mapped into a set of regular 2D parameter domains numerically. A multi-patch stitching scheme is also developed to achieve an MMC-friendly global parameterization through a number of local parameterizations. Numerical examples including a saddle-shaped shell, a torus-shape shell and a tee-branch pipe are solved to demonstrate the validity and efficiency of the proposed approach. It is found that compared with traditional approaches for topology optimization on 2D surfaces, optimized designs with clear load transmission paths can be obtained with much fewer numbers of design variables and degrees of freedom for finite element analysis (FEA) via the proposed approach.
△ Less
Submitted 2 February, 2022; v1 submitted 20 January, 2022;
originally announced January 2022.
-
Computing Groebner bases of ideal interpolation
Authors:
Xue Jiang,
Yihe Gong
Abstract:
We present algorithms for computing the reduced Gröbner basis of the vanishing ideal of a finite set of points in a frame of ideal interpolation. Ideal interpolation is defined by a linear projector whose kernel is a polynomial ideal. In this paper, we translate interpolation condition functionals into formal power series via Taylor expansion, then the reduced Gröbner basis is read from formal pow…
▽ More
We present algorithms for computing the reduced Gröbner basis of the vanishing ideal of a finite set of points in a frame of ideal interpolation. Ideal interpolation is defined by a linear projector whose kernel is a polynomial ideal. In this paper, we translate interpolation condition functionals into formal power series via Taylor expansion, then the reduced Gröbner basis is read from formal power series by Gaussian elimination. Our algorithm has a polynomial time complexity. It compares favorably with MMM algorithm in single point ideal interpolation and some several points ideal interpolation.
△ Less
Submitted 14 January, 2024; v1 submitted 14 November, 2021;
originally announced November 2021.
-
Distributed stochastic proximal algorithm with random reshuffling for non-smooth finite-sum optimization
Authors:
Xia Jiang,
Xianlin Zeng,
Jian Sun,
Jie Chen,
Lihua Xie
Abstract:
The non-smooth finite-sum minimization is a fundamental problem in machine learning. This paper develops a distributed stochastic proximal-gradient algorithm with random reshuffling to solve the finite-sum minimization over time-varying multi-agent networks. The objective function is a sum of differentiable convex functions and non-smooth regularization. Each agent in the network updates local var…
▽ More
The non-smooth finite-sum minimization is a fundamental problem in machine learning. This paper develops a distributed stochastic proximal-gradient algorithm with random reshuffling to solve the finite-sum minimization over time-varying multi-agent networks. The objective function is a sum of differentiable convex functions and non-smooth regularization. Each agent in the network updates local variables with a constant step-size by local information and cooperates to seek an optimal solution. We prove that local variable estimates generated by the proposed algorithm achieve consensus and are attracted to a neighborhood of the optimal solution in expectation with an $\mathcal{O}(\frac{1}{T}+\frac{1}{\sqrt{T}})$ convergence rate, where $T$ is the total number of iterations. Finally, some comparative simulations are provided to verify the convergence performance of the proposed algorithm.
△ Less
Submitted 10 October, 2022; v1 submitted 6 November, 2021;
originally announced November 2021.
-
On the divisibility of sums of even powers of $q$-binomial coefficients
Authors:
Ji-Cai Liu,
Xue-Ting Jiang
Abstract:
We prove the divisibility conjecture on sums of even powers of $q$-binomial coefficients, which was recently proposed by Guo, Schlosser and Zudilin. Our proof relies on two $q$-harmonic series congruences due to Shi and Pan.
We prove the divisibility conjecture on sums of even powers of $q$-binomial coefficients, which was recently proposed by Guo, Schlosser and Zudilin. Our proof relies on two $q$-harmonic series congruences due to Shi and Pan.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Riesz representation theorems for positive algebra homomorphisms
Authors:
Marcel de Jeu,
Xingni Jiang
Abstract:
Let $X$ be a locally compact Hausdorff space, let $\mathrm A$ be a partially ordered algebra, and let $T:\mathrm{C}_{\mathrm c}(X)\to \mathrm A$ be a positive algebra homomorphism. Under conditions on $\mathrm A$ that are satisfied in a good number of cases of practical interest, we show that $T$ is represented by a (unique regular) measure $μ$ on the Borel $σ$-algebra of $X$ that takes it values…
▽ More
Let $X$ be a locally compact Hausdorff space, let $\mathrm A$ be a partially ordered algebra, and let $T:\mathrm{C}_{\mathrm c}(X)\to \mathrm A$ be a positive algebra homomorphism. Under conditions on $\mathrm A$ that are satisfied in a good number of cases of practical interest, we show that $T$ is represented by a (unique regular) measure $μ$ on the Borel $σ$-algebra of $X$ that takes it values in the positive cone of $\mathrm A$, and with the property that $μ(A_1\cap A_2)=μ(A_1)μ(A_2)$ for Borel subsets $A_1,A_2$ of $X$.
The positive algebra homomorphism $T$ can be extended from ${\mathrm C}_{\mathrm c}(X)$ to the accompanying $\mathcal L^1$-space of $μ$. We show that, quite often, this $\mathcal L^1$-space is closed under multiplication, so that it is a Riesz algebra, and that the extended map $T:\mathcal L^1\to\mathrm A$ is not only an algebra homomorphism, but, even when $\mathrm A$ is not a Riesz space, also a vector lattice homomorphism in a sense that is explained in the paper. The latter property enables one to describe images of the extended map in terms of sequential up-downs and down-ups of the image of (the positive cone of ) ${\mathrm C}_{\mathrm c}(X)$ when $\mathrm A$ has the countable sup property.
We apply the main results, which are obtained by purely order-theoretical methods, to positive algebra homomorphisms from ${\mathrm C}_0(X)$ into the order continuous operators on a Banach lattice, and to representations of ${\mathrm C}_0(X,\mathbb C)$ on Hilbert spaces.
It is thus seen that, for representations on Banach lattices and on Hilbert spaces, although situated in rather different contexts, spectral theorems can be established that are both rooted in the same order-theoretical Riesz representation theorem for positive algebra homomorphisms from ${\mathrm C_\mathrm c}(X)$ into partially ordered algebras.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.
-
A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy
Authors:
Kai Zhang,
Chao Tian,
Kun Zhang,
Todd Johnson,
Xiaoqian Jiang
Abstract:
The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data. It can be computationally expensive in the worst case due to the conditional independence tests are performed in an exhaustive-searching manner. This makes the algorithm computationally intractable when the task contains several hundred or thousand nodes, particularly when the true underlying c…
▽ More
The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data. It can be computationally expensive in the worst case due to the conditional independence tests are performed in an exhaustive-searching manner. This makes the algorithm computationally intractable when the task contains several hundred or thousand nodes, particularly when the true underlying causal graph is dense. We propose a critical observation that the conditional set rendering two nodes independent is non-unique, and including certain redundant nodes do not sacrifice result accuracy. Based on this finding, the innovations of our work are two-folds. First, we innovate on a reserve order linkage pruning PC algorithm which significantly increases the algorithm's efficiency. Second, we propose a parallel computing strategy for statistical independence tests by leveraging tensor computation, which brings further speedup. We also prove the proposed algorithm does not induce statistical power loss under mild graph and data dimensionality assumptions. Experimental results show that the single-threaded version of the proposed algorithm can achieve a 6-fold speedup compared to the PC algorithm on a dense 95-node graph, and the parallel version can make a 825-fold speed-up. We also provide proof that the proposed algorithm is consistent under the same set of conditions with conventional PC algorithm.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
A Physics-Data-Driven Bayesian Method for Heat Conduction Problems
Authors:
Xinchao Jiang,
Hu Wang,
Yu li
Abstract:
In this study, a novel physics-data-driven Bayesian method named Heat Conduction Equation assisted Bayesian Neural Network (HCE-BNN) is proposed. The HCE-BNN is constructed based on the Bayesian neural network, it is a physics-informed machine learning strategy. Compared with the existed pure data driven method, to acquire physical consistency and better performance of the data-driven model, the h…
▽ More
In this study, a novel physics-data-driven Bayesian method named Heat Conduction Equation assisted Bayesian Neural Network (HCE-BNN) is proposed. The HCE-BNN is constructed based on the Bayesian neural network, it is a physics-informed machine learning strategy. Compared with the existed pure data driven method, to acquire physical consistency and better performance of the data-driven model, the heat conduction equation is embedded into the loss function of the HCE-BNN as a regularization term. Hence, the proposed method can build a more reliable model by physical constraints with less data. The HCE-BNN can handle the forward and inverse problems consistently, that is, to infer unknown responses from known partial responses, or to identify boundary conditions or material parameters from known responses. Compared with the exact results, the test results demonstrate that the proposed method can be applied to both heat conduction forward and inverse problems successfully. In addition, the proposed method can be implemented with the noisy data and gives the corresponding uncertainty quantification for the solutions.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
Asymptotics of Kähler-Einstein metrics on complex hyperbolic cusps
Authors:
Xin Fu,
Hans-Joachim Hein,
Xumin Jiang
Abstract:
Let $L$ be a negative holomorphic line bundle over an $(n-1)$-dimensional complex torus $D$. Let $h$ be a Hermitian metric on $L$ such that the curvature form of the dual Hermitian metric defines a flat Kähler metric on $D$. Then $h$ is unique up to scaling, and, for some closed tubular neighborhood $V$ of the zero section $D \subset L$, the form…
▽ More
Let $L$ be a negative holomorphic line bundle over an $(n-1)$-dimensional complex torus $D$. Let $h$ be a Hermitian metric on $L$ such that the curvature form of the dual Hermitian metric defines a flat Kähler metric on $D$. Then $h$ is unique up to scaling, and, for some closed tubular neighborhood $V$ of the zero section $D \subset L$, the form $ω_h = -(n+1)i\partial\overline\partial\log(-{\log h})$ defines a complete Kähler-Einstein metric on $V \setminus D$ with ${\rm Ric}(ω_h) = -ω_h$. In fact, $ω_h$ is complex hyperbolic, i.e., the holomorphic sectional curvature of $ω_h$ is constant, and $ω_h$ has the usual doubly-warped cusp structure familiar from complex hyperbolic geometry. In this paper, we prove that if $U$ is another closed tubular neighborhood of the zero section and if $ω$ is a complete Kähler-Einstein metric with ${\rm Ric}(ω) = -ω$ on $U \setminus D$, then there exist a Hermitian metric $h$ as above and a $δ\in \mathbb{R}^+$ such that $ω- ω_{h} = O(e^{-δ\sqrt{-{\log h}}})$ to all orders with respect to $ω_h$ as $h \to 0$. This rate is doubly exponential in the distance from a fixed point, and is sharp.
△ Less
Submitted 9 November, 2021; v1 submitted 30 August, 2021;
originally announced August 2021.
-
Special MMP for log canonical generalised pairs
Authors:
Vladimir Lazić,
Nikolaos Tsakanikas,
with an appendix joint with Xiaowei Jiang
Abstract:
We show that minimal models of $\mathbb{Q}$-factorial NQC log canonical generalised pairs exist, assuming the existence of minimal models of smooth varieties. More generally, we prove that on a $\mathbb{Q}$-factorial NQC log canonical generalised pair $ (X,B+M) $ we can run an MMP with scaling of an ample divisor which terminates, assuming that it admits an NQC weak Zariski decomposition or that…
▽ More
We show that minimal models of $\mathbb{Q}$-factorial NQC log canonical generalised pairs exist, assuming the existence of minimal models of smooth varieties. More generally, we prove that on a $\mathbb{Q}$-factorial NQC log canonical generalised pair $ (X,B+M) $ we can run an MMP with scaling of an ample divisor which terminates, assuming that it admits an NQC weak Zariski decomposition or that $K_X+B+M$ is not pseudoeffective. As a consequence, we establish several existence results for minimal models and Mori fibre spaces.
△ Less
Submitted 10 August, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Distributed stochastic gradient tracking algorithm with variance reduction for non-convex optimization
Authors:
Xia Jiang,
Xianlin Zeng,
Jian Sun,
Jie Chen
Abstract:
This paper proposes a distributed stochastic algorithm with variance reduction for general smooth non-convex finite-sum optimization, which has wide applications in signal processing and machine learning communities. In distributed setting, large number of samples are allocated to multiple agents in the network. Each agent computes local stochastic gradient and communicates with its neighbors to s…
▽ More
This paper proposes a distributed stochastic algorithm with variance reduction for general smooth non-convex finite-sum optimization, which has wide applications in signal processing and machine learning communities. In distributed setting, large number of samples are allocated to multiple agents in the network. Each agent computes local stochastic gradient and communicates with its neighbors to seek for the global optimum. In this paper, we develop a modified variance reduction technique to deal with the variance introduced by stochastic gradients. Combining gradient tracking and variance reduction techniques, this paper proposes a distributed stochastic algorithm, GT-VR, to solve large-scale non-convex finite-sum optimization over multi-agent networks. A complete and rigorous proof shows that the GT-VR algorithm converges to first-order stationary points with $O(\frac{1}{k})$ convergence rate. In addition, we provide the complexity analysis of the proposed algorithm. Compared with some existing first-order methods, the proposed algorithm has a lower $\mathcal{O}(PMε^{-1})$ gradient complexity under some mild condition. By comparing state-of-the-art algorithms and GT-VR in experimental simulations, we verify the efficiency of the proposed algorithm.
△ Less
Submitted 21 July, 2021; v1 submitted 28 June, 2021;
originally announced June 2021.
-
On the finiteness of the Morse index of self-shrinkers
Authors:
Xu-Yong Jiang,
He-Jun Sun,
Peibiao Zhao
Abstract:
In this paper, we present a sufficient condition for finite Morse index of complete properly self-shrinkers. We prove that a complete properly embedded self-shrinker in $\mathbb{R}^{n+1}$ with finite asymptotically conical ends or asymptotically cylindrical ends must have finite Morse index. Moreover, as an application of this result, we show that a complete properly embedded self-shrinker in…
▽ More
In this paper, we present a sufficient condition for finite Morse index of complete properly self-shrinkers. We prove that a complete properly embedded self-shrinker in $\mathbb{R}^{n+1}$ with finite asymptotically conical ends or asymptotically cylindrical ends must have finite Morse index. Moreover, as an application of this result, we show that a complete properly embedded self-shrinker in $\mathbb{R}^3$ with finite genus has finite Morse index.
△ Less
Submitted 23 June, 2021;
originally announced June 2021.
-
Riesz representation theorems for positive linear operators
Authors:
Marcel de Jeu,
Xingni Jiang
Abstract:
We generalise the Riesz representation theorems for positive linear functionals on $\mathrm{C}_{\mathrm c}(X)$ and $\mathrm{C}_{\mathrm 0}(X)$, where $X$ is a locally compact Hausdorff space, to positive linear operators from these spaces into a partially ordered vector space $E$. The representing measures are defined on the Borel $σ$-algebra of $X$ and take their values in the extended positive c…
▽ More
We generalise the Riesz representation theorems for positive linear functionals on $\mathrm{C}_{\mathrm c}(X)$ and $\mathrm{C}_{\mathrm 0}(X)$, where $X$ is a locally compact Hausdorff space, to positive linear operators from these spaces into a partially ordered vector space $E$. The representing measures are defined on the Borel $σ$-algebra of $X$ and take their values in the extended positive cone of $E$; the corresponding integrals are order integrals. We give explicit formulas for the values of the representing measures at open and at compact subsets of $X$.
Results are included where the space $E$ need not be a vector lattice, nor a normed space. Representing measures exist for positive linear operators into Banach lattices with order continuous norms, into the regular operators on a KB-space, into the self-adjoint linear operators in a weakly closed complex linear subspace of the bounded linear operators on a complex Hilbert space, and into JBW-algebras.
△ Less
Submitted 4 January, 2022; v1 submitted 25 April, 2021;
originally announced April 2021.
-
Order Integrals
Authors:
Marcel de Jeu,
Xingni Jiang
Abstract:
We define an integral of real-valued functions with respect to a measure that takes its values in the extended positive cone of a partially ordered vector space $E$. The monotone convergence theorem, Fatou's lemma, and the dominated convergence theorem are established; the analogues of the classical ${\mathcal L}^1$- and ${\mathrm L}^1$-spaces are investigated. The results extend earlier work by W…
▽ More
We define an integral of real-valued functions with respect to a measure that takes its values in the extended positive cone of a partially ordered vector space $E$. The monotone convergence theorem, Fatou's lemma, and the dominated convergence theorem are established; the analogues of the classical ${\mathcal L}^1$- and ${\mathrm L}^1$-spaces are investigated. The results extend earlier work by Wright and specialise to those for the Lebesgue integral when $E$ equals the real numbers.
The hypothesis on $E$ that is needed for the definition of the integral and for the monotone convergence theorem to hold ($σ$-monotone completeness) is a rather mild one. It is satisfied, for example, by the space of regular operators between a directed partially ordered vector space and a $σ$-monotone complete partially ordered vector space, and by every JBW-algebra. Fatou's lemma and the dominated convergence theorem hold for every $σ$-Dedekind complete space.
When $E$ consists of the regular operators on a Banach lattice with an order continuous norm, or when it consists of the self-adjoint elements of a strongly closed complex linear subspace of the bounded operators on a complex Hilbert space, then the finite measures as in the current paper are precisely the strongly $σ$-additive positive operator-valued measures. When $E$ is a partially ordered Banach space with a closed positive cone, then every positive vector measure is a measure in our sense, but not conversely. Even when a measure falls into both categories, the domain of the integral as defined in this paper can properly contain that of any reasonably defined integral with respect to the vector measure using Banach space methods.
△ Less
Submitted 9 November, 2021; v1 submitted 18 April, 2021;
originally announced April 2021.
-
Distributed synchronous and asynchronous algorithms for semi-definite programming with diagonal constraints
Authors:
Xia Jiang,
Xianlin Zeng,
Jian Sun,
Jie Chen
Abstract:
This paper develops distributed synchronous and asynchronous algorithms for the large-scale semi-definite programming with diagonal constraints, which has wide applications in combination optimization, image processing and community detection. The information of the semi-definite programming is allocated to multiple interconnected agents such that each agent aims to find a solution by communicatin…
▽ More
This paper develops distributed synchronous and asynchronous algorithms for the large-scale semi-definite programming with diagonal constraints, which has wide applications in combination optimization, image processing and community detection. The information of the semi-definite programming is allocated to multiple interconnected agents such that each agent aims to find a solution by communicating to its neighbors. Based on low-rank property of solutions and the Burer-Monteiro factorization, we transform the original problem into a distributed optimization problem over unit spheres to reduce variable dimensions and ensure positive semi-definiteness without involving semi-definite projections, which are computationally expensive. For the distributed optimization problem, we propose distributed synchronous and asynchronous algorithms, both of which reduce computational burden and storage space compared with existing centralized algorithms. Specifically, the distributed synchronous algorithm almost surely escapes strict saddle points and converges to the set of optimal solutions to the optimization problem. In addition, the proposed distributed asynchronous algorithm allows communication delays and converges to the set of critical points to the optimization problem under mild conditions. By applying proposed algorithms to image segmentation applications, we illustrate the efficiency and convergence performance of the two proposed algorithms.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Distributed proximal gradient algorithm for non-smooth non-convex optimization over time-varying networks
Authors:
Xia Jiang,
Xianlin Zeng,
Jian Sun,
Jie Chen
Abstract:
This note studies the distributed non-convex optimization problem with non-smooth regularization, which has wide applications in decentralized learning, estimation and control. The objective function is the sum of different local objective functions, which consist of differentiable (possibly non-convex) cost functions and non-smooth convex functions. This paper presents a distributed proximal grad…
▽ More
This note studies the distributed non-convex optimization problem with non-smooth regularization, which has wide applications in decentralized learning, estimation and control. The objective function is the sum of different local objective functions, which consist of differentiable (possibly non-convex) cost functions and non-smooth convex functions. This paper presents a distributed proximal gradient algorithm for the non-smooth non-convex optimization problem over time-varying multi-agent networks. Each agent updates local variable estimate by the multi-step consensus operator and the proximal operator. We prove that the generated local variables achieve consensus and converge to the set of critical points with convergence rate $O(1/T)$. Finally, we verify the efficacy of proposed algorithm by numerical simulations.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Analytics and Machine Learning in Vehicle Routing Research
Authors:
Ruibin Bai,
Xinan Chen,
Zhi-Long Chen,
Tianxiang Cui,
Shuhui Gong,
Wentao He,
** Jiang,
Huan **,
Jiahuan **,
Graham Kendall,
Jiawei Li,
Zheng Lu,
Jianfeng Ren,
Paul Weng,
Ning Xue,
Huayan Zhang
Abstract:
The Vehicle Routing Problem (VRP) is one of the most intensively studied combinatorial optimisation problems for which numerous models and algorithms have been proposed. To tackle the complexities, uncertainties and dynamics involved in real-world VRP applications, Machine Learning (ML) methods have been used in combination with analytical approaches to enhance problem formulations and algorithmic…
▽ More
The Vehicle Routing Problem (VRP) is one of the most intensively studied combinatorial optimisation problems for which numerous models and algorithms have been proposed. To tackle the complexities, uncertainties and dynamics involved in real-world VRP applications, Machine Learning (ML) methods have been used in combination with analytical approaches to enhance problem formulations and algorithmic performance across different problem solving scenarios. However, the relevant papers are scattered in several traditional research fields with very different, sometimes confusing, terminologies. This paper presents a first, comprehensive review of hybrid methods that combine analytical techniques with ML tools in addressing VRP problems. Specifically, we review the emerging research streams on ML-assisted VRP modelling and ML-assisted VRP optimisation. We conclude that ML can be beneficial in enhancing VRP modelling, and improving the performance of algorithms for both online and offline VRP optimisations. Finally, challenges and future opportunities of VRP research are discussed.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.