Search | arXiv e-print repository

arXiv:2406.19981 [pdf, other]

Orthogonal Constrained Neural Networks for Solving Structured Inverse Eigenvalue Problems

Authors: Shuai Zhang, Xuelian Jiang, Hao Qian, Yingxiang Xu

Abstract: This paper introduces a novel neural network for efficiently solving Structured Inverse Eigenvalue Problems (SIEPs). The main contributions lie in two aspects: firstly, a unified framework is proposed that can handle various SIEPs instances. Particularly, an innovative method for handling nonnegativity constraints is devised using the ReLU function. Secondly, a novel neural network based on multil… ▽ More This paper introduces a novel neural network for efficiently solving Structured Inverse Eigenvalue Problems (SIEPs). The main contributions lie in two aspects: firstly, a unified framework is proposed that can handle various SIEPs instances. Particularly, an innovative method for handling nonnegativity constraints is devised using the ReLU function. Secondly, a novel neural network based on multilayer perceptrons, utilizing the Stiefel layer, is designed to efficiently solve SIEP. By incorporating the Stiefel layer through matrix orthogonal decomposition, the orthogonality of similarity transformations is ensured, leading to accurate solutions for SIEPs. Hence, we name this new network Stiefel Multilayer Perceptron (SMLP). Furthermore, SMLP is an unsupervised learning approach with a lightweight structure that is easy to train. Several numerical tests from literature and engineering domains demonstrate the efficiency of SMLP. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2406.10958 [pdf, other]

City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization

Authors: Zihao Jiao, Mengyi Sha, Haoyu Zhang, Xinyu Jiang, Wei Qi

Abstract: Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and trans… ▽ More Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and transparency of city management through conversational interactions. Specifically, to accommodate diverse users' requirements and enhance computational tractability, City-LEO leverages LLM's logical reasoning capabilities on prior knowledge to scope down large-scale optimization problems efficiently. In the human-like decision process, City-LEO also incorporates End-to-end (E2E) model to synergize the prediction and optimization. The E2E framework be conducive to co** with environmental uncertainties and involving more query-relevant features, and then facilitates transparent and interpretable decision-making process. In case study, we employ City-LEO in the operations management of e-bike sharing (EBS) system. The numerical results demonstrate that City-LEO has superior performance when benchmarks against the full-scale optimization problem. With less computational time, City-LEO generates more satisfactory and relevant solutions to the users' requirements, and achieves lower global suboptimality without significantly compromising accuracy. In a broader sense, our proposed agent offers promise to develop LLM-embedded OR tools for smart-city operations management. △ Less

Submitted 17 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

Comments: 26 pages, 8 figures, 5 tables

arXiv:2406.09189 [pdf, other]

Lie Symmetry Net: Preserving Conservation Laws in Modelling Financial Market Dynamics via Differential Equations

Authors: Xuelian Jiang, Tongtian Zhu, Can Wang, Yingxiang Xu, Fengxiang He

Abstract: This paper employs a novel Lie symmetry-based framework to model the intrinsic symmetries within financial market. Specifically, we introduce {\it Lie symmetry net} (LSN), which characterises the Lie symmetry of the differential equations (DE) estimating financial market dynamics, such as the Black-Scholes equation and the Vašiček equation. To simulate these differential equations in a symmetry-aw… ▽ More This paper employs a novel Lie symmetry-based framework to model the intrinsic symmetries within financial market. Specifically, we introduce {\it Lie symmetry net} (LSN), which characterises the Lie symmetry of the differential equations (DE) estimating financial market dynamics, such as the Black-Scholes equation and the Vašiček equation. To simulate these differential equations in a symmetry-aware manner, LSN incorporates a Lie symmetry risk derived from the conservation laws associated with the Lie symmetry operators of the target differential equations. This risk measures how well the Lie symmetry is realised and guides the training of LSN under the structural risk minimisation framework. Extensive numerical experiments demonstrate that LSN effectively realises the Lie symmetry and achieves an error reduction of more than {\it one order of magnitude} compared to state-of-the-art methods. The code is available at \href{https://github.com/Jxl163/LSN_code}{https://github.com/Jxl163/LSN$\_$code}. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.06398 [pdf, other]

Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance Reduction

Authors: Anton Rodomanov, Xiaowen Jiang, Sebastian Stich

Abstract: We present adaptive gradient methods (both basic and accelerated) for solving convex composite optimization problems in which the main part is approximately smooth (a.k.a. $(δ, L)$-smooth) and can be accessed only via a (potentially biased) stochastic gradient oracle. This setting covers many interesting examples including Hölder smooth problems and various inexact computations of the stochastic g… ▽ More We present adaptive gradient methods (both basic and accelerated) for solving convex composite optimization problems in which the main part is approximately smooth (a.k.a. $(δ, L)$-smooth) and can be accessed only via a (potentially biased) stochastic gradient oracle. This setting covers many interesting examples including Hölder smooth problems and various inexact computations of the stochastic gradient. Our methods use AdaGrad stepsizes and are adaptive in the sense that they do not require knowing any problem-dependent constants except an estimate of the diameter of the feasible set but nevertheless achieve the best possible convergence rates as if they knew the corresponding constants. We demonstrate that AdaGrad stepsizes work in a variety of situations by proving, in a unified manner, three types of new results. First, we establish efficiency guarantees for our methods in the classical setting where the oracle's variance is uniformly bounded. We then show that, under more refined assumptions on the variance, the same methods without any modifications enjoy implicit variance reduction properties allowing us to express their complexity estimates in terms of the variance only at the minimizer. Finally, we show how to incorporate explicit SVRG-type variance reduction into our methods and obtain even faster algorithms. In all three cases, we present both basic and accelerated algorithms achieving state-of-the-art complexity bounds. As a direct corollary of our results, we obtain universal stochastic gradient methods for Hölder smooth problems which can be used in all situations. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2404.18953 [pdf, other]

A Knowledge-driven Memetic Algorithm for the Energy-efficient Distributed Homogeneous Flow Shop Scheduling Problem

Authors: Yunbao Xu, Xuemei Jiang, Jun Li, Lining Xing, Yanjie Song

Abstract: The reduction of carbon emissions in the manufacturing industry holds significant importance in achieving the national "double carbon" target. Ensuring energy efficiency is a crucial factor to be incorporated into future generation manufacturing systems. In this study, energy consumption is considered in the distributed homogeneous flow shop scheduling problem (DHFSSP). A knowledge-driven memetic… ▽ More The reduction of carbon emissions in the manufacturing industry holds significant importance in achieving the national "double carbon" target. Ensuring energy efficiency is a crucial factor to be incorporated into future generation manufacturing systems. In this study, energy consumption is considered in the distributed homogeneous flow shop scheduling problem (DHFSSP). A knowledge-driven memetic algorithm (KDMA) is proposed to address the energy-efficient DHFSSP (EEDHFSSP). KDMA incorporates a collaborative initialization strategy to generate high-quality initial populations. Furthermore, several algorithmic improvements including update strategy, local search strategy, and carbon reduction strategy are employed to improve the search performance of the algorithm. The effectiveness of KDMA in solving EEDHFSSP is verified through extensive simulation experiments. It is evident that KDMA outperforms many state-of-the-art algorithms across various evaluation aspects. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Comments: 14 pages

arXiv:2404.08447 [pdf, other]

Federated Optimization with Doubly Regularized Drift Correction

Authors: Xiaowen Jiang, Anton Rodomanov, Sebastian U. Stich

Abstract: Federated learning is a distributed optimization paradigm that allows training machine learning models across decentralized devices while kee** the data localized. The standard method, FedAvg, suffers from client drift which can hamper performance and increase communication costs over centralized methods. Previous works proposed various strategies to mitigate drift, yet none have shown uniformly… ▽ More Federated learning is a distributed optimization paradigm that allows training machine learning models across decentralized devices while kee** the data localized. The standard method, FedAvg, suffers from client drift which can hamper performance and increase communication costs over centralized methods. Previous works proposed various strategies to mitigate drift, yet none have shown uniformly improved communication-computation trade-offs over vanilla gradient descent. In this work, we revisit DANE, an established method in distributed optimization. We show that (i) DANE can achieve the desired communication reduction under Hessian similarity constraints. Furthermore, (ii) we present an extension, DANE+, which supports arbitrary inexact local solvers and has more freedom to choose how to aggregate the local updates. We propose (iii) a novel method, FedRed, which has improved local computational complexity and retains the same communication complexity compared to DANE/DANE+. This is achieved by using doubly regularized drift correction. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2403.11293 [pdf, other]

Accelerating Gradient Tracking with Periodic Global Averaging

Authors: Shu**g Feng, Xin Jiang

Abstract: Decentralized optimization algorithms have recently attracted increasing attention due to its wide applications in all areas of science and engineering. In these algorithms, a collection of agents collaborate to minimize the average of a set of heterogeneous cost functions in a decentralized manner. State-of-the-art decentralized algorithms like Gradient Tracking (GT) and Exact Diffusion (ED) invo… ▽ More Decentralized optimization algorithms have recently attracted increasing attention due to its wide applications in all areas of science and engineering. In these algorithms, a collection of agents collaborate to minimize the average of a set of heterogeneous cost functions in a decentralized manner. State-of-the-art decentralized algorithms like Gradient Tracking (GT) and Exact Diffusion (ED) involve communication at each iteration. Yet, communication between agents is often expensive, resource intensive, and can be very slow. To this end, several strategies have been developed to balance between communication overhead and convergence rate of decentralized methods. In this paper, we introduce GT-PGA, which incorporates~GT with periodic global averaging. With the additional PGA, the influence of poor network connectivity in the GT algorithm can be compensated or controlled by a careful selection of the global averaging period. Under the stochastic, nonconvex setup, our analysis quantifies the crucial trade-off between the connectivity of network topology and the PGA period. Thus, with a suitable design of the PGA period, GT-PGA improves the convergence rate of vanilla GT. Numerical experiments are conducted to support our theory, and simulation results reveal that the proposed GT-PGA accelerates practical convergence, especially when the network is sparse. △ Less

Submitted 17 March, 2024; originally announced March 2024.

arXiv:2403.04412 [pdf, ps, other]

Model-free $H_{\infty}$ control of Itô stochastic system via off-policy reinforcement learning

Authors: **g Guo **g Guo, Xiushan Jiang, Weihai Zhang

Abstract: The stochastic $H_{\infty}$ control is studied for a linear stochastic Itô system with an unknown system model. The linear stochastic $H_{\infty}$ control issue is known to be transformable into the problem of solving a so-called generalized algebraic Riccati equation (GARE), which is a nonlinear equation that is typically difficult to solve analytically. Worse, model-based techniques cannot be ut… ▽ More The stochastic $H_{\infty}$ control is studied for a linear stochastic Itô system with an unknown system model. The linear stochastic $H_{\infty}$ control issue is known to be transformable into the problem of solving a so-called generalized algebraic Riccati equation (GARE), which is a nonlinear equation that is typically difficult to solve analytically. Worse, model-based techniques cannot be utilized to approximately solve a GARE when an accurate system model is unavailable or prohibitively expensive to construct in reality. To address these issues, an off-policy reinforcement learning (RL) approach is presented to learn the solution of a GARE from real system data rather than a system model; its convergence is demonstrated, and the robustness of RL to errors in the learning process is investigated. In the off-policy RL approach, the system data may be created with behavior policies rather than the target policies, which is highly significant and promising for use in actual systems. Finally, the proposed off-policy RL approach is validated on a stochastic linear F-16 aircraft system. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2403.01340 [pdf, ps, other]

An eternal hypersurface flow arising in centro-affine geometry

Authors: Xinjie Jiang, Changzheng Qu, Yun Yang

Abstract: In this paper, the existence and uniqueness for a specific centro-affine invariant hypersurface flow in $R^{n+1}$ are studied, and the corresponding evolutionary processes in both centro-affine and Euclidean settings are explored. It turns out that the flow exhibits similar properties as the standard heat flow. In addition, the long time existence of the flow is investigated, which asserts that th… ▽ More In this paper, the existence and uniqueness for a specific centro-affine invariant hypersurface flow in $R^{n+1}$ are studied, and the corresponding evolutionary processes in both centro-affine and Euclidean settings are explored. It turns out that the flow exhibits similar properties as the standard heat flow. In addition, the long time existence of the flow is investigated, which asserts that the hypersurface governed by the flow converges asymptotically toward an ellipsoid via systematically investigating evolutions of the centro-affine invariants. Furthermore, the classification of the eternal solutions for the flow is provided. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2402.18071 [pdf, ps, other]

Improved uniform error bounds for long-time dynamics of the high-dimensional nonlinear space fractional sine-Gordon equation with weak nonlinearity

Authors: Junqing Jia, Xiaoqing Chi, Xiaoyun Jiang

Abstract: In this paper, we derive the improved uniform error bounds for the long-time dynamics of the $d$-dimensional $(d=2,3)$ nonlinear space fractional sine-Gordon equation (NSFSGE). The nonlinearity strength of the NSFSGE is characterized by $\varepsilon^2$ where $0<\varepsilon \le 1$ is a dimensionless parameter. The second-order time-splitting method is applied to the temporal discretization and the… ▽ More In this paper, we derive the improved uniform error bounds for the long-time dynamics of the $d$-dimensional $(d=2,3)$ nonlinear space fractional sine-Gordon equation (NSFSGE). The nonlinearity strength of the NSFSGE is characterized by $\varepsilon^2$ where $0<\varepsilon \le 1$ is a dimensionless parameter. The second-order time-splitting method is applied to the temporal discretization and the Fourier pseudo-spectral method is used for the spatial discretization. To obtain the explicit relation between the numerical errors and the parameter $\varepsilon$, we introduce the regularity compensation oscillation technique to the convergence analysis of fractional models. Then we establish the improved uniform error bounds $O\left(\varepsilon^2 τ^2\right)$ for the semi-discretization scheme and $O\left(h^m+\varepsilon^2 τ^2\right)$ for the full-discretization scheme up to the long time at $O(1/\varepsilon^2)$. Further, we extend the time-splitting Fourier pseudo-spectral method to the complex NSFSGE as well as the oscillatory complex NSFSGE, and the improved uniform error bounds for them are also given. Finally, extensive numerical examples in two-dimension or three-dimension are provided to support the theoretical analysis. The differences in dynamic behaviors between the fractional sine-Gordon equation and classical sine-Gordon equation are also discussed. △ Less

Submitted 28 February, 2024; originally announced February 2024.

MSC Class: 35R11; 35Q55; 65M12; 65M15 ACM Class: G.1

arXiv:2401.14596 [pdf, ps, other]

Sparse factorization of the square all-ones matrix of arbitrary order

Authors: Xin Jiang, Edward Duc Hien Nguyen, César A. Uribe, Bicheng Ying

Abstract: In this paper, we study sparse factorization of the (scaled) square all-ones matrix $J$ of arbitrary order. We introduce the concept of hierarchically banded matrices and propose two types of hierarchically banded factorization of $J$: the reduced hierarchically banded (RHB) factorization and the doubly stochastic hierarchically banded (DSHB) factorization. Based on the DSHB factorization, we prop… ▽ More In this paper, we study sparse factorization of the (scaled) square all-ones matrix $J$ of arbitrary order. We introduce the concept of hierarchically banded matrices and propose two types of hierarchically banded factorization of $J$: the reduced hierarchically banded (RHB) factorization and the doubly stochastic hierarchically banded (DSHB) factorization. Based on the DSHB factorization, we propose the sequential doubly stochastic (SDS) factorization, in which~$J$ is decomposed as a product of sparse, doubly stochastic matrices. Finally, we discuss the application of the proposed sparse factorizations to the decentralized average consensus problem and decentralized optimization. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.11468 [pdf, other]

A continuous cusp closing process for negative Kähler-Einstein metrics

Authors: Xin Fu, Hans-Joachim Hein, Xumin Jiang

Abstract: We give an example of a family of smooth complex algebraic surfaces of degree $6$ in $\mathbb{CP}^3$ develo** an isolated elliptic singularity. We show via a gluing construction that the unique Kähler-Einstein metrics of Ricci curvature $-1$ on these sextics develop a complex hyperbolic cusp in the limit, and that near the tip of the forming cusp a Tian-Yau gravitational instanton bubbles off. We give an example of a family of smooth complex algebraic surfaces of degree $6$ in $\mathbb{CP}^3$ develo** an isolated elliptic singularity. We show via a gluing construction that the unique Kähler-Einstein metrics of Ricci curvature $-1$ on these sextics develop a complex hyperbolic cusp in the limit, and that near the tip of the forming cusp a Tian-Yau gravitational instanton bubbles off. △ Less

Submitted 21 January, 2024; originally announced January 2024.

Comments: 65 pages, 3 figures

MSC Class: 53C25; 32Q20

arXiv:2312.03313 [pdf, ps, other]

Boundedness and moduli of traditional stable minimal models

Authors: Xiaowei Jiang

Abstract: For good minimal models with semi-log canonical (slc) singularities, polarized by effective divisors that are relatively ample over the bases of Iitaka fibration, Birkar proves that they belong to a bounded family after fixing appropriate numerical invariants recently. Subsequently, he constructs their projective coarse moduli spaces. In this paper, we consider good minimal models with only Kawama… ▽ More For good minimal models with semi-log canonical (slc) singularities, polarized by effective divisors that are relatively ample over the bases of Iitaka fibration, Birkar proves that they belong to a bounded family after fixing appropriate numerical invariants recently. Subsequently, he constructs their projective coarse moduli spaces. In this paper, we consider good minimal models with only Kawamata log terminal (klt) singularities but polarized by possibly non-effective divisors. We prove that they still belong to a bounded family after fixing the same invariants. As an application, we construct separated coarse moduli spaces for klt good minimal models polarized by line bundles. △ Less

Submitted 17 January, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

Comments: 27 pages, 1 figure, comments are welcome! v2: exposition improved throughout the paper. arXiv admin note: text overlap with arXiv:2211.11237 by other authors

arXiv:2311.14992 [pdf, ps, other]

Model-free Reinforcement Learning for ${H_{2}/H_{\infty}}$ Control of Stochastic Discrete-time Systems

Authors: Xiushan Jiang, Li Wang, Dongya Zhao, Ling Shi

Abstract: This paper proposes a reinforcement learning (RL) algorithm for infinite horizon $\rm {H_{2}/H_{\infty}}$ problem in a class of stochastic discrete-time systems, rather than using a set of coupled generalized algebraic Riccati equations (GAREs). The algorithm is able to learn the optimal control policy for the system even when its parameters are unknown. Additionally, the paper explores the effect… ▽ More This paper proposes a reinforcement learning (RL) algorithm for infinite horizon $\rm {H_{2}/H_{\infty}}$ problem in a class of stochastic discrete-time systems, rather than using a set of coupled generalized algebraic Riccati equations (GAREs). The algorithm is able to learn the optimal control policy for the system even when its parameters are unknown. Additionally, the paper explores the effect of detection noise as well as the convergence of the algorithm, and shows that the control policy is admissible after a finite number of iterations. The algorithm is also able to handle multi-objective control problems within stochastic fields. Finally, the algorithm is applied to the F-16 aircraft autopilot with multiplicative noise. △ Less

Submitted 25 November, 2023; originally announced November 2023.

arXiv:2311.01317 [pdf, other]

On Graphs with Finite-Time Consensus and Their Use in Gradient Tracking

Authors: Edward Duc Hien Nguyen, Xin Jiang, Bicheng Ying, César A. Uribe

Abstract: This paper studies sequences of graphs satisfying the finite-time consensus property (i.e., iterating through such a finite sequence is equivalent to performing global or exact averaging) and their use in Gradient Tracking. We provide an explicit weight matrix representation of the studied sequences and prove their finite-time consensus property. Moreover, we incorporate the studied finite-time co… ▽ More This paper studies sequences of graphs satisfying the finite-time consensus property (i.e., iterating through such a finite sequence is equivalent to performing global or exact averaging) and their use in Gradient Tracking. We provide an explicit weight matrix representation of the studied sequences and prove their finite-time consensus property. Moreover, we incorporate the studied finite-time consensus topologies into Gradient Tracking and present a new algorithmic scheme called Gradient Tracking for Finite-Time Consensus Topologies (GT-FT). We analyze the new scheme for nonconvex problems with stochastic gradient estimates. Our analysis shows that the convergence rate of GT-FT does not depend on the heterogeneity of the agents' functions or the connectivity of any individual graph in the topology sequence. Furthermore, owing to the sparsity of the graphs, GT-FT requires lower communication costs than Gradient Tracking using the static counterpart of the topology sequence. △ Less

Submitted 14 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

arXiv:2310.01751 [pdf, other]

A nonmonotone proximal quasi-Newton method for multiobjective optimization

Authors: Xiaoxue Jiang

Abstract: This paper proposes a nonmonotone proximal quasi-Newton algorithm for unconstrained convex multiobjective composite optimization problems. To design the search direction, we minimize the max-scalarization of the variations of the Hessian approximations and nonsmooth terms. Subsequently, a nonmonotone line search is used to determine the step size, we allow for the decrease of a convex combination… ▽ More This paper proposes a nonmonotone proximal quasi-Newton algorithm for unconstrained convex multiobjective composite optimization problems. To design the search direction, we minimize the max-scalarization of the variations of the Hessian approximations and nonsmooth terms. Subsequently, a nonmonotone line search is used to determine the step size, we allow for the decrease of a convex combination of recent function values. Under the assumption of strong convexity of the objective function, we prove that the sequence generated by this method converges to a Pareto optimal. Furthermore, based on the strong convexity, Hessian continuity and Dennis-Moré criterion, we use a basic inequality to derive the local superlinear convergence rate of the proposed algorithm. Numerical experiments results demonstrate the feasibility and effectiveness of the proposed algorithm on a set of test problems. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.09948 [pdf, ps, other]

The singular sets of degenerate and nonlocal elliptic equations on Poincaré-Einstein manifolds

Authors: Xumin Jiang, Yannick Sire, Ruobing Zhang

Abstract: The main objects of this paper include some degenerate and nonlocal elliptic operators which naturally arise in the conformal invariant theory of Poincaré-Einstein manifolds. These operators generally reflect the correspondence between the Riemannian geometry of a complete Poincaré-Einstein manifold and the conformal geometry of its associated conformal infinity. In this setting, we develop the qu… ▽ More The main objects of this paper include some degenerate and nonlocal elliptic operators which naturally arise in the conformal invariant theory of Poincaré-Einstein manifolds. These operators generally reflect the correspondence between the Riemannian geometry of a complete Poincaré-Einstein manifold and the conformal geometry of its associated conformal infinity. In this setting, we develop the quantitative differentiation theory that includes quantitative stratification for the singular set and Minkowski type estimates for the (quantitatively) stratified singular sets. All these, together with a new $ε$-regularity result for degenerate/singular elliptic operators on Poincaré-Einstein manifolds, lead to uniform Hausdorff measure estimates for the singular sets. Furthermore, the main results in this paper provide a delicate synergy between the geometry of Poincaré-Einstein manifolds and the elliptic theory of associated degenerate elliptic operators. △ Less

Submitted 18 September, 2023; originally announced September 2023.

arXiv:2309.02988 [pdf, ps, other]

Fast time-step** discontinuous Galerkin method for the subdiffusion equation

Authors: Hui Zhang, Fanhai Zeng, Xiaoyun Jiang, Zhimin Zhang

Abstract: The nonlocality of the fractional operator causes numerical difficulties for long time computation of the time-fractional evolution equations. This paper develops a high-order fast time-step** discontinuous Galerkin finite element method for the time-fractional diffusion equations, which saves storage and computational time. The optimal error estimate… ▽ More The nonlocality of the fractional operator causes numerical difficulties for long time computation of the time-fractional evolution equations. This paper develops a high-order fast time-step** discontinuous Galerkin finite element method for the time-fractional diffusion equations, which saves storage and computational time. The optimal error estimate $O(N^{-p-1} + h^{m+1} + \varepsilon N^{rα})$ of the current time-step** discontinuous Galerkin method is rigorous proved, where $N$ denotes the number of time intervals, $p$ is the degree of polynomial approximation on each time subinterval, $h$ is the maximum space step, $r\ge1$, $m$ is the order of finite element space, and $\varepsilon>0$ can be arbitrarily small. Numerical simulations verify the theoretical analysis. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: 21 pages, 1 figure,4 tables

MSC Class: 26A33; 65M06; 65M12; 65M15; 35R11

arXiv:2308.13295 [pdf]

Resolution-independent generative models based on operator learning for physics-constrained Bayesian inverse problems

Authors: Xinchao Jiang, Xin Wang, Ziming Wen, Hu Wang

Abstract: The Bayesian inference approach is widely used to tackle inverse problems due to its versatile and natural ability to handle ill-posedness. However, it often faces challenges when dealing with situations involving continuous fields or large-resolution discrete representations (high-dimensional). Moreover, the prior distribution of unknown parameters is commonly difficult to be determined. In this… ▽ More The Bayesian inference approach is widely used to tackle inverse problems due to its versatile and natural ability to handle ill-posedness. However, it often faces challenges when dealing with situations involving continuous fields or large-resolution discrete representations (high-dimensional). Moreover, the prior distribution of unknown parameters is commonly difficult to be determined. In this study, an Operator Learning-based Generative Adversarial Network (OL-GAN) is proposed and integrated into the Bayesian inference framework to handle these issues. Unlike most Bayesian approaches, the distinctive characteristic of the proposed method is to learn the joint distribution of parameters and responses. By leveraging the trained generative model, the posteriors of the unknown parameters can theoretically be approximated by any sampling algorithm (e.g., Markov Chain Monte Carlo, MCMC) in a low-dimensional latent space shared by the components of the joint distribution. The latent space is typically a simple and easy-to-sample distribution (e.g., Gaussian, uniform), which significantly reduces the computational cost associated with the Bayesian inference while avoiding prior selection concerns. Furthermore, incorporating operator learning enables resolution-independent in the generator. Predictions can be obtained at desired coordinates, and inversions can be performed even if the observation data are misaligned with the training data. Finally, the effectiveness of the proposed method is validated through several numerical experiments. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.10140 [pdf, ps, other]

On the Convergence of Newton-type Proximal Gradient Method for Multiobjective Optimization Problems

Authors: Jian Chen, Xiaoxue Jiang, Li** Tang, Xinmin Yang

Abstract: In a recent study, Ansary (Optim Methods Softw 38(3):570-590,2023) proposed a Newton-type proximal gradient method for nonlinear multiobjective optimization problems (NPGMO). However, the favorable convergence properties typically associated with Newton-type methods were not established for NPGMO in Ansary's work. In response to this gap, we develop a straightforward framework for analyzing the co… ▽ More In a recent study, Ansary (Optim Methods Softw 38(3):570-590,2023) proposed a Newton-type proximal gradient method for nonlinear multiobjective optimization problems (NPGMO). However, the favorable convergence properties typically associated with Newton-type methods were not established for NPGMO in Ansary's work. In response to this gap, we develop a straightforward framework for analyzing the convergence behavior of the NPGMO. Specifically, under the assumption of strong convexity, we demonstrate that the NPGMO enjoys quadratic termination, superlinear convergence, and quadratic convergence for problems that are quadratic, twice continuously differentiable and twice Lipschitz continuously differentiable, respectively. △ Less

Submitted 19 August, 2023; originally announced August 2023.

Comments: arXiv admin note: text overlap with arXiv:2306.09797

MSC Class: 90C29 and 90C30

arXiv:2308.09864 [pdf]

A novel reduced basis method for adjoint sensitivity analysis of dynamic topology optimization

Authors: Shuhao Li, Hu Wang, Jichao Yin, Xinchao Jiang, Yaya Zhang

Abstract: In gradient-based time domain topology optimization, design sensitivity analysis (DSA) of the dynamic response is essential, and requires high computational cost to directly differentiate, especially for high-order dynamic system. To address this issue, this study develops an efficient reduced basis method (RBM)-based discrete adjoint sensitivity analysis method, which on the one hand significantl… ▽ More In gradient-based time domain topology optimization, design sensitivity analysis (DSA) of the dynamic response is essential, and requires high computational cost to directly differentiate, especially for high-order dynamic system. To address this issue, this study develops an efficient reduced basis method (RBM)-based discrete adjoint sensitivity analysis method, which on the one hand significantly improves the efficiency of sensitivity analysis and on the other hand avoids the consistency errors caused by the continuum method. In this algorithm, the basis functions of the adjoint problem are constructed in the offline phase based on the greedy-POD method, and a novel model-based estimation is developed to facilitate the acceleration of this process. Based on these basis functions, a fast and reasonably accurate model is then built by Galerkin projection for sensitivity analysis in each dynamic topology optimization iteration. Finally, the effectiveness of the error measures, the efficiency and the accuracy of the presented reduced-order method are verified by 2D and 3D dynamic structure studies. △ Less

Submitted 18 August, 2023; originally announced August 2023.

arXiv:2308.06058 [pdf, other]

Adaptive SGD with Polyak stepsize and Line-search: Robust Convergence and Variance Reduction

Authors: Xiaowen Jiang, Sebastian U. Stich

Abstract: The recently proposed stochastic Polyak stepsize (SPS) and stochastic line-search (SLS) for SGD have shown remarkable effectiveness when training over-parameterized models. However, in non-interpolation settings, both algorithms only guarantee convergence to a neighborhood of a solution which may result in a worse output than the initial guess. While artificially decreasing the adaptive stepsize h… ▽ More The recently proposed stochastic Polyak stepsize (SPS) and stochastic line-search (SLS) for SGD have shown remarkable effectiveness when training over-parameterized models. However, in non-interpolation settings, both algorithms only guarantee convergence to a neighborhood of a solution which may result in a worse output than the initial guess. While artificially decreasing the adaptive stepsize has been proposed to address this issue (Orvieto et al. [2022]), this approach results in slower convergence rates for convex and over-parameterized models. In this work, we make two contributions: Firstly, we propose two new variants of SPS and SLS, called AdaSPS and AdaSLS, which guarantee convergence in non-interpolation settings and maintain sub-linear and linear convergence rates for convex and strongly convex functions when training over-parameterized models. AdaSLS requires no knowledge of problem-dependent parameters, and AdaSPS requires only a lower bound of the optimal function value as input. Secondly, we equip AdaSPS and AdaSLS with a novel variance reduction technique and obtain algorithms that require $\smash{\widetilde{\mathcal{O}}}(n+1/ε)$ gradient evaluations to achieve an $\mathcal{O}(ε)$-suboptimality for convex functions, which improves upon the slower $\mathcal{O}(1/ε^2)$ rates of AdaSPS and AdaSLS without variance reduction in the non-interpolation regimes. Moreover, our result matches the fast rates of AdaSVRG but removes the inner-outer-loop structure, which is easier to implement and analyze. Finally, numerical experiments on synthetic and real datasets validate our theory and demonstrate the effectiveness and robustness of our algorithms. △ Less

Submitted 21 August, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

arXiv:2308.03687 [pdf, other]

Almost-sure convergence of iterates and multipliers in stochastic sequential quadratic optimization

Authors: Frank E. Curtis, Xin Jiang, Qi Wang

Abstract: Stochastic sequential quadratic optimization (SQP) methods for solving continuous optimization problems with nonlinear equality constraints have attracted attention recently, such as for solving large-scale data-fitting problems subject to nonconvex constraints. However, for a recently proposed subclass of such methods that is built on the popular stochastic-gradient methodology from the unconstra… ▽ More Stochastic sequential quadratic optimization (SQP) methods for solving continuous optimization problems with nonlinear equality constraints have attracted attention recently, such as for solving large-scale data-fitting problems subject to nonconvex constraints. However, for a recently proposed subclass of such methods that is built on the popular stochastic-gradient methodology from the unconstrained setting, convergence guarantees have been limited to the asymptotic convergence of the expected value of a stationarity measure to zero. This is in contrast to the unconstrained setting in which almost-sure convergence guarantees (of the gradient of the objective to zero) can be proved for stochastic-gradient-based methods. In this paper, new almost-sure convergence guarantees for the primal iterates, Lagrange multipliers, and stationarity measures generated by a stochastic SQP algorithm in this subclass of methods are proved. It is shown that the error in the Lagrange multipliers can be bounded by the distance of the primal iterate to a primal stationary point plus the error in the latest stochastic gradient estimate. It is further shown that, subject to certain assumptions, this latter error can be made to vanish by employing a running average of the Lagrange multipliers that are computed during the run of the algorithm. The results of numerical experiments are provided to demonstrate the proved theoretical guarantees. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Report number: Lehigh ISE Technical Report 23T-019 MSC Class: 49M05; 49M37; 62L20; 65K05; 68W20; 90C26; 90C30; 90C55

arXiv:2306.02001 [pdf, ps, other]

A globally convergent difference-of-convex algorithmic framework and application to log-determinant optimization problems

Authors: Chaorui Yao, Xin Jiang

Abstract: The difference-of-convex algorithm (DCA) is a conceptually simple method for the minimization of (possibly) nonconvex functions that are expressed as the difference of two convex functions. At each iteration, DCA constructs a global overestimator of the objective and solves the resulting convex subproblem. Despite its conceptual simplicity, the theoretical understanding and algorithmic framework o… ▽ More The difference-of-convex algorithm (DCA) is a conceptually simple method for the minimization of (possibly) nonconvex functions that are expressed as the difference of two convex functions. At each iteration, DCA constructs a global overestimator of the objective and solves the resulting convex subproblem. Despite its conceptual simplicity, the theoretical understanding and algorithmic framework of DCA needs further investigation. In this paper, global convergence of DCA at a linear rate is established under an extended Polyak--Łojasiewicz condition. The proposed condition holds for a class of DC programs with a bounded, closed, and convex constraint set, for which global convergence of DCA cannot be covered by existing analyses. Moreover, the DCProx computational framework is proposed, in which the DCA subproblems are solved by a primal--dual proximal algorithm with Bregman distances. With a suitable choice of Bregman distances, DCProx has simple update rules with cheap per-iteration complexity. As an application, DCA is applied to several fundamental problems in network information theory, for which no existing numerical methods are able to compute the global optimum. For these problems, our analysis proves the global convergence of DCA, and more importantly, DCProx solves the DCA subproblems efficiently. Numerical experiments are conducted to verify the efficiency of DCProx. △ Less

Submitted 3 June, 2023; originally announced June 2023.

arXiv:2303.03754 [pdf, other]

Improved uniform error bounds of exponential wave integrator method for long-time dynamics of the space fractional Klein-Gordon equation with weak nonlinearity

Authors: Junqing Jia, Xiaoyun Jiang

Abstract: An improved uniform error bound at $O\left(h^m+\varepsilon^2 τ^2\right)$ is established in $H^{α/2}$-norm for the long-time dynamics of the nonlinear space fractional Klein-Gordon equation (NSFKGE). A second-order exponential wave integrator (EWI) method is used to semi-discretize NSFKGE in time and the Fourier spectral method in space is applied to derive the full-discretization scheme. Regularit… ▽ More An improved uniform error bound at $O\left(h^m+\varepsilon^2 τ^2\right)$ is established in $H^{α/2}$-norm for the long-time dynamics of the nonlinear space fractional Klein-Gordon equation (NSFKGE). A second-order exponential wave integrator (EWI) method is used to semi-discretize NSFKGE in time and the Fourier spectral method in space is applied to derive the full-discretization scheme. Regularity compensation oscillation (RCO) technique is employed to prove the improved uniform error bounds at $O\left(\varepsilon^2 τ^2\right)$ in temporal semi-discretization and $O\left(h^m+\varepsilon^2 τ^2\right)$ in full-discretization up to the long-time $T_{\varepsilon}=T / \varepsilon^2$ ($T>0$ fixed), respectively. Complex NSFKGE and oscillatory complex NSFKGE with nonlinear terms of general power exponents are also discussed. Finally, the correctness of the theoretical analysis and the effectiveness of the method are verified by numerical examples. △ Less

Submitted 7 March, 2023; originally announced March 2023.

arXiv:2303.01368 [pdf]

doi 10.11996/JG.j.2095-302X.2022060967

A survey of path planning and feedrate interpolation in computer numerical control

Authors: Hong-yu Ma, Li-yong Shen, Xin Jiang, Qiang Zou, Chun-ming Yuan

Abstract: This paper presents a brief survey (in Chinese) on path planning and feedrate interpolation. Numerical control technology is widely employed in the modern manufacturing industry, and related research has been emphasized by academia and industry. The traditional process of numerical control technology is mainly composed of tool path planning and feedrate interpolation. To attain the machining of hi… ▽ More This paper presents a brief survey (in Chinese) on path planning and feedrate interpolation. Numerical control technology is widely employed in the modern manufacturing industry, and related research has been emphasized by academia and industry. The traditional process of numerical control technology is mainly composed of tool path planning and feedrate interpolation. To attain the machining of high speed and precision, several problems in tool path planning and feedrate interpolation are usually transformed into mathematical optimization models. To better undertake the research on the integrated design and optimization idea of tool path planning and feedrate interpolation, it is necessary to systematically review and drawn on the existing representative works. We will introduce the relevant methods and technical progress of tool path planning and feedrate interpolation in CNC machining successively, including tool path planning based on end milling, tool orientation optimization, G-code processing and corner transition, feedrate planning of parameter curves, and some new machining optimization methods proposed recently. △ Less

Submitted 28 February, 2023; originally announced March 2023.

Comments: in Chinese language, a prevision of the published paper: Journal of Graphics, 2022, 43(6): 967-986

ACM Class: I.3.5

Journal ref: [J]. Journal of Graphics, 2022, 43(6): 967-986

arXiv:2212.13833 [pdf, other]

A PML method for signal-propagation problems in axon

Authors: Xue Jiang, Maohui Lyu, Tao Yin, Weiying Zheng

Abstract: This work is focused on the modelling of signal propagations in myelinated axons to characterize the functions of the myelin sheath in the neural structure. Based on reasonable assumptions on the medium properties, we derive a two-dimensional neural-signaling model in cylindrical coordinates from the time-harmonic Maxwell's equations. The well-posedness of model is established upon Dirichlet bound… ▽ More This work is focused on the modelling of signal propagations in myelinated axons to characterize the functions of the myelin sheath in the neural structure. Based on reasonable assumptions on the medium properties, we derive a two-dimensional neural-signaling model in cylindrical coordinates from the time-harmonic Maxwell's equations. The well-posedness of model is established upon Dirichlet boundary conditions at the two ends of the neural structure and the radiative condition in the radial direction of the structure. Using the perfectly matched layer (PML) method, we truncate the unbounded background medium and propose an approximate problem on the truncated domain. The well-posedness of the PML problem and the exponential convergence of the approximate solution to the exact solution are established. Numerical experiments based on finite element discretization are presented to demonstrate the theoretical results and the efficiency of our methods to simulate the signal propagation in axons. △ Less

Submitted 28 December, 2022; originally announced December 2022.

arXiv:2211.14593 [pdf, other]

Fast method and convergence analysis of fractional magnetohydrodynamic coupled flow and heat transfer model for generalized second-grade fluid

Authors: Xiaoqing Chi, Hui Zhang, Xiaoyun Jiang

Abstract: In this paper, we first establish a new fractional magnetohydrodynamic (MHD) coupled flow and heat transfer model for a generalized second-grade fluid. This coupled model consists of a fractional momentum equation and a heat conduction equation with a generalized form of Fourier law. The second-order fractional backward difference formula is applied to the temporal discretization and the Legendre… ▽ More In this paper, we first establish a new fractional magnetohydrodynamic (MHD) coupled flow and heat transfer model for a generalized second-grade fluid. This coupled model consists of a fractional momentum equation and a heat conduction equation with a generalized form of Fourier law. The second-order fractional backward difference formula is applied to the temporal discretization and the Legendre spectral method is used for the spatial discretization. The fully discrete scheme is proved to be stable and convergent with an accuracy of $O(τ^2+N^{-r})$, where $τ$ is the time step size and $N$ is the polynomial degree. To reduce the memory requirements and computational cost, a fast method is developed, which is based on a globally uniform approximation of the trapezoidal rule for integrals on the real line. And the strict convergence of the numerical scheme with this fast method is proved. We present the results of several numerical experiments to verify the effectiveness of the proposed method. Finally, we simulate the unsteady fractional MHD flow and heat transfer of the generalized second-grade fluid through a porous medium. The effects of the relevant parameters on the velocity and temperature are presented and analyzed in detail. △ Less

Submitted 26 November, 2022; originally announced November 2022.

Comments: This paper has been accepted for publication in SCIENCE CHINA Mathematics

MSC Class: 76W05; 35R11; 65M12; 65M70

arXiv:2209.10195 [pdf]

An E-PINN assisted practical uncertainty quantification for inverse problems

Authors: Xinchao Jiang, Xin Wanga, Ziming Wena, Enying Li, Hu Wang

Abstract: How to solve inverse problems is the challenge of many engineering and industrial applications. Recently, physics-informed neural networks (PINNs) have emerged as a powerful approach to solve inverse problems efficiently. However, it is difficult for PINNs to quantify the uncertainty of results. Therefore, this study proposed ensemble PINNs (E-PINNs) to handle this issue. The E-PINN uses ensemble… ▽ More How to solve inverse problems is the challenge of many engineering and industrial applications. Recently, physics-informed neural networks (PINNs) have emerged as a powerful approach to solve inverse problems efficiently. However, it is difficult for PINNs to quantify the uncertainty of results. Therefore, this study proposed ensemble PINNs (E-PINNs) to handle this issue. The E-PINN uses ensemble statistics of several basic models to provide uncertainty quantifications for the inverse solution based on the PINN framework, and it is employed to solve the inverse problems in which the unknown quantity is propagated through partial differential equations (PDEs), especially the identification of the unknown field (e.g., space function) of a given physical system. Compared with other data-driven approaches, the suggested method is more than straightforward to implement, and also obtains high-quality uncertainty estimates of the quantity of interest (QoI) without significantly increasing the complexity of the algorithm. This work discusses the good properties of ensemble learning in field inversion and uncertainty quantification. The effectiveness of the proposed method is demonstrated through several numerical experiments. To enhance the robustness of models, adversarial training (AT) is applied. Furthermore, an adaptive active sampling (AS) strategy based on the uncertainty estimates from E-PINNs is also proposed to improve the accuracy of material field inversion problems. △ Less

Submitted 21 September, 2022; originally announced September 2022.

Comments: 31 pages, 12figures, 3 tables

arXiv:2209.02913 [pdf, other]

A stochastic agent-based model to evaluate COVID-19 transmission influenced by human mobility

Authors: Kejie Chen, Yanqing Li, Rongxin Zhou, Xiaomo Jiang

Abstract: The COVID-19 pandemic has created an urgent need for mathematical models that can project epidemic trends and evaluate the effectiveness of mitigation strategies. To forecast the transmission of COVID-19, a major challenge is the accurate assessment of the multi-scale human mobility and how they impact the infection through close contacts. By combining the stochastic agent-based modeling strategy… ▽ More The COVID-19 pandemic has created an urgent need for mathematical models that can project epidemic trends and evaluate the effectiveness of mitigation strategies. To forecast the transmission of COVID-19, a major challenge is the accurate assessment of the multi-scale human mobility and how they impact the infection through close contacts. By combining the stochastic agent-based modeling strategy and hierarchical structures of spatial containers corresponding to the notion of places in geography, this study proposes a novel model, Mob-Cov, to study the impact of human traveling behaviour and individual health conditions on the disease outbreak and the probability of zero COVID in the population. Specifically, individuals perform power-law type of local movements within a container and global transport between different-level containers. Frequent short movements inside a small-level container (e.g. a road or a county) and a large population size influence the local crowdedness of people, which accelerates the infection and regional transmission. Travels between large-level containers (e.g. cities and nations) facilitate global spread and outbreak. Moreover, dynamic infection and recovery in the population are able to drive the bifurcation of the system to a "zero-COVID" state or a "live with COVID" state, depending on the mobility patterns, population number and health conditions. Reducing total population and local people accumulation as well as restricting global travels help achieve zero-COVID. In summary, the Mob-Cov model considers more realistic human mobility in a wide range of spatial scales, and has been designed with equal emphasis on performance, low simulation cost, accuracy, ease of use and flexibility. It is a useful tool for researchers and politicians to investigate the pandemic dynamics and plan actions against the disease. △ Less

Submitted 17 November, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

arXiv:2207.06627 [pdf, ps, other]

doi 10.1016/j.jfa.2023.109904

A nonlocal curve flow in centro-affine geometry

Authors: Xinjie Jiang, Yun Yang, Yanhua Yu

Abstract: In this paper, the isoperimetric inequality in centro-affine plane geometry is obtained. We also investigate the long-term behavior of an invariant plane curve flow, whose evolution process can be expressed as a second-order nonlinear parabolic equation with respect to centro-affine curvature. The forward and backward limits in time are discussed, which shows that a closed convex embedded curve ma… ▽ More In this paper, the isoperimetric inequality in centro-affine plane geometry is obtained. We also investigate the long-term behavior of an invariant plane curve flow, whose evolution process can be expressed as a second-order nonlinear parabolic equation with respect to centro-affine curvature. The forward and backward limits in time are discussed, which shows that a closed convex embedded curve may converge to an ellipse when evolving according to this flow. △ Less

Submitted 13 July, 2022; originally announced July 2022.

arXiv:2204.10605 [pdf, other]

Distributed stochastic projection-free solver for constrained optimization

Authors: Xia Jiang, Xianlin Zeng, Lihua Xie, Jian Sun, Jie Chen

Abstract: This paper proposes a distributed stochastic projection-free algorithm for large-scale constrained finite-sum optimization whose constraint set is complicated such that the projection onto the constraint set can be expensive. The global cost function is allocated to multiple agents, each of which computes its local stochastic gradients and communicates with its neighbors to solve the global proble… ▽ More This paper proposes a distributed stochastic projection-free algorithm for large-scale constrained finite-sum optimization whose constraint set is complicated such that the projection onto the constraint set can be expensive. The global cost function is allocated to multiple agents, each of which computes its local stochastic gradients and communicates with its neighbors to solve the global problem. Stochastic gradient methods enable low computational cost, while they are hard and slow to converge due to the variance caused by random sampling. To construct a convergent distributed stochastic projection-free algorithm, this paper incorporates a variance reduction technique and gradient tracking technique in the Frank-Wolfe update. We develop a sampling rule for the variance reduction technique to reduce the variance introduced by stochastic gradients. Complete and rigorous proofs show that the proposed distributed projection-free algorithm converges with a sublinear convergence rate and enjoys superior complexity guarantees for both convex and non-convex objective functions. By comparative simulations, we demonstrate the convergence and computational efficiency of the proposed algorithm. △ Less

Submitted 22 April, 2022; originally announced April 2022.

Comments: 14 pages

arXiv:2203.00252 [pdf, ps, other]

doi 10.1007/s10957-022-02125-9

Bregman three-operator splitting methods

Authors: Xin Jiang, Lieven Vandenberghe

Abstract: The paper presents primal-dual proximal splitting methods for convex optimization, in which generalized Bregman distances are used to define the primal and dual proximal update steps. The methods extend the primal and dual Condat-Vu algorithms and the primal-dual three-operator (PD3O) algorithm. The Bregman extensions of the Condat-Vu algorithms are derived from the Bregman proximal point method a… ▽ More The paper presents primal-dual proximal splitting methods for convex optimization, in which generalized Bregman distances are used to define the primal and dual proximal update steps. The methods extend the primal and dual Condat-Vu algorithms and the primal-dual three-operator (PD3O) algorithm. The Bregman extensions of the Condat-Vu algorithms are derived from the Bregman proximal point method applied to a monotone inclusion problem. Based on this interpretation, a unified framework for the convergence analysis of the two methods is presented. We also introduce a line search procedure for stepsize selection in the Bregman dual Condat-Vu algorithm applied to equality-constrained problems. Finally, we propose a Bregman extension of PD3O and analyze its convergence. △ Less

Submitted 3 October, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Journal ref: Journal of Optimization Theory and Applications 196, 936-972 (2023)

arXiv:2202.09203 [pdf, other]

An Adaptive Finite Element DtN Method for Maxwell's Equations

Authors: Gang Bao, Mingming Zhang, Xue Jiang, Peijun Li, Xiaokai Yuan

Abstract: This paper is concerned with a numerical solution to the scattering of a time-harmonic electromagnetic wave by a bounded and impenetrable obstacle in three dimensions. The electromagnetic wave propagation is modeled by a boundary value problem of Maxwell's equations in the exterior domain of the obstacle. Based on the Dirichlet-to-Neumann (DtN) operator, which is defined by an infinite series, an… ▽ More This paper is concerned with a numerical solution to the scattering of a time-harmonic electromagnetic wave by a bounded and impenetrable obstacle in three dimensions. The electromagnetic wave propagation is modeled by a boundary value problem of Maxwell's equations in the exterior domain of the obstacle. Based on the Dirichlet-to-Neumann (DtN) operator, which is defined by an infinite series, an exact transparent boundary condition is introduced and the scattering problem is reduced equivalently into a bounded domain. An a posteriori error estimate based adaptive finite element DtN method is developed to solve the discrete variational problem, where the DtN operator is truncated into a sum of finitely many terms. The a posteriori error estimate takes into account both the finite element approximation error and the truncation error of the DtN operator. The latter is shown to decay exponentially with respect to the truncation parameter. Numerical experiments are presented to illustrate the effectiveness of the proposed method. △ Less

Submitted 18 February, 2022; originally announced February 2022.

arXiv:2201.08190 [pdf]

doi 10.1115/1.4053727

Topology optimization on complex surfaces based on the moving morphable component (MMC) method and computational conformal map** (CCM)

Authors: Wendong Huo, Chang Liu, Zongliang Du, Xudong Jiang, Zhengyu Liu, Xu Guo

Abstract: In the present paper, an integrated paradigm for topology optimization on complex surfaces with arbitrary genus is proposed. The approach is constructed based on the two-dimensional (2D) Moving Morphable Component (MMC) framework, where a set of structural components are used as the basic units of optimization, and computational conformal map** (CCM) technique, with which a complex surface repre… ▽ More In the present paper, an integrated paradigm for topology optimization on complex surfaces with arbitrary genus is proposed. The approach is constructed based on the two-dimensional (2D) Moving Morphable Component (MMC) framework, where a set of structural components are used as the basic units of optimization, and computational conformal map** (CCM) technique, with which a complex surface represented by an unstructured triangular mesh can be mapped into a set of regular 2D parameter domains numerically. A multi-patch stitching scheme is also developed to achieve an MMC-friendly global parameterization through a number of local parameterizations. Numerical examples including a saddle-shaped shell, a torus-shape shell and a tee-branch pipe are solved to demonstrate the validity and efficiency of the proposed approach. It is found that compared with traditional approaches for topology optimization on 2D surfaces, optimized designs with clear load transmission paths can be obtained with much fewer numbers of design variables and degrees of freedom for finite element analysis (FEA) via the proposed approach. △ Less

Submitted 2 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

Journal ref: ASME. J. Appl. Mech (2022)

arXiv:2111.07340 [pdf, ps, other]

Computing Groebner bases of ideal interpolation

Authors: Xue Jiang, Yihe Gong

Abstract: We present algorithms for computing the reduced Gröbner basis of the vanishing ideal of a finite set of points in a frame of ideal interpolation. Ideal interpolation is defined by a linear projector whose kernel is a polynomial ideal. In this paper, we translate interpolation condition functionals into formal power series via Taylor expansion, then the reduced Gröbner basis is read from formal pow… ▽ More We present algorithms for computing the reduced Gröbner basis of the vanishing ideal of a finite set of points in a frame of ideal interpolation. Ideal interpolation is defined by a linear projector whose kernel is a polynomial ideal. In this paper, we translate interpolation condition functionals into formal power series via Taylor expansion, then the reduced Gröbner basis is read from formal power series by Gaussian elimination. Our algorithm has a polynomial time complexity. It compares favorably with MMM algorithm in single point ideal interpolation and some several points ideal interpolation. △ Less

Submitted 14 January, 2024; v1 submitted 14 November, 2021; originally announced November 2021.

Comments: arXiv admin note: text overlap with arXiv:2007.11830

arXiv:2111.03820 [pdf, other]

Distributed stochastic proximal algorithm with random reshuffling for non-smooth finite-sum optimization

Authors: Xia Jiang, Xianlin Zeng, Jian Sun, Jie Chen, Lihua Xie

Abstract: The non-smooth finite-sum minimization is a fundamental problem in machine learning. This paper develops a distributed stochastic proximal-gradient algorithm with random reshuffling to solve the finite-sum minimization over time-varying multi-agent networks. The objective function is a sum of differentiable convex functions and non-smooth regularization. Each agent in the network updates local var… ▽ More The non-smooth finite-sum minimization is a fundamental problem in machine learning. This paper develops a distributed stochastic proximal-gradient algorithm with random reshuffling to solve the finite-sum minimization over time-varying multi-agent networks. The objective function is a sum of differentiable convex functions and non-smooth regularization. Each agent in the network updates local variables with a constant step-size by local information and cooperates to seek an optimal solution. We prove that local variable estimates generated by the proposed algorithm achieve consensus and are attracted to a neighborhood of the optimal solution in expectation with an $\mathcal{O}(\frac{1}{T}+\frac{1}{\sqrt{T}})$ convergence rate, where $T$ is the total number of iterations. Finally, some comparative simulations are provided to verify the convergence performance of the proposed algorithm. △ Less

Submitted 10 October, 2022; v1 submitted 6 November, 2021; originally announced November 2021.

Comments: 15 pages, 7 figures

arXiv:2110.09906 [pdf, ps, other]

On the divisibility of sums of even powers of $q$-binomial coefficients

Authors: Ji-Cai Liu, Xue-Ting Jiang

Abstract: We prove the divisibility conjecture on sums of even powers of $q$-binomial coefficients, which was recently proposed by Guo, Schlosser and Zudilin. Our proof relies on two $q$-harmonic series congruences due to Shi and Pan. We prove the divisibility conjecture on sums of even powers of $q$-binomial coefficients, which was recently proposed by Guo, Schlosser and Zudilin. Our proof relies on two $q$-harmonic series congruences due to Shi and Pan. △ Less

Submitted 19 October, 2021; originally announced October 2021.

Comments: 6 pages

MSC Class: 11A07; 11B65; 05A10

arXiv:2109.10690 [pdf, ps, other]

Riesz representation theorems for positive algebra homomorphisms

Authors: Marcel de Jeu, Xingni Jiang

Abstract: Let $X$ be a locally compact Hausdorff space, let $\mathrm A$ be a partially ordered algebra, and let $T:\mathrm{C}_{\mathrm c}(X)\to \mathrm A$ be a positive algebra homomorphism. Under conditions on $\mathrm A$ that are satisfied in a good number of cases of practical interest, we show that $T$ is represented by a (unique regular) measure $μ$ on the Borel $σ$-algebra of $X$ that takes it values… ▽ More Let $X$ be a locally compact Hausdorff space, let $\mathrm A$ be a partially ordered algebra, and let $T:\mathrm{C}_{\mathrm c}(X)\to \mathrm A$ be a positive algebra homomorphism. Under conditions on $\mathrm A$ that are satisfied in a good number of cases of practical interest, we show that $T$ is represented by a (unique regular) measure $μ$ on the Borel $σ$-algebra of $X$ that takes it values in the positive cone of $\mathrm A$, and with the property that $μ(A_1\cap A_2)=μ(A_1)μ(A_2)$ for Borel subsets $A_1,A_2$ of $X$. The positive algebra homomorphism $T$ can be extended from ${\mathrm C}_{\mathrm c}(X)$ to the accompanying $\mathcal L^1$-space of $μ$. We show that, quite often, this $\mathcal L^1$-space is closed under multiplication, so that it is a Riesz algebra, and that the extended map $T:\mathcal L^1\to\mathrm A$ is not only an algebra homomorphism, but, even when $\mathrm A$ is not a Riesz space, also a vector lattice homomorphism in a sense that is explained in the paper. The latter property enables one to describe images of the extended map in terms of sequential up-downs and down-ups of the image of (the positive cone of ) ${\mathrm C}_{\mathrm c}(X)$ when $\mathrm A$ has the countable sup property. We apply the main results, which are obtained by purely order-theoretical methods, to positive algebra homomorphisms from ${\mathrm C}_0(X)$ into the order continuous operators on a Banach lattice, and to representations of ${\mathrm C}_0(X,\mathbb C)$ on Hilbert spaces. It is thus seen that, for representations on Banach lattices and on Hilbert spaces, although situated in rather different contexts, spectral theorems can be established that are both rooted in the same order-theoretical Riesz representation theorem for positive algebra homomorphisms from ${\mathrm C_\mathrm c}(X)$ into partially ordered algebras. △ Less

Submitted 21 September, 2021; originally announced September 2021.

Comments: 51 pages

MSC Class: Primary 47B99; Secondary 06F25; 28B15

arXiv:2109.04626 [pdf, other]

A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy

Authors: Kai Zhang, Chao Tian, Kun Zhang, Todd Johnson, Xiaoqian Jiang

Abstract: The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data. It can be computationally expensive in the worst case due to the conditional independence tests are performed in an exhaustive-searching manner. This makes the algorithm computationally intractable when the task contains several hundred or thousand nodes, particularly when the true underlying c… ▽ More The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data. It can be computationally expensive in the worst case due to the conditional independence tests are performed in an exhaustive-searching manner. This makes the algorithm computationally intractable when the task contains several hundred or thousand nodes, particularly when the true underlying causal graph is dense. We propose a critical observation that the conditional set rendering two nodes independent is non-unique, and including certain redundant nodes do not sacrifice result accuracy. Based on this finding, the innovations of our work are two-folds. First, we innovate on a reserve order linkage pruning PC algorithm which significantly increases the algorithm's efficiency. Second, we propose a parallel computing strategy for statistical independence tests by leveraging tensor computation, which brings further speedup. We also prove the proposed algorithm does not induce statistical power loss under mild graph and data dimensionality assumptions. Experimental results show that the single-threaded version of the proposed algorithm can achieve a 6-fold speedup compared to the PC algorithm on a dense 95-node graph, and the parallel version can make a 825-fold speed-up. We also provide proof that the proposed algorithm is consistent under the same set of conditions with conventional PC algorithm. △ Less

Submitted 9 September, 2021; originally announced September 2021.

Comments: 37 pages

arXiv:2109.00996 [pdf]

A Physics-Data-Driven Bayesian Method for Heat Conduction Problems

Authors: Xinchao Jiang, Hu Wang, Yu li

Abstract: In this study, a novel physics-data-driven Bayesian method named Heat Conduction Equation assisted Bayesian Neural Network (HCE-BNN) is proposed. The HCE-BNN is constructed based on the Bayesian neural network, it is a physics-informed machine learning strategy. Compared with the existed pure data driven method, to acquire physical consistency and better performance of the data-driven model, the h… ▽ More In this study, a novel physics-data-driven Bayesian method named Heat Conduction Equation assisted Bayesian Neural Network (HCE-BNN) is proposed. The HCE-BNN is constructed based on the Bayesian neural network, it is a physics-informed machine learning strategy. Compared with the existed pure data driven method, to acquire physical consistency and better performance of the data-driven model, the heat conduction equation is embedded into the loss function of the HCE-BNN as a regularization term. Hence, the proposed method can build a more reliable model by physical constraints with less data. The HCE-BNN can handle the forward and inverse problems consistently, that is, to infer unknown responses from known partial responses, or to identify boundary conditions or material parameters from known responses. Compared with the exact results, the test results demonstrate that the proposed method can be applied to both heat conduction forward and inverse problems successfully. In addition, the proposed method can be implemented with the noisy data and gives the corresponding uncertainty quantification for the solutions. △ Less

Submitted 2 September, 2021; originally announced September 2021.

Comments: 27 pages, 10 figures, 6 tables

arXiv:2108.13390 [pdf, ps, other]

Asymptotics of Kähler-Einstein metrics on complex hyperbolic cusps

Authors: Xin Fu, Hans-Joachim Hein, Xumin Jiang

Abstract: Let $L$ be a negative holomorphic line bundle over an $(n-1)$-dimensional complex torus $D$. Let $h$ be a Hermitian metric on $L$ such that the curvature form of the dual Hermitian metric defines a flat Kähler metric on $D$. Then $h$ is unique up to scaling, and, for some closed tubular neighborhood $V$ of the zero section $D \subset L$, the form… ▽ More Let $L$ be a negative holomorphic line bundle over an $(n-1)$-dimensional complex torus $D$. Let $h$ be a Hermitian metric on $L$ such that the curvature form of the dual Hermitian metric defines a flat Kähler metric on $D$. Then $h$ is unique up to scaling, and, for some closed tubular neighborhood $V$ of the zero section $D \subset L$, the form $ω_h = -(n+1)i\partial\overline\partial\log(-{\log h})$ defines a complete Kähler-Einstein metric on $V \setminus D$ with ${\rm Ric}(ω_h) = -ω_h$. In fact, $ω_h$ is complex hyperbolic, i.e., the holomorphic sectional curvature of $ω_h$ is constant, and $ω_h$ has the usual doubly-warped cusp structure familiar from complex hyperbolic geometry. In this paper, we prove that if $U$ is another closed tubular neighborhood of the zero section and if $ω$ is a complete Kähler-Einstein metric with ${\rm Ric}(ω) = -ω$ on $U \setminus D$, then there exist a Hermitian metric $h$ as above and a $δ\in \mathbb{R}^+$ such that $ω- ω_{h} = O(e^{-δ\sqrt{-{\log h}}})$ to all orders with respect to $ω_h$ as $h \to 0$. This rate is doubly exponential in the distance from a fixed point, and is sharp. △ Less

Submitted 9 November, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

arXiv:2108.00993 [pdf, ps, other]

doi 10.1007/s00029-022-00798-8

Special MMP for log canonical generalised pairs

Authors: Vladimir Lazić, Nikolaos Tsakanikas, with an appendix joint with Xiaowei Jiang

Abstract: We show that minimal models of $\mathbb{Q}$-factorial NQC log canonical generalised pairs exist, assuming the existence of minimal models of smooth varieties. More generally, we prove that on a $\mathbb{Q}$-factorial NQC log canonical generalised pair $ (X,B+M) $ we can run an MMP with scaling of an ample divisor which terminates, assuming that it admits an NQC weak Zariski decomposition or that… ▽ More We show that minimal models of $\mathbb{Q}$-factorial NQC log canonical generalised pairs exist, assuming the existence of minimal models of smooth varieties. More generally, we prove that on a $\mathbb{Q}$-factorial NQC log canonical generalised pair $ (X,B+M) $ we can run an MMP with scaling of an ample divisor which terminates, assuming that it admits an NQC weak Zariski decomposition or that $K_X+B+M$ is not pseudoeffective. As a consequence, we establish several existence results for minimal models and Mori fibre spaces. △ Less

Submitted 10 August, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

Comments: v5: minor changes in the appendix; to appear in Sel. Math. New Ser

MSC Class: 14E30

Journal ref: Selecta Math. New Ser. 28, 89 (2022)

arXiv:2106.14479 [pdf, other]

Distributed stochastic gradient tracking algorithm with variance reduction for non-convex optimization

Authors: Xia Jiang, Xianlin Zeng, Jian Sun, Jie Chen

Abstract: This paper proposes a distributed stochastic algorithm with variance reduction for general smooth non-convex finite-sum optimization, which has wide applications in signal processing and machine learning communities. In distributed setting, large number of samples are allocated to multiple agents in the network. Each agent computes local stochastic gradient and communicates with its neighbors to s… ▽ More This paper proposes a distributed stochastic algorithm with variance reduction for general smooth non-convex finite-sum optimization, which has wide applications in signal processing and machine learning communities. In distributed setting, large number of samples are allocated to multiple agents in the network. Each agent computes local stochastic gradient and communicates with its neighbors to seek for the global optimum. In this paper, we develop a modified variance reduction technique to deal with the variance introduced by stochastic gradients. Combining gradient tracking and variance reduction techniques, this paper proposes a distributed stochastic algorithm, GT-VR, to solve large-scale non-convex finite-sum optimization over multi-agent networks. A complete and rigorous proof shows that the GT-VR algorithm converges to first-order stationary points with $O(\frac{1}{k})$ convergence rate. In addition, we provide the complexity analysis of the proposed algorithm. Compared with some existing first-order methods, the proposed algorithm has a lower $\mathcal{O}(PMε^{-1})$ gradient complexity under some mild condition. By comparing state-of-the-art algorithms and GT-VR in experimental simulations, we verify the efficiency of the proposed algorithm. △ Less

Submitted 21 July, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

Comments: 11pages

arXiv:2106.12415 [pdf, ps, other]

On the finiteness of the Morse index of self-shrinkers

Authors: Xu-Yong Jiang, He-Jun Sun, Peibiao Zhao

Abstract: In this paper, we present a sufficient condition for finite Morse index of complete properly self-shrinkers. We prove that a complete properly embedded self-shrinker in $\mathbb{R}^{n+1}$ with finite asymptotically conical ends or asymptotically cylindrical ends must have finite Morse index. Moreover, as an application of this result, we show that a complete properly embedded self-shrinker in… ▽ More In this paper, we present a sufficient condition for finite Morse index of complete properly self-shrinkers. We prove that a complete properly embedded self-shrinker in $\mathbb{R}^{n+1}$ with finite asymptotically conical ends or asymptotically cylindrical ends must have finite Morse index. Moreover, as an application of this result, we show that a complete properly embedded self-shrinker in $\mathbb{R}^3$ with finite genus has finite Morse index. △ Less

Submitted 23 June, 2021; originally announced June 2021.

Comments: 13 pages

MSC Class: 53C24; 53C42; 53C21

arXiv:2104.12153 [pdf, ps, other]

doi 10.1007/s11117-022-00880-7

Riesz representation theorems for positive linear operators

Authors: Marcel de Jeu, Xingni Jiang

Abstract: We generalise the Riesz representation theorems for positive linear functionals on $\mathrm{C}_{\mathrm c}(X)$ and $\mathrm{C}_{\mathrm 0}(X)$, where $X$ is a locally compact Hausdorff space, to positive linear operators from these spaces into a partially ordered vector space $E$. The representing measures are defined on the Borel $σ$-algebra of $X$ and take their values in the extended positive c… ▽ More We generalise the Riesz representation theorems for positive linear functionals on $\mathrm{C}_{\mathrm c}(X)$ and $\mathrm{C}_{\mathrm 0}(X)$, where $X$ is a locally compact Hausdorff space, to positive linear operators from these spaces into a partially ordered vector space $E$. The representing measures are defined on the Borel $σ$-algebra of $X$ and take their values in the extended positive cone of $E$; the corresponding integrals are order integrals. We give explicit formulas for the values of the representing measures at open and at compact subsets of $X$. Results are included where the space $E$ need not be a vector lattice, nor a normed space. Representing measures exist for positive linear operators into Banach lattices with order continuous norms, into the regular operators on a KB-space, into the self-adjoint linear operators in a weakly closed complex linear subspace of the bounded linear operators on a complex Hilbert space, and into JBW-algebras. △ Less

Submitted 4 January, 2022; v1 submitted 25 April, 2021; originally announced April 2021.

Comments: This version has 39 pages. Some minor improvements in presentation and notation have been made. It is the final version which will appear in Banach J. Math. Anal

Journal ref: Banach J. Math. Anal. 16 (2022), no.3, Paper No. 44, 40pp

arXiv:2104.08745 [pdf, ps, other]

doi 10.1007/s11117-022-00880-7

Order Integrals

Authors: Marcel de Jeu, Xingni Jiang

Abstract: We define an integral of real-valued functions with respect to a measure that takes its values in the extended positive cone of a partially ordered vector space $E$. The monotone convergence theorem, Fatou's lemma, and the dominated convergence theorem are established; the analogues of the classical ${\mathcal L}^1$- and ${\mathrm L}^1$-spaces are investigated. The results extend earlier work by W… ▽ More We define an integral of real-valued functions with respect to a measure that takes its values in the extended positive cone of a partially ordered vector space $E$. The monotone convergence theorem, Fatou's lemma, and the dominated convergence theorem are established; the analogues of the classical ${\mathcal L}^1$- and ${\mathrm L}^1$-spaces are investigated. The results extend earlier work by Wright and specialise to those for the Lebesgue integral when $E$ equals the real numbers. The hypothesis on $E$ that is needed for the definition of the integral and for the monotone convergence theorem to hold ($σ$-monotone completeness) is a rather mild one. It is satisfied, for example, by the space of regular operators between a directed partially ordered vector space and a $σ$-monotone complete partially ordered vector space, and by every JBW-algebra. Fatou's lemma and the dominated convergence theorem hold for every $σ$-Dedekind complete space. When $E$ consists of the regular operators on a Banach lattice with an order continuous norm, or when it consists of the self-adjoint elements of a strongly closed complex linear subspace of the bounded operators on a complex Hilbert space, then the finite measures as in the current paper are precisely the strongly $σ$-additive positive operator-valued measures. When $E$ is a partially ordered Banach space with a closed positive cone, then every positive vector measure is a measure in our sense, but not conversely. Even when a measure falls into both categories, the domain of the integral as defined in this paper can properly contain that of any reasonably defined integral with respect to the vector measure using Banach space methods. △ Less

Submitted 9 November, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

Comments: Current version contains 39 pages. Several minor improvements in presentation have been made. Final version, to appear in Positivity

Journal ref: Positivity 26 (2022), no. 2, Paper No. 32, 39 pp

arXiv:2104.00360 [pdf, other]

Distributed synchronous and asynchronous algorithms for semi-definite programming with diagonal constraints

Authors: Xia Jiang, Xianlin Zeng, Jian Sun, Jie Chen

Abstract: This paper develops distributed synchronous and asynchronous algorithms for the large-scale semi-definite programming with diagonal constraints, which has wide applications in combination optimization, image processing and community detection. The information of the semi-definite programming is allocated to multiple interconnected agents such that each agent aims to find a solution by communicatin… ▽ More This paper develops distributed synchronous and asynchronous algorithms for the large-scale semi-definite programming with diagonal constraints, which has wide applications in combination optimization, image processing and community detection. The information of the semi-definite programming is allocated to multiple interconnected agents such that each agent aims to find a solution by communicating to its neighbors. Based on low-rank property of solutions and the Burer-Monteiro factorization, we transform the original problem into a distributed optimization problem over unit spheres to reduce variable dimensions and ensure positive semi-definiteness without involving semi-definite projections, which are computationally expensive. For the distributed optimization problem, we propose distributed synchronous and asynchronous algorithms, both of which reduce computational burden and storage space compared with existing centralized algorithms. Specifically, the distributed synchronous algorithm almost surely escapes strict saddle points and converges to the set of optimal solutions to the optimization problem. In addition, the proposed distributed asynchronous algorithm allows communication delays and converges to the set of critical points to the optimization problem under mild conditions. By applying proposed algorithms to image segmentation applications, we illustrate the efficiency and convergence performance of the two proposed algorithms. △ Less

Submitted 1 April, 2021; originally announced April 2021.

Comments: 15 pages

arXiv:2103.02271 [pdf, other]

Distributed proximal gradient algorithm for non-smooth non-convex optimization over time-varying networks

Authors: Xia Jiang, Xianlin Zeng, Jian Sun, Jie Chen

Abstract: This note studies the distributed non-convex optimization problem with non-smooth regularization, which has wide applications in decentralized learning, estimation and control. The objective function is the sum of different local objective functions, which consist of differentiable (possibly non-convex) cost functions and non-smooth convex functions. This paper presents a distributed proximal grad… ▽ More This note studies the distributed non-convex optimization problem with non-smooth regularization, which has wide applications in decentralized learning, estimation and control. The objective function is the sum of different local objective functions, which consist of differentiable (possibly non-convex) cost functions and non-smooth convex functions. This paper presents a distributed proximal gradient algorithm for the non-smooth non-convex optimization problem over time-varying multi-agent networks. Each agent updates local variable estimate by the multi-step consensus operator and the proximal operator. We prove that the generated local variables achieve consensus and converge to the set of critical points with convergence rate $O(1/T)$. Finally, we verify the efficacy of proposed algorithm by numerical simulations. △ Less

Submitted 3 March, 2021; originally announced March 2021.

arXiv:2102.10012 [pdf, other]

Analytics and Machine Learning in Vehicle Routing Research

Authors: Ruibin Bai, Xinan Chen, Zhi-Long Chen, Tianxiang Cui, Shuhui Gong, Wentao He, ** Jiang, Huan **, Jiahuan **, Graham Kendall, Jiawei Li, Zheng Lu, Jianfeng Ren, Paul Weng, Ning Xue, Huayan Zhang

Abstract: The Vehicle Routing Problem (VRP) is one of the most intensively studied combinatorial optimisation problems for which numerous models and algorithms have been proposed. To tackle the complexities, uncertainties and dynamics involved in real-world VRP applications, Machine Learning (ML) methods have been used in combination with analytical approaches to enhance problem formulations and algorithmic… ▽ More The Vehicle Routing Problem (VRP) is one of the most intensively studied combinatorial optimisation problems for which numerous models and algorithms have been proposed. To tackle the complexities, uncertainties and dynamics involved in real-world VRP applications, Machine Learning (ML) methods have been used in combination with analytical approaches to enhance problem formulations and algorithmic performance across different problem solving scenarios. However, the relevant papers are scattered in several traditional research fields with very different, sometimes confusing, terminologies. This paper presents a first, comprehensive review of hybrid methods that combine analytical techniques with ML tools in addressing VRP problems. Specifically, we review the emerging research streams on ML-assisted VRP modelling and ML-assisted VRP optimisation. We conclude that ML can be beneficial in enhancing VRP modelling, and improving the performance of algorithms for both online and offline VRP optimisations. Finally, challenges and future opportunities of VRP research are discussed. △ Less

Submitted 19 February, 2021; originally announced February 2021.

Comments: Submitted to International Journal of Production Research

Showing 1–50 of 93 results for author: Jiang, X