-
Computational Graph Representation of Equations System Constructors in Hierarchical Circuit Simulation
Authors:
Zichao Long,
Lin Li,
Lei Han,
Xianglong Meng,
Chongjun Ding,
Ruiyan Li,
Wu Jiang,
Fuchen Ding,
Jiaqing Yue,
Zhichao Li,
Yisheng Hu,
Ding Li,
Heng Liao
Abstract:
Equations system constructors of hierarchical circuits play a central role in device modeling, nonlinear equations solving, and circuit design automation. However, existing constructors present limitations in applications to different extents. For example, the costs of develo** and reusing device models -- especially coarse-grained equivalent models of circuit modules -- remain high while parame…
▽ More
Equations system constructors of hierarchical circuits play a central role in device modeling, nonlinear equations solving, and circuit design automation. However, existing constructors present limitations in applications to different extents. For example, the costs of develo** and reusing device models -- especially coarse-grained equivalent models of circuit modules -- remain high while parameter sensitivity analysis is complex and inefficient. Inspired by differentiable programming and leveraging the ecosystem benefits of open-source software, we propose an equations system constructor using the computational graph representation, along with its JSON format netlist, to address these limitations. This representation allows for runtime dependencies between signals and subcircuit/device parameters. The proposed method streamlines the model development process and facilitates end-to-end computation of gradients of equations remainders with respect to parameters. This paper discusses in detail the overarching concept of hierarchical subcircuit/device decomposition and nested invocation by drawing parallels to functions in programming languages, and introduces rules for parameters passing and gradient propagation across hierarchical circuit modules. The presented numerical examples, including (1) an uncoupled CMOS model representation using "equivalent circuit decomposition+dynamic parameters" and (2) operational amplifier (OpAmp) auto device sizing, have demonstrated that the proposed method supports circuit simulation and design and particularly subcircuit modeling with improved efficiency, simplicity, and decoupling compared to existing techniques.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Generalized Moving Least-Squares for Solving Vector-valued PDEs on Unknown Manifolds
Authors:
Rongji Li,
Qile Yan,
Shixiao W. Jiang
Abstract:
In this paper, we extend the Generalized Moving Least-Squares (GMLS) method in two different ways to solve the vector-valued PDEs on unknown smooth 2D manifolds without boundaries embedded in $\mathbb{R}^{3}$, identified with randomly sampled point cloud data. The two approaches are referred to as the intrinsic method and the extrinsic method. For the intrinsic method which relies on local approxi…
▽ More
In this paper, we extend the Generalized Moving Least-Squares (GMLS) method in two different ways to solve the vector-valued PDEs on unknown smooth 2D manifolds without boundaries embedded in $\mathbb{R}^{3}$, identified with randomly sampled point cloud data. The two approaches are referred to as the intrinsic method and the extrinsic method. For the intrinsic method which relies on local approximations of metric tensors, we simplify the formula of Laplacians and covariant derivatives acting on vector fields at the base point by calculating them in a local Monge coordinate system. On the other hand, the extrinsic method formulates tangential derivatives on a submanifold as the projection of the directional derivative in the ambient Euclidean space onto the tangent space of the submanifold. One challenge of this method is that the discretization of vector Laplacians yields a matrix whose size relies on the ambient dimension. To overcome this issue, we reduce the dimension of vector Laplacian matrices by employing an appropriate projection. The complexity of both methods scales well with the dimension of manifolds rather than the ambient dimension. We also present supporting numerical examples, including eigenvalue problems, linear Poisson equations, and nonlinear Burgers' equations, to examine the numerical accuracy of proposed methods on various smooth manifolds.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
The Nodal Sets of Solutions to Parabolic Equations
Authors:
Yiqi Huang,
Wenshuai Jiang
Abstract:
In this paper, we study the parabolic equations $\partial_t u=\partial_j\left(a^{ij}(x,t)\partial_iu\right)+b^j(x,t)\partial_ju+c(x,t)u$ in a domain of $\mathbb{R}^n$ under the condition that $a^{ij}$ are Lipschitz continuous. Consider the nodal set $Z_t=\{x: u(x,t)=0\}$ at a time $t$-slice. Simple examples show that the singular set $\mathcal{S}_t=\{x: u(x,t)=|\nabla_x u|(x,t)=0\}$ may coincide w…
▽ More
In this paper, we study the parabolic equations $\partial_t u=\partial_j\left(a^{ij}(x,t)\partial_iu\right)+b^j(x,t)\partial_ju+c(x,t)u$ in a domain of $\mathbb{R}^n$ under the condition that $a^{ij}$ are Lipschitz continuous. Consider the nodal set $Z_t=\{x: u(x,t)=0\}$ at a time $t$-slice. Simple examples show that the singular set $\mathcal{S}_t=\{x: u(x,t)=|\nabla_x u|(x,t)=0\}$ may coincide with nodal set. This makes the methods used in the study of nodal sets for elliptic equations fail, rendering the parabolic case much more complicated.
The current strongest results in the literature establish the finiteness of the $(n-1)$-dimensional Hausdorff measure of $Z_t$, assuming either $n=1$ by Angenent or that the coefficients are time-independent and analytic by Lin. With general coefficients, the codimension-one estimate was obtained under some doubling assumption by Han-Lin but only for space-time nodal sets. In the first part, we prove that $\mathcal{H}^{n-1}(Z_t) < \infty$ in full generality, i.e. for any dimension, with time-dependent coefficients and with merely Lipschitz regular leading coefficients $a^{ij}$.
In the second part, we study the evolutionary behavior of nodal sets. When $n=1$, it is proved by Angenent that the number of nodal points is non-increasing in time. For the $n$-dimensional case, we construct examples showing that measure monotonicity fails. In contrast, we prove dimension monotonicity, i.e., the Hausdorff dimension of the nodal set is non-increasing in time. This is the first monotonicity property for nodal sets in general dimensions. All the assumptions here are sharp.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Projection-Free Variance Reduction Methods for Stochastic Constrained Multi-Level Compositional Optimization
Authors:
Wei Jiang,
Sifan Yang,
Wenhao Yang,
Yibo Wang,
Yuanyu Wan,
Lijun Zhang
Abstract:
This paper investigates projection-free algorithms for stochastic constrained multi-level optimization. In this context, the objective function is a nested composition of several smooth functions, and the decision set is closed and convex. Existing projection-free algorithms for solving this problem suffer from two limitations: 1) they solely focus on the gradient map** criterion and fail to mat…
▽ More
This paper investigates projection-free algorithms for stochastic constrained multi-level optimization. In this context, the objective function is a nested composition of several smooth functions, and the decision set is closed and convex. Existing projection-free algorithms for solving this problem suffer from two limitations: 1) they solely focus on the gradient map** criterion and fail to match the optimal sample complexities in unconstrained settings; 2) their analysis is exclusively applicable to non-convex functions, without considering convex and strongly convex objectives. To address these issues, we introduce novel projection-free variance reduction algorithms and analyze their complexities under different criteria. For gradient map**, our complexities improve existing results and match the optimal rates for unconstrained problems. For the widely-used Frank-Wolfe gap criterion, we provide theoretical guarantees that align with those for single-level problems. Additionally, by using a stage-wise adaptation, we further obtain complexities for convex and strongly convex functions. Finally, numerical experiments on different tasks demonstrate the effectiveness of our methods.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Adaptive Variance Reduction for Stochastic Optimization under Weaker Assumptions
Authors:
Wei Jiang,
Sifan Yang,
Yibo Wang,
Lijun Zhang
Abstract:
This paper explores adaptive variance reduction methods for stochastic optimization based on the STORM technique. Existing adaptive extensions of STORM rely on strong assumptions like bounded gradients and bounded function values, or suffer an additional $\mathcal{O}(\log T)$ term in the convergence rate. To address these limitations, we introduce a novel adaptive STORM method that achieves an opt…
▽ More
This paper explores adaptive variance reduction methods for stochastic optimization based on the STORM technique. Existing adaptive extensions of STORM rely on strong assumptions like bounded gradients and bounded function values, or suffer an additional $\mathcal{O}(\log T)$ term in the convergence rate. To address these limitations, we introduce a novel adaptive STORM method that achieves an optimal convergence rate of $\mathcal{O}(T^{-1/3})$ for non-convex functions with our newly designed learning rate strategy. Compared with existing approaches, our method requires weaker assumptions and attains the optimal convergence rate without the additional $\mathcal{O}(\log T)$ term. We also extend the proposed technique to stochastic compositional optimization, obtaining the same optimal rate of $\mathcal{O}(T^{-1/3})$. Furthermore, we investigate the non-convex finite-sum problem and develop another innovative adaptive variance reduction method that achieves an optimal convergence rate of $\mathcal{O}(n^{1/4} T^{-1/2} )$, where $n$ represents the number of component functions. Numerical experiments across various tasks validate the effectiveness of our method.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction
Authors:
Wei Jiang,
Sifan Yang,
Wenhao Yang,
Lijun Zhang
Abstract:
Sign stochastic gradient descent (signSGD) is a communication-efficient method that transmits only the sign of stochastic gradients for parameter updating. Existing literature has demonstrated that signSGD can achieve a convergence rate of $\mathcal{O}(d^{1/2}T^{-1/4})$, where $d$ represents the dimension and $T$ is the iteration number. In this paper, we improve this convergence rate to…
▽ More
Sign stochastic gradient descent (signSGD) is a communication-efficient method that transmits only the sign of stochastic gradients for parameter updating. Existing literature has demonstrated that signSGD can achieve a convergence rate of $\mathcal{O}(d^{1/2}T^{-1/4})$, where $d$ represents the dimension and $T$ is the iteration number. In this paper, we improve this convergence rate to $\mathcal{O}(d^{1/2}T^{-1/3})$ by introducing the Sign-based Stochastic Variance Reduction (SSVR) method, which employs variance reduction estimators to track gradients and leverages their signs to update. For finite-sum problems, our method can be further enhanced to achieve a convergence rate of $\mathcal{O}(m^{1/4}d^{1/2}T^{-1/2})$, where $m$ denotes the number of component functions. Furthermore, we investigate the heterogeneous majority vote in distributed settings and introduce two novel algorithms that attain improved convergence rates of $\mathcal{O}(d^{1/2}T^{-1/2} + dn^{-1/2})$ and $\mathcal{O}(d^{1/4}T^{-1/4})$ respectively, outperforming the previous results of $\mathcal{O}(dT^{-1/4} + dn^{-1/2})$ and $\mathcal{O}(d^{3/8}T^{-1/8})$, where $n$ represents the number of nodes. Numerical experiments across different tasks validate the effectiveness of our proposed methods.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
A bridge connecting convex analysis and complex analysis and $L^2$-estimate of $d$ and $\bar\partial$
Authors:
Fusheng Deng,
**** Hu,
Weiwen Jiang,
Xiangsen Qin
Abstract:
We propose a way to connect complex analysis and convex analysis. As applications, we derive some results about $L^2$-estimate for $d$-equation and prove some curvature positivity related to convex analysis from well known $L^2$-estimate for $\bar\partial$-equation or the results we prove in complex analysis.
We propose a way to connect complex analysis and convex analysis. As applications, we derive some results about $L^2$-estimate for $d$-equation and prove some curvature positivity related to convex analysis from well known $L^2$-estimate for $\bar\partial$-equation or the results we prove in complex analysis.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Finite Diffeomorphism Theorem for manifolds with lower Ricci curvature and bounded energy
Authors:
Wenshuai Jiang,
Guofang Wei
Abstract:
In this paper we prove that the space $\cM(n,\rv,D,Λ):=\{(M^n,g) \text{ closed }: ~~\Ric\ge -(n-1),~\Vol(M)\ge \rv>0, \diam(M)\le D \text{ and } \int_{M}|\Rm|^{n/2}\le Λ\}$ has at most $C(n,\rv,D,Λ)$ many diffeomorphism types. This removes the upper Ricci curvature bound of Anderson-Cheeger's finite diffeomorphism theorem in \cite{AnCh}. Furthermore, if $M$ is Kähler surface, the Riemann curvature…
▽ More
In this paper we prove that the space $\cM(n,\rv,D,Λ):=\{(M^n,g) \text{ closed }: ~~\Ric\ge -(n-1),~\Vol(M)\ge \rv>0, \diam(M)\le D \text{ and } \int_{M}|\Rm|^{n/2}\le Λ\}$ has at most $C(n,\rv,D,Λ)$ many diffeomorphism types. This removes the upper Ricci curvature bound of Anderson-Cheeger's finite diffeomorphism theorem in \cite{AnCh}. Furthermore, if $M$ is Kähler surface, the Riemann curvature $L^2$ bound could be replaced by the scalar curvature $L^2$ bound.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Stable BDF time discretization of BGN-based parametric finite element methods for geometric flows
Authors:
Wei Jiang,
Chunmei Su,
Ganghui Zhang
Abstract:
We propose a novel class of temporal high-order parametric finite element methods for solving a wide range of geometric flows of curves and surfaces. By incorporating the backward differentiation formulae (BDF) for time discretization into the BGN formulation, originally proposed by Barrett, Garcke, and Nürnberg (J. Comput. Phys., 222 (2007), pp.~441--467), we successfully develop high-order BGN/B…
▽ More
We propose a novel class of temporal high-order parametric finite element methods for solving a wide range of geometric flows of curves and surfaces. By incorporating the backward differentiation formulae (BDF) for time discretization into the BGN formulation, originally proposed by Barrett, Garcke, and Nürnberg (J. Comput. Phys., 222 (2007), pp.~441--467), we successfully develop high-order BGN/BDF$k$ schemes. The proposed BGN/BDF$k$ schemes not only retain almost all the advantages of the classical first-order BGN scheme such as computational efficiency and good mesh quality, but also exhibit the desired $k$th-order temporal accuracy in terms of shape metrics, ranging from second-order to fourth-order accuracy. Furthermore, we validate the performance of our proposed BGN/BDF$k$ schemes through extensive numerical examples, demonstrating their high-order temporal accuracy for various types of geometric flows while maintaining good mesh quality throughout the evolution.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
$\bar\partial$ Poincaré inequality and an improved $L^2$-estimate of $\bar\partial$ on bounded strictly pseudoconvex domains
Authors:
Fusheng Deng,
Weiwen Jiang,
Xiangsen Qin
Abstract:
We prove several inequalities related to the $\bar\partial$-operator on bounded domains in $\mathbb{C}^n$, which can be viewed as a $\bar\partial$-version of the classical Poincaré inequality and its various generalizations, and apply them to derive a generalization of Sobolev Inequality with Trace in $\mathbb{R}^n$. As applications to complex analysis, we get an integral form of Maximum Modulus P…
▽ More
We prove several inequalities related to the $\bar\partial$-operator on bounded domains in $\mathbb{C}^n$, which can be viewed as a $\bar\partial$-version of the classical Poincaré inequality and its various generalizations, and apply them to derive a generalization of Sobolev Inequality with Trace in $\mathbb{R}^n$. As applications to complex analysis, we get an integral form of Maximum Modulus Principle for holomorphic functions, and an improvement of Hörmander's $L^2$-estimate for $\bar\partial$ on bounded strictly pseudoconvex domains.
△ Less
Submitted 6 March, 2024; v1 submitted 28 January, 2024;
originally announced January 2024.
-
Fast Updating Truncated SVD for Representation Learning with Sparse Matrices
Authors:
Haoran Deng,
Yang Yang,
Jiahe Li,
Cheng Chen,
Weihao Jiang,
Shiliang Pu
Abstract:
Updating a truncated Singular Value Decomposition (SVD) is crucial in representation learning, especially when dealing with large-scale data matrices that continuously evolve in practical scenarios. Aligning SVD-based models with fast-paced updates becomes increasingly important. Existing methods for updating truncated SVDs employ Rayleigh-Ritz projection procedures, where projection matrices are…
▽ More
Updating a truncated Singular Value Decomposition (SVD) is crucial in representation learning, especially when dealing with large-scale data matrices that continuously evolve in practical scenarios. Aligning SVD-based models with fast-paced updates becomes increasingly important. Existing methods for updating truncated SVDs employ Rayleigh-Ritz projection procedures, where projection matrices are augmented based on original singular vectors. However, these methods suffer from inefficiency due to the densification of the update matrix and the application of the projection to all singular vectors. To address these limitations, we introduce a novel method for dynamically approximating the truncated SVD of a sparse and temporally evolving matrix. Our approach leverages sparsity in the orthogonalization process of augmented matrices and utilizes an extended decomposition to independently store projections in the column space of singular vectors. Numerical experiments demonstrate a remarkable efficiency improvement of an order of magnitude compared to previous methods. Remarkably, this improvement is achieved while maintaining a comparable precision to existing approaches.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Discretized Distributed Optimization over Dynamic Digraphs
Authors:
Mohammadreza Doostmohammadian,
Wei Jiang,
Muwahida Liaquat,
Alireza Aghasi,
Houman Zarrabi
Abstract:
We consider a discrete-time model of continuous-time distributed optimization over dynamic directed-graphs (digraphs) with applications to distributed learning. Our optimization algorithm works over general strongly connected dynamic networks under switching topologies, e.g., in mobile multi-agent systems and volatile networks due to link failures. Compared to many existing lines of work, there is…
▽ More
We consider a discrete-time model of continuous-time distributed optimization over dynamic directed-graphs (digraphs) with applications to distributed learning. Our optimization algorithm works over general strongly connected dynamic networks under switching topologies, e.g., in mobile multi-agent systems and volatile networks due to link failures. Compared to many existing lines of work, there is no need for bi-stochastic weight designs on the links. The existing literature mostly needs the link weights to be stochastic using specific weight-design algorithms needed both at the initialization and at all times when the topology of the network changes. This paper eliminates the need for such algorithms and paves the way for distributed optimization over time-varying digraphs. We derive the bound on the gradient-tracking step-size and discrete time-step for convergence and prove dynamic stability using arguments from consensus algorithms, matrix perturbation theory, and Lyapunov theory. This work, particularly, is an improvement over existing stochastic-weight undirected networks in case of link removal or packet drops. This is because the existing literature may need to rerun time-consuming and computationally complex algorithms for stochastic design, while the proposed strategy works as long as the underlying network is weight-symmetric and balanced. The proposed optimization framework finds applications to distributed classification and learning.
△ Less
Submitted 26 March, 2024; v1 submitted 14 November, 2023;
originally announced November 2023.
-
An operator-splitting optimization approach for phase-field simulation of equilibrium shapes of crystals
Authors:
Zeyu Zhou,
Wen Huang,
Wei Jiang,
Zhen Zhang
Abstract:
Computing equilibrium shapes of crystals (ESC) is a challenging problem in materials science that involves minimizing an orientation-dependent (i.e., anisotropic) surface energy functional subject to a prescribed mass constraint. The highly nonlinear and singular anisotropic terms in the problem make it very challenging from both the analytical and numerical aspects. Especially, when the strength…
▽ More
Computing equilibrium shapes of crystals (ESC) is a challenging problem in materials science that involves minimizing an orientation-dependent (i.e., anisotropic) surface energy functional subject to a prescribed mass constraint. The highly nonlinear and singular anisotropic terms in the problem make it very challenging from both the analytical and numerical aspects. Especially, when the strength of anisotropy is very strong (i.e., strongly anisotropic cases), the ESC will form some singular, sharp corners even if the surface energy function is smooth. Traditional numerical approaches, such as the $H^{-1}$ gradient flow, are unable to produce true sharp corners due to the necessary addition of a high-order regularization term that penalizes sharp corners and rounds them off. In this paper, we propose a new numerical method based on the Davis-Yin splitting (DYS) optimization algorithm to predict the ESC instead of using gradient flow approaches. We discretize the infinite-dimensional phase-field energy functional in the absence of regularization terms and transform it into a finite-dimensional constraint minimization problem. The resulting optimization problem is solved using the DYS method which automatically guarantees the mass-conservation and bound-preserving properties. We also prove the global convergence of the proposed algorithm. These desired properties are numerically observed. In particular, the proposed method can produce real sharp corners with satisfactory accuracy. Finally, we present numerous numerical results to demonstrate that the ESC can be well simulated under different types of anisotropic surface energies, which also confirms the effectiveness and efficiency of the proposed method.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
General mean-field BSDEs with diagonally quadratic generators in multi-dimension
Authors:
Weimin Jiang,
Juan Li,
Qingmeng Wei
Abstract:
The purpose of this paper is to investigate general mean-field backward stochastic differential equations (MFBSDEs) in multi-dimension with diagonally quadratic generators $f(ω,t,y,z,μ)$, that is, the coefficients depend not only on the solution processes $(Y,Z)$, but also on their law $\mathbb{P}_{(Y,Z)}$, as well as have a diagonally quadratic growth in $Z$ and super-linear growth (or even a qua…
▽ More
The purpose of this paper is to investigate general mean-field backward stochastic differential equations (MFBSDEs) in multi-dimension with diagonally quadratic generators $f(ω,t,y,z,μ)$, that is, the coefficients depend not only on the solution processes $(Y,Z)$, but also on their law $\mathbb{P}_{(Y,Z)}$, as well as have a diagonally quadratic growth in $Z$ and super-linear growth (or even a quadratic growth) in the law of $Z$ which is totally new. We start by establishing through a fixed point theorem the existence and the uniqueness of local solutions in the ``Markovian case'' $f(t,Y_{t},Z_{t},\mathbb{P}_{(Y_{t},Z_{t})})$ when the terminal value is bounded. Afterwards, global solutions are constructed by stitching local solutions. Finally, employing the $θ$-method, we explore the existence and the uniqueness of global solutions for diagonally quadratic mean-field BSDEs with convex generators, even in the case of unbounded terminal values that have exponential moments of all orders. These results are extended to a Volterra-type case where the coefficients can even be of quadratic growth with respect to the law of $Z$.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Integrated Sensing and Communication enabled Sensing Base Station: System Design, Beamforming, Interference Cancellation and Performance Analysis
Authors:
Wangjun Jiang,
Zhiqing Wei,
Zhiyong Feng,
Xu Chen
Abstract:
This paper studies the sensing base station (SBS) that has great potential to improve the safety of vehicles and pedestrians on roads. It can detect the targets on the road with communication signals using the integrated sensing and communication (ISAC) technique. Compared with vehicle-mounted radar, SBS has a better sensing field due to its higher deployment position, which can help solve the pro…
▽ More
This paper studies the sensing base station (SBS) that has great potential to improve the safety of vehicles and pedestrians on roads. It can detect the targets on the road with communication signals using the integrated sensing and communication (ISAC) technique. Compared with vehicle-mounted radar, SBS has a better sensing field due to its higher deployment position, which can help solve the problem of sensing blind areas. In this paper, key technologies of SBS are studied, including the beamforming algorithm, beam scanning scheme, and interference cancellation algorithm. To transmit and receive ISAC signals simultaneously, a double-coupling antenna array is applied. The free detection beam and directional communication beam are proposed for joint communication and sensing to meet the requirements of beamwidth and pointing directions. The joint time-space-frequency domain division multiple access algorithm is proposed to cancel the interference of SBS, including multiuser interference and duplex interference between sensing and communication. Finally, the sensing and communication performance of SBS under the industrial scientific medical power limitation is analyzed and simulated. Simulation results show that the communication rate of SBS can reach over 100 Mbps and the range of sensing and communication can reach about 500 m.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
A second-order in time, BGN-based parametric finite element method for geometric flows of curves
Authors:
Wei Jiang,
Chunmei Su,
Ganghui Zhang
Abstract:
Over the last two decades, the field of geometric curve evolutions has attracted significant attention from scientific computing. One of the most popular numerical methods for solving geometric flows is the so-called BGN scheme, which was proposed by Barrett, Garcke, and Nürnberg (J. Comput. Phys., 222 (2007), pp.~441--467), due to its favorable properties (e.g., its computational efficiency and t…
▽ More
Over the last two decades, the field of geometric curve evolutions has attracted significant attention from scientific computing. One of the most popular numerical methods for solving geometric flows is the so-called BGN scheme, which was proposed by Barrett, Garcke, and Nürnberg (J. Comput. Phys., 222 (2007), pp.~441--467), due to its favorable properties (e.g., its computational efficiency and the good mesh property). However, the BGN scheme is limited to first-order accuracy in time, and how to develop a higher-order numerical scheme is challenging. In this paper, we propose a fully discrete, temporal second-order parametric finite element method, which integrates with two different mesh regularization techniques, for solving geometric flows of curves. The scheme is constructed based on the BGN formulation and a semi-implicit Crank-Nicolson leap-frog time step** discretization as well as a linear finite element approximation in space. More importantly, we point out that the shape metrics, such as manifold distance and Hausdorff distance, instead of function norms, should be employed to measure numerical errors. Extensive numerical experiments demonstrate that the proposed BGN-based scheme is second-order accurate in time in terms of shape metrics. Moreover, by employing the classical BGN scheme as mesh regularization techniques, our proposed second-order schemes exhibit good properties with respect to the mesh distribution. In addition, an unconditional interlaced energy stability property is obtained for one of the mesh regularization techniques.
△ Less
Submitted 20 June, 2024; v1 submitted 22 September, 2023;
originally announced September 2023.
-
Volume Estimates for Singular sets and Critical Sets of Elliptic Equations with Hölder Coefficients
Authors:
Yiqi Huang,
Wenshuai Jiang
Abstract:
Consider the solutions $u$ to the elliptic equation $\mathcal{L}(u) = \partial_i(a^{ij}(x) \partial_j u) + b^i(x) \partial_i u + c(x) u= 0$ with $a^{ij}$ assumed only to be Hölder continuous. In this paper we prove an explicit bound for $(n-2)$-dimensional Minkowski estimates of singular set $\mathcal{S}(u) = \{ x \in B_1 : u(x) = |\nabla u(x)| = 0\}$ and critical set…
▽ More
Consider the solutions $u$ to the elliptic equation $\mathcal{L}(u) = \partial_i(a^{ij}(x) \partial_j u) + b^i(x) \partial_i u + c(x) u= 0$ with $a^{ij}$ assumed only to be Hölder continuous. In this paper we prove an explicit bound for $(n-2)$-dimensional Minkowski estimates of singular set $\mathcal{S}(u) = \{ x \in B_1 : u(x) = |\nabla u(x)| = 0\}$ and critical set $\mathcal{C}(u) \equiv \{ x\in B_{1} : |\nabla u(x)| = 0 \}$ in terms of the bound on doubling index, depending on $c \equiv 0$ or not. Here the Hölder assumption is sharp as it is the weakest condition in order to define the critical set of $u$ according to elliptic estimates. We can also obtain an optimal improvement on Cheeger-Naber-Valtorta's volume estimates on each quantitative stratum $\mathcal{S}^k_{η, r}$. The main difficulty in this situation is the lack of monotonicity formula which is essential to the quantitative stratification. In our proof, one key ingredient is a new almost monotonicity formula for doubling index under the Hölder assumption. Another key ingredient is the quantitative uniqueness of tangent maps. It deserves to note that our almost monotonicity is sufficient to address all the difficulties arising from the absence of monotonicity in the analysis of differential equations. We believe the idea could be applied to other relevant study.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Distributed Optimization via Gradient Descent with Event-Triggered Zooming over Quantized Communication
Authors:
Apostolos I. Rikos,
Wei Jiang,
Themistoklis Charalambous,
Karl H. Johansson
Abstract:
In this paper, we study unconstrained distributed optimization strongly convex problems, in which the exchange of information in the network is captured by a directed graph topology over digital channels that have limited capacity (and hence information should be quantized). Distributed methods in which nodes use quantized communication yield a solution at the proximity of the optimal solution, he…
▽ More
In this paper, we study unconstrained distributed optimization strongly convex problems, in which the exchange of information in the network is captured by a directed graph topology over digital channels that have limited capacity (and hence information should be quantized). Distributed methods in which nodes use quantized communication yield a solution at the proximity of the optimal solution, hence reaching an error floor that depends on the quantization level used; the finer the quantization the lower the error floor. However, it is not possible to determine in advance the optimal quantization level that ensures specific performance guarantees (such as achieving an error floor below a predefined threshold). Choosing a very small quantization level that would guarantee the desired performance, requires {information} packets of very large size, which is not desirable (could increase the probability of packet losses, increase delays, etc) and often not feasible due to the limited capacity of the channels available. In order to obtain a communication-efficient distributed solution and a sufficiently close proximity to the optimal solution, we propose a quantized distributed optimization algorithm that converges in a finite number of steps and is able to adjust the quantization level accordingly. The proposed solution uses a finite-time distributed optimization protocol to find a solution to the problem for a given quantization level in a finite number of steps and keeps refining the quantization level until the difference in the solution between two successive solutions with different quantization levels is below a certain pre-specified threshold.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Asynchronous Distributed Optimization via ADMM with Efficient Communication
Authors:
Apostolos I. Rikos,
Wei Jiang,
Themistoklis Charalambous,
Karl H. Johansson
Abstract:
In this paper, we focus on an asynchronous distributed optimization problem. In our problem, each node is endowed with a convex local cost function, and is able to communicate with its neighbors over a directed communication network. Furthermore, we assume that the communication channels between nodes have limited bandwidth, and each node suffers from processing delays. We present a distributed al…
▽ More
In this paper, we focus on an asynchronous distributed optimization problem. In our problem, each node is endowed with a convex local cost function, and is able to communicate with its neighbors over a directed communication network. Furthermore, we assume that the communication channels between nodes have limited bandwidth, and each node suffers from processing delays. We present a distributed algorithm which combines the Alternating Direction Method of Multipliers (ADMM) strategy with a finite time quantized averaging algorithm. In our proposed algorithm, nodes exchange quantized valued messages and operate in an asynchronous fashion. More specifically, during every iteration of our algorithm each node (i) solves a local convex optimization problem (for the one of its primal variables), and (ii) utilizes a finite-time quantized averaging algorithm to obtain the value of the second primal variable (since the cost function for the second primal variable is not decomposable). We show that our algorithm converges to the optimal solution at a rate of $O(1/k)$ (where $k$ is the number of time steps) for the case where the local cost function of every node is convex and not-necessarily differentiable. Finally, we demonstrate the operational advantages of our algorithm against other algorithms from the literature.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Generalized Finite Difference Method on unknown manifolds
Authors:
Shixiao W. Jiang,
Rongji Li,
Qile Yan,
John Harlim
Abstract:
In this paper, we extend the Generalized Finite Difference Method (GFDM) on unknown compact submanifolds of the Euclidean domain, identified by randomly sampled data that (almost surely) lie on the interior of the manifolds. Theoretically, we formalize GFDM by exploiting a representation of smooth functions on the manifolds with Taylor's expansions of polynomials defined on the tangent bundles. We…
▽ More
In this paper, we extend the Generalized Finite Difference Method (GFDM) on unknown compact submanifolds of the Euclidean domain, identified by randomly sampled data that (almost surely) lie on the interior of the manifolds. Theoretically, we formalize GFDM by exploiting a representation of smooth functions on the manifolds with Taylor's expansions of polynomials defined on the tangent bundles. We illustrate the approach by approximating the Laplace-Beltrami operator, where a stable approximation is achieved by a combination of Generalized Moving Least-Squares algorithm and novel linear programming that relaxes the diagonal-dominant constraint for the estimator to allow for a feasible solution even when higher-order polynomials are employed. We establish the theoretical convergence of GFDM in solving Poisson PDEs and numerically demonstrate the accuracy on simple smooth manifolds of low and moderate high co-dimensions as well as unknown 2D surfaces. For the Dirichlet Poisson problem where no data points on the boundaries are available, we employ GFDM with the volume-constraint approach that imposes the boundary conditions on data points close to the boundary. When the location of the boundary is unknown, we introduce a novel technique to detect points close to the boundary without needing to estimate the distance of the sampled data points to the boundary. We demonstrate the effectiveness of the volume-constraint employed by imposing the boundary conditions on the data points detected by this new technique compared to imposing the boundary conditions on all points within a certain distance from the boundary, where the latter is sensitive to the choice of truncation distance and require the knowledge of the boundary location.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Learning Unnormalized Statistical Models via Compositional Optimization
Authors:
Wei Jiang,
Jiayu Qin,
Lingyu Wu,
Changyou Chen,
Tianbao Yang,
Lijun Zhang
Abstract:
Learning unnormalized statistical models (e.g., energy-based models) is computationally challenging due to the complexity of handling the partition function. To eschew this complexity, noise-contrastive estimation~(NCE) has been proposed by formulating the objective as the logistic loss of the real data and the artificial noise. However, as found in previous works, NCE may perform poorly in many t…
▽ More
Learning unnormalized statistical models (e.g., energy-based models) is computationally challenging due to the complexity of handling the partition function. To eschew this complexity, noise-contrastive estimation~(NCE) has been proposed by formulating the objective as the logistic loss of the real data and the artificial noise. However, as found in previous works, NCE may perform poorly in many tasks due to its flat loss landscape and slow convergence. In this paper, we study it a direct approach for optimizing the negative log-likelihood of unnormalized models from the perspective of compositional optimization. To tackle the partition function, a noise distribution is introduced such that the log partition function can be written as a compositional function whose inner function can be estimated with stochastic samples. Hence, the objective can be optimized by stochastic compositional optimization algorithms. Despite being a simple method, we demonstrate that it is more favorable than NCE by (1) establishing a fast convergence rate and quantifying its dependence on the noise distribution through the variance of stochastic estimators; (2) develo** better results for one-dimensional Gaussian mean estimation by showing our objective has a much favorable loss landscape and hence our method enjoys faster convergence; (3) demonstrating better performance on multiple applications, including density estimation, out-of-distribution detection, and real image generation.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
A Privacy-Preserving Finite-Time Push-Sum based Gradient Method for Distributed Optimization over Digraphs
Authors:
Xiaomeng Chen,
Wei Jiang,
Themistoklis Charalambous,
Ling Shi
Abstract:
This paper addresses the problem of distributed optimization, where a network of agents represented as a directed graph (digraph) aims to collaboratively minimize the sum of their individual cost functions. Existing approaches for distributed optimization over digraphs, such as Push-Pull, require agents to exchange explicit state values with their neighbors in order to reach an optimal solution. H…
▽ More
This paper addresses the problem of distributed optimization, where a network of agents represented as a directed graph (digraph) aims to collaboratively minimize the sum of their individual cost functions. Existing approaches for distributed optimization over digraphs, such as Push-Pull, require agents to exchange explicit state values with their neighbors in order to reach an optimal solution. However, this can result in the disclosure of sensitive and private information. To overcome this issue, we propose a state-decomposition-based privacy-preserving finite-time push-sum (PrFTPS) algorithm without any global information, such as network size or graph diameter. Then, based on PrFTPS, we design a gradient descent algorithm (PrFTPS-GD) to solve the distributed optimization problem. It is proved that under PrFTPS-GD, the privacy of each agent is preserved and the linear convergence rate related to the optimization iteration number is achieved. Finally, numerical simulations are provided to illustrate the effectiveness of the proposed approach.
△ Less
Submitted 5 July, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Authors:
Yuandong Ding,
Mingxiao Feng,
Guozi Liu,
Wei Jiang,
Chuheng Zhang,
Li Zhao,
Lei Song,
Houqiang Li,
Yan **,
Jiang Bian
Abstract:
In this paper, we consider the inventory management (IM) problem where we need to make replenishment decisions for a large number of stock kee** units (SKUs) to balance their supply and demand. In our setting, the constraint on the shared resources (such as the inventory capacity) couples the otherwise independent control for each SKU. We formulate the problem with this structure as Shared-Resou…
▽ More
In this paper, we consider the inventory management (IM) problem where we need to make replenishment decisions for a large number of stock kee** units (SKUs) to balance their supply and demand. In our setting, the constraint on the shared resources (such as the inventory capacity) couples the otherwise independent control for each SKU. We formulate the problem with this structure as Shared-Resource Stochastic Game (SRSG)and propose an efficient algorithm called Context-aware Decentralized PPO (CD-PPO). Through extensive experiments, we demonstrate that CD-PPO can accelerate the learning procedure compared with standard MARL algorithms.
△ Less
Submitted 17 December, 2022; v1 submitted 15 December, 2022;
originally announced December 2022.
-
Open Source Implementations of Numerical Algorithms for Computing the Complete Elliptic Integral of the First Kind
Authors:
Hong-Yan Zhang,
Wen-Juan Jiang
Abstract:
The complete elliptic integral of the first kind (CEI-1) plays in a significant role in mathematics, physics and engineering. There is no simple formula for its computation, thus numerical algorithms are essential for co** with practical problems involved. The commercial implementations for the numerical solutions, such as the functions ellipticK and EllipticK provided by MATLAB and Mathematica…
▽ More
The complete elliptic integral of the first kind (CEI-1) plays in a significant role in mathematics, physics and engineering. There is no simple formula for its computation, thus numerical algorithms are essential for co** with practical problems involved. The commercial implementations for the numerical solutions, such as the functions ellipticK and EllipticK provided by MATLAB and Mathematica respectively, are based on $\mathcal{K}_{\mathrm{cs}}(m)$ instead of the usual form $K(k)$ such that $m = k^2$ and $\mathcal{K}_{\mathrm{cs}}(k^2) = K(k)$. It is necessary to develop open source implementations for the computation of the CEI-1 in order to avoid potential risks of using commercial software and possible limitations due to the unknown factors. In this paper, the infinite series method, arithmetic-geometric mean (AGM) method, Gauss-Chebyshev method and Gauss-Legendre methods are discussed in details with a top-down strategy. The four key algorithms for computing CEI-1 are designed, verified, validated and tested, which can be utilized in R\& D and be reused properly. Numerical results show that our open source implementations based on $K(k)$ are equivalent to the commercial implementation based on $\mathcal{K}_{\mathrm{cs}}(m)$. The general algorithms for computing orthogonal polynomials developed are significant byproducts in the sense of STEM education and scientific computation.
△ Less
Submitted 16 May, 2024; v1 submitted 11 December, 2022;
originally announced December 2022.
-
General multi-fidelity surrogate models: Framework and active learning strategies for efficient rare event simulation
Authors:
Promit Chakroborty,
Somayajulu L. N. Dhulipala,
Yifeng Che,
Wen Jiang,
Benjamin W. Spencer,
Jason D. Hales,
Michael D. Shields
Abstract:
Estimating the probability of failure for complex real-world systems using high-fidelity computational models is often prohibitively expensive, especially when the probability is small. Exploiting low-fidelity models can make this process more feasible, but merging information from multiple low-fidelity and high-fidelity models poses several challenges. This paper presents a robust multi-fidelity…
▽ More
Estimating the probability of failure for complex real-world systems using high-fidelity computational models is often prohibitively expensive, especially when the probability is small. Exploiting low-fidelity models can make this process more feasible, but merging information from multiple low-fidelity and high-fidelity models poses several challenges. This paper presents a robust multi-fidelity surrogate modeling strategy in which the multi-fidelity surrogate is assembled using an active learning strategy using an on-the-fly model adequacy assessment set within a subset simulation framework for efficient reliability analysis. The multi-fidelity surrogate is assembled by first applying a Gaussian process correction to each low-fidelity model and assigning a model probability based on the model's local predictive accuracy and cost. Three strategies are proposed to fuse these individual surrogates into an overall surrogate model based on model averaging and deterministic/stochastic model selection. The strategies also dictate which model evaluations are necessary. No assumptions are made about the relationships between low-fidelity models, while the high-fidelity model is assumed to be the most accurate and most computationally expensive model. Through two analytical and two numerical case studies, including a case study evaluating the failure probability of Tristructural isotropic-coated (TRISO) nuclear fuels, the algorithm is shown to be highly accurate while drastically reducing the number of high-fidelity model calls (and hence computational cost).
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Distributed Optimization with Quantized Gradient Descent
Authors:
Apostolos I. Rikos,
Wei Jiang,
Themistoklis Charalambous,
Karl H. Johansson
Abstract:
In this paper, we consider the unconstrained distributed optimization problem, in which the exchange of information in the network is captured by a directed graph topology, thus, nodes can only communicate with their neighbors. Additionally, in our problem, the communication channels among the nodes have limited bandwidth. In order to alleviate this limitation, quantized messages should be exchang…
▽ More
In this paper, we consider the unconstrained distributed optimization problem, in which the exchange of information in the network is captured by a directed graph topology, thus, nodes can only communicate with their neighbors. Additionally, in our problem, the communication channels among the nodes have limited bandwidth. In order to alleviate this limitation, quantized messages should be exchanged among the nodes. For solving this distributed optimization problem, we combine a gradient descent method with a distributed quantized consensus algorithm (which requires the nodes to exchange quantized messages and converges in a finite number of steps). Specifically, at every optimization step, each node (i) performs a gradient descent step (i.e., subtracts the scaled gradient from its current estimate), and (ii) performs a finite-time calculation of the quantized average of every node's estimate in the network. As a consequence, this algorithm approximately mimics the centralized gradient descent algorithm. We show that our algorithm asymptotically converges to a neighborhood of the optimal solution with linear convergence rate. The performance of the proposed algorithm is demonstrated via simple illustrative examples.
△ Less
Submitted 5 December, 2023; v1 submitted 19 November, 2022;
originally announced November 2022.
-
A sturcture-preserving, upwind-SAV scheme for the degenerate Cahn--Hilliard equation with applications to simulating surface diffusion
Authors:
Qiong-Ao Huang,
Wei Jiang,
Jerry Zhijian Yang,
Cheng Yuan
Abstract:
This paper establishes a structure-preserving numerical scheme for the Cahn--Hilliard equation with degenerate mobility. First, by applying a finite volume method with upwind numerical fluxes to the degenerate Cahn--Hilliard equation rewritten by the scalar auxiliary variable (SAV) approach, we creatively obtain an unconditionally bound-preserving, energy-stable and fully-discrete scheme, which, f…
▽ More
This paper establishes a structure-preserving numerical scheme for the Cahn--Hilliard equation with degenerate mobility. First, by applying a finite volume method with upwind numerical fluxes to the degenerate Cahn--Hilliard equation rewritten by the scalar auxiliary variable (SAV) approach, we creatively obtain an unconditionally bound-preserving, energy-stable and fully-discrete scheme, which, for the first time, addresses the boundedness of the classical SAV approach under $H^{-1}$-gradient flow. Then, a dimensional-splitting technique is introduced in high-dimensional cases, which greatly reduces the computational complexity while preserves original structural properties. Numerical experiments are presented to verify the bound-preserving and energy-stable properties of the proposed scheme. Finally, by applying the proposed structure-preserving scheme, we numerically demonstrate that surface diffusion can be approximated by the Cahn--Hilliard equation with degenerate mobility and Flory--Huggins potential when the absolute temperature is sufficiently low, which agrees well with the theoretical result by using formal asymptotic analysis.wn theoretically by formal matched asymptotics.
△ Less
Submitted 28 February, 2023; v1 submitted 28 October, 2022;
originally announced October 2022.
-
DTAC-ADMM: Delay-Tolerant Augmented Consensus ADMM-based Algorithm for Distributed Resource Allocation
Authors:
Mohammadreza Doostmohammadian,
Wei Jiang,
Themistoklis Charalambous
Abstract:
Latency is inherent in almost all real-world networked applications. In this paper, we propose a distributed allocation strategy over multi-agent networks with delayed communications. The state of each agent (or node) represents its share of assigned resources out of a fixed amount (equal to overall demand). Every node locally updates its state toward optimizing a global allocation cost function v…
▽ More
Latency is inherent in almost all real-world networked applications. In this paper, we propose a distributed allocation strategy over multi-agent networks with delayed communications. The state of each agent (or node) represents its share of assigned resources out of a fixed amount (equal to overall demand). Every node locally updates its state toward optimizing a global allocation cost function via received information of its neighbouring nodes even when the data exchange over the network is heterogeneously delayed at different links. The update is based on the alternating direction method of multipliers (ADMM) formulation subject to both sum-preserving coupling-constraint and local box-constraints. The solution is derivative-free and holds for general (not necessarily differentiable) convex cost models. We use the notion of augmented consensus over undirected networks to model delayed information exchange and for convergence analysis. We simulate our \textit{delay-tolerant} algorithm for
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Radial basis approximation of tensor fields on manifolds: From operator estimation to manifold learning
Authors:
John Harlim,
Shixiao Willing Jiang,
John Wilson Peoples
Abstract:
In this paper, we study the Radial Basis Function (RBF) approximation to differential operators on smooth tensor fields defined on closed Riemannian submanifolds of Euclidean space, identified by randomly sampled point cloud data. {The formulation in this paper leverages a fundamental fact that the covariant derivative on a submanifold is the projection of the directional derivative in the ambient…
▽ More
In this paper, we study the Radial Basis Function (RBF) approximation to differential operators on smooth tensor fields defined on closed Riemannian submanifolds of Euclidean space, identified by randomly sampled point cloud data. {The formulation in this paper leverages a fundamental fact that the covariant derivative on a submanifold is the projection of the directional derivative in the ambient Euclidean space onto the tangent space of the submanifold. To differentiate a test function (or vector field) on the submanifold with respect to the Euclidean metric, the RBF interpolation is applied to extend the function (or vector field) in the ambient Euclidean space. When the manifolds are unknown, we develop an improved second-order local SVD technique for estimating local tangent spaces on the manifold. When the classical pointwise non-symmetric RBF formulation is used to solve Laplacian eigenvalue problems, we found that while accurate estimation of the leading spectra can be obtained with large enough data, such an approximation often produces irrelevant complex-valued spectra (or pollution) as the true spectra are real-valued and positive. To avoid such an issue,} we introduce a symmetric RBF discrete approximation of the Laplacians induced by a weak formulation on appropriate Hilbert spaces. Unlike the non-symmetric approximation, this formulation guarantees non-negative real-valued spectra and the orthogonality of the eigenvectors. Theoretically, we establish the convergence of the eigenpairs of both the Laplace-Beltrami operator and Bochner Laplacian {for the symmetric formulation} in the limit of large data with convergence rates. Numerically, we provide supporting examples for approximations of the Laplace-Beltrami operator and various vector Laplacians, including the Bochner, Hodge, and Lichnerowicz Laplacians.
△ Less
Submitted 22 November, 2023; v1 submitted 17 August, 2022;
originally announced August 2022.
-
A regularized model for wetting/dewetting problems: asymptotic analysis and $Γ$-convergence
Authors:
Wei Jiang,
Zhen Zhang,
Zeyu Zhou
Abstract:
By introducing height dependency in the surface energy density, we propose a novel regularized variational model to simulate wetting/dewetting problems. The regularized model leads to the appearance of a precursor layer which covers the bare substrate, with the precursor height depending on the regularization parameter $\varepsilon$. The new model enjoys lots of advantages in analysis and imulatio…
▽ More
By introducing height dependency in the surface energy density, we propose a novel regularized variational model to simulate wetting/dewetting problems. The regularized model leads to the appearance of a precursor layer which covers the bare substrate, with the precursor height depending on the regularization parameter $\varepsilon$. The new model enjoys lots of advantages in analysis and imulations. With the help of the precursor layer, the regularized model is naturally extended to a larger domain than that of the classical sharp-interface model, and thus can be solved in a fixed domain. There is no need to explicitly track the contact line motion, and difficulties arising from free boundary problems can be avoided. In addition, topological change events can be automatically captured. Under some mild and physically meaningful conditions, we show the positivity-preserving property of the minimizers of the new model. By using asymptotic analysis and $Γ$-convergence, we investigate the convergence relations between the new regularized model and the classical sharp-interface model. Finally, numerical results are provided to validate our theoretical analysis, as well as the accuracy and efficiency of the new regularized model.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
A convexity-preserving and perimeter-decreasing parametric finite element method for the area-preserving curve shortening flow
Authors:
Wei Jiang,
Chunmei Su,
Ganghui Zhang
Abstract:
We propose and analyze a semi-discrete parametric finite element scheme for solving the area-preserving curve shortening flow. The scheme is based on Dziuk's approach (SIAM J. Numer. Anal. 36(6): 1808-1830, 1999) for the anisotropic curve shortening flow. We prove that the scheme preserves two fundamental geometric structures of the flow with an initially convex curve: (i) the convexity-preserving…
▽ More
We propose and analyze a semi-discrete parametric finite element scheme for solving the area-preserving curve shortening flow. The scheme is based on Dziuk's approach (SIAM J. Numer. Anal. 36(6): 1808-1830, 1999) for the anisotropic curve shortening flow. We prove that the scheme preserves two fundamental geometric structures of the flow with an initially convex curve: (i) the convexity-preserving property, and (ii) the perimeter-decreasing property. To the best of our knowledge, the convexity-preserving property of numerical schemes which approximate the flow is rigorously proved for the first time. Furthermore, the error estimate of the semi-discrete scheme is established, and numerical results are provided to demonstrate the structure-preserving properties as well as the accuracy of the scheme.
△ Less
Submitted 11 January, 2023; v1 submitted 2 August, 2022;
originally announced August 2022.
-
Multi-block-Single-probe Variance Reduced Estimator for Coupled Compositional Optimization
Authors:
Wei Jiang,
Gang Li,
Yibo Wang,
Lijun Zhang,
Tianbao Yang
Abstract:
Variance reduction techniques such as SPIDER/SARAH/STORM have been extensively studied to improve the convergence rates of stochastic non-convex optimization, which usually maintain and update a sequence of estimators for a single function across iterations. What if we need to track multiple functional map**s across iterations but only with access to stochastic samples of $\mathcal{O}(1)$ functi…
▽ More
Variance reduction techniques such as SPIDER/SARAH/STORM have been extensively studied to improve the convergence rates of stochastic non-convex optimization, which usually maintain and update a sequence of estimators for a single function across iterations. What if we need to track multiple functional map**s across iterations but only with access to stochastic samples of $\mathcal{O}(1)$ functional map**s at each iteration? There is an important application in solving an emerging family of coupled compositional optimization problems in the form of $\sum_{i=1}^m f_i(g_i(\mathbf{w}))$, where $g_i$ is accessible through a stochastic oracle. The key issue is to track and estimate a sequence of $\mathbf g(\mathbf{w})=(g_1(\mathbf{w}), \ldots, g_m(\mathbf{w}))$ across iterations, where $\mathbf g(\mathbf{w})$ has $m$ blocks and it is only allowed to probe $\mathcal{O}(1)$ blocks to attain their stochastic values and Jacobians. To improve the complexity for solving these problems, we propose a novel stochastic method named Multi-block-Single-probe Variance Reduced (MSVR) estimator to track the sequence of $\mathbf g(\mathbf{w})$. It is inspired by STORM but introduces a customized error correction term to alleviate the noise not only in stochastic samples for the selected blocks but also in those blocks that are not sampled. With the help of the MSVR estimator, we develop several algorithms for solving the aforementioned compositional problems with improved complexities across a spectrum of settings with non-convex/convex/strongly convex/Polyak-Łojasiewicz (PL) objectives. Our results improve upon prior ones in several aspects, including the order of sample complexities and dependence on the strong convexity parameter. Empirical studies on multi-task deep AUC maximization demonstrate the better performance of using the new estimator.
△ Less
Submitted 30 December, 2022; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Smoothed Online Convex Optimization Based on Discounted-Normal-Predictor
Authors:
Lijun Zhang,
Wei Jiang,
**feng Yi,
Tianbao Yang
Abstract:
In this paper, we investigate an online prediction strategy named as Discounted-Normal-Predictor (Kapralov and Panigrahy, 2010) for smoothed online convex optimization (SOCO), in which the learner needs to minimize not only the hitting cost but also the switching cost. In the setting of learning with expert advice, Daniely and Mansour (2019) demonstrate that Discounted-Normal-Predictor can be util…
▽ More
In this paper, we investigate an online prediction strategy named as Discounted-Normal-Predictor (Kapralov and Panigrahy, 2010) for smoothed online convex optimization (SOCO), in which the learner needs to minimize not only the hitting cost but also the switching cost. In the setting of learning with expert advice, Daniely and Mansour (2019) demonstrate that Discounted-Normal-Predictor can be utilized to yield nearly optimal regret bounds over any interval, even in the presence of switching costs. Inspired by their results, we develop a simple algorithm for SOCO: Combining online gradient descent (OGD) with different step sizes sequentially by Discounted-Normal-Predictor. Despite its simplicity, we prove that it is able to minimize the adaptive regret with switching cost, i.e., attaining nearly optimal regret with switching cost on every interval. By exploiting the theoretical guarantee of OGD for dynamic regret, we further show that the proposed algorithm can minimize the dynamic regret with switching cost in every interval.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Statistically Consistent Inverse Optimal Control for Linear-Quadratic Tracking with Random Time Horizon
Authors:
Han Zhang,
Axel Ringh,
Weihan Jiang,
Shaoyuan Li,
Xiaoming Hu
Abstract:
The goal of Inverse Optimal Control (IOC) is to identify the underlying objective function based on observed optimal trajectories. It provides a powerful framework to model expert's behavior, and a data-driven way to design an objective function so that the induced optimal control is adapted to a contextual environment. In this paper, we design an IOC algorithm for linear-quadratic tracking proble…
▽ More
The goal of Inverse Optimal Control (IOC) is to identify the underlying objective function based on observed optimal trajectories. It provides a powerful framework to model expert's behavior, and a data-driven way to design an objective function so that the induced optimal control is adapted to a contextual environment. In this paper, we design an IOC algorithm for linear-quadratic tracking problems with random time horizon, and prove the statistical consistency of the algorithm. More specifically, the proposed estimator is the solution to a convex optimization problem, which means that the estimator does not suffer from local minima. This enables the proven statistical consistency to actually be achieved in practice. The algorithm is also verified on simulated data as well as data from a real world experiment, both in the setting of identifying the objective function of human tracking locomotion. The statistical consistency is illustrated on the synthetic data set, and the experimental results on the real data shows that we can get a good prediction on human tracking locomotion based on estimating the objective function. It shows that the theory and the model have a good performance in real practice. Moreover, the identified model can be used as a control target in personalized rehabilitation robot controller design, since the identified objective function describes personal habit and preferences.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
On Turing-Turing bifurcation of partial functional differential equations and its induced superposition patterns
Authors:
Xun Cao,
Weihua Jiang
Abstract:
When two Turing modes interact, i.e., Turing-Turing bifurcation occurs, superposition patterns revealing complex dynamical phenomena appear. In this paper, superposition patterns resulting from Turing-Turing bifurcation are investigated in theory. Firstly, the third-order normal form locally topologically equivalent to original partial functional differential equations (PFDEs) is derived. When sel…
▽ More
When two Turing modes interact, i.e., Turing-Turing bifurcation occurs, superposition patterns revealing complex dynamical phenomena appear. In this paper, superposition patterns resulting from Turing-Turing bifurcation are investigated in theory. Firstly, the third-order normal form locally topologically equivalent to original partial functional differential equations (PFDEs) is derived. When selecting 1D domain and Neumann boundary conditions, three normal forms describing different spatial patterns are deduced from original third-order normal form. Also, formulas for computing coefficients of these normal forms are given, which are expressed in explicit form of original system parameters. With the aid of three normal forms, spatial patterns of a diffusive predator-prey system with Crowley-Martin functional response near Turing-Turing singularity are investigated. For one set of parameters, diffusive system supports the coexistence of four stable steady states with different single characteristic wavelengths, which demonstrates our previous conjecture. For another set of parameters, superposition patterns, tri-stable patterns that a pair of stable superposition steady states coexists with the stable coexistence equilibrium or another stable steady state, as well as quad-stable patterns that a pair of stable superposition steady states and another pair of stable steady states coexist, arise. Finally, numerical simulations are shown to support theory analysis.
△ Less
Submitted 9 April, 2022;
originally announced April 2022.
-
Optimal Algorithms for Stochastic Multi-Level Compositional Optimization
Authors:
Wei Jiang,
Bokun Wang,
Yibo Wang,
Lijun Zhang,
Tianbao Yang
Abstract:
In this paper, we investigate the problem of stochastic multi-level compositional optimization, where the objective function is a composition of multiple smooth but possibly non-convex functions. Existing methods for solving this problem either suffer from sub-optimal sample complexities or need a huge batch size. To address these limitations, we propose a Stochastic Multi-level Variance Reduction…
▽ More
In this paper, we investigate the problem of stochastic multi-level compositional optimization, where the objective function is a composition of multiple smooth but possibly non-convex functions. Existing methods for solving this problem either suffer from sub-optimal sample complexities or need a huge batch size. To address these limitations, we propose a Stochastic Multi-level Variance Reduction method (SMVR), which achieves the optimal sample complexity of $\mathcal{O}\left(1 / ε^{3}\right)$ to find an $ε$-stationary point for non-convex objectives. Furthermore, when the objective function satisfies the convexity or Polyak-Łojasiewicz (PL) condition, we propose a stage-wise variant of SMVR and improve the sample complexity to $\mathcal{O}\left(1 / ε^{2}\right)$ for convex functions or $\mathcal{O}\left(1 /\left(με\right)\right)$ for non-convex functions satisfying the $μ$-PL condition. The latter result implies the same complexity for $μ$-strongly convex functions. To make use of adaptive learning rates, we also develop Adaptive SMVR, which achieves the same complexities but converges faster in practice. All our complexities match the lower bounds not only in terms of $ε$ but also in terms of $μ$ (for PL or strongly convex functions), without using a large batch size in each iteration.
△ Less
Submitted 18 October, 2022; v1 submitted 15 February, 2022;
originally announced February 2022.
-
A symmetrized parametric finite element method for anisotropic surface diffusion of closed curves
Authors:
Weizhu Bao,
Wei Jiang,
Yifei Li
Abstract:
We deal with a long-standing problem about how to design an energy-stable numerical scheme for solving the motion of a closed curve under {\sl anisotropic surface diffusion} with a general anisotropic surface energy $γ(\boldsymbol{n})$ in two dimensions, where $\boldsymbol{n}$ is the outward unit normal vector. By introducing a novel symmetric positive definite surface energy matrix…
▽ More
We deal with a long-standing problem about how to design an energy-stable numerical scheme for solving the motion of a closed curve under {\sl anisotropic surface diffusion} with a general anisotropic surface energy $γ(\boldsymbol{n})$ in two dimensions, where $\boldsymbol{n}$ is the outward unit normal vector. By introducing a novel symmetric positive definite surface energy matrix $Z_k(\boldsymbol{n})$ depending on the Cahn-Hoffman $\boldsymbolξ$-vector and a stabilizing function $k(\boldsymbol{n})$, we first reformulate the anisotropic surface diffusion into a conservative form and then derive a new symmetrized variational formulation for the anisotropic surface diffusion with weakly or strongly anisotropic surface energies. A semi-discretization in space for the symmetrized variational formulation is proposed and its area (or mass) conservation and energy dissipation are proved. The semi-discretization is then discretized in time by either an implicit structural-preserving scheme (SP-PFEM) which preserves the area in the discretized level or a semi-implicit energy-stable method (ES-PFEM) which needs only solve a linear system at each time step. Under a relatively simple and mild condition on $γ(\boldsymbol{n})$, we show that both SP-PFEM and ES-PFEM are unconditionally energy-stable for almost all anisotropic surface energies $γ(\boldsymbol{n})$ arising in practical applications. Specifically, for several commonly-used anisotropic surface energies, we construct $Z_k(\boldsymbol{n})$ explicitly. Finally, extensive numerical results are reported to demonstrate the high performance of the proposed numerical schemes.
△ Less
Submitted 26 October, 2022; v1 submitted 1 December, 2021;
originally announced December 2021.
-
Weak scalar curvature lower bounds along Ricci flow
Authors:
Wenshuai Jiang,
Weimin Sheng,
Huaiyu Zhang
Abstract:
In this paper, we study Ricci flow on compact manifolds with a continuous initial metric. It was known from Simon that the Ricci flow exists for a short time. We prove that the scalar curvature lower bound is preserved along the Ricci flow if the initial metric has a scalar curvature lower bound in distributional sense provided that the initial metric is $W^{1,p}$ for some $n<p\le \infty$.
As an…
▽ More
In this paper, we study Ricci flow on compact manifolds with a continuous initial metric. It was known from Simon that the Ricci flow exists for a short time. We prove that the scalar curvature lower bound is preserved along the Ricci flow if the initial metric has a scalar curvature lower bound in distributional sense provided that the initial metric is $W^{1,p}$ for some $n<p\le \infty$.
As an application, we use this result to study the relation between Yamabe invariant and Ricci flat metrics. We prove that if the Yamabe invariant is nonpositive and the scalar curvature is nonnegative in distributional sense, then the manifold is isometric to a Ricci flat manifold.
△ Less
Submitted 27 October, 2021; v1 submitted 23 October, 2021;
originally announced October 2021.
-
Numerical Eigensolver for Solving Eigenmodes of Cavity Resonators Filled With both Electric and Magnetic Lossy, Anisotropic Media
Authors:
Wei Jiang,
Jie Liu,
Shiling Zheng
Abstract:
This article presents the numerical eigensolver to find the resonant frequencies of 3-D closed cavity resonators filled with both electric and magnetic lossy, anisotropic media. By introducing a dummy variable with zero value in the 3-D linear vector Maxwell eigenvalue problem for the electric field, we enforce the divergence-free condition for electric flux density in a weak sense. In addition, b…
▽ More
This article presents the numerical eigensolver to find the resonant frequencies of 3-D closed cavity resonators filled with both electric and magnetic lossy, anisotropic media. By introducing a dummy variable with zero value in the 3-D linear vector Maxwell eigenvalue problem for the electric field, we enforce the divergence-free condition for electric flux density in a weak sense. In addition, by introducing a dummy variable with constant value in the 3-D linear vector Maxwell eigenvalue problem for the magnetic field, we enforce the divergence-free condition for magnetic flux density in a weak sense. Moreover, it is theoretically proved that the novel method of introducing dummy variables can be free of all the spurious modes in solving eigenmodes of the 3-D closed cavity problem. Numerical experiments show that the numerical eigensolver supported by this article can eliminate all the spurious modes, including spurious dc modes.
△ Less
Submitted 30 August, 2021;
originally announced August 2021.
-
Internal and String Stability of an Observer-based Controller for Vehicle Platooning under the MPF Topology
Authors:
Wei Jiang,
Elham Abolfazli,
Themistoklis Charalambous
Abstract:
In this paper, we study the internal stability and string stability of a vehicle platoon under the constant time headway spacing (CTHS) policy and the multiple-predecessor-following (MPF) vehicle-to-vehicle information flow topology. More specifically, we depart from the conventional Proportional-Integral-Derivative (PID) controller design for such systems and we propose the design of an observer-…
▽ More
In this paper, we study the internal stability and string stability of a vehicle platoon under the constant time headway spacing (CTHS) policy and the multiple-predecessor-following (MPF) vehicle-to-vehicle information flow topology. More specifically, we depart from the conventional Proportional-Integral-Derivative (PID) controller design for such systems and we propose the design of an observer-based controller. For designing our observer-based controller, we first design a distributed observer, with which each follower estimates their position, speed and acceleration error with respect to the leader. The observer is designed by means of constructing an observer matrix whose parameters should be determined. Next, we simplify the design of the matrix of the observer in such a way that the design boils down to choosing a single scalar value; this design further simplifies the structure of the controller, whose simplicity enables the derivation of string stability conditions by means of a frequency response method. Subsequently, the string stability conditions for a given time headway, are transformed to conditions for the controller parameters. We obtain controller parameters that satisfy the stability conditions by designing a novel heuristic search algorithm. Furthermore, we extend the search algorithm by incorporating a bisection-like algorithm, which allows to obtain (within some deviation tolerance) the minimum available value of the time headway. Finally, we provide insights about how to finalize the observer-based controller parameters from above algorithms to avoid the peaking phenomenon. The performance of the proposed observer-based controller is demonstrated via illustrative examples. Additionally, a comparison with a widely-used PID controller for MPF topology shows that our proposed observer-based controller has better convergence performance.
△ Less
Submitted 14 May, 2024; v1 submitted 21 August, 2021;
originally announced August 2021.
-
Special MMP for log canonical generalised pairs
Authors:
Vladimir Lazić,
Nikolaos Tsakanikas,
with an appendix joint with Xiaowei Jiang
Abstract:
We show that minimal models of $\mathbb{Q}$-factorial NQC log canonical generalised pairs exist, assuming the existence of minimal models of smooth varieties. More generally, we prove that on a $\mathbb{Q}$-factorial NQC log canonical generalised pair $ (X,B+M) $ we can run an MMP with scaling of an ample divisor which terminates, assuming that it admits an NQC weak Zariski decomposition or that…
▽ More
We show that minimal models of $\mathbb{Q}$-factorial NQC log canonical generalised pairs exist, assuming the existence of minimal models of smooth varieties. More generally, we prove that on a $\mathbb{Q}$-factorial NQC log canonical generalised pair $ (X,B+M) $ we can run an MMP with scaling of an ample divisor which terminates, assuming that it admits an NQC weak Zariski decomposition or that $K_X+B+M$ is not pseudoeffective. As a consequence, we establish several existence results for minimal models and Mori fibre spaces.
△ Less
Submitted 10 August, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Fully Distributed Alternating Direction Method of Multipliers in Digraphs via Finite-Time Termination Mechanisms
Authors:
Wei Jiang,
Themistoklis Charalambous
Abstract:
In this work, we consider the distributed optimization problem in which each node has its own convex cost function and can communicate directly only with its neighbors, as determined by a directed communication topology (directed graph or digraph). First, we reformulate the optimization problem so that Alternating Direction Method of Multipliers (ADMM) can be utilized. Then, we propose an algorith…
▽ More
In this work, we consider the distributed optimization problem in which each node has its own convex cost function and can communicate directly only with its neighbors, as determined by a directed communication topology (directed graph or digraph). First, we reformulate the optimization problem so that Alternating Direction Method of Multipliers (ADMM) can be utilized. Then, we propose an algorithm, herein called Distributed Alternating Direction Method of Multipliers using Finite-Time Exact Ratio Consensus (D-ADMM-FTERC), to solve the multi-node convex optimization problem, in which every node performs iterative computations and exchanges information with its neighbors. At every iteration of D-ADMM-FTERC, each node solves a local convex optimization problem for the one of the primal variables and utilizes a finite-time exact consensus protocol to obtain the optimal value of the other variable, since the cost function for the second primal variable is not decomposable. Since D-ADMM-FTERC requires to know the upper bound on the number of nodes in the network, we furthermore propose a new algorithm, called Fully D-ADMM Finite-Time Distributed Termination (FD-ADMM-FTDT) algorithm, which does not need any global information. If the individual cost functions are convex and not-necessarily differentiable, the proposed algorithms converge at a rate of O(1/k), where k is the iteration counter. Additionally, if the global objective function is strongly convex and smooth, the proposed algorithms have an "approximate" R-linear convergence rate. The efficacy of FD-ADMM-FTDT is demonstrated via a distributed L1 regularized logistic regression optimization example. Additionally, comparisons with other state-of-the-art algorithms are provided on large-scale networks showing the superior precision and time-efficient performance of FD-ADMM-FTDT.
△ Less
Submitted 6 October, 2021; v1 submitted 5 July, 2021;
originally announced July 2021.
-
Solving PDEs on Unknown Manifolds with Machine Learning
Authors:
Senwei Liang,
Shixiao W. Jiang,
John Harlim,
Haizhao Yang
Abstract:
This paper proposes a mesh-free computational framework and machine learning theory for solving elliptic PDEs on unknown manifolds, identified with point clouds, based on diffusion maps (DM) and deep learning. The PDE solver is formulated as a supervised learning task to solve a least-squares regression problem that imposes an algebraic equation approximating a PDE (and boundary conditions if appl…
▽ More
This paper proposes a mesh-free computational framework and machine learning theory for solving elliptic PDEs on unknown manifolds, identified with point clouds, based on diffusion maps (DM) and deep learning. The PDE solver is formulated as a supervised learning task to solve a least-squares regression problem that imposes an algebraic equation approximating a PDE (and boundary conditions if applicable). This algebraic equation involves a graph-Laplacian type matrix obtained via DM asymptotic expansion, which is a consistent estimator of second-order elliptic differential operators. The resulting numerical method is to solve a highly non-convex empirical risk minimization problem subjected to a solution from a hypothesis space of neural networks (NNs). In a well-posed elliptic PDE setting, when the hypothesis space consists of neural networks with either infinite width or depth, we show that the global minimizer of the empirical loss function is a consistent solution in the limit of large training data. When the hypothesis space is a two-layer neural network, we show that for a sufficiently large width, gradient descent can identify a global minimizer of the empirical loss function. Supporting numerical examples demonstrate the convergence of the solutions, ranging from simple manifolds with low and high co-dimensions, to rough surfaces with and without boundaries. We also show that the proposed NN solver can robustly generalize the PDE solution on new data points with generalization errors that are almost identical to the training errors, superseding a Nystrom-based interpolation method.
△ Less
Submitted 27 February, 2024; v1 submitted 11 June, 2021;
originally announced June 2021.
-
Kernel-based methods for Solving Time-Dependent Advection-Diffusion Equations on Manifolds
Authors:
Qile Yan,
Shixiao Willing Jiang,
John Harlim
Abstract:
In this paper, we extend the class of kernel methods, the so-called diffusion maps (DM) and ghost point diffusion maps (GPDM), to solve the time-dependent advection-diffusion PDE on unknown smooth manifolds without and with boundaries. The core idea is to directly approximate the spatial components of the differential operator on the manifold with a local integral operator and combine it with the…
▽ More
In this paper, we extend the class of kernel methods, the so-called diffusion maps (DM) and ghost point diffusion maps (GPDM), to solve the time-dependent advection-diffusion PDE on unknown smooth manifolds without and with boundaries. The core idea is to directly approximate the spatial components of the differential operator on the manifold with a local integral operator and combine it with the standard implicit time difference scheme. When the manifold has a boundary, a simplified version of the GPDM approach is used to overcome the bias of the integral approximation near the boundary. The Monte-Carlo discretization of the integral operator over the point cloud data gives rise to a mesh-free formulation that is natural for randomly distributed points, even when the manifold is embedded in high-dimensional ambient space. Here, we establish the convergence of the proposed solver on appropriate topologies, depending on the distribution of point cloud data and boundary type. We provide numerical results to validate the convergence results on various examples that involve simple geometry and an unknown manifold. Additionally, we also found positive results in solving the one-dimensional viscous Burger's equation where GPDM is adopted with a pseudo-spectral Galerkin framework to approximate nonlinear advection term.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
An Asynchronous Approximate Distributed Alternating Direction Method of Multipliers in Digraphs
Authors:
Wei Jiang,
Andreas Grammenos,
Evangelia Kalyvianaki,
Themistoklis Charalambous
Abstract:
In this work, we consider the asynchronous distributed optimization problem in which each node has its own convex cost function and can communicate directly only with its neighbors, as determined by a directed communication topology (directed graph or digraph). First, we reformulate the optimization problem so that Alternating Direction Method of Multipliers (ADMM) can be utilized. Then, we propos…
▽ More
In this work, we consider the asynchronous distributed optimization problem in which each node has its own convex cost function and can communicate directly only with its neighbors, as determined by a directed communication topology (directed graph or digraph). First, we reformulate the optimization problem so that Alternating Direction Method of Multipliers (ADMM) can be utilized. Then, we propose an algorithm, herein called Asynchronous Approximate Distributed Alternating Direction Method of Multipliers (AsyAD-ADMM), using finite-time asynchronous approximate ratio consensus, to solve the multi-node convex optimization problem, in which every node performs iterative computations and exchanges information with its neighbors asynchronously. More specifically, at every iteration of AsyAD-ADMM, each node solves a local convex optimization problem for one of the primal variables and utilizes a finite-time asynchronous approximate consensus protocol to obtain the value of the other variable which is close to the optimal value, since the cost function for the second primal variable is not decomposable. If the individual cost functions are convex but not necessarily differentiable, the proposed algorithm converges at a rate of $\mathcal{O}(1/k)$, where $k$ is the iteration counter. The efficacy of AsyAD-ADMM is exemplified via a proof-of-concept distributed least-square optimization problem with different performance-influencing factors investigated.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
An Iteratively Reweighted Method for Sparse Optimization on Nonconvex $\ell_{p}$ Ball
Authors:
Hao Wang,
Xiangyu Yang,
Wei Jiang
Abstract:
This paper is intended to solve the nonconvex $\ell_{p}$-ball constrained nonlinear optimization problems. An iteratively reweighted method is proposed, which solves a sequence of weighted $\ell_{1}$-ball projection subproblems. At each iteration, the next iterate is obtained by moving along the negative gradient with a stepsize and then projecting the resulted point onto the weighted $\ell_{1}$ b…
▽ More
This paper is intended to solve the nonconvex $\ell_{p}$-ball constrained nonlinear optimization problems. An iteratively reweighted method is proposed, which solves a sequence of weighted $\ell_{1}$-ball projection subproblems. At each iteration, the next iterate is obtained by moving along the negative gradient with a stepsize and then projecting the resulted point onto the weighted $\ell_{1}$ ball to approximate the $\ell_{p}$ ball. Specifically, if the current iterate is in the interior of the feasible set, then the weighted $\ell_{1}$ ball is formed by linearizing the $\ell_{p}$ norm at the current iterate. If the current iterate is on the boundary of the feasible set, then the weighted $\ell_{1}$ ball is formed differently by kee** those zero components in the current iterate still zero. In our analysis, we prove that the generated iterates converge to a first-order stationary point. Numerical experiments demonstrate the effectiveness of the proposed method.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
Revisiting Smoothed Online Learning
Authors:
Lijun Zhang,
Wei Jiang,
Shiyin Lu,
Tianbao Yang
Abstract:
In this paper, we revisit the problem of smoothed online learning, in which the online learner suffers both a hitting cost and a switching cost, and target two performance metrics: competitive ratio and dynamic regret with switching cost.
To bound the competitive ratio, we assume the hitting cost is known to the learner in each round, and investigate the simple idea of balancing the two costs by…
▽ More
In this paper, we revisit the problem of smoothed online learning, in which the online learner suffers both a hitting cost and a switching cost, and target two performance metrics: competitive ratio and dynamic regret with switching cost.
To bound the competitive ratio, we assume the hitting cost is known to the learner in each round, and investigate the simple idea of balancing the two costs by an optimization problem. Surprisingly, we find that minimizing the hitting cost alone is $\max(1, \frac{2}α)$-competitive for $α$-polyhedral functions and $1 + \frac{4}λ$-competitive for $λ$-quadratic growth functions, both of which improve state-of-the-art results significantly. Moreover, when the hitting cost is both convex and $λ$-quadratic growth, we reduce the competitive ratio to $1 + \frac{2}{\sqrtλ}$ by minimizing the weighted sum of the hitting cost and the switching cost.
To bound the dynamic regret with switching cost, we follow the standard setting of online convex optimization, in which the hitting cost is convex but hidden from the learner before making predictions. We modify Ader, an existing algorithm designed for dynamic regret, slightly to take into account the switching cost when measuring the performance. The proposed algorithm, named as Smoothed Ader, attains an optimal $O(\sqrt{T(1+P_T)})$ bound for dynamic regret with switching cost, where $P_T$ is the path-length of the comparator sequence. Furthermore, if the hitting cost is accessible in the beginning of each round, we obtain a similar guarantee without the bounded gradient condition, and establish an $Ω(\sqrt{T(1+P_T)})$ lower bound to confirm the optimality.
△ Less
Submitted 18 May, 2021; v1 submitted 13 February, 2021;
originally announced February 2021.
-
A perimeter-decreasing and area-conserving algorithm for surface diffusion flow of curves
Authors:
Wei Jiang,
Buyang Li
Abstract:
A fully discrete finite element method, based on a new weak formulation and a new time-step** scheme, is proposed for the surface diffusion flow of closed curves in the two-dimensional plane. It is proved that the proposed method can preserve two geometric structures simultaneously at the discrete level, i.e., the perimeter of the curve decreases in time while the area enclosed by the curve is c…
▽ More
A fully discrete finite element method, based on a new weak formulation and a new time-step** scheme, is proposed for the surface diffusion flow of closed curves in the two-dimensional plane. It is proved that the proposed method can preserve two geometric structures simultaneously at the discrete level, i.e., the perimeter of the curve decreases in time while the area enclosed by the curve is conserved. Numerical examples are provided to demonstrate the convergence of the proposed method and the effectiveness of the method in preserving the two geometric structures.
△ Less
Submitted 30 January, 2021;
originally announced February 2021.
-
Removable singularity of positive mass theorem with continuous metrics
Authors:
Wenshuai Jiang,
Weimin Sheng,
Huaiyu Zhang
Abstract:
In this paper, we consider asymptotically flat Riemannnian manifolds $(M^n,g)$ with $C^0$ metric $g$ and $g$ is smooth away from a closed bounded subset $Σ$ and the scalar curvature $R_g\ge 0$ on $M\setminus Σ$. For given $n\le p\le \infty$, if $g\in C^0\cap W^{1,p}$ and the Hausdorff measure $\mathcal{H}^{n-\frac{p}{p-1}}(Σ)<\infty$ when $n\le p<\infty$ or $\mathcal{H}^{n-1}(Σ)=0$ when…
▽ More
In this paper, we consider asymptotically flat Riemannnian manifolds $(M^n,g)$ with $C^0$ metric $g$ and $g$ is smooth away from a closed bounded subset $Σ$ and the scalar curvature $R_g\ge 0$ on $M\setminus Σ$. For given $n\le p\le \infty$, if $g\in C^0\cap W^{1,p}$ and the Hausdorff measure $\mathcal{H}^{n-\frac{p}{p-1}}(Σ)<\infty$ when $n\le p<\infty$ or $\mathcal{H}^{n-1}(Σ)=0$ when $p=\infty$, then we prove that the ADM mass of each end is nonnegative. Furthermore, if the ADM mass of some end is zero, then we prove that $(M^n,g)$ is isometric to the Euclidean space by showing the manifold has nonnegative Ricci curvature in RCD sense. This extends the result of [Lee-LeFloch2015] from spin to non-spin, also improves the result of [Shi-Tam2018] and [Lee2013]. Moreover, for $p=\infty$, this confirms a conjecture of Lee [Lee2013].
△ Less
Submitted 27 December, 2020;
originally announced December 2020.
-
Curvature positivity of invariant direct images of Hermitian vector bundles
Authors:
Fusheng Deng,
**** Hu,
Weiwen Jiang
Abstract:
We prove that the invariant part, with respect to a compact group action satisfying certain condition, of the direct image of a Nakano positive Hermitian holomorphic vector bundle over a bounded pseudoconvex domain is Nakano positive. We also consider the action of the noncompact group $\mathbb{R}^m$ and get the same result for a family of tube domains, which leads to a new method to the matrix-va…
▽ More
We prove that the invariant part, with respect to a compact group action satisfying certain condition, of the direct image of a Nakano positive Hermitian holomorphic vector bundle over a bounded pseudoconvex domain is Nakano positive. We also consider the action of the noncompact group $\mathbb{R}^m$ and get the same result for a family of tube domains, which leads to a new method to the matrix-valued Prekopa's theorem originally proved by Raufi. The two main ingredients in our method are Hörmander's $L^2$ theory of $\bar\partial$ and the recent work of Deng-Ning-Zhang-Zhou on characterization of Nakano positivity of Hermitian holomorphic vector bundles.
△ Less
Submitted 31 August, 2020;
originally announced September 2020.