Search | arXiv e-print repository

Extended alternating structure-adapted proximal gradient algorithm for nonconvex nonsmooth problems

Authors: Ying Gao, Chunfeng Cui, Wenxing Zhang, Deren Han

Abstract: Alternating structure-adapted proximal (ASAP) gradient algorithm (M. Nikolova and P. Tan, SIAM J Optim, 29:2053-2078, 2019) has drawn much attention due to its efficiency in solving nonconvex nonsmooth optimization problems. However, the multiblock nonseparable structure confines the performance of ASAP to far-reaching practical problems, e.g., coupled tensor decomposition. In this paper, we propo… ▽ More Alternating structure-adapted proximal (ASAP) gradient algorithm (M. Nikolova and P. Tan, SIAM J Optim, 29:2053-2078, 2019) has drawn much attention due to its efficiency in solving nonconvex nonsmooth optimization problems. However, the multiblock nonseparable structure confines the performance of ASAP to far-reaching practical problems, e.g., coupled tensor decomposition. In this paper, we propose an extended ASAP (eASAP) algorithm for nonconvex nonsmooth optimization whose objective is the sum of two nonseperable functions and a coupling one. By exploiting the blockwise restricted prox-regularity, eASAP is capable of minimizing the objective whose coupling function is multiblock nonseparable. Moreover, we analyze the global convergence of eASAP by virtue of the Aubin property on partial subdifferential map** and the Kurdyka-Łojasiewicz property on the objective. Furthermore, the sublinear convergence rate of eASAP is built upon the proximal point algorithmic framework under some mild conditions. Numerical simulations on multimodal data fusion demonstrate the compelling performance of the proposed method. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.08777 [pdf, other]

Finite Time Blowup of Integer- and Fractional-Order Time-Delayed Diffusion Equations

Authors: Christopher N. Angstmann, Stuart-James M. Burney, Daniel S. Han, Bruce I. Henry, Boris Z. Huang, Zhuang Xu

Abstract: In this work, exact solutions are derived for an integer- and fractional-order time-delayed diffusion equation with arbitrary initial conditions. The solutions are obtained using Fourier transform methods in conjunction with the known properties of delay functions. It is observed that the solutions do not exhibit infinite speed of propagation for smooth initial conditions that are bounded and posi… ▽ More In this work, exact solutions are derived for an integer- and fractional-order time-delayed diffusion equation with arbitrary initial conditions. The solutions are obtained using Fourier transform methods in conjunction with the known properties of delay functions. It is observed that the solutions do not exhibit infinite speed of propagation for smooth initial conditions that are bounded and positive. Sufficient conditions on the initial condition are also established such that the finite time blowup of the solutions can be explicitly calculated. Examples are provided that highlight the contrasting behaviours of these exact solutions with the known dynamics of solutions to the standard diffusion equation. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 17 pages, 2 figures

MSC Class: 35R25; 35C10; 34K06; 34K37; 33E20; 42A38

arXiv:2406.00897 [pdf, other]

Exact Solutions of a Time-Delay Advection Equation and a Fractional Time-Delay Advection Equation

Authors: Christopher N. Angstmann, Stuart-James M. Burney, Daniel S. Han, Bruce I. Henry, Boris Z. Huang, Zhuang Xu

Abstract: Exact solutions are derived for a time-delay advection equation and a fractional-order time-delay advection equation with a time-delay in the spatial derivative. Solutions are obtained, for arbitrary separable initial conditions, by incorporating recently introduced delay functions in a separation of variables approach. Examples are provided showing oscillatory and translatory behaviours fundament… ▽ More Exact solutions are derived for a time-delay advection equation and a fractional-order time-delay advection equation with a time-delay in the spatial derivative. Solutions are obtained, for arbitrary separable initial conditions, by incorporating recently introduced delay functions in a separation of variables approach. Examples are provided showing oscillatory and translatory behaviours fundamentally different to standard propagating wave solutions. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: Letter

MSC Class: 35C10; 35F10; 34K06; 42A38; 33E20

arXiv:2405.19044 [pdf, ps, other]

On adaptive stochastic extended iterative methods for solving least squares

Authors: Yun Zeng, Deren Han, Yansheng Su, Jiaxin Xie

Abstract: In this paper, we propose a novel adaptive stochastic extended iterative method, which can be viewed as an improved extension of the randomized extended Kaczmarz (REK) method, for finding the unique minimum Euclidean norm least-squares solution of a given linear system. In particular, we introduce three equivalent stochastic reformulations of the linear least-squares problem: stochastic unconstrai… ▽ More In this paper, we propose a novel adaptive stochastic extended iterative method, which can be viewed as an improved extension of the randomized extended Kaczmarz (REK) method, for finding the unique minimum Euclidean norm least-squares solution of a given linear system. In particular, we introduce three equivalent stochastic reformulations of the linear least-squares problem: stochastic unconstrained and constrained optimization problems, and the stochastic multiobjective optimization problem. We then alternately employ the adaptive variants of the stochastic heavy ball momentum (SHBM) method, which utilize iterative information to update the parameters, to solve the stochastic reformulations. We prove that our method converges linearly in expectation, addressing an open problem in the literature related to designing theoretically supported adaptive SHBM methods. Numerical experiments show that our adaptive stochastic extended iterative method has strong advantages over the non-adaptive one. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.04091 [pdf, ps, other]

Randomized iterative methods for generalized absolute value equations: Solvability and error bounds

Authors: Jiaxin Xie, Houduo Qi, Deren Han

Abstract: Randomized iterative methods, such as the Kaczmarz method and its variants, have gained growing attention due to their simplicity and efficiency in solving large-scale linear systems. Meanwhile, absolute value equations (AVE) have attracted increasing interest due to their connection with the linear complementarity problem. In this paper, we investigate the application of randomized iterative meth… ▽ More Randomized iterative methods, such as the Kaczmarz method and its variants, have gained growing attention due to their simplicity and efficiency in solving large-scale linear systems. Meanwhile, absolute value equations (AVE) have attracted increasing interest due to their connection with the linear complementarity problem. In this paper, we investigate the application of randomized iterative methods to generalized AVE (GAVE). Our approach differs from most existing works in that we tackle GAVE with non-square coefficient matrices. We establish more comprehensive sufficient and necessary conditions for characterizing the solvability of GAVE and propose precise error bound conditions. Furthermore, we introduce a flexible and efficient randomized iterative algorithmic framework for solving GAVE, which employs sampling matrices drawn from user-specified distributions. This framework is capable of encompassing many well-known methods, including the Picard iteration method and the randomized Kaczmarz method. Leveraging our findings on solvability and error bounds, we establish both almost sure convergence and linear convergence rates for this versatile algorithmic framework. Finally, we present numerical examples to illustrate the advantages of the new algorithms. △ Less

Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.18560 [pdf, other]

Non-convex Pose Graph Optimization in SLAM via Proximal Linearized Riemannian ADMM

Authors: Xin Chen, Chunfeng Cui, Deren Han, Liqun Qi

Abstract: Pose graph optimization (PGO) is a well-known technique for solving the pose-based simultaneous localization and map** (SLAM) problem. In this paper, we represent the rotation and translation by a unit quaternion and a three-dimensional vector, and propose a new PGO model based on the von Mises-Fisher distribution. The constraints derived from the unit quaternions are spherical manifolds, and th… ▽ More Pose graph optimization (PGO) is a well-known technique for solving the pose-based simultaneous localization and map** (SLAM) problem. In this paper, we represent the rotation and translation by a unit quaternion and a three-dimensional vector, and propose a new PGO model based on the von Mises-Fisher distribution. The constraints derived from the unit quaternions are spherical manifolds, and the projection onto the constraints can be calculated by normalization. Then a proximal linearized Riemannian alternating direction method of multipliers (PieADMM) is developed to solve the proposed model, which not only has low memory requirements, but also can update the poses in parallel. Furthermore, we establish the iteration complexity of $O(1/ε^{2})$ of PieADMM for finding an $ε$-stationary solution of our model. The efficiency of our proposed algorithm is demonstrated by numerical experiments on two synthetic and four 3D SLAM benchmark datasets. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.11822 [pdf, ps, other]

A class of maximum-based iteration methods for the generalized absolute value equation

Authors: Shiliang Wu, Deren Han, Cuixia Li

Abstract: In this paper, by using $|x|=2\max\{0,x\}-x$, a class of maximum-based iteration methods is established to solve the generalized absolute value equation $Ax-B|x|=b$. Some convergence conditions of the proposed method are presented. By some numerical experiments, the effectiveness and feasibility of the proposed method are confirmed. In this paper, by using $|x|=2\max\{0,x\}-x$, a class of maximum-based iteration methods is established to solve the generalized absolute value equation $Ax-B|x|=b$. Some convergence conditions of the proposed method are presented. By some numerical experiments, the effectiveness and feasibility of the proposed method are confirmed. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.09460 [pdf, other]

Optimal Real-time Bidding Strategy For EV Aggregators in Wholesale Electricity Markets

Authors: Shihan Huang, Dongkun Han, John Zhen Fu Pang, Yue Chen

Abstract: With the rapid growth of electric vehicles (EVs), EV aggregators have been playing a increasingly vital role in power systems by not merely providing charging management but also participating in wholesale electricity markets. This work studies the optimal real-time bidding strategy for an EV aggregator. Since the charging process of EVs is time-coupled, it is necessary for EV aggregators to consi… ▽ More With the rapid growth of electric vehicles (EVs), EV aggregators have been playing a increasingly vital role in power systems by not merely providing charging management but also participating in wholesale electricity markets. This work studies the optimal real-time bidding strategy for an EV aggregator. Since the charging process of EVs is time-coupled, it is necessary for EV aggregators to consider future operational conditions (e.g., future EV arrivals) when deciding the current bidding strategy. However, accurately forecasting future operational conditions is challenging under the inherent uncertainties. Hence, there demands a real-time bidding strategy based solely on the up-to-date information, which is the main goal of this work. We start by develo** an online optimal EV charging management algorithm for the EV aggregator via Lyapunov optimization. Based on this, an optimal real-time bidding strategy (bidding cost curve and bounds) for the aggregator is derived. Then, an efficient yet practical algorithm is proposed to obtain the bidding strategy. It shows that with the proposed bidding strategy, the aggregator's profit is nearly offline optimal. Moreover, the wholesale electricity market clearing result aligns with the individual aggregator's optimal charging strategy given the prices. Case studies against several benchmarks are conducted to evaluate the performance of the proposed method. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 13 pages, 6 figures

arXiv:2403.19218 [pdf, other]

A piecewise neural network method for solving large interval solution to initial value problem of ordinary differential equations

Authors: Dongpeng Han, Chaolu Temuer

Abstract: Various traditional numerical methods for solving initial value problems of differential equations often produce local solutions near the initial value point, despite the problems having larger interval solutions. Even current popular neural network algorithms or deep learning methods cannot guarantee yielding large interval solutions for these problems. In this paper, we propose a piecewise neura… ▽ More Various traditional numerical methods for solving initial value problems of differential equations often produce local solutions near the initial value point, despite the problems having larger interval solutions. Even current popular neural network algorithms or deep learning methods cannot guarantee yielding large interval solutions for these problems. In this paper, we propose a piecewise neural network approach to obtain a large interval numerical solution for initial value problems of differential equations. In this method, we first divide the solution interval, on which the initial problem is to be solved, into several smaller intervals. Neural networks with a unified structure are then employed on each sub-interval to solve the related sub-problems. By assembling these neural network solutions, a piecewise expression of the large interval solution to the problem is constructed, referred to as the piecewise neural network solution. The continuous differentiability of the solution over the entire interval, except for finite points, is proven through theoretical analysis and employing a parameter transfer technique. Additionally, a parameter transfer and multiple rounds of pre-training technique are utilized to enhance the accuracy of the approximation solution. Compared with existing neural network algorithms, this method does not increase the network size and training data scale for training the network on each sub-domain. Finally, several numerical experiments are presented to demonstrate the efficiency of the proposed algorithm. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: 26 pages,13 figures

arXiv:2403.13977 [pdf, other]

Spectral Analysis of Lattice Schrödinger-Type Operators Associated with the Nonstationary Anderson Model and Intermittency

Authors: Dan Han, Stanislav Molchanov, Boris Vainberg

Abstract: The research explores a high irregularity, commonly referred to as intermittency, of the solution to the non-stationary parabolic Anderson problem: \begin{equation*} \frac{\partial u}{\partial t} = \varkappa \mathcal{L}u(t,x) + ξ_{t}(x)u(t,x) \end{equation*} with the initial condition $u(0,x) \equiv 1$, where $(t,x) \in [0,\infty)\times \mathbb{Z}^d$. Here, $\varkappa \mathcal{L}$ denotes… ▽ More The research explores a high irregularity, commonly referred to as intermittency, of the solution to the non-stationary parabolic Anderson problem: \begin{equation*} \frac{\partial u}{\partial t} = \varkappa \mathcal{L}u(t,x) + ξ_{t}(x)u(t,x) \end{equation*} with the initial condition $u(0,x) \equiv 1$, where $(t,x) \in [0,\infty)\times \mathbb{Z}^d$. Here, $\varkappa \mathcal{L}$ denotes a non-local Laplacian, and $ξ_{t}(x)$ is a correlated white noise potential. The observed irregularity is intricately linked to the upper part of the spectrum of the multiparticle Schrödinger equations for the moment functions $m_p(t,x_1,x_2,\cdots,x_p) = \langle u(t,x_1)u(t,x_2)\cdots u(t,x_p)\rangle$. In the first half of the paper, a weak form of intermittency is expressed through moment functions of order $p\geq 3$ and established for a wide class of operators $\varkappa \mathcal{L}$ with a positive-definite correlator $B=B(x))$ of the white noise. In the second half of the paper, the strong intermittency is studied. It relates to the existence of a positive eigenvalue for the lattice Schrödinger type operator with the potential $B$. This operator is associated with the second moment $m_2$. Now $B$ is not necessarily positive-definite, but $\sum B(x)\geq 0$. △ Less

Submitted 20 March, 2024; originally announced March 2024.

MSC Class: 60H25; 60H15; 81Q10; 37H15; 35B40

arXiv:2403.11557 [pdf, other]

doi 10.1109/TAC.2024.3380710

Distributed Adaptive Gradient Algorithm with Gradient Tracking for Stochastic Non-Convex Optimization

Authors: Dongyu Han, Kun Liu, Yeming Lin, Yuanqing Xia

Abstract: This paper considers a distributed stochastic non-convex optimization problem, where the nodes in a network cooperatively minimize a sum of $L$-smooth local cost functions with sparse gradients. By adaptively adjusting the stepsizes according to the historical (possibly sparse) gradients, a distributed adaptive gradient algorithm is proposed, in which a gradient tracking estimator is used to handl… ▽ More This paper considers a distributed stochastic non-convex optimization problem, where the nodes in a network cooperatively minimize a sum of $L$-smooth local cost functions with sparse gradients. By adaptively adjusting the stepsizes according to the historical (possibly sparse) gradients, a distributed adaptive gradient algorithm is proposed, in which a gradient tracking estimator is used to handle the heterogeneity between different local cost functions. We establish an upper bound on the optimality gap, which indicates that our proposed algorithm can reach a first-order stationary solution dependent on the upper bound on the variance of the stochastic gradients. Finally, numerical examples are presented to illustrate the effectiveness of the algorithm. △ Less

Submitted 29 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

Comments: 14 pages, 8 figures

Journal ref: IEEE Transactions on Automatic Control (2024)

arXiv:2402.04406 [pdf, other]

Regularized MIP Model for Optimal Power Flow with Energy Storage Systems and its Applications

Authors: Dahye Han, Nan Jiang, Santanu S. Dey, Weijun Xie

Abstract: Incorporating energy storage systems (ESS) into power systems has been studied in many recent works, where binary variables are often introduced to model the complementary nature of battery charging and discharging. A conventional approach for these ESS optimization problems is to relax binary variables and convert the problem into a linear program. However, such linear programming relaxation mode… ▽ More Incorporating energy storage systems (ESS) into power systems has been studied in many recent works, where binary variables are often introduced to model the complementary nature of battery charging and discharging. A conventional approach for these ESS optimization problems is to relax binary variables and convert the problem into a linear program. However, such linear programming relaxation models can yield unrealistic fractional solutions, such as simultaneous charging and discharging. In this paper, we develop a regularized Mixed-Integer Programming (MIP) model for the ESS optimal power flow (OPF) problem. We prove that under mild conditions, the proposed regularized model admits a zero integrality gap with its linear programming relaxation; hence, it can be solved efficiently. By studying the properties of the regularized MIP model, we show that its optimal solution is also near-optimal to the original ESS OPF problem, thereby providing a valid and tight upper bound for the ESS OPF problem. The use of the regularized MIP model allows us to solve two intractable problems: a two-stage stochastic ESS OPF problem and a trilevel network contingency problem. △ Less

Submitted 6 February, 2024; originally announced February 2024.

arXiv:2401.02040 [pdf, other]

A Bregman Proximal Stochastic Gradient Method with Extrapolation for Nonconvex Nonsmooth Problems

Authors: Qingsong Wang, Zehui Liu, Chunfeng Cui, Deren Han

Abstract: In this paper, we explore a specific optimization problem that involves the combination of a differentiable nonconvex function and a nondifferentiable function. The differentiable component lacks a global Lipschitz continuous gradient, posing challenges for optimization. To address this issue and accelerate the convergence, we propose a Bregman proximal stochastic gradient method with extrapolatio… ▽ More In this paper, we explore a specific optimization problem that involves the combination of a differentiable nonconvex function and a nondifferentiable function. The differentiable component lacks a global Lipschitz continuous gradient, posing challenges for optimization. To address this issue and accelerate the convergence, we propose a Bregman proximal stochastic gradient method with extrapolation (BPSGE), which only requires smooth adaptivity of the differentiable part. Under the variance reduction framework, we not only analyze the subsequential and global convergence of the proposed algorithm under certain conditions, but also analyze the sublinear convergence rate of the subsequence, and the complexity of the algorithm, revealing that the BPSGE algorithm requires at most O(epsilon\^\,(-2)) iterations in expectation to attain an epsilon-stationary point. To validate the effectiveness of our proposed algorithm, we conduct numerical experiments on three real-world applications: graph regularized nonnegative matrix factorization (NMF), matrix factorization with weakly-convex regularization, and NMF with nonconvex sparsity constraints. These experiments demonstrate that BPSGE is faster than the baselines without extrapolation. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: accepted by AAAI 2024

arXiv:2312.05268 [pdf, other]

A Stochastic Simulation Method for Fractional Order Compartment Models

Authors: Christopher N. Angstmann, Stuart-James M. Burney, Bruce I. Henry, Daniel S. Han, Byron A. Jacobs, Zhuang Xu

Abstract: Our study focuses on fractional order compartment models derived from underlying physical stochastic processes, providing a more physically grounded approach compared to models that use the dynamical system approach by simply replacing integer-order derivatives with fractional order derivatives. In these models, inherent stochasticity becomes important, particularly when dealing with the dynamics… ▽ More Our study focuses on fractional order compartment models derived from underlying physical stochastic processes, providing a more physically grounded approach compared to models that use the dynamical system approach by simply replacing integer-order derivatives with fractional order derivatives. In these models, inherent stochasticity becomes important, particularly when dealing with the dynamics of small populations far from the continuum limit of large particle numbers. The necessity for stochastic simulations arises from deviations of the mean states from those obtained from the governing equations in these scenarios. To address this, we introduce an exact stochastic simulation algorithm designed for fractional order compartment models, based on a semi-Markov process. We have considered a fractional order resusceptibility SIS model and a fractional order recovery SIR model as illustrative examples, highlighting significant disparities between deterministic and stochastic dynamics when the total population is small. Beyond its modeling applications, the algorithm presented serves as a versatile tool for solving fractional order differential equations via Monte Carlo simulations. △ Less

Submitted 26 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

Comments: 26 pages, 7 figures

MSC Class: 34A08; 60G22; 60K40; 92C45; 92D30

arXiv:2310.10965 [pdf, ps, other]

The neural network models with delays for solving absolute value equations

Authors: Dongmei Yu, Gehao Zhang, Cairong Chen, Deren Han

Abstract: An inverse-free neural network model with mixed delays is proposed for solving the absolute value equation (AVE) $Ax -|x| - b =0$, which includes an inverse-free neural network model with discrete delay as a special case. By using the Lyapunov-Krasovskii theory and the linear matrix inequality (LMI) method, the developed neural network models are proved to be exponentially convergent to the soluti… ▽ More An inverse-free neural network model with mixed delays is proposed for solving the absolute value equation (AVE) $Ax -|x| - b =0$, which includes an inverse-free neural network model with discrete delay as a special case. By using the Lyapunov-Krasovskii theory and the linear matrix inequality (LMI) method, the developed neural network models are proved to be exponentially convergent to the solution of the AVE. Compared with the existing neural network models for solving the AVE, the proposed models feature the ability of solving a class of AVE with $\|A^{-1}\|>1$. Numerical simulations are given to show the effectiveness of the two delayed neural network models. △ Less

Submitted 16 October, 2023; originally announced October 2023.

arXiv:2309.03510 [pdf, ps, other]

Gradient estimates for $Δ_pu-|\nabla u|^q+b(x)|u|^{r-1}u=0$ on a complete Riemannian manifold and Liouville type theorems

Authors: Dong Han, Jie He, Youde Wang

Abstract: In this paper the Nash-Moser iteration method is used to study the gradient estimates of solutions to the quasilinear elliptic equation $Δ_p u-|\nabla u|^q+b(x)|u|^{r-1}u=0$ defined on a complete Riemannian manifold $(M,g)$. When $b(x)\equiv0$, a unified Cheng-Yau type estimate of the solutions to this equation is derived. Regardless of whether this equation is defined on a manifold or a region of… ▽ More In this paper the Nash-Moser iteration method is used to study the gradient estimates of solutions to the quasilinear elliptic equation $Δ_p u-|\nabla u|^q+b(x)|u|^{r-1}u=0$ defined on a complete Riemannian manifold $(M,g)$. When $b(x)\equiv0$, a unified Cheng-Yau type estimate of the solutions to this equation is derived. Regardless of whether this equation is defined on a manifold or a region of Euclidean space, certain technical and geometric conditions posed in \cite[Theorem E, F]{MR3261111} are weakened and hence some of the estimates due to Bidaut-Véron, Garcia-Huidobro and Véron (see \cite[Theorem E, F]{MR3261111}) are improved. In addition, we extend their results to the case $p>n=\dim(M)$. When $b(x)$ does not vanish, we can also extend some estimates for positive solutions to the above equation defined on a region of the Euclidean space due to Filippucci-Sun-Zheng \cite{filippucci2022priori} to arbitrary solutions to this equation on a complete Riemannian manifold. Even in the case of Euclidean space, the estimates for positive solutions in \cite{filippucci2022priori} and our results can not cover each other. △ Less

Submitted 15 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

arXiv:2308.01593 [pdf, ps, other]

New constructions of NMDS self-dual codes

Authors: Dongchun Han, Hanbin Zhang

Abstract: Near maximum distance separable (NMDS) codes are important in finite geometry and coding theory. Self-dual codes are closely related to combinatorics, lattice theory, and have important application in cryptography. In this paper, we construct a class of $q$-ary linear codes and prove that they are either MDS or NMDS which depends on certain zero-sum condition. In the NMDS case, we provide an effec… ▽ More Near maximum distance separable (NMDS) codes are important in finite geometry and coding theory. Self-dual codes are closely related to combinatorics, lattice theory, and have important application in cryptography. In this paper, we construct a class of $q$-ary linear codes and prove that they are either MDS or NMDS which depends on certain zero-sum condition. In the NMDS case, we provide an effective approach to construct NMDS self-dual codes which largely extend known parameters of such codes. In particular, we proved that for square $q$, almost $q/8$ NMDS self-dual $q$-ary codes can be constructed. △ Less

Submitted 3 August, 2023; originally announced August 2023.

Comments: 12 pages

arXiv:2308.00467 [pdf, ps, other]

On greedy multi-step inertial randomized Kaczmarz method for solving linear systems

Authors: Yansheng Su, Deren Han, Yun Zeng, Jiaxin Xie

Abstract: Recently, the multi-step inertial randomized Kaczmarz (MIRK) method for solving large-scale linear systems was proposed in [17]. In this paper, we incorporate the greedy probability criterion into the MIRK method, along with the introduction of a tighter threshold parameter for this criterion. We prove that the proposed greedy MIRK (GMIRK) method enjoys an improved deterministic linear convergence… ▽ More Recently, the multi-step inertial randomized Kaczmarz (MIRK) method for solving large-scale linear systems was proposed in [17]. In this paper, we incorporate the greedy probability criterion into the MIRK method, along with the introduction of a tighter threshold parameter for this criterion. We prove that the proposed greedy MIRK (GMIRK) method enjoys an improved deterministic linear convergence compared to both the MIRK method and the greedy randomized Kaczmarz method. Furthermore, we exhibit that the multi-step inertial extrapolation approach can be seen geometrically as an orthogonal projection method, and establish its relationship with the sketch-and-project method [15] and the oblique projection technique [22]. Numerical experiments are provided to confirm our results. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: arXiv admin note: text overlap with arXiv:2307.01988

arXiv:2307.16702 [pdf, ps, other]

Fast stochastic dual coordinate descent algorithms for linearly constrained convex optimization

Authors: Yun Zeng, Deren Han, Yansheng Su, Jiaxin Xie

Abstract: The problem of finding a solution to the linear system $Ax = b$ with certain minimization properties arises in numerous scientific and engineering areas. In the era of big data, the stochastic optimization algorithms become increasingly significant due to their scalability for problems of unprecedented size. This paper focuses on the problem of minimizing a strongly convex function subject to line… ▽ More The problem of finding a solution to the linear system $Ax = b$ with certain minimization properties arises in numerous scientific and engineering areas. In the era of big data, the stochastic optimization algorithms become increasingly significant due to their scalability for problems of unprecedented size. This paper focuses on the problem of minimizing a strongly convex function subject to linear constraints. We consider the dual formulation of this problem and adopt the stochastic coordinate descent to solve it. The proposed algorithmic framework, called fast stochastic dual coordinate descent, utilizes sampling matrices sampled from user-defined distributions to extract gradient information. Moreover, it employs Polyak's heavy ball momentum acceleration with adaptive parameters learned through iterations, overcoming the limitation of the heavy ball momentum method that it requires prior knowledge of certain parameters, such as the singular values of a matrix. With these extensions, the framework is able to recover many well-known methods in the context, including the randomized sparse Kaczmarz method, the randomized regularized Kaczmarz method, the linearized Bregman iteration, and a variant of the conjugate gradient (CG) method. We prove that, with strongly admissible objective function, the proposed method converges linearly in expectation. Numerical experiments are provided to confirm our results. △ Less

Submitted 15 August, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

Comments: arXiv admin note: text overlap with arXiv:2305.05482

arXiv:2307.01988 [pdf, ps, other]

On the convergence analysis of the greedy randomized Kaczmarz method

Authors: Yansheng Su, Deren Han, Yun Zeng, Jiaxin Xie

Abstract: In this paper, we analyze the greedy randomized Kaczmarz (GRK) method proposed in Bai and Wu (SIAM J. Sci. Comput., 40(1):A592--A606, 2018) for solving linear systems. We develop more precise greedy probability criteria to effectively select the working row from the coefficient matrix. Notably, we prove that the linear convergence of the GRK method is deterministic and demonstrate that using a tig… ▽ More In this paper, we analyze the greedy randomized Kaczmarz (GRK) method proposed in Bai and Wu (SIAM J. Sci. Comput., 40(1):A592--A606, 2018) for solving linear systems. We develop more precise greedy probability criteria to effectively select the working row from the coefficient matrix. Notably, we prove that the linear convergence of the GRK method is deterministic and demonstrate that using a tighter threshold parameter can lead to a faster convergence rate. Our result revises existing convergence analyses, which are solely based on the expected error by realizing that the iterates of the GRK method are random variables. Consequently, we obtain an improved iteration complexity for the GRK method. Moreover, the Polyak's heavy ball momentum technique is incorporated to improve the performance of the GRK method. We propose a refined convergence analysis, compared with the technique used in Loizou and Richtárik (Comput. Optim. Appl., 77(3):653--710, 2020), of momentum variants of randomized iterative methods, which shows that the proposed GRK method with momentum (mGRK) also enjoys a deterministic linear convergence. Numerical experiments show that the mGRK method is more efficient than the GRK method. △ Less

Submitted 14 November, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

arXiv:2305.05482 [pdf, ps, other]

On adaptive stochastic heavy ball momentum for solving linear systems

Authors: Yun Zeng, Deren Han, Yansheng Su, Jiaxin Xie

Abstract: The stochastic heavy ball momentum (SHBM) method has gained considerable popularity as a scalable approach for solving large-scale optimization problems. However, one limitation of this method is its reliance on prior knowledge of certain problem parameters, such as singular values of a matrix. In this paper, we propose an adaptive variant of the SHBM method for solving stochastic problems that ar… ▽ More The stochastic heavy ball momentum (SHBM) method has gained considerable popularity as a scalable approach for solving large-scale optimization problems. However, one limitation of this method is its reliance on prior knowledge of certain problem parameters, such as singular values of a matrix. In this paper, we propose an adaptive variant of the SHBM method for solving stochastic problems that are reformulated from linear systems using user-defined distributions. Our adaptive SHBM (ASHBM) method utilizes iterative information to update the parameters, addressing an open problem in the literature regarding the adaptive learning of momentum parameters. We prove that our method converges linearly in expectation, with a better convergence bound compared to the basic method. Notably, we demonstrate that the deterministic version of our ASHBM algorithm can be reformulated as a variant of the conjugate gradient (CG) method, inheriting many of its appealing properties, such as finite-time convergence. Consequently, the ASHBM method can be further generalized to develop a brand-new framework of the stochastic CG (SCG) method for solving linear systems. Our theoretical results are supported by numerical experiments. △ Less

Submitted 2 April, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

Comments: to appear in SIAM Journal on Matrix Analysis and Applications

arXiv:2303.02839 [pdf, ps, other]

Single-shot phase retrieval: a holography-driven problem in Sobolev space

Authors: Youfa Li, Shengli Fan, Deguang Han

Abstract: The phase-shifting digital holography (PSDH) is a widely used approach for recovering signals by their interference (with reference waves) intensity measurements. Such measurements are traditionally from multiple shots (corresponding to multiple reference waves). However, the imaging of dynamic signals requires a single-shot PSDH approach, namely, such an approach depends only on the intensity mea… ▽ More The phase-shifting digital holography (PSDH) is a widely used approach for recovering signals by their interference (with reference waves) intensity measurements. Such measurements are traditionally from multiple shots (corresponding to multiple reference waves). However, the imaging of dynamic signals requires a single-shot PSDH approach, namely, such an approach depends only on the intensity measurements from the interference with a single reference wave. In this paper, based on the uniform admissibility of plane (or spherical) reference wave and the interference intensity-based approximation to quasi-interference intensity, the nonnegative refinable function is applied to establish the single-shot PSDH in Sobolev space. Our approach is conducted by the intensity measurements from the interference of the signal with a single reference wave. The main results imply that the approximation version from such a single-shot approach converges exponentially to the signal as the level increases. Moreover, like the transport of intensity equation (TIE), our results can be interpreted from the perspective of intensity difference. △ Less

Submitted 14 May, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

Comments: 37pages

MSC Class: 42C40; 94A12

arXiv:2302.11780 [pdf, other]

Improving the generalization via coupled tensor norm regularization

Authors: Ying Gao, Yunfei Qu, Chunfeng Cui, Deren Han

Abstract: In this paper, we propose a coupled tensor norm regularization that could enable the model output feature and the data input to lie in a low-dimensional manifold, which helps us to reduce overfitting. We show this regularization term is convex, differentiable, and gradient Lipschitz continuous for logistic regression, while nonconvex and nonsmooth for deep neural networks. We further analyze the c… ▽ More In this paper, we propose a coupled tensor norm regularization that could enable the model output feature and the data input to lie in a low-dimensional manifold, which helps us to reduce overfitting. We show this regularization term is convex, differentiable, and gradient Lipschitz continuous for logistic regression, while nonconvex and nonsmooth for deep neural networks. We further analyze the convergence of the first-order method for solving this model. The numerical experiments demonstrate that our method is efficient. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Comments: Operations Research Letters

arXiv:2301.12675 [pdf, ps, other]

Monotone Splitting SQP Algorithms for Two-block Nonconvex Optimization Problems with General Linear Constraints and Applications

Authors: **bao Jian, Guodong Ma, Xiao Xu, Daolan Han

Abstract: In this work, based on the ideas of alternating direction method with multipliers (ADMM) and sequential quadratic programming (SQP), as well as Armijo line search technology, monotone splitting SQP algorithms for two-block nonconvex optimization problems with linear equality, inequality and box constraints are discussed. Firstly, the discussed problem is transformed into an optimization problem wi… ▽ More In this work, based on the ideas of alternating direction method with multipliers (ADMM) and sequential quadratic programming (SQP), as well as Armijo line search technology, monotone splitting SQP algorithms for two-block nonconvex optimization problems with linear equality, inequality and box constraints are discussed. Firstly, the discussed problem is transformed into an optimization problem with only linear equality and box constraints by introducing slack variables. Secondly, we use the idea of ADMM to decompose the quadratic programming (QP) subproblem. Especially, the QP subproblem corresponding to the introducing slack variable is simple, and it has an explicit optimal solution without increasing computational cost. Thirdly, the search direction is generated by the optimal solutions of the subproblems, and the new iteration point is yielded by Armijo line search with augmented Lagrange function. And the global convergence of the algorithm is analyzed under weaker assumptions. In addition, box constraints are extended to general nonempty closed convex sets, moreover, the global convergence of the corresponding algorithm is also proved. Finally, some preliminary numerical experiments and applications in the mid-to-large-scale economic dispatch problems for power systems are reported, and these show that our proposed algorithm is promising. △ Less

Submitted 30 January, 2023; originally announced January 2023.

arXiv:2301.02984 [pdf, ps, other]

Understanding the convergence of the preconditioned PDHG method: a view of indefinite proximal ADMM

Authors: Yumin Ma, Xingju Cai, Bo Jiang, Deren Han

Abstract: The primal-dual hybrid gradient (PDHG) algorithm is popular in solving min-max problems which are being widely used in a variety of areas. To improve the applicability and efficiency of PDHG for different application scenarios, we focus on the preconditioned PDHG (PrePDHG) algorithm, which is a framework covering PDHG, alternating direction method of multipliers (ADMM), and other methods. We give… ▽ More The primal-dual hybrid gradient (PDHG) algorithm is popular in solving min-max problems which are being widely used in a variety of areas. To improve the applicability and efficiency of PDHG for different application scenarios, we focus on the preconditioned PDHG (PrePDHG) algorithm, which is a framework covering PDHG, alternating direction method of multipliers (ADMM), and other methods. We give the optimal convergence condition of PrePDHG in the sense that the key parameters in the condition can not be further improved, which fills the theoretical gap in the-state-of-art convergence results of PrePDHG, and obtain the ergodic and non-ergodic sublinear convergence rates of PrePDHG. The theoretical analysis is achieved by establishing the equivalence between PrePDHG and indefinite proximal ADMM. Besides, we discuss various choices of the proximal matrices in PrePDHG and derive some interesting results. For example, the convergence condition of diagonal PrePDHG is improved to be tight, the dual stepsize of the balanced augmented Lagrangian method can be enlarged to $4/3$ from $1$, and a balanced augmented Lagrangian method with symmetric Gauss-Seidel iterations is also explored. Numerical results on the matrix game, projection onto the Birkhoff polytope, earth mover's distance, and CT reconstruction verify the effectiveness and superiority of PrePDHG. △ Less

Submitted 8 January, 2023; originally announced January 2023.

Comments: accepted for publication in Journal of Scientific Computing

arXiv:2301.01242 [pdf, other]

Non-stationary Lattice Anderson Model with Non-local Laplacian and Correlated White Noise

Authors: Xiaoyun Chen, Dan Han, Stanislav Molchanov

Abstract: We study the non-stationary Anderson parabolic problem on the lattice $Z^d$, i.e., the equation \begin{equation}\label{andersonmodel} \begin{aligned} \frac{\partial u}{\partial t} &=\varkappa \mathcal{A}u(t,x)+ξ_{t}(x)u(t,x) u(0,x) &\equiv 1, \, (t,x) \in [0,\infty)\times Z^d. \end{aligned} \end{equation} Here $\mathcal{A}$ is non-local Laplacian, $ξ_t (x), \ t \geq 0, \ x \in Z^d$ is the… ▽ More We study the non-stationary Anderson parabolic problem on the lattice $Z^d$, i.e., the equation \begin{equation}\label{andersonmodel} \begin{aligned} \frac{\partial u}{\partial t} &=\varkappa \mathcal{A}u(t,x)+ξ_{t}(x)u(t,x) u(0,x) &\equiv 1, \, (t,x) \in [0,\infty)\times Z^d. \end{aligned} \end{equation} Here $\mathcal{A}$ is non-local Laplacian, $ξ_t (x), \ t \geq 0, \ x \in Z^d$ is the family of the correlated white noises and $\varkappa >0$ is the diffusion coefficient. The changes of $\varkappa$ (large versus small) are responsible for the qualitative phase transition in the model. At the first step the analysis of the model is reduced to the solution of the stochastic differential equation(SDE) (in the standard Itô's form) on the weighted Hilbert space $l^2(Z^d,μ)$ with appropriate measure $μ$. The equations of first two moments of the solution $u(t,x)$ are derived and studied using the spectral analysis of the corresponding Schrödinger operators with special class of the positive definite potentials. The analysis reveals several bifurcations depending on the properties of the kernel of $\mathcal{A}$ and the correlation function in the potential. △ Less

Submitted 9 January, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

MSC Class: 60H25; 82C44; 60K37

arXiv:2301.00176 [pdf, ps, other]

Randomized Kaczmarz method with adaptive stepsizes for inconsistent linear systems

Authors: Yun Zeng, Deren Han, Yansheng Su, Jiaxin Xie

Abstract: We investigate the randomized Kaczmarz method that adaptively updates the stepsize using readily available information for solving inconsistent linear systems. A novel geometric interpretation is provided which shows that the proposed method can be viewed as an orthogonal projection method in some sense. We prove that this method converges linearly in expectation to the unique minimum Euclidean no… ▽ More We investigate the randomized Kaczmarz method that adaptively updates the stepsize using readily available information for solving inconsistent linear systems. A novel geometric interpretation is provided which shows that the proposed method can be viewed as an orthogonal projection method in some sense. We prove that this method converges linearly in expectation to the unique minimum Euclidean norm least-squares solution of the linear system, and provide a tight upper bound for the convergence of the proposed method. Numerical experiments are also given to illustrate the theoretical results. △ Less

Submitted 16 March, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

Comments: to appear in Numerical Algorithms

arXiv:2211.15755 [pdf, other]

Confidence-Aware Graph Neural Networks for Learning Reliability Assessment Commitments

Authors: Seonho Park, Wenbo Chen, Dahye Han, Mathieu Tanneau, Pascal Van Hentenryck

Abstract: Reliability Assessment Commitment (RAC) Optimization is increasingly important in grid operations due to larger shares of renewable generations in the generation mix and increased prediction errors. Independent System Operators (ISOs) also aim at using finer time granularities, longer time horizons, and possibly stochastic formulations for additional economic and reliability benefits. The goal of… ▽ More Reliability Assessment Commitment (RAC) Optimization is increasingly important in grid operations due to larger shares of renewable generations in the generation mix and increased prediction errors. Independent System Operators (ISOs) also aim at using finer time granularities, longer time horizons, and possibly stochastic formulations for additional economic and reliability benefits. The goal of this paper is to address the computational challenges arising in extending the scope of RAC formulations. It presents RACLearn that (1) uses a Graph Neural Network (GNN) based architecture to predict generator commitments and active line constraints, (2) associates a confidence value to each commitment prediction, (3) selects a subset of the high-confidence predictions, which are (4) repaired for feasibility, and (5) seeds a state-of-the-art optimization algorithm with feasible predictions and active constraints. Experimental results on exact RAC formulations used by the Midcontinent Independent System Operator (MISO) and an actual transmission network (8965 transmission lines, 6708 buses, 1890 generators, and 6262 load units) show that the RACLearn framework can speed up RAC optimization by factors ranging from 2 to 4 with negligible loss in solution quality. △ Less

Submitted 10 June, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

Comments: Submitted to IEEE Transactions on Power Systems

arXiv:2211.08693 [pdf, ps, other]

Determination of compactly supported functions in shift-invariant space by single-angle Radon samples

Authors: Youfa Li, Shengli Fan, Deguang Han

Abstract: While traditionally the computerized tomography of a function $f\in L^{2}(\mathbb{R}^{2})$ depends on the samples of its Radon transform at multiple angles, the real-time imaging sometimes requires the reconstruction of $f$ by the samples of its Radon transform $\mathcal{R}_{\emph{\textbf{p}}}f$ at a single angle $θ$, where $\emph{\textbf{p}}=(\cosθ, \sinθ)$ is the direction vector. This naturally… ▽ More While traditionally the computerized tomography of a function $f\in L^{2}(\mathbb{R}^{2})$ depends on the samples of its Radon transform at multiple angles, the real-time imaging sometimes requires the reconstruction of $f$ by the samples of its Radon transform $\mathcal{R}_{\emph{\textbf{p}}}f$ at a single angle $θ$, where $\emph{\textbf{p}}=(\cosθ, \sinθ)$ is the direction vector. This naturally leads to the question of identifying those functions that can be determined by their Radon samples at a single angle $θ$. The shift-invariant space $V(\varphi, \mathbb{Z}^2)$ generated by $\varphi$ is a type of function space that has been widely considered in many fields including wavelet analysis and signal processing. In this paper we examine the single-angle reconstruction problem for compactly supported functions $f\in V(\varphi, \mathbb{Z}^2)$. The central issue for the problem is to identify the eligible $\emph{\textbf{p}}$ and sampling set $X_{\emph{\textbf{p}}}\subseteq \mathbb{R}$ such that $f$ can be determined by its single-angle Radon (w.r.t $\emph{\textbf{p}}$) samples at $X_{\emph{\textbf{p}}}$. For the general generator $\varphi$, we address the eligible $\emph{\textbf{p}}$ for the two cases: (1) $\varphi$ being nonvanishing ($\int_{\mathbb{R}^{2}}\varphi(\emph{\textbf{x}})d\emph{\textbf{x}}\neq0$) and (2) being vanishing ($\int_{\mathbb{R}^2}\varphi(\emph{\textbf{x}})d\emph{\textbf{x}}=0$). We prove that eligible $X_{\emph{\textbf{p}}}$ exists for general $\varphi$. In particular, $X_{\emph{\textbf{p}}}$ can be explicitly constructed if $\varphi\in C^{1}(\mathbb{R}^{2})$. The single-angle problem corresponding to the case that $\varphi$ being positive definite is addressed such that $X_{\emph{\textbf{p}}}$ can be constructed easily. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Journal ref: Journal of Functional Analysis, 2023

arXiv:2209.08586 [pdf, ps, other]

Convergence Rate of Sample Mean for $\varphi$-Mixing Random Variables with Heavy-Tailed Distributions

Authors: F. Q. Tang, D. Han

Abstract: This article studies the convergence rate of the sample mean for $\varphi$-mixing dependent random variables with finite means and infinite variances. Dividing the sample mean into sum of the average of the main parts and the average of the tailed parts, we not only obtain the convergence rate of the sample mean but also prove that the convergence rate of the average of the main parts is faster th… ▽ More This article studies the convergence rate of the sample mean for $\varphi$-mixing dependent random variables with finite means and infinite variances. Dividing the sample mean into sum of the average of the main parts and the average of the tailed parts, we not only obtain the convergence rate of the sample mean but also prove that the convergence rate of the average of the main parts is faster than that of the average of the tailed parts. △ Less

Submitted 18 September, 2022; originally announced September 2022.

arXiv:2209.04772 [pdf, other]

A new method for estimating the tail index using truncated sample sequence

Authors: F. Q. Tang, D. Han

Abstract: This article proposes a new method of truncated estimation to estimate the tail index $α$ of the extremely heavy-tailed distribution with infinite mean or variance. We not only present two truncated estimators $\hatα$ and $\hatα^{\prime}$ for estimating $α$ ($0<α\leq 1$) and $α$ ($1<α\leq 2$) respectively, but also prove their asymptotic statistical properties. The numerical simulation results com… ▽ More This article proposes a new method of truncated estimation to estimate the tail index $α$ of the extremely heavy-tailed distribution with infinite mean or variance. We not only present two truncated estimators $\hatα$ and $\hatα^{\prime}$ for estimating $α$ ($0<α\leq 1$) and $α$ ($1<α\leq 2$) respectively, but also prove their asymptotic statistical properties. The numerical simulation results comparing the six known estimators in estimating error, the Type I Error and the power of estimator show that the performance of the two new truncated estimators is quite good on the whole. △ Less

Submitted 10 September, 2022; originally announced September 2022.

arXiv:2209.02853 [pdf, other]

Second order, unconditionally stable, linear ensemble algorithms for the magnetohydrodynamics equations

Authors: John Carter, Daozhi Han, Nan Jiang

Abstract: We propose two unconditionally stable, linear ensemble algorithms with pre-computable shared coefficient matrices across different realizations for the magnetohydrodynamics equations. The viscous terms are treated by a standard perturbative discretization. The nonlinear terms are discretized fully explicitly within the framework of the generalized positive auxiliary variable approach (GPAV). Artif… ▽ More We propose two unconditionally stable, linear ensemble algorithms with pre-computable shared coefficient matrices across different realizations for the magnetohydrodynamics equations. The viscous terms are treated by a standard perturbative discretization. The nonlinear terms are discretized fully explicitly within the framework of the generalized positive auxiliary variable approach (GPAV). Artificial viscosity stabilization that modifies the kinetic energy is introduced to improve accuracy of the GPAV ensemble methods. Numerical results are presented to demonstrate the accuracy and robustness of the ensemble algorithms. △ Less

Submitted 6 September, 2022; originally announced September 2022.

Comments: 24 pages, 30 figures

arXiv:2208.10691 [pdf, other]

doi 10.4208/nmtma.OA-2022-0148

A fixed-time inverse-free dynamical system for solving the system of absolute value equations

Authors: Xuehua Li, Dongmei Yu, Yinong Yang, Deren Han, Cairong Chen

Abstract: In this paper, an inverse-free dynamical system with fixed-time convergence is presented to solve the system of absolute value equations (AVEs). Under a mild condition, it is proved that the solution of the proposed dynamical system converges to the solution of the AVEs. Moreover, in contrast to the existing inverse-free dynamical system \cite{chen2021}, a conservative settling-time of the propose… ▽ More In this paper, an inverse-free dynamical system with fixed-time convergence is presented to solve the system of absolute value equations (AVEs). Under a mild condition, it is proved that the solution of the proposed dynamical system converges to the solution of the AVEs. Moreover, in contrast to the existing inverse-free dynamical system \cite{chen2021}, a conservative settling-time of the proposed method is given. Numerical simulations illustrate the effectiveness of the new method. △ Less

Submitted 22 August, 2022; originally announced August 2022.

Comments: 3 figures. arXiv admin note: text overlap with arXiv:2208.05308

Journal ref: Numer. Math. Theor. Meth. Appl.-2023

arXiv:2208.05437 [pdf, ps, other]

On pseudoinverse-free randomized methods for linear systems: Unified framework and acceleration

Authors: Deren Han, Jiaxin Xie

Abstract: We present a new framework for the analysis and design of randomized algorithms for solving various types of linear systems, including consistent or inconsistent, full rank or rank-deficient. Our method is formulated with four randomized sampling parameters, which allows the method to cover many existing randomization algorithms within a unified framework, including the doubly stochastic Gauss-Sei… ▽ More We present a new framework for the analysis and design of randomized algorithms for solving various types of linear systems, including consistent or inconsistent, full rank or rank-deficient. Our method is formulated with four randomized sampling parameters, which allows the method to cover many existing randomization algorithms within a unified framework, including the doubly stochastic Gauss-Seidel, randomized Kaczmarz method, randomized coordinate descent method, and Gaussian Kaczmarz method. Compared with the projection-based block algorithms where a pseudoinverse for solving a least-squares problem is utilized at each iteration, our design is pseudoinverse-free. Furthermore, the flexibility of the new approach also enables the design of a number of new methods as special cases. Polyak's heavy ball momentum technique is also introduced in our framework for improving the convergence behavior of the method. We prove the global linear convergence rates of our method as well as an accelerated linear rate for the case of the norm of expected iterates. Finally, numerical experiments are provided to confirm our results. △ Less

Submitted 24 August, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

Comments: arXiv admin note: text overlap with arXiv:2207.04291; text overlap with arXiv:1909.12176 by other authors

arXiv:2208.05308 [pdf, other]

A dynamical system based on projection operator for solving absolute value equations associated with second-order cone

Authors: Cairong Chen, Dongmei Yu, Deren Han, Changfeng Ma

Abstract: A new equivalent reformulation of the absolute value equations associated with second-order cone (SOCAVEs) is emphasised, from which a dynamical system based on projection operator for solving SOCAVEs is constructed. Under proper assumptions, the equilibrium points of the dynamical system exist and could be (globally) asymptotically stable. Some numerical simulations are given to show the effectiv… ▽ More A new equivalent reformulation of the absolute value equations associated with second-order cone (SOCAVEs) is emphasised, from which a dynamical system based on projection operator for solving SOCAVEs is constructed. Under proper assumptions, the equilibrium points of the dynamical system exist and could be (globally) asymptotically stable. Some numerical simulations are given to show the effectiveness of the proposed method. △ Less

Submitted 10 August, 2022; originally announced August 2022.

Comments: 5 figures

arXiv:2207.04492 [pdf, ps, other]

doi 10.1007/s40314-023-02318-6

On finite termination of the generalized Newton method for solving absolute value equations

Authors: Jia Tang, Wenli Zheng, Cairong Chen, Dongmei Yu, Deren Han

Abstract: Motivated by the framework constructed by Brugnano and Casulli $[$SIAM J. Sci. Comput. 30: 463--472, 2008$]$, we analyze the finite termination property of the generalized Netwon method (GNM) for solving the absolute value equation (AVE). More precisely, for some special matrices, GNM is terminated in at most $2n + 2$ iterations. A new result for the unique solvability and unsolvability of the AVE… ▽ More Motivated by the framework constructed by Brugnano and Casulli $[$SIAM J. Sci. Comput. 30: 463--472, 2008$]$, we analyze the finite termination property of the generalized Netwon method (GNM) for solving the absolute value equation (AVE). More precisely, for some special matrices, GNM is terminated in at most $2n + 2$ iterations. A new result for the unique solvability and unsolvability of the AVE is obtained. Numerical experiments are given to demonstrate the theoretical analysis. △ Less

Submitted 10 July, 2022; originally announced July 2022.

Comments: 11 pages

Journal ref: Computational and Applied Mathematics-2023

arXiv:2207.04291 [pdf, ps, other]

Randomized Douglas-Rachford methods for linear systems: Improved accuracy and efficiency

Authors: Deren Han, Yansheng Su, Jiaxin Xie

Abstract: The Douglas-Rachford (DR) method is a widely used method for finding a point in the intersection of two closed convex sets (feasibility problem). However, the method converges weakly and the associated rate of convergence is hard to analyze in general. In addition, the direct extension of the DR method for solving more-than-two-sets feasibility problems, called the $r$-sets-DR method, is not neces… ▽ More The Douglas-Rachford (DR) method is a widely used method for finding a point in the intersection of two closed convex sets (feasibility problem). However, the method converges weakly and the associated rate of convergence is hard to analyze in general. In addition, the direct extension of the DR method for solving more-than-two-sets feasibility problems, called the $r$-sets-DR method, is not necessarily convergent. To improve the efficiency of the optimization algorithms, the introduction of randomization and the momentum technique has attracted increasing attention. In this paper, we propose the randomized $r$-sets-DR (RrDR) method for solving the feasibility problem derived from linear systems, showing the benefit of the randomization as it brings linear convergence in expectation to the otherwise divergent $r$-sets-DR method. Furthermore, the convergence rate does not depend on the dimension of the coefficient matrix. We also study RrDR with heavy ball momentum and establish its accelerated rate. Numerical experiments are provided to confirm our results and demonstrate the notable improvements in accuracy and efficiency of the DR method, brought by the randomization and the momentum technique. △ Less

Submitted 9 January, 2024; v1 submitted 9 July, 2022; originally announced July 2022.

Comments: to appear in SIAM Journal on Optimization

arXiv:2204.00950 [pdf, other]

Risk-Aware Control and Optimization for High-Renewable Power Grids

Authors: Neil Barry, Minas Chatzos, Wenbo Chen, Dahye Han, Chaofan Huang, Roshan Joseph, Michael Klamkin, Seonho Park, Mathieu Tanneau, Pascal Van Hentenryck, Shangkun Wang, Hanyu Zhang, Haoruo Zhao

Abstract: The transition of the electrical power grid from fossil fuels to renewable sources of energy raises fundamental challenges to the market-clearing algorithms that drive its operations. Indeed, the increased stochasticity in load and the volatility of renewable energy sources have led to significant increases in prediction errors, affecting the reliability and efficiency of existing deterministic op… ▽ More The transition of the electrical power grid from fossil fuels to renewable sources of energy raises fundamental challenges to the market-clearing algorithms that drive its operations. Indeed, the increased stochasticity in load and the volatility of renewable energy sources have led to significant increases in prediction errors, affecting the reliability and efficiency of existing deterministic optimization models. The RAMC project was initiated to investigate how to move from this deterministic setting into a risk-aware framework where uncertainty is quantified explicitly and incorporated in the market-clearing optimizations. Risk-aware market-clearing raises challenges on its own, primarily from a computational standpoint. This paper reviews how RAMC approaches risk-aware market clearing and presents some of its innovations in uncertainty quantification, optimization, and machine learning. Experimental results on real networks are presented. △ Less

Submitted 2 April, 2022; originally announced April 2022.

arXiv:2112.04353 [pdf, ps, other]

A decoupled numerical method for two-phase flows of different densities and viscosities in superposed fluid and porous layers

Authors: Yali Gao, Daozhi Han, Xiaoming He, Ulrich Rüde

Abstract: In this article we consider the numerical modeling and simulation via the phase field approach of two-phase flows of different densities and viscosities in superposed fluid and porous layers. The model consists of the Cahn-Hilliard-Navier-Stokes equations in the free flow region and the Cahn-Hilliard-Darcy equations in porous media that are coupled by seven domain interface boundary conditions. We… ▽ More In this article we consider the numerical modeling and simulation via the phase field approach of two-phase flows of different densities and viscosities in superposed fluid and porous layers. The model consists of the Cahn-Hilliard-Navier-Stokes equations in the free flow region and the Cahn-Hilliard-Darcy equations in porous media that are coupled by seven domain interface boundary conditions. We show that the coupled model satisfies an energy law. Based on the ideas of pressure stabilization and artificial compressibility, we propose an unconditionally stable time step** method that decouples the computation of the phase field variable, the velocity and pressure of free flow, the velocity and pressure of porous media, hence significantly reduces the computational cost. The energy stability of the scheme effected with the finite element spatial discretization is rigorously established. We verify numerically that our schemes are convergent and energy-law preserving. Ample numerical experiments are performed to illustrate the features of two-phase flows in the coupled free flow and porous media setting. △ Less

Submitted 8 December, 2021; originally announced December 2021.

arXiv:2111.13808 [pdf, ps, other]

A non-monotone smoothing Newton algorithm for solving the system of generalized absolute value equations

Authors: Cairong Chen, Dongmei Yu, Deren Han, Changfeng Ma

Abstract: The system of generalized absolute value equations (GAVE) has attracted more and more attention in the optimization community. In this paper, by introducing a smoothing function, we develop a smoothing Newton algorithm with non-monotone line search to solve the GAVE. We show that the non-monotone algorithm is globally and locally quadratically convergent under a weaker assumption than those given… ▽ More The system of generalized absolute value equations (GAVE) has attracted more and more attention in the optimization community. In this paper, by introducing a smoothing function, we develop a smoothing Newton algorithm with non-monotone line search to solve the GAVE. We show that the non-monotone algorithm is globally and locally quadratically convergent under a weaker assumption than those given in most existing algorithms for solving the GAVE. Numerical results are given to demonstrate the viability and efficiency of the approach. △ Less

Submitted 26 November, 2021; originally announced November 2021.

Comments: 22 pages, 2 figures

arXiv:2110.07951 [pdf, other]

doi 10.1016/j.cnsns.2022.106531

Dynamical transition of hydromagnetic convection in a rotating fluid layer

Authors: Liang Li, Yanlong Fan, Daozhi Han, Quan Wang

Abstract: In this article, we aim to study the stability and dynamic transition of an electrically conducting fluid in the presence of an external uniform horizontal magnetic field and rotation based on a Boussinesq approximation model. By analyzing the spectrum of the linear part of the model and verifying the validity of the principle of exchange of stability, we take a hybrid approach combining theoretic… ▽ More In this article, we aim to study the stability and dynamic transition of an electrically conducting fluid in the presence of an external uniform horizontal magnetic field and rotation based on a Boussinesq approximation model. By analyzing the spectrum of the linear part of the model and verifying the validity of the principle of exchange of stability, we take a hybrid approach combining theoretical analysis with numerical computation to study the transition from a simple real eigenvalue, a pair of complex conjugate eigenvalues and a real eigenvalue of multiplicity two, respectively. The center manifold reduction theory is applied to reduce the infinite dimensional system to the corresponding finite dimensional one together with one or several non-dimensional transition numbers that determine the dynamic transition types. Careful numerical computations are performed to determine these transition numbers as well as related temporal and flow patterns etc. Our results indicate that both continuous and jump transitions can occur at certain parameter region. △ Less

Submitted 12 November, 2021; v1 submitted 15 October, 2021; originally announced October 2021.

arXiv:2109.09042 [pdf, ps, other]

Dilations for operator-valued quantum measures

Authors: Deguang Han, Qianfeng Hu, David R. Larson, Rui Liu

Abstract: This paper concerns the dilations of Banach space operator-valued quantum measures. While the recently developed general dilation theory can lead to a projection (idempotent) valued dilation for any quantum measure over the projection lattice for a von Neumann algebra that dose not contain type $I_{2}$ direct summand, such a dilation does not necessarily guarantee the preservation of countable add… ▽ More This paper concerns the dilations of Banach space operator-valued quantum measures. While the recently developed general dilation theory can lead to a projection (idempotent) valued dilation for any quantum measure over the projection lattice for a von Neumann algebra that dose not contain type $I_{2}$ direct summand, such a dilation does not necessarily guarantee the preservation of countable additivity of the quantum measure. So it remain an open question whether every countably additive $B(X)$-valued quantum measure can be dilated to a countably additive projection-valued measure.The main purpose of this paper is to prove that such a dilation can be constructed if one of the following two conditions is satisfied: (i) the underling Banach space $X = \ell_{p}$ $(1\leq p < 2$) or it has Schur property, (ii) the quantum measure has bounded $p$-variation for some $ 1\leq p < \infty $. All of these were achieved by establishing a non-commutative version of a minimal dilation theory on the so-called elementary dilation space equip** with an appropriate dilation norm. In particular, the newly introduced $p$-variation norm on the elementary dilation space allows us to prove that every operator-valued quantum measure with bounded $p$-variation has a projection-valued quantum measure dilation that preserves the boundedness of the $p$-variation. △ Less

Submitted 18 September, 2021; originally announced September 2021.

Comments: 28 pages

arXiv:2106.03260 [pdf, other]

Error estimate of a decoupled numerical scheme for the Cahn-Hilliard-Stokes-Darcy system

Authors: Wenbin Chen, Daozhi Han, Cheng Wang, Shufen Wang, Xiaoming Wang, Yichao Zhang

Abstract: We analyze a fully discrete finite element numerical scheme for the Cahn-Hilliard-Stokes-Darcy system that models two-phase flows in coupled free flow and porous media. To avoid a well-known difficulty associated with the coupling between the Cahn-Hilliard equation and the fluid motion, we make use of the operator-splitting in the numerical scheme, so that these two solvers are decoupled, which in… ▽ More We analyze a fully discrete finite element numerical scheme for the Cahn-Hilliard-Stokes-Darcy system that models two-phase flows in coupled free flow and porous media. To avoid a well-known difficulty associated with the coupling between the Cahn-Hilliard equation and the fluid motion, we make use of the operator-splitting in the numerical scheme, so that these two solvers are decoupled, which in turn would greatly improve the computational efficiency. The unique solvability and the energy stability have been proved in~\cite{CHW2017}. In this work, we carry out a detailed convergence analysis and error estimate for the fully discrete finite element scheme, so that the optimal rate convergence order is established in the energy norm, i.e.,, in the $\ell^\infty (0, T; H^1) \cap \ell^2 (0, T; H^2)$ norm for the phase variables, as well as in the $\ell^\infty (0, T; H^1) \cap \ell^2 (0, T; H^2)$ norm for the velocity variable. Such an energy norm error estimate leads to a cancellation of a nonlinear error term associated with the convection part, which turns out to be a key step to pass through the analysis. In addition, a discrete $\ell^2 (0;T; H^3)$ bound of the numerical solution for the phase variables plays an important role in the error estimate, which is accomplished via a discrete version of Gagliardo-Nirenberg inequality in the finite element setting. △ Less

Submitted 6 June, 2021; originally announced June 2021.

arXiv:2104.08435 [pdf, ps, other]

On $*$-clean group rings over finite fields

Authors: Dongchun Han, Hanbin Zhang

Abstract: A ring $R$ is called clean if every element of $R$ is the sum of a unit and an idempotent. Motivated by a question proposed by Lam on the cleanness of von Neumann Algebras, Vaš introduced a more natural concept of cleanness for $*$-rings, called the $*$-cleanness. More precisely, a $*$-ring $R$ is called a $*$-clean ring if every element of $R$ is the sum of a unit and a projection ($*$-invariant… ▽ More A ring $R$ is called clean if every element of $R$ is the sum of a unit and an idempotent. Motivated by a question proposed by Lam on the cleanness of von Neumann Algebras, Vaš introduced a more natural concept of cleanness for $*$-rings, called the $*$-cleanness. More precisely, a $*$-ring $R$ is called a $*$-clean ring if every element of $R$ is the sum of a unit and a projection ($*$-invariant idempotent). Let $\mathbb F$ be a finite field and $G$ a finite abelian group. In this paper, we introduce two classes of involutions on group rings of the form $\mathbb FG$ and characterize the $*$-cleanness of these group rings in each case. When $*$ is taken as the classical involution, we also characterize the $*$-cleanness of $\mathbb F_qG$ in terms of LCD abelian codes and self-orthogonal abelian codes in $\mathbb F_qG$. △ Less

Submitted 16 April, 2021; originally announced April 2021.

Comments: 13 pages, accepted by Finite Fields and their Applications

arXiv:2103.16154 [pdf, ps, other]

Convergence on a symmetric accelerated stochastic ADMM with larger stepsizes

Authors: Jianchao Bai, Deren Han, Hao Sun, Hongchao Zhang

Abstract: In this paper, we develop a symmetric accelerated stochastic Alternating Direction Method of Multipliers (SAS-ADMM) for solving separable convex optimization problems with linear constraints. The objective function is the sum of a possibly nonsmooth convex function and an average function of many smooth convex functions. Our proposed algorithm combines both ideas of ADMM and the techniques of acce… ▽ More In this paper, we develop a symmetric accelerated stochastic Alternating Direction Method of Multipliers (SAS-ADMM) for solving separable convex optimization problems with linear constraints. The objective function is the sum of a possibly nonsmooth convex function and an average function of many smooth convex functions. Our proposed algorithm combines both ideas of ADMM and the techniques of accelerated stochastic gradient methods possibly with variance reduction to solve the smooth subproblem. One main feature of SAS-ADMM is that its dual variable is symmetrically updated after each update of the separated primal variable, which would allow a more flexible and larger convergence region of the dual variable compared with that of standard deter-ministic or stochastic ADMM. This new stochastic optimization algorithm is shown to have ergodic converge in expectation with O(1/T) convergence rate, where T is the number of outer iterations. Our preliminary experiments indicate the proposed algorithm is very effective for solving separable optimization problems from big-data applications. Finally, 3-block extensions of the algorithm and its variant of an accelerated stochastic augmented Lagrangian method are discussed in the appendix. △ Less

Submitted 19 December, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

Comments: Accepted by CSIAM-AM

arXiv:2103.10129 [pdf, ps, other]

An inexact framework of the Newton-based matrix splitting iterative method for the generalized absolute value equation

Authors: Dongmei Yu, Cairong Chen, Deren Han

Abstract: An inexact framework of the Newton-based matrix splitting (INMS) iterative method is developed to solve the generalized absolute value equation, whose exact version was proposed by Zhou, Wu and Li [H.-Y. Zhou, S.-L. Wu and C.-X. Li, \textit{J. Comput. Appl. Math.}, 394 (2021), 113578]. Global linear convergence of the INMS iterative method is investigated in detail. Some numerical results are give… ▽ More An inexact framework of the Newton-based matrix splitting (INMS) iterative method is developed to solve the generalized absolute value equation, whose exact version was proposed by Zhou, Wu and Li [H.-Y. Zhou, S.-L. Wu and C.-X. Li, \textit{J. Comput. Appl. Math.}, 394 (2021), 113578]. Global linear convergence of the INMS iterative method is investigated in detail. Some numerical results are given to show the superiority of the INMS iterative method. △ Less

Submitted 14 February, 2022; v1 submitted 18 March, 2021; originally announced March 2021.

Comments: 15 pages, 4 tables

arXiv:2103.09398 [pdf, ps, other]

doi 10.1093/imanum/drab105

An inexact Douglas-Rachford splitting method for solving absolute value equations

Authors: Cairong Chen, Dongmei Yu, Deren Han

Abstract: The last two decades witnessed the increasing of the interests on the absolute value equations (AVE) of finding $x\in\mathbb{R}^n$ such that $Ax-|x|-b=0$, where $A\in \mathbb{R}^{n\times n}$ and $b\in \mathbb{R}^n$. In this paper, we pay our attention on designing efficient algorithms. To this end, we reformulate AVE to a generalized linear complementarity problem (GLCP), which, among the equivale… ▽ More The last two decades witnessed the increasing of the interests on the absolute value equations (AVE) of finding $x\in\mathbb{R}^n$ such that $Ax-|x|-b=0$, where $A\in \mathbb{R}^{n\times n}$ and $b\in \mathbb{R}^n$. In this paper, we pay our attention on designing efficient algorithms. To this end, we reformulate AVE to a generalized linear complementarity problem (GLCP), which, among the equivalent forms, is the most economical one in the sense that it does not increase the dimension of the variables. For solving the GLCP, we propose an inexact Douglas-Rachford splitting method which can adopt a relative error tolerance. As a consequence, in the inner iteration processes, we can employ the LSQR method ([C.C. Paige and M.A. Saunders, ACM Trans. Mathe. Softw. (TOMS), 8 (1982), pp. 43--71]) to find a qualified approximate solution for each subproblem, which makes the cost per iteration very low. We prove the convergence of the algorithm and establish its global linear rate of convergence. Comparing results with the popular algorithms such as the exact generalized Newton method [O.L. Mangasarian, Optim. Lett., 1 (2007), pp. 3--8], the inexact semi-smooth Newton method [J.Y.B. Cruz, O.P. Ferreira and L.F. Prudente, Comput. Optim. Appl., 65 (2016), pp. 93--108] and the exact SOR-like method [Y.-F. Ke and C.-F. Ma, Appl. Math. Comput., 311 (2017), pp. 195--202] are reported, which indicate that the proposed algorithm is very promising. Moreover, our method also extends the range of numerically solvable of the AVE; that is, it can deal with not only the case that $\|A^{-1}\|<1$, the commonly used in those existing literature, but also the case where $\|A^{-1}\|=1$. △ Less

Submitted 16 March, 2021; originally announced March 2021.

Comments: 25 pages, 3 figures, 3 tables

Journal ref: IMA Journal of Numerical Analysis-2022

arXiv:2103.08660 [pdf, ps, other]

FROG-measurement based phase retrieval for analytic signals

Authors: Youfa Li, Yaoshuai Ma, Deguang Han

Abstract: While frequency-resolved optical gating (FROG) is widely used in characterizing the ultrafast pulse in optics, analytic signals are often considered in time-frequency analysis and signal processing, especially when extracting instantaneous features of events. In this paper we examine the phase retrieval (PR) problem of analytic signals in $\Bbb{C}^N$ by their FROG measurements. After establishing… ▽ More While frequency-resolved optical gating (FROG) is widely used in characterizing the ultrafast pulse in optics, analytic signals are often considered in time-frequency analysis and signal processing, especially when extracting instantaneous features of events. In this paper we examine the phase retrieval (PR) problem of analytic signals in $\Bbb{C}^N$ by their FROG measurements. After establishing the ambiguity of the FROG-PR of analytic signals, we found that the FROG-PR of analytic signals of even lengths is different from that of analytic signals of odd lengths, and it is also different from the case of $B$-bandlimited signals with $B \leq N/2$. The existing approach to bandlimited signals can be applied to analytic signals of odd lengths, but it does not apply to the even length case. With the help of two relaxed FROG-PR problems and a translation technique, we develop an approach to FROG-PR for the analytic signals of even lengths, and prove that in this case the generic analytic signals can be uniquely (up to the ambiguity) determined by their $(3N/2+1)$ FROG measurements. △ Less

Submitted 7 March, 2021; originally announced March 2021.

arXiv:2011.14171 [pdf, ps, other]

One Explicitly Solvable Model For The Galton-Watson Processes In the Random Environment

Authors: Dan Han, Stanislav Molchanov, Yanjmaa Jutmaan

Abstract: In this paper, we study the Galton-Watson process in the random environment for the particular case when the number of the offsprings in each generation has the fractional linear generation function with random parameters. In this case, the distribution of $N_t$, the number of particles at the moment time $t=0,1,2,\cdots$ can be calculated explicitly. We present the classification of such processe… ▽ More In this paper, we study the Galton-Watson process in the random environment for the particular case when the number of the offsprings in each generation has the fractional linear generation function with random parameters. In this case, the distribution of $N_t$, the number of particles at the moment time $t=0,1,2,\cdots$ can be calculated explicitly. We present the classification of such processes and limit theorems of two types: quenched type which is for the fixed realization of the random environment and annealed type which includes the averaging over the environment. △ Less

Submitted 28 November, 2020; originally announced November 2020.

MSC Class: 60J80; 60G51

arXiv:2011.06183 [pdf, ps, other]

Gabor single-frame and multi-frame multipliers in any given dimension

Authors: Yuanan Diao, Deguang Han, Zhongyan Li

Abstract: Functional Gabor single-frame or multi-frame generator multipliers are the matrices of function entries that preserve Parseval Gabor single-frame or multi-frame generators. An interesting and natural question is how to characterize all such multipliers. This question has been answered for several special cases including the case of single-frame generators in two dimensions and the case of multi-fr… ▽ More Functional Gabor single-frame or multi-frame generator multipliers are the matrices of function entries that preserve Parseval Gabor single-frame or multi-frame generators. An interesting and natural question is how to characterize all such multipliers. This question has been answered for several special cases including the case of single-frame generators in two dimensions and the case of multi-frame generators in one-dimension. In this paper we completely characterize multipliers for Gabor single-frame and multi-frame generators with respect to separable time-frequency lattices in any given dimension. Our approach is general and applies to the previously known cases as well. △ Less

Submitted 11 November, 2020; originally announced November 2020.

Comments: 21 pages, no figures

MSC Class: 42C15; 46C05; 47B10

Showing 1–50 of 98 results for author: Han, D