-
Distributionally Robust Performative Optimization
Authors:
Zhuangzhuang Jia,
Yijie Wang,
Roy Dong,
Grani A. Hanasusanto
Abstract:
In this paper, we propose a general distributionally robust framework for performative optimization, where the selected decision can influence the probabilistic distribution of uncertain parameters. Our framework facilitates safe decision-making in scenarios with incomplete information about the underlying decision-dependent distributions, relying instead on accessible reference distributions. To…
▽ More
In this paper, we propose a general distributionally robust framework for performative optimization, where the selected decision can influence the probabilistic distribution of uncertain parameters. Our framework facilitates safe decision-making in scenarios with incomplete information about the underlying decision-dependent distributions, relying instead on accessible reference distributions. To tackle the challenge of decision-dependent uncertainty, we introduce an algorithm named repeated robust risk minimization. This algorithm decouples the decision variables associated with the ambiguity set from the expected loss, optimizing the latter at each iteration while kee** the former fixed to the previous decision. By leveraging the strong connection between distributionally robust optimization and regularization, we establish a linear convergence rate to a performatively stable point and provide a suboptimality performance guarantee for the proposed algorithm. Finally, we examine the performance of our proposed model through an experimental study in strategic classification.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization
Authors:
Zihao Jiao,
Mengyi Sha,
Haoyu Zhang,
Xinyu Jiang,
Wei Qi
Abstract:
Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and trans…
▽ More
Existing operations research (OR) models and tools play indispensable roles in smart-city operations, yet their practical implementation is limited by the complexity of modeling and deficiencies in optimization proficiency. To generate more relevant and accurate solutions to users' requirements, we propose a large language model (LLM)-based agent ("City-LEO") that enhances the efficiency and transparency of city management through conversational interactions. Specifically, to accommodate diverse users' requirements and enhance computational tractability, City-LEO leverages LLM's logical reasoning capabilities on prior knowledge to scope down large-scale optimization problems efficiently. In the human-like decision process, City-LEO also incorporates End-to-end (E2E) model to synergize the prediction and optimization. The E2E framework be conducive to co** with environmental uncertainties and involving more query-relevant features, and then facilitates transparent and interpretable decision-making process. In case study, we employ City-LEO in the operations management of e-bike sharing (EBS) system. The numerical results demonstrate that City-LEO has superior performance when benchmarks against the full-scale optimization problem. With less computational time, City-LEO generates more satisfactory and relevant solutions to the users' requirements, and achieves lower global suboptimality without significantly compromising accuracy. In a broader sense, our proposed agent offers promise to develop LLM-embedded OR tools for smart-city operations management.
△ Less
Submitted 17 June, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator
Authors:
Zhigang Jia,
Yuelian Xiang,
Meixiang Zhao,
Tingting Wu,
Michael K. Ng
Abstract:
The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color infection in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by intro…
▽ More
The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color infection in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by introducing a quaternion blur operator and a cross-color space regularization functional. The existence and uniqueness of the solution is proved and a new L-curve method is proposed to find a sweet balance of regularization functionals on different color spaces.
The Euler-Lagrange equation is derived to show that CSTV has taken into account the coupling of all color channels and the local smoothing within each color channel. A quaternion operator splitting method is firstly proposed to enhance the ability of color infection reduction of the CSTV regularization model. This strategy also applies to the well-known color deblurring models. Numerical experiments on color image databases illustrate the efficiency and manoeuvrability of the new model and algorithms. The color images restored by them successfully maintain the color and spatial information and are of higher quality in terms of PSNR, SSIM, MSE and CIEde2000 than the restorations of the-state-of-the-art methods.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Generalized cluster states from Hopf algebras: non-invertible symmetry and Hopf tensor network representation
Authors:
Zhian Jia
Abstract:
Cluster states are crucial resources for measurement-based quantum computation (MBQC). It exhibits symmetry-protected topological (SPT) order, thus also playing a crucial role in studying topological phases. We present the construction of cluster states based on Hopf algebras. By generalizing the finite group valued qudit to a Hopf algebra valued qudit and introducing the generalized Pauli-X opera…
▽ More
Cluster states are crucial resources for measurement-based quantum computation (MBQC). It exhibits symmetry-protected topological (SPT) order, thus also playing a crucial role in studying topological phases. We present the construction of cluster states based on Hopf algebras. By generalizing the finite group valued qudit to a Hopf algebra valued qudit and introducing the generalized Pauli-X operator based on the regular action of the Hopf algebra, as well as the generalized Pauli-Z operator based on the irreducible representation action on the Hopf algebra, we develop a comprehensive theory of Hopf qudits. We demonstrate that non-invertible symmetry naturally emerges for Hopf qudits. Subsequently, for a bipartite graph termed the cluster graph, we assign the identity state and trivial representation state to even and odd vertices, respectively. Introducing the edge entangler as controlled regular action, we provide a general construction of Hopf cluster states. To ensure the commutativity of the edge entangler, we propose a method to construct a cluster lattice for any triangulable manifold. We use the 1d cluster state as an example to illustrate our construction. As this serves as a promising candidate for SPT phases, we construct the gapped Hamiltonian for this scenario and delve into a detailed discussion of its non-invertible symmetries. We also show that the 1d cluster state model is equivalent to the quasi-1d Hopf quantum double model. We also introduce the Hopf tensor network representation of Hopf cluster states by integrating the tensor representation of structure constants with the string diagrams of the Hopf algebra.
△ Less
Submitted 29 May, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Local Theory of Yang-Mills-Higgs-Schrödinger Flow
Authors:
Zonglin Jia
Abstract:
In this article, we study two Hamiltonian type flows: Yang-Mills-Higgs-Schrödinger flow and $A$-Schrödinger flow. For the first one, we only obtain local existence. However, the uniqueness follows from classical tricks for the second one.
In this article, we study two Hamiltonian type flows: Yang-Mills-Higgs-Schrödinger flow and $A$-Schrödinger flow. For the first one, we only obtain local existence. However, the uniqueness follows from classical tricks for the second one.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Preconditioning correction equations in Jacobi--Davidson type methods for computing partial singular value decompositions of large matrices
Authors:
**zhi Huang,
Zhongxiao Jia
Abstract:
In a Jacobi--Davidson (JD) type method for singular value decomposition (SVD) problems, called JDSVD, a large symmetric and generally indefinite correction equation is approximately solved iteratively at each outer iteration, which constitutes the inner iterations and dominates the overall efficiency of JDSVD. In this paper, a convergence analysis is made on the minimal residual (MINRES) method fo…
▽ More
In a Jacobi--Davidson (JD) type method for singular value decomposition (SVD) problems, called JDSVD, a large symmetric and generally indefinite correction equation is approximately solved iteratively at each outer iteration, which constitutes the inner iterations and dominates the overall efficiency of JDSVD. In this paper, a convergence analysis is made on the minimal residual (MINRES) method for the correction equation. Motivated by the results obtained, a preconditioned correction equation is derived that extracts useful information from current searching subspaces to construct effective preconditioners for the correction equation and is proved to retain the same convergence of outer iterations of JDSVD. The resulting method is called inner preconditioned JDSVD (IPJDSVD) method. Convergence results show that MINRES for the preconditioned correction equation can converge much faster when there is a cluster of singular values closest to a given target, so that IPJDSVD is more efficient than JDSVD. A new thick-restart IPJDSVD algorithm with deflation and purgation is proposed that simultaneously accelerates the outer and inner convergence of the standard thick-restart JDSVD and computes several singular triplets of a large matrix. Numerical experiments justify the theory and illustrate the considerable superiority of IPJDSVD to JDSVD.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Weak Hopf symmetry and tube algebra of the generalized multifusion string-net model
Authors:
Zhian Jia,
Sheng Tan,
Dagomir Kaszlikowski
Abstract:
We investigate the multifusion generalization of string-net ground states and lattice Hamiltonians, delving into its associated weak Hopf symmetry. For the multifusion string-net, the gauge symmetry manifests as a general weak Hopf algebra, leading to a reducible vacuum string label; the charge symmetry, serving as a quantum double of gauge symmetry, constitutes a connected weak Hopf algebra. This…
▽ More
We investigate the multifusion generalization of string-net ground states and lattice Hamiltonians, delving into its associated weak Hopf symmetry. For the multifusion string-net, the gauge symmetry manifests as a general weak Hopf algebra, leading to a reducible vacuum string label; the charge symmetry, serving as a quantum double of gauge symmetry, constitutes a connected weak Hopf algebra. This implies that the associated topological phase retains its characterization by a unitary modular tensor category (UMTC). The bulk charge symmetry can also be captured by a weak Hopf tube algebra. We offer an explicit construction of the weak Hopf tube algebra structure and thoroughly discuss its properties. The gapped boundary and domain wall models are extensively discussed, with these $1d$ phases characterized by unitary multifusion categories (UMFCs). We delve into the gauge and charge symmetries of these $1d$ phases, as well as the construction of the boundary and domain wall tube algebras. Additionally, we illustrate that the domain wall tube algebra can be regarded as a cross product of two boundary tube algebras. As an application of our model, we elucidate how to interpret the defective string-net as a restricted multifusion string-net.
△ Less
Submitted 14 May, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
Neural optimal controller for stochastic systems via pathwise HJB operator
Authors:
Zhe Jiao,
Xiaoyan Luo,
Xinlei Yi
Abstract:
The aim of this work is to develop deep learning-based algorithms for high-dimensional stochastic control problems based on physics-informed learning and dynamic programming. Unlike classical deep learning-based methods relying on a probabilistic representation of the solution to the Hamilton--Jacobi--Bellman (HJB) equation, we introduce a pathwise operator associated with the HJB equation so that…
▽ More
The aim of this work is to develop deep learning-based algorithms for high-dimensional stochastic control problems based on physics-informed learning and dynamic programming. Unlike classical deep learning-based methods relying on a probabilistic representation of the solution to the Hamilton--Jacobi--Bellman (HJB) equation, we introduce a pathwise operator associated with the HJB equation so that we can define a problem of physics-informed learning. According to whether the optimal control has an explicit representation, two numerical methods are proposed to solve the physics-informed learning problem. We provide an error analysis on how the truncation, approximation and optimization errors affect the accuracy of these methods. Numerical results on various applications are presented to illustrate the performance of the proposed algorithms.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Energy-Efficient Data Offloading for Earth Observation Satellite Networks
Authors:
Lijun He,
Ziye Jia,
Juncheng Wang,
Feng Wang,
Erick Lansard,
Chau Yuen
Abstract:
In Earth Observation Satellite Networks (EOSNs) with a large number of battery-carrying satellites, proper power allocation and task scheduling are crucial to improving the data offloading efficiency. As such, we jointly optimize power allocation and task scheduling to achieve energy-efficient data offloading in EOSNs, aiming to balance the objectives of reducing the total energy consumption and i…
▽ More
In Earth Observation Satellite Networks (EOSNs) with a large number of battery-carrying satellites, proper power allocation and task scheduling are crucial to improving the data offloading efficiency. As such, we jointly optimize power allocation and task scheduling to achieve energy-efficient data offloading in EOSNs, aiming to balance the objectives of reducing the total energy consumption and increasing the sum weights of tasks. First, we derive the optimal power allocation solution to the joint optimization problem when the task scheduling policy is given. Second, leveraging the conflict graph model, we transform the original joint optimization problem into a maximum weight independent set problem when the power allocation strategy is given. Finally, we utilize the genetic framework to combine the above special solutions as a two-layer solution for the joint optimization problem. Simulation results demonstrate that our proposed solution can properly balance the sum weights of tasks and the total energy consumption, achieving superior system performance over the current best alternatives.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Learning Fair Policies for Multi-stage Selection Problems from Observational Data
Authors:
Zhuangzhuang Jia,
Grani A. Hanasusanto,
Phebe Vayanos,
Weijun Xie
Abstract:
We consider the problem of learning fair policies for multi-stage selection problems from observational data. This problem arises in several high-stakes domains such as company hiring, loan approval, or bail decisions where outcomes (e.g., career success, loan repayment, recidivism) are only observed for those selected. We propose a multi-stage framework that can be augmented with various fairness…
▽ More
We consider the problem of learning fair policies for multi-stage selection problems from observational data. This problem arises in several high-stakes domains such as company hiring, loan approval, or bail decisions where outcomes (e.g., career success, loan repayment, recidivism) are only observed for those selected. We propose a multi-stage framework that can be augmented with various fairness constraints, such as demographic parity or equal opportunity. This problem is a highly intractable infinite chance-constrained program involving the unknown joint distribution of covariates and outcomes. Motivated by the potential impact of selection decisions on people's lives and livelihoods, we propose to focus on interpretable linear selection rules. Leveraging tools from causal inference and sample average approximation, we obtain an asymptotically consistent solution to this selection problem by solving a mixed binary conic optimization problem, which can be solved using standard off-the-shelf solvers. We conduct extensive computational experiments on a variety of datasets adapted from the UCI repository on which we show that our proposed approaches can achieve an 11.6% improvement in precision and a 38% reduction in the measure of unfairness compared to the existing selection policy.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
A CJ-FEAST GSVDsolver for computing a partial GSVD of a large matrix pair with the generalized singular values in a given interval
Authors:
Zhongxiao Jia,
Kailiang Zhang
Abstract:
We propose a CJ-FEAST GSVDsolver to compute a partial generalized singular value decomposition (GSVD) of a large matrix pair $(A,B)$ with the generalized singular values in a given interval. The solver is a highly nontrivial extension of the FEAST eigensolver for the (generalized) eigenvalue problem and CJ-FEAST SVDsolver for the SVD problem. For a partial GSVD problem, given three left and right…
▽ More
We propose a CJ-FEAST GSVDsolver to compute a partial generalized singular value decomposition (GSVD) of a large matrix pair $(A,B)$ with the generalized singular values in a given interval. The solver is a highly nontrivial extension of the FEAST eigensolver for the (generalized) eigenvalue problem and CJ-FEAST SVDsolver for the SVD problem. For a partial GSVD problem, given three left and right searching subspaces, we propose a general projection method that works on $(A,B)$ {\em directly}, and computes approximations to the desired GSVD components. For the concerning GSVD problem, we exploit the Chebyshev--Jackson (CJ) series to construct an approximate spectral projector of the generalized eigenvalue problem of the matrix pair $(A^TA,B^TB)$ associated with the generalized singular values of interest, and use subspace iteration on it to generate a right subspace. Premultiplying it with $A$ and $B$ constructs two left subspaces. Applying the general projection method to the subspaces constructed leads to the CJ-FEAST GSVDsolver. We derive accuracy estimates for the approximate spectral projector and its eigenvalues, and establish a number of convergence results on the underlying subspaces and the approximate GSVD components obtained by the CJ-FEAST GSVDsolver. We propose general-purpose choice strategies for the series degree and subspace dimension. Numerical experiments illustrate the efficiency of the CJ-FEAST GSVDsolver.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
When is Agnostic Reinforcement Learning Statistically Tractable?
Authors:
Zeyu Jia,
Gene Li,
Alexander Rakhlin,
Ayush Sekhari,
Nathan Srebro
Abstract:
We study the problem of agnostic PAC reinforcement learning (RL): given a policy class $Π$, how many rounds of interaction with an unknown MDP (with a potentially large state and action space) are required to learn an $ε$-suboptimal policy with respect to $Π$? Towards that end, we introduce a new complexity measure, called the \emph{spanning capacity}, that depends solely on the set $Π$ and is ind…
▽ More
We study the problem of agnostic PAC reinforcement learning (RL): given a policy class $Π$, how many rounds of interaction with an unknown MDP (with a potentially large state and action space) are required to learn an $ε$-suboptimal policy with respect to $Π$? Towards that end, we introduce a new complexity measure, called the \emph{spanning capacity}, that depends solely on the set $Π$ and is independent of the MDP dynamics. With a generative model, we show that for any policy class $Π$, bounded spanning capacity characterizes PAC learnability. However, for online RL, the situation is more subtle. We show there exists a policy class $Π$ with a bounded spanning capacity that requires a superpolynomial number of samples to learn. This reveals a surprising separation for agnostic learnability between generative access and online access models (as well as between deterministic/stochastic MDPs under online access). On the positive side, we identify an additional \emph{sunflower} structure, which in conjunction with bounded spanning capacity enables statistically efficient online RL via a new algorithm called POPLER, which takes inspiration from classical importance sampling methods as well as techniques for reachable-state identification and policy evaluation in reward-free exploration.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Goldstein Stationarity in Lipschitz Constrained Optimization
Authors:
Benjamin Grimmer,
Zhichao Jia
Abstract:
We prove the first convergence guarantees for a subgradient method minimizing a generic Lipschitz function over generic Lipschitz inequality constraints. No smoothness or convexity (or weak convexity) assumptions are made. Instead, we utilize a sequence of recent advances in Lipschitz unconstrained minimization, which showed convergence rates of $O(1/δε^3)$ towards reaching a "Goldstein" stationar…
▽ More
We prove the first convergence guarantees for a subgradient method minimizing a generic Lipschitz function over generic Lipschitz inequality constraints. No smoothness or convexity (or weak convexity) assumptions are made. Instead, we utilize a sequence of recent advances in Lipschitz unconstrained minimization, which showed convergence rates of $O(1/δε^3)$ towards reaching a "Goldstein" stationary point, that is, a point where an average of gradients sampled at most distance $δ$ away has size at most $ε$. We generalize these prior techniques to handle functional constraints, proposing a subgradient-type method with similar $O(1/δε^3)$ guarantees on reaching a Goldstein Fritz-John or Goldstein KKT stationary point, depending on whether a certain Goldstein-style generalization of constraint qualification holds.
△ Less
Submitted 12 October, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Refined and refined harmonic Jacobi--Davidson methods for computing several GSVD components of a large regular matrix pair
Authors:
**zhi Huang,
Zhongxiao Jia
Abstract:
Three refined and refined harmonic extraction-based Jacobi--Davidson (JD) type methods are proposed, and their thick-restart algorithms with deflation and purgation are developed to compute several generalized singular value decomposition (GSVD) components of a large regular matrix pair. The new methods are called refined cross product-free (RCPF), refined cross product-free harmonic (RCPF-harmoni…
▽ More
Three refined and refined harmonic extraction-based Jacobi--Davidson (JD) type methods are proposed, and their thick-restart algorithms with deflation and purgation are developed to compute several generalized singular value decomposition (GSVD) components of a large regular matrix pair. The new methods are called refined cross product-free (RCPF), refined cross product-free harmonic (RCPF-harmonic) and refined inverse-free harmonic (RIF-harmonic) JDGSVD algorithms, abbreviated as RCPF-JDGSVD, RCPF-HJDGSVD and RIF-HJDGSVD, respectively. The new JDGSVD methods are more efficient than the corresponding standard and harmonic extraction-based JDSVD methods proposed previously by the authors, and can overcome the erratic behavior and intrinsic possible non-convergence of the latter ones. Numerical experiments illustrate that RCPF-JDGSVD performs better for the computation of extreme GSVD components while RCPF-HJDGSVD and RIF-HJDGSVD suit better for that of interior GSVD components.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Entropic characterization of optimal rates for learning Gaussian mixtures
Authors:
Zeyu Jia,
Yury Polyanskiy,
Yihong Wu
Abstract:
We consider the question of estimating multi-dimensional Gaussian mixtures (GM) with compactly supported or subgaussian mixing distributions. Minimax estimation rate for this class (under Hellinger, TV and KL divergences) is a long-standing open question, even in one dimension. In this paper we characterize this rate (for all constant dimensions) in terms of the metric entropy of the class. Such c…
▽ More
We consider the question of estimating multi-dimensional Gaussian mixtures (GM) with compactly supported or subgaussian mixing distributions. Minimax estimation rate for this class (under Hellinger, TV and KL divergences) is a long-standing open question, even in one dimension. In this paper we characterize this rate (for all constant dimensions) in terms of the metric entropy of the class. Such characterizations originate from seminal works of Le Cam (1973); Birge (1983); Haussler and Opper (1997); Yang and Barron (1999). However, for GMs a key ingredient missing from earlier work (and widely sought-after) is a comparison result showing that the KL and the squared Hellinger distance are within a constant multiple of each other uniformly over the class. Our main technical contribution is in showing this fact, from which we derive entropy characterization for estimation rate under Hellinger and KL. Interestingly, the sequential (online learning) estimation rate is characterized by the global entropy, while the single-step (batch) rate corresponds to local entropy, paralleling a similar result for the Gaussian sequence model recently discovered by Neykov (2022) and Mourtada (2023). Additionally, since Hellinger is a proper metric, our comparison shows that GMs under KL satisfy the triangle inequality within multiplicative constants, implying that proper and improper estimation rates coincide.
△ Less
Submitted 27 June, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
A New Low-Rank Learning Robust Quaternion Tensor Completion Method for Color Video Inpainting Problem and Fast Algorithms
Authors:
Zhigang Jia,
**gfei Zhu
Abstract:
The color video inpainting problem is one of the most challenging problem in the modern imaging science. It aims to recover a color video from a small part of pixels that may contain noise. However, there are less of robust models that can simultaneously preserve the coupling of color channels and the evolution of color video frames. In this paper, we present a new robust quaternion tensor complet…
▽ More
The color video inpainting problem is one of the most challenging problem in the modern imaging science. It aims to recover a color video from a small part of pixels that may contain noise. However, there are less of robust models that can simultaneously preserve the coupling of color channels and the evolution of color video frames. In this paper, we present a new robust quaternion tensor completion (RQTC) model to solve this challenging problem and derive the exact recovery theory. The main idea is to build a quaternion tensor optimization model to recover a low-rank quaternion tensor that represents the targeted color video and a sparse quaternion tensor that represents noise. This new model is very efficient to recover high dimensional data that satisfies the prior low-rank assumption. To solve the case without low-rank property, we introduce a new low-rank learning RQTC model, which rearranges similar patches classified by a quaternion learning method into smaller tensors satisfying the prior low-rank assumption. We also propose fast algorithms with global convergence guarantees. In numerical experiments, the proposed methods successfully recover color videos with eliminating color contamination and kee** the continuity of video scenery, and their solutions are of higher quality in terms of PSNR and SSIM values than the state-of-the-art algorithms.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Switch Updating in SPSA Algorithm for Stochastic Optimization with Inequality Constraints
Authors:
Zhichao Jia,
Ziyi Wei,
James C. Spall
Abstract:
Simultaneous perturbation stochastic approximation (SPSA) is widely used in stochastic optimization due to its high efficiency, asymptotic stability, and reduced number of required loss function measurements. However, the standard SPSA algorithm needs to be modified to deal with constrained problems. In recent years, sequential quadratic programming (SQP)-based projection ideas and penalty ideas h…
▽ More
Simultaneous perturbation stochastic approximation (SPSA) is widely used in stochastic optimization due to its high efficiency, asymptotic stability, and reduced number of required loss function measurements. However, the standard SPSA algorithm needs to be modified to deal with constrained problems. In recent years, sequential quadratic programming (SQP)-based projection ideas and penalty ideas have been analyzed. Both ideas have convergence results and a potentially wide range of applications, but with some limitations in practical consideration, such as computation time, complexity, and feasibility guarantee. We propose an SPSA-based switch updating algorithm, which updates based on the loss function or the inequality constraints, depending on current feasibility in each iteration. We show convergence results for the algorithm, and analyze its properties relative to other methods. We also numerically compare the switch updating algorithm with the penalty function approach for two constrained examples.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
Testing mean and variance by e-processes
Authors:
Yixuan Fan,
Zhanyi Jiao,
Ruodu Wang
Abstract:
We address the problem of testing conditional mean and conditional variance for non-stationary data. We build e-values and p-values for four types of non-parametric composite hypotheses with specified mean and variance as well as other conditions on the shape of the data-generating distribution. These shape conditions include symmetry, unimodality, and their combination. Using the obtained e-value…
▽ More
We address the problem of testing conditional mean and conditional variance for non-stationary data. We build e-values and p-values for four types of non-parametric composite hypotheses with specified mean and variance as well as other conditions on the shape of the data-generating distribution. These shape conditions include symmetry, unimodality, and their combination. Using the obtained e-values and p-values, we construct tests via e-processes, also known as testing by betting, as well as some tests based on combining p-values for comparison. Although we mainly focus on one-sided tests, the two-sided test for the mean is also studied. Simulation and empirical studies are conducted under a few settings, and they illustrate features of the methods based on e-processes.
△ Less
Submitted 25 June, 2024; v1 submitted 29 January, 2023;
originally announced January 2023.
-
An augmented matrix-based CJ-FEAST SVDsolver for computing a partial singular value decomposition with the singular values in a given interval
Authors:
Zhongxiao Jia,
Kailiang Zhang
Abstract:
The cross-product matrix-based CJ-FEAST SVDsolver proposed previously by the authors is shown to compute the left singular vector possibly much less accurately than the right singular vector and may be numerically backward unstable when a desired singular value is small. In this paper, an alternative augmented matrix-based CJ-FEAST SVDsolver is considered to compute the singular triplets of a larg…
▽ More
The cross-product matrix-based CJ-FEAST SVDsolver proposed previously by the authors is shown to compute the left singular vector possibly much less accurately than the right singular vector and may be numerically backward unstable when a desired singular value is small. In this paper, an alternative augmented matrix-based CJ-FEAST SVDsolver is considered to compute the singular triplets of a large matrix $A$ with the singular values in an interval $[a,b]$ contained in the singular spectrum. The new CJ-FEAST SVDsolver is a subspace iteration applied to an approximate spectral projector of the augmented matrix $[0, A^T; A, 0]$ associated with the eigenvalues in $[a,b]$, and constructs approximate left and right singular subspaces with the desired singular values independently, onto which $A$ is projected to obtain the Ritz approximations to the desired singular triplets. Compact estimates are given for the accuracy of the approximate spectral projector, and a number of convergence results are established. The new solver is proved to be always numerically backward stable. A convergence comparison of the cross-product and augmented matrix-based CJ-FEAST SVDsolvers is made, and a general-purpose choice strategy between the two solvers is proposed for the robustness and overall efficiency. Numerical experiments confirm all the results.
△ Less
Submitted 16 January, 2023;
originally announced January 2023.
-
A skew-symmetric Lanczos bidiagonalization method for computing several largest eigenpairs of a large skew-symmetric matrix
Authors:
**zhi Huang,
Zhongxiao Jia
Abstract:
The spectral decomposition of a real skew-symmetric matrix $A$ can be mathematically transformed into a specific structured singular value decomposition (SVD) of $A$. Based on such equivalence, a skew-symmetric Lanczos bidiagonalization (SSLBD) method is proposed for the specific SVD problem that computes extreme singular values and the corresponding singular vectors of $A$, from which the eigenpa…
▽ More
The spectral decomposition of a real skew-symmetric matrix $A$ can be mathematically transformed into a specific structured singular value decomposition (SVD) of $A$. Based on such equivalence, a skew-symmetric Lanczos bidiagonalization (SSLBD) method is proposed for the specific SVD problem that computes extreme singular values and the corresponding singular vectors of $A$, from which the eigenpairs of $A$ corresponding to the extreme conjugate eigenvalues in magnitude are recovered pairwise in real arithmetic. A number of convergence results on the method are established, and accuracy estimates for approximate singular triplets are given. In finite precision arithmetic, it is proven that the semi-orthogonality of each set of basis vectors and the semi-biorthogonality of two sets of basis vectors suffice to compute the singular values accurately. A commonly used efficient partial reorthogonalization strategy is adapted to maintaining the needed semi-orthogonality and semi-biorthogonality. For a practical purpose, an implicitly restarted SSLBD algorithm is developed with partial reorthogonalization. Numerical experiments illustrate the effectiveness and overall efficiency of the algorithm.
△ Less
Submitted 14 December, 2022;
originally announced December 2022.
-
Neumann Boundary Problem of Second Order Parabolic Quasi-Linear System with Variable Coefficient on a Vector Bundle
Authors:
Zonglin Jia
Abstract:
Classical results of second order parabolic quasi-linear equations always require that the nonlinear terms are controlled by a power of the unknown functions and their first derivatives. We improve the previous results. More precisely, in the present article the upper bound of nonlinear term can be extended to a power series.
Classical results of second order parabolic quasi-linear equations always require that the nonlinear terms are controlled by a power of the unknown functions and their first derivatives. We improve the previous results. More precisely, in the present article the upper bound of nonlinear term can be extended to a power series.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
First-Order Methods for Nonsmooth Nonconvex Functional Constrained Optimization with or without Slater Points
Authors:
Zhichao Jia,
Benjamin Grimmer
Abstract:
Constrained optimization problems where both the objective and constraints may be nonsmooth and nonconvex arise across many learning and data science settings. In this paper, we show for any Lipschitz, weakly convex objectives and constraints, a simple first-order method finds a feasible, $ε$-stationary point at a convergence rate of $O(ε^{-4})$ without relying on compactness or Constraint Qualifi…
▽ More
Constrained optimization problems where both the objective and constraints may be nonsmooth and nonconvex arise across many learning and data science settings. In this paper, we show for any Lipschitz, weakly convex objectives and constraints, a simple first-order method finds a feasible, $ε$-stationary point at a convergence rate of $O(ε^{-4})$ without relying on compactness or Constraint Qualification (CQ). When CQ holds, this convergence is measured by approximately satisfying the Karush-Kuhn-Tucker conditions. When CQ fails, we guarantee the attainment of weaker Fritz-John conditions. As an illustrative example, our method stably converges on piecewise quadratic SCAD regularized problems despite frequent violations of constraint qualification. The considered algorithm is similar to those of "Quadratically regularized subgradient methods for weakly convex optimization with weakly convex constraints" by Ma et al. and "Stochastic first-order methods for convex and nonconvex functional constrained optimization" by Boob et al. (whose guarantees further assume compactness and CQ), iteratively taking inexact proximal steps, computed via an inner loop applying a switching subgradient method to a strongly convex constrained subproblem. Our non-Lipschitz analysis of the switching subgradient method appears to be new and may be of independent interest.
△ Less
Submitted 14 March, 2024; v1 submitted 1 December, 2022;
originally announced December 2022.
-
An analysis of the Rayleigh-Ritz and refined Rayleigh-Ritz methods for nonlinear eigenvalue problems
Authors:
Zhongxiao Jia,
Qingqing Zheng
Abstract:
We establish a general convergence theory of the Rayleigh--Ritz method and the refined Rayleigh--Ritz method for computing some simple eigenpair ($λ_{*},x_{*}$) of a given analytic nonlinear eigenvalue problem (NEP). In terms of the deviation $\varepsilon$ of $x_{*}$ from a given subspace $\mathcal{W}$, we establish a priori convergence results on the Ritz value, the Ritz vector and the refined Ri…
▽ More
We establish a general convergence theory of the Rayleigh--Ritz method and the refined Rayleigh--Ritz method for computing some simple eigenpair ($λ_{*},x_{*}$) of a given analytic nonlinear eigenvalue problem (NEP). In terms of the deviation $\varepsilon$ of $x_{*}$ from a given subspace $\mathcal{W}$, we establish a priori convergence results on the Ritz value, the Ritz vector and the refined Ritz vector, and present sufficient convergence conditions for them. The results show that, as $\varepsilon\rightarrow 0$, there is a Ritz value that unconditionally converges to $λ_*$ and the corresponding refined Ritz vector does so too but the Ritz vector may fail to converge and even may not be unique. We also present an error bound for the approximate eigenvector in terms of the computable residual norm of a given approximate eigenpair, and give lower and upper bounds for the error of the refined Ritz vector and the Ritz vector as well as for that of the corresponding residual norms. These results nontrivially extend some convergence results on these two methods for the linear eigenvalue problem to the NEP. Examples are constructed to illustrate some of the results.
△ Less
Submitted 31 October, 2023; v1 submitted 1 December, 2022;
originally announced December 2022.
-
Quasi Non-Negative Quaternion Matrix Factorization with Application to Color Face Recognition
Authors:
Yifen Ke,
Changfeng Ma,
Zhigang Jia,
Yajun Xie,
Riwei Liao
Abstract:
To address the non-negativity dropout problem of quaternion models, a novel quasi non-negative quaternion matrix factorization (QNQMF) model is presented for color image processing. To implement QNQMF, the quaternion projected gradient algorithm and the quaternion alternating direction method of multipliers are proposed via formulating QNQMF as the non-convex constraint quaternion optimization pro…
▽ More
To address the non-negativity dropout problem of quaternion models, a novel quasi non-negative quaternion matrix factorization (QNQMF) model is presented for color image processing. To implement QNQMF, the quaternion projected gradient algorithm and the quaternion alternating direction method of multipliers are proposed via formulating QNQMF as the non-convex constraint quaternion optimization problems. Some properties of the proposed algorithms are studied. The numerical experiments on the color image reconstruction show that these algorithms encoded on the quaternion perform better than these algorithms encoded on the red, green and blue channels. Furthermore, we apply the proposed algorithms to the color face recognition. Numerical results indicate that the accuracy rate of face recognition on the quaternion model is better than on the red, green and blue channels of color image as well as single channel of gray level images for the same data, when large facial expressions and shooting angle variations are presented.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Rate of convergence of the smoothed empirical Wasserstein distance
Authors:
Adam Block,
Zeyu Jia,
Yury Polyanskiy,
Alexander Rakhlin
Abstract:
Consider an empirical measure $\mathbb{P}_n$ induced by $n$ iid samples from a $d$-dimensional $K$-subgaussian distribution $\mathbb{P}$ and let $γ= \mathcal{N}(0,σ^2 I_d)$ be the isotropic Gaussian measure. We study the speed of convergence of the smoothed Wasserstein distance $W_2(\mathbb{P}_n * γ, \mathbb{P}*γ) = n^{-α+ o(1)}$ with $*$ being the convolution of measures. For $K<σ$ and in any dim…
▽ More
Consider an empirical measure $\mathbb{P}_n$ induced by $n$ iid samples from a $d$-dimensional $K$-subgaussian distribution $\mathbb{P}$ and let $γ= \mathcal{N}(0,σ^2 I_d)$ be the isotropic Gaussian measure. We study the speed of convergence of the smoothed Wasserstein distance $W_2(\mathbb{P}_n * γ, \mathbb{P}*γ) = n^{-α+ o(1)}$ with $*$ being the convolution of measures. For $K<σ$ and in any dimension $d\ge 1$ we show that $α= {1\over2}$. For $K>σ$ in dimension $d=1$ we show that the rate is slower and is given by $α= {(σ^2 + K^2)^2\over 4 (σ^4 + K^4)} < 1/2$. This resolves several open problems in \cite{goldfeld2020convergence}, and in particular precisely identifies the amount of smoothing $σ$ needed to obtain a parametric rate. In addition, we also establish that $D_{KL}(\mathbb{P}_n * γ\|\mathbb{P}*γ)$ has rate $O(1/n)$ for $K<σ$ but only slows down to $O({(\log n)^{d+1}\over n})$ for $K>σ$. The surprising difference of the behavior of $W_2^2$ and KL implies the failure of $T_{2}$-transportation inequality when $σ< K$. Consequently, the requirement $K<σ$ is necessary for validity of the log-Sobolev inequality (LSI) for the Gaussian mixture $\mathbb{P} * \mathcal{N}(0, σ^{2})$, closing an open problem in \cite{wang2016functional}, who established the LSI under precisely this condition.
△ Less
Submitted 12 July, 2023; v1 submitted 4 May, 2022;
originally announced May 2022.
-
Stability for nonlinear wave motions damped by time-dependent frictions
Authors:
Zhe Jiao,
Yong Xu,
Li**g Zhao
Abstract:
We are concerned with the dynamical behavior of solutions to semilinear wave systems with time-varying dam** and nonconvex force potential. Our result shows that the dynamical behavior of solution is asymptotically stable without any bifurcation and chaos. And it is a sharp condition on the dam** coefficient for the solution to converge to some equilibrium. To illustrate our theoretical result…
▽ More
We are concerned with the dynamical behavior of solutions to semilinear wave systems with time-varying dam** and nonconvex force potential. Our result shows that the dynamical behavior of solution is asymptotically stable without any bifurcation and chaos. And it is a sharp condition on the dam** coefficient for the solution to converge to some equilibrium. To illustrate our theoretical results, we provide some numerical simulations for dissipative sine-Gordon equation and dissipative Klein-Gordon equation.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Stability for acoustic wave motion with random force on the locally reacting boundary
Authors:
Zhe Jiao,
Yong Xu
Abstract:
This paper concerns about the stability for the acoustic wave motion with boundary frictions and random forces. We will show there exists a unique invariant measure for the stochastic evolution equation associated with this acoustic wave motion, and the invariant measure possesses the property of strong mixing. This result is new with respect to the literature on two accounts: (i) stochasticity is…
▽ More
This paper concerns about the stability for the acoustic wave motion with boundary frictions and random forces. We will show there exists a unique invariant measure for the stochastic evolution equation associated with this acoustic wave motion, and the invariant measure possesses the property of strong mixing. This result is new with respect to the literature on two accounts: (i) stochasticity is accounted for in the acoustic wave model; (ii) the controllability of the dynamical system modeling the acoustic wave motion implies the mixing.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Two harmonic Jacobi--Davidson methods for computing a partial generalized singular value decomposition of a large matrix pair
Authors:
**zhi Huang,
Zhongxiao Jia
Abstract:
Two harmonic extraction based Jacobi--Davidson (JD) type algorithms are proposed to compute a partial generalized singular value decomposition (GSVD) of a large regular matrix pair. They are called cross product-free (CPF) and inverse-free (IF) harmonic JDGSVD algorithms, abbreviated as CPF-HJDGSVD and IF-HJDGSVD, respectively. Compared with the standard extraction based JDGSVD algorithm, the harm…
▽ More
Two harmonic extraction based Jacobi--Davidson (JD) type algorithms are proposed to compute a partial generalized singular value decomposition (GSVD) of a large regular matrix pair. They are called cross product-free (CPF) and inverse-free (IF) harmonic JDGSVD algorithms, abbreviated as CPF-HJDGSVD and IF-HJDGSVD, respectively. Compared with the standard extraction based JDGSVD algorithm, the harmonic extraction based algorithms converge more regularly and suit better for computing GSVD components corresponding to interior generalized singular values. Thick-restart CPF-HJDGSVD and IF-HJDGSVD algorithms with some deflation and purgation techniques are developed to compute more than one GSVD components. Numerical experiments confirm the superiority of CPF-HJDGSVD and IF-HJDGSVD to the standard extraction based JDGSVD algorithm.
△ Less
Submitted 8 January, 2022;
originally announced January 2022.
-
A FEAST SVDsolver based on Chebyshev--Jackson series for computing partial singular triplets of large matrices
Authors:
Zhongxiao Jia,
Kailiang Zhang
Abstract:
The FEAST eigensolver is extended to the computation of the singular triplets of a large matrix $A$ with the singular values in a given interval. The resulting FEAST SVDsolver is subspace iteration applied to an approximate spectral projector of $A^TA$ corresponding to the desired singular values in a given interval, and constructs approximate left and right singular subspaces corresponding to the…
▽ More
The FEAST eigensolver is extended to the computation of the singular triplets of a large matrix $A$ with the singular values in a given interval. The resulting FEAST SVDsolver is subspace iteration applied to an approximate spectral projector of $A^TA$ corresponding to the desired singular values in a given interval, and constructs approximate left and right singular subspaces corresponding to the desired singular values, onto which $A$ is projected to obtain Ritz approximations. Differently from a commonly used contour integral-based FEAST solver, we propose a robust alternative that constructs approximate spectral projectors by using the Chebyshev--Jackson polynomial series, which are symmetric positive semi-definite with the eigenvalues in $[0,1]$. We prove the pointwise convergence of this series and give compact estimates for pointwise errors of it and the step function that corresponds to the exact spectral projector. We present error bounds for the approximate spectral projector and reliable estimates for the number of desired singular triplets, establish numerous convergence results on the resulting FEAST SVDsolver, and propose practical selection strategies for determining the series degree and for reliably determining the subspace dimension. The solver and results on it are directly applicable or adaptable to the real symmetric and complex Hermitian eigenvalue problem. Numerical experiments illustrate that our FEAST SVDsolver is at least competitive with and is much more efficient than the contour integral-based FEAST SVDsolver when the desired singular values are extreme and interior ones, respectively, and it is also more robust than the latter.
△ Less
Submitted 20 November, 2022; v1 submitted 8 January, 2022;
originally announced January 2022.
-
Global Existence for The Massive Dirac Equations with small initial datum on Tori
Authors:
Zonglin Jia
Abstract:
In the article we obtain almost global existence for Dirac Equations with high regularity and small initial datum on Tori. Besides, the global existence with low regularity and small initial datum is gotten. The approaches are mainly Gagliardo-Nirenberg-Moser estimates and Bernstein-Type Lemma.
In the article we obtain almost global existence for Dirac Equations with high regularity and small initial datum on Tori. Besides, the global existence with low regularity and small initial datum is gotten. The approaches are mainly Gagliardo-Nirenberg-Moser estimates and Bernstein-Type Lemma.
△ Less
Submitted 18 March, 2023; v1 submitted 6 January, 2022;
originally announced January 2022.
-
Averaging principle of stochastic Burgers equation driven by Lévy processes
Authors:
Hongge Yue,
Yong Xu,
Ruifang Wang,
Zhe Jiao
Abstract:
We are concerned about the averaging principle for the stochastic Burgers equation with slow-fast time scale. This slow-fast system is driven by Lévy processes. Under some appropriate conditions, we show that the slow component of this system strongly converges to a limit, which is characterized by the solution of stochastic Burgers equation whose coefficients are averaged with respect to the stat…
▽ More
We are concerned about the averaging principle for the stochastic Burgers equation with slow-fast time scale. This slow-fast system is driven by Lévy processes. Under some appropriate conditions, we show that the slow component of this system strongly converges to a limit, which is characterized by the solution of stochastic Burgers equation whose coefficients are averaged with respect to the stationary measure of the fast-varying jump-diffusion. To illustrate our theoretical result, we provide some numerical simulations.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
A comparison of eigenvalue-based algorithms and the generalized Lanczos trust-region algorithm for Solving the trust-region subproblem
Authors:
Zhongxiao Jia,
Fa Wang
Abstract:
Solving the trust-region subproblem (TRS) plays a key role in numerical optimization and many other applications. Based on a fundamental result that the solution of TRS of size $n$ is mathematically equivalent to finding the rightmost eigenpair of a certain matrix pair of size $2n$, eigenvalue-based methods are promising due to their simplicity. For $n$ large, the implicitly restarted Arnoldi (IRA…
▽ More
Solving the trust-region subproblem (TRS) plays a key role in numerical optimization and many other applications. Based on a fundamental result that the solution of TRS of size $n$ is mathematically equivalent to finding the rightmost eigenpair of a certain matrix pair of size $2n$, eigenvalue-based methods are promising due to their simplicity. For $n$ large, the implicitly restarted Arnoldi (IRA) and refined Arnoldi (IRRA) algorithms are well suited for this eigenproblem. For a reasonable comparison of overall efficiency of the algorithms for solving TRS directly and eigenvalue-based algorithms, a vital premise is that the two kinds of algorithms must compute the approximate solutions of TRS with (almost) the same accuracy, but such premise has been ignored in the literature. To this end, we establish close relationships between the two kinds of residual norms, so that, given a stop** tolerance for IRA and IRRA, we are able to determine a reliable one that GLTR should use so as to ensure that GLTR and IRA, IRRA deliver the converged approximate solutions with similar accuracy. We also make a convergence analysis on the residual norms by the Generalized Lanczos Trust-Region (GLTR) algorithm for solving TRS directly, the Arnoldi method and the refined Arnoldi method for the equivalent eigenproblem. A number of numerical experiments are reported to illustrate that IRA and IRRA are competitive with GLTR and IRRA outperforms IRA.
△ Less
Submitted 18 February, 2021;
originally announced February 2021.
-
A contribution to condition numbers of the multidimensional total least squares problem with linear equality constraint
Authors:
Qiaohua Liu,
Zhigang Jia,
Yimin Wei
Abstract:
This paper is devoted to condition numbers of the multidimensional total least squares problem with linear equality constraint (TLSE). Based on the perturbation theory of invariant subspace, the TLSE problem is proved to be equivalent to a multidimensional unconstrained weighed total least squares problem in the limit sense. With a limit technique, Kronecker-product-based formulae for normwise, mi…
▽ More
This paper is devoted to condition numbers of the multidimensional total least squares problem with linear equality constraint (TLSE). Based on the perturbation theory of invariant subspace, the TLSE problem is proved to be equivalent to a multidimensional unconstrained weighed total least squares problem in the limit sense. With a limit technique, Kronecker-product-based formulae for normwise, mixed and componentwise condition numbers of the minimum Frobenius norm TLSE solution are given. Compact upper bounds of these condition numbers are provided to reduce the storage and computation cost. All expressions and upper bounds of these condition numbers unify the ones for the single-dimensional TLSE problem and multidimensional total least squares problem. Some numerical experiments are performed to illustrate our results.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
Non-Local Robust Quaternion Matrix Completion for Color Images and Videos Inpainting
Authors:
Zhigang Jia,
Qiyu **,
Michael K. Ng,
Xile Zhao
Abstract:
The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS…
▽ More
The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS and low-rank property of color images, which is also available to grey images. A new patch group based NSS prior scheme is proposed to learn explicit NSS models of natural color images. The numerical low-rank property of patched matrices is also rigorously proved. The NSS-based QMC algorithm computes an optimal low-rank approximation to the high-rank color image, resulting in high PSNR and SSIM measures and particularly the better visual quality. A new tensor NSS-based QMC method is also presented to solve the color video inpainting problem based on quaternion tensor representation. The numerical experiments on color images and videos indicate the advantages of NSS-based QMC over the state-of-the-art methods.
△ Less
Submitted 13 May, 2022; v1 submitted 17 November, 2020;
originally announced November 2020.
-
Efficient Robust Watermarking Based on Quaternion Singular Value Decomposition and Coefficient Pair Selection
Authors:
Yong Chen,
Zhi-Gang Jia,
Ya-Xin Peng,
Yan Peng
Abstract:
Quaternion singular value decomposition (QSVD) is a robust technique of digital watermarking which can extract high quality watermarks from watermarked images with low distortion. In this paper, QSVD technique is further investigated and an efficient robust watermarking scheme is proposed. The improved algebraic structure-preserving method is proposed to handle the problem of "explosion of complex…
▽ More
Quaternion singular value decomposition (QSVD) is a robust technique of digital watermarking which can extract high quality watermarks from watermarked images with low distortion. In this paper, QSVD technique is further investigated and an efficient robust watermarking scheme is proposed. The improved algebraic structure-preserving method is proposed to handle the problem of "explosion of complexity" occurred in the conventional QSVD design. Secret information is transmitted blindly by incorporating in QSVD two new strategies, namely, coefficient pair selection and adaptive embedding. Unlike conventional QSVD which embeds watermarks in a single imaginary unit, we propose to adaptively embed the watermark into the optimal hiding position using the Normalized Cross-Correlation (NC) method. This avoids the selection of coefficient pair with less correlation, and thus, it reduces embedding impact by decreasing the maximum modification of coefficient values. In this way, compared with conventional QSVD, the proposed watermarking strategy avoids more modifications to a single color image layer and a better visual quality of the watermarked image is observed. Meanwhile, adaptive QSVD resists some common geometric attacks, and it improves the robustness of conventional QSVD. With these improvements, our method outperforms conventional QSVD. Its superiority over other state-of-the-art methods is also demonstrated experimentally.
△ Less
Submitted 6 November, 2020;
originally announced November 2020.
-
Randomized Quaternion Singular Value Decomposition for Low-Rank Approximation
Authors:
Qiaohua Liu,
Sitao Ling,
Zhigang Jia
Abstract:
This paper presents a randomized quaternion singular value decomposition (QSVD) algorithm for low-rank matrix approximation problems, which are widely used in color face recognition, video compression, and signal processing problems. With quaternion normal distribution based random sampling, the randomized QSVD algorithm projects a high-dimensional data to a low-dimensional subspace and then ident…
▽ More
This paper presents a randomized quaternion singular value decomposition (QSVD) algorithm for low-rank matrix approximation problems, which are widely used in color face recognition, video compression, and signal processing problems. With quaternion normal distribution based random sampling, the randomized QSVD algorithm projects a high-dimensional data to a low-dimensional subspace and then identifies an approximate range subspace of the quaternion matrix. The key statistical properties of quaternion Wishart distribution are proposed and used to perform the approximation error analysis of the algorithm. Theoretical results show that the randomized QSVD algorithm can trace dominant singular value decomposition triplets of a quaternion matrix with acceptable accuracy. Numerical experiments also indicate the rationality of proposed theories. Applied to color face recognition problems, the randomized QSVD algorithm obtains higher recognition accuracies and behaves more efficient than the known Lanczos-based partial QSVD and a quaternion version of fast frequent directions algorithm.
△ Less
Submitted 26 December, 2021; v1 submitted 6 November, 2020;
originally announced November 2020.
-
The central limit theorem for slow-fast systems with Lévy noise
Authors:
Xiaoyu Yang,
Yong Xu,
Ruifang Wang,
Zhe Jiao
Abstract:
We consider a slow-fast stochastic differential system with Lévy noise. We will employ the perturbed test function method to study the normal deviation of the slow-fast system. Our main result states that the deviation can be approximated by a Gaussian process and the central limit theorem is obtained for the system.
We consider a slow-fast stochastic differential system with Lévy noise. We will employ the perturbed test function method to study the normal deviation of the slow-fast system. Our main result states that the deviation can be approximated by a Gaussian process and the central limit theorem is obtained for the system.
△ Less
Submitted 11 March, 2024; v1 submitted 19 August, 2020;
originally announced August 2020.
-
On condition numbers of the total least squares problem with linear equality constraint
Authors:
Qiaohua Liu,
Zhigang Jia
Abstract:
This paper is devoted to condition numbers of the total least squares problem with linear equality constraint (TLSE). With novel limit techniques, closed formulae for normwise, mixed and componentwise condition numbers of the TLSE problem are derived. Computable expressions and upper bounds for these condition numbers are also given to avoid the costly Kronecker product-based operations. The resul…
▽ More
This paper is devoted to condition numbers of the total least squares problem with linear equality constraint (TLSE). With novel limit techniques, closed formulae for normwise, mixed and componentwise condition numbers of the TLSE problem are derived. Computable expressions and upper bounds for these condition numbers are also given to avoid the costly Kronecker product-based operations. The results unify the ones for the TLS problem. For TLSE problems with equilibratory input data, numerical experiments illustrate that normwise condition number-based estimate is sharp to evaluate the forward error of the solution, while for sparse and badly scaled matrices, mixed and componentwise condition numbers-based estimates are much tighter.
△ Less
Submitted 29 October, 2021; v1 submitted 18 August, 2020;
originally announced August 2020.
-
The Multi-Symplectic Lanczos Algorithm and Its Applications to Color Image Processing
Authors:
Zhigang Jia,
Xuan Liu,
Mei-Xiang Zhao
Abstract:
Low-rank approximations of original samples are playing more and more an important role in many recently proposed mathematical models from data science. A natural and initial requirement is that these representations inherit original structures or properties. With this aim, we propose a new multi-symplectic method based on the Lanzcos bidiagonalization to compute the partial singular triplets of J…
▽ More
Low-rank approximations of original samples are playing more and more an important role in many recently proposed mathematical models from data science. A natural and initial requirement is that these representations inherit original structures or properties. With this aim, we propose a new multi-symplectic method based on the Lanzcos bidiagonalization to compute the partial singular triplets of JRS-symmetric matrices. These singular triplets can be used to reconstruct optimal low-rank approximations while preserving the intrinsic multi-symmetry. The augmented Ritz and harmonic Ritz vectors are used to perform implicit restarting to obtain a satisfactory bidiagonal matrix for calculating the $k$ largest or smallest singular triplets, respectively. We also apply the new multi-symplectic Lanczos algorithms to color face recognition and color video compressing and reconstruction. Numerical experiments indicate their superiority over the state-of-the-art algorithms.
△ Less
Submitted 4 May, 2020;
originally announced May 2020.
-
A cross-product free Jacobi-Davidson type method for computing a partial generalized singular value decomposition (GSVD) of a large matrix pair
Authors:
**zhi Huang,
Zhongxiao Jia
Abstract:
A Cross-Product Free (CPF) Jacobi-Davidson (JD) type method is proposed to compute a partial generalized singular value decomposition (GSVD) of a large regular matrix pair $(A,B)$. It implicitly solves the mathematically equivalent generalized eigenvalue problem of $(A^TA,B^TB)$ but does not explicitly form the cross-product matrices and thus avoids the possible accuracy loss of the computed gener…
▽ More
A Cross-Product Free (CPF) Jacobi-Davidson (JD) type method is proposed to compute a partial generalized singular value decomposition (GSVD) of a large regular matrix pair $(A,B)$. It implicitly solves the mathematically equivalent generalized eigenvalue problem of $(A^TA,B^TB)$ but does not explicitly form the cross-product matrices and thus avoids the possible accuracy loss of the computed generalized singular values and generalized singular vectors. The method is an inner-outer iteration method, where the expansion of the right searching subspace forms the inner iterations that approximately solve the correction equations involved and the outer iterations extract approximate GSVD components with respect to the subspaces. Some convergence results are established for the inner and outer iterations, based on some of which practical stop** criteria are designed for the inner iterations. A thick-restart CPF-JDGSVD algorithm with deflation is developed to compute several GSVD components. Numerical experiments illustrate the efficiency of the algorithm.
△ Less
Submitted 9 March, 2021; v1 submitted 29 April, 2020;
originally announced April 2020.
-
Theoretical and Computable Optimal Subspace Expansions for Matrix Eigenvalue Problems
Authors:
Zhongxiao Jia
Abstract:
Consider the optimal subspace expansion problem for the matrix eigenvalue problem $Ax=λx$: Which vector $w$ in the current subspace $\mathcal{V}$, after multiplied by $A$, provides an optimal subspace expansion for approximating a desired eigenvector $x$ in the sense that $x$ has the smallest angle with the expanded subspace $\mathcal{V}_w=\mathcal{V}+{\rm span}\{Aw\}$, i.e.,…
▽ More
Consider the optimal subspace expansion problem for the matrix eigenvalue problem $Ax=λx$: Which vector $w$ in the current subspace $\mathcal{V}$, after multiplied by $A$, provides an optimal subspace expansion for approximating a desired eigenvector $x$ in the sense that $x$ has the smallest angle with the expanded subspace $\mathcal{V}_w=\mathcal{V}+{\rm span}\{Aw\}$, i.e., $w_{opt}=\arg\max_{w\in\mathcal{V}}\cos\angle(\mathcal{V}_w,x)$? This problem is important as many iterative methods construct nested subspaces that successively expand $\mathcal{V}$ to $\mathcal{V}_w$. An expression of $w_{opt}$ by Ye (Linear Algebra Appl., 428 (2008), pp. 911--918) for $A$ general, but it could not be exploited to construct a computable (nearly) optimally expanded subspace. He turns to deriving a maximization characterization of $\cos\angle(\mathcal{V}_w,x)$ for a {\em given} $w\in \mathcal{V}$ when $A$ is Hermitian. We generalize Ye's maximization characterization to the general case and find its maximizer. Our main contributions consist of explicit expressions of $w_{opt}$, $(I-P_V)Aw_{opt}$ and the optimally expanded subspace $\mathcal{V}_{w_{opt}}$ for $A$ general, where $P_V$ is the orthogonal projector onto $\mathcal{V}$. These results are fully exploited to obtain computable optimally expanded subspaces within the framework of the standard, harmonic, refined, and refined harmonic Rayleigh--Ritz methods. We show how to efficiently implement the proposed subspace expansion approaches. Numerical experiments demonstrate the effectiveness of our computable optimal expansions.
△ Less
Submitted 3 January, 2022; v1 submitted 10 April, 2020;
originally announced April 2020.
-
The Krylov Subspaces, Low Rank Approximations and Ritz Values of LSQR for Linear Discrete Ill-Posed Problems: the Multiple Singular Value Case
Authors:
Zhongxiao Jia
Abstract:
For the large-scale linear discrete ill-posed problem $\min\|Ax-b\|$ or $Ax=b$ with $b$ contaminated by white noise, the Golub-Kahan bidiagonalization based LSQR method and its mathematically equivalent CGLS, the Conjugate Gradient (CG) method applied to $A^TAx=A^Tb$, are most commonly used. They have intrinsic regularizing effects, where the iteration number $k$ plays the role of regularization p…
▽ More
For the large-scale linear discrete ill-posed problem $\min\|Ax-b\|$ or $Ax=b$ with $b$ contaminated by white noise, the Golub-Kahan bidiagonalization based LSQR method and its mathematically equivalent CGLS, the Conjugate Gradient (CG) method applied to $A^TAx=A^Tb$, are most commonly used. They have intrinsic regularizing effects, where the iteration number $k$ plays the role of regularization parameter. The long-standing fundamental question is: {\em Can LSQR and CGLS find 2-norm filtering best possible regularized solutions}? The author has given definitive answers to this question for severely and moderately ill-posed problems when the singular values of $A$ are simple. This paper extends the results to the multiple singular value case, and studies the approximation accuracy of Krylov subspaces, the quality of low rank approximations generated by Golub-Kahan bidiagonalization and the convergence properties of Ritz values. For the two kinds of problems, we prove that LSQR finds 2-norm filtering best possible regularized solutions at semi-convergence. Particularly, we consider some important and untouched issues on best, near best and general rank $k$ approximations to $A$ for the ill-posed problems with the singular values $σ_k=\mathcal{O}(k^{-α})$ with $α>0$, and the relationships between them and their nonzero singular values. Numerical experiments confirm our theory. The results on general rank $k$ approximations and the properties of their nonzero singular values apply to several Krylov solvers, including LSQR, CGME, MINRES, MR-II, GMRES and RRGMRES.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
The joint bidiagonalization process with partial reorthogonalization
Authors:
Zhongxiao Jia,
Haibo Li
Abstract:
The joint bidiagonalization(JBD) process is a useful algorithm for the computation of the generalized singular value decomposition(GSVD) of a matrix pair. However, it always suffers from rounding errors, which causes the Lanczos vectors to loss their mutual orthogonality. In order to maintain some level of orthongonality, we present a semiorthogonalization strategy. Our rounding error analysis sho…
▽ More
The joint bidiagonalization(JBD) process is a useful algorithm for the computation of the generalized singular value decomposition(GSVD) of a matrix pair. However, it always suffers from rounding errors, which causes the Lanczos vectors to loss their mutual orthogonality. In order to maintain some level of orthongonality, we present a semiorthogonalization strategy. Our rounding error analysis shows that the JBD process with the semiorthogonalization strategy can ensure that the convergence of the computed quantities is not affected by rounding errors and the final accuracy is high enough. Based on the semiorthogonalization strategy, we develop the joint bidiagonalization process with partial reorthogonalization(JBDPRO). In the JBDPRO algorithm, reorthogonalizations occur only when necessary, which saves a big amount of reorthogonalization work compared with the full reorthogonalization strategy. Numerical experiments illustrate our theory and algorithm.
△ Less
Submitted 7 January, 2021; v1 submitted 10 January, 2020;
originally announced January 2020.
-
Advanced Variations of Two-Dimensional Principal Component Analysis for Face Recognition
Authors:
Meixiang Zhao,
Zhigang Jia,
Yunfeng Cai,
Xiao Chen,
Dunwei Gong
Abstract:
The two-dimensional principal component analysis (2DPCA) has become one of the most powerful tools of artificial intelligent algorithms. In this paper, we review 2DPCA and its variations, and propose a general ridge regression model to extract features from both row and column directions. To enhance the generalization ability of extracted features, a novel relaxed 2DPCA (R2DPCA) is proposed with a…
▽ More
The two-dimensional principal component analysis (2DPCA) has become one of the most powerful tools of artificial intelligent algorithms. In this paper, we review 2DPCA and its variations, and propose a general ridge regression model to extract features from both row and column directions. To enhance the generalization ability of extracted features, a novel relaxed 2DPCA (R2DPCA) is proposed with a new ridge regression model. R2DPCA generates a weighting vector with utilizing the label information, and maximizes a relaxed criterion with applying an optimal algorithm to get the essential features. The R2DPCA-based approaches for face recognition and image reconstruction are also proposed and the selected principle components are weighted to enhance the role of main components. Numerical experiments on well-known standard databases indicate that R2DPCA has high generalization ability and can achieve a higher recognition rate than the state-of-the-art methods, including in the deep learning methods such as CNNs, DBNs, and DNNs.
△ Less
Submitted 19 December, 2019;
originally announced December 2019.
-
The joint bidiagonalization method for large GSVD computations in finite precision
Authors:
Zhongxiao Jia,
Haibo Li
Abstract:
The joint bidiagonalization (JBD) method has been used to compute some extreme generalized singular values and vectors of a large regular matrix pair $\{A,L\}$, where we propose three approaches to compute approximate generalized singular values and vectors. We make a numerical analysis of the underlying JBD process and establish relationships between it and two mathematically equivalent Lanczos b…
▽ More
The joint bidiagonalization (JBD) method has been used to compute some extreme generalized singular values and vectors of a large regular matrix pair $\{A,L\}$, where we propose three approaches to compute approximate generalized singular values and vectors. We make a numerical analysis of the underlying JBD process and establish relationships between it and two mathematically equivalent Lanczos bidiagonalizations in finite precision. Based on the results of numerical analysis, we investigate the convergence of the approximate generalized singular values and vectors of $\{A,L\}$. The results show that, under some mild conditions, the semiorthogonality of Lanczos type vectors suffices to deliver approximate generalized singular values with the same accuracy as the full orthogonality does, meaning that it is only necessary to seek for efficient semiorthogonalization strategies for the JBD process. We also establish a sharp bound for the residual norm of an approximate generalized singular value and corresponding approximate right generalized singular vectors, which can reliably estimate the residual norm without explicitly computing the approximate right generalized singular vectors before the convergence occurs.
△ Less
Submitted 3 January, 2022; v1 submitted 18 December, 2019;
originally announced December 2019.
-
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Authors:
**gliang Duan,
Zhengyu Liu,
Shengbo Eben Li,
Qi Sun,
Zhenzhong Jia,
Bo Cheng
Abstract:
This paper presents a constrained adaptive dynamic programming (CADP) algorithm to solve general nonlinear nonaffine optimal control problems with known dynamics. Unlike previous ADP algorithms, it can directly deal with problems with state constraints. Firstly, a constrained generalized policy iteration (CGPI) framework is developed to handle state constraints by transforming the traditional poli…
▽ More
This paper presents a constrained adaptive dynamic programming (CADP) algorithm to solve general nonlinear nonaffine optimal control problems with known dynamics. Unlike previous ADP algorithms, it can directly deal with problems with state constraints. Firstly, a constrained generalized policy iteration (CGPI) framework is developed to handle state constraints by transforming the traditional policy improvement process into a constrained policy optimization problem. Next, we propose an actor-critic variant of CGPI, called CADP, in which both policy and value functions are approximated by multi-layer neural networks to directly map the system states to control inputs and value function, respectively. CADP linearizes the constrained optimization problem locally into a quadratically constrained linear programming problem, and then obtains the optimal update of the policy network by solving its dual problem. A trust region constraint is added to prevent excessive policy update, thus ensuring linearization accuracy. We determine the feasibility of the policy optimization problem by calculating the minimum trust region boundary and update the policy using two recovery rules when infeasible. The vehicle control problem in the path-tracking task is used to demonstrate the effectiveness of this proposed method.
△ Less
Submitted 8 April, 2022; v1 submitted 26 November, 2019;
originally announced November 2019.
-
The convergence of the Generalized Lanczos Trust-Region Method for the Trust-Region Subproblem
Authors:
Zhongxiao Jia,
Fa Wang
Abstract:
Solving the trust-region subproblem (TRS) plays a key role in numerical optimization and many other applications. The generalized Lanczos trust-region (GLTR) method is a well-known Lanczos type approach for solving a large-scale TRS. The method projects the original large-scale TRS onto a $k$ dimensional Krylov subspace, whose orthonormal basis is generated by the symmetric Lanczos process, and co…
▽ More
Solving the trust-region subproblem (TRS) plays a key role in numerical optimization and many other applications. The generalized Lanczos trust-region (GLTR) method is a well-known Lanczos type approach for solving a large-scale TRS. The method projects the original large-scale TRS onto a $k$ dimensional Krylov subspace, whose orthonormal basis is generated by the symmetric Lanczos process, and computes an approximate solution from the underlying subspace. There have been some a-priori error bounds for the optimal solution and the optimal objective value in the literature, but no a-priori result exists on the convergence of Lagrangian multipliers involved in projected TRS's and the residual norm of approximate solution. In this paper, a general convergence theory of the GLTR method is established, and a-priori bounds are derived for the errors of the optimal Lagrangian multiplier, the optimal solution, the optimal objective value and the residual norm of approximate solution. Numerical experiments demonstrate that our bounds are realistic and predict the convergence rates of the three errors and residual norms accurately.
△ Less
Submitted 6 August, 2019;
originally announced August 2019.
-
On choices of formulations of computing the generalized singular value decomposition of a large matrix pair
Authors:
**zhi Huang,
Zhongxiao Jia
Abstract:
For the computation of the generalized singular value decomposition (GSVD) of a large matrix pair $(A,B)$ of full column rank, the GSVD is commonly formulated as two mathematically equivalent generalized eigenvalue problems, so that a generalized eigensolver can be applied to one of them and the desired GSVD components are then recovered from the computed generalized eigenpairs. Our concern in thi…
▽ More
For the computation of the generalized singular value decomposition (GSVD) of a large matrix pair $(A,B)$ of full column rank, the GSVD is commonly formulated as two mathematically equivalent generalized eigenvalue problems, so that a generalized eigensolver can be applied to one of them and the desired GSVD components are then recovered from the computed generalized eigenpairs. Our concern in this paper is, in finite precision arithmetic, which generalized eigenvalue formulation is numerically preferable to compute the desired GSVD components more accurately. We make a detailed perturbation analysis on the two formulations and show how to make a suitable choice between them. Numerical experiments illustrate the results obtained.
△ Less
Submitted 10 January, 2020; v1 submitted 24 July, 2019;
originally announced July 2019.
-
Global existence of the solution to Einstein-Maxwell equations with small initial data
Authors:
Zonglin Jia,
Boling Guo
Abstract:
We study the global existence of Einstein-Maxwell(EM) equations on $\mathbb{R}^4$. We use the method, which relies on wave and Lorentzian gauge conditions, to obtain some exquisite estimates. Our main conclusion is that if the initial data is small enough, then the EM system has a global in time solution.
We study the global existence of Einstein-Maxwell(EM) equations on $\mathbb{R}^4$. We use the method, which relies on wave and Lorentzian gauge conditions, to obtain some exquisite estimates. Our main conclusion is that if the initial data is small enough, then the EM system has a global in time solution.
△ Less
Submitted 5 September, 2019; v1 submitted 4 July, 2019;
originally announced July 2019.
-
Toward Solving 2-TBSG Efficiently
Authors:
Zeyu Jia,
Zaiwen Wen,
Yinyu Ye
Abstract:
2-TBSG is a two-player game model which aims to find Nash equilibriums and is widely utilized in reinforced learning and AI. Inspired by the fact that the simplex method for solving the deterministic discounted Markov decision processes (MDPs) is strongly polynomial independent of the discounted factor, we are trying to answer an open problem whether there is a similar algorithm for 2-TBSG. We dev…
▽ More
2-TBSG is a two-player game model which aims to find Nash equilibriums and is widely utilized in reinforced learning and AI. Inspired by the fact that the simplex method for solving the deterministic discounted Markov decision processes (MDPs) is strongly polynomial independent of the discounted factor, we are trying to answer an open problem whether there is a similar algorithm for 2-TBSG. We develop a simplex strategy iteration where one player updates its strategy with a simplex step while the other player finds an optimal counterstrategy in turn, and a modified simplex strategy iteration. Both of them belong to a class of geometrically converging algorithms. We establish the strongly polynomial property of these algorithms by considering a strategy combined from the current strategy and the equilibrium strategy. Moreover, we present a method to transform general 2-TBSGs into special 2-TBSGs where each state has exactly two actions.
△ Less
Submitted 8 June, 2019;
originally announced June 2019.