Search | arXiv e-print repository

arXiv:2405.20390 [pdf, other]

Quantitative Convergences of Lie Group Momentum Optimizers

Abstract: Explicit, momentum-based dynamics that optimize functions defined on Lie groups can be constructed via variational optimization and momentum trivialization. Structure preserving time discretizations can then turn this dynamics into optimization algorithms. This article investigates two types of discretization, Lie Heavy-Ball, which is a known splitting scheme, and Lie NAG-SC, which is newly propos… ▽ More Explicit, momentum-based dynamics that optimize functions defined on Lie groups can be constructed via variational optimization and momentum trivialization. Structure preserving time discretizations can then turn this dynamics into optimization algorithms. This article investigates two types of discretization, Lie Heavy-Ball, which is a known splitting scheme, and Lie NAG-SC, which is newly proposed. Their convergence rates are explicitly quantified under $L$-smoothness and local strong convexity assumptions. Lie NAG-SC provides acceleration over the momentumless case, i.e. Riemannian gradient descent, but Lie Heavy-Ball does not. When compared to existing accelerated optimizers for general manifolds, both Lie Heavy-Ball and Lie NAG-SC are computationally cheaper and easier to implement, thanks to their utilization of group structure. Only gradient oracle and exponential map are required, but not logarithm map or parallel transport which are computational costly. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.06889 [pdf, other]

Tuning parameter selection for the adaptive nuclear norm regularized trace regression

Authors: Pan Shang, Lingchen Kong, Yiting Ma

Abstract: Regularized models have been applied in lots of areas, with high-dimensional data sets being popular. Because tuning parameter decides the theoretical performance and computational efficiency of the regularized models, tuning parameter selection is a basic and important issue. We consider the tuning parameter selection for adaptive nuclear norm regularized trace regression, which achieves by the B… ▽ More Regularized models have been applied in lots of areas, with high-dimensional data sets being popular. Because tuning parameter decides the theoretical performance and computational efficiency of the regularized models, tuning parameter selection is a basic and important issue. We consider the tuning parameter selection for adaptive nuclear norm regularized trace regression, which achieves by the Bayesian information criterion (BIC). The proposed BIC is established with the help of an unbiased estimator of degrees of freedom. Under some regularized conditions, this BIC is proved to achieve the rank consistency of the tuning parameter selection. That is the model solution under selected tuning parameter converges to the true solution and has the same rank with that of the true solution in probability. Some numerical results are presented to evaluate the performance of the proposed BIC on tuning parameter selection. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2404.10262 [pdf, other]

Safe Feature Identification Rule for Fused Lasso by An Extra Dual Variable

Authors: Pan Shang, Huangyue Chen, Lingchen Kong

Abstract: Fused Lasso was proposed to characterize the sparsity of the coefficients and the sparsity of their successive differences for the linear regression. Due to its wide applications, there are many existing algorithms to solve fused Lasso. However, the computation of this model is time-consuming in high-dimensional data sets. To accelerate the calculation of fused Lasso in high-dimension data sets, w… ▽ More Fused Lasso was proposed to characterize the sparsity of the coefficients and the sparsity of their successive differences for the linear regression. Due to its wide applications, there are many existing algorithms to solve fused Lasso. However, the computation of this model is time-consuming in high-dimensional data sets. To accelerate the calculation of fused Lasso in high-dimension data sets, we build up the safe feature identification rule by introducing an extra dual variable. With a low computational cost, this rule can eliminate inactive features with zero coefficients and identify adjacent features with same coefficients in the solution. To the best of our knowledge, existing screening rules can not be applied to speed up the computation of fused Lasso and our work is the first one to deal with this problem. To emphasize our rule is a unique result that is capable of identifying adjacent features with same coefficients, we name the result as the safe feature identification rule. Numerical experiments on simulation and real data illustrate the efficiency of the rule, which means this rule can reduce the computational time of fused Lasso. In addition, our rule can be embedded into any efficient algorithm and speed up the computational process of fused Lasso. △ Less

Submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.07459 [pdf, other]

Safe subspace screening for the adaptive nuclear norm regularized trace regression

Authors: Pan Shang, Lingchen Kong

Abstract: Matrix form data sets arise in many areas, so there are lots of works about the matrix regression models. One special model of these models is the adaptive nuclear norm regularized trace regression, which has been proven have good statistical performances. In order to accelerate the computation of this model, we consider the technique called screening rule. According to matrix decomposition and op… ▽ More Matrix form data sets arise in many areas, so there are lots of works about the matrix regression models. One special model of these models is the adaptive nuclear norm regularized trace regression, which has been proven have good statistical performances. In order to accelerate the computation of this model, we consider the technique called screening rule. According to matrix decomposition and optimal condition of the model, we develop a safe subspace screening rule that can be used to identify inactive subspace of the solution decomposition and reduce the dimension of the solution. To evaluate the efficiency of the safe subspace screening rule, we embed this result into the alternating direction method of multipliers algorithm under a sequence of the tuning parameters. Under this process, each solution under the tuning parameter provides a matrix decomposition space. Then, the safe subspace screening rule is applied to eliminate inactive subspace, reduce the solution dimension and accelerate the computation process. Some numerical experiments are implemented on simulation data sets and real data sets, which illustrate the efficiency of our screening rule. △ Less

Submitted 15 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

arXiv:2403.18528 [pdf, other]

Limited Attention Allocation in a Stochastic Linear Quadratic System with Multiplicative Noise

Authors: Xiangyu Cui, Jianjun Gao, Lingjie Kong

Abstract: This study addresses limited attention allocation in a stochastic linear quadratic system with multiplicative noise. Our approach enables strategic resource allocation to enhance noise estimation and improve control decisions. We provide analytical optimal control and propose a numerical method for optimal attention allocation. Additionally, we apply our ffndings to dynamic mean-variance portfolio… ▽ More This study addresses limited attention allocation in a stochastic linear quadratic system with multiplicative noise. Our approach enables strategic resource allocation to enhance noise estimation and improve control decisions. We provide analytical optimal control and propose a numerical method for optimal attention allocation. Additionally, we apply our ffndings to dynamic mean-variance portfolio selection, showing effective resource allocation across time periods and factors, providing valuable insights for investors. △ Less

Submitted 27 March, 2024; originally announced March 2024.

arXiv:2403.12012 [pdf, other]

Convergence of Kinetic Langevin Monte Carlo on Lie groups

Authors: Lingkai Kong, Molei Tao

Abstract: Explicit, momentum-based dynamics for optimizing functions defined on Lie groups was recently constructed, based on techniques such as variational optimization and left trivialization. We appropriately add tractable noise to the optimization dynamics to turn it into a sampling dynamics, leveraging the advantageous feature that the trivialized momentum variable is Euclidean despite that the potenti… ▽ More Explicit, momentum-based dynamics for optimizing functions defined on Lie groups was recently constructed, based on techniques such as variational optimization and left trivialization. We appropriately add tractable noise to the optimization dynamics to turn it into a sampling dynamics, leveraging the advantageous feature that the trivialized momentum variable is Euclidean despite that the potential function lives on a manifold. We then propose a Lie-group MCMC sampler, by delicately discretizing the resulting kinetic-Langevin-type sampling dynamics. The Lie group structure is exactly preserved by this discretization. Exponential convergence with explicit convergence rate for both the continuous dynamics and the discrete sampler are then proved under $W_2$ distance. Only compactness of the Lie group and geodesically $L$-smoothness of the potential function are needed. To the best of our knowledge, this is the first convergence result for kinetic Langevin on curved spaces, and also the first quantitative result that requires no convexity or, at least not explicitly, any common relaxation such as isoperimetry. △ Less

Submitted 17 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.07813 [pdf, ps, other]

Higher condensation theory

Authors: Liang Kong, Zhi-Hao Zhang, Jiaheng Zhao, Hao Zheng

Abstract: We develop a unified theory of defect condensations for topological orders in all dimensions based on higher categories, higher algebras and higher representations. We show that condensing a $k$-codimensional topological defect $A$ in an $n$+1D (potentially anomalous) topological order $\mathsf C^{n+1}$ amounts to a $k$-step process. In the first step, we condense $A$ along one of the transversal… ▽ More We develop a unified theory of defect condensations for topological orders in all dimensions based on higher categories, higher algebras and higher representations. We show that condensing a $k$-codimensional topological defect $A$ in an $n$+1D (potentially anomalous) topological order $\mathsf C^{n+1}$ amounts to a $k$-step process. In the first step, we condense $A$ along one of the transversal directions, thus obtaining a $(k-1)$-codimensional defect $ΣA$, which can be further condensed as the second step, so on and so forth. In the $k$-th step, condensing $Σ^{k-1}A$ along the only transversal direction defines a phase transition to a new phase $\mathsf D^{n+1}$. Mathematically, a $k$-codimensional defect $A$ is condensable if it is equipped with the structure of a condensable $E_k$-algebra. In this case, $ΣA$ is naturally a condensable $E_{k-1}$-algebra, thus it can be further condensed. The condensed phase $\mathsf D^{n+1}$ consists of all deconfined topological defects in $\mathsf C^{n+1}$. A $k$-codimensional topological defect is deconfined if and only if it is equipped with a $k$-dimensional $A$-action, which defines an $E_k$-module over $A$. When $\mathsf C^{n+1}$ is anomaly-free, the same condensation can be alternatively defined by replacing the last two steps by a single step of condensing the $E_2$-algebra $Σ^{k-2}A$ directly. The condensed phase $\mathsf D^{n+1}$ is determined by the category of $E_2$-modules over $Σ^{k-2}A$. When $n=2$, this modified last step is precisely a usual anyon condensation in a 2+1D topological order. The proofs of the most mathematical results will appear in a mathematical companion of this paper. We also briefly discuss some generalizations and applications that naturally arise from our condensation theory such as higher Morita theory, factorization homology and the condensation theory of non-topological defects. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 120 pages. We are preparing the second version, in which more remarks, examples and references will be added. Comments are welcome

arXiv:2402.14877 [pdf, other]

Machine-learning prediction of tip** and collapse of the Atlantic Meridional Overturning Circulation

Authors: Shirin Panahi, Ling-Wei Kong, Mohammadamin Moradi, Zheng-Meng Zhai, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

Abstract: Recent research on the Atlantic Meridional Overturning Circulation (AMOC) raised concern about its potential collapse through a tip** point due to the climate-change caused increase in the freshwater input into the North Atlantic. The predicted time window of collapse is centered about the middle of the century and the earliest possible start is approximately two years from now. More generally,… ▽ More Recent research on the Atlantic Meridional Overturning Circulation (AMOC) raised concern about its potential collapse through a tip** point due to the climate-change caused increase in the freshwater input into the North Atlantic. The predicted time window of collapse is centered about the middle of the century and the earliest possible start is approximately two years from now. More generally, anticipating a tip** point at which the system transitions from one stable steady state to another is relevant to a broad range of fields. We develop a machine-learning approach to predicting tip** in noisy dynamical systems with a time-varying parameter and test it on a number of systems including the AMOC, ecological networks, an electrical power system, and a climate model. For the AMOC, our prediction based on simulated fingerprint data and real data of the sea surface temperature places the time window of a potential collapse between the years 2040 and 2065. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 6 pages, 3 figures

arXiv:2312.02421 [pdf, ps, other]

Inverse conductivity problem with one measurement: Uniqueness of multi-layer structures

Authors: Lingzheng Kong, Youjun Deng, Liyan Zhu

Abstract: In this paper, we study the recovery of multi-layer structures in inverse conductivity problem by using one measurement. First, we define the concept of Generalized Polarization Tensors (GPTs) for multi-layered medium and show some important properties of the proposed GPTs. With the help of GPTs, we present the perturbation formula for general multi-layered medium. Then we derive the perturbed ele… ▽ More In this paper, we study the recovery of multi-layer structures in inverse conductivity problem by using one measurement. First, we define the concept of Generalized Polarization Tensors (GPTs) for multi-layered medium and show some important properties of the proposed GPTs. With the help of GPTs, we present the perturbation formula for general multi-layered medium. Then we derive the perturbed electric potential for multi-layer concentric disks structure in terms of the so-called generalized polarization matrix, whose dimension is the same as the number of the layers. By delicate analysis, we derive an algebraic identity involving the geometric and material configurations of multi-layer concentric disks. This enables us to reconstruct the multi-layer structures by using only one partial-order measurement. △ Less

Submitted 4 December, 2023; originally announced December 2023.

MSC Class: 31A25; 35J05; 86A20

arXiv:2309.11470 [pdf, other]

doi 10.1038/s41467-023-41379-3

Model-free tracking control of complex dynamical trajectories with machine learning

Authors: Zheng-Meng Zhai, Mohammadamin Moradi, Ling-Wei Kong, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

Abstract: Nonlinear tracking control enabling a dynamical system to track a desired trajectory is fundamental to robotics, serving a wide range of civil and defense applications. In control engineering, designing tracking control requires complete knowledge of the system model and equations. We develop a model-free, machine-learning framework to control a two-arm robotic manipulator using only partially obs… ▽ More Nonlinear tracking control enabling a dynamical system to track a desired trajectory is fundamental to robotics, serving a wide range of civil and defense applications. In control engineering, designing tracking control requires complete knowledge of the system model and equations. We develop a model-free, machine-learning framework to control a two-arm robotic manipulator using only partially observed states, where the controller is realized by reservoir computing. Stochastic input is exploited for training, which consists of the observed partial state vector as the first and its immediate future as the second component so that the neural machine regards the latter as the future state of the former. In the testing (deployment) phase, the immediate-future component is replaced by the desired observational vector from the reference trajectory. We demonstrate the effectiveness of the control framework using a variety of periodic and chaotic signals, and establish its robustness against measurement noise, disturbances, and uncertainties. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 16 pages, 8 figures

Journal ref: Nat Commun 14, 5698 (2023)

arXiv:2306.13850 [pdf, other]

High-dimensional outlier detection and variable selection via adaptive weighted mean regression

Authors: Jiaqi Li, Linglong Kong, Bei Jiang, Wei Tu

Abstract: This paper proposes an adaptive penalized weighted mean regression for outlier detection of high-dimensional data. In comparison to existing approaches based on the mean shift model, the proposed estimators demonstrate robustness against outliers present in both response variables and/or covariates. By utilizing the adaptive Huber loss function, the proposed method is effective in high-dimensional… ▽ More This paper proposes an adaptive penalized weighted mean regression for outlier detection of high-dimensional data. In comparison to existing approaches based on the mean shift model, the proposed estimators demonstrate robustness against outliers present in both response variables and/or covariates. By utilizing the adaptive Huber loss function, the proposed method is effective in high-dimensional linear models characterized by heavy-tailed and heteroscedastic error distributions. The proposed framework enables simultaneous and collaborative estimation of regression parameters and outlier detection. Under regularity conditions, outlier detection consistency and oracle inequalities of robust estimates in high-dimensional settings are established. Additionally, theoretical robustness properties, such as the breakdown point and a smoothed limiting influence function, are ascertained. Extensive simulation studies and a breast cancer survival data are used to evaluate the numerical performance of the proposed method, demonstrating comparable or superior variable selection and outlier detection capabilities. △ Less

Submitted 23 June, 2023; originally announced June 2023.

Comments: 8 Tables, 4 figures

arXiv:2303.15464 [pdf, other]

Mathematical Challenges in Deep Learning

Authors: Vahid Partovi Nia, Guojun Zhang, Ivan Kobyzev, Michael R. Metel, Xinlin Li, Ke Sun, Sobhan Hemati, Masoud Asgharian, Linglong Kong, Wulong Liu, Boxing Chen

Abstract: Deep models are dominating the artificial intelligence (AI) industry since the ImageNet challenge in 2012. The size of deep models is increasing ever since, which brings new challenges to this field with applications in cell phones, personal computers, autonomous cars, and wireless base stations. Here we list a set of problems, ranging from training, inference, generalization bound, and optimizati… ▽ More Deep models are dominating the artificial intelligence (AI) industry since the ImageNet challenge in 2012. The size of deep models is increasing ever since, which brings new challenges to this field with applications in cell phones, personal computers, autonomous cars, and wireless base stations. Here we list a set of problems, ranging from training, inference, generalization bound, and optimization with some formalism to communicate these challenges with mathematicians, statisticians, and theoretical computer scientists. This is a subjective view of the research questions in deep learning that benefits the tech industry in long run. △ Less

Submitted 24 March, 2023; originally announced March 2023.

arXiv:2303.06595 [pdf, other]

A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph Data

Authors: Jia** Li, Jianheng Tang, Lemin Kong, Huikang Liu, Jia Li, Anthony Man-Cho So, Jose Blanchet

Abstract: In this work, we present the Bregman Alternating Projected Gradient (BAPG) method, a single-loop algorithm that offers an approximate solution to the Gromov-Wasserstein (GW) distance. We introduce a novel relaxation technique that balances accuracy and computational efficiency, albeit with some compromises in the feasibility of the coupling map. Our analysis is based on the observation that the GW… ▽ More In this work, we present the Bregman Alternating Projected Gradient (BAPG) method, a single-loop algorithm that offers an approximate solution to the Gromov-Wasserstein (GW) distance. We introduce a novel relaxation technique that balances accuracy and computational efficiency, albeit with some compromises in the feasibility of the coupling map. Our analysis is based on the observation that the GW problem satisfies the Luo-Tseng error bound condition, which relates to estimating the distance of a point to the critical point set of the GW problem based on the optimality residual. This observation allows us to provide an approximation bound for the distance between the fixed-point set of BAPG and the critical point set of GW. Moreover, under a mild technical assumption, we can show that BAPG converges to its fixed point set. The effectiveness of BAPG has been validated through comprehensive numerical experiments in graph alignment and partition tasks, where it outperforms existing methods in terms of both solution quality and wall-clock time. △ Less

Submitted 12 March, 2023; originally announced March 2023.

Comments: Accepted by ICLR 2023

arXiv:2302.13983 [pdf, other]

Elastostatics with multi-layer metamaterial structures and an algebraic framework for polariton resonances

Authors: Youjun Deng, Lingzheng Kong, Hongyu Liu, Liyan Zhu

Abstract: Multi-layer structures are ubiquitous in constructing metamaterial devices to realise various frontier applications including super-resolution imaging and invisibility cloaking. In this paper, we develop a general mathematical framework for studying elastostatics within multi-layer material structures in $\mathbb{R}^d$, $d=2,3$. The multi-layer structure is formed by concentric balls and each laye… ▽ More Multi-layer structures are ubiquitous in constructing metamaterial devices to realise various frontier applications including super-resolution imaging and invisibility cloaking. In this paper, we develop a general mathematical framework for studying elastostatics within multi-layer material structures in $\mathbb{R}^d$, $d=2,3$. The multi-layer structure is formed by concentric balls and each layer is filled by either a regular elastic material or an elastic metamaterial. The number of layers can be arbitrary and the material parameters in each layer may be different from one another. In practice, the multi-layer structure can serve as the building block for various material devices. Considering the im**ement of an incident field on the multi-layer structure, we first derive the exact perturbed field in terms of an elastic momentum matrix, whose dimension is the same as the number of layers. By highly intricate and delicate analysis, we derive a comprehensive study of the spectral properties of the elastic momentum matrix. This enables us to establishe a handy algebraic framework for studying polariton resonances associated with multi-layer metamaterial structures, which forms the fundamental basis for many metamaterial applications. △ Less

Submitted 27 January, 2023; originally announced February 2023.

arXiv:2211.09955 [pdf, other]

Emergence of a stochastic resonance in machine learning

Authors: Zheng-Meng Zhai, Ling-Wei Kong, Ying-Cheng Lai

Abstract: Can noise be beneficial to machine-learning prediction of chaotic systems? Utilizing reservoir computers as a paradigm, we find that injecting noise to the training data can induce a stochastic resonance with significant benefits to both short-term prediction of the state variables and long-term prediction of the attractor of the system. A key to inducing the stochastic resonance is to include the… ▽ More Can noise be beneficial to machine-learning prediction of chaotic systems? Utilizing reservoir computers as a paradigm, we find that injecting noise to the training data can induce a stochastic resonance with significant benefits to both short-term prediction of the state variables and long-term prediction of the attractor of the system. A key to inducing the stochastic resonance is to include the amplitude of the noise in the set of hyperparameters for optimization. By so doing, the prediction accuracy, stability and horizon can be dramatically improved. The stochastic resonance phenomenon is demonstrated using two prototypical high-dimensional chaotic systems. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: 7 pages, 4 figures

arXiv:2211.02323 [pdf, ps, other]

Canonical nilpotent structure under bounded Ricci curvature and Reifenberg local covering geometry over regular limits

Authors: Zuohai Jiang, Lingling Kong, Shicheng Xu

Abstract: It is known that a closed collapsed Riemannian $n$-manifold $(M,g)$ of bounded Ricci curvature and Reifenberg local covering geometry admits a nilpotent structure in the sense of Cheeger-Fukaya-Gromov with respect to a smoothed metric $g(t)$. We prove that a canonical nilpotent structure over a regular limit space that describes the collapsing of original metric $g$ can be defined and uniquely det… ▽ More It is known that a closed collapsed Riemannian $n$-manifold $(M,g)$ of bounded Ricci curvature and Reifenberg local covering geometry admits a nilpotent structure in the sense of Cheeger-Fukaya-Gromov with respect to a smoothed metric $g(t)$. We prove that a canonical nilpotent structure over a regular limit space that describes the collapsing of original metric $g$ can be defined and uniquely determined up to a conjugation, and prove that the nilpotent structures arising from nearby metrics $g_ε$ with respect to $g_ε$'s sectional curvature bound are equivalent to the canonical one. △ Less

Submitted 4 November, 2022; originally announced November 2022.

Comments: 26 pages

MSC Class: 53C23; 53C21; 53C20

arXiv:2208.07865 [pdf, ps, other]

doi 10.4310/ATMP.2023.v27.n2.a5

String Condensations in 3+1D and Lagrangian Algebras

Authors: Jiaheng Zhao, Jia-Qi Lou, Zhi-Hao Zhang, Ling-Yan Hung, Liang Kong, Yin Tian

Abstract: We present three Lagrangian algebras in the modular 2-category associated to the 3+1D $\mathbb{Z}_2$ topological order and discuss their physical interpretations, connecting algebras with gapped boundary conditions, and interestingly, maps (braided autoequivalences) exchanging algebras with bulk domain walls. A Lagrangian algebra, together with its modules and local modules, encapsulates detailed… ▽ More We present three Lagrangian algebras in the modular 2-category associated to the 3+1D $\mathbb{Z}_2$ topological order and discuss their physical interpretations, connecting algebras with gapped boundary conditions, and interestingly, maps (braided autoequivalences) exchanging algebras with bulk domain walls. A Lagrangian algebra, together with its modules and local modules, encapsulates detailed physical data of strings condensing at a gapped boundary. In particular, the condensed strings can terminate at boundaries in non-trivial ways. This phenomenon has no lower dimensional analogue and corresponds to novel mathematical structures associated to higher algebras. We provide a layered construction and also explicit lattice realizations of these boundaries and illustrate the correspondence between physics and mathematics of these boundary conditions. This is a first detailed study of the mathematics of Lagrangian algebras in modular 2-categories and their corresponding physics, that brings together rich phenomena of string condensations, gapped boundaries and domain walls in 3+1D topological orders. △ Less

Submitted 6 February, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

Comments: 7+17 pages, 16 figures. Comments are welcome

Journal ref: Adv. Theor. Math. Phys. Volume 27, Number 2, 583--622, 2023

arXiv:2205.14173 [pdf, other]

Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport

Authors: Lingkai Kong, Yuqing Wang, Molei Tao

Abstract: The problem of optimization on Stiefel manifold, i.e., minimizing functions of (not necessarily square) matrices that satisfy orthogonality constraints, has been extensively studied. Yet, a new approach is proposed based on, for the first time, an interplay between thoughtfully designed continuous and discrete dynamics. It leads to a gradient-based optimizer with intrinsically added momentum. This… ▽ More The problem of optimization on Stiefel manifold, i.e., minimizing functions of (not necessarily square) matrices that satisfy orthogonality constraints, has been extensively studied. Yet, a new approach is proposed based on, for the first time, an interplay between thoughtfully designed continuous and discrete dynamics. It leads to a gradient-based optimizer with intrinsically added momentum. This method exactly preserves the manifold structure but does not require additional operation to keep momentum in the changing (co)tangent space, and thus has low computational cost and pleasant accuracy. Its generalization to adaptive learning rates is also demonstrated. Notable performances are observed in practical tasks. For instance, we found that placing orthogonal constraints on attention heads of trained-from-scratch Vision Transformer [Dosovitskiy et al. 2022] could markedly improve its performance, when our optimizer is used, and it is better that each head is made orthogonal within itself but not necessarily to other heads. This optimizer also makes the useful notion of Projection Robust Wasserstein Distance [Paty & Cuturi 2019; Lin et al. 2020] for high-dim. optimal transport even more effective. △ Less

Submitted 2 March, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

Comments: Code: https://github.com/konglk1203/VariationalStiefelOptimizer

Journal ref: ICLR 2023

arXiv:2205.05565 [pdf, ps, other]

An invitation to topological orders and category theory

Authors: Liang Kong, Zhi-Hao Zhang

Abstract: Although it has been a well-known fact, for more than two decades, that category theory is needed for the study of topological orders, it is still a non-trivial challenge for students and working physicists to master the abstract language of category theory. In this work, for those who have no background in category theory, we explain in great details how the structure of a (braided) fusion catego… ▽ More Although it has been a well-known fact, for more than two decades, that category theory is needed for the study of topological orders, it is still a non-trivial challenge for students and working physicists to master the abstract language of category theory. In this work, for those who have no background in category theory, we explain in great details how the structure of a (braided) fusion category naturally emerges from lattice models and physical intuitions. Moreover, we show that nearly all mathematical notions and constructions in fusion categories and its representation theory, such as (monoidal) functors, Drinfeld center, module categories, Morita equivalence, condensation completion and fusion 2-categories, naturally emerge from lattice models and physical intuitions. In this process, we also introduce some basic notions and important results of topological orders. △ Less

Submitted 31 May, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

Comments: 138 pages, 73 figures. Correct some mistakes and imprecise sentences in section 3.4.6 in version 1. Comments are welcome

arXiv:2205.00373 [pdf, ps, other]

Convergence of Ricci-limit spaces under bounded Ricci curvature and local covering geometry I

Authors: Zuohai Jiang, Lingling Kong, Shicheng Xu

Abstract: We extend Cheeger-Gromov's and Anderson's convergence theorems to regular limit spaces of manifolds with bounded Ricci curvature and local covering geometry, by establishing the $C^{1,α}$-regularities that are the best one may expect on those Ricci-limit spaces. As an application we prove an optimal generalization of Fukaya's fibration theorem on collapsed manifolds with bounded Ricci curvature, w… ▽ More We extend Cheeger-Gromov's and Anderson's convergence theorems to regular limit spaces of manifolds with bounded Ricci curvature and local covering geometry, by establishing the $C^{1,α}$-regularities that are the best one may expect on those Ricci-limit spaces. As an application we prove an optimal generalization of Fukaya's fibration theorem on collapsed manifolds with bounded Ricci curvature, which also improves the original version to $C^{1,α}$ limit spaces. △ Less

Submitted 30 April, 2022; originally announced May 2022.

Comments: 39pages

MSC Class: 53C23; 53C21; 53C20; 5324

arXiv:2201.05726 [pdf, ps, other]

Categories of quantum liquids III

Authors: Liang Kong, Hao Zheng

Abstract: We continue our study of the categories of quantum liquids started in a previous work. We combine local quantum symmetries with topological skeletons into a single mathematical theory of topological nets and defect nets. In particular, we introduce the notion of a topological net, which is motivated from and generalizes that of a conformal net, and the notion of a defect net which generalizes that… ▽ More We continue our study of the categories of quantum liquids started in a previous work. We combine local quantum symmetries with topological skeletons into a single mathematical theory of topological nets and defect nets. In particular, we introduce the notion of a topological net, which is motivated from and generalizes that of a conformal net, and the notion of a defect net which generalizes that of a defect between conformal nets. We give explicit examples of them. Moreover, we construct the category of topological $n$-nets with $k$-morphisms defined by defect $n$-nets of codimension $k$, and show that the category of $n$D quantum liquids can be extracted from it and computed explicitly via the condensation theory of topological nets. △ Less

Submitted 14 January, 2022; originally announced January 2022.

Comments: 28 pages

arXiv:2112.05368 [pdf, other]

Sample Average Approximation for Stochastic Optimization with Dependent Data: Performance Guarantees and Tractability

Authors: Yafei Wang, Bo Pan, Wei Tu, Peng Liu, Bei Jiang, Chao Gao, Wei Lu, Shangling Jui, Linglong Kong

Abstract: Sample average approximation (SAA), a popular method for tractably solving stochastic optimization problems, enjoys strong asymptotic performance guarantees in settings with independent training samples. However, these guarantees are not known to hold generally with dependent samples, such as in online learning with time series data or distributed computing with Markovian training samples. In this… ▽ More Sample average approximation (SAA), a popular method for tractably solving stochastic optimization problems, enjoys strong asymptotic performance guarantees in settings with independent training samples. However, these guarantees are not known to hold generally with dependent samples, such as in online learning with time series data or distributed computing with Markovian training samples. In this paper, we show that SAA remains tractable when the distribution of unknown parameters is only observable through dependent instances and still enjoys asymptotic consistency and finite sample guarantees. Specifically, we provide a rigorous probability error analysis to derive $1 - β$ confidence bounds for the out-of-sample performance of SAA estimators and show that these estimators are asymptotically consistent. We then, using monotone operator theory, study the performance of a class of stochastic first-order algorithms trained on a dependent source of data. We show that approximation error for these algorithms is bounded and concentrates around zero, and establish deviation bounds for iterates when the underlying stochastic process is $φ$-mixing. The algorithms presented can be used to handle numerically inconvenient loss functions such as the sum of a smooth and non-smooth function or of non-smooth functions with constraints. To illustrate the usefulness of our results, we present several stochastic versions of popular algorithms such as stochastic proximal gradient descent (S-PGD), stochastic relaxed Peaceman--Rachford splitting algorithms (S-rPRS), and numerical experiment. △ Less

Submitted 10 December, 2021; originally announced December 2021.

arXiv:2110.08896 [pdf, other]

Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization

Authors: Ke Sun, Yafei Wang, Yi Liu, Yingnan Zhao, Bo Pan, Shangling Jui, Bei Jiang, Linglong Kong

Abstract: Anderson mixing has been heuristically applied to reinforcement learning (RL) algorithms for accelerating convergence and improving the sampling efficiency of deep RL. Despite its heuristic improvement of convergence, a rigorous mathematical justification for the benefits of Anderson mixing in RL has not yet been put forward. In this paper, we provide deeper insights into a class of acceleration s… ▽ More Anderson mixing has been heuristically applied to reinforcement learning (RL) algorithms for accelerating convergence and improving the sampling efficiency of deep RL. Despite its heuristic improvement of convergence, a rigorous mathematical justification for the benefits of Anderson mixing in RL has not yet been put forward. In this paper, we provide deeper insights into a class of acceleration schemes built on Anderson mixing that improve the convergence of deep RL algorithms. Our main results establish a connection between Anderson mixing and quasi-Newton methods and prove that Anderson mixing increases the convergence radius of policy iteration schemes by an extra contraction factor. The key focus of the analysis roots in the fixed-point iteration nature of RL. We further propose a stabilization strategy by introducing a stable regularization term in Anderson mixing and a differentiable, non-expansive MellowMax operator that can allow both faster convergence and more stable behavior. Extensive experiments demonstrate that our proposed method enhances the convergence, stability, and performance of RL algorithms. △ Less

Submitted 20 October, 2021; v1 submitted 17 October, 2021; originally announced October 2021.

arXiv:2108.08835 [pdf, other]

doi 10.1007/JHEP03(2022)022

One dimensional gapped quantum phases and enriched fusion categories

Authors: Liang Kong, Xiao-Gang Wen, Hao Zheng

Abstract: In this work, we use Ising chain and Kitaev chain to check the validity of an earlier proposal in arXiv:2011.02859 that enriched fusion (higher) categories provide a unified categorical description of all gapped/gapless quantum liquid phases, including symmetry-breaking phases, topological orders, SPT/SET orders and certain gapless quantum phases. In particular, we show explicitly that, in each ga… ▽ More In this work, we use Ising chain and Kitaev chain to check the validity of an earlier proposal in arXiv:2011.02859 that enriched fusion (higher) categories provide a unified categorical description of all gapped/gapless quantum liquid phases, including symmetry-breaking phases, topological orders, SPT/SET orders and certain gapless quantum phases. In particular, we show explicitly that, in each gapped phase realized by these two models, the spacetime observables form a fusion category enriched in a braided fusion category. We also study the categorical descriptions of the boundaries of these models. In the end, we provide a classification of and the categorical descriptions of all 1-dimensional (the spatial dimension) gapped quantum phases with a finite onsite symmetry. △ Less

Submitted 10 March, 2022; v1 submitted 19 August, 2021; originally announced August 2021.

Comments: 27 pages. We add some remarks and references

Journal ref: J. High Energ. Phys. 2022, 22 (2022)

arXiv:2108.06605 [pdf, other]

doi 10.1016/j.cam.2022.114872

Gradient Projection Newton Algorithm for Sparse Collaborative Learning Using Synthetic and Real Datasets of Applications

Authors: Jun Sun, Lingchen Kong, Shenglong Zhou

Abstract: Exploring the relationship among multiple sets of data from one same group enables practitioners to make better decisions in medical science and engineering. In this paper, we propose a sparse collaborative learning (SCL) model, an optimization with double-sparsity constraints, to process the problem with two sets of data and a shared response variable. It is capable of dealing with the classifica… ▽ More Exploring the relationship among multiple sets of data from one same group enables practitioners to make better decisions in medical science and engineering. In this paper, we propose a sparse collaborative learning (SCL) model, an optimization with double-sparsity constraints, to process the problem with two sets of data and a shared response variable. It is capable of dealing with the classification problems or the regression problems dependent on the discreteness of the response variable as well as exploring the relationship between two datasets simultaneously. To solve SCL, we first present some necessary and sufficient optimality conditions and then design a gradient projection Newton algorithm which has proven to converge to a unique locally optimal solution globally with at least a quadratic convergence rate. Finally, the reported numerical experiments illustrate the efficiency of the proposed method. △ Less

Submitted 13 November, 2022; v1 submitted 14 August, 2021; originally announced August 2021.

Journal ref: Journal of Computational and Applied Mathematics 2022

arXiv:2107.03858 [pdf, ps, other]

Categories of quantum liquids II

Authors: Liang Kong, Hao Zheng

Abstract: We continue to develop the theory of separable higher categories, including center functors, higher centralizers, modular extensions and group theoretical higher fusion categories. Moreover, we outline a theory of orthogonal higher categories to treat anti-unitary symmetries. Using these results we derive a systematic classification of gapped quantum liquids and predict many new SPT orders in spac… ▽ More We continue to develop the theory of separable higher categories, including center functors, higher centralizers, modular extensions and group theoretical higher fusion categories. Moreover, we outline a theory of orthogonal higher categories to treat anti-unitary symmetries. Using these results we derive a systematic classification of gapped quantum liquids and predict many new SPT orders in spacetime dimension $\ge3$. △ Less

Submitted 29 November, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

Comments: 27 pages. Major revision

arXiv:2106.11970 [pdf, other]

Learned Interpretable Residual Extragradient ISTA for Sparse Coding

Authors: Lin Kong, Wei Sun, Fanhua Shang, Yuanyuan Liu, Hongying Liu

Abstract: Recently, the study on learned iterative shrinkage thresholding algorithm (LISTA) has attracted increasing attentions. A large number of experiments as well as some theories have proved the high efficiency of LISTA for solving sparse coding problems. However, existing LISTA methods are all serial connection. To address this issue, we propose a novel extragradient based LISTA (ELISTA), which has a… ▽ More Recently, the study on learned iterative shrinkage thresholding algorithm (LISTA) has attracted increasing attentions. A large number of experiments as well as some theories have proved the high efficiency of LISTA for solving sparse coding problems. However, existing LISTA methods are all serial connection. To address this issue, we propose a novel extragradient based LISTA (ELISTA), which has a residual structure and theoretical guarantees. In particular, our algorithm can also provide the interpretability for Res-Net to a certain extent. From a theoretical perspective, we prove that our method attains linear convergence. In practice, extensive empirical results verify the advantages of our method. △ Less

Submitted 22 June, 2021; originally announced June 2021.

Comments: Accepted for presentation at the ICML Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI

arXiv:2104.14063 [pdf, ps, other]

Nonuniform Berry-Esseen bound for self-normalized martingales

Authors: Songqi Wu, Lingjie Kong

Abstract: We give a nonuniform Berry-Esseen bound for self-normalized martingales, which bridges the gap between the result of Haeusler (1988) and Fan and Shao (2018). The bound coincides with the nonuniform Berry-Esseen bound of Haeusler and Joos (1988) for standardized martingales. As a consequence, a Berry-Esseen bound is obtained. We give a nonuniform Berry-Esseen bound for self-normalized martingales, which bridges the gap between the result of Haeusler (1988) and Fan and Shao (2018). The bound coincides with the nonuniform Berry-Esseen bound of Haeusler and Joos (1988) for standardized martingales. As a consequence, a Berry-Esseen bound is obtained. △ Less

Submitted 28 April, 2021; originally announced April 2021.

arXiv:2104.11957 [pdf, ps, other]

Positive solutions for a coupled nonlinear Kirchhoff-type system with vanishing potentials

Authors: Lingzheng Kong, Haibo Chen

Abstract: In this paper, we consider the strongly coupled nonlinear Kirchhoff-type system with vanshing potentials: \begin{equation*}\begin{cases} -\left(a_1+b_1\int_{\mathbb{R}^3}|\nabla u|^2\dx\right)Δu+λV(x)u=\fracα{α+β}|u|^{α-2}u|v|^β,&x\in\mathbb{R}^3,\\ -\left(a_2+b_2\int_{\mathbb{R}^3}|\nabla v|^2\dx\right)Δv+λW(x)v=\fracβ{α+β}|u|^α|v|^{β-2}v,&x\in\mathbb{R}^3,\\ u,v\in \mathcal{D}^{1,2}(\R^3), \end{… ▽ More In this paper, we consider the strongly coupled nonlinear Kirchhoff-type system with vanshing potentials: \begin{equation*}\begin{cases} -\left(a_1+b_1\int_{\mathbb{R}^3}|\nabla u|^2\dx\right)Δu+λV(x)u=\fracα{α+β}|u|^{α-2}u|v|^β,&x\in\mathbb{R}^3,\\ -\left(a_2+b_2\int_{\mathbb{R}^3}|\nabla v|^2\dx\right)Δv+λW(x)v=\fracβ{α+β}|u|^α|v|^{β-2}v,&x\in\mathbb{R}^3,\\ u,v\in \mathcal{D}^{1,2}(\R^3), \end{cases}\end{equation*} where $a_i>0$ are constants, $λ,b_i>0$ are parameters for $i=1,2$, $α,β>1$ and $α+β\leqslant 4$, $V(x)$, $W(x)$ are nonnegative continuous potentials, the nonlinear term $F(x,u,v)=|u|^α|v|^β$ is not 4-superlinear at infinity. Such problem cannot be studied directly by standard variational methods, even by restricting the associated energy functional on the Nehari manifold, because Palais-Smale sequences may not be bounded. Combining some new detailed estimates with truncation technique, we obtain the existence of positive vector solutions for the above system when $b_1+b_2$ small and $λ$ large. Moreover, the asymptotic behavior of these vector solutions is also explored as $\textbf{b}=(b_1,b_2)\to \bf{0}$ and $λ\to\infty$. In particular, our results extend some known ones in previous papers that only deals with the case where $4<α+β<6$. △ Less

Submitted 2 October, 2022; v1 submitted 24 April, 2021; originally announced April 2021.

arXiv:2104.03121 [pdf, other]

Enriched monoidal categories I: centers

Authors: Liang Kong, Wei Yuan, Zhi-Hao Zhang, Hao Zheng

Abstract: This work is the first one in a series, in which we develop a mathematical theory of enriched (braided) monoidal categories and their representations. In this work, we introduce the notion of the $E_0$-center ($E_1$-center or $E_2$-center) of an enriched (monoidal or braided monoidal) category, and compute the centers explicitly when the enriched (braided monoidal or monoidal) categories are obtai… ▽ More This work is the first one in a series, in which we develop a mathematical theory of enriched (braided) monoidal categories and their representations. In this work, we introduce the notion of the $E_0$-center ($E_1$-center or $E_2$-center) of an enriched (monoidal or braided monoidal) category, and compute the centers explicitly when the enriched (braided monoidal or monoidal) categories are obtained from the canonical constructions. These centers have important applications in the mathematical theory of gapless boundaries of 2+1D topological orders and that of topological phase transitions in physics. They also play very important roles in the higher representation theory, which is the focus of the second work in the series. △ Less

Submitted 28 April, 2024; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: 56 pages. published version

arXiv:2102.04814 [pdf, ps, other]

Categorical computation

Authors: Liang Kong, Hao Zheng

Abstract: In quantum computing, the computation is achieved by linear operators in or between Hilbert spaces. In this work, we explore a new computation scheme, in which the linear operators in quantum computing are replaced by (higher) functors between two (higher) categories. If from Turing computing to quantum computing is the first quantization of computation, then this new scheme can be viewed as the s… ▽ More In quantum computing, the computation is achieved by linear operators in or between Hilbert spaces. In this work, we explore a new computation scheme, in which the linear operators in quantum computing are replaced by (higher) functors between two (higher) categories. If from Turing computing to quantum computing is the first quantization of computation, then this new scheme can be viewed as the second quantization of computation. The fundamental problem in realizing this idea is how to realize a (higher) functor physically. We provide a theoretical idea of realizing (higher) functors physically based on the physics of topological orders. △ Less

Submitted 6 February, 2023; v1 submitted 9 February, 2021; originally announced February 2021.

Comments: 11 pages, comments are welcome

Journal ref: Front. Phys. 18(2), 21302 (2023)

arXiv:2012.09395 [pdf, other]

l1-norm quantile regression screening rule via the dual circumscribed sphere

Authors: Pan Shang, Lingchen Kong

Abstract: l1-norm quantile regression is a common choice if there exists outlier or heavy-tailed error in high-dimensional data sets. However, it is computationally expensive to solve this problem when the feature size of data is ultra high. As far as we know, existing screening rules can not speed up the computation of the l1-norm quantile regression, which dues to the non-differentiability of the quantile… ▽ More l1-norm quantile regression is a common choice if there exists outlier or heavy-tailed error in high-dimensional data sets. However, it is computationally expensive to solve this problem when the feature size of data is ultra high. As far as we know, existing screening rules can not speed up the computation of the l1-norm quantile regression, which dues to the non-differentiability of the quantile function/pinball loss. In this paper, we introduce the dual circumscribed sphere technique and propose a novel l1-norm quantile regression screening rule. Our rule is expressed as the closed-form function of given data and eliminates inactive features with a low computational cost. Numerical experiments on some simulation and real data sets show that this screening rule can be used to eliminate almost all inactive features. Moreover, this rule can help to reduce up to 23 times of computational time, compared with the computation without our screening rule. △ Less

Submitted 16 December, 2020; originally announced December 2020.

arXiv:2012.08978 [pdf, ps, other]

Semiclassical solutions for critical Schrödinger-Poisson systems involving multiple competing potentials

Authors: Lingzheng Kong, Haibo Chen

Abstract: In this paper, a class of Schrödinger-Poisson system involving multiple competing potentials and critical Sobolev exponent is considered. Such a problem cannot be studied with the same argument of the nonlinear term with only a positive potential, because the weight potentials set $\{Q_i(x)|1\le i \le m\}$ contains nonpositive, sign-changing, and nonnegative elements. By introducing the ground ene… ▽ More In this paper, a class of Schrödinger-Poisson system involving multiple competing potentials and critical Sobolev exponent is considered. Such a problem cannot be studied with the same argument of the nonlinear term with only a positive potential, because the weight potentials set $\{Q_i(x)|1\le i \le m\}$ contains nonpositive, sign-changing, and nonnegative elements. By introducing the ground energy function and subtle analysis, we first prove the existence of ground state solution $v_\varepsilon$ in the semiclassical limit via the Nehari manifold and concentration-compactness principle. Then we show that $v_\varepsilon$ converges to the ground state solution of the associated limiting problem and concentrates at a concrete set characterized by the potentials. At the same time, some properties for the ground state solution are also studied. Moreover, a sufficient condition for the nonexistence of the ground state solution is obtained. △ Less

Submitted 16 December, 2020; originally announced December 2020.

arXiv:2012.01545 [pdf, other]

Machine learning prediction of critical transition and system collapse

Authors: Ling-Wei Kong, Hua-Wei Fan, Celso Grebogi, Ying-Cheng Lai

Abstract: To predict a critical transition due to parameter drift without relying on model is an outstanding problem in nonlinear dynamics and applied fields. A closely related problem is to predict whether the system is already in or if the system will be in a transient state preceding its collapse. We develop a model free, machine learning based solution to both problems by exploiting reservoir computing… ▽ More To predict a critical transition due to parameter drift without relying on model is an outstanding problem in nonlinear dynamics and applied fields. A closely related problem is to predict whether the system is already in or if the system will be in a transient state preceding its collapse. We develop a model free, machine learning based solution to both problems by exploiting reservoir computing to incorporate a parameter input channel. We demonstrate that, when the machine is trained in the normal functioning regime with a chaotic attractor (i.e., before the critical transition), the transition point can be predicted accurately. Remarkably, for a parameter drift through the critical point, the machine with the input parameter channel is able to predict not only that the system will be in a transient state, but also the average transient time before the final collapse. △ Less

Submitted 2 December, 2020; originally announced December 2020.

Comments: 5 pages, 3 figures

arXiv:2011.10597 [pdf, other]

Synchronization within synchronization: transients and intermittency in ecological networks

Authors: Huawei Fan, Ling-Wei Kong, Xingang Wang, Alan Hastings, Ying-Cheng Lai

Abstract: Transients are fundamental to ecological systems with significant implications to management, conservation, and biological control. We uncover a type of transient synchronization behavior in spatial ecological networks whose local dynamics are of the chaotic, predator-prey type. In the parameter regime where there is phase synchronization among all the patches, complete synchronization (i.e., sync… ▽ More Transients are fundamental to ecological systems with significant implications to management, conservation, and biological control. We uncover a type of transient synchronization behavior in spatial ecological networks whose local dynamics are of the chaotic, predator-prey type. In the parameter regime where there is phase synchronization among all the patches, complete synchronization (i.e., synchronization in both phase and amplitude) can arise in certain pairs of patches as determined by the network symmetry - henceforth the phenomenon of "synchronization within synchronization." Distinct patterns of complete synchronization coexist but, due to intrinsic instability or noise, each pattern is a transient and there is random, intermittent switching among the patterns in the course of time evolution. The probability distribution of the transient time is found to follow an algebraic scaling law with a divergent average transient lifetime. Based on symmetry considerations, we develop a stability analysis to understand these phenomena. The general principle of symmetry can also be exploited to explain previously discovered, counterintuitive synchronization behaviors in ecological networks. △ Less

Submitted 20 November, 2020; originally announced November 2020.

Comments: 17 pages, 7 figures

arXiv:2011.02859 [pdf, ps, other]

doi 10.1007/JHEP08(2022)070

Categories of quantum liquids I

Authors: Liang Kong, Hao Zheng

Abstract: We develop a mathematical theory of separable higher categories based on Gaiotto and Johnson-Freyd's work on condensation completion. Based on this theory, we prove some fundamental results on $E_m$-multi-fusion higher categories and their higher centers. We also outline a theory of unitary higher categories based on a $*$-version of condensation completion. After these mathematical preparations,… ▽ More We develop a mathematical theory of separable higher categories based on Gaiotto and Johnson-Freyd's work on condensation completion. Based on this theory, we prove some fundamental results on $E_m$-multi-fusion higher categories and their higher centers. We also outline a theory of unitary higher categories based on a $*$-version of condensation completion. After these mathematical preparations, based on the idea of topological Wick rotation, we develop a unified mathematical theory of all quantum liquids, which include topological orders, SPT/SET orders, symmetry-breaking orders and CFT-like gapless phases. We explain that a quantum liquid consists of two parts, the topological skeleton and the local quantum symmetry, and show that all $n$D quantum liquids form a $*$-condensation complete higher category whose equivalence type can be computed explicitly from a simple coslice 1-category. △ Less

Submitted 15 August, 2023; v1 submitted 5 November, 2020; originally announced November 2020.

Comments: 36 pages, a minor revision, including a correction of an typo in Theorem 5.5, an illustrative picture in Hypothesis 5.3 and refinement of a few sentences

Journal ref: J. High Energ. Phys. 2022, 70 (2022)

arXiv:2009.06564 [pdf, other]

doi 10.1007/JHEP12(2020)078

Defects in the 3-dimensional toric code model form a braided fusion 2-category

Authors: Liang Kong, Yin Tian, Zhi-Hao Zhang

Abstract: It was well known that there are $e$-particles and $m$-strings in the 3-dimensional (spatial dimension) toric code model, which realizes the 3-dimensional $\mathbb{Z}_2$ topological order. Recent mathematical result, however, shows that there are additional string-like topological defects in the 3-dimensional $\mathbb{Z}_2$ topological order. In this work, we construct all topological defects of c… ▽ More It was well known that there are $e$-particles and $m$-strings in the 3-dimensional (spatial dimension) toric code model, which realizes the 3-dimensional $\mathbb{Z}_2$ topological order. Recent mathematical result, however, shows that there are additional string-like topological defects in the 3-dimensional $\mathbb{Z}_2$ topological order. In this work, we construct all topological defects of codimension 2 and higher, and show that they form a braided fusion 2-category satisfying a braiding non-degeneracy condition. △ Less

Submitted 27 September, 2020; v1 submitted 14 September, 2020; originally announced September 2020.

Comments: 30 pages, 29 figures. Comments are welcome

Journal ref: J. High Energ. Phys. 2020, 78 (2020)

arXiv:2004.04769 [pdf, other]

doi 10.1103/PhysRevResearch.2.023196

Scaling law of transient lifetime of chimera states under dimension-augmenting perturbations

Authors: Ling-Wei Kong, Ying-Cheng Lai

Abstract: Chimera states arising in the classic Kuramoto system of two-dimensional phase coupled oscillators are transient but they are "long" transients in the sense that the average transient lifetime grows exponentially with the system size. For reasonably large systems, e.g., those consisting of a few hundreds oscillators, it is infeasible to numerically calculate or experimentally measure the average l… ▽ More Chimera states arising in the classic Kuramoto system of two-dimensional phase coupled oscillators are transient but they are "long" transients in the sense that the average transient lifetime grows exponentially with the system size. For reasonably large systems, e.g., those consisting of a few hundreds oscillators, it is infeasible to numerically calculate or experimentally measure the average lifetime, so the chimera states are practically permanent. We find that small perturbations in the third dimension, which make system "slightly" three-dimensional, will reduce dramatically the transient lifetime. In particular, under such a perturbation, the practically infinite average transient lifetime will become extremely short, because it scales with the magnitude of the perturbation only logarithmically. Physically, this means that a reduction in the perturbation strength over many orders of magnitude, insofar as it is not zero, would result in only an incremental increase in the lifetime. The uncovered type of fragility of chimera states raises concerns about their observability in physical systems. △ Less

Submitted 13 April, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

Comments: 15 pages, 13 figures

Journal ref: Phys. Rev. Research 2, 023196 (2020)

arXiv:2003.08898 [pdf, other]

doi 10.1007/JHEP09(2020)093

Classification of topological phases with finite internal symmetries in all dimensions

Authors: Liang Kong, Tian Lan, Xiao-Gang Wen, Zhi-Hao Zhang, Hao Zheng

Abstract: We develop a mathematical theory of symmetry protected trivial (SPT) orders and anomaly-free symmetry enriched topological (SET) orders in all dimensions via two different approaches with an emphasis on the second approach. The first approach is to gauge the symmetry in the same dimension by adding topological excitations as it was done in the 2d case, in which the gauging process is mathematicall… ▽ More We develop a mathematical theory of symmetry protected trivial (SPT) orders and anomaly-free symmetry enriched topological (SET) orders in all dimensions via two different approaches with an emphasis on the second approach. The first approach is to gauge the symmetry in the same dimension by adding topological excitations as it was done in the 2d case, in which the gauging process is mathematically described by the minimal modular extensions of unitary braided fusion 1-categories. This 2d result immediately generalizes to all dimensions except in 1d, which is treated with special care. The second approach is to use the 1-dimensional higher bulk of the SPT/SET order and the boundary-bulk relation. This approach also leads us to a precise mathematical description and a classification of SPT/SET orders in all dimensions. The equivalence of these two approaches, together with known physical results, provides us with many precise mathematical predictions. △ Less

Submitted 10 September, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

Comments: 41 pages, 6 figures; add more results on anomalies; final version to appear in JHEP

Journal ref: J. High Energ. Phys. 2020, 93 (2020)

arXiv:2002.06189 [pdf, other]

Stochasticity of Deterministic Gradient Descent: Large Learning Rate for Multiscale Objective Function

Authors: Lingkai Kong, Molei Tao

Abstract: This article suggests that deterministic Gradient Descent, which does not use any stochastic gradient approximation, can still exhibit stochastic behaviors. In particular, it shows that if the objective function exhibit multiscale behaviors, then in a large learning rate regime which only resolves the macroscopic but not the microscopic details of the objective, the deterministic GD dynamics can b… ▽ More This article suggests that deterministic Gradient Descent, which does not use any stochastic gradient approximation, can still exhibit stochastic behaviors. In particular, it shows that if the objective function exhibit multiscale behaviors, then in a large learning rate regime which only resolves the macroscopic but not the microscopic details of the objective, the deterministic GD dynamics can become chaotic and convergent not to a local minimizer but to a statistical distribution. A sufficient condition is also established for approximating this long-time statistical limit by a rescaled Gibbs distribution. Both theoretical and numerical demonstrations are provided, and the theoretical part relies on the construction of a stochastic map that uses bounded noise (as opposed to discretized diffusions). △ Less

Submitted 2 November, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

Comments: NeurIPS 2020. v1->v2: Weakened conditions needed for the theory. Added connections to neural network. Corrected typo

arXiv:1912.13168 [pdf, other]

doi 10.1007/s00220-020-03922-x

Pointed Drinfeld center functor

Authors: Liang Kong, Wei Yuan, Hao Zheng

Abstract: In this work, using the functoriality of Drinfeld center of fusion categories, we generalize an earlier result on the functoriality of full center of simple separable algebras in a fixed fusion category to all fusion categories. This generalization produces a new center functor, which involves both Drinfeld center and full center and will be called the pointed Drinfeld center functor. We prove tha… ▽ More In this work, using the functoriality of Drinfeld center of fusion categories, we generalize an earlier result on the functoriality of full center of simple separable algebras in a fixed fusion category to all fusion categories. This generalization produces a new center functor, which involves both Drinfeld center and full center and will be called the pointed Drinfeld center functor. We prove that this pointed Drinfeld center functor is a symmetric monoidal equivalence. It turns out that this functor provides a precise and rather complete mathematical formulation of the boundary-bulk relation of 1+1D rational conformal field theories (RCFT). In this process, we solve an old problem of computing the fusion of two 0D (or 1D) wall CFT's along a non-trivial 1+1D bulk RCFT. △ Less

Submitted 30 December, 2019; originally announced December 2019.

Comments: 34 pages, 8 figures, comments are welcome

Journal ref: Comm. Math. Phys. 381 (2021), no.3, 1409--1443

arXiv:1912.01760 [pdf, other]

doi 10.1016/j.nuclphysb.2021.115384

A mathematical theory of gapless edges of 2d topological orders. Part II

Authors: Liang Kong, Hao Zheng

Abstract: This is the second part of a two-part work on the unified mathematical theory of gapped and gapless edges of 2+1D topological orders. In Part I, we have developed the mathematical theory of chiral gapless edges. In Part II, we study boundary-bulk relation and non-chiral gapless edges. In particular, we explain how the notion of the center of an enriched monoidal category naturally emerges from the… ▽ More This is the second part of a two-part work on the unified mathematical theory of gapped and gapless edges of 2+1D topological orders. In Part I, we have developed the mathematical theory of chiral gapless edges. In Part II, we study boundary-bulk relation and non-chiral gapless edges. In particular, we explain how the notion of the center of an enriched monoidal category naturally emerges from the boundary-bulk relation. After the study of 0+1D gapless walls, we give the complete boundary-bulk relation for 2+1D topological orders with chiral gapless edges (including gapped edges) and 0d walls between edges. This relation is stated precisely and proved rigorously as a monoidal equivalence, which generalizes the functoriality of the usual Drinfeld center to an enriched setting. We also develop the mathematical theory of non-chiral gapless edges and 0+1D walls, and explain how to gap out certain non-chiral 1+1D gapless edges and 0+1D gapless walls categorically. In the end, we show that all anomaly-free 1+1D boundary-bulk rational CFT's can be recovered from 2d topological orders with chiral gapless edges via a dimensional reduction process. This provides physical meanings to some mysterious connections between mathematical results in fusion categories and those in rational CFT's. △ Less

Submitted 30 March, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

Comments: 54 pages. In Section 7, we add some discussion of the implication of our work to the study of gapless phases in all dimensions

Journal ref: Nucl. Phys. B 966 (2021), 115384

arXiv:1909.12438 [pdf, ps, other]

On a discrete elliptic problem with a weight

Authors: Mohamed Ousbika, Zakaria El Allali, Lingju Kong

Abstract: Using the variational approach and the critical point theory, we established several criteria for the existence of at least one nontrivial solution for a discrete elliptic boundary value problem with a weight $p(\cdot, \cdot)$ and depending on a real parameter $λ$. Using the variational approach and the critical point theory, we established several criteria for the existence of at least one nontrivial solution for a discrete elliptic boundary value problem with a weight $p(\cdot, \cdot)$ and depending on a real parameter $λ$. △ Less

Submitted 26 September, 2019; originally announced September 2019.

arXiv:1909.06960 [pdf, other]

Regularization parameter selection for low rank matrix recovery

Authors: Pan Shang, Lingchen Kong

Abstract: Low rank matrix recovery is the focus of many applications, but it is a NP-hard problem. A popular way to deal with this problem is to solve its convex relaxation, the nuclear norm regularized minimization problem (NRM), which includes LASSO as a special case. There are some regularization parameter selection results for LASSO in vector case, such as screening rules, which improve the efficiency o… ▽ More Low rank matrix recovery is the focus of many applications, but it is a NP-hard problem. A popular way to deal with this problem is to solve its convex relaxation, the nuclear norm regularized minimization problem (NRM), which includes LASSO as a special case. There are some regularization parameter selection results for LASSO in vector case, such as screening rules, which improve the efficiency of the algorithms. However, there are no corresponding parameter selection results for NRM in matrix case. In this paper, we build up a novel rule to choose the regularization parameter for NRM under the help of duality theory. This rule claims that the regularization parameter can be easily chosen by feasible points of NRM and its dual problem, when the rank of the desired solution is no more than a given constant. In particular, we apply this idea to NRM with least square and Huber functions, and establish the easily calculated formula of regularization parameters. Finally, we report numerical results on some signal shapes, which state that our proposed rule shrinks the interval of the regularization parameter efficiently. △ Less

Submitted 15 September, 2019; originally announced September 2019.

arXiv:1905.04924 [pdf, other]

doi 10.1007/JHEP02(2020)150

A mathematical theory of gapless edges of 2d topological orders. Part I

Authors: Liang Kong, Hao Zheng

Abstract: This is the first part of a two-part work on a unified mathematical theory of gapped and gapless edges of 2d topological orders. We analyze all the possible observables on the 1+1D world sheet of a chiral gapless edge of a 2d topological order, and show that these observables form an enriched unitary fusion category, the Drinfeld center of which is precisely the unitary modular tensor category ass… ▽ More This is the first part of a two-part work on a unified mathematical theory of gapped and gapless edges of 2d topological orders. We analyze all the possible observables on the 1+1D world sheet of a chiral gapless edge of a 2d topological order, and show that these observables form an enriched unitary fusion category, the Drinfeld center of which is precisely the unitary modular tensor category associated to the bulk. This mathematical description of a chiral gapless edge automatically includes that of a gapped edge (i.e. a unitary fusion category) as a special case. Therefore, we obtain a unified mathematical description and a classification of both gapped and chiral gapless edges of a given 2d topological order. In the process of our analysis, we encounter an interesting and reoccurring phenomenon: spatial fusion anomaly, which leads us to propose the Principle of Universality at RG fixed points for all quantum field theories. Our theory also implies that all chiral gapless edges can be obtained from a so-called topological Wick rotations. This fact leads us to propose, at the end of this work, a surprising correspondence between gapped and gapless phases in all dimensions. △ Less

Submitted 29 March, 2020; v1 submitted 13 May, 2019; originally announced May 2019.

Comments: 52 pages, 39 figures, we add some discussion on spatial fusion anomalies, and propose how to define a phase transition between two gapless phases via topological Wick rotation, and discuss some interesting implications of our theory to higher dimensional topological orders

Journal ref: J. High Energ. Phys. 2020, 150 (2020)

arXiv:1905.04644 [pdf, ps, other]

doi 10.1016/j.aim.2019.106928

The center of monoidal 2-categories in 3+1D Dijkgraaf-Witten Theory

Authors: Liang Kong, Yin Tian, Shan Zhou

Abstract: In this work, for a finite group $G$ and a 4-cocycle $ω\in Z^4(G, \mathbf{k}^\times)$, we compute explicitly the center of the monoidal 2-category $\operatorname{2Vec}_G^ω$ of $ω$-twisted $G$-graded 1-categories of finite dimensional $\mathbf{k}$-vector spaces. This center gives a precise mathematical description of the topological defects in the associated 3+1D Dijkgraaf-Witten TQFT. We prove tha… ▽ More In this work, for a finite group $G$ and a 4-cocycle $ω\in Z^4(G, \mathbf{k}^\times)$, we compute explicitly the center of the monoidal 2-category $\operatorname{2Vec}_G^ω$ of $ω$-twisted $G$-graded 1-categories of finite dimensional $\mathbf{k}$-vector spaces. This center gives a precise mathematical description of the topological defects in the associated 3+1D Dijkgraaf-Witten TQFT. We prove that this center is a braided monoidal 2-category with a trivial sylleptic center. △ Less

Submitted 31 December, 2019; v1 submitted 12 May, 2019; originally announced May 2019.

Comments: 24 pages

Journal ref: Advances in Mathematics 360 (2020) 106928

arXiv:1903.12334 [pdf, other]

doi 10.1103/PhysRevB.102.045139

A topological phase transition on the edge of the 2d $\mathbb{Z}_2$ topological order

Authors: Wei-Qiang Chen, Chao-Ming Jian, Liang Kong, Yi-Zhuang You, Hao Zheng

Abstract: The unified mathematical theory of gapped and gapless edges of 2d topological orders was developed by two of the authors. It provides a powerful tool to study pure edge topological phase transitions on the edges of 2d topological orders (without altering the bulks). In particular, it implies that the critical points are described by enriched fusion categories. In this work, we illustrate this idea… ▽ More The unified mathematical theory of gapped and gapless edges of 2d topological orders was developed by two of the authors. It provides a powerful tool to study pure edge topological phase transitions on the edges of 2d topological orders (without altering the bulks). In particular, it implies that the critical points are described by enriched fusion categories. In this work, we illustrate this idea in a concrete example: the 2d $\mathbb{Z}_2$ topological order. In particular, we construct an enriched fusion category, which describes a gappable non-chiral gapless edge of the 2d $\mathbb{Z}_2$ topological order; then use an explicit lattice model construction to realize the critical point and, at the same time, all the ingredients of this enriched fusion category. △ Less

Submitted 28 March, 2019; originally announced March 2019.

Comments: 31 pages, 58 figures, Comments are welcome

Journal ref: Phys. Rev. B 102, 045139 (2020)

arXiv:1901.06478 [pdf, other]

Tuning parameter selection rules for nuclear norm regularized multivariate linear regression

Authors: Pan Shang, Lingchen Kong

Abstract: We consider the tuning parameter selection rules for nuclear norm regularized multivariate linear regression (NMLR) in high-dimensional setting. High-dimensional multivariate linear regression is widely used in statistics and machine learning, and regularization technique is commonly applied to deal with the special structures in high-dimensional data. As we know, how to select the tuning paramete… ▽ More We consider the tuning parameter selection rules for nuclear norm regularized multivariate linear regression (NMLR) in high-dimensional setting. High-dimensional multivariate linear regression is widely used in statistics and machine learning, and regularization technique is commonly applied to deal with the special structures in high-dimensional data. As we know, how to select the tuning parameter is an essential issue for regularization approach and it directly affects the model estimation performance. To the best of our knowledge, there are no rules about the tuning parameter selection for NMLR from the point of view of optimization. In order to establish such rules, we study the duality theory of NMLR. Then, we claim the choice of tuning parameter for NMLR is based on the sample data and the solution of NMLR dual problem, which is a projection on a nonempty, closed and convex set. Moreover, based on the (firm) nonexpansiveness and the idempotence of the projection operator, we build four tuning parameter selection rules PSR, PSRi, PSRfn and PSR+. Furthermore, we give a sequence of tuning parameters and the corresponding intervals for every rule, which states that the rank of the estimation coefficient matrix is no more than a fixed number for the tuning parameter in the given interval. The relationships between these rules are also discussed and PSR+ is the most efficient one to select the tuning parameter. Finally, the numerical results are reported on simulation and real data, which show that these four tuning parameter selection rules are valuable. △ Less

Submitted 19 January, 2019; originally announced January 2019.

arXiv:1810.05950 [pdf, ps, other]

A degenerate elliptic system with variable exponents

Authors: Lingju Kong

Abstract: We study a degenerate elliptic system with variable exponents. Using the variational approach and some recent theory on weighted Lebesgue and Sobolev spaces with variable exponents, we prove the existence of at least two distinct nontrivial weak solutions of the system. Several consequences of the main theorem are derived; in particular, the existence of at lease two distinct nontrivial nonnegativ… ▽ More We study a degenerate elliptic system with variable exponents. Using the variational approach and some recent theory on weighted Lebesgue and Sobolev spaces with variable exponents, we prove the existence of at least two distinct nontrivial weak solutions of the system. Several consequences of the main theorem are derived; in particular, the existence of at lease two distinct nontrivial nonnegative solution are established for a scalar degenerate problem. One example is provided to showthe applicability of our results. △ Less

Submitted 13 October, 2018; originally announced October 2018.

Comments: Accepted for publication in SCIENCE CHINA Mathematics

MSC Class: 35J70; 35J20; 35J25; 35J92

arXiv:1808.03774 [pdf, ps, other]

Collapsed Manifolds With Ricci Bounded Covering Geometry

Authors: Hongzhi Huang, Lingling Kong, Xiaochun Rong, Shicheng Xu

Abstract: We study collapsed manifolds with Ricci bounded covering geometry i.e., Ricci curvature is bounded below and the Riemannian universal cover is non-collapsed or consists of uniform Reifenberg points. Via Ricci flows' techniques, we partially extend the nilpotent structural results of Cheeger-Fukaya-Gromov, on collapsed manifolds with (sectional curvature) local bounded covering geometry, to manifol… ▽ More We study collapsed manifolds with Ricci bounded covering geometry i.e., Ricci curvature is bounded below and the Riemannian universal cover is non-collapsed or consists of uniform Reifenberg points. Via Ricci flows' techniques, we partially extend the nilpotent structural results of Cheeger-Fukaya-Gromov, on collapsed manifolds with (sectional curvature) local bounded covering geometry, to manifolds with (global) Ricci boundedcovering geometry. △ Less

Submitted 11 August, 2018; originally announced August 2018.

Showing 1–50 of 99 results for author: Kong, L