-
Detecting and Identifying Selection Structure in Sequential Data
Authors:
Yujia Zheng,
Zeyu Tang,
Yiwen Qiu,
Bernhard Schölkopf,
Kun Zhang
Abstract:
We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportun…
▽ More
We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportunity to provide a deeper insight into the hidden generation process, as it is a fundamental mechanism underlying what we observe. In particular, overlooking selection in sequential data can lead to an incomplete or overcomplicated inductive bias in modeling, such as assuming a universal autoregressive structure for all dependencies. Therefore, rather than merely viewing it as a bias, we explore the causal structure of selection in sequential data to delve deeper into the complete causal process. Specifically, we show that selection structure is identifiable without any parametric assumptions or interventional experiments. Moreover, even in cases where selection variables coexist with latent confounders, we still establish the nonparametric identifiability under appropriate structural conditions. Meanwhile, we also propose a provably correct algorithm to detect and identify selection structures as well as other types of dependencies. The framework has been validated empirically on both synthetic data and real-world music.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Policy Optimization in Control: Geometry and Algorithmic Implications
Authors:
Shahriar Talebi,
Yang Zheng,
Spencer Kraisler,
Na Li,
Mehran Mesbahi
Abstract:
This survey explores the geometric perspective on policy optimization within the realm of feedback control systems, emphasizing the intrinsic relationship between control design and optimization. By adopting a geometric viewpoint, we aim to provide a nuanced understanding of how various ``complete parameterization'' -- referring to the policy parameters together with its Riemannian geometry -- of…
▽ More
This survey explores the geometric perspective on policy optimization within the realm of feedback control systems, emphasizing the intrinsic relationship between control design and optimization. By adopting a geometric viewpoint, we aim to provide a nuanced understanding of how various ``complete parameterization'' -- referring to the policy parameters together with its Riemannian geometry -- of control design problems, influence stability and performance of local search algorithms. The paper is structured to address key themes such as policy parameterization, the topology and geometry of stabilizing policies, and their implications for various (non-convex) dynamic performance measures. We focus on a few iconic control design problems, including the Linear Quadratic Regulator (LQR), Linear Quadratic Gaussian (LQG) control, and $\mathcal{H}_\infty$ control. In particular, we first discuss the topology and Riemannian geometry of stabilizing policies, distinguishing between their static and dynamic realizations. Expanding on this geometric perspective, we then explore structural properties of the aforementioned performance measures and their interplay with the geometry of stabilizing policies in presence of policy constraints; along the way, we address issues such as spurious stationary points, symmetries of dynamic feedback policies, and (non-)smoothness of the corresponding performance measures. We conclude the survey with algorithmic implications of policy optimization in feedback design.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Benign Nonconvex Landscapes in Optimal and Robust Control, Part II: Extended Convex Lifting
Authors:
Yang Zheng,
Chih-Fan Pai,
Yujie Tang
Abstract:
Many optimal and robust control problems are nonconvex and potentially nonsmooth in their policy optimization forms. In Part II of this paper, we introduce a new and unified Extended Convex Lifting (ECL) framework to reveal hidden convexity in classical optimal and robust control problems from a modern optimization perspective. Our ECL offers a bridge between nonconvex policy optimization and conv…
▽ More
Many optimal and robust control problems are nonconvex and potentially nonsmooth in their policy optimization forms. In Part II of this paper, we introduce a new and unified Extended Convex Lifting (ECL) framework to reveal hidden convexity in classical optimal and robust control problems from a modern optimization perspective. Our ECL offers a bridge between nonconvex policy optimization and convex reformulations, enabling convex analysis for nonconvex problems. Despite non-convexity and non-smoothness, the existence of an ECL not only reveals that minimizing the original function is equivalent to a convex problem but also certifies a class of first-order non-degenerate stationary points to be globally optimal. Therefore, no spurious stationarity exists in the set of non-degenerate policies. This ECL framework can cover many benchmark control problems, including state feedback linear quadratic regulator (LQR), dynamic output feedback linear quadratic Gaussian (LQG) control, and $\mathcal{H}_\infty$ robust control. ECL can also handle a class of distributed control problems when the notion of quadratic invariance (QI) holds. We further show that all static stabilizing policies are non-degenerate for state feedback LQR and $\mathcal{H}_\infty$ control under standard assumptions. We believe that the new ECL framework may be of independent interest for analyzing nonconvex problems beyond control.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
A geometric realization for maximal almost pre-rigid representations over type $\mathbb{D}$ quivers
Authors:
Jianmin Chen,
Yiting Zheng
Abstract:
We focus on a class of special representations over a type $\mathbb{D}$ quiver $Q_{D}$ with $n$ vertices and directional symmetry, namely, maximal almost pre-rigid representations. By using the equivariant theory of group actions, we give a geometric model for the category of finite dimensional representations over $Q_{D}$ via centrally-symmetric polygon $P(Q_{D})$ with a puncture, and show that t…
▽ More
We focus on a class of special representations over a type $\mathbb{D}$ quiver $Q_{D}$ with $n$ vertices and directional symmetry, namely, maximal almost pre-rigid representations. By using the equivariant theory of group actions, we give a geometric model for the category of finite dimensional representations over $Q_{D}$ via centrally-symmetric polygon $P(Q_{D})$ with a puncture, and show that the dimension of extension group between indecomposable representations can be interpreted as the crossing number on $P(Q_{D})$. Furthermore, we provide a geometric realization for maximal almost pre-rigid representations over $Q_{D}$. As an application, we illustrate their general form and prove that each maximal almost pre-rigid representation will determine two or four tilting objects over the path algebra $\Bbbk Q_{\overline{D}}$, where $Q_{\overline{D}}$ is a quiver obtained by adding $n-2$ new vertices and $n-2$ arrows to the quiver $Q_{D}$.
△ Less
Submitted 7 May, 2024; v1 submitted 6 May, 2024;
originally announced May 2024.
-
On the spectrality of a class of Moran measures
Authors:
Yali Zheng,
Yingqing Xiao
Abstract:
In this paper, we study the spectrality of a class of Moran measures $μ_{\mathcal{P},\mathcal{D}}$ on $\mathbb{R}$ generated by $\{(p_n,\mathcal{D}_n)\}_{n=1}^{\infty}$, where $\mathcal{P}=\{p_n\}_{n=1}^{\infty}$ is a sequence of positive integers with $p_n>1$ and $\mathcal{D}=\{\mathcal{D}_{n}\}_{n=1}^{\infty}$ is a sequence of digit sets of $\mathbb{N}$ with the cardinality…
▽ More
In this paper, we study the spectrality of a class of Moran measures $μ_{\mathcal{P},\mathcal{D}}$ on $\mathbb{R}$ generated by $\{(p_n,\mathcal{D}_n)\}_{n=1}^{\infty}$, where $\mathcal{P}=\{p_n\}_{n=1}^{\infty}$ is a sequence of positive integers with $p_n>1$ and $\mathcal{D}=\{\mathcal{D}_{n}\}_{n=1}^{\infty}$ is a sequence of digit sets of $\mathbb{N}$ with the cardinality $\#\mathcal{D}_{n}\in \{2,3,N_{n}\}$. We find a countable set $Λ\subset\mathbb{R}$ such that the set $\{e^{-2πi λx}|λ\inΛ\}$ is a orthonormal basis of $L^{2}(μ_{\mathcal{P},\mathcal{D}})$ under some conditions. As an application, we show that when $μ_{\mathcal{P},\mathcal{D}}$ is absolutely continuous, $μ_{\mathcal{P},\mathcal{D}}$ not only is a spectral measure, but also its support set tiles $\mathbb{R}$ with $\mathbb{Z}$.
△ Less
Submitted 3 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Stability of Navier-Stokes equations with a free surface
Authors:
Xing Cheng,
Yunrui Zheng
Abstract:
We consider the viscous incompressible fluids in a three-dimensional horizontally periodic domain bounded below by a fixed smooth boundary and above by a free moving surface. The fluid dynamics are governed by the Navier-Stokes equations with the effect of gravity and surface tension on the free surface. We develop a global well-posedness theory by a nonlinear energy method in low regular Sobolev…
▽ More
We consider the viscous incompressible fluids in a three-dimensional horizontally periodic domain bounded below by a fixed smooth boundary and above by a free moving surface. The fluid dynamics are governed by the Navier-Stokes equations with the effect of gravity and surface tension on the free surface. We develop a global well-posedness theory by a nonlinear energy method in low regular Sobolev spaces with several techniques, including: the horizontal energy-dissipation estimates, a new tripled bootstrap argument inspired by Guo and Tice [Arch. Ration. Mech. Anal.(2018)]. Moreover, the solution decays asymptotically to the equilibrium in an exponential rate.
△ Less
Submitted 28 April, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
A characterization on trees $T$ with $m(T, λ)=p(T)-2$
Authors:
Sarula Chang,
Jianxi Li,
Yirong Zheng
Abstract:
Let $m(G,λ)$ be the multiplicity of an eigenvalue $λ$ of a connected graph $G$. Wang et al. [Linear Algebra Appl. 584(2020), 257-266] proved that for any connected graph $G\neq C_n$, $m(G, λ) \leq 2c(G) + p(G) -1$, where $c (G) = |E(G)| - |V (G)| + 1$ and $p(G)$ are the cyclomatic number and the number of pendant vertices of $G$, respectively. In the same paper, they proposed the problem to charac…
▽ More
Let $m(G,λ)$ be the multiplicity of an eigenvalue $λ$ of a connected graph $G$. Wang et al. [Linear Algebra Appl. 584(2020), 257-266] proved that for any connected graph $G\neq C_n$, $m(G, λ) \leq 2c(G) + p(G) -1$, where $c (G) = |E(G)| - |V (G)| + 1$ and $p(G)$ are the cyclomatic number and the number of pendant vertices of $G$, respectively. In the same paper, they proposed the problem to characterize all connected graphs $G$ with eigenvalue $λ$ such that $m(G, λ) =2c (G)+ p(G)-1$. Wong et al. [Discrete Math. 347(2024), 113845] solved this problem for the case when $G$ is a tree by characterizing all trees $T$ with eigenvalue $λ$ such that $m(T , λ) = p(T )-1$. In this paper, we further provide the structural characterization on trees $T$ with eigenvalue $λ$ such that $m(T , λ) = p(T )-2$.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Adapprox: Adaptive Approximation in Adam Optimization via Randomized Low-Rank Matrices
Authors:
Pengxiang Zhao,
** Li,
Yingjie Gu,
Yi Zheng,
Stephan Ludger Kölker,
Zhefeng Wang,
Xiaoming Yuan
Abstract:
As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank…
▽ More
As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank matrix approximation for a more effective and accurate approximation of Adam's second moment. Adapprox features an adaptive rank selection mechanism, finely balancing accuracy and memory efficiency, and includes an optional cosine similarity guidance strategy to enhance stability and expedite convergence. In GPT-2 training and downstream tasks, Adapprox surpasses AdamW by achieving 34.5% to 49.9% and 33.8% to 49.9% memory savings for the 117M and 345M models, respectively, with the first moment enabled, and further increases these savings without the first moment. Besides, it enhances convergence speed and improves downstream task performance relative to its counterparts.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Threshold solutions of the energy-critical complex Ginzburg-Landau equation
Authors:
Xing Cheng,
Yunrui Zheng
Abstract:
In this article, we consider energy-critical complex Ginzburg-Landau equation in three and four dimensions. We give the dynamics when the energy of the initial data is equal to the energy of the stationary solution.
In this article, we consider energy-critical complex Ginzburg-Landau equation in three and four dimensions. We give the dynamics when the energy of the initial data is equal to the energy of the stationary solution.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Decentralized Robust Data-driven Predictive Control for Smoothing Mixed Traffic Flow
Authors:
Xu Shang,
Jiawei Wang,
Yang Zheng
Abstract:
In a mixed traffic with connected automated vehicles (CAVs) and human-driven vehicles (HDVs) coexisting, data-driven predictive control of CAVs promises system-wide traffic performance improvements. Yet, most existing approaches focus on a centralized setup, which is not computationally scalable while failing to protect data privacy. The robustness against unknown disturbances has not been well ad…
▽ More
In a mixed traffic with connected automated vehicles (CAVs) and human-driven vehicles (HDVs) coexisting, data-driven predictive control of CAVs promises system-wide traffic performance improvements. Yet, most existing approaches focus on a centralized setup, which is not computationally scalable while failing to protect data privacy. The robustness against unknown disturbances has not been well addressed either, causing safety concerns. In this paper, we propose a decentralized robust DeeP-LCC (Data-EnablEd Predictive Leading Cruise Control) approach for CAVs to smooth mixed traffic flow. In particular, each CAV computes its control input based on locally available data from its involved subsystem. Meanwhile, the interaction between neighboring subsystems is modeled as a bounded disturbance, for which appropriate estimation methods are proposed. Then, we formulate a robust optimization problem and present its tractable computational solutions. Compared with the centralized formulation, our method greatly reduces computation burden with better safety performance, while naturally preserving data privacy. Extensive traffic simulations validate its wave-dampening ability, safety performance, and computational benefits.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
On equivalence relations induced by Polish groups admitting compatible two-sided invariant metrics
Authors:
Longyun Ding,
Yang Zheng
Abstract:
Given a Polish group $G$, let $E(G)$ be the right coset equivalence relation $G^ω/c(G)$, where $c(G)$ is the group of all convergent sequences in $G$. We first established two results:
(1) Let $G,H$ be two Polish groups. If $H$ is TSI but $G$ is not, then $E(G)\not\le_BE(H)$.
(2) Let $G$ be a Polish group. Then the following are equivalent: (a) $G$ is TSI non-archimedean; (b)$E(G)\leq_B E_0^ω$;…
▽ More
Given a Polish group $G$, let $E(G)$ be the right coset equivalence relation $G^ω/c(G)$, where $c(G)$ is the group of all convergent sequences in $G$. We first established two results:
(1) Let $G,H$ be two Polish groups. If $H$ is TSI but $G$ is not, then $E(G)\not\le_BE(H)$.
(2) Let $G$ be a Polish group. Then the following are equivalent: (a) $G$ is TSI non-archimedean; (b)$E(G)\leq_B E_0^ω$; and (c) $E(G)\leq_B{\mathbb R}^ω/c_0$. In particular, $E(G)\sim_B E_0^ω$ iff $G$ is TSI uncountable non-archimedean.
A critical theorem presented in this article is as follows: Let $G$ be a TSI Polish group, and let $H$ be a closed subgroup of the product of a sequence of TSI strongly NSS Polish groups. If $E(G)\le_BE(H)$, then there exists a continuous homomorphism $S:G_0\rightarrow H$ such that $\ker(S)$ is non-archimedean, where $G_0$ is the connected component of the identity of $G$. The converse holds if $G$ is connected, $S(G)$ is closed in $H$, and the interval $[0,1]$ can be embedded into $H$.
As its applications, we prove several Rigid theorems for TSI Lie groups, locally compact Polish groups, separable Banach spaces, and separable Fréchet spaces, respectively.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Robust Discrete Choice Model for Travel Behavior Prediction With Data Uncertainties
Authors:
Baichuan Mo,
Yunhan Zheng,
Xiaotong Guo,
Ruoyun Ma,
**hua Zhao
Abstract:
Discrete choice models (DCMs) are the canonical methods for travel behavior modeling and prediction. However, in many scenarios, the collected data for DCMs are subject to measurement errors. Previous studies on measurement errors mostly focus on "better estimating model parameters" with training data. In this study, we focus on "better predicting new samples' behavior" when there are measurement…
▽ More
Discrete choice models (DCMs) are the canonical methods for travel behavior modeling and prediction. However, in many scenarios, the collected data for DCMs are subject to measurement errors. Previous studies on measurement errors mostly focus on "better estimating model parameters" with training data. In this study, we focus on "better predicting new samples' behavior" when there are measurement errors in testing data. To this end, we propose a robust discrete choice model framework that is able to account for data uncertainties in both features and labels. The model is based on robust optimization theory that minimizes the worst-case loss over a set of uncertainty data scenarios. Specifically, for feature uncertainties, we assume that the $\ell_p$-norm of the measurement errors in features is smaller than a pre-established threshold. We model label uncertainties by limiting the number of mislabeled choices to at most $Γ$. Based on these assumptions, we derive a tractable robust counterpart for robust-feature and robust-label DCM models. The derived robust-feature binary logit (BNL) and the robust-label multinomial logit (MNL) models are exact. However, the formulation for the robust-feature MNL model is an approximation of the exact robust optimization problem. The proposed models are validated in a binary choice data set and a multinomial choice data set, respectively. Results show that the robust models (both features and labels) can outperform the conventional BNL and MNL models in prediction accuracy and log-likelihood. We show that the robustness works like "regularization" and thus has better generalizability.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
Error bounds, PL condition, and quadratic growth for weakly convex functions, and linear convergences of proximal point methods
Authors:
Feng-Yi Liao,
Lijun Ding,
Yang Zheng
Abstract:
Many machine learning problems lack strong convexity properties. Fortunately, recent studies have revealed that first-order algorithms also enjoy linear convergences under various weaker regularity conditions. While the relationship among different conditions for convex and smooth functions is well understood, it is not the case for the nonsmooth setting. In this paper, we go beyond convexity and…
▽ More
Many machine learning problems lack strong convexity properties. Fortunately, recent studies have revealed that first-order algorithms also enjoy linear convergences under various weaker regularity conditions. While the relationship among different conditions for convex and smooth functions is well understood, it is not the case for the nonsmooth setting. In this paper, we go beyond convexity and smoothness, and clarify the connections among common regularity conditions (including $\textit{strong convexity, restricted secant inequality, subdifferential error bound, Polyak-Łojasiewicz inequality, and quadratic growth}$) in the class of weakly convex functions. In addition, we present a simple and modular proof for the linear convergence of the $\textit{proximal point method}$ (PPM) for convex (possibly nonsmooth) optimization using these regularity conditions. The linear convergence also holds when the subproblems of PPM are solved inexactly with a proper control of inexactness.
△ Less
Submitted 27 December, 2023;
originally announced December 2023.
-
Convex Approximations for a Bi-level Formulation of Data-Enabled Predictive Control
Authors:
Xu Shang,
Yang Zheng
Abstract:
The Willems' fundamental lemma, which characterizes linear time-invariant (LTI) systems using input and output trajectories, has found many successful applications. Combining this with receding horizon control leads to a popular Data-EnablEd Predictive Control (DeePC) scheme. DeePC is first established for LTI systems and has been extended and applied for practical systems beyond LTI settings. How…
▽ More
The Willems' fundamental lemma, which characterizes linear time-invariant (LTI) systems using input and output trajectories, has found many successful applications. Combining this with receding horizon control leads to a popular Data-EnablEd Predictive Control (DeePC) scheme. DeePC is first established for LTI systems and has been extended and applied for practical systems beyond LTI settings. However, the relationship between different DeePC variants, involving regularization and dimension reduction, remains unclear. In this paper, we first introduce a new bi-level optimization formulation that combines a data pre-processing step as an inner problem (system identification) and predictive control as an outer problem (online control). We next discuss a series of convex approximations by relaxing some hard constraints in the bi-level optimization as suitable regularization terms, accounting for an implicit identification. These include some existing DeePC variants as well as two new variants, for which we establish their equivalence under appropriate settings. Notably, our analysis reveals a novel variant, called DeePC-SVD-Iter, which has remarkable empirical performance of direct methods on systems beyond deterministic LTI settings.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Benign Nonconvex Landscapes in Optimal and Robust Control, Part I: Global Optimality
Authors:
Yang Zheng,
Chih-fan Pai,
Yujie Tang
Abstract:
Direct policy search has achieved great empirical success in reinforcement learning. Many recent studies have revisited its theoretical foundation for continuous control, which reveals elegant nonconvex geometry in various benchmark problems, especially in fully observable state-feedback cases. This paper considers two fundamental optimal and robust control problems with partial observability: the…
▽ More
Direct policy search has achieved great empirical success in reinforcement learning. Many recent studies have revisited its theoretical foundation for continuous control, which reveals elegant nonconvex geometry in various benchmark problems, especially in fully observable state-feedback cases. This paper considers two fundamental optimal and robust control problems with partial observability: the Linear Quadratic Gaussian (LQG) control with stochastic noises, and $\mathcal{H}_\infty$ robust control with adversarial noises. In the policy space, the former problem is smooth but nonconvex, while the latter one is nonsmooth and nonconvex. We highlight some interesting and surprising ``discontinuity'' of LQG and $\mathcal{H}_\infty$ cost functions around the boundary of their domains. Despite the lack of convexity (and possibly smoothness), our main results show that for a class of non-degenerate policies, all Clarke stationary points are globally optimal and there is no spurious local minimum for both LQG and $\mathcal{H}_\infty$ control. Our proof techniques rely on a new and unified framework of Extended Convex Lifting (ECL), which reconciles the gap between nonconvex policy optimization and convex reformulations. This ECL framework is of independent interest, and we will discuss its details in Part II of this paper.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
Finite-time singularity formation for the heat flow of the $H$-system
Authors:
Yannick Sire,
Juncheng Wei,
Youquan Zheng,
Yifu Zhou
Abstract:
We construct the first example of finite time blow-up solutions for the heat flow of the $H$-system, describing the evolution of surfaces with constant mean curvature \begin{equation*} \left\{ \begin{aligned} &u_t = Δu - 2u_{x_1}\wedge u_{x_2}~\quad\text{ in }~\mathbb{R}^2\times\mathbb{R}_+,\\ &u(\cdot, 0) = u_0~\qquad\qquad~\text{ in }~\mathbb{R}^2, \end{aligned} \right. \end{equation*} where…
▽ More
We construct the first example of finite time blow-up solutions for the heat flow of the $H$-system, describing the evolution of surfaces with constant mean curvature \begin{equation*} \left\{ \begin{aligned} &u_t = Δu - 2u_{x_1}\wedge u_{x_2}~\quad\text{ in }~\mathbb{R}^2\times\mathbb{R}_+,\\ &u(\cdot, 0) = u_0~\qquad\qquad~\text{ in }~\mathbb{R}^2, \end{aligned} \right. \end{equation*} where $u$: $\mathbb{R}^2\times\mathbb{R}_+\to \mathbb{R}^3$. The singularity at finite time forms as a scaled least energy $H$-bubble, denoted as $W$, exhibiting type II blow-up speed. One key observation is that the linearized operators around $W$ projected onto $W^\perp$ and in the $W$-direction are in fact decoupled. On $W^\perp$, the linearization is the linearized harmonic map heat flow, while in the $W$-direction, it is the linearized Liouville-type flow. Based on this, we also prove the non-degeneracy of the $H$-bubbles with any degree.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
A quantum-classical performance separation in nonconvex optimization
Authors:
Jiaqi Leng,
Yufan Zheng,
Xiaodi Wu
Abstract:
In this paper, we identify a family of nonconvex continuous optimization instances, each $d$-dimensional instance with $2^d$ local minima, to demonstrate a quantum-classical performance separation. Specifically, we prove that the recently proposed Quantum Hamiltonian Descent (QHD) algorithm [Leng et al., arXiv:2303.01471] is able to solve any $d$-dimensional instance from this family using…
▽ More
In this paper, we identify a family of nonconvex continuous optimization instances, each $d$-dimensional instance with $2^d$ local minima, to demonstrate a quantum-classical performance separation. Specifically, we prove that the recently proposed Quantum Hamiltonian Descent (QHD) algorithm [Leng et al., arXiv:2303.01471] is able to solve any $d$-dimensional instance from this family using $\widetilde{\mathcal{O}}(d^3)$ quantum queries to the function value and $\widetilde{\mathcal{O}}(d^4)$ additional 1-qubit and 2-qubit elementary quantum gates. On the other side, a comprehensive empirical study suggests that representative state-of-the-art classical optimization algorithms/solvers (including Gurobi) would require a super-polynomial time to solve such optimization instances.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
The limit theory of the energy-critical complex Ginzburg-Landau equation
Authors:
Xing. Cheng,
Chang-yu Guo,
Yunrui. Zheng
Abstract:
We study the limit behavior of the solutions to energy-critical complex Ginzburg-Landau equation. We give a rigorous theory of the zero-dispersion limit from energy-critical complex Ginzburg-Landau equation to energy-critical nonlinear heat equation for dimensions 3 and 4 in both the defocusing and focusing cases by energy method. Furthermore, we also show the invisicid limit of energy-critical co…
▽ More
We study the limit behavior of the solutions to energy-critical complex Ginzburg-Landau equation. We give a rigorous theory of the zero-dispersion limit from energy-critical complex Ginzburg-Landau equation to energy-critical nonlinear heat equation for dimensions 3 and 4 in both the defocusing and focusing cases by energy method. Furthermore, we also show the invisicid limit of energy-critical complex Ginzburg-Landau equation to energy-critical nonlinear Schrödinger equation for dimension 4 in the focusing case.
△ Less
Submitted 22 April, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Smoothing Mixed Traffic with Robust Data-driven Predictive Control for Connected and Autonomous Vehicles
Authors:
Xu Shang,
Jiawei Wang,
Yang Zheng
Abstract:
The recently developed DeeP-LCC (Data-EnablEd Predictive Leading Cruise Control) method has shown promising performance for data-driven predictive control of Connected and Autonomous Vehicles (CAVs) in mixed traffic. However, its simplistic zero assumption of the future velocity errors for the head vehicle may pose safety concerns and limit its performance of smoothing traffic flow. In this paper,…
▽ More
The recently developed DeeP-LCC (Data-EnablEd Predictive Leading Cruise Control) method has shown promising performance for data-driven predictive control of Connected and Autonomous Vehicles (CAVs) in mixed traffic. However, its simplistic zero assumption of the future velocity errors for the head vehicle may pose safety concerns and limit its performance of smoothing traffic flow. In this paper, we propose a robust DeeP-LCC method to control CAVs in mixed traffic with enhanced safety performance. In particular, we first present a robust formulation that enforces a safety constraint for a range of potential velocity error trajectories, and then estimate all potential velocity errors based on the past data from the head vehicle. We also provide efficient computational approaches to solve the robust optimization for online predictive control. Nonlinear traffic simulations show that our robust DeeP-LCC can provide better traffic efficiency and stronger safety performance while requiring less offline data.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Global weak solution of 3-D focusing energy-critical nonlinear Schrödinger equation
Authors:
Xing Cheng,
Chang-Yu Guo,
Yunrui Zheng
Abstract:
In this article, we prove the existence of global weak solutions to the three-dimensional focusing energy-critical nonlinear Schrödinger (NLS) equation in the non-radial case. Furthermore, we prove the weak-strong uniqueness for some class of initial data. The main ingredient of our new approach is to use solutions of an energy-critical Ginzburg-Landau equation as approximations for the correspond…
▽ More
In this article, we prove the existence of global weak solutions to the three-dimensional focusing energy-critical nonlinear Schrödinger (NLS) equation in the non-radial case. Furthermore, we prove the weak-strong uniqueness for some class of initial data. The main ingredient of our new approach is to use solutions of an energy-critical Ginzburg-Landau equation as approximations for the corresponding nonlinear Schördinger equation.
In our proofs, we first show the dichotomy of global well-posedness versus finite time blow-up of energy-critical Ginzburg-Landau equation in $\dot{H}^1( \mathbb{R}^d)$ for $d = 3,4 $ when the energy is less than the energy of the stationary solution $W$. We follow the strategy of C. E. Kenig and F. Merle [25,26], using a concentration-compactness/rigidity argument to reduce the global well-posedness to the exclusion of a critical element. The critical element is ruled out by dissipation of the Ginzburg-Landau equation, including local smoothness, backwards uniqueness and unique continuation. The existence of global weak solution of the three dimensional focusing energy-critical nonlinear Schrödinger equation in the non-radial case then follows from the global well-posedness of the energy-critical Ginzburg-Landau equation via a limitation argument. We also adapt the arguments of M. Struwe [37,38] to prove the weak-strong uniqueness when the $\dot{H}^1$-norm of the initial data is bounded by a constant depending on the stationary solution $W$.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
An Efficient Algorithm for Computational Protein Design Problem
Authors:
Yukai Zheng,
Weikun Chen,
Qingna Li
Abstract:
A protein is a sequence of basic blocks called amino acids, and it plays an important role in animals and human beings. The computational protein design (CPD) problem is to identify a protein that could perform some given functions. The CPD problem can be formulated as a quadratic semi-assigement problem (QSAP) and is extremely challenging due to its combinatorial properties over different amino a…
▽ More
A protein is a sequence of basic blocks called amino acids, and it plays an important role in animals and human beings. The computational protein design (CPD) problem is to identify a protein that could perform some given functions. The CPD problem can be formulated as a quadratic semi-assigement problem (QSAP) and is extremely challenging due to its combinatorial properties over different amino acid sequences. In this paper, we first show that the QSAP is equivalent to its continuous relaxation problem, the RQSAP, in the sense that the QSAP and RQSAP share the same optimal solution. Then we design an efficient quadratic penalty method to solve large-scale RQSAP. Numerical results on benchmark instances verify the superior performance of our approach over the state-of-the-art branch-and-cut solvers. In particular, our proposed algorithm outperforms the state-of-the-art solvers by three order of magnitude in CPU time in most cases while returns a high-quality solution.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
An Overview and Comparison of Spectral Bundle Methods for Primal and Dual Semidefinite Programs
Authors:
Feng-Yi Liao,
Lijun Ding,
Yang Zheng
Abstract:
The spectral bundle method developed by Helmberg and Rendl is well-established for solving large-scale semidefinite programs (SDPs) in the dual form, especially when the SDPs admit $\textit{low-rank primal solutions}$. Under mild regularity conditions, a recent result by Ding and Grimmer has established fast linear convergence rates when the bundle method captures…
▽ More
The spectral bundle method developed by Helmberg and Rendl is well-established for solving large-scale semidefinite programs (SDPs) in the dual form, especially when the SDPs admit $\textit{low-rank primal solutions}$. Under mild regularity conditions, a recent result by Ding and Grimmer has established fast linear convergence rates when the bundle method captures $\textit{the rank of primal solutions}$. In this paper, we present an overview and comparison of spectral bundle methods for solving both $\textit{primal}$ and $\textit{dual}$ SDPs. In particular, we introduce a new family of spectral bundle methods for solving SDPs in the $\textit{primal}$ form. The algorithm developments are parallel to those by Helmberg and Rendl, mirroring the elegant duality between primal and dual SDPs. The new family of spectral bundle methods also achieves linear convergence rates for primal feasibility, dual feasibility, and duality gap when the algorithm captures $\textit{the rank of the dual solutions}$. Therefore, the original spectral bundle method by Helmberg and Rendl is well-suited for SDPs with $\textit{low-rank primal solutions}$, while on the other hand, our new spectral bundle method works well for SDPs with $\textit{low-rank dual solutions}$. These theoretical findings are supported by a range of large-scale numerical experiments. Finally, we demonstrate that our new spectral bundle method achieves state-of-the-art efficiency and scalability for solving polynomial optimization compared to a set of baseline solvers $\textsf{SDPT3}$, $\textsf{MOSEK}$, $\textsf{CDCS}$, and $\textsf{SDPNAL+}$.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Moving-horizon False Data Injection Attack Design against Cyber-Physical Systems
Authors:
Yu Zheng,
Sridhar Babu Mudhangulla,
Olugbenga Moses Anubi
Abstract:
Systematic attack design is essential to understanding the vulnerabilities of cyber-physical systems (CPSs), to better design for resiliency. In particular, false data injection attacks (FDIAs) are well-known and have been shown to be capable of bypassing bad data detection (BDD) while causing targeted biases in resulting state estimates. However, their effectiveness against moving horizon estimat…
▽ More
Systematic attack design is essential to understanding the vulnerabilities of cyber-physical systems (CPSs), to better design for resiliency. In particular, false data injection attacks (FDIAs) are well-known and have been shown to be capable of bypassing bad data detection (BDD) while causing targeted biases in resulting state estimates. However, their effectiveness against moving horizon estimators (MHE) is not well understood. In fact, this paper shows that conventional FDIAs are generally ineffective against MHE. One of the main reasons is that the moving window renders the static FDIA recursively infeasible. This paper proposes a new attack methodology, moving-horizon FDIA (MH-FDIA), by considering both the performance of historical attacks and the current system's status. Theoretical guarantees for successful attack generation and recursive feasibility are given. Numerical simulations on the IEEE-14 bus system further validate the theoretical claims and show that the proposed MH-FDIA outperforms state-of-the-art counterparts in both stealthiness and effectiveness. In addition, \textcolor{blue}{an experiment on} a path-tracking control system of an autonomous vehicle shows the feasibility of the MH-FDIA in real-world nonlinear systems.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
On the Relationship of Optimal State Feedback and Disturbance Response Controllers
Authors:
Runyu Zhang,
Yang Zheng,
Weiyu Li,
Na Li
Abstract:
This paper studies the relationship between state feedback policies and disturbance response policies for the standard Linear Quadratic Regulator (LQR). For open-loop stable plants, we establish a simple relationship between the optimal state feedback controller $u_t=K_\star x_t$ and the optimal disturbance response controller $u_t=L^{(H)}_{\star;1}w_{t-1}+\cdots+L^{(H)}_{\star;H}w_{t-H}$ with…
▽ More
This paper studies the relationship between state feedback policies and disturbance response policies for the standard Linear Quadratic Regulator (LQR). For open-loop stable plants, we establish a simple relationship between the optimal state feedback controller $u_t=K_\star x_t$ and the optimal disturbance response controller $u_t=L^{(H)}_{\star;1}w_{t-1}+\cdots+L^{(H)}_{\star;H}w_{t-H}$ with $H$-order. Here $x_t, w_t, u_t$ stands for the state, disturbance, control action of the system, respectively. Our result shows that $L_{\star,1}^{(H)}$ is a good approximation of $K_\star$ and the approximation error $\|K_\star - L_{\star,1}^{(H)}\|$ decays exponentially with $H$. We further extend this result to LQR for open-loop unstable systems, when a pre-stabilizing controller $K_0$ is available.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
On the Global Optimality of Direct Policy Search for Nonsmooth $H_\infty$ Output-Feedback Control
Authors:
Yujie Tang,
Yang Zheng
Abstract:
Direct policy search has achieved great empirical success in reinforcement learning. Recently, there has been increasing interest in studying its theoretical properties for continuous control, and fruitful results have been established for linear quadratic regulator (LQR) and linear quadratic Gaussian (LQG) control that are smooth and nonconvex. In this paper, we consider the standard $H_\infty$ r…
▽ More
Direct policy search has achieved great empirical success in reinforcement learning. Recently, there has been increasing interest in studying its theoretical properties for continuous control, and fruitful results have been established for linear quadratic regulator (LQR) and linear quadratic Gaussian (LQG) control that are smooth and nonconvex. In this paper, we consider the standard $H_\infty$ robust control for output feedback systems and investigate the global optimality of direct policy search. Unlike LQR or LQG, the $H_\infty$ cost function is nonsmooth in the policy space. Despite the lack of smoothness and convexity, our main result shows that for a class of non-degenerated stabilizing controllers, all Clarke stationary points of $H_\infty$ robust control are globally optimal and there is no spurious local minimum. Our proof technique is motivated by the idea of differentiable convex liftings (DCL), and we extend DCL to analyze the nonsmooth and nonconvex $H_\infty$ robust control via convex reformulation. Our result sheds some light on the analysis of direct policy search for solving nonsmooth and nonconvex robust control problems.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Quotients of exact categories by pseudo-cluster tilting subcategories
Authors:
Jie Xu,
Yuefei Zheng
Abstract:
We introduce the concept of a pseudo-cluster tilting subcategory from the viewpoint of the fact that the quotient of an exact category by a cluster tilting subcategory is an abelian category. We prove that the quotients in the case of pseudo-cluster tilting are always semi-abelian. In addition, it is abelian if and only if some self-orthogonal conditions are satisfied. We revisit the abelian quoti…
▽ More
We introduce the concept of a pseudo-cluster tilting subcategory from the viewpoint of the fact that the quotient of an exact category by a cluster tilting subcategory is an abelian category. We prove that the quotients in the case of pseudo-cluster tilting are always semi-abelian. In addition, it is abelian if and only if some self-orthogonal conditions are satisfied. We revisit the abelian quotient category of conflations by splitting ones, and get that there exists a unique exact substructure such that it is a cluster quotient.
△ Less
Submitted 12 March, 2023;
originally announced March 2023.
-
$F$-zips with additional structure on splitting models of Shimura varieties
Authors:
Xu Shen,
Yuqiang Zheng
Abstract:
We construct universal $G$-zips on good reductions of the Pappas-Rapoport splitting models for PEL-type Shimura varieties. We study the induced Ekedahl-Oort stratification, which sheds new light on the mod $p$ geometry of splitting models. Building on the work of Lan on arithmetic compactifications of splitting models, we further extend these constructions to smooth toroidal compactifications. Com…
▽ More
We construct universal $G$-zips on good reductions of the Pappas-Rapoport splitting models for PEL-type Shimura varieties. We study the induced Ekedahl-Oort stratification, which sheds new light on the mod $p$ geometry of splitting models. Building on the work of Lan on arithmetic compactifications of splitting models, we further extend these constructions to smooth toroidal compactifications. Combined with the work of Goldring-Koskivirta on group theoretical Hasse invariants, we get an application to Galois representations associated to torsion classes in coherent cohomology in the ramified setting.
△ Less
Submitted 26 December, 2023; v1 submitted 28 December, 2022;
originally announced December 2022.
-
Multivariate Polynomial Regression of Euclidean Degree Extends the Stability for Fast Approximations of Trefethen Functions
Authors:
Sachin K. Thekke Veettil,
Yuxi Zheng,
Uwe Hernandez Acosta,
Damar Wicaksono,
Michael Hecht
Abstract:
We address classic multivariate polynomial regression tasks from a novel perspective resting on the notion of general polynomial $l_p$-degree, with total, Euclidean, and maximum degree being the centre of considerations. While ensuring stability is a theoretically known and empirically observable limitation of any computational scheme seeking for fast function approximation, we show that choosing…
▽ More
We address classic multivariate polynomial regression tasks from a novel perspective resting on the notion of general polynomial $l_p$-degree, with total, Euclidean, and maximum degree being the centre of considerations. While ensuring stability is a theoretically known and empirically observable limitation of any computational scheme seeking for fast function approximation, we show that choosing Euclidean degree resists the instability phenomenon best. Especially, for a class of analytic functions, we termed Trefethen functions, we extend recent argumentations that suggest this result to be genuine. We complement the novel regression scheme, presented herein, by an adaptive domain decomposition approach that extends the stability for fast function approximation even further.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
A note on $α$-permanent and loop soup
Authors:
Xiaodan Li,
Yushu Zheng
Abstract:
In this paper, it is shown that $α$-permanent in algebra is closely related to loop soup in probability. We give explicit expansions of $α$-permanents of the block matrices obtained from matrices associated to $*$-forests, which are a special class of matrices containing tridiagonal matrices. It is proved in two ways, one is the direct combinatorial proof, and the other is the probabilistic proof…
▽ More
In this paper, it is shown that $α$-permanent in algebra is closely related to loop soup in probability. We give explicit expansions of $α$-permanents of the block matrices obtained from matrices associated to $*$-forests, which are a special class of matrices containing tridiagonal matrices. It is proved in two ways, one is the direct combinatorial proof, and the other is the probabilistic proof via loop soup.
△ Less
Submitted 18 June, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
The Global Maximum Principle for Optimal Control of Partially Observed Stochastic Systems Driven by Fractional Brownian Motion
Authors:
Yueyang Zheng,
Yaozhong Hu
Abstract:
In this paper we study the stochastic control problem of partially observed (multi-dimensional) stochastic system driven by both Brownian motions and fractional Brownian motions. In the absence of the powerful tool of Girsanov transformation, we introduce and study new stochastic processes which are used to transform the original problem to a "classical one". The adjoint backward stochastic differ…
▽ More
In this paper we study the stochastic control problem of partially observed (multi-dimensional) stochastic system driven by both Brownian motions and fractional Brownian motions. In the absence of the powerful tool of Girsanov transformation, we introduce and study new stochastic processes which are used to transform the original problem to a "classical one". The adjoint backward stochastic differential equations and the necessary condition satisfied by the optimal control (maximum principle) are obtained.
△ Less
Submitted 19 August, 2023; v1 submitted 10 December, 2022;
originally announced December 2022.
-
On Controller Reduction in Linear Quadratic Gaussian Control with Performance Bounds
Authors:
Zhaolin Ren,
Yang Zheng,
Maryam Fazel,
Na Li
Abstract:
The problem of controller reduction has a rich history in control theory. Yet, many questions remain open. In particular, there exist very few results on the order reduction of general non-observer based controllers and the subsequent quantification of the closed-loop performance. Recent developments in model-free policy optimization for Linear Quadratic Gaussian (LQG) control have highlighted the…
▽ More
The problem of controller reduction has a rich history in control theory. Yet, many questions remain open. In particular, there exist very few results on the order reduction of general non-observer based controllers and the subsequent quantification of the closed-loop performance. Recent developments in model-free policy optimization for Linear Quadratic Gaussian (LQG) control have highlighted the importance of this question. In this paper, we first propose a new set of sufficient conditions ensuring that a perturbed controller remains internally stabilizing. Based on this result, we illustrate how to perform order reduction of general non-observer based controllers using balanced truncation and modal truncation. We also provide explicit bounds on the LQG performance of the reduced-order controller. Furthermore, for single-input-single-output (SISO) systems, we introduce a new controller reduction technique by truncating unstable modes. We illustrate our theoretical results with numerical simulations. Our results will serve as valuable tools to design direct policy search algorithms for control problems with partial observations.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
The statistical analysis for Sombor indices in a random polygonal chain networks
Authors:
Jia-Bao Liu,
Ya-Qian Zheng,
Xin-Bei Peng
Abstract:
The Sombor indices, a new category of degree-based topological molecular descriptors, have been widely investigated due to their excellent chemical applicability. This paper aims to establish Sombor indices distributions in random polygonal chain networks and to achieve expressions of the expected values and variances. The expected values and variances of the Sombor indices for polyonino, pentacha…
▽ More
The Sombor indices, a new category of degree-based topological molecular descriptors, have been widely investigated due to their excellent chemical applicability. This paper aims to establish Sombor indices distributions in random polygonal chain networks and to achieve expressions of the expected values and variances. The expected values and variances of the Sombor indices for polyonino, pentachain, polyphenyl, and cyclooctane chains are obtained. Since the end connection of a random chain network follows a binomial distribution, the Sombor indices of any chain network follow the normal distribution when the number of polygons connected by the chain, indicated by n, approaches infinity. Keywords: Degree distribution; Polygonal chains; Expected value; Variance; Sombor indices.
△ Less
Submitted 23 October, 2022;
originally announced October 2022.
-
On the one-sided boundedness of the local discrepancy of $\{nα\}$-sequences
Authors:
Jiangang Ying,
Yushu Zheng
Abstract:
The main interest of this article is the one-sided boundedness of the local discrepancy of $α\in\mathbb{R}\setminus\mathbb{Q}$ on the interval $(0,c)\subset(0,1)$ defined by \[D_n(α,c)=\sum_{j=1}^n 1_{\{\{jα\}<c\}}-cn.\] We focus on the special case $c\in (0,1)\cap\mathbb{Q}$. Several necessary and sufficient conditions on $α$ for $(D_n(α,c))$ to be one-side bounded are derived. Using these, certa…
▽ More
The main interest of this article is the one-sided boundedness of the local discrepancy of $α\in\mathbb{R}\setminus\mathbb{Q}$ on the interval $(0,c)\subset(0,1)$ defined by \[D_n(α,c)=\sum_{j=1}^n 1_{\{\{jα\}<c\}}-cn.\] We focus on the special case $c\in (0,1)\cap\mathbb{Q}$. Several necessary and sufficient conditions on $α$ for $(D_n(α,c))$ to be one-side bounded are derived. Using these, certain topological properties are given to describe the size of the set \[O_c=\{α\in \irr: (D_n(α,c)) \text{ is one-side bounded}\}.\]
△ Less
Submitted 16 October, 2022;
originally announced October 2022.
-
On some classification of finite-dimensional Hopf algebras over the Hopf algebra $H_{b:1}^*$ of Kashina
Authors:
Yiwei Zheng,
Yun Gao,
Naihong Hu,
Yuxing Shi
Abstract:
Let $H$ be the dual of $16$-dimensional nontrivial semisimple Hopf algebra $H_{b:1}$ in the classification work of Kashina \cite{K00}. We completely determine all finite-dimensional Nichols algebras satisfying $\mathcal{B}(N)\cong \bigotimes_{i\in I}\mathcal{B}(N_i)$, where $N=\bigoplus_{i\in I}N_i$, each $N_i$ is a simple object in $_H^H\mathcal{YD}$. Under this assumption, we classify all those…
▽ More
Let $H$ be the dual of $16$-dimensional nontrivial semisimple Hopf algebra $H_{b:1}$ in the classification work of Kashina \cite{K00}. We completely determine all finite-dimensional Nichols algebras satisfying $\mathcal{B}(N)\cong \bigotimes_{i\in I}\mathcal{B}(N_i)$, where $N=\bigoplus_{i\in I}N_i$, each $N_i$ is a simple object in $_H^H\mathcal{YD}$. Under this assumption, we classify all those Hopf algebras of finite-dimensional growth from the semisimple Hopf algebra $H$ via the relevant Nichols algebras $\mathcal B(N)$.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
On equivalence relations induced by locally compact abelian Polish groups
Authors:
Longyun Ding,
Yang Zheng
Abstract:
Given a Polish group $G$, let $E(G)$ be the right coset equivalence relation $G^ω/c(G)$, where $c(G)$ is the group of all convergent sequences in $G$. The connected component of the identity of a Polish group $G$ is denoted by $G_0$. Let $G,H$ be locally compact abelian Polish groups. If $E(G)\leq_B E(H)$, then there is a continuous homomorphism $S:G_0\rightarrow H_0$ such that $\ker(S)$ is non-ar…
▽ More
Given a Polish group $G$, let $E(G)$ be the right coset equivalence relation $G^ω/c(G)$, where $c(G)$ is the group of all convergent sequences in $G$. The connected component of the identity of a Polish group $G$ is denoted by $G_0$. Let $G,H$ be locally compact abelian Polish groups. If $E(G)\leq_B E(H)$, then there is a continuous homomorphism $S:G_0\rightarrow H_0$ such that $\ker(S)$ is non-archimedean. The converse is also true when $G$ is connected and compact. For $n\in{\mathbb N}^+$, the partially ordered set $P(ω)/\mbox{Fin}$ can be embedded into Borel equivalence relations between $E({\mathbb R}^n)$ and $E({\mathbb T}^n)$.
△ Less
Submitted 29 May, 2023; v1 submitted 25 September, 2022;
originally announced September 2022.
-
Symmetry TFTs for Non-Invertible Defects
Authors:
Justin Kaidi,
Kantaro Ohmori,
Yunqin Zheng
Abstract:
Given any symmetry acting on a $d$-dimensional quantum field theory, there is an associated $(d+1)$-dimensional topological field theory known as the Symmetry TFT (SymTFT). The SymTFT is useful for decoupling the universal quantities of quantum field theories, such as their generalized global symmetries and 't Hooft anomalies, from their dynamics. In this work, we explore the SymTFT for theories w…
▽ More
Given any symmetry acting on a $d$-dimensional quantum field theory, there is an associated $(d+1)$-dimensional topological field theory known as the Symmetry TFT (SymTFT). The SymTFT is useful for decoupling the universal quantities of quantum field theories, such as their generalized global symmetries and 't Hooft anomalies, from their dynamics. In this work, we explore the SymTFT for theories with Kramers-Wannier-like duality symmetry in both $(1+1)$d and $(3+1)$d quantum field theories. After constructing the SymTFT, we use it to reproduce the non-invertible fusion rules of duality defects, and along the way we generalize the concept of duality defects to \textit{higher} duality defects. We also apply the SymTFT to the problem of distinguishing intrinsically versus non-intrinsically non-invertible duality defects in $(1+1)$d.
△ Less
Submitted 21 October, 2023; v1 submitted 22 September, 2022;
originally announced September 2022.
-
Infinite time bubbling for the $SU(2)$ Yang-Mills heat flow on $\mathbb{R}^4$
Authors:
Yannick Sire,
Juncheng Wei,
Youquan Zheng
Abstract:
We investigate the long time behaviour of the Yang-Mills heat flow on the bundle $\mathbb{R}^4\times SU(2)$. Waldron \cite{Waldron2019} proved global existence and smoothness of the flow on closed $4-$manifolds, leaving open the issue of the behaviour in infinite time. We exhibit two types of long-time bubbling: first we construct an initial data and a globally defined solution which {\sl blows-up…
▽ More
We investigate the long time behaviour of the Yang-Mills heat flow on the bundle $\mathbb{R}^4\times SU(2)$. Waldron \cite{Waldron2019} proved global existence and smoothness of the flow on closed $4-$manifolds, leaving open the issue of the behaviour in infinite time. We exhibit two types of long-time bubbling: first we construct an initial data and a globally defined solution which {\sl blows-up} in infinite time at a given point in $\mathbb R^4$. Second, we prove the existence of {\sl bubble-tower} solutions, also in infinite time. This answers the basic dynamical properties of the heat flow of Yang-Mills connection in the critical dimension $4$ and shows in particular that in general one cannot expect that this gradient flow converges to a Yang-Mills connection. We emphasize that we do not assume for the first result any symmetry assumption; whereas the second result on the existence of the bubble-tower is in the $SO(4)$-equivariant class, but nevertheless new.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
Regular subspaces of symmetric stable processes
Authors:
Dongjian Qian,
Jiangang Ying,
Yushu Zheng
Abstract:
Roughly speaking, regular subspaces are regular Dirichlet forms that inherit the original forms with smaller domains. In this paper, regular subspaces of 1-dim symmetric $α$-stable processes are considered. The main result is that it admits proper regular subspaces if and only if $α\in [1,2]$. Moreover, for $α\in(1,2)$, the characterization of the regular subspaces is given. General 1-dim symmetri…
▽ More
Roughly speaking, regular subspaces are regular Dirichlet forms that inherit the original forms with smaller domains. In this paper, regular subspaces of 1-dim symmetric $α$-stable processes are considered. The main result is that it admits proper regular subspaces if and only if $α\in [1,2]$. Moreover, for $α\in(1,2)$, the characterization of the regular subspaces is given. General 1-dim symmetric Lévy processes will also be investigated. It will be shown that whether it has proper regular subspaces is closely related to whether its sample paths have finite variation.
△ Less
Submitted 8 March, 2023; v1 submitted 19 July, 2022;
originally announced July 2022.
-
On positive solutions of biharmonic elliptic inequalities on Riemannian manifolds
Authors:
Yuhua Sun,
Yadong Zheng
Abstract:
We investigate the non-existence and existence of positive solutions to biharmonic elliptic inequalities on manifolds. Using Green function and volume growth conditions, we establish the critical exponent for biharmonic problem.
We investigate the non-existence and existence of positive solutions to biharmonic elliptic inequalities on manifolds. Using Green function and volume growth conditions, we establish the critical exponent for biharmonic problem.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
Qualitative properties for elliptic problems with CKN operators
Authors:
Huyuan Chen,
Yishan Zheng
Abstract:
The purpose of this paper is to study basic property of the operator
$$\mathcal{L}_{μ_1,μ_2} u=-Δ+\frac{μ_1 }{|x|^2}x\cdot\nabla +\frac{μ_2 }{|x|^2},$$ which generates at the origin due to the critical gradient and the Hardy term, where $μ_1,μ_2$ are free parameters. This operator arises from the critical Caffarelli-Kohn-Nirenberg inequality. We analyze the fundamental solutions in a weighted di…
▽ More
The purpose of this paper is to study basic property of the operator
$$\mathcal{L}_{μ_1,μ_2} u=-Δ+\frac{μ_1 }{|x|^2}x\cdot\nabla +\frac{μ_2 }{|x|^2},$$ which generates at the origin due to the critical gradient and the Hardy term, where $μ_1,μ_2$ are free parameters. This operator arises from the critical Caffarelli-Kohn-Nirenberg inequality. We analyze the fundamental solutions in a weighted distributional identity and obtain the Liouville theorem for the Lane-Emden equation with that operator, by using the classification of isolated singular solutions of the related Poisson problem in a bounded domain $Ω\subset \mathbb{R}^N$ ($N \geq 2$) containing the origin.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Inverting Ray-Knight identities on trees
Authors:
Xiaodan Li,
Yushu Zheng
Abstract:
In this paper, we first introduce the Ray-Knight identity and percolation Ray-Knight identity related to loop soup with intensity $α(\ge 0)$ on trees. Then we present the inversions of the above identities, which are expressed in terms of repelling jump processes. In particular, the inversion in the case of $α=0$ gives the conditional law of a Markov jump process given its local time field. We fur…
▽ More
In this paper, we first introduce the Ray-Knight identity and percolation Ray-Knight identity related to loop soup with intensity $α(\ge 0)$ on trees. Then we present the inversions of the above identities, which are expressed in terms of repelling jump processes. In particular, the inversion in the case of $α=0$ gives the conditional law of a Markov jump process given its local time field. We further show that the fine mesh limits of these repelling jump processes are the self-repelling diffusions \cite{Aidekon} involved in the inversion of the Ray-Knight identity on the corresponding metric graph. This is a generalization of results in \cite{2016Inverting,lupu2019inverting,LupuEJP657}, where the authors explore the case of $α=1/2$ on a general graph. Our construction is different from \cite{2016Inverting,lupu2019inverting} and based on the link between random networks and loop soups.
△ Less
Submitted 24 October, 2022; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Constrained Langevin Algorithms with L-mixing External Random Variables
Authors:
Yu** Zheng,
Andrew Lamperski
Abstract:
Langevin algorithms are gradient descent methods augmented with additive noise, and are widely used in Markov Chain Monte Carlo (MCMC) sampling, optimization, and machine learning. In recent years, the non-asymptotic analysis of Langevin algorithms for non-convex learning has been extensively explored. For constrained problems with non-convex losses over a compact convex domain with IID data varia…
▽ More
Langevin algorithms are gradient descent methods augmented with additive noise, and are widely used in Markov Chain Monte Carlo (MCMC) sampling, optimization, and machine learning. In recent years, the non-asymptotic analysis of Langevin algorithms for non-convex learning has been extensively explored. For constrained problems with non-convex losses over a compact convex domain with IID data variables, the projected Langevin algorithm achieves a deviation of $O(T^{-1/4} (\log T)^{1/2})$ from its target distribution [27] in $1$-Wasserstein distance.
In this paper, we obtain a deviation of $O(T^{-1/2} \log T)$ in $1$-Wasserstein distance for non-convex losses with $L$-mixing data variables and polyhedral constraints (which are not necessarily bounded). This improves on the previous bound for constrained problems and matches the best-known bound for unconstrained problems.
△ Less
Submitted 7 January, 2023; v1 submitted 27 May, 2022;
originally announced May 2022.
-
A priori estimates and Liouville type results for quasilinear elliptic equations involving gradient terms
Authors:
Roberta Filippucci,
Yuhua Sun,
Yadong Zheng
Abstract:
In this article we study local and global properties of positive solutions of $-Δ_mu=|u|^{p-1}u+M|\nabla u|^q$ in a domain $Ω$ of $\mathbb R^N$, with $m>1$, $p,q>0$ and $M\in\mathbb R$. Following some ideas used in \cite{BV,Vron1}, and by using a direct Bernstein method combined with Keller-Osserman's estimate, we obtain several a priori estimates as well as Liouville type theorems. Moreover, we p…
▽ More
In this article we study local and global properties of positive solutions of $-Δ_mu=|u|^{p-1}u+M|\nabla u|^q$ in a domain $Ω$ of $\mathbb R^N$, with $m>1$, $p,q>0$ and $M\in\mathbb R$. Following some ideas used in \cite{BV,Vron1}, and by using a direct Bernstein method combined with Keller-Osserman's estimate, we obtain several a priori estimates as well as Liouville type theorems. Moreover, we prove a local Harnack inequality with the help of Serrin's classical results.
△ Less
Submitted 25 June, 2022; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Non-spectrality of Moran measures with consecutive digits
Authors:
Ya-Li Zheng,
Wen-Hui Ai
Abstract:
Let $ρ=(\frac{p}{q})^{\frac{1}{r}}<1$ for some $p,q,r\in\mathbb{N}$ with $(p,q)=1$ and $\mathcal{D}_{n}=\{0,1,\cdot\cdot\cdot,N_{n}-1\}$, where $N_{n}$ is prime for all $n\in\mathbb{N}$, and denote $M=\sup\{N_{n}:n=1,2,3,\ldots\}<\infty$. The associated Borel probability measure $$μ_{ρ,\{\mathcal{D}_{n}\}}=δ_{ρ\mathcal{D}_{1}}*δ_{ρ^{2}\mathcal{D}_{2}}*δ_{ρ^{3}\mathcal{D}_{3}}*\cdots$$ is called a…
▽ More
Let $ρ=(\frac{p}{q})^{\frac{1}{r}}<1$ for some $p,q,r\in\mathbb{N}$ with $(p,q)=1$ and $\mathcal{D}_{n}=\{0,1,\cdot\cdot\cdot,N_{n}-1\}$, where $N_{n}$ is prime for all $n\in\mathbb{N}$, and denote $M=\sup\{N_{n}:n=1,2,3,\ldots\}<\infty$. The associated Borel probability measure $$μ_{ρ,\{\mathcal{D}_{n}\}}=δ_{ρ\mathcal{D}_{1}}*δ_{ρ^{2}\mathcal{D}_{2}}*δ_{ρ^{3}\mathcal{D}_{3}}*\cdots$$ is called a Moran measure. Recently, Deng and Li proved that $μ_{ρ,\{\mathcal{D}_{n}\}}$ is a spectral measure if and only if $\frac{1}{N_{n}ρ}$ is an integer for all $n\geq 2$. In this paper, we prove that if $L^{2}(μ_{ρ, \{\mathcal{D}_{n}\}})$ contains an infinite orthogonal exponential set, then there exist infinite positive integers $n_{l}$ such that $(q,N_{n_{l}})>1$. Contrastly, if $(q,N_{n})=1$ and $(p,N_{n})=1$ for all $n\in\mathbb{N}$, then there are at most $M$ mutually orthogonal exponential functions in $L^{2}(μ_{ρ, \{\mathcal{D}_{n}\}})$ and $M$ is the best possible. If $(q,N_{n})=1$ and $(p,N_{n})>1$ for all $n\in\mathbb{N}$, then there are any number of orthogonal exponential functions in $L^{2}(μ_{ρ, \{\mathcal{D}_{n}\}})$.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
Distributed Coverage Control of Multi-Agent Systems in Uncertain Environments using Heat Transfer Equations
Authors:
Yinan Zheng,
Chao Zhai
Abstract:
This paper addresses the coverage control problem of multi-agent systems in the uncertain environment. With the aid of Voronoi partition, a distributed coverage control formulation of multi-agent system is proposed to complete the workload in uncertain environments. Driven by the gradient of thermal field, each agent is able to move around for clearing the workload on its own subregion. Theoretica…
▽ More
This paper addresses the coverage control problem of multi-agent systems in the uncertain environment. With the aid of Voronoi partition, a distributed coverage control formulation of multi-agent system is proposed to complete the workload in uncertain environments. Driven by the gradient of thermal field, each agent is able to move around for clearing the workload on its own subregion. Theoretical analysis is conducted to ensure the completion of workload in finite time. Finally, numerical simulations are carried out to demonstrate the effectiveness and advantages of the proposed coverage control approach as compared to other existing approaches.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Iterative Inner/outer Approximations for Scalable Semidefinite Programs using Block Factor-width-two Matrices
Authors:
Feng-Yi Liao,
Yang Zheng
Abstract:
In this paper, we propose iterative inner/outer approximations based on a recent notion of block factor-width-two matrices for solving semidefinite programs (SDPs). Our inner/outer approximating algorithms generate a sequence of upper/lower bounds of increasing accuracy for the optimal SDP cost. The block partition in our algorithms offers flexibility in terms of both numerical efficiency and solu…
▽ More
In this paper, we propose iterative inner/outer approximations based on a recent notion of block factor-width-two matrices for solving semidefinite programs (SDPs). Our inner/outer approximating algorithms generate a sequence of upper/lower bounds of increasing accuracy for the optimal SDP cost. The block partition in our algorithms offers flexibility in terms of both numerical efficiency and solution quality, which includes the approach of scaled diagonally dominance (SDD) approximation as a special case. We discuss both the theoretical results and numerical implementation in detail. Our main theorems guarantee that the proposed iterative algorithms generate monotonically decreasing upper (increasing lower) bounds. Extensive numerical results confirm our findings.
△ Less
Submitted 29 September, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
Lyapunov-Guided Representation of Recurrent Neural Network Performance
Authors:
Ryan Vogt,
Yang Zheng,
Eli Shlizerman
Abstract:
Recurrent Neural Networks (RNN) are ubiquitous computing systems for sequences and multivariate time series data. While several robust architectures of RNN are known, it is unclear how to relate RNN initialization, architecture, and other hyperparameters with accuracy for a given task. In this work, we propose to treat RNN as dynamical systems and to correlate hyperparameters with accuracy through…
▽ More
Recurrent Neural Networks (RNN) are ubiquitous computing systems for sequences and multivariate time series data. While several robust architectures of RNN are known, it is unclear how to relate RNN initialization, architecture, and other hyperparameters with accuracy for a given task. In this work, we propose to treat RNN as dynamical systems and to correlate hyperparameters with accuracy through Lyapunov spectral analysis, a methodology specifically designed for nonlinear dynamical systems. To address the fact that RNN features go beyond the existing Lyapunov spectral analysis, we propose to infer relevant features from the Lyapunov spectrum with an Autoencoder and an embedding of its latent representation (AeLLE). Our studies of various RNN architectures show that AeLLE successfully correlates RNN Lyapunov spectrum with accuracy. Furthermore, the latent representation learned by AeLLE is generalizable to novel inputs from the same task and is formed early in the process of RNN training. The latter property allows for the prediction of the accuracy to which RNN would converge when training is complete. We conclude that representation of RNN through Lyapunov spectrum along with AeLLE provides a novel method for organization and interpretation of variants of RNN architectures.
△ Less
Submitted 27 December, 2023; v1 submitted 11 April, 2022;
originally announced April 2022.
-
On equivalence relations induced by Polish groups
Authors:
Longyun Ding,
Yang Zheng
Abstract:
The motivation of this article is to introduce a kind of orbit equivalence relations which can well describe structures and properties of Polish groups from the perspective of Borel reducibility. Given a Polish group $G$, let $E(G)$ be the right coset equivalence relation $G^ω/c(G)$, where $c(G)$ is the group of all convergent sequences in $G$.
Let $G$ be a Polish group. (1) $G$ is a discrete co…
▽ More
The motivation of this article is to introduce a kind of orbit equivalence relations which can well describe structures and properties of Polish groups from the perspective of Borel reducibility. Given a Polish group $G$, let $E(G)$ be the right coset equivalence relation $G^ω/c(G)$, where $c(G)$ is the group of all convergent sequences in $G$.
Let $G$ be a Polish group. (1) $G$ is a discrete countable group containing at least two elements iff $E(G)\sim_BE_0$; (2) if $G$ is TSI uncountable non-archimedean, then $E(G)\sim_BE_0^ω$; (3) $G$ is non-archimedean iff $E(G)\le_B=^+$; (4) let $G$ be a non-CLI Polish group and $H$ a CLI Polish group, then $E(G)\not\le_BE_H^Y$ for any Polish $H$-space $Y$; (5) if $H$ is a non-archimedean Polish group but $G$ is not, then $E(G)\not\le_BE_H^Y$ for any Polish $H$-space $Y$.
The notion of $α$-unbalanced Polish group for $α<ω_1$ is introduced. Let $G,H$ be Polish groups, $0<α<ω_1$. If $G$ is $α$-unbalanced but $H$ is not, then $E(G)\not\le_B E(H)$.
For TSI Polish groups, the existence of Borel reduction is transformed into the existence of a well-behaved continuous map** between topological groups. As its applications, for any Lie group $G$, denote $G_0$ the connected component of the identity element $1_G$. Let $G$ and $H$ be two separable TSI Lie groups. If $E(G)\le_BE(H)$, then there exists a continuous locally injection $S:G_0\to H_0$. Moreover, if $G_0,H_0$ are abelian, $S$ is a group homomorphism. Particularly, for $c_0,e_0,c_1,e_1\in{\mathbb N}$, $E({\mathbb R}^{c_0}\times{\mathbb T}^{e_0})\le_BE({\mathbb R}^{c_1}\times{\mathbb T}^{e_1})$ iff $e_0\le e_1$ and $c_0+e_0\le c_1+e_1$.
△ Less
Submitted 27 September, 2022; v1 submitted 9 April, 2022;
originally announced April 2022.
-
Esca** High-order Saddles in Policy Optimization for Linear Quadratic Gaussian (LQG) Control
Authors:
Yang Zheng,
Yue Sun,
Maryam Fazel,
Na Li
Abstract:
First order policy optimization has been widely used in reinforcement learning. It guarantees to find the optimal policy for the state-feedback linear quadratic regulator (LQR). However, the performance of policy optimization remains unclear for the linear quadratic Gaussian (LQG) control where the LQG cost has spurious suboptimal stationary points. In this paper, we introduce a novel perturbed po…
▽ More
First order policy optimization has been widely used in reinforcement learning. It guarantees to find the optimal policy for the state-feedback linear quadratic regulator (LQR). However, the performance of policy optimization remains unclear for the linear quadratic Gaussian (LQG) control where the LQG cost has spurious suboptimal stationary points. In this paper, we introduce a novel perturbed policy gradient (PGD) method to escape a large class of bad stationary points (including high-order saddles). In particular, based on the specific structure of LQG, we introduce a novel reparameterization procedure which converts the iterate from a high-order saddle to a strict saddle, from which standard random perturbations in PGD can escape efficiently. We further characterize the high-order saddles that can be escaped by our algorithm.
△ Less
Submitted 2 April, 2022;
originally announced April 2022.
-
Convex Parameterization of Stabilizing Controllers and its LMI-based Computation via Filtering
Authors:
Mauricio C. de Oliveira,
Yang Zheng
Abstract:
Various new implicit parameterizations for stabilizing controllers that allow one to impose structural constraints on the controller have been proposed lately. They are convex but infinite-dimensional, formulated in the frequency domain with no available efficient methods for computation. In this paper, we introduce a kernel version of the Youla parameterization to characterize the set of stabiliz…
▽ More
Various new implicit parameterizations for stabilizing controllers that allow one to impose structural constraints on the controller have been proposed lately. They are convex but infinite-dimensional, formulated in the frequency domain with no available efficient methods for computation. In this paper, we introduce a kernel version of the Youla parameterization to characterize the set of stabilizing controllers. It features a single affine constraint, which allows us to recast the controller parameterization as a novel robust filtering problem. This makes it possible to derive the first efficient Linear Matrix Inequality (LMI) implicit parametrization of stabilizing controllers. Our LMI characterization not only admits efficient numerical computation, but also guarantees a full-order stabilizing dynamical controller that is efficient for practical deployment. Numerical experiments demonstrate that our LMI can be orders of magnitude faster to solve than the existing closed-loop parameterizations.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.