-
Safe and Stable Filter Design Using a Relaxed Compatibitlity Control Barrier -- Lyapunov Condition
Authors:
Han Wang,
Kostas Margellos,
Antonis Papachristodoulou
Abstract:
In this paper, we propose a quadratic programming-based filter for safe and stable controller design, via a Control Barrier Function (CBF) and a Control Lyapunov Function (CLF). Our method guarantees safety and local asymptotic stability without the need for an asymptotically stabilizing control law. Feasibility of the proposed program is ensured under a mild regularity condition, termed relaxed c…
▽ More
In this paper, we propose a quadratic programming-based filter for safe and stable controller design, via a Control Barrier Function (CBF) and a Control Lyapunov Function (CLF). Our method guarantees safety and local asymptotic stability without the need for an asymptotically stabilizing control law. Feasibility of the proposed program is ensured under a mild regularity condition, termed relaxed compatibility between the CLF and CBF. The resulting optimal control law is guaranteed to be locally Lipschitz continuous. We also analyze the closed-loop behaviour by characterizing the equilibrium points, and verifying that there are no equilibrium points in the interior of the control invariant set except at the origin. For a polynomial system and a semi-algebraic safe set, we provide a sum-of-squares program to design a relaxed compatible pair of CLF and CBF. The proposed approach is compared with other methods in the literature using numerical examples, exhibits superior filter performance and guarantees safety and local stability.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Efficient Low-rank Identification via Accelerated Iteratively Reweighted Nuclear Norm Minimization
Authors:
Hao Wang,
Ye Wang,
Xiangyu Yang
Abstract:
This paper considers the problem of minimizing the sum of a smooth function and the Schatten-$p$ norm of the matrix. Our contribution involves proposing accelerated iteratively reweighted nuclear norm methods designed for solving the nonconvex low-rank minimization problem. Two major novelties characterize our approach. Firstly, the proposed method possesses a rank identification property, enablin…
▽ More
This paper considers the problem of minimizing the sum of a smooth function and the Schatten-$p$ norm of the matrix. Our contribution involves proposing accelerated iteratively reweighted nuclear norm methods designed for solving the nonconvex low-rank minimization problem. Two major novelties characterize our approach. Firstly, the proposed method possesses a rank identification property, enabling the provable identification of the "correct" rank of the stationary point within a finite number of iterations. Secondly, we introduce an adaptive updating strategy for smoothing parameters. This strategy automatically fixes parameters associated with zero singular values as constants upon detecting the "correct" rank while quickly driving the rest of the parameters to zero. This adaptive behavior transforms the algorithm into one that effectively solves smooth problems after a few iterations, setting our work apart from existing iteratively reweighted methods for low-rank optimization. We prove the global convergence of the proposed algorithm, guaranteeing that every limit point of the iterates is a critical point. Furthermore, a local convergence rate analysis is provided under the Kurdyka-Łojasiewicz property. We conduct numerical experiments using both synthetic and real data to showcase our algorithm's efficiency and superiority over existing methods.
△ Less
Submitted 26 June, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Enhancing supply chain security with automated machine learning
Authors:
Haibo Wang,
Lutfu S. Sua,
Bahram Alidaee
Abstract:
This study tackles the complexities of global supply chains, which are increasingly vulnerable to disruptions caused by port congestion, material shortages, and inflation. To address these challenges, we explore the application of machine learning methods, which excel in predicting and optimizing solutions based on large datasets. Our focus is on enhancing supply chain security through fraud detec…
▽ More
This study tackles the complexities of global supply chains, which are increasingly vulnerable to disruptions caused by port congestion, material shortages, and inflation. To address these challenges, we explore the application of machine learning methods, which excel in predicting and optimizing solutions based on large datasets. Our focus is on enhancing supply chain security through fraud detection, maintenance prediction, and material backorder forecasting. We introduce an automated machine learning framework that streamlines data analysis, model construction, and hyperparameter optimization for these tasks. By automating these processes, our framework improves the efficiency and effectiveness of supply chain security measures. Our research identifies key factors that influence machine learning performance, including sampling methods, categorical encoding, feature selection, and hyperparameter optimization. We demonstrate the importance of considering these factors when applying machine learning to supply chain challenges. Traditional mathematical programming models often struggle to cope with the complexity of large-scale supply chain problems. Our study shows that machine learning methods can provide a viable alternative, particularly when dealing with extensive datasets and complex patterns. The automated machine learning framework presented in this study offers a novel approach to supply chain security, contributing to the existing body of knowledge in the field. Its comprehensive automation of machine learning processes makes it a valuable contribution to the domain of supply chain management.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Global-in-time energy stability: a powerful analysis tool for the gradient flow problem without maximum principle or Lipschitz assumption
Authors:
J. Sun,
H. Wang,
H. Zhang,
X. Qian,
S. Song
Abstract:
Before proving (unconditional) energy stability for gradient flows, most existing studies either require a strong Lipschitz condition regarding the non-linearity or certain $L^{\infty}$ bounds on the numerical solutions (the maximum principle). However, proving energy stability without such premises is a very challenging task. In this paper, we aim to develop a novel analytical tool, namely global…
▽ More
Before proving (unconditional) energy stability for gradient flows, most existing studies either require a strong Lipschitz condition regarding the non-linearity or certain $L^{\infty}$ bounds on the numerical solutions (the maximum principle). However, proving energy stability without such premises is a very challenging task. In this paper, we aim to develop a novel analytical tool, namely global-in-time energy stability, to demonstrate energy dissipation without assuming any strong Lipschitz condition or $L^{\infty}$ boundedness. The fourth-order-in-space Swift-Hohenberg equation is used to elucidate the theoretical results in detail. We also propose a temporal second-order accurate scheme for efficiently solving such a strongly stiff equation. Furthermore, we present the corresponding optimal $L^2$ error estimate and provide several numerical simulations to demonstrate the dynamics.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
A Unified Framework for Integer Programming Formulation of Graph Matching Problems
Authors:
Bahram Alidaee,
Haibo Wang,
Hugh Sloan
Abstract:
Graph theory has been a powerful tool in solving difficult and complex problems arising in all disciplines. In particular, graph matching is a classical problem in pattern analysis with enormous applications. Many graph problems have been formulated as a mathematical program and then solved using exact, heuristic, and/or approximated-guaranteed procedures. On the other hand, graph theory has been…
▽ More
Graph theory has been a powerful tool in solving difficult and complex problems arising in all disciplines. In particular, graph matching is a classical problem in pattern analysis with enormous applications. Many graph problems have been formulated as a mathematical program and then solved using exact, heuristic, and/or approximated-guaranteed procedures. On the other hand, graph theory has been a powerful tool in visualizing and understanding complex mathematical programming problems, especially integer programs. Formulating a graph problem as a natural integer program (IP) is often a challenging task. However, an IP formulation of the problem has many advantages. Several researchers have noted the need for natural IP formulation of graph theoretic problems. The present study aims to provide a unified framework for IP formulation of graph-matching problems. Although there are many surveys on graph matching problems, none is concerned with IP formulation. This paper is the first to provide a comprehensive IP formulation for such problems. The framework includes a variety of graph optimization problems in the literature. While these problems have been studied by different research communities, however, the framework presented here helps to bring efforts from different disciplines to tackle such diverse and complex problems. We hope the present study can significantly help to simplify some of the difficult problems arising in practice, especially in pattern analysis.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Fast Adaptive Meta-Heuristic for Large-Scale Facility Location Problem
Authors:
Bahram Alidaee,
Haibo Wang
Abstract:
Facility location problems have been a major research area of interest in the last several decades. In particular, uncapacitated location problems (ULP) have enormous applications. Variations of ULP often appear, especially as large-scale subproblems in more complex combinatorial optimization problems. Although many researchers have studied different versions of ULP (e.g., uncapacitated facility l…
▽ More
Facility location problems have been a major research area of interest in the last several decades. In particular, uncapacitated location problems (ULP) have enormous applications. Variations of ULP often appear, especially as large-scale subproblems in more complex combinatorial optimization problems. Although many researchers have studied different versions of ULP (e.g., uncapacitated facility location problem (UCFLP) and p-Median problem), most of these authors have considered small to moderately sized problems. In this paper, we address the ULP and provide a fast adaptive meta-heuristic for large-scale problems. The approach is based on critical event memory tabu search. For the diversification component of the algorithm, we have chosen a procedure based on a sequencing problem commonly used for traveling salesman-type problems. The efficacy of this approach is evaluated across a diverse range of benchmark problems sourced from the Internet, with a comprehensive comparison against four prominent algorithms in the literature. The proposed adaptive critical event tabu search (ACETS) demonstrates remarkable effectiveness for large-scale problems. The algorithm successfully solved all problems optimally within a short computing time. Notably, ACETS discovered three best new solutions for benchmark problems, specifically for Asymmetric 500A-1, Asymmetric 750A-1, and Symmetric 750B-4, underscoring its innovative and robust nature.
△ Less
Submitted 17 June, 2024; v1 submitted 11 June, 2024;
originally announced June 2024.
-
Szemerédi-Trotter bounds for tubes and applications
Authors:
Ciprian Demeter,
Hong Wang
Abstract:
We prove sharp estimates for incidences involving planar tubes that satisfy packing conditions. We apply them to improve the estimates for the Fourier transform of fractal measures supported on planar curves.
We prove sharp estimates for incidences involving planar tubes that satisfy packing conditions. We apply them to improve the estimates for the Fourier transform of fractal measures supported on planar curves.
△ Less
Submitted 21 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Boundedness for maximal operators over hypersurfaces in $\mathbb{R}^3$
Authors:
Wenjuan Li,
Huiju Wang
Abstract:
In this article, we study maximal functions related to hypersurfaces with vanishing Gaussian curvature in $\mathbb{R}^3$. Firstly, we characterize the $L^p\rightarrow L^q$ boundedness of local maximal operators along homogeneous hypersurfaces. Moreover, weighted $L^p$-estimates are obtained for the corresponding global operators. Secondly, for a class of hypersurfaces that lack a homogeneous struc…
▽ More
In this article, we study maximal functions related to hypersurfaces with vanishing Gaussian curvature in $\mathbb{R}^3$. Firstly, we characterize the $L^p\rightarrow L^q$ boundedness of local maximal operators along homogeneous hypersurfaces. Moreover, weighted $L^p$-estimates are obtained for the corresponding global operators. Secondly, for a class of hypersurfaces that lack a homogeneous structure and pass through the origin, we attempt to look for other geometric properties instead of height of hypersurfaces to characterize the optimal $L^p$-boundedness of the corresponding global maximal operators.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Numerical approximation for variable-exponent fractional diffusion-wave equation
Authors:
Xiangcheng Zheng,
Hong Wang,
Wenlin Qiu
Abstract:
This work considers the variable-exponent fractional diffusion-wave equation, which describes, e.g. the propagation of mechanical diffusive waves in viscoelastic media with varying material properties. Rigorous mathematical and numerical analysis for this model is not available in the literature, partly because the variable-exponent Abel kernel may not be positive definite or monotonic. We overcom…
▽ More
This work considers the variable-exponent fractional diffusion-wave equation, which describes, e.g. the propagation of mechanical diffusive waves in viscoelastic media with varying material properties. Rigorous mathematical and numerical analysis for this model is not available in the literature, partly because the variable-exponent Abel kernel may not be positive definite or monotonic. We overcome these difficulties to design two numerical schemes and derive their stability and error estimate based on the proved solution regularity, with $α(0)$-order and second-order accuracy in time, respectively. Numerical experiments are presented to substantiate the theoretical findings.
△ Less
Submitted 2 July, 2024; v1 submitted 5 June, 2024;
originally announced June 2024.
-
Congruence properties of the coefficients of the classical modular polynomials
Authors:
Haiyang Wang
Abstract:
The classical modular polynomials $Φ_\ell(X,Y)$ give plane curve models for the modular curves $X_0(\ell)/\mathbb{Q}$ and have been extensively studied. In this article, we provide closed formulas for $\ell$ nontrivial coefficients of the classical modular polynomials $Φ_\ell(X,Y)$ in terms of the Fourier coefficients of the modular invariant function $j(z)$ for a prime $\ell$. Our interest in the…
▽ More
The classical modular polynomials $Φ_\ell(X,Y)$ give plane curve models for the modular curves $X_0(\ell)/\mathbb{Q}$ and have been extensively studied. In this article, we provide closed formulas for $\ell$ nontrivial coefficients of the classical modular polynomials $Φ_\ell(X,Y)$ in terms of the Fourier coefficients of the modular invariant function $j(z)$ for a prime $\ell$. Our interest in the formulas were motivated by our conjectures on congruences modulo powers of the primes $2,3$ and $5$ satisfied by the coefficients of these polynomials. We deduce congruences from these formulas supporting the conjectures.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
On the Kodaira types of elliptic curves with potentially good supersingular reduction
Authors:
Haiyang Wang
Abstract:
Let $\mathcal{O}_K$ be a Henselian discrete valuation domain with field of fractions $K$. Assume that $\mathcal{O}_K$ has algebraically closed residue field $k$. Let $E/K$ be an elliptic curve with additive reduction. The semi-stable reduction theorem asserts that there exists a minimal extension $L/K$ such that the base change $E_L/L$ has semi-stable reduction.
It is natural to wonder whether s…
▽ More
Let $\mathcal{O}_K$ be a Henselian discrete valuation domain with field of fractions $K$. Assume that $\mathcal{O}_K$ has algebraically closed residue field $k$. Let $E/K$ be an elliptic curve with additive reduction. The semi-stable reduction theorem asserts that there exists a minimal extension $L/K$ such that the base change $E_L/L$ has semi-stable reduction.
It is natural to wonder whether specific properties of the semi-stable reduction and of the extension $L/K$ impose restrictions on what types of Kodaira type the special fiber of $E/K$ may have. In this paper we study the restrictions imposed on the reduction type when the extension $L/K$ is wildly ramified of degree $2$, and the curve $E/K$ has potentially good supersingular reduction. We also analyze the possible reduction types of two isogenous elliptic curves with these properties.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Profiled Transfer Learning for High Dimensional Linear Model
Authors:
Ziqian Lin,
Junlong Zhao,
Fang Wang,
Hansheng Wang
Abstract:
We develop here a novel transfer learning methodology called Profiled Transfer Learning (PTL). The method is based on the \textit{approximate-linear} assumption between the source and target parameters. Compared with the commonly assumed \textit{vanishing-difference} assumption and \textit{low-rank} assumption in the literature, the \textit{approximate-linear} assumption is more flexible and less…
▽ More
We develop here a novel transfer learning methodology called Profiled Transfer Learning (PTL). The method is based on the \textit{approximate-linear} assumption between the source and target parameters. Compared with the commonly assumed \textit{vanishing-difference} assumption and \textit{low-rank} assumption in the literature, the \textit{approximate-linear} assumption is more flexible and less stringent. Specifically, the PTL estimator is constructed by two major steps. Firstly, we regress the response on the transferred feature, leading to the profiled responses. Subsequently, we learn the regression relationship between profiled responses and the covariates on the target data. The final estimator is then assembled based on the \textit{approximate-linear} relationship. To theoretically support the PTL estimator, we derive the non-asymptotic upper bound and minimax lower bound. We find that the PTL estimator is minimax optimal under appropriate regularity conditions. Extensive simulation studies are presented to demonstrate the finite sample performance of the new method. A real data example about sentence prediction is also presented with very encouraging results.
△ Less
Submitted 5 June, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments
Authors:
Han Wang,
Sihong He,
Zhili Zhang,
Fei Miao,
James Anderson
Abstract:
We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maxim…
▽ More
We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maximizes the average performance across all potentially completely different environments, we propose two algorithms: FedSVRPG-M and FedHAPG-M. In contrast to existing results, we demonstrate that both FedSVRPG-M and FedHAPG-M, both of which leverage momentum mechanisms, can exactly converge to a stationary point of the average performance function, regardless of the magnitude of environment heterogeneity. Furthermore, by incorporating the benefits of variance-reduction techniques or Hessian approximation, both algorithms achieve state-of-the-art convergence results, characterized by a sample complexity of $\mathcal{O}\left(ε^{-\frac{3}{2}}/N\right)$. Notably, our algorithms enjoy linear convergence speedups with respect to the number of agents, highlighting the benefit of collaboration among agents in finding a common policy.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Unisolver: PDE-Conditional Transformers Are Universal PDE Solvers
Authors:
Hang Zhou,
Yuezhou Ma,
Haixu Wu,
Haowen Wang,
Mingsheng Long
Abstract:
Deep models have recently emerged as a promising tool to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve the PDEs reasonably well, they are mainly restricted to a specific set of PDEs, e.g. a certain equation or a finite set of coefficients. This bottleneck limits the generalizabil…
▽ More
Deep models have recently emerged as a promising tool to solve partial differential equations (PDEs), known as neural PDE solvers. While neural solvers trained from either simulation data or physics-informed loss can solve the PDEs reasonably well, they are mainly restricted to a specific set of PDEs, e.g. a certain equation or a finite set of coefficients. This bottleneck limits the generalizability of neural solvers, which is widely recognized as its major advantage over numerical solvers. In this paper, we present the Universal PDE solver (Unisolver) capable of solving a wide scope of PDEs by leveraging a Transformer pre-trained on diverse data and conditioned on diverse PDEs. Instead of simply scaling up data and parameters, Unisolver stems from the theoretical analysis of the PDE-solving process. Our key finding is that a PDE solution is fundamentally under the control of a series of PDE components, e.g. equation symbols, coefficients, and initial and boundary conditions. Inspired by the mathematical structure of PDEs, we define a complete set of PDE components and correspondingly embed them as domain-wise (e.g. equation symbols) and point-wise (e.g. boundaries) conditions for Transformer PDE solvers. Integrating physical insights with recent Transformer advances, Unisolver achieves consistent state-of-the-art results on three challenging large-scale benchmarks, showing impressive gains and endowing favorable generalizability and scalability.
△ Less
Submitted 1 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Sums of four polygonal numbers: precise formulas
Authors:
Jialin Li,
Haowu Wang
Abstract:
In this paper we give unified formulas for the numbers of representations of positive integers as sums of four generalized $m$-gonal numbers, and as restricted sums of four squares under a linear condition, respectively. These formulas are given as $\mathbb{Z}$-linear combinations of Hurwitz class numbers. As applications, we prove several Zhi-Wei Sun's conjectures. As by-products, we obtain formu…
▽ More
In this paper we give unified formulas for the numbers of representations of positive integers as sums of four generalized $m$-gonal numbers, and as restricted sums of four squares under a linear condition, respectively. These formulas are given as $\mathbb{Z}$-linear combinations of Hurwitz class numbers. As applications, we prove several Zhi-Wei Sun's conjectures. As by-products, we obtain formulas for expressing the Fourier coefficients of $\vartheta(τ,z)^4$, $η(τ)^{12}$, $η(τ)^4$ and $η(τ)^8η(2τ)^8$ in terms of the Hurwitz class numbers, respectively. The proof is based on the theory of Jacobi forms.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
The distributed biased min-consensus protocol revisited: pre-specified finite time control strategies and small-gain based analysis
Authors:
Yuanqiu Mo,
He Wang
Abstract:
Unlike the classical distributed consensus protocols enabling the group of agents as a whole to reach an agreement regarding a certain quantity of interest in a distributed fashion, the distributed biased min-consensus protocol (DBMC) has been proven to generate advanced complexity pertaining to solving the shortest path problem. As such a protocol is commonly incorporated as the first step of a h…
▽ More
Unlike the classical distributed consensus protocols enabling the group of agents as a whole to reach an agreement regarding a certain quantity of interest in a distributed fashion, the distributed biased min-consensus protocol (DBMC) has been proven to generate advanced complexity pertaining to solving the shortest path problem. As such a protocol is commonly incorporated as the first step of a hierarchical architecture in real applications, e.g., robots path planning, management of dispersed computing services, an impedance limiting the application potential of DBMC lies in, the lack of results regarding to its convergence within a user-assigned time. In this paper, we first propose two control strategies ensuring the state error of DBMC decrease exactly to zero or a desired level manipulated by the user, respectively. To compensate the high feedback gains incurred by these two control strategies, this paper further investigates the nominal DBMC itself. By leveraging small gain based stability tools, this paper also proves the global exponential input-to-state stability of DBMC, outperforming its current stability results. Simulations have been provided to validate the efficacy of our theoretical result.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Convergence Rates of Online Critic Value Function Approximation in Native Spaces
Authors:
Shengyuan Niu,
Ali Bouland,
Haoran Wang,
Filippos Fotiadis,
Andrew Kurdila,
Andrea L'Afflitto,
Sai Tej Paruchuri,
Kyriakos G. Vamvoudakis
Abstract:
In this paper, the evolution equation that defines the online critic for the approximation of the optimal value function is cast in a general class of reproducing kernel Hilbert spaces (RKHSs). Exploiting some core tools of RKHS theory, this formulation allows deriving explicit bounds on the performance of the critic in terms of the kernel and definition of the RKHS, the number of basis functions,…
▽ More
In this paper, the evolution equation that defines the online critic for the approximation of the optimal value function is cast in a general class of reproducing kernel Hilbert spaces (RKHSs). Exploiting some core tools of RKHS theory, this formulation allows deriving explicit bounds on the performance of the critic in terms of the kernel and definition of the RKHS, the number of basis functions, and the location of centers used to define scattered bases. The performance of the critic is precisely measured in terms of the power function of the scattered basis used in approximations, and it can be used either in an a priori evaluation of potential bases or in an a posteriori assessments of value function error for basis enrichment or pruning. The most concise bounds in the paper describe explicitly how the critic performance depends on the placement of centers, as measured by their fill distance in a subset that contains the trajectory of the critic.
△ Less
Submitted 28 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Higher Kazhdan projections and delocalised $\ell^ 2$-Betti numbers
Authors:
Sanaz Pooya,
Hang Wang
Abstract:
We provide an explicit description of the K-classes of higher Kazhdan projections in degrees greater than 0 for specific free product groups and Cartesian product groups. Employing this description, we obtain new calculations of Lott's delocalised $\ell^2$-Betti numbers. Notably, we establish the first non-vanishing results for infinite groups.
We provide an explicit description of the K-classes of higher Kazhdan projections in degrees greater than 0 for specific free product groups and Cartesian product groups. Employing this description, we obtain new calculations of Lott's delocalised $\ell^2$-Betti numbers. Notably, we establish the first non-vanishing results for infinite groups.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Data-Driven Stable Neural Feedback Loop Design
Authors:
Zuxun Xiong,
Han Wang,
Liqun Zhao,
Antonis Papachristodoulou
Abstract:
This paper proposes a data-driven approach to design a feedforward Neural Network (NN) controller with a stability guarantee for systems with unknown dynamics. We first introduce data-driven representations of stability conditions for Neural Feedback Loops (NFLs) with linear plants. These conditions are then formulated into a semidefinite program (SDP). Subsequently, this SDP constraint is integra…
▽ More
This paper proposes a data-driven approach to design a feedforward Neural Network (NN) controller with a stability guarantee for systems with unknown dynamics. We first introduce data-driven representations of stability conditions for Neural Feedback Loops (NFLs) with linear plants. These conditions are then formulated into a semidefinite program (SDP). Subsequently, this SDP constraint is integrated into the NN training process resulting in a stable NN controller. We propose an iterative algorithm to solve this problem efficiently. Finally, we illustrate the effectiveness of the proposed method and its superiority compared to model-based methods via numerical examples.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
The Lp Polar bodies of shadow system and related inequalities
Authors:
Lujun Guo,
Hanxiao Wang
Abstract:
The $L_p$ versions of the support function and polar body are introduced by Berndtsson, Mastrantonis and Rubinstein in \cite{Berndtsson-Mastrantonis-Rubinstein-2023} recently. In this paper, we prove that the $L_p$-support function of the shadow system $K_t$ introduced by Rogers and Shephard in \cite{rogers-1958-02,shephard-1964} is convex and the volume of the section of $L_p$ polar bodies of…
▽ More
The $L_p$ versions of the support function and polar body are introduced by Berndtsson, Mastrantonis and Rubinstein in \cite{Berndtsson-Mastrantonis-Rubinstein-2023} recently. In this paper, we prove that the $L_p$-support function of the shadow system $K_t$ introduced by Rogers and Shephard in \cite{rogers-1958-02,shephard-1964} is convex and the volume of the section of $L_p$ polar bodies of $K_t$ is $\frac{1}{n}$-concave with respect to parameter $t$, and obtain some related inequalities. Finally, we present the reverse Rogers-Shephard type inequality for $L_p$-polar bodies.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Accurate adaptive deep learning method for solving elliptic problems
Authors:
**gyong Ying,
Yaqi Xie,
Jiao Li,
Hongqiao Wang
Abstract:
Deep learning method is of great importance in solving partial differential equations. In this paper, inspired by the failure-informed idea proposed by Gao et.al. (SIAM Journal on Scientific Computing 45(4)(2023)) and as an improvement, a new accurate adaptive deep learning method is proposed for solving elliptic problems, including the interface problems and the convection-dominated problems. Bas…
▽ More
Deep learning method is of great importance in solving partial differential equations. In this paper, inspired by the failure-informed idea proposed by Gao et.al. (SIAM Journal on Scientific Computing 45(4)(2023)) and as an improvement, a new accurate adaptive deep learning method is proposed for solving elliptic problems, including the interface problems and the convection-dominated problems. Based on the failure probability framework, the piece-wise uniform distribution is used to approximate the optimal proposal distribution and an kernel-based method is proposed for efficient sampling. Together with the improved Levenberg-Marquardt optimization method, the proposed adaptive deep learning method shows great potential in improving solution accuracy. Numerical tests on the elliptic problems without interface conditions, on the elliptic interface problem, and on the convection-dominated problems demonstrate the effectiveness of the proposed method, as it reduces the relative errors by a factor varying from $10^2$ to $10^4$ for different cases.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
The inversion number of dijoins and blow-up digraphs
Authors:
Haozhe Wang,
Yuxuan Yang,
Mei Lu
Abstract:
For an oriented graph $D$, the $inversion$ of $X \subseteq V(D)$ in $D$ is the digraph obtained from $D$ by reversing the direction of all arcs with both ends in $X$. The inversion number of $D$, denoted by $inv(D)$, is the minimum number of inversions needed to transform $D$ into an acyclic digraph. In this paper, we first show that $inv (\overrightarrow{C_3} \Rightarrow D)= inv(D) +1$ for any or…
▽ More
For an oriented graph $D$, the $inversion$ of $X \subseteq V(D)$ in $D$ is the digraph obtained from $D$ by reversing the direction of all arcs with both ends in $X$. The inversion number of $D$, denoted by $inv(D)$, is the minimum number of inversions needed to transform $D$ into an acyclic digraph. In this paper, we first show that $inv (\overrightarrow{C_3} \Rightarrow D)= inv(D) +1$ for any oriented graph $\textit{D}$ with even inversion number $inv(D)$, where the dijoin $\overrightarrow{C_3} \Rightarrow D$ is the oriented graph obtained from the disjoint union of $\overrightarrow{C_3}$ and $D$ by adding all arcs from $\overrightarrow{C_3}$ to $D$. Thus we disprove the conjecture of Aubian el at. \cite{2212.09188} and the conjecture of Alon el at. \cite{2212.11969}. We also study the blow-up graph which is an oriented graph obtained from a tournament by replacing all vertices into oriented graphs. We construct a tournament $T$ with order $n$ and $inv(T)=\frac{n}{3}+1$ using blow-up graphs.
△ Less
Submitted 24 April, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
On the CR Nirenberg problem: density and multiplicity of solutions
Authors:
Zhongwei Tang,
Heming Wang,
Bingwei Zhang
Abstract:
We prove some results on the density and multiplicity of positive solutions to the prescribed Webster scalar curvature problem on the $(2n+1)$-dimensional standard unit CR sphere $(\mathbb{S} ^{2n+1},θ_0)$. Specifically, we construct arbitrarily many multi-bump solutions via the variational gluing method. In particular, we show the Webster scalar curvature functions of contact forms conformal to…
▽ More
We prove some results on the density and multiplicity of positive solutions to the prescribed Webster scalar curvature problem on the $(2n+1)$-dimensional standard unit CR sphere $(\mathbb{S} ^{2n+1},θ_0)$. Specifically, we construct arbitrarily many multi-bump solutions via the variational gluing method. In particular, we show the Webster scalar curvature functions of contact forms conformal to $θ_0$ are $C^{0}$-dense among bounded functions which are positive somewhere. Existence results of infinitely many positive solutions to the related equation $-Δ_{\mathbb{H}} u=R(ξ) u^{(n+2) /n}$ on the Heisenberg group $\Hn $ with $R(ξ)$ being asymptotically periodic with respect to left translation are also obtained. Our proofs make use of a refined analysis of bubbling behavior, gradient flow, Pohozaev identity, as well as blow up arguments.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Quantitative homogenization and hydrodynamic limit of non-gradient exclusion process
Authors:
Tadahisa Funaki,
Chenlin Gu,
Han Wang
Abstract:
For the non-gradient exclusion process, we prove the quantitative homogenization in the approximation of the diffusion matrix and the conductivity by local functions. The proof relies on the renormalization approach developed by Armstrong, Kuusi, Mourrat, and Smart, while the new challenge here is the hard core constraint of particle number on every site. Therefore, a coarse-grained method is prop…
▽ More
For the non-gradient exclusion process, we prove the quantitative homogenization in the approximation of the diffusion matrix and the conductivity by local functions. The proof relies on the renormalization approach developed by Armstrong, Kuusi, Mourrat, and Smart, while the new challenge here is the hard core constraint of particle number on every site. Therefore, a coarse-grained method is proposed to lift the configuration to a larger space without exclusion, and a gradient coupling between two systems is applied to capture the spatial cancellation. We then strengthen the convergence rate to be uniform concerning the density and integrate it into the work by Funaki, Uchiyama, and Yau [IMA Vol. Math. Appl., 77 (1996), pp. 1-40.] to yield a quantitative hydrodynamic limit. Our new approach avoids showing the characterization of closed forms and provides stronger results. The extension is discussed for the model in the presence of disorder on the bonds.
△ Less
Submitted 23 May, 2024; v1 submitted 18 April, 2024;
originally announced April 2024.
-
$L^p$ weighted Fourier restriction estimates
Authors:
Xiumin Du,
Jianhui Li,
Hong Wang,
Ruixiang Zhang
Abstract:
We obtain some sharp $L^p$ weighted Fourier restriction estimates of the form $\|Ef\|_{L^p(B^{n+1}(0,R),Hdx)} \lessapprox R^β\|f\|_2$, where $E$ is the Fourier extension operator over the truncated paraboloid, and $H$ is a weight function on $\mathbb R^{n+1}$ which is $n$-dimensional up to scale $\sqrt R$.
We obtain some sharp $L^p$ weighted Fourier restriction estimates of the form $\|Ef\|_{L^p(B^{n+1}(0,R),Hdx)} \lessapprox R^β\|f\|_2$, where $E$ is the Fourier extension operator over the truncated paraboloid, and $H$ is a weight function on $\mathbb R^{n+1}$ which is $n$-dimensional up to scale $\sqrt R$.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Asynchronous Heterogeneous Linear Quadratic Regulator Design
Authors:
Leonardo F. Toso,
Han Wang,
James Anderson
Abstract:
We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits - multiple dataset…
▽ More
We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits - multiple datasets can be leveraged simultaneously to produce more accurate policy gradient estimates. However, the interplay of heterogeneous trajectory data and varying levels of local computational power introduce bias to the aggregated PG descent direction, and prevents us from fully exploiting the parallelism in the distributed computation. The latter stems from synchronous aggregation, where straggler systems negatively impact the runtime. To address this, we propose an asynchronous policy gradient algorithm for LQR control design. By carefully controlling the "staleness" in the asynchronous aggregation, we show that the designed controller converges to each system's $ε$-near optimal controller up to a heterogeneity bias. Furthermore, we prove that our asynchronous approach obtains exact local convergence at a sub-linear rate.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Early detection of disease outbreaks and non-outbreaks using incidence data
Authors:
Shan Gao,
Amit K. Chakraborty,
Russell Greiner,
Mark A. Lewis,
Hao Wang
Abstract:
Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a…
▽ More
Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a Susceptible-Infected-Recovered model for slowly changing, noisy disease dynamics. Outbreak sequences give a transcritical bifurcation within a specified future time window, whereas non-outbreak (null bifurcation) sequences do not. We identified incipient differences in time series of infectives leading to future outbreaks and non-outbreaks. These differences are reflected in 22 statistical features and 5 early warning signal indicators. Classifier performance, given by the area under the receiver-operating curve, ranged from 0.99 for large expanding windows of training data to 0.7 for small rolling windows. Real-world performances of classifiers were tested on two empirical datasets, COVID-19 data from Singapore and SARS data from Hong Kong, with two classifiers exhibiting high accuracy. In summary, we showed that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur. We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Explicit Witt basis over the tensor product of Clifford algebras and octonions
Authors:
Yong Li,
Guangbin Ren,
Haiyan Wang
Abstract:
In this article, we investigate how the Witt basis serves as a link between real and complex variables in higher-dimensional spaces. Our focus is on the detailed construction of the Witt basis within the tensor product space combining Clifford algebra and multiple octonionic spaces. This construction effectively introduces complex coordinates. The technique is based on a specific subgroup of octon…
▽ More
In this article, we investigate how the Witt basis serves as a link between real and complex variables in higher-dimensional spaces. Our focus is on the detailed construction of the Witt basis within the tensor product space combining Clifford algebra and multiple octonionic spaces. This construction effectively introduces complex coordinates. The technique is based on a specific subgroup of octonionic automorphisms, distinguished by binary codes. This method allows us to perform a Hermitian analysis of the complex structures within the tensor product space.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules
Authors:
Xiang Li,
Feng Ruan,
Huiyuan Wang,
Qi Long,
Weijie J. Su
Abstract:
Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical effi…
▽ More
Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical efficiency of watermarks and designing powerful detection rules. Inspired by the hypothesis testing formulation of watermark detection, our framework starts by selecting a pivotal statistic of the text and a secret key -- provided by the LLM to the verifier -- to enable controlling the false positive rate (the error of mistakenly detecting human-written text as LLM-generated). Next, this framework allows one to evaluate the power of watermark detection rules by obtaining a closed-form expression of the asymptotic false negative rate (the error of incorrectly classifying LLM-generated text as human-written). Our framework further reduces the problem of determining the optimal detection rule to solving a minimax optimization program. We apply this framework to two representative watermarks -- one of which has been internally implemented at OpenAI -- and obtain several findings that can be instrumental in guiding the practice of implementing watermarks. In particular, we derive optimal detection rules for these watermarks under our framework. These theoretically derived detection rules are demonstrated to be competitive and sometimes enjoy a higher power than existing detection approaches through numerical experiments.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Complex generalized Gauss-Radau quadrature rules for Hankel transforms of integer order
Authors:
Haiyong Wang,
Menghan Wu
Abstract:
Complex Gaussian quadrature rules for oscillatory integral transforms have the advantage that they can achieve optimal asymptotic order. However, their existence for Hankel transform can only be guaranteed when the order of the transform belongs to $[0,1/2]$. In this paper we consider the construction of generalized Gauss-Radau quadrature rules for Hankel transform. We show that, if adding certain…
▽ More
Complex Gaussian quadrature rules for oscillatory integral transforms have the advantage that they can achieve optimal asymptotic order. However, their existence for Hankel transform can only be guaranteed when the order of the transform belongs to $[0,1/2]$. In this paper we consider the construction of generalized Gauss-Radau quadrature rules for Hankel transform. We show that, if adding certain value and derivative information at the left endpoint, then complex generalized Gauss-Radau quadrature rules for Hankel transform of integer order can be constructed with theoretical guarantees. Orthogonal polynomials that are closely related to such quadrature rules are investigated and their existence for even degrees is proved. Numerical experiments are presented to confirm our findings.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Online Prediction for Streaming Tensor Time Series
Authors:
Zhenting Luan,
Haoning Wang,
Li** Zhang,
Shansuo Liang,
Wei Han
Abstract:
Real-time prediction plays a vital role in various control systems, such as traffic congestion control and wireless channel resource allocation. In these scenarios, the predictor usually needs to track the evolution of the latent statistical patterns in the modern high-dimensional streaming time series continuously and quickly, which presents new challenges for traditional prediction methods. This…
▽ More
Real-time prediction plays a vital role in various control systems, such as traffic congestion control and wireless channel resource allocation. In these scenarios, the predictor usually needs to track the evolution of the latent statistical patterns in the modern high-dimensional streaming time series continuously and quickly, which presents new challenges for traditional prediction methods. This paper proposes a novel algorithm based on tensor factorization to predict streaming tensor time series online. The proposed algorithm updates the predictor in a low-complexity online manner to adapt to the time-evolving data. Additionally, an automatically adaptive version of the algorithm is presented to mitigate the negative impact of stale data. Simulation results demonstrate that our proposed methods achieve prediction accuracy similar to that of conventional offline tensor prediction methods, while being much faster than them during long-term online prediction. Therefore, our proposed algorithm provides an effective and efficient solution for the online prediction of streaming tensor time series.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Anderson acceleration of derivative-free projection methods for constrained monotone nonlinear equations
Authors:
Jiachen **,
Hongxia Wang,
Kangkang Deng
Abstract:
The derivative-free projection method (DFPM) is an efficient algorithm for solving monotone nonlinear equations. As problems grow larger, there is a strong demand for speeding up the convergence of DFPM. This paper considers the application of Anderson acceleration (AA) to DFPM for constrained monotone nonlinear equations. By employing a nonstationary relaxation parameter and interleaving with sli…
▽ More
The derivative-free projection method (DFPM) is an efficient algorithm for solving monotone nonlinear equations. As problems grow larger, there is a strong demand for speeding up the convergence of DFPM. This paper considers the application of Anderson acceleration (AA) to DFPM for constrained monotone nonlinear equations. By employing a nonstationary relaxation parameter and interleaving with slight modifications in each iteration, a globally convergent variant of AA for DFPM named as AA-DFPM is proposed. Further, the linear convergence rate is proved under some mild assumptions. Experiments on both mathematical examples and a real-world application show encouraging results of AA-DFPM and confirm the suitability of AA for accelerating DFPM in solving optimization problems.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
On Intermediate Exceptional Series
Authors:
Kimyeong Lee,
Kaiwen Sun,
Haowu Wang
Abstract:
The Freudenthal--Tits magic square $\mathfrak{m}(\mathbb{A}_1,\mathbb{A}_2)$ for $\mathbb{A}=\mathbb{R},\mathbb{C},\mathbb{H},\mathbb{O}$ of semi-simple Lie algebras can be extended by including the sextonions $\mathbb{S}$. A series of non-reductive Lie algebras naturally appear in the new row associated with the sextonions, which we will call the \textit{intermediate exceptional series}, with the…
▽ More
The Freudenthal--Tits magic square $\mathfrak{m}(\mathbb{A}_1,\mathbb{A}_2)$ for $\mathbb{A}=\mathbb{R},\mathbb{C},\mathbb{H},\mathbb{O}$ of semi-simple Lie algebras can be extended by including the sextonions $\mathbb{S}$. A series of non-reductive Lie algebras naturally appear in the new row associated with the sextonions, which we will call the \textit{intermediate exceptional series}, with the largest one as the intermediate Lie algebra $E_{7+1/2}$ constructed by Landsberg--Manivel. We study various aspects of the intermediate vertex operator (super)algebras associated with the intermediate exceptional series, including rationality, coset constructions, irreducible modules, (super)characters and modular linear differential equations. For all $\mathfrak{g}_I$ belonging to the intermediate exceptional series, the intermediate VOA $L_1(\mathfrak{g}_I)$ has characters of irreducible modules coinciding with those of the simple rational $C_2$-cofinite $W$-algebra $W_{-h^\vee/6}(\mathfrak{g},f_θ)$ studied by Kawasetsu, with $\mathfrak{g} $ belonging to the Cvitanović--Deligne exceptional series. We propose some new intermediate VOA $L_k(\mathfrak{g}_I)$ with integer level $k$ and investigate their properties. For example, for the intermediate Lie algebra $D_{6+1/2}$ between $D_6$ and $E_7$ in the subexceptional series and also in Vogel's projective plane, we find that the intermediate VOA $L_2(D_{6+1/2})$ has a simple current extension to a SVOA with four irreducible Neveu--Schwarz modules. We also provide some (super) coset constructions such as $L_2(E_7)/L_2(D_{6+1/2})$ and $L_1(D_{6+1/2})^{\otimes2}\!/L_2(D_{6+1/2})$. In the end, we find that the theta blocks associated with the intermediate exceptional series produce some new holomorphic Jacobi forms of critical weight and lattice index.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Addressing complex boundary conditions of miscible flow and transport in two and three dimensions with application to optimal control
Authors:
Yiqun Li,
Hong Wang,
Xiangcheng Zheng
Abstract:
We investigate complex boundary conditions of the miscible displacement system in two and three space dimensions with the commonly-used Bear-Scheidegger diffusion-dispersion tensor, which describes, e.g., the porous medium flow processes in petroleum reservoir simulation or groundwater contaminant transport. Specifically, we incorporate the no-flux boundary condition for the Darcy velocity to prov…
▽ More
We investigate complex boundary conditions of the miscible displacement system in two and three space dimensions with the commonly-used Bear-Scheidegger diffusion-dispersion tensor, which describes, e.g., the porous medium flow processes in petroleum reservoir simulation or groundwater contaminant transport. Specifically, we incorporate the no-flux boundary condition for the Darcy velocity to prove that the general no-flux boundary condition for the transport equation is equivalent to the normal derivative boundary condition of the concentration, based on which we further prove several complex boundary conditions by the Bear-Scheidegger tensor and its derivative. The derived boundary conditions not only provide new insights and distinct properties of the Bear-Scheidegger diffusion-dispersion tensor, but accommodate the coupling and the nonlinearity of the miscible displacement system and the Bear-Scheidegger tensor in deriving the first-order optimality condition of the corresponding optimal control problem for practical application.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Network-Aware Value Stacking of Community Battery via Asynchronous Distributed Optimization
Authors:
Canchen Jiang,
Hao Wang
Abstract:
Community battery systems have been widely deployed to provide services to the grid. Unlike a single battery storage system in the community, coordinating multiple community batteries can further unlock their value, enhancing the viability of community battery solutions. However, the centralized control of community batteries relies on the full information of the system, which is less practical an…
▽ More
Community battery systems have been widely deployed to provide services to the grid. Unlike a single battery storage system in the community, coordinating multiple community batteries can further unlock their value, enhancing the viability of community battery solutions. However, the centralized control of community batteries relies on the full information of the system, which is less practical and may even lead to privacy leakage. In this paper, we formulate a value-stacking optimization problem for community batteries to interact with local solar, buildings, and the grid, within distribution network constraints. We then propose a distributed algorithm using asynchronous distributed alternate direction method of multipliers (ADMM) to solve the problem. Our algorithm is robust to communication latency between community batteries and the grid while preserving the operational privacy. The simulation results demonstrate the convergence of our proposed asynchronous distributed ADMM algorithm. We also evaluate the electricity cost and the contribution of each value stream in the value-stacking problem for community batteries using real-world data.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network
Authors:
Jiarong Fan,
Ariel Liebman,
Hao Wang
Abstract:
The increasing integration of electric vehicles (EVs) into the grid can pose a significant risk to the distribution system operation in the absence of coordination. In response to the need for effective coordination of EVs within the distribution network, this paper presents a safety-aware reinforcement learning (RL) algorithm designed to manage EV charging stations while ensuring the satisfaction…
▽ More
The increasing integration of electric vehicles (EVs) into the grid can pose a significant risk to the distribution system operation in the absence of coordination. In response to the need for effective coordination of EVs within the distribution network, this paper presents a safety-aware reinforcement learning (RL) algorithm designed to manage EV charging stations while ensuring the satisfaction of system constraints. Unlike existing methods, our proposed algorithm does not rely on explicit penalties for constraint violations, eliminating the need for penalty coefficient tuning. Furthermore, managing EV charging stations is further complicated by multiple uncertainties, notably the variability in solar energy generation and energy prices. To address this challenge, we develop an off-policy RL algorithm to efficiently utilize data to learn patterns in such uncertain environments. Our algorithm also incorporates a maximum entropy framework to enhance the RL algorithm's exploratory process, preventing convergence to local optimal solutions. Simulation results demonstrate that our algorithm outperforms traditional RL algorithms in managing EV charging in the distribution network.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
Authors:
He Wang,
Laixi Shi,
Yuejie Chi
Abstract:
In offline reinforcement learning (RL), the absence of active exploration calls for attention on the model robustness to tackle the sim-to-real gap, where the discrepancy between the simulated and deployed environments can significantly undermine the performance of the learned policy. To endow the learned policy with robustness in a sample-efficient manner in the presence of high-dimensional state…
▽ More
In offline reinforcement learning (RL), the absence of active exploration calls for attention on the model robustness to tackle the sim-to-real gap, where the discrepancy between the simulated and deployed environments can significantly undermine the performance of the learned policy. To endow the learned policy with robustness in a sample-efficient manner in the presence of high-dimensional state-action space, this paper considers the sample complexity of distributionally robust linear Markov decision processes (MDPs) with an uncertainty set characterized by the total variation distance using offline data. We develop a pessimistic model-based algorithm and establish its sample complexity bound under minimal data coverage assumptions, which outperforms prior art by at least $\widetilde{O}(d)$, where $d$ is the feature dimension. We further improve the performance guarantee of the proposed algorithm by incorporating a carefully-designed variance estimator.
△ Less
Submitted 26 June, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.
-
Convex Co-Design of Control Barrier Function and Safe Feedback Controller Under Input Constraints
Authors:
Han Wang,
Kostas Margellos,
Antonis Papachristodoulou,
Claudio De Persis
Abstract:
We study the problem of co-designing control barrier functions (CBF) and linear state feedback controllers for continuous-time linear systems. We achieve this by means of a single semi-definite optimization program. Our formulation can handle mixed-relative degree problems without requiring an explicit safe controller. Different L-norm based input limitations can be introduced as convex constraint…
▽ More
We study the problem of co-designing control barrier functions (CBF) and linear state feedback controllers for continuous-time linear systems. We achieve this by means of a single semi-definite optimization program. Our formulation can handle mixed-relative degree problems without requiring an explicit safe controller. Different L-norm based input limitations can be introduced as convex constraints in the proposed program. We demonstrate our results on an omni-directional car numerical example.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Quasi-Monte Carlo and importance sampling methods for Bayesian inverse problems
Authors:
Zhijian He,
He** Wang,
Xiaoqun Wang
Abstract:
Importance Sampling (IS), an effective variance reduction strategy in Monte Carlo (MC) simulation, is frequently utilized for Bayesian inference and other statistical challenges. Quasi-Monte Carlo (QMC) replaces the random samples in MC with low discrepancy points and has the potential to substantially enhance error rates. In this paper, we integrate IS with a randomly shifted rank-1 lattice rule,…
▽ More
Importance Sampling (IS), an effective variance reduction strategy in Monte Carlo (MC) simulation, is frequently utilized for Bayesian inference and other statistical challenges. Quasi-Monte Carlo (QMC) replaces the random samples in MC with low discrepancy points and has the potential to substantially enhance error rates. In this paper, we integrate IS with a randomly shifted rank-1 lattice rule, a widely used QMC method, to approximate posterior expectations arising from Bayesian Inverse Problems (BIPs) where the posterior density tends to concentrate as the intensity of noise diminishes. Within the framework of weighted Hilbert spaces, we first establish the convergence rate of the lattice rule for a large class of unbounded integrands. This method extends to the analysis of QMC combined with IS in BIPs. Furthermore, we explore the robustness of the IS-based randomly shifted rank-1 lattice rule by determining the quadrature error rate with respect to the noise level. The effects of using Gaussian distributions and $t$-distributions as the proposal distributions on the error rate of QMC are comprehensively investigated. We find that the error rate may deteriorate at low intensity of noise when using improper proposals, such as the prior distribution. To reclaim the effectiveness of QMC, we propose a new IS method such that the lattice rule with $N$ quadrature points achieves an optimal error rate close to $O(N^{-1})$, which is insensitive to the noise level. Numerical experiments are conducted to support the theoretical results.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
From habitat decline to collapse: a spatially explicit approach connecting habitat degradation to destruction
Authors:
Yurij Salmaniw,
Zhongwei Shen,
Hao Wang
Abstract:
Habitat loss, through degradation and destruction of viable habitat, has a well-documented and undeniable impact on the sustainability of ecosystems [Diaz et al. (2019), Pimm et al. (2014), Pimm et al. (2000)]. Moreover, most habitat loss is anthropogenic [Jacobson et al. (2019)]. Understanding the relationships between varying degrees of habitat degradation, movement strategies, and population dy…
▽ More
Habitat loss, through degradation and destruction of viable habitat, has a well-documented and undeniable impact on the sustainability of ecosystems [Diaz et al. (2019), Pimm et al. (2014), Pimm et al. (2000)]. Moreover, most habitat loss is anthropogenic [Jacobson et al. (2019)]. Understanding the relationships between varying degrees of habitat degradation, movement strategies, and population dynamics on species persistence is crucial. We establish a robust connection between habitat degradation and destruction using a reaction-diffusion equation framework. Motivated by the recent work [Salmaniw et al. (2022)], we consider an intrinsic growth rate function that features a logistic-type growth in the viable habitat but decays at rate $c\geq0$ in the degraded region(s). In the limit as $c \to \infty$, the solution to the habitat degradation problem converges uniformly to the solution of a related habitat destruction problem. When the habitat destruction problem predicts deterministic extinction, a unique value $c_0$ exists, the extinction threshold, from which any further habitat degradation leads to deterministic extinction. This extinction threshold can be bounded below by a constant depending on the size of the degraded region and the habitat quality in the undisturbed region. To show these results, we investigate the eigenvalue problems related to each model formulation and establish a convergence result between the principal eigenvalues and their associated eigenfunctions, providing a precise analytical connection between habitat degradation and destruction in a general setting applicable to any species adopting diffusive movement strategies.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques
Authors:
Xuetong Li,
Yuan Gao,
Hong Chang,
Danyang Huang,
Yingying Ma,
Rui Pan,
Haobo Qi,
Feifei Wang,
Shuyuan Wu,
Ke Xu,
**g Zhou,
Xuening Zhu,
Yingqiu Zhu,
Hansheng Wang
Abstract:
This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first clas…
▽ More
This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first class of literature is about distributed computing and focuses on the situation, where the dataset size is too huge to be comfortably handled by one single computer. In this case, a distributed computation system with multiple computers has to be utilized. The second class of literature is about subsampling methods and concerns about the situation, where the sample size of dataset is small enough to be placed on one single computer but too large to be easily processed by its memory as a whole. The last class of literature studies those minibatch gradient related optimization techniques, which have been extensively used for optimizing various deep learning models.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Superlinear Optimization Algorithms
Authors:
Hongxia Wang,
Yeming Xu,
Ziyuan Guo,
Huanshui Zhang
Abstract:
This paper proposes several novel optimization algorithms for minimizing a nonlinear objective function. The algorithms are enlightened by the optimal state trajectory of an optimal control problem closely related to the minimized objective function. They are superlinear convergent when appropriate parameters are selected as required. Unlike Newton's method, all of them can be also applied in the…
▽ More
This paper proposes several novel optimization algorithms for minimizing a nonlinear objective function. The algorithms are enlightened by the optimal state trajectory of an optimal control problem closely related to the minimized objective function. They are superlinear convergent when appropriate parameters are selected as required. Unlike Newton's method, all of them can be also applied in the case of a singular Hessian matrix. More importantly, by reduction, some of them avoid calculating the inverse of the Hessian matrix or an identical dimension matrix and some of them need only the diagonal elements of the Hessian matrix. In these cases, these algorithms still outperform the gradient descent method. The merits of the proposed optimization algorithm are illustrated by numerical experiments.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Authors:
Yudong Luo,
Yangchen Pan,
Han Wang,
Philip Torr,
Pascal Poupart
Abstract:
Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their practical applications. This inefficiency stems from two main facts: a focus on tail-end performance that overlooks many sampled trajectories, and the potential of gradient vanishing when the lower tail of the return di…
▽ More
Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their practical applications. This inefficiency stems from two main facts: a focus on tail-end performance that overlooks many sampled trajectories, and the potential of gradient vanishing when the lower tail of the return distribution is overly flat. To address these challenges, we propose a simple mixture policy parameterization. This method integrates a risk-neutral policy with an adjustable policy to form a risk-averse policy. By employing this strategy, all collected trajectories can be utilized for policy updating, and the issue of vanishing gradients is counteracted by stimulating higher returns through the risk-neutral component, thus lifting the tail and preventing flatness. Our empirical study reveals that this mixture parameterization is uniquely effective across a variety of benchmark domains. Specifically, it excels in identifying risk-averse CVaR policies in some Mujoco environments where the traditional CVaR-PG fails to learn a reasonable policy.
△ Less
Submitted 28 June, 2024; v1 submitted 16 March, 2024;
originally announced March 2024.
-
Hairer-Quastel universality for KPZ -- polynomial smoothing mechanisms, general nonlinearities and Poisson noise
Authors:
Fanhao Kong,
Haiyi Wang,
Weijun Xu
Abstract:
We consider a class of weakly asymmetric continuous microscopic growth models with polynomial smoothing mechanisms, general nonlinearities and a Poisson type noise. We show that they converge to the KPZ equation after proper rescaling and re-centering, where the coupling constant depends nontrivially on all details of the smoothing and growth mechanisms in the microscopic model. This confirms some…
▽ More
We consider a class of weakly asymmetric continuous microscopic growth models with polynomial smoothing mechanisms, general nonlinearities and a Poisson type noise. We show that they converge to the KPZ equation after proper rescaling and re-centering, where the coupling constant depends nontrivially on all details of the smoothing and growth mechanisms in the microscopic model. This confirms some of the predictions in [HQ18].
The proof builds on the general discretisation framework of regularity structures ([EH19]), and employs the idea of using the spectral gap inequality to control stochastic objects as developed and systematised in [LOTT21, HS24], together with a new observation on structures of the Malliavin derivatives in our situation.
△ Less
Submitted 10 March, 2024;
originally announced March 2024.
-
Low-rank Tensor Autoregressive Predictor for Third-Order Time-Series Forecasting
Authors:
Haoning Wang,
Li** Zhang,
Shengbo Eben Li
Abstract:
Recently, tensor time-series forecasting has gained increasing attention, whose core requirement is how to perform dimensionality reduction. Among all multidimensional data, third-order tensor is the most prevalent structure in real-world scenarios, such as RGB images and network traffic data. Previous studies in this field are mainly based on tensor Tucker decomposition and such methods have limi…
▽ More
Recently, tensor time-series forecasting has gained increasing attention, whose core requirement is how to perform dimensionality reduction. Among all multidimensional data, third-order tensor is the most prevalent structure in real-world scenarios, such as RGB images and network traffic data. Previous studies in this field are mainly based on tensor Tucker decomposition and such methods have limitations in terms of computational cost, with iteration complexity of approximately $O(2n^3r)$, where $n$ and $r$ are the dimension and rank of original tensor data. Moreover, many real-world data does not exhibit the low-rank property under Tucker decomposition, which may fail the dimensionality reduction. In this paper, we pioneer the application of tensor singular value decomposition (t-SVD) to third-order time-series, which builds an efficient forecasting algorithm, called Low-rank Tensor Autoregressive Predictor (LOTAP). We observe that tensor tubal rank in t-SVD is always less than Tucker rank, which leads to great benefit in computational complexity. By combining it with the autoregressive (AR) model, the forecasting problem is formulated as a least squares optimization. We divide such an optimization problem by fast Fourier transformation into four decoupled subproblems, whose variables include regressive coefficient, f-diagonal tensor, left and right orthogonal tensors. The alternating minimization algorithm is proposed with iteration complexity of about $O(n^3 + n^2r^2)$, in which each subproblem has a closed-form solution. Numerical experiments show that, compared to Tucker-decomposition-based algorithms, LOTAP achieves a speed improvement ranging from 2 to 6 times while maintaining accurate forecasting performance in all four baseline tasks. In addition, LOTAP is applicable to a wider range of tensor forecasting tasks due to its more effective dimensionality reduction ability.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Newton Polyhedrons and Hodge Numbers of Non-degenerate Laurent Polynomials
Authors:
Haoxu Wang
Abstract:
Claude Sabbah has defined the Fourier transform $G$ of the Gauss-Manin system for a non-degenerate and convenient Laurent polynomial and has shown that there exists a polarized mixed Hodge structure on the vanishing cycle of $G$. In this article, we consider certain non-degenerate and convenient Laurent polynomials $f_{P,\mathbf{a}}$, whose Newton polyhedron at infinity is a simplicial polytope…
▽ More
Claude Sabbah has defined the Fourier transform $G$ of the Gauss-Manin system for a non-degenerate and convenient Laurent polynomial and has shown that there exists a polarized mixed Hodge structure on the vanishing cycle of $G$. In this article, we consider certain non-degenerate and convenient Laurent polynomials $f_{P,\mathbf{a}}$, whose Newton polyhedron at infinity is a simplicial polytope $P$. First, we consider the stacky fan $\boldsymbolΣ_P$ given by $P$ and show that for each quotient stacky fan of $\boldsymbolΣ_P$, there is a natural polarized mixed Hodge structure on the ring of conewise polynomial functions on it. Then, we describe the polarized mixed Hodge structure on the vanishing cycle associated to $f_{P,\mathbf{a}}$ using these rings of conewise polynomial functions. In particular, we compute the Hodge diamond of the vanishing cycle. As a further consequence, we can solve the Birkhoff problem of such a Laurent polynomial by using elementary methods.
△ Less
Submitted 5 March, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Weighted least $\ell_p$ approximation on compact Riemannian manifolds
Authors:
Jiansong Li,
Yun Ling,
Jiaxin Geng,
He** Wang
Abstract:
Given a sequence of Marcinkiewicz-Zygmund inequalities in $L_2$ on a compact space, Gröchenig in \cite{G} discussed weighted least squares approximation and least squares quadrature. Inspired by this work, for all $1\le p\le\infty$, we develop weighted least $\ell_p$ approximation induced by a sequence of Marcinkiewicz-Zygmund inequalities in $L_p$ on a compact smooth Riemannian manifold $\Bbb M$…
▽ More
Given a sequence of Marcinkiewicz-Zygmund inequalities in $L_2$ on a compact space, Gröchenig in \cite{G} discussed weighted least squares approximation and least squares quadrature. Inspired by this work, for all $1\le p\le\infty$, we develop weighted least $\ell_p$ approximation induced by a sequence of Marcinkiewicz-Zygmund inequalities in $L_p$ on a compact smooth Riemannian manifold $\Bbb M$ with normalized Riemannian measure (typical examples are the torus and the sphere). In this paper we derive corresponding approximation theorems with the error measured in $L_q,\,1\le q\le\infty$, and least quadrature errors for both Sobolev spaces $H_p^r(\Bbb M), \, r>d/p$ generated by eigenfunctions associated with the Laplace-Beltrami operator and Besov spaces $B_{p,τ}^r(\Bbb M),\, 0<τ\le \infty, r>d/p $ defined by best polynomial approximation. Finally, we discuss the optimality of the obtained results by giving sharp estimates of sampling numbers and optimal quadrature errors for the aforementioned spaces.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Temporal-Aware Deep Reinforcement Learning for Energy Storage Bidding in Energy and Contingency Reserve Markets
Authors:
**hao Li,
Changlong Wang,
Yanru Zhang,
Hao Wang
Abstract:
The battery energy storage system (BESS) has immense potential for enhancing grid reliability and security through its participation in the electricity market. BESS often seeks various revenue streams by taking part in multiple markets to unlock its full potential, but effective algorithms for joint-market participation under price uncertainties are insufficiently explored in the existing research…
▽ More
The battery energy storage system (BESS) has immense potential for enhancing grid reliability and security through its participation in the electricity market. BESS often seeks various revenue streams by taking part in multiple markets to unlock its full potential, but effective algorithms for joint-market participation under price uncertainties are insufficiently explored in the existing research. To bridge this gap, we develop a novel BESS joint bidding strategy that utilizes deep reinforcement learning (DRL) to bid in the spot and contingency frequency control ancillary services (FCAS) markets. Our approach leverages a transformer-based temporal feature extractor to effectively respond to price fluctuations in seven markets simultaneously and helps DRL learn the best BESS bidding strategy in joint-market participation. Additionally, unlike conventional "black-box" DRL model, our approach is more interpretable and provides valuable insights into the temporal bidding behavior of BESS in the dynamic electricity market. We validate our method using realistic market prices from the Australian National Electricity Market. The results show that our strategy outperforms benchmarks, including both optimization-based and other DRL-based strategies, by substantial margins. Our findings further suggest that effective temporal-aware bidding can significantly increase profits in the spot and contingency FCAS markets compared to individual market participation.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Sparse gradient bounds for divergence form elliptic equations
Authors:
Olli Saari,
Hua-Yang Wang,
Yuanhong Wei
Abstract:
We provide sparse estimates for gradients of solutions to divergence form elliptic partial differential equations in terms of the source data. We give a general result of Meyers (or Gehring) type, a result for linear equations with VMO coefficients and a result for linear equations with Dini continuous coefficients. In addition, we provide an abstract theorem conditional on PDE estimates available…
▽ More
We provide sparse estimates for gradients of solutions to divergence form elliptic partial differential equations in terms of the source data. We give a general result of Meyers (or Gehring) type, a result for linear equations with VMO coefficients and a result for linear equations with Dini continuous coefficients. In addition, we provide an abstract theorem conditional on PDE estimates available. The linear results have the full range of weighted estimates with Muckenhoupt weights as a consequence.
△ Less
Submitted 23 May, 2024; v1 submitted 25 February, 2024;
originally announced February 2024.
-
Harish-Chandra Theorem for Two-parameter Quantum Groups
Authors:
Naihong Hu,
Hengyi Wang
Abstract:
The centre of two-parameter quantum groups $U_{r,s}(\mathfrak{g})$ is determined through the Harish-Chandra homomorphism. Based on the Rosso form and the representation theory of weight modules, we prove that when rank $\mathfrak{g}$ is even, the Harish-Chandra homomorphism is an isomorphism, and in particular, the centre of the quantum group $\breve{U}_{r,s}(\mathfrak{g})$ of the weight lattice t…
▽ More
The centre of two-parameter quantum groups $U_{r,s}(\mathfrak{g})$ is determined through the Harish-Chandra homomorphism. Based on the Rosso form and the representation theory of weight modules, we prove that when rank $\mathfrak{g}$ is even, the Harish-Chandra homomorphism is an isomorphism, and in particular, the centre of the quantum group $\breve{U}_{r,s}(\mathfrak{g})$ of the weight lattice type is a polynomial algebra $\mathbb{K}[z_{\varpi_1},\cdots,z_{\varpi_n}]$, where canonical central elements $z_λ\; (λ\in Λ^+)$ are turned out to be uniformly expressed. For rank $\mathfrak{g}$ to be odd, we figure out a new invertible extra central generator $z_*$, which doesn't survive in $U_q(\mathfrak g)$, and we get a larger centre containing $\mathbb{K}[z_{\varpi_1},\cdots,z_{\varpi_n}]\otimes_\mathbb K\mathbb K[z_*, z_*^{-1}]$.
△ Less
Submitted 23 March, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.