-
Invertibility of Discrete-Time Linear Systems with Sparse Inputs
Authors:
Kyle Poe,
Enrique Mallada,
Rene Vidal
Abstract:
One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain struc…
▽ More
One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain structural assumptions, such as input sparsity, have been shown to translate to practical gains in the performance of inversion algorithms, surpassing classical guarantees. Establishing necessary and sufficient conditions for left invertibility of systems with sparse inputs is therefore a crucial step toward understanding the performance limits of system inversion under structured input assumptions. In this work, we provide the first necessary and sufficient characterizations of left invertibility for linear systems with sparse inputs, echoing classic characterizations for standard linear systems. The key insight in deriving these results is in establishing the existence of two novel geometric invariants unique to the sparse-input setting, the weakly unobservable and strongly reachable subspace arrangements. By means of a concrete example, we demonstrate the utility of these characterizations. We conclude by discussing extensions and applications of this framework to several related problems in sparse control.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization
Authors:
Tianqi Zheng,
Nicolas Loizou,
Pengcheng You,
Enrique Mallada
Abstract:
Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized sadd…
▽ More
Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA. Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA's effectiveness in solving saddle point problems.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
A Market Mechanism for a Two-stage Settlement Electricity Market with Energy Storage
Authors:
Rajni Kant Bansal,
Enrique Mallada,
Patricia Hidalgo-Gonzalez
Abstract:
Electricity markets typically clear in two stages: a day-ahead market and a real-time market. In this paper, we propose market mechanisms for a two-stage multi-interval electricity market with energy storage, generators, and demand uncertainties. We consider two possible mixed bidding strategies: storage first bids cycle depths in the day ahead and then charge-discharge power bids in the real-time…
▽ More
Electricity markets typically clear in two stages: a day-ahead market and a real-time market. In this paper, we propose market mechanisms for a two-stage multi-interval electricity market with energy storage, generators, and demand uncertainties. We consider two possible mixed bidding strategies: storage first bids cycle depths in the day ahead and then charge-discharge power bids in the real-time market for any last-minute adjustments. While the first strategy only considers day-ahead decisions from an individual participant's perspective as part of their individual optimization formulation, the second strategy accounts for both the market operator's and participants' perspectives. We demonstrate that the competitive equilibrium exists uniquely for both mechanisms. However, accounting for the day-ahead decisions in the bidding function has several advantages. Numerical experiments using New York ISO data provide bounds on the proposed market mechanism.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Learning safety critics via a non-contractive binary bellman operator
Authors:
Agustin Castellano,
Hancheng Min,
Juan Andrés Bazerque,
Enrique Mallada
Abstract:
The inability to naturally enforce safety in Reinforcement Learning (RL), with limited failures, is a core challenge impeding its use in real-world applications. One notion of safety of vast practical relevance is the ability to avoid (unsafe) regions of the state space. Though such a safety goal can be captured by an action-value-like function, a.k.a. safety critics, the associated operator lacks…
▽ More
The inability to naturally enforce safety in Reinforcement Learning (RL), with limited failures, is a core challenge impeding its use in real-world applications. One notion of safety of vast practical relevance is the ability to avoid (unsafe) regions of the state space. Though such a safety goal can be captured by an action-value-like function, a.k.a. safety critics, the associated operator lacks the desired contraction and uniqueness properties that the classical Bellman operator enjoys. In this work, we overcome the non-contractiveness of safety critic operators by leveraging that safety is a binary property. To that end, we study the properties of the binary safety critic associated with a deterministic dynamical system that seeks to avoid reaching an unsafe region. We formulate the corresponding binary Bellman equation (B2E) for safety and study its properties. While the resulting operator is still non-contractive, we fully characterize its fixed points representing--except for a spurious solution--maximal persistently safe regions of the state space that can always avoid failure. We provide an algorithm that, by design, leverages axiomatic knowledge of safe data to avoid spurious fixed points.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Recurrence of Nonlinear Control Systems: Entropy and Bit Rates
Authors:
Hussein Sibai,
Enrique Mallada
Abstract:
In this paper, we introduce the notion of recurrence entropy in the context of nonlinear control systems. A set is said to be ($τ$-)recurrent if every trajectory that starts in the set returns to it (within at most $τ$ units of time). Recurrence entropy quantifies the complexity of making a set $τ$-recurrent measured by the average rate of growth, as time increases, of the number of control signal…
▽ More
In this paper, we introduce the notion of recurrence entropy in the context of nonlinear control systems. A set is said to be ($τ$-)recurrent if every trajectory that starts in the set returns to it (within at most $τ$ units of time). Recurrence entropy quantifies the complexity of making a set $τ$-recurrent measured by the average rate of growth, as time increases, of the number of control signals required to achieve this goal. Our analysis reveals that, compared to invariance, recurrence is quantitatively less complex, meaning that the recurrence entropy of a set is no larger than, and often strictly smaller than, the invariance entropy. Our results further offer insights into the minimum data rate required for achieving recurrence. We also present an algorithm for achieving recurrence asymptotically.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Closed-Loop Motion Planning for Differentially Flat Systems: A Time-Varying Optimization Framework
Authors:
Tianqi Zheng,
John W. Simpson-Porco,
Enrique Mallada
Abstract:
Motion planning and control are two core components of the robotic systems autonomy stack. The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances. Such an approach generally introduces conservativeness…
▽ More
Motion planning and control are two core components of the robotic systems autonomy stack. The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances. Such an approach generally introduces conservativeness into the planning stage, which becomes difficult to overcome as the model complexity increases and real-time decisions need to be made in a changing environment. This work addresses these challenges for the class of differentially flat nonlinear systems by integrating planning and control into a cohesive closed-loop task. Precisely, we develop an optimization-based framework that aims to steer a differentially flat system to a trajectory implicitly defined via a constrained time-varying optimization problem. To that end, we generalize the notion of feedback linearization, which makes non-linear systems behave as linear systems, and develop controllers that effectively transform a differentially flat system into an optimization algorithm that seeks to find the optimal solution of a (possibly time-varying) optimization problem. Under sufficient regularity assumptions, we prove global asymptotic convergence for the optimization dynamics to the minimizer of the time-varying optimization problem. We illustrate the effectiveness of our method with two numerical examples: a multi-robot tracking problem and a robot obstacle avoidance problem.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Intercept Function and Quantity Bidding in Two-stage Electricity Market with Market Power Mitigation
Authors:
Rajni Kant Bansal,
Yue Chen,
Pengcheng You,
Enrique Mallada
Abstract:
Electricity markets typically operate in two stages, day-ahead and real-time. Despite best efforts striving efficiency, evidence of price manipulation has called for system-level market power mitigation (MPM) initiatives that substitute noncompetitive bids with default bids. Implementing these policies with a limited understanding of participant behavior may lead to unintended economic losses. In…
▽ More
Electricity markets typically operate in two stages, day-ahead and real-time. Despite best efforts striving efficiency, evidence of price manipulation has called for system-level market power mitigation (MPM) initiatives that substitute noncompetitive bids with default bids. Implementing these policies with a limited understanding of participant behavior may lead to unintended economic losses. In this paper, we model the competition between generators and inelastic loads in a two-stage market with stage-wise MPM policies. The loss of Nash equilibrium and lack of guarantee of stable market outcome in the case of conventional supply function bidding motivates the use of an alternative market mechanism where generators bid an intercept function. A Nash equilibrium analysis for a day-ahead MPM policy leads to a Stackelberg-Nash game with loads exercising market power at the expense of generators. A comparison of the resulting equilibrium with the standard market (not implementing any MPM policy) shows that a day-ahead policy completely mitigates the market power of generators. On the other hand, the real-time MPM policy increases demand allocation to real-time, contrary to current market practice with most electricity trades in the day-ahead market. Numerical studies illustrate the impact of the slope of the intercept function on the standard market.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization
Authors:
Hancheng Min,
Enrique Mallada,
René Vidal
Abstract:
This paper studies the problem of training a two-layer ReLU network for binary classification using gradient flow with small initialization. We consider a training dataset with well-separated input vectors: Any pair of input data with the same label are positively correlated, and any pair with different labels are negatively correlated. Our analysis shows that, during the early phase of training,…
▽ More
This paper studies the problem of training a two-layer ReLU network for binary classification using gradient flow with small initialization. We consider a training dataset with well-separated input vectors: Any pair of input data with the same label are positively correlated, and any pair with different labels are negatively correlated. Our analysis shows that, during the early phase of training, neurons in the first layer try to align with either the positive data or the negative data, depending on its corresponding weight on the second layer. A careful analysis of the neurons' directional dynamics allows us to provide an $\mathcal{O}(\frac{\log n}{\sqrtμ})$ upper bound on the time it takes for all neurons to achieve good alignment with the input data, where $n$ is the number of data points and $μ$ measures how well the data are separated. After the early alignment phase, the loss converges to zero at a $\mathcal{O}(\frac{1}{t})$ rate, and the weight matrix on the first layer is approximately low-rank. Numerical experiments on the MNIST dataset illustrate our theoretical findings.
△ Less
Submitted 25 March, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Necessary and Sufficient Conditions for Simultaneous State and Input Recovery of Linear Systems with Sparse Inputs by $\ell_1$-Minimization
Authors:
Kyle Poe,
Enrique Mallada,
René Vidal
Abstract:
The study of theoretical conditions for recovering sparse signals from compressive measurements has received a lot of attention in the research community. In parallel, there has been a great amount of work characterizing conditions for the recovery both the state and the input to a linear dynamical system (LDS), including a handful of results on recovering sparse inputs. However, existing sufficie…
▽ More
The study of theoretical conditions for recovering sparse signals from compressive measurements has received a lot of attention in the research community. In parallel, there has been a great amount of work characterizing conditions for the recovery both the state and the input to a linear dynamical system (LDS), including a handful of results on recovering sparse inputs. However, existing sufficient conditions for recovering sparse inputs to an LDS are conservative and hard to interpret, while necessary and sufficient conditions have not yet appeared in the literature. In this work, we provide (1) the first characterization of necessary and sufficient conditions for the existence and uniqueness of sparse inputs to an LDS, (2) the first necessary and sufficient conditions for a linear program to recover both an unknown initial state and a sparse input, and (3) simple, interpretable recovery conditions in terms of the LDS parameters. We conclude with a numerical validation of these claims and discuss implications and future directions.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
A Frequency Domain Analysis of Slow Coherency in Networked Systems
Authors:
Hancheng Min,
Richard Pates,
Enrique Mallada
Abstract:
Network coherence generally refers to the emergence of simple aggregated dynamical behaviours, despite heterogeneity in the dynamics of the subsystems that constitute the network. In this paper, we develop a general frequency domain framework to analyze and quantify the level of network coherence that a system exhibits by relating coherence with a low-rank property of the system's input-output res…
▽ More
Network coherence generally refers to the emergence of simple aggregated dynamical behaviours, despite heterogeneity in the dynamics of the subsystems that constitute the network. In this paper, we develop a general frequency domain framework to analyze and quantify the level of network coherence that a system exhibits by relating coherence with a low-rank property of the system's input-output response. More precisely, for a networked system with linear dynamics and coupling, we show that, as the network's \emph{effective algebraic connectivity} grows, the system transfer matrix converges to a rank-one transfer matrix representing the coherent behavior. Interestingly, the non-zero eigenvalue of such a rank-one matrix is given by the harmonic mean of individual nodal dynamics, and we refer to it as the coherent dynamics. Our analysis unveils the frequency-dependent nature of coherence and a non-trivial interplay between dynamics and network topology. We further show that many networked systems can exhibit similar coherent behavior by establishing a concentration result in a setting with randomly chosen individual nodal dynamics.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Market Power Mitigation in Two-stage Electricity Market with Supply Function and Quantity Bidding
Authors:
Rajni Kant Bansal,
Yue Chen,
Pengcheng You,
Enrique Mallada
Abstract:
The main goal of a sequential two-stage electricity market -- e.g., day-ahead and real-time markets -- is to operate efficiently. However, the price difference across stages due to inadequate competition and unforeseen circumstances leads to undesirable price manipulation. To mitigate this, some Independent System Operators (ISOs) proposed system-level market power mitigation (MPM) policies in add…
▽ More
The main goal of a sequential two-stage electricity market -- e.g., day-ahead and real-time markets -- is to operate efficiently. However, the price difference across stages due to inadequate competition and unforeseen circumstances leads to undesirable price manipulation. To mitigate this, some Independent System Operators (ISOs) proposed system-level market power mitigation (MPM) policies in addition to existing local policies. These policies aim to substitute noncompetitive bids with a default bid based on estimated generator costs. However, these policies may lead to unintended consequences when implemented without accounting for the conflicting interest of participants. In this paper, we model the competition between generators (bidding supply functions) and loads (bidding quantity) in a two-stage market with a stage-wise MPM policy. An equilibrium analysis shows that a real-time MPM policy leads to equilibrium loss, meaning no stable market outcome (Nash equilibrium) exists. A day-ahead MPM policy, besides, leads to a Stackelberg-Nash game with loads acting as leaders and generators as followers. In this setting, loads become winners, i.e., their aggregate payment is always less than competitive payments. Moreover, comparison with standard market equilibrium highlights that markets are better off without such policies. Finally, numerical studies highlight the impact of heterogeneity and load size on market equilibrium.
△ Less
Submitted 18 September, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Constrained Reinforcement Learning via Dissipative Saddle Flow Dynamics
Authors:
Tianqi Zheng,
Pengcheng You,
Enrique Mallada
Abstract:
In constrained reinforcement learning (C-RL), an agent seeks to learn from the environment a policy that maximizes the expected cumulative reward while satisfying minimum requirements in secondary cumulative reward constraints. Several algorithms rooted in sampled-based primal-dual methods have been recently proposed to solve this problem in policy space. However, such methods are based on stochas…
▽ More
In constrained reinforcement learning (C-RL), an agent seeks to learn from the environment a policy that maximizes the expected cumulative reward while satisfying minimum requirements in secondary cumulative reward constraints. Several algorithms rooted in sampled-based primal-dual methods have been recently proposed to solve this problem in policy space. However, such methods are based on stochastic gradient descent ascent algorithms whose trajectories are connected to the optimal policy only after a mixing output stage that depends on the algorithm's history. As a result, there is a mismatch between the behavioral policy and the optimal one. In this work, we propose a novel algorithm for constrained RL that does not suffer from these limitations. Leveraging recent results on regularized saddle-flow dynamics, we develop a novel stochastic gradient descent-ascent algorithm whose trajectories converge to the optimal policy almost surely.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
Learning Coherent Clusters in Weakly-Connected Network Systems
Authors:
Hancheng Min,
Enrique Mallada
Abstract:
We propose a structure-preserving model-reduction methodology for large-scale dynamic networks with tightly-connected components. First, the coherent groups are identified by a spectral clustering algorithm on the graph Laplacian matrix that models the network feedback. Then, a reduced network is built, where each node represents the aggregate dynamics of each coherent group, and the reduced netwo…
▽ More
We propose a structure-preserving model-reduction methodology for large-scale dynamic networks with tightly-connected components. First, the coherent groups are identified by a spectral clustering algorithm on the graph Laplacian matrix that models the network feedback. Then, a reduced network is built, where each node represents the aggregate dynamics of each coherent group, and the reduced network captures the dynamic coupling between the groups. We provide an upper bound on the approximation error when the network graph is randomly generated from a weight stochastic block model. Finally, numerical experiments align with and validate our theoretical findings.
△ Less
Submitted 12 May, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Spectral clustering and model reduction for weakly-connected coherent network systems
Authors:
Hancheng Min,
Enrique Mallada
Abstract:
We propose a novel model-reduction methodology for large-scale dynamic networks with tightly-connected components. First, the coherent groups are identified by a spectral clustering algorithm on the graph Laplacian matrix that models the network feedback. Then, a reduced network is built, where each node represents the aggregate dynamics of each coherent group, and the reduced network captures the…
▽ More
We propose a novel model-reduction methodology for large-scale dynamic networks with tightly-connected components. First, the coherent groups are identified by a spectral clustering algorithm on the graph Laplacian matrix that models the network feedback. Then, a reduced network is built, where each node represents the aggregate dynamics of each coherent group, and the reduced network captures the dynamic coupling between the groups. Our approach is theoretically justified under a random graph setting. Finally, numerical experiments align with and validate our theoretical findings.
△ Less
Submitted 3 October, 2022; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Frequency sha** control for weakly-coupled grid-forming IBRs
Authors:
Bala Kameshwar Poolla,
Yashen Lin,
Andrey Bernstein,
Enrique Mallada,
Dominic Groß
Abstract:
We consider the problem of controlling the frequency of low-inertia power systems via inverter-based resources (IBRs) that are weakly connected to the grid. We propose a novel grid-forming control strategy, the so-called frequency sha** control, that aims to shape the frequency response of synchronous generators (SGs) to load perturbations so as to efficiently arrest sudden frequency drops. Our…
▽ More
We consider the problem of controlling the frequency of low-inertia power systems via inverter-based resources (IBRs) that are weakly connected to the grid. We propose a novel grid-forming control strategy, the so-called frequency sha** control, that aims to shape the frequency response of synchronous generators (SGs) to load perturbations so as to efficiently arrest sudden frequency drops. Our solution relaxes several existing assumptions in the literature and is able to navigate tradeoffs between peak power requirements and maximum frequency deviations. Finally, we analyze the robustness to imperfect knowledge of network parameters, while particularly highlighting the importance of accurate estimation of these parameters.
△ Less
Submitted 17 December, 2022; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Model-free Learning of Regions of Attraction via Recurrent Sets
Authors:
Yue Shen,
Maxim Bichuch,
Enrique Mallada
Abstract:
We consider the problem of learning an inner approximation of the region of attraction (ROA) of an asymptotically stable equilibrium point without an explicit model of the dynamics. Rather than leveraging approximate models with bounded uncertainty to find a (robust) invariant set contained in the ROA, we propose to learn sets that satisfy a more relaxed notion of containment known as recurrence.…
▽ More
We consider the problem of learning an inner approximation of the region of attraction (ROA) of an asymptotically stable equilibrium point without an explicit model of the dynamics. Rather than leveraging approximate models with bounded uncertainty to find a (robust) invariant set contained in the ROA, we propose to learn sets that satisfy a more relaxed notion of containment known as recurrence. We define a set to be $τ$-recurrent (resp. $k$-recurrent) if every trajectory that starts within the set, returns to it after at most $τ$ seconds (resp. $k$ steps). We show that under mild assumptions a $τ$-recurrent set containing a stable equilibrium must be a subset of its ROA. We then leverage this property to develop algorithms that compute inner approximations of the ROA using counter-examples of recurrence that are obtained by sampling finite-length trajectories. Our algorithms process samples sequentially, which allow them to continue being executed even after an initial offline training stage. We further provide an upper bound on the number of counter-examples used by the algorithm, and almost sure convergence guarantees.
△ Less
Submitted 13 September, 2023; v1 submitted 21 April, 2022;
originally announced April 2022.
-
Closed-Form Minkowski Sum Approximations for Efficient Optimization-Based Collision Avoidance
Authors:
James Guthrie,
Marin Kobilarov,
Enrique Mallada
Abstract:
Motion planning methods for autonomous systems based on nonlinear programming offer great flexibility in incorporating various dynamics, objectives, and constraints. One limitation of such tools is the difficulty of efficiently representing obstacle avoidance conditions for non-trivial shapes. For example, it is possible to define collision avoidance constraints suitable for nonlinear programming…
▽ More
Motion planning methods for autonomous systems based on nonlinear programming offer great flexibility in incorporating various dynamics, objectives, and constraints. One limitation of such tools is the difficulty of efficiently representing obstacle avoidance conditions for non-trivial shapes. For example, it is possible to define collision avoidance constraints suitable for nonlinear programming solvers in the canonical setting of a circular robot navigating around M convex polytopes over N time steps. However, it requires introducing (2+L)MN additional constraints and LMN additional variables, with L being the number of halfplanes per polytope, leading to larger nonlinear programs with slower and less reliable solving time. In this paper, we overcome this issue by building closed-form representations of the collision avoidance conditions by outer-approximating the Minkowski sum conditions for collision. Our solution requires only MN constraints (and no additional variables), leading to a smaller nonlinear program. On motion planning problems for an autonomous car and quadcopter in cluttered environments, we achieve speedups of 4.8x and 8.7x respectively with significantly less variance in solve times and negligible impact on performance arising from the use of outer approximations.
△ Less
Submitted 29 March, 2022;
originally announced March 2022.
-
On the Stability, Economic Efficiency and Incentive Compatibility of Electricity Market Dynamics
Authors:
Pengcheng You,
Yan Jiang,
Enoch Yeung,
Dennice F. Gayme,
Enrique Mallada
Abstract:
This paper focuses on the operation of an electricity market that accounts for participants that bid at a sub-minute timescale. To that end, we model the market-clearing process as a dynamical system, called market dynamics, which is temporally coupled with the grid frequency dynamics and is thus required to guarantee system-wide stability while meeting the system operational constraints. We chara…
▽ More
This paper focuses on the operation of an electricity market that accounts for participants that bid at a sub-minute timescale. To that end, we model the market-clearing process as a dynamical system, called market dynamics, which is temporally coupled with the grid frequency dynamics and is thus required to guarantee system-wide stability while meeting the system operational constraints. We characterize participants as price-takers who rationally update their bids to maximize their utility in response to real-time schedules of prices and dispatch. For two common bidding mechanisms, based on quantity and price, we identify a notion of alignment between participants' behavior and planners' goals that leads to a saddle-based design of the market that guarantees convergence to a point meeting all operational constraints. We further explore cases where this alignment property does not hold and observe that misaligned participants' bidding can destabilize the closed-loop system. We thus design a regularized version of the market dynamics that recovers all the desirable stability and steady-state performance guarantees. Numerical tests validate our results on the IEEE 39-bus system.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Reinforcement Learning with Almost Sure Constraints
Authors:
Agustin Castellano,
Hancheng Min,
Juan Bazerque,
Enrique Mallada
Abstract:
In this work we address the problem of finding feasible policies for Constrained Markov Decision Processes under probability one constraints. We argue that stationary policies are not sufficient for solving this problem, and that a rich class of policies can be found by endowing the controller with a scalar quantity, so called budget, that tracks how close the agent is to violating the constraint.…
▽ More
In this work we address the problem of finding feasible policies for Constrained Markov Decision Processes under probability one constraints. We argue that stationary policies are not sufficient for solving this problem, and that a rich class of policies can be found by endowing the controller with a scalar quantity, so called budget, that tracks how close the agent is to violating the constraint. We show that the minimal budget required to act safely can be obtained as the smallest fixed point of a Bellman-like operator, for which we analyze its convergence properties. We also show how to learn this quantity when the true kernel of the Markov decision process is not known, while providing sample-complexity bounds. The utility of knowing this minimal budget relies in that it can aid in the search of optimal or near-optimal policies by shrinking down the region of the state space the agent must navigate. Simulations illustrate the different nature of probability one constraints against the typically used constraints in expectation.
△ Less
Submitted 13 February, 2023; v1 submitted 9 December, 2021;
originally announced December 2021.
-
A Market Mechanism for Truthful Bidding with Energy Storage
Authors:
Rajni Kant Bansal,
Pengcheng You,
Dennice F. Gayme,
Enrique Mallada
Abstract:
This paper proposes a market mechanism for multi-interval electricity markets with generator and storage participants. Drawing ideas from supply function bidding, we introduce a novel bid structure for storage participation that allows storage units to communicate their cost to the market using energy-cycling functions that map prices to cycle depths. The resulting market-clearing process--impleme…
▽ More
This paper proposes a market mechanism for multi-interval electricity markets with generator and storage participants. Drawing ideas from supply function bidding, we introduce a novel bid structure for storage participation that allows storage units to communicate their cost to the market using energy-cycling functions that map prices to cycle depths. The resulting market-clearing process--implemented via convex programming--yields corresponding schedules and payments based on traditional energy prices for power supply and per-cycle prices for storage utilization. We illustrate the benefits of our solution by comparing the competitive equilibrium of the resulting mechanism to that of an alternative solution that uses prosumer-based bids. Our solution shows several advantages over the prosumer-based approach. It does not require a priori price estimation. It also incentivizes participants to reveal their truthful cost, thus leading to an efficient, competitive equilibrium. Numerical experiments using New York Independent System Operator (NYISO) data validate our findings.
△ Less
Submitted 14 October, 2021; v1 submitted 29 September, 2021;
originally announced September 2021.
-
Inner Approximations of the Positive-Semidefinite Cone via Grassmannian Packings
Authors:
Tianqi Zheng,
James Guthrie,
Enrique Mallada
Abstract:
We investigate the problem of finding inner ap-proximations of positive semidefinite (PSD) cones. We developa novel decomposition framework of the PSD cone by meansof conical combinations of smaller dimensional sub-cones. Weshow that many inner approximation techniques could besummarized within this framework, including the set of (scaled)diagonally dominant matrices, Factor-widthkmatrices, andCho…
▽ More
We investigate the problem of finding inner ap-proximations of positive semidefinite (PSD) cones. We developa novel decomposition framework of the PSD cone by meansof conical combinations of smaller dimensional sub-cones. Weshow that many inner approximation techniques could besummarized within this framework, including the set of (scaled)diagonally dominant matrices, Factor-widthkmatrices, andChordal Sparse matrices. Furthermore, we provide a moreflexible family of inner approximations of the PSD cone, wherewe aim to arrange the sub-cones so that they are maximallyseparated from each other. In doing so, these approximationstend to occupy large fractions of the volume of the PSD cone.The proposed approach is connected to a classical packingproblem in Riemannian Geometry. Precisely, we show thatthe problem of finding maximally distant sub-cones in anambient PSD cone is equivalent to the problem of packingsub-spaces in a Grassmannian Manifold. We further leverageexisting computational method for constructing packings inGrassmannian manifolds to build tighter approximations ofthe PSD cone. Numerical experiments show how the proposedframework can balance between accuracy and computationalcomplexity, to efficiently solve positive-semidefinite programs.
△ Less
Submitted 30 September, 2021; v1 submitted 25 May, 2021;
originally announced May 2021.
-
Learning to Act Safely with Limited Exposure and Almost Sure Certainty
Authors:
Agustin Castellano,
Hancheng Min,
Juan Bazerque,
Enrique Mallada
Abstract:
This paper puts forward the concept that learning to take safe actions in unknown environments, even with probability one guarantees, can be achieved without the need for an unbounded number of exploratory trials. This is indeed possible, provided that one is willing to navigate trade-offs between optimality, level of exposure to unsafe events, and the maximum detection time of unsafe actions. We…
▽ More
This paper puts forward the concept that learning to take safe actions in unknown environments, even with probability one guarantees, can be achieved without the need for an unbounded number of exploratory trials. This is indeed possible, provided that one is willing to navigate trade-offs between optimality, level of exposure to unsafe events, and the maximum detection time of unsafe actions. We illustrate this concept in two complementary settings. We first focus on the canonical multi-armed bandit problem and study the intrinsic trade-offs of learning safety in the presence of uncertainty. Under mild assumptions on sufficient exploration, we provide an algorithm that provably detects all unsafe machines in an (expected) finite number of rounds. The analysis also unveils a trade-off between the number of rounds needed to secure the environment and the probability of discarding safe machines. We then consider the problem of finding optimal policies for a Markov Decision Process (MDP) with almost sure constraints. We show that the action-value function satisfies a barrier-based decomposition which allows for the identification of feasible policies independently of the reward process. Using this decomposition, we develop a Barrier-learning algorithm, that identifies such unsafe state-action pairs in a finite expected number of steps. Our analysis further highlights a trade-off between the time lag for the underlying MDP necessary to detect unsafe actions, and the level of exposure to unsafe events. Simulations corroborate our theoretical findings, further illustrating the aforementioned trade-offs, and suggesting that safety constraints can speed up the learning process.
△ Less
Submitted 13 February, 2023; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Convergence and Implicit Bias of Gradient Flow on Overparametrized Linear Networks
Authors:
Hancheng Min,
Salma Tarmoun,
Rene Vidal,
Enrique Mallada
Abstract:
Neural networks trained via gradient descent with random initialization and without any regularization enjoy good generalization performance in practice despite being highly overparametrized. A promising direction to explain this phenomenon is to study how initialization and overparametrization affect convergence and implicit bias of training algorithms. In this paper, we present a novel analysis…
▽ More
Neural networks trained via gradient descent with random initialization and without any regularization enjoy good generalization performance in practice despite being highly overparametrized. A promising direction to explain this phenomenon is to study how initialization and overparametrization affect convergence and implicit bias of training algorithms. In this paper, we present a novel analysis of single-hidden-layer linear networks trained under gradient flow, which connects initialization, optimization, and overparametrization. Firstly, we show that the squared loss converges exponentially to its optimum at a rate that depends on the level of imbalance and the margin of the initialization. Secondly, we show that proper initialization constrains the dynamics of the network parameters to lie within an invariant set. In turn, minimizing the loss over this set leads to the min-norm solution. Finally, we show that large hidden layer width, together with (properly scaled) random initialization, ensures proximity to such an invariant set during training, allowing us to derive a novel non-asymptotic upper-bound on the distance between the trained network and the min-norm solution.
△ Less
Submitted 16 May, 2022; v1 submitted 13 May, 2021;
originally announced May 2021.
-
Voltage Collapse Stabilization in Star DC Networks
Authors:
Charalampos Avraam,
Enrique Mallada
Abstract:
Voltage collapse is a type of blackout-inducing dynamic instability that occurs when power demand exceeds the maximum power that can be transferred through a network. The traditional (preventive) approach to avoid voltage collapse is based on ensuring that the network never reaches its maximum capacity. However, such an approach leads to inefficient use of network resources and does not account fo…
▽ More
Voltage collapse is a type of blackout-inducing dynamic instability that occurs when power demand exceeds the maximum power that can be transferred through a network. The traditional (preventive) approach to avoid voltage collapse is based on ensuring that the network never reaches its maximum capacity. However, such an approach leads to inefficient use of network resources and does not account for unforeseen events. To overcome this limitation, this paper seeks to initiate the study of voltage collapse stabilization, i.e., the design of load controllers aimed at stabilizing the point of voltage collapse. We formulate the problem of voltage stability for a star direct current network as a dynamic problem where each load seeks to achieve a constant power consumption by updating its conductance as the voltage changes. We introduce a voltage collapse stabilization controller and show that the high-voltage equilibrium is stabilized. More importantly, we are able to achieve proportional load shedding under extreme loading conditions. We further highlight the key features of our controller using numerical illustrations.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
Coherence and Concentration in Tightly-Connected Networks
Authors:
Hancheng Min,
Richard Pates,
Enrique Mallada
Abstract:
The ability to achieve coordinated behavior -- engineered or emergent -- on networked systems has attracted widespread interest over several fields. This interest has led to remarkable advances in develo** a theoretical understanding of the conditions under which agents within a network can reach an agreement (consensus) or develop coordinated behavior, such as synchronization. However, much les…
▽ More
The ability to achieve coordinated behavior -- engineered or emergent -- on networked systems has attracted widespread interest over several fields. This interest has led to remarkable advances in develo** a theoretical understanding of the conditions under which agents within a network can reach an agreement (consensus) or develop coordinated behavior, such as synchronization. However, much less understood is the phenomenon of network coherence. Network coherence generally refers to nodes' ability in a network to have a similar dynamic response despite heterogeneity in their individual behavior. In this paper, we develop a general framework to analyze and quantify the level of network coherence that a system exhibits by relating coherence with a low-rank property of the system. More precisely, for a networked system with linear dynamics and coupling, we show that, as the network connectivity grows, the system transfer matrix converges to a rank-one transfer matrix representing the coherent behavior. Interestingly, the non-zero eigenvalue of such a rank-one matrix is given by the harmonic mean of individual nodal dynamics, and we refer to it as the coherent dynamics. Our analysis unveils the frequency-dependent nature of coherence and a non-trivial interplay between dynamics and network topology. We further show that many networked systems can exhibit similar coherent behavior by establishing a concentration result in a setting with randomly chosen individual nodal dynamics.
△ Less
Submitted 25 March, 2022; v1 submitted 4 January, 2021;
originally announced January 2021.
-
Assured RL: Reinforcement Learning with Almost Sure Constraints
Authors:
Agustin Castellano,
Juan Bazerque,
Enrique Mallada
Abstract:
We consider the problem of finding optimal policies for a Markov Decision Process with almost sure constraints on state transitions and action triplets. We define value and action-value functions that satisfy a barrier-based decomposition which allows for the identification of feasible policies independently of the reward process. We prove that, given a policy π, certifying whether certain state-a…
▽ More
We consider the problem of finding optimal policies for a Markov Decision Process with almost sure constraints on state transitions and action triplets. We define value and action-value functions that satisfy a barrier-based decomposition which allows for the identification of feasible policies independently of the reward process. We prove that, given a policy π, certifying whether certain state-action pairs lead to feasible trajectories under π is equivalent to solving an auxiliary problem aimed at finding the probability of performing an unfeasible transition. Using this interpretation,we develop a Barrier-learning algorithm, based on Q-Learning, that identifies such unsafe state-action pairs. Our analysis motivates the need to enhance the Reinforcement Learning (RL) framework with an additional signal, besides rewards, called here damage function that provides feasibility information and enables the solution of RL problems with model-free constraints. Moreover, our Barrier-learning algorithm wraps around existing RL algorithms, such as Q-Learning and SARSA, giving them the ability to solve almost-surely constrained problems.
△ Less
Submitted 23 December, 2020;
originally announced December 2020.
-
On the Value of Energy Storage in Generation Cost Reduction
Authors:
Yue Shen,
Maxim Bichuch,
Enrique Mallada
Abstract:
This work seeks to quantify the benefits of using energy storage toward the reduction of the energy generation cost of a power system. A two-fold optimization framework is provided where the first optimization problem seeks to find the optimal storage schedule that minimizes operational costs. Since the operational cost depends on the storage capacity, a second optimization problem is then formula…
▽ More
This work seeks to quantify the benefits of using energy storage toward the reduction of the energy generation cost of a power system. A two-fold optimization framework is provided where the first optimization problem seeks to find the optimal storage schedule that minimizes operational costs. Since the operational cost depends on the storage capacity, a second optimization problem is then formulated with the aim of finding the optimal storage capacity to be deployed. Although, in general, these problems are difficult to solve, we provide a lower bound on the cost savings for a parametrized family of demand profiles. The optimization framework is numerically illustrated using real-world demand data from ISO New England. Numerical results show that energy storage can reduce energy generation costs by at least 2.5 %.
△ Less
Submitted 2 October, 2020;
originally announced October 2020.
-
Storage Degradation Aware Economic Dispatch
Authors:
R. K. Bansal,
P. You,
D. F. Gayme,
E. Mallada
Abstract:
In this paper, we formulate a cycling cost aware economic dispatch problem that co-optimizes generation and storage dispatch while taking into account cycle based storage degradation cost. Our approach exploits the Rainflow cycle counting algorithm to quantify storage degradation for each charging and discharging half-cycle based on its depth. We show that the dispatch is optimal for individual pa…
▽ More
In this paper, we formulate a cycling cost aware economic dispatch problem that co-optimizes generation and storage dispatch while taking into account cycle based storage degradation cost. Our approach exploits the Rainflow cycle counting algorithm to quantify storage degradation for each charging and discharging half-cycle based on its depth. We show that the dispatch is optimal for individual participants in the sense that it maximizes the profit of generators and storage units, under price taking assumptions. We further provide a condition under which the optimal storage response is unique for given market clearing prices. Simulations using data from the New York Independent System Operator (NYISO) illustrate the optimization framework. In particular, they show that the generation-centric dispatch that does not account for storage degradation is insufficient to guarantee storage profitability.
△ Less
Submitted 10 October, 2021; v1 submitted 1 October, 2020;
originally announced October 2020.
-
Learning to be safe, in finite time
Authors:
Agustin Castellano,
Juan Bazerque,
Enrique Mallada
Abstract:
This paper aims to put forward the concept that learning to take safe actions in unknown environments, even with probability one guarantees, can be achieved without the need for an unbounded number of exploratory trials, provided that one is willing to relax its optimality requirements mildly. We focus on the canonical multi-armed bandit problem and seek to study the exploration-preservation trade…
▽ More
This paper aims to put forward the concept that learning to take safe actions in unknown environments, even with probability one guarantees, can be achieved without the need for an unbounded number of exploratory trials, provided that one is willing to relax its optimality requirements mildly. We focus on the canonical multi-armed bandit problem and seek to study the exploration-preservation trade-off intrinsic within safe learning. More precisely, by defining a handicap metric that counts the number of unsafe actions, we provide an algorithm for discarding unsafe machines (or actions), with probability one, that achieves constant handicap. Our algorithm is rooted in the classical sequential probability ratio test, redefined here for continuing tasks. Under standard assumptions on sufficient exploration, our rule provably detects all unsafe machines in an (expected) finite number of rounds. The analysis also unveils a trade-off between the number of rounds needed to secure the environment and the probability of discarding safe machines. Our decision rule can wrap around any other algorithm to optimize a specific auxiliary goal since it provides a safe environment to search for (approximately) optimal policies. Simulations corroborate our theoretical findings and further illustrate the aforementioned trade-offs.
△ Less
Submitted 31 March, 2021; v1 submitted 1 October, 2020;
originally announced October 2020.
-
Saddle Flow Dynamics: Observable Certificates and Separable Regularization
Authors:
Pengcheng You,
Enrique Mallada
Abstract:
This paper proposes a certificate, rooted in observability, for asymptotic convergence of saddle flow dynamics of convex-concave functions to a saddle point. This observable certificate directly bridges the gap between the invariant set and the equilibrium set in a LaSalle argument, and generalizes conventional conditions such as strict convexity-concavity and proximal regularization. We further b…
▽ More
This paper proposes a certificate, rooted in observability, for asymptotic convergence of saddle flow dynamics of convex-concave functions to a saddle point. This observable certificate directly bridges the gap between the invariant set and the equilibrium set in a LaSalle argument, and generalizes conventional conditions such as strict convexity-concavity and proximal regularization. We further build upon this certificate to propose a separable regularization method for saddle flow dynamics that makes minimal requirements on convexity-concavity and yet still guarantees asymptotic convergence to a saddle point. Our results generalize to saddle flow dynamics with projections on the vector field and have an immediate application as a distributed solution to linear programs.
△ Less
Submitted 30 September, 2020;
originally announced September 2020.
-
Grid-forming frequency sha** control
Authors:
Yan Jiang,
Andrey Bernstein,
Petr Vorobev,
Enrique Mallada
Abstract:
As power systems transit to a state of high renewable penetration, little or no presence of synchronous generators makes the prerequisite of well-regulated frequency for grid-following inverters unrealistic. Thus, there is a trend to resort to grid-forming inverters which set frequency directly. We propose a novel grid-forming frequency sha** control that is able to shape the aggregate system fr…
▽ More
As power systems transit to a state of high renewable penetration, little or no presence of synchronous generators makes the prerequisite of well-regulated frequency for grid-following inverters unrealistic. Thus, there is a trend to resort to grid-forming inverters which set frequency directly. We propose a novel grid-forming frequency sha** control that is able to shape the aggregate system frequency dynamics into a first-order one with the desired steady-state frequency deviation and Rate of Change of Frequency (RoCoF) after a sudden power imbalance. The no overshoot property resulting from the first-order dynamics allows the system frequency to monotonically move towards its new steady-state without experiencing frequency Nadir, which largely improves frequency security. We prove that our grid-forming frequency-sha** control renders the system internally stable under mild assumptions. The performance of the proposed control is verified via numerical simulations on a modified Icelandic Power Network test case.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
Storage-Based Frequency Sha** Control
Authors:
Yan Jiang,
Eliza Cohn,
Petr Vorobev,
Enrique Mallada
Abstract:
With the decrease in system inertia, frequency security becomes an issue for power systems around the world. Energy storage systems (ESS), due to their excellent ram** capabilities, are considered as a natural choice for the improvement of frequency response following major contingencies. In this manuscript, we propose a new strategy for energy storage -- frequency sha** control -- that allows…
▽ More
With the decrease in system inertia, frequency security becomes an issue for power systems around the world. Energy storage systems (ESS), due to their excellent ram** capabilities, are considered as a natural choice for the improvement of frequency response following major contingencies. In this manuscript, we propose a new strategy for energy storage -- frequency sha** control -- that allows to completely eliminate the frequency Nadir, one of the main issue in frequency security, and at the same time tune the rate of change of frequency (RoCoF) to a desired value. With Nadir eliminated, the frequency security assessment can be performed via simple algebraic calculations, as opposed to dynamic simulations for conventional control strategies. Moreover, our proposed control is also very efficient in terms of the requirements on storage peak power, requiring up to 40% less power than conventional virtual inertia approach for the same performance.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Performance of Single and Double-Integrator Networks over Directed Graphs
Authors:
H. Giray Oral,
Enrique Mallada,
Dennice F. Gayme
Abstract:
This paper provides a framework to evaluate the performance of single and double integrator networks over arbitrary directed graphs. Adopting vehicular network terminology, we consider quadratic performance metrics defined by the L2-norm of position and velocity based response functions given impulsive inputs to each vehicle. We exploit the spectral properties of weighted graph Laplacians and outp…
▽ More
This paper provides a framework to evaluate the performance of single and double integrator networks over arbitrary directed graphs. Adopting vehicular network terminology, we consider quadratic performance metrics defined by the L2-norm of position and velocity based response functions given impulsive inputs to each vehicle. We exploit the spectral properties of weighted graph Laplacians and output performance matrices to derive a novel method of computing the closed-form solutions for this general class of performance metrics, which include H2-norm based quantities as special cases. We then explore the effect of the interplay between network properties (e.g. edge directionality and connectivity) and the control strategy on the overall network performance. More precisely, for systems whose interconnection is described by graphs with normal Laplacian L, we characterize the role of directionality by comparing their performance with that of their undirected counterparts, represented by the Hermitian part of L. We show that, for single-integrator networks, directed and undirected graphs perform identically. However, for double-integrator networks, graph directionality -- expressed by the eigenvalues of L with nonzero imaginary part -- can significantly degrade performance. Interestingly in many cases, well-designed feedback can also exploit directionality to mitigate degradation or even improve the performance to exceed that of the undirected case. Finally we focus on a system coherence metric -- aggregate deviation from the state average -- to investigate the relationship between performance and degree of connectivity, leading to somewhat surprising findings. For example increasing the number of neighbors on a ω-nearest neighbor directed graph does not necessarily improve performance. Similarly, we demonstrate equivalence in performance between all-to-one and all-to-all communication graphs.
△ Less
Submitted 2 November, 2019;
originally announced November 2019.
-
What is the Largest Sparsity Pattern that Can Be Recovered by 1-Norm Minimization?
Authors:
Mustafa D. Kaba,
Mengnan Zhao,
Rene Vidal,
Daniel P. Robinson,
Enrique Mallada
Abstract:
Much of the existing literature in sparse recovery is concerned with the following question: given a sparsity pattern and a corresponding regularizer, derive conditions on the dictionary under which exact recovery is possible. In this paper, we study the opposite question: given a dictionary and the 1-norm regularizer, find the largest sparsity pattern that can be recovered. We show that such a pa…
▽ More
Much of the existing literature in sparse recovery is concerned with the following question: given a sparsity pattern and a corresponding regularizer, derive conditions on the dictionary under which exact recovery is possible. In this paper, we study the opposite question: given a dictionary and the 1-norm regularizer, find the largest sparsity pattern that can be recovered. We show that such a pattern is described by a mathematical object called a "maximum abstract simplicial complex", and provide two different characterizations of this object: one based on extreme points and the other based on vectors of minimal support. In addition, we show how this new framework is useful in the study of sparse recovery problems when the dictionary takes the form of a graph incidence matrix or a partial discrete Fourier transform. In case of incidence matrices, we show that the largest sparsity pattern that can be recovered is determined by the set of simple cycles of the graph. As a byproduct, we show that standard sparse recovery can be certified in polynomial time, although this is known to be NP-hard for general matrices. In the case of the partial discrete Fourier transform, our characterization of the largest sparsity pattern that can be recovered requires the unknown signal to be real and its dimension to be a prime number.
△ Less
Submitted 23 July, 2020; v1 submitted 12 October, 2019;
originally announced October 2019.
-
Dynamic Droop Approach for Storage-based Frequency Control
Authors:
Yan Jiang,
Eliza Cohn,
Petr Vorobev,
Enrique Mallada
Abstract:
Transient frequency dips that follow sudden power imbalances --frequency Nadir-- represent a big challenge for frequency stability of low-inertia power systems. Since low inertia is identified as one of the causes for deep frequency Nadir, virtual inertia, which is provided by energy storage units, is said to be one of the solutions to the problem. In the present paper, we propose a new method for…
▽ More
Transient frequency dips that follow sudden power imbalances --frequency Nadir-- represent a big challenge for frequency stability of low-inertia power systems. Since low inertia is identified as one of the causes for deep frequency Nadir, virtual inertia, which is provided by energy storage units, is said to be one of the solutions to the problem. In the present paper, we propose a new method for frequency control with energy storage systems (ESS), called dynamic droop control (iDroop), that can completely eliminate frequency Nadir during transients. Nadir elimination allows us to perform frequency stability assessment without the need for direct numerical simulations of system dynamics. We make a direct comparison of our developed strategy with the usual control approaches --virtual inertia (VI) and droop control (DC)-- and show that iDroop is more effective than both in eliminating the Nadir. More precisely, iDroop achieves the Nadir elimination under significantly lower gains than virtual inertia and requires almost $40\%$ less storage power capacity to implement the control. Moreover, we show that rather unrealistic control gains are required for virtual inertia in order to achieve Nadir elimination.
△ Less
Submitted 14 October, 2019; v1 submitted 10 October, 2019;
originally announced October 2019.
-
Implicit Trajectory Planning for Feedback Linearizable Systems: A Time-varying Optimization Approach
Authors:
Tianqi Zheng,
John Simpson-Porco,
Enrique Mallada
Abstract:
We develop an optimization-based framework for joint real-time trajectory planning and feedback control of feedback-linearizable systems. To achieve this goal, we define a target trajectory as the optimal solution of a time-varying optimization problem. In general, however, such trajectory may not be feasible due to , e.g., nonholonomic constraints. To solve this problem, we design a control law t…
▽ More
We develop an optimization-based framework for joint real-time trajectory planning and feedback control of feedback-linearizable systems. To achieve this goal, we define a target trajectory as the optimal solution of a time-varying optimization problem. In general, however, such trajectory may not be feasible due to , e.g., nonholonomic constraints. To solve this problem, we design a control law that generates feasible trajectories that asymptotically converge to the target trajectory. More precisely, for systems that are (dynamic) full-state linearizable, the proposed control law implicitly transforms the nonlinear system into an optimization algorithm of sufficiently high order. We prove global exponential convergence to the target trajectory for both the optimization algorithm and the original system. We illustrate the effectiveness of our proposed method on multi-target or multi-agent tracking problems with constraints.
△ Less
Submitted 13 March, 2020; v1 submitted 1 October, 2019;
originally announced October 2019.
-
Accurate Reduced-Order Models for Heterogeneous Coherent Generators
Authors:
Hancheng Min,
Fernando Paganini,
Enrique Mallada
Abstract:
We introduce a novel framework to approximate the aggregate frequency dynamics of coherent generators. By leveraging recent results on dynamics concentration of tightly connected networks, and frequency weighted balanced truncation, a hierarchy of reduced-order models is developed. This hierarchy provides increasing accuracy in the approximation of the aggregate system response, outperforming exis…
▽ More
We introduce a novel framework to approximate the aggregate frequency dynamics of coherent generators. By leveraging recent results on dynamics concentration of tightly connected networks, and frequency weighted balanced truncation, a hierarchy of reduced-order models is developed. This hierarchy provides increasing accuracy in the approximation of the aggregate system response, outperforming existing aggregation techniques.
△ Less
Submitted 16 November, 2020; v1 submitted 27 September, 2019;
originally announced September 2019.
-
Adversarial Model Predictive Control via Second-Order Cone Programming
Authors:
James Guthrie,
Enrique Mallada
Abstract:
We study the problem of designing attacks to safety-critical systems in which the adversary seeks to maximize the overall system cost within a model predictive control framework. Although in general this problem is NP-hard, we characterize a family of problems that can be solved in polynomial time via a second-order cone programming relaxation. In particular, we show that positive systems fall und…
▽ More
We study the problem of designing attacks to safety-critical systems in which the adversary seeks to maximize the overall system cost within a model predictive control framework. Although in general this problem is NP-hard, we characterize a family of problems that can be solved in polynomial time via a second-order cone programming relaxation. In particular, we show that positive systems fall under this family. We provide examples demonstrating the design of optimal attacks on an autonomous vehicle and a microgrid.
△ Less
Submitted 11 September, 2019;
originally announced September 2019.
-
Dynamic Droop Control in Low-inertia Power Systems
Authors:
Yan Jiang,
Richard Pates,
Enrique Mallada
Abstract:
A widely embraced approach to mitigate the dynamic degradation in low-inertia power systems is to mimic generation response using grid-connected inverters to restore the grid's stiffness. In this paper, we seek to challenge this approach and advocate for a principled design based on a systematic analysis of the performance trade-offs of inverter-based frequency control. With this aim, we perform a…
▽ More
A widely embraced approach to mitigate the dynamic degradation in low-inertia power systems is to mimic generation response using grid-connected inverters to restore the grid's stiffness. In this paper, we seek to challenge this approach and advocate for a principled design based on a systematic analysis of the performance trade-offs of inverter-based frequency control. With this aim, we perform a qualitative and quantitative study comparing the effect of conventional control strategies --droop control (DC) and virtual inertia (VI)-- on several performance metrics induced by $\mathcal L_2$ and $\mathcal L_\infty$ signal norms. By extending a recently proposed modal decomposition method, we capture the effect of step and stochastic power disturbances, and frequency measurement noise, on the overall transient and steady-state behavior of the system. Our analysis unveils several limitations of these solutions, such as the inability of DC to improve dynamic frequency response without increasing steady-state control effort, or the large frequency variance that VI introduces in the presence of measurement noise. We further propose a novel dynam-i-c Droop controller (iDroop) that overcomes the limitations of DC and VI. More precisely, we show that iDroop can be tuned to achieve high noise rejection, fast system-wide synchronization, or frequency overshoot (Nadir) elimination without affecting the steady-state control effort share, and propose a tuning recommendation that strikes a balance among these objectives. Extensive numerical experimentation shows that the proposed tuning is effective even when our proportionality assumptions are not valid, and that the particular tuning used for Nadir elimination strikes a good trade-off among various performance metrics.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
Global analysis of synchronization performance for power systems: bridging the theory-practice gap
Authors:
Fernando Paganini,
Enrique Mallada
Abstract:
The issue of synchronization in the power grid is receiving renewed attention, as new energy sources with different dynamics enter the picture. Global metrics have been proposed to evaluate performance and analyzed under highly simplified assumptions. In this paper, we extend this approach to more realistic network scenarios and more closely connect it with metrics used in power engineering practi…
▽ More
The issue of synchronization in the power grid is receiving renewed attention, as new energy sources with different dynamics enter the picture. Global metrics have been proposed to evaluate performance and analyzed under highly simplified assumptions. In this paper, we extend this approach to more realistic network scenarios and more closely connect it with metrics used in power engineering practice. In particular, our analysis covers networks with generators of heterogeneous ratings and richer dynamic models of machines. Under a suitable proportionality assumption in the parameters, we show that the step response of bus frequencies can be decomposed in two components. The first component is a {system-wide frequency} that captures the aggregate grid behavior, and the residual component represents the individual bus frequency deviations from the aggregate. Using this decomposition, we define --and compute in closed form-- several metrics that capture dynamic behaviors that are of relevance for power engineers. In particular, using the \emph{system frequency}, we define industry-style metrics (Nadir, RoCoF) that are evaluated through a representative machine. We further use the norm of the residual component to define a \emph{synchronization cost} that can appropriately quantify inter-area oscillations. Finally, we employ robustness analysis tools to evaluate deviations from our proportionality assumption. We show that the system frequency still captures the grid steady-state deviation, and becomes an accurate reduced-order model of the grid as the network connectivity grows. Simulation studies with practically relevant data are included to validate the theory and further illustrate the impact of network structure and parameters on synchronization. Our analysis gives conclusions of practical interest, sometimes challenging the conventional wisdom in the field.
△ Less
Submitted 16 May, 2019;
originally announced May 2019.
-
The Role of Strategic Load Participants in Two-Stage Settlement Electricity Markets
Authors:
Pengcheng You,
Dennice F. Gayme,
Enrique Mallada
Abstract:
Two-stage electricity market clearing is designed to maintain market efficiency under ideal conditions, e.g., perfect forecast and nonstrategic generation. This work demonstrates that the individual strategic behavior of inelastic load participants in a two-stage settlement electricity market can deteriorate efficiency. Our analysis further implies that virtual bidding can play a role in alleviati…
▽ More
Two-stage electricity market clearing is designed to maintain market efficiency under ideal conditions, e.g., perfect forecast and nonstrategic generation. This work demonstrates that the individual strategic behavior of inelastic load participants in a two-stage settlement electricity market can deteriorate efficiency. Our analysis further implies that virtual bidding can play a role in alleviating this loss of efficiency by mitigating the market power of strategic load participants. We use real-world market data from New York ISO to validate our theory.
△ Less
Submitted 15 September, 2019; v1 submitted 20 March, 2019;
originally announced March 2019.
-
Dynamics Concentration of Large-Scale Tightly-Connected Networks
Authors:
Hancheng Min,
Enrique Mallada
Abstract:
The ability to achieve coordinated behavior --engineered or emergent-- on networked systems has attracted widespread interest over several fields. This has led to remarkable advances on the development of a theoretical understanding of the conditions under which agents within a network can reach agreement (consensus) or develop coordinated behaviors such as synchronization. However, fewer advances…
▽ More
The ability to achieve coordinated behavior --engineered or emergent-- on networked systems has attracted widespread interest over several fields. This has led to remarkable advances on the development of a theoretical understanding of the conditions under which agents within a network can reach agreement (consensus) or develop coordinated behaviors such as synchronization. However, fewer advances have been made toward explaining another commonly observed phenomena in tightly-connected networks systems: output responses of nodes in the networks are almost identical to each other despite heterogeneity in their individual dynamics. In this paper, we leverage tools from high-dimensional probability to provide an initial answer to this phenomena. More precisely, we show that for linear networks of nodal random transfer functions, as the network size and connectivity grows, every node in the network follows the same response to an input or disturbance --irrespectively of the source of this input. We term this behavior as dynamics concentration since it stems from the fact that the network transfer matrix uniformly converges in probability, i.e., it concentrates, to a unique dynamic response determined by the distribution of the random transfer function of each node. We further discuss the implications of our analysis in the context of model reduction and robustness, and provide numerical evidence that similar phenomena occur in small deterministic networks over a properly defined frequency band.
△ Less
Submitted 12 September, 2019; v1 submitted 14 March, 2019;
originally announced March 2019.
-
Online Inventory Management with Application to Energy Procurement in Data Centers
Authors:
Lin Yang,
Mohammad H. Hajiesmaili,
Ramesh Sitaraman,
Enrique Mallada,
Wing S. Wong,
Adam Wierman
Abstract:
Motivated by the application of energy storage management in electricity markets, this paper considers the problem of online linear programming with inventory management constraints. Specifically, a decision maker should satisfy some units of an asset as her demand, either form a market with time-varying price or from her own inventory. The decision maker is presented a price in slot-by-slot manne…
▽ More
Motivated by the application of energy storage management in electricity markets, this paper considers the problem of online linear programming with inventory management constraints. Specifically, a decision maker should satisfy some units of an asset as her demand, either form a market with time-varying price or from her own inventory. The decision maker is presented a price in slot-by-slot manner, and must immediately decide the purchased amount with the current price to cover the demand or to store in inventory for covering the future demand. The inventory has a limited capacity and its critical role is to buy and store assets at low price and use the stored assets to cover the demand at high price. The ultimate goal of the decision maker is to cover the demands while minimizing the cost of buying assets from the market. We propose BatMan, an online algorithm for simple inventory models, and BatManRate, an extended version for the case with rate constraints. Both BatMan and BatManRate achieve optimal competitive ratios, meaning that no other online algorithm can achieve a better theoretical guarantee. To illustrate the results, we use the proposed algorithms to design and evaluate energy procurement and storage management strategies for data centers with a portfolio of energy sources including the electric grid, local renewable generation, and energy storage systems.
△ Less
Submitted 14 January, 2019;
originally announced January 2019.
-
Linear-Convex Optimal Steady-State Control
Authors:
Liam S. P. Lawrence,
John W. Simpson-Porco,
Enrique Mallada
Abstract:
We consider the problem of designing a feedback controller for a multivariable linear time-invariant system which regulates an arbitrary system output to the solution of an equality-constrained convex optimization problem despite unknown constant exogenous disturbances; we term this the linear-convex optimal steady-state (OSS) control problem. We introduce the notion of an optimality model, and sh…
▽ More
We consider the problem of designing a feedback controller for a multivariable linear time-invariant system which regulates an arbitrary system output to the solution of an equality-constrained convex optimization problem despite unknown constant exogenous disturbances; we term this the linear-convex optimal steady-state (OSS) control problem. We introduce the notion of an optimality model, and show that the existence of an optimality model is sufficient to reduce the OSS control problem to a stabilization problem. This yields a constructive design framework for optimal steady-state control that unifies and extends existing design methods in the literature. We illustrate the approach via an application to optimal frequency control of power networks, where our methodology recovers centralized and distributed controllers reported in the recent literature.
△ Less
Submitted 10 May, 2020; v1 submitted 30 October, 2018;
originally announced October 2018.
-
Voltage Collapse Stabilization: A Game Theory Viewpoint
Authors:
Charalampos Avraam,
Jesse Rines,
Aurik Sarker,
Fernando Paganini,
Enrique Mallada
Abstract:
Voltage collapse is a type of blackout-inducing dynamic instability that occurs when the power demand exceeds the maximum power that can be transferred through the network. The traditional (preventive) approach to avoid voltage collapse is based on ensuring that the network never reaches its maximum capacity. However, such an approach leads to inefficiencies as it prevents operators to fully utili…
▽ More
Voltage collapse is a type of blackout-inducing dynamic instability that occurs when the power demand exceeds the maximum power that can be transferred through the network. The traditional (preventive) approach to avoid voltage collapse is based on ensuring that the network never reaches its maximum capacity. However, such an approach leads to inefficiencies as it prevents operators to fully utilize the network resources and does not account for unprescribed events. To overcome this limitation, this paper seeks to initiate the study of voltage collapse stabilization.
More precisely, for a DC network, we formulate the problem of voltage stability as a dynamic problem where each load seeks to achieve a constant power consumption by updating its conductance as the voltage changes. We show that such a system can be interpreted as a dynamic game, where each player (load) seeks to myopically maximize their utility, and where every stable power flow solution amounts to a Local Nash Equilibrium.
Using this framework, we show that voltage collapse is equivalent to the non-existence of a Local Nash Equilibrium in the game and, as a result, it is caused by the lack of cooperation between loads. Finally, we propose a Voltage Collapse Stabilizer (VCS) controller that uses (flexible) loads that are willing to cooperate and provides a fair allocation of the curtailed demand. Our solution stabilizes voltage collapse even in the presence of non-cooperative loads. Numerical simulations validate several features of our controllers.
△ Less
Submitted 26 October, 2018; v1 submitted 23 October, 2018;
originally announced October 2018.
-
Optimal Steady-State Control for Linear Time-Invariant Systems
Authors:
Liam S. P. Lawrence,
Zachary E. Nelson,
Enrique Mallada,
John W. Simpson-Porco
Abstract:
We consider the problem of designing a feedback controller that guides the input and output of a linear time-invariant system to a minimizer of a convex optimization problem. The system is subject to an unknown disturbance that determines the feasible set defined by the system equilibrium constraints. Our proposed design enforces the Karush-Kuhn-Tucker optimality conditions in steady-state without…
▽ More
We consider the problem of designing a feedback controller that guides the input and output of a linear time-invariant system to a minimizer of a convex optimization problem. The system is subject to an unknown disturbance that determines the feasible set defined by the system equilibrium constraints. Our proposed design enforces the Karush-Kuhn-Tucker optimality conditions in steady-state without incorporating dual variables into the controller. We prove that the input and output variables achieve optimality in equilibrium and outline two procedures for designing controllers that stabilize the closed-loop system. We explore key ideas through simple examples and simulations.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
-
Robust Scale-Free Synthesis for Frequency Control in Power Systems
Authors:
Richard Pates,
Enrique Mallada
Abstract:
The AC frequency in electrical power systems is conventionally regulated by synchronous machines. The gradual replacement of these machines by asynchronous renewable-based generation, which provides little or no frequency control, increases system uncertainty and the risk of instability. This imposes hard limits on the proportion of renewables that can be integrated into the system. In this paper…
▽ More
The AC frequency in electrical power systems is conventionally regulated by synchronous machines. The gradual replacement of these machines by asynchronous renewable-based generation, which provides little or no frequency control, increases system uncertainty and the risk of instability. This imposes hard limits on the proportion of renewables that can be integrated into the system. In this paper we address this issue by develo** a framework for performing frequency control in power systems with arbitrary mixes of conventional and renewable generation. Our approach is based on a robust stability criterion that can be used to guarantee the stability of a full power system model on the basis of a set of decentralised tests, one for each component in the system. It can be applied even when using detailed heterogeneous component models, and can be verified using several standard frequency response, state-space, and circuit theoretic analysis tools. Furthermore the stability guarantees hold independently of the operating point, and remain valid even as components are added to and removed from the grid. By designing decentralised controllers for individual components to meet these decentralised tests, every component can contribute to the regulation of the system frequency in a simple and provable manner. Notably, our framework certifies the stability of several existing (non-passive) power system control schemes and models, and allows for the study of robustness with respect to delays.
△ Less
Submitted 12 June, 2019; v1 submitted 2 April, 2018;
originally announced April 2018.
-
Sparse Recovery over Graph Incidence Matrices
Authors:
Mengnan Zhao,
M. Devrim Kaba,
René Vidal,
Daniel P. Robinson,
Enrique Mallada
Abstract:
Classical results in sparse recovery guarantee the exact reconstruction of $s$-sparse signals under assumptions on the dictionary that are either too strong or NP-hard to check. Moreover, such results may be pessimistic in practice since they are based on a worst-case analysis. In this paper, we consider the sparse recovery of signals defined over a graph, for which the dictionary takes the form o…
▽ More
Classical results in sparse recovery guarantee the exact reconstruction of $s$-sparse signals under assumptions on the dictionary that are either too strong or NP-hard to check. Moreover, such results may be pessimistic in practice since they are based on a worst-case analysis. In this paper, we consider the sparse recovery of signals defined over a graph, for which the dictionary takes the form of an incidence matrix. We derive necessary and sufficient conditions for sparse recovery, which depend on properties of the cycles of the graph that can be checked in polynomial time. We also derive support-dependent conditions for sparse recovery that depend only on the intersection of the cycles of the graph with the support of the signal. Finally, we exploit sparsity properties on the measurements and the structure of incidence matrices to propose a specialized sub-graph-based recovery algorithm that outperforms the standard $\ell_1$-minimization approach.
△ Less
Submitted 25 October, 2018; v1 submitted 26 March, 2018;
originally announced March 2018.
-
Robust Decentralized Secondary Frequency Control in Power Systems: Merits and Trade-Offs
Authors:
Erik Weitenberg,
Yan Jiang,
Changhong Zhao,
Enrique Mallada,
Claudio De Persis,
Florian Dörfler
Abstract:
Frequency restoration in power systems is conventionally performed by broadcasting a centralized signal to local controllers. As a result of the energy transition, technological advances, and the scientific interest in distributed control and optimization methods, a plethora of distributed frequency control strategies have been proposed recently that rely on communication amongst local controllers…
▽ More
Frequency restoration in power systems is conventionally performed by broadcasting a centralized signal to local controllers. As a result of the energy transition, technological advances, and the scientific interest in distributed control and optimization methods, a plethora of distributed frequency control strategies have been proposed recently that rely on communication amongst local controllers.
In this paper we propose a fully decentralized leaky integral controller for frequency restoration that is derived from a classic lag element. We study steady-state, asymptotic optimality, nominal stability, input-to-state stability, noise rejection, transient performance, and robustness properties of this controller in closed loop with a nonlinear and multivariable power system model. We demonstrate that the leaky integral controller can strike an acceptable trade-off between performance and robustness as well as between asymptotic disturbance rejection and transient convergence rate by tuning its DC gain and time constant. We compare our findings to conventional decentralized integral control and distributed-averaging-based integral control in theory and simulations.
△ Less
Submitted 22 November, 2018; v1 submitted 20 November, 2017;
originally announced November 2017.
-
An integral quadratic constraint framework for real-time steady-state optimization of linear time-invariant systems
Authors:
Zachary E. Nelson,
Enrique Mallada
Abstract:
Achieving optimal steady-state performance in real-time is an increasingly necessary requirement of many critical infrastructure systems. In pursuit of this goal, this paper builds a systematic design framework of feedback controllers for Linear Time-Invariant (LTI) systems that continuously track the optimal solution of some predefined optimization problem. The proposed solution can be logically…
▽ More
Achieving optimal steady-state performance in real-time is an increasingly necessary requirement of many critical infrastructure systems. In pursuit of this goal, this paper builds a systematic design framework of feedback controllers for Linear Time-Invariant (LTI) systems that continuously track the optimal solution of some predefined optimization problem. The proposed solution can be logically divided into three components. The first component estimates the system state from the output measurements. The second component uses the estimated state and computes a drift direction based on an optimization algorithm. The third component computes an input to the LTI system that aims to drive the system toward the optimal steady-state.
We analyze the equilibrium characteristics of the closed-loop system and provide conditions for optimality and stability. Our analysis shows that the proposed solution guarantees optimal steady-state performance, even in the presence of constant disturbances. Furthermore, by leveraging recent results on the analysis of optimization algorithms using integral quadratic constraints (IQCs), the proposed framework is able to translate input-output properties of our optimization component into sufficient conditions, based on linear matrix inequalities (LMIs), for global exponential asymptotic stability of the closed loop system. We illustrate the versatility of our framework using several examples.
△ Less
Submitted 27 October, 2017;
originally announced October 2017.