-
A safe exploration approach to constrained Markov decision processes
Authors:
Tingting Ni,
Maryam Kamgarpour
Abstract:
We consider discounted infinite horizon constrained Markov decision processes (CMDPs) where the goal is to find an optimal policy that maximizes the expected cumulative reward subject to expected cumulative constraints. Motivated by the application of CMDPs in online learning of safety-critical systems, we focus on develo** a model-free and simulator-free algorithm that ensures constraint satisf…
▽ More
We consider discounted infinite horizon constrained Markov decision processes (CMDPs) where the goal is to find an optimal policy that maximizes the expected cumulative reward subject to expected cumulative constraints. Motivated by the application of CMDPs in online learning of safety-critical systems, we focus on develo** a model-free and simulator-free algorithm that ensures constraint satisfaction during learning. To this end, we develop an interior point approach based on the log barrier function of the CMDP. Under the commonly assumed conditions of Fisher non-degeneracy and bounded transfer error of the policy parameterization, we establish the theoretical properties of the algorithm. In particular, in contrast to existing CMDP approaches that ensure policy feasibility only upon convergence, our algorithm guarantees the feasibility of the policies during the learning process and converges to the $\varepsilon$-optimal policy with a sample complexity of $\tilde{\mathcal{O}}(\varepsilon^{-6})$. In comparison to the state-of-the-art policy gradient-based algorithm, C-NPG-PDA, our algorithm requires an additional $\mathcal{O}(\varepsilon^{-2})$ samples to ensure policy feasibility during learning with the same Fisher non-degenerate parameterization.
△ Less
Submitted 23 May, 2024; v1 submitted 1 December, 2023;
originally announced December 2023.
-
Connections whose differential Galois groups are reductive of maximal degree
Authors:
Masoud Kamgarpour,
Daniel S. Sage
Abstract:
The differential Galois group of an $n^\mathrm{th}$ order linear differential equation is the symmetry group of its solutions; it is an algebraic subgroup of $\mathrm{GL}_n(\mathbb{C})$. More generally, if $G$ is a simple complex algebraic group, the differential Galois group of a $G$-connection is an algebraic subgroup of $G$. A connected reductive subgroup of $G$ is said to have maximal degree i…
▽ More
The differential Galois group of an $n^\mathrm{th}$ order linear differential equation is the symmetry group of its solutions; it is an algebraic subgroup of $\mathrm{GL}_n(\mathbb{C})$. More generally, if $G$ is a simple complex algebraic group, the differential Galois group of a $G$-connection is an algebraic subgroup of $G$. A connected reductive subgroup of $G$ is said to have maximal degree if it has a fundamental degree equal to the Coxeter number of $G$. We give a complete classification of these subgroups and generalise a theorem of Katz on linear differential equations by giving a criterion for the differential Galois group of a $G$-connection to be reductive of maximal degree. As an application, we determine the differential Galois groups of certain $G$-connections that play an important role in recent work on the geometric Langlands program: connections on $\mathbb{G}_m$ with an (irregular) "Coxeter" singularity and possibly an additional regular singular point.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Identifiability and Generalizability in Constrained Inverse Reinforcement Learning
Authors:
Andreas Schlaginhaufen,
Maryam Kamgarpour
Abstract:
Two main challenges in Reinforcement Learning (RL) are designing appropriate reward functions and ensuring the safety of the learned policy. To address these challenges, we present a theoretical framework for Inverse Reinforcement Learning (IRL) in constrained Markov decision processes. From a convex-analytic perspective, we extend prior results on reward identifiability and generalizability to bo…
▽ More
Two main challenges in Reinforcement Learning (RL) are designing appropriate reward functions and ensuring the safety of the learned policy. To address these challenges, we present a theoretical framework for Inverse Reinforcement Learning (IRL) in constrained Markov decision processes. From a convex-analytic perspective, we extend prior results on reward identifiability and generalizability to both the constrained setting and a more general class of regularizations. In particular, we show that identifiability up to potential sha** (Cao et al., 2021) is a consequence of entropy regularization and may generally no longer hold for other regularizations or in the presence of safety constraints. We also show that to ensure generalizability to new transition laws and constraints, the true reward must be identified up to a constant. Additionally, we derive a finite sample guarantee for the suboptimality of the learned rewards, and validate our results in a gridworld environment.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
On Linear Quadratic Potential Games
Authors:
Sara Hosseinirad,
Giulio Salizzoni,
Alireza Alian Porzani,
Maryam Kamgarpour
Abstract:
Our paper addresses characterizing conditions for a linear quadratic (LQ) game to be a potential game. The desired properties of potential games in finite action settings, such as convergence of learning dynamics to Nash equilibria, and the challenges of learning Nash equilibria in continuous state and action settings motivate us to characterize LQ potential games. Our first contribution is to sho…
▽ More
Our paper addresses characterizing conditions for a linear quadratic (LQ) game to be a potential game. The desired properties of potential games in finite action settings, such as convergence of learning dynamics to Nash equilibria, and the challenges of learning Nash equilibria in continuous state and action settings motivate us to characterize LQ potential games. Our first contribution is to show that the set of LQ games with full-state feedback that are potential games is very limited, essentially differing only slightly from an identical interest game. Given this finding, we restrict the class of LQ games to those with decoupled dynamics and decoupled state information structure. For this subclass, we show that the set of potential games strictly includes non-identical interest games and characterize conditions for the LQ games in this subclass to be potential. We further derive their corresponding potential function and prove the existence of a Nash equilibrium. Meanwhile, we highlight the challenges in the characterization and computation of Nash equilibrium for this class of potential LQ games.
△ Less
Submitted 9 June, 2024; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Convergence Rate of Learning a Strongly Variationally Stable Equilibrium
Authors:
Tatiana Tatarenko,
Maryam Kamgarpour
Abstract:
We derive the rate of convergence to the strongly variationally stable Nash equilibrium in a convex game, for a zeroth-order learning algorithm. Though we do not assume strong monotonicity of the game, our rates for the one-point feedback and for the two-point feedback match the best known rates for strongly monotone games under zeroth-order information.
We derive the rate of convergence to the strongly variationally stable Nash equilibrium in a convex game, for a zeroth-order learning algorithm. Though we do not assume strong monotonicity of the game, our rates for the one-point feedback and for the two-point feedback match the best known rates for strongly monotone games under zeroth-order information.
△ Less
Submitted 11 March, 2024; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Safe Zeroth-Order Optimization Using Linear Programs
Authors:
Baiwei Guo,
Yang Wang,
Yuning Jiang,
Maryam Kamgarpour,
Giancarlo Ferrari-Trecate
Abstract:
To solve unmodeled optimization problems with hard constraints, this paper proposes a novel zeroth-order approach called Safe Zeroth-order Optimization using Linear Programs (SZO-LP). The SZO-LP method solves a linear program in each iteration to find a descent direction, followed by a step length determination. We prove that, under mild conditions, the iterates of SZO-LP have an accumulation poin…
▽ More
To solve unmodeled optimization problems with hard constraints, this paper proposes a novel zeroth-order approach called Safe Zeroth-order Optimization using Linear Programs (SZO-LP). The SZO-LP method solves a linear program in each iteration to find a descent direction, followed by a step length determination. We prove that, under mild conditions, the iterates of SZO-LP have an accumulation point that is also the primal of a KKT pair. We then apply SZO-LP to solve an Optimal Power Flow (OPF) problem on the IEEE 30-bus system. The results demonstrate that SZO-LP requires less computation time and samples compared to state-of-the-art approaches.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Safe Zeroth-Order Optimization Using Quadratic Local Approximations
Authors:
Baiwei Guo,
Yuning Jiang,
Giancarlo Ferrari-Trecate,
Maryam Kamgarpour
Abstract:
This paper addresses black-box smooth optimization problems, where the objective and constraint functions are not explicitly known but can be queried. The main goal of this work is to generate a sequence of feasible points converging towards a KKT primal-dual pair. Assuming to have prior knowledge on the smoothness of the unknown objective and constraints, we propose a novel zeroth-order method th…
▽ More
This paper addresses black-box smooth optimization problems, where the objective and constraint functions are not explicitly known but can be queried. The main goal of this work is to generate a sequence of feasible points converging towards a KKT primal-dual pair. Assuming to have prior knowledge on the smoothness of the unknown objective and constraints, we propose a novel zeroth-order method that iteratively computes quadratic approximations of the constraint functions, constructs local feasible sets and optimizes over them. Under some mild assumptions, we prove that this method returns an $η$-KKT pair (a property reflecting how close a primal-dual pair is to the exact KKT condition) within $O({1}/{η^{2}})$ iterations. Moreover, we numerically show that our method can achieve faster convergence compared with some state-of-the-art zeroth-order approaches. The effectiveness of the proposed approach is also illustrated by applying it to nonconvex optimization problems in optimal control and power system operation.
△ Less
Submitted 24 April, 2024; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Certification of Bottleneck Task Assignment with Shortest Path Criteria
Authors:
Tony A. Wood,
Maryam Kamgarpour
Abstract:
Minimising the longest travel distance for a group of mobile robots with interchangeable goals requires knowledge of the shortest length paths between all robots and goal destinations. Determining the exact length of the shortest paths in an environment with obstacles is NP-hard however. In this paper, we investigate when polynomial-time approximations of the shortest path search are sufficient to…
▽ More
Minimising the longest travel distance for a group of mobile robots with interchangeable goals requires knowledge of the shortest length paths between all robots and goal destinations. Determining the exact length of the shortest paths in an environment with obstacles is NP-hard however. In this paper, we investigate when polynomial-time approximations of the shortest path search are sufficient to determine the optimal assignment of robots to goals. In particular, we propose an algorithm in which the accuracy of the path planning is iteratively increased. The approach provides a certificate when the uncertainties on estimates of the shortest paths become small enough to guarantee the optimality of the goal assignment. To this end, we apply results from assignment sensitivity assuming upper and lower bounds on the length of the shortest paths. We then provide polynomial-time methods to find such bounds by applying sampling-based path planning. The upper bounds are given by feasible paths, the lower bounds are obtained by expanding the sample set and leveraging the knowledge of the sample dispersion. We demonstrate the application of the proposed method with a multi-robot path-planning case study.
△ Less
Submitted 8 June, 2023; v1 submitted 24 December, 2022;
originally announced December 2022.
-
Safe Zeroth-Order Convex Optimization Using Quadratic Local Approximations
Authors:
Baiwei Guo,
Yuning Jiang,
Maryam Kamgarpour,
Giancarlo Ferrari-Trecate
Abstract:
We address black-box convex optimization problems, where the objective and constraint functions are not explicitly known but can be sampled within the feasible set. The challenge is thus to generate a sequence of feasible points converging towards an optimal solution. By leveraging the knowledge of the smoothness properties of the objective and constraint functions, we propose a novel zeroth-order…
▽ More
We address black-box convex optimization problems, where the objective and constraint functions are not explicitly known but can be sampled within the feasible set. The challenge is thus to generate a sequence of feasible points converging towards an optimal solution. By leveraging the knowledge of the smoothness properties of the objective and constraint functions, we propose a novel zeroth-order method, SZO-QQ, that iteratively computes quadratic approximations of the constraint functions, constructs local feasible sets and optimizes over them. We prove convergence of the sequence of the objective values generated at each iteration to the minimum. Through experiments, we show that our method can achieve faster convergence compared with state-of-the-art zeroth-order approaches to convex optimization.
△ Less
Submitted 7 November, 2022; v1 submitted 4 November, 2022;
originally announced November 2022.
-
Arithmetic geometry of character varieties with regular monodromy, I
Authors:
Masoud Kamgarpour,
GyeongHyeon Nam,
Anna Puskás
Abstract:
We study character varieties arising as moduli of representations of an orientable surface group into a reductive group $G$. We first show that if $G/Z$ acts freely on the representation variety, then both the representation variety and the character variety are smooth and equidimensional. Next, we count points on a family of smooth character varieties; namely, those involving both regular semisim…
▽ More
We study character varieties arising as moduli of representations of an orientable surface group into a reductive group $G$. We first show that if $G/Z$ acts freely on the representation variety, then both the representation variety and the character variety are smooth and equidimensional. Next, we count points on a family of smooth character varieties; namely, those involving both regular semisimple and regular unipotent monodromy. In particular, we show that these varieties are polynomial count and obtain an explicit expression for their $E$-polynomials. Finally, by analysing the $E$-polynomial, we determine certain topological invariants of these varieties such as the Euler characteristic and the number of connected components. As an application, we give an example of a cohomologically rigid representation which is not physically rigid.
△ Less
Submitted 14 September, 2023; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning
Authors:
Ilnura Usmanova,
Yarden As,
Maryam Kamgarpour,
Andreas Krause
Abstract:
Optimizing noisy functions online, when evaluating the objective requires experiments on a deployed system, is a crucial task arising in manufacturing, robotics and many others. Often, constraints on safe inputs are unknown ahead of time, and we only obtain noisy information, indicating how close we are to violating the constraints. Yet, safety must be guaranteed at all times, not only for the fin…
▽ More
Optimizing noisy functions online, when evaluating the objective requires experiments on a deployed system, is a crucial task arising in manufacturing, robotics and many others. Often, constraints on safe inputs are unknown ahead of time, and we only obtain noisy information, indicating how close we are to violating the constraints. Yet, safety must be guaranteed at all times, not only for the final output of the algorithm.
We introduce a general approach for seeking a stationary point in high dimensional non-linear stochastic optimization problems in which maintaining safety during learning is crucial. Our approach called LB-SGD is based on applying stochastic gradient descent (SGD) with a carefully chosen adaptive step size to a logarithmic barrier approximation of the original problem. We provide a complete convergence analysis of non-convex, convex, and strongly-convex smooth constrained problems, with first-order and zeroth-order feedback. Our approach yields efficient updates and scales better with dimensionality compared to existing approaches.
We empirically compare the sample complexity and the computational cost of our method with existing safe learning approaches. Beyond synthetic benchmarks, we demonstrate the effectiveness of our approach on minimizing constraint violation in policy search tasks in safe reinforcement learning (RL).
△ Less
Submitted 2 June, 2023; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Character stacks are PORC count
Authors:
Nick Bridger,
Masoud Kamgarpour
Abstract:
We compute the number of points over finite fields of the character stack associated to a compact surface group and a reductive group with connected centre. We find that the answer is a Polynomial On Residue Classes (PORC). The key ingredients in the proof are Lusztig's Jordan decomposition of complex characters and Deriziotis's results on genus numbers of finite reductive groups. As a consequence…
▽ More
We compute the number of points over finite fields of the character stack associated to a compact surface group and a reductive group with connected centre. We find that the answer is a Polynomial On Residue Classes (PORC). The key ingredients in the proof are Lusztig's Jordan decomposition of complex characters and Deriziotis's results on genus numbers of finite reductive groups. As a consequence of our main theorem, we obtain an expression for the $E$-polynomial of the character stack.
△ Less
Submitted 14 September, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games
Authors:
Tatiana Tatarenko,
Maryam Kamgarpour
Abstract:
We derive the rate of convergence to Nash equilibria for the payoff-based algorithm proposed in \cite{tat_kam_TAC}. These rates are achieved under the standard assumption of convexity of the game, strong monotonicity and differentiability of the pseudo-gradient. In particular, we show the algorithm achieves $O(\frac{1}{T})$ in the two-point function evaluating setting and $O(\frac{1}{\sqrt{T}})$ i…
▽ More
We derive the rate of convergence to Nash equilibria for the payoff-based algorithm proposed in \cite{tat_kam_TAC}. These rates are achieved under the standard assumption of convexity of the game, strong monotonicity and differentiability of the pseudo-gradient. In particular, we show the algorithm achieves $O(\frac{1}{T})$ in the two-point function evaluating setting and $O(\frac{1}{\sqrt{T}})$ in the one-point function evaluation under additional requirement of Lipschitz continuity of the pseudo-gradient. These rates are to our knowledge the best known rates for the corresponding problem classes.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Hypergeometric sheaves for classical groups via geometric Langlands
Authors:
Masoud Kamgarpour,
Daxin Xu,
Lingfei Yi
Abstract:
In a previous paper, the first and third authors gave an explicit realization of the geometric Langlands correspondence for hypergeometric sheaves, considered as $\textrm{GL}_n$-local systems. Certain hypergeometric local systems admit a symplectic or orthogonal structure, which can be viewed as $\check{G}$-local systems, for a classical group $\check{G}$. This article aims to realize the geometri…
▽ More
In a previous paper, the first and third authors gave an explicit realization of the geometric Langlands correspondence for hypergeometric sheaves, considered as $\textrm{GL}_n$-local systems. Certain hypergeometric local systems admit a symplectic or orthogonal structure, which can be viewed as $\check{G}$-local systems, for a classical group $\check{G}$. This article aims to realize the geometric Langlands correspondence for these $\check{G}$-local systems. We study this problem from two aspects. In the first approach, we define the hypergeometric automorphic data for a classical group $G$ in the framework of Yun, one of whose local components is a new class of euphotic representations in the sense of Jakob-Yun. We prove the rigidity of hypergeometric automorphic data under natural assumptions, which allows us to define $\check{G}$-local systems $\mathcal{E}_{\check{G}}$ on $\mathbb{G}_m$ as Hecke eigenvalues (in both $\ell$-adic and de Rham setting). In the second approach (which works only in the de Rham setting), we quantize an enhanced ramified Hitchin system, following Beilinson-Drinfeld and Zhu, and identify $\mathcal{E}_{\check{G}}$ with certain $\check{G}$-opers on $\mathbb{G}_m$. Finally, we compare these $\check{G}$-opers with hypergeometric local systems.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Distributionally Robust Bootstrap Optimization
Authors:
Tyler Summers,
Maryam Kamgarpour
Abstract:
Control architectures and autonomy stacks for complex engineering systems are often divided into layers to decompose a complex problem and solution into distinct, manageable sub-problems. To simplify designs, uncertainties are often ignored across layers, an approach with deep roots in classical notions of separation and certainty equivalence. But to develop robust architectures, especially as int…
▽ More
Control architectures and autonomy stacks for complex engineering systems are often divided into layers to decompose a complex problem and solution into distinct, manageable sub-problems. To simplify designs, uncertainties are often ignored across layers, an approach with deep roots in classical notions of separation and certainty equivalence. But to develop robust architectures, especially as interactions between data-driven learning layers and model-based decision-making layers grow more intricate, more sophisticated interfaces between layers are required. We propose a basic architecture that couples a statistical parameter estimation layer with a constrained optimization layer. We show how the layers can be tightly integrated by combining bootstrap resampling with distributionally robust optimization. The approach allows a finite-data out-of-sample safety guarantee and an exact reformulation as a tractable finite-dimensional convex optimization problem.
△ Less
Submitted 27 December, 2021;
originally announced December 2021.
-
Airy sheaves for reductive groups
Authors:
Konstantin Jakob,
Masoud Kamgarpour,
Lingfei Yi
Abstract:
We construct a class of $\ell$-adic local systems on $\mathbb{A}^1$ that generalizes the Airy sheaves defined by N. Katz to reductive groups. These sheaves are finite field analogues of generalizations of the classical Airy equation $y''(z)=zy(z)$. We employ the geometric Langlands correspondence to construct the sought-after local systems as eigenvalues of certain rigid Hecke eigensheaves, follow…
▽ More
We construct a class of $\ell$-adic local systems on $\mathbb{A}^1$ that generalizes the Airy sheaves defined by N. Katz to reductive groups. These sheaves are finite field analogues of generalizations of the classical Airy equation $y''(z)=zy(z)$. We employ the geometric Langlands correspondence to construct the sought-after local systems as eigenvalues of certain rigid Hecke eigensheaves, following the methods developed by Heinloth, Ngô and Yun. The construction is motivated by a special case of Adler and Yu's construction of tame supercuspidal representations. The representations that we consider can be viewed as deeper analogues of simple supercuspidals. For $\mathrm{GL}_n$, we compute the Frobenius trace of the local systems in question and show that they agree with Katz's Airy sheaves. We make precise conjectures about the ramification behaviour of the local systems at $\infty$. These conjectures in particular imply cohomological rigidity of Airy sheaves.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
Fast Projection Onto Convex Smooth Constraints
Authors:
Ilnura Usmanova,
Maryam Kamgarpour,
Andreas Krause,
Kfir Yehuda Levy
Abstract:
The Euclidean projection onto a convex set is an important problem that arises in numerous constrained optimization tasks. Unfortunately, in many cases, computing projections is computationally demanding. In this work, we focus on projection problems where the constraints are smooth and the number of constraints is significantly smaller than the dimension. The runtime of existing approaches to sol…
▽ More
The Euclidean projection onto a convex set is an important problem that arises in numerous constrained optimization tasks. Unfortunately, in many cases, computing projections is computationally demanding. In this work, we focus on projection problems where the constraints are smooth and the number of constraints is significantly smaller than the dimension. The runtime of existing approaches to solving such problems is either cubic in the dimension or polynomial in the inverse of the target accuracy. Conversely, we propose a simple and efficient primal-dual approach, with a runtime that scales only linearly with the dimension, and only logarithmically in the inverse of the target accuracy. We empirically demonstrate its performance, and compare it with standard baselines.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Trajectory Planning Under Environmental Uncertainty With Finite-Sample Safety Guarantees
Authors:
Vasileios Lefkopoulos,
Maryam Kamgarpour
Abstract:
We tackle the problem of trajectory planning in an environment comprised of a set of obstacles with uncertain time-varying locations. The uncertainties are modeled using widely accepted Gaussian distributions, resulting in a chance-constrained program. Contrary to previous approaches however, we do not assume perfect knowledge of the moments of the distribution, and instead estimate them through f…
▽ More
We tackle the problem of trajectory planning in an environment comprised of a set of obstacles with uncertain time-varying locations. The uncertainties are modeled using widely accepted Gaussian distributions, resulting in a chance-constrained program. Contrary to previous approaches however, we do not assume perfect knowledge of the moments of the distribution, and instead estimate them through finite samples available from either sensors or past data. We derive tight concentration bounds on the error of these estimates to sufficiently tighten the chance-constraint program. As such, we provide provable guarantees on satisfaction of the chance-constraints corresponding to the nominal yet unknown moments. We illustrate our results with two autonomous vehicle trajectory planning case studies.
△ Less
Submitted 13 August, 2021;
originally announced August 2021.
-
Actuator Placement for Structural Controllability beyond Strong Connectivity and towards Robustness
Authors:
Baiwei Guo,
Orcun Karaca,
Sepide Azhdari,
Maryam Kamgarpour,
Giancarlo Ferrari-Trecate
Abstract:
Actuator placement is a fundamental problem in control design for large-scale networks. In this paper, we study the problem of finding a set of actuator positions by minimizing a given metric, while satisfying a structural controllability requirement and a constraint on the number of actuators. We first extend the classical forward greedy algorithm for applications to graphs that are not necessari…
▽ More
Actuator placement is a fundamental problem in control design for large-scale networks. In this paper, we study the problem of finding a set of actuator positions by minimizing a given metric, while satisfying a structural controllability requirement and a constraint on the number of actuators. We first extend the classical forward greedy algorithm for applications to graphs that are not necessarily strongly connected. We then improve this greedy algorithm by extending its horizon. This is done by evaluating the actuator position set expansions at the further steps of the classical greedy algorithm. We prove that this new method attains a better performance, when this evaluation considers the final actuator position set. Moreover, we study the problem of minimal backup placements. The goal is to ensure that the system stays structurally controllable even when any of the selected actuators goes offline, with minimum number of backup actuators. We show that this problem is equivalent to the well-studied NP-hard hitting set problem. Our results are verified by a numerical case study.
△ Less
Submitted 10 April, 2021; v1 submitted 5 April, 2021;
originally announced April 2021.
-
Multi-robot task allocation for safe planning against stochastic hazard dynamics
Authors:
Daniel Tihanyi,
Yimeng Lu,
Orcun Karaca,
Maryam Kamgarpour
Abstract:
We address multi-robot safe mission planning in uncertain dynamic environments. This problem arises in several applications including safety-critical exploration, surveillance, and emergency rescue missions. Computation of a multi-robot optimal control policy is challenging not only because of the complexity of incorporating dynamic uncertainties while planning, but also because of the exponential…
▽ More
We address multi-robot safe mission planning in uncertain dynamic environments. This problem arises in several applications including safety-critical exploration, surveillance, and emergency rescue missions. Computation of a multi-robot optimal control policy is challenging not only because of the complexity of incorporating dynamic uncertainties while planning, but also because of the exponential growth in problem size as a function of number of robots. Leveraging recent works obtaining a tractable safety maximizing plan for a single robot, we propose a scalable two-stage framework to solve the problem at hand. Specifically, the problem is split into a low-level single-agent control problem and a high-level task allocation problem. The low-level problem uses an efficient approximation of stochastic reachability for a Markov decision process to derive the optimal control policy under dynamic uncertainty. The task allocation is solved using polynomial-time forward and reverse greedy heuristics and in a distributed auction-based manner. By leveraging the properties of our safety objective function, we provide provable performance bounds on the safety of the approximate solutions proposed by these two heuristics. We evaluate the theory with extensive numerical case studies.
△ Less
Submitted 13 November, 2022; v1 submitted 2 March, 2021;
originally announced March 2021.
-
Performance guarantees of forward and reverse greedy algorithms for minimizing nonsupermodular nonsubmodular functions on a matroid
Authors:
Orcun Karaca,
Daniel Tihanyi,
Maryam Kamgarpour
Abstract:
This letter studies the problem of minimizing increasing set functions, or equivalently, maximizing decreasing set functions, over the base of a matroid. This setting has received great interest, since it generalizes several applied problems including actuator and sensor placement problems in control theory, multi-robot task allocation problems, video summarization, and many others. We study two g…
▽ More
This letter studies the problem of minimizing increasing set functions, or equivalently, maximizing decreasing set functions, over the base of a matroid. This setting has received great interest, since it generalizes several applied problems including actuator and sensor placement problems in control theory, multi-robot task allocation problems, video summarization, and many others. We study two greedy heuristics, namely, the forward and the reverse greedy algorithms. We provide two novel performance guarantees for the approximate solutions obtained by these heuristics depending on both the submodularity ratio and the curvature.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
A market-based approach for enabling inter-area reserve exchange
Authors:
Orcun Karaca,
Stefanos Delikaraoglou,
Maryam Kamgarpour
Abstract:
Considering the sequential clearing of energy and reserves in Europe, enabling inter-area reserve exchange requires optimally allocating inter-area transmission capacities between these two markets. To achieve this, we provide a market-based allocation framework and derive payments with desirable properties. The proposed min-max least core selecting payments achieve individual rationality, budget…
▽ More
Considering the sequential clearing of energy and reserves in Europe, enabling inter-area reserve exchange requires optimally allocating inter-area transmission capacities between these two markets. To achieve this, we provide a market-based allocation framework and derive payments with desirable properties. The proposed min-max least core selecting payments achieve individual rationality, budget balance, and approximate incentive compatibility and coalitional stability. The results extend the works on private discrete items to a network of continuous public choices.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
Sample Complexity of Linear Quadratic Gaussian (LQG) Control for Output Feedback Systems
Authors:
Yang Zheng,
Luca Furieri,
Maryam Kamgarpour,
Na Li
Abstract:
This paper studies a class of partially observed Linear Quadratic Gaussian (LQG) problems with unknown dynamics. We establish an end-to-end sample complexity bound on learning a robust LQG controller for open-loop stable plants. This is achieved using a robust synthesis procedure, where we first estimate a model from a single input-output trajectory of finite length, identify an H-infinity bound o…
▽ More
This paper studies a class of partially observed Linear Quadratic Gaussian (LQG) problems with unknown dynamics. We establish an end-to-end sample complexity bound on learning a robust LQG controller for open-loop stable plants. This is achieved using a robust synthesis procedure, where we first estimate a model from a single input-output trajectory of finite length, identify an H-infinity bound on the estimation error, and then design a robust controller using the estimated model and its quantified uncertainty. Our synthesis procedure leverages a recent control tool called Input-Output Parameterization (IOP) that enables robust controller design using convex optimization. For open-loop stable systems, we prove that the LQG performance degrades linearly with respect to the model estimation error using the proposed synthesis procedure. Despite the hidden states in the LQG problem, the achieved scaling matches previous results on learning Linear Quadratic Regulator (LQR) controllers with full state observations.
△ Less
Submitted 25 June, 2021; v1 submitted 19 November, 2020;
originally announced November 2020.
-
Bandit Learning in Convex Non-Strictly Monotone Games
Authors:
Tatiana Tatarenko,
Maryam Kamgarpour
Abstract:
We address learning Nash equilibria in convex games under the payoff information setting. We consider the case in which the game pseudo-gradient is monotone but not necessarily strictly monotone. This relaxation of strict monotonicity enables application of learning algorithms to a larger class of games, such as, for example, a zero-sum game with a merely convex-concave cost function. We derive an…
▽ More
We address learning Nash equilibria in convex games under the payoff information setting. We consider the case in which the game pseudo-gradient is monotone but not necessarily strictly monotone. This relaxation of strict monotonicity enables application of learning algorithms to a larger class of games, such as, for example, a zero-sum game with a merely convex-concave cost function. We derive an algorithm whose iterates provably converge to the least-norm Nash equilibrium in this setting. {From the perspective of a single player using the proposed algorithm, we view the game as an instance of online optimization}. Through this lens, we quantify the regret rate of the algorithm and provide an approach to choose the algorithm's parameters to minimize the regret rate.
△ Less
Submitted 16 August, 2023; v1 submitted 8 September, 2020;
originally announced September 2020.
-
Geometric Langlands for hypergeometric sheaves
Authors:
Masoud Kamgarpour,
Lingfei Yi
Abstract:
Generalised hypergeometric sheaves are rigid local systems on the punctured projective line with remarkable properties. Their study originated in the seminal work of Riemann on the Euler--Gauss hypergeometric function and has blossomed into an active field with connections to many areas of mathematics. In this paper, we construct the Hecke eigensheaves whose eigenvalues are the irreducible hyperge…
▽ More
Generalised hypergeometric sheaves are rigid local systems on the punctured projective line with remarkable properties. Their study originated in the seminal work of Riemann on the Euler--Gauss hypergeometric function and has blossomed into an active field with connections to many areas of mathematics. In this paper, we construct the Hecke eigensheaves whose eigenvalues are the irreducible hypergeometric local systems, thus confirming a central conjecture of the geometric Langlands program for hypergeometrics. The key new concept is the notion of hypergeometric automorphic data. We prove that this automorphic data is generically rigid (in the sense of Zhiwei Yun) and identify the resulting Hecke eigenvalue with hypergeometric sheaves. The definition of hypergeometric automorphic data in the tame case involves the mirabolic subgroup, while in the wild case, semistable (but not necessarily stable) vectors coming from principal gradings intervene.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Enabling inter-area reserve exchange through stable benefit allocation mechanisms
Authors:
Orcun Karaca,
Stefanos Delikaraoglou,
Gabriela Hug,
Maryam Kamgarpour
Abstract:
The establishment of a single European day-ahead market has accomplished the integration of the regional day-ahead markets. However, the reserves provision and activation remain an exclusive responsibility of regional operators. This limited spatial coordination and the sequential market structure hinder the efficient utilization of flexible generation and transmission, since their capacities have…
▽ More
The establishment of a single European day-ahead market has accomplished the integration of the regional day-ahead markets. However, the reserves provision and activation remain an exclusive responsibility of regional operators. This limited spatial coordination and the sequential market structure hinder the efficient utilization of flexible generation and transmission, since their capacities have to be ex-ante allocated between energy and reserves. To promote reserve exchange, recent work has proposed a preemptive model that defines the optimal inter-area transmission capacities for energy and reserves reducing the expected system cost. This decision-support tool, formulated as a stochastic bilevel program, respects the current architecture but does not suggest area-specific costs that guarantee sufficient incentives for all areas to accept the proposed solution. To this end, we formulate a preemptive model in a framework that allows the application of coalitional game theory methods to obtain a stable benefit allocation, i.e., an outcome immune to coalitional deviations ensuring willingness of all areas to coordinate. We show that benefit allocation mechanisms can be formulated either at the day-ahead or the real-time stages, in order to distribute the expected or the scenario-specific benefits, respectively. For both games, the proposed benefits achieve minimal stability violation, while allowing for a tractable computation with limited queries to the bilevel program. Our case studies, based on an illustrative and a more realistic test case, compare our method with well-studied benefit allocations, namely, the Shapley value and the nucleolus. We show that our method performs better in stability, tractability, and fairness, which would potentially be dictated by a criterion chosen by the regulator.
△ Less
Submitted 20 December, 2019;
originally announced December 2019.
-
Log Barriers for Safe Non-convex Black-box Optimization
Authors:
Ilnura Usmanova,
Andreas Krause,
Maryam Kamgarpour
Abstract:
We address the problem of minimizing a smooth function $f^0(x)$ over a compact set $D$ defined by smooth functional constraints $f^i(x)\leq 0,~ i = 1,\ldots, m$ given noisy value measurements of $f^i(x)$. This problem arises in safety-critical applications, where certain parameters need to be adapted online in a data-driven fashion, such as in personalized medicine, robotics, manufacturing, etc. I…
▽ More
We address the problem of minimizing a smooth function $f^0(x)$ over a compact set $D$ defined by smooth functional constraints $f^i(x)\leq 0,~ i = 1,\ldots, m$ given noisy value measurements of $f^i(x)$. This problem arises in safety-critical applications, where certain parameters need to be adapted online in a data-driven fashion, such as in personalized medicine, robotics, manufacturing, etc. In such cases, it is important to ensure constraints are not violated while taking measurements and seeking the minimum of the cost function. We propose a new algorithm s0-LBM, which provides provably feasible iterates with high probability and applies to the challenging case of uncertain zero-th order oracle. We also analyze the convergence rate of the algorithm, and empirically demonstrate its effectiveness.
△ Less
Submitted 19 December, 2019;
originally announced December 2019.
-
Safe non-smooth black-box optimization with application to policy search
Authors:
Ilnura Usmanova,
Andreas Krause,
Maryam Kamgarpour
Abstract:
For safety-critical black-box optimization tasks, observations of the constraints and the objective are often noisy and available only for the feasible points. We propose an approach based on log barriers to find a local solution of a non-convex non-smooth black-box optimization problem $\min f^0(x)$ subject to $f^i(x)\leq 0,~ i = 1,\ldots, m$, at the same time, guaranteeing constraint satisfactio…
▽ More
For safety-critical black-box optimization tasks, observations of the constraints and the objective are often noisy and available only for the feasible points. We propose an approach based on log barriers to find a local solution of a non-convex non-smooth black-box optimization problem $\min f^0(x)$ subject to $f^i(x)\leq 0,~ i = 1,\ldots, m$, at the same time, guaranteeing constraint satisfaction while learning an optimal solution with high probability. Our proposed algorithm exploits noisy observations to iteratively improve on an initial safe point until convergence. We derive the convergence rate and prove safety of our algorithm. We demonstrate its performance in an application to an iterative control design problem.
△ Less
Submitted 23 February, 2021; v1 submitted 19 December, 2019;
originally announced December 2019.
-
Learning the Globally Optimal Distributed LQ Regulator
Authors:
Luca Furieri,
Yang Zheng,
Maryam Kamgarpour
Abstract:
We study model-free learning methods for the output-feedback Linear Quadratic (LQ) control problem in finite-horizon subject to subspace constraints on the control policy. Subspace constraints naturally arise in the field of distributed control and present a significant challenge in the sense that standard model-based optimization and learning leads to intractable numerical programs in general. Bu…
▽ More
We study model-free learning methods for the output-feedback Linear Quadratic (LQ) control problem in finite-horizon subject to subspace constraints on the control policy. Subspace constraints naturally arise in the field of distributed control and present a significant challenge in the sense that standard model-based optimization and learning leads to intractable numerical programs in general. Building upon recent results in zeroth-order optimization, we establish model-free sample-complexity bounds for the class of distributed LQ problems where a local gradient dominance constant exists on any sublevel set of the cost function. %which admit a local gradient dominance constant valid on the sublevel set of the cost function. We prove that a fundamental class of distributed control problems - commonly referred to as Quadratically Invariant (QI) problems - as well as others possess this property. To the best of our knowledge, our result is the first sample-complexity bound guarantee on learning globally optimal distributed output-feedback control policies.
△ Less
Submitted 30 May, 2020; v1 submitted 18 December, 2019;
originally announced December 2019.
-
Actuator Placement under Structural Controllability using Forward and Reverse Greedy Algorithms
Authors:
Baiwei Guo,
Orcun Karaca,
Tyler Summers,
Maryam Kamgarpour
Abstract:
Actuator placement is an active field of research which has received significant attention for its applications in complex dynamical networks. In this paper, we study the problem of finding a set of actuator placements minimizing the metric that measures the average energy consumed for state transfer by the controller, while satisfying a structural controllability requirement and a cardinality con…
▽ More
Actuator placement is an active field of research which has received significant attention for its applications in complex dynamical networks. In this paper, we study the problem of finding a set of actuator placements minimizing the metric that measures the average energy consumed for state transfer by the controller, while satisfying a structural controllability requirement and a cardinality constraint on the number of actuators allowed. As no computationally efficient methods are known to solve such combinatorial set function optimization problems, two greedy algorithms, forward and reverse, are proposed to obtain approximate solutions. We first show that the constraint sets these algorithms explore can be characterized by matroids. We then obtain performance guarantees for the forward and reverse greedy algorithms applied to the general class of matroid optimization problems by exploiting properties of the objective function such as the submodularity ratio and the curvature. Finally, we propose feasibility check methods for both algorithms based on maximum flow problems on certain auxiliary graphs originating from the network graph. Our results are verified with case studies over large networks.
△ Less
Submitted 29 October, 2020; v1 submitted 11 December, 2019;
originally announced December 2019.
-
A comment on "Performance guarantees of a greedy algorithm for minimizing a supermodular set function on comatroid"
Authors:
Orcun Karaca,
Baiwei Guo,
Maryam Kamgarpour
Abstract:
We provide a counterexample to the performance guarantee obtained in the paper "Performance guarantees of a greedy algorithm for minimizing a supermodular set function on comatroid", which was published in Volume 171 of the European Journal of Operational Research. We comment on where this error originates from in the proof of the main theorem.
We provide a counterexample to the performance guarantee obtained in the paper "Performance guarantees of a greedy algorithm for minimizing a supermodular set function on comatroid", which was published in Volume 171 of the European Journal of Operational Research. We comment on where this error originates from in the proof of the main theorem.
△ Less
Submitted 10 December, 2019;
originally announced December 2019.
-
Rigid connections on $\mathbb{P}^1$ via the Bruhat-Tits building
Authors:
Masoud Kamgarpour,
Daniel S. Sage
Abstract:
We apply the theory of fundamental strata of Bremer and Sage to find cohomologically rigid $G$-connections on the projective line, generalising the work of Frenkel and Gross. In this theory, one studies the leading term of a formal connection with respect to the Moy-Prasad filtration associated to a point in the Bruhat-Tits building. If the leading term is regular semisimple with centraliser a (no…
▽ More
We apply the theory of fundamental strata of Bremer and Sage to find cohomologically rigid $G$-connections on the projective line, generalising the work of Frenkel and Gross. In this theory, one studies the leading term of a formal connection with respect to the Moy-Prasad filtration associated to a point in the Bruhat-Tits building. If the leading term is regular semisimple with centraliser a (not necessarily split) maximal torus $S$, then we have an $S$-toral connection. In this language, the irregular singularity of the Frenkel-Gross connection gives rise to the homogenous toral connection of minimal slope associated to the Coxeter torus $\mathcal{C}$. In the present paper, we consider connections on $\mathbb{G}_m$ which have an irregular homogeneous $\mathcal{C}$-toral singularity at zero of slope $i/h$, where $h$ is the Coxeter number and $i$ is a positive integer coprime to $h$, and a regular singularity at infinity with unipotent monodromy. Our main result is the characterisation of all such connections which are rigid.
△ Less
Submitted 19 May, 2020; v1 submitted 30 September, 2019;
originally announced October 2019.
-
System-level, Input-output and New Parameterizations of Stabilizing Controllers, and Their Numerical Computation
Authors:
Yang Zheng,
Luca Furieri,
Maryam Kamgarpour,
Na Li
Abstract:
It is known that the set of internally stabilizing controller $\mathcal{C}_{\text{stab}}$ is non-convex, but it admits convex characterizations using certain closed-loop maps: a classical result is the Youla parameterization, and two recent notions are the system-level parameterization (SLP) and the input-output parameterization (IOP). In this paper, we address the existence of new convex paramete…
▽ More
It is known that the set of internally stabilizing controller $\mathcal{C}_{\text{stab}}$ is non-convex, but it admits convex characterizations using certain closed-loop maps: a classical result is the Youla parameterization, and two recent notions are the system-level parameterization (SLP) and the input-output parameterization (IOP). In this paper, we address the existence of new convex parameterizations and discuss potential tradeoffs of each parametrization in different scenarios. Our main contributions are: 1) We reveal that only four groups of stable closed-loop transfer matrices are equivalent to internal stability: one of them is used in the SLP, another one is used in the IOP, and the other two are new, leading to two new convex parameterizations of $\mathcal{C}_{\text{stab}}$. 2) We investigate the properties of these parameterizations after imposing the finite impulse response (FIR) approximation, revealing that the IOP has the best ability of approximating $\mathcal{C}_{\text{stab}}$ given FIR constraints. 3) These four parameterizations require no \emph{a priori} doubly-coprime factorization of the plant, but impose a set of equality constraints. However, these equality constraints will never be satisfied exactly in floating-point arithmetic computation and/or implementation. We prove that the IOP is numerically robust for open-loop stable plants, in the sense that small mismatches in the equality constraints do not compromise the closed-loop stability; but a direct IOP implementation will fail to stabilize open-loop unstable systems in practice. The SLP is known to enjoy numerical robustness in the state feedback case; here, we show that numerical robustness of the four-block SLP controller requires case-by-case analysis even the plant is open-loop stable.
△ Less
Submitted 4 February, 2022; v1 submitted 26 September, 2019;
originally announced September 2019.
-
First Order Methods For Globally Optimal Distributed Controllers Beyond Quadratic Invariance
Authors:
Luca Furieri,
Maryam Kamgarpour
Abstract:
We study the distributed Linear Quadratic Gaussian (LQG) control problem in discrete-time and finite-horizon, where the controller depends linearly on the history of the outputs and it is required to lie in a given subspace, e.g. to possess a certain sparsity pattern. It is well-known that this problem can be solved with convex programming within the Youla domain if and only if a condition known a…
▽ More
We study the distributed Linear Quadratic Gaussian (LQG) control problem in discrete-time and finite-horizon, where the controller depends linearly on the history of the outputs and it is required to lie in a given subspace, e.g. to possess a certain sparsity pattern. It is well-known that this problem can be solved with convex programming within the Youla domain if and only if a condition known as Quadratic Invariance (QI) holds. In this paper, we first show that given QI sparsity constraints, one can directly descend the gradient of the cost function within the domain of output-feedback controllers and converge to a global optimum. Note that convergence is guaranteed despite non-convexity of the cost function. Second, we characterize a class of Uniquely Stationary (US) problems, for which first-order methods are guaranteed to converge to a global optimum. We show that the class of US problems is strictly larger than that of strongly QI problems and that it is not included in that of QI problems. We refer to Figure 1 for details. Finally, we propose a tractable test for the US property.
△ Less
Submitted 24 September, 2019;
originally announced September 2019.
-
On the Equivalence of Youla, System-level and Input-output Parameterizations
Authors:
Yang Zheng,
Luca Furieri,
Antonis Papachristodoulou,
Na Li,
Maryam Kamgarpour
Abstract:
A convex parameterization of internally stabilizing controllers is fundamental for many controller synthesis procedures. The celebrated Youla parameterization relies on a doubly-coprime factorization of the system, while the recent system-level and input-output characterizations require no doubly-coprime factorization but a set of equality constraints for achievable closed-loop responses. In this…
▽ More
A convex parameterization of internally stabilizing controllers is fundamental for many controller synthesis procedures. The celebrated Youla parameterization relies on a doubly-coprime factorization of the system, while the recent system-level and input-output characterizations require no doubly-coprime factorization but a set of equality constraints for achievable closed-loop responses. In this paper, we present explicit affine map**s among Youla, system-level and input-output parameterizations. Two direct implications of the affine map**s are 1) any convex problem in Youla, system level, or input-output parameters can be equivalently and convexly formulated in any other one of these frameworks, including the convex system-level synthesis (SLS); 2) the condition of quadratic invariance (QI) is sufficient and necessary for the classical distributed control problem to admit an equivalent convex reformulation in terms of Youla, system-level, or input-output parameters.
△ Less
Submitted 14 July, 2019;
originally announced July 2019.
-
Sparsity Invariance for Convex Design of Distributed Controllers
Authors:
Luca Furieri,
Yang Zheng,
Antonis Papachristodoulou,
Maryam Kamgarpour
Abstract:
We address the problem of designing optimal linear time-invariant (LTI) sparse controllers for LTI systems, which corresponds to minimizing a norm of the closed-loop system subject to sparsity constraints on the controller structure. This problem is NP-hard in general and motivates the development of tractable approximations. We characterize a class of convex restrictions based on a new notion of…
▽ More
We address the problem of designing optimal linear time-invariant (LTI) sparse controllers for LTI systems, which corresponds to minimizing a norm of the closed-loop system subject to sparsity constraints on the controller structure. This problem is NP-hard in general and motivates the development of tractable approximations. We characterize a class of convex restrictions based on a new notion of Sparsity Invariance (SI). The underlying idea of SI is to design sparsity patterns for transfer matrices Y(s) and X(s) such that any corresponding controller K(s)=Y(s)X(s)^-1 exhibits the desired sparsity pattern. For sparsity constraints, the approach of SI goes beyond the notion of Quadratic Invariance (QI): 1) the SI approach always yields a convex restriction; 2) the solution via the SI approach is guaranteed to be globally optimal when QI holds and performs at least as well as considering a nearest QI subset. Moreover, the notion of SI naturally applies to designing structured static controllers, while QI is not utilizable. Numerical examples show that even for non-QI cases, SI can recover solutions that are 1) globally optimal and 2) strictly more performing than previous methods.
△ Less
Submitted 11 July, 2020; v1 submitted 16 June, 2019;
originally announced June 2019.
-
Using Uncertainty Data in Chance-Constrained Trajectory Planning
Authors:
Vasileios Lefkopoulos,
Maryam Kamgarpour
Abstract:
We consider the problem of trajectory planning in an environment comprised of a set of obstacles with uncertain locations. While previous approaches model the uncertainties with a prescribed Gaussian distribution, we consider the realistic case in which the distribution's moments are unknown and are learned online. We derive tight concentration bounds on the error of the estimated moments. These b…
▽ More
We consider the problem of trajectory planning in an environment comprised of a set of obstacles with uncertain locations. While previous approaches model the uncertainties with a prescribed Gaussian distribution, we consider the realistic case in which the distribution's moments are unknown and are learned online. We derive tight concentration bounds on the error of the estimated moments. These bounds are then used to derive a tractable and tight mixed-integer convex reformulation of the trajectory planning problem, assuming linear dynamics and polyhedral constraints. The solution of the resulting optimization program is a feasible solution for the original problem with high confidence. We illustrate the approach with a case study from autonomous driving.
△ Less
Submitted 10 January, 2021; v1 submitted 29 April, 2019;
originally announced April 2019.
-
Actuator Placement for Optimizing Network Performance under Controllability Constraints
Authors:
Baiwei Guo,
Orcun Karaca,
Tyler Summers,
Maryam Kamgarpour
Abstract:
With the rising importance of large-scale network control, the problem of actuator placement has received increasing attention. Our goal in this paper is to find a set of actuators minimizing the metric that measures the average energy consumption of the control inputs while ensuring structural controllability of the network. As this problem is intractable, greedy algorithm can be used to obtain a…
▽ More
With the rising importance of large-scale network control, the problem of actuator placement has received increasing attention. Our goal in this paper is to find a set of actuators minimizing the metric that measures the average energy consumption of the control inputs while ensuring structural controllability of the network. As this problem is intractable, greedy algorithm can be used to obtain an approximate solution. To provide a performance guarantee for this approach, we first define the submodularity ratio for the metric under consideration and then reformulate the structural controllability constraint as a matroid constraint. This shows that the problem under study can be characterized by a matroid optimization involving a weakly submodular objective function. Then, we derive a novel performance guarantee for the greedy algorithm applied to this class of optimization problems. Finally, we show that the matroid feasibility check for the greedy algorithm can be cast as a maximum matching problem in a certain auxiliary bipartite graph related to the network graph.
△ Less
Submitted 19 March, 2019;
originally announced March 2019.
-
Safe Convex Learning under Uncertain Constraints
Authors:
Ilnura Usmanova,
Andreas Krause,
Maryam Kamgarpour
Abstract:
We address the problem of minimizing a convex smooth function $f(x)$ over a compact polyhedral set $D$ given a stochastic zeroth-order constraint feedback model. This problem arises in safety-critical machine learning applications, such as personalized medicine and robotics. In such cases, one needs to ensure constraints are satisfied while exploring the decision space to find optimum of the loss…
▽ More
We address the problem of minimizing a convex smooth function $f(x)$ over a compact polyhedral set $D$ given a stochastic zeroth-order constraint feedback model. This problem arises in safety-critical machine learning applications, such as personalized medicine and robotics. In such cases, one needs to ensure constraints are satisfied while exploring the decision space to find optimum of the loss function. We propose a new variant of the Frank-Wolfe algorithm, which applies to the case of uncertain linear constraints. Using robust optimization, we provide the convergence rate of the algorithm while guaranteeing feasibility of all iterates, with high probability.
△ Less
Submitted 9 December, 2019; v1 submitted 11 March, 2019;
originally announced March 2019.
-
On Separable Quadratic Lyapunov Functions for Convex Design of Distributed Controllers
Authors:
Luca Furieri,
Yang Zheng,
Antonis Papachristodoulou,
Maryam Kamgarpour
Abstract:
We consider the problem of designing a stabilizing and optimal static controller with a pre-specified sparsity pattern. Since this problem is NP-hard in general, it is necessary to resort to approximation approaches. In this paper, we characterize a class of convex restrictions of this problem that are based on designing a separable quadratic Lyapunov function for the closed-loop system. This appr…
▽ More
We consider the problem of designing a stabilizing and optimal static controller with a pre-specified sparsity pattern. Since this problem is NP-hard in general, it is necessary to resort to approximation approaches. In this paper, we characterize a class of convex restrictions of this problem that are based on designing a separable quadratic Lyapunov function for the closed-loop system. This approach generalizes previous results based on optimizing over diagonal Lyapunov functions, thus allowing for improved feasibility and performance. Moreover, we suggest a simple procedure to compute favourable structures for the Lyapunov function yielding high-performance distributed controllers. Numerical examples validate our results.
△ Less
Submitted 10 March, 2019;
originally announced March 2019.
-
An Input-Output Parametrization of Stabilizing Controllers: amidst Youla and System Level Synthesis
Authors:
Luca Furieri,
Yang Zheng,
Antonis Papachristodoulou,
Maryam Kamgarpour
Abstract:
This paper proposes a novel input-output parametrization of the set of internally stabilizing output-feedback controllers for linear time-invariant (LTI) systems. Our underlying idea is to directly treat the closed-loop transfer matrices from disturbances to input and output signals as design parameters and exploit their affine relationships. This input-output perspective is particularly effective…
▽ More
This paper proposes a novel input-output parametrization of the set of internally stabilizing output-feedback controllers for linear time-invariant (LTI) systems. Our underlying idea is to directly treat the closed-loop transfer matrices from disturbances to input and output signals as design parameters and exploit their affine relationships. This input-output perspective is particularly effective when a doubly-coprime factorization is difficult to compute, or an initial stabilizing controller is challenging to find; most previous work requires one of these pre-computation steps. Instead, our approach can bypass such pre-computations, in the sense that a stabilizing controller is computed by directly solving a linear program (LP). Furthermore, we show that the proposed input-output parametrization allows for computing norm-optimal controllers subject to quadratically invariant (QI) constraints using convex programming.
△ Less
Submitted 11 July, 2020; v1 submitted 9 March, 2019;
originally announced March 2019.
-
Core-Selecting Mechanisms in Electricity Markets
Authors:
Orcun Karaca,
Maryam Kamgarpour
Abstract:
Due to its theoretical virtues, several recent works propose the use of the incentive-compatible Vickrey-Clarke-Groves (VCG) mechanism for electricity markets. Coalitions of participants, however, can influence the VCG outcome to obtain higher collective profit. To address this issue, we propose core-selecting mechanisms for their coalition-proofness. We show that core-selecting mechanisms general…
▽ More
Due to its theoretical virtues, several recent works propose the use of the incentive-compatible Vickrey-Clarke-Groves (VCG) mechanism for electricity markets. Coalitions of participants, however, can influence the VCG outcome to obtain higher collective profit. To address this issue, we propose core-selecting mechanisms for their coalition-proofness. We show that core-selecting mechanisms generalize the economic rationale of the locational marginal pricing (LMP) mechanism. Namely, these mechanisms are the exact class of mechanisms that ensure the existence of a competitive equilibrium in linear/nonlinear prices. This implies that the LMP mechanism is also core-selecting, and hence coalition-proof. In contrast to the LMP mechanism, core-selecting mechanisms exist for a broad class of electricity markets, such as ones involving nonconvex costs and nonconvex constraint sets. In addition, they can approximate truthfulness without the price-taking assumption of the LMP mechanism. Finally, we show that they are also budget-balanced. Our results are verified with case studies based on optimal power flow test systems and the Swiss reserve market.
△ Less
Submitted 23 November, 2018;
originally announced November 2018.
-
Minimizing Regret of Bandit Online Optimization in Unconstrained Action Spaces
Authors:
Tatiana Tatarenko,
Maryam Kamgarpour
Abstract:
We consider online convex optimization with a zero-order oracle feedback. In particular, the decision maker does not know the explicit representation of the time-varying cost functions, or their gradients. At each time step, she observes the value of the corresponding cost function evaluated at her chosen action (zero-order oracle). The objective is to minimize the regret, that is, the difference…
▽ More
We consider online convex optimization with a zero-order oracle feedback. In particular, the decision maker does not know the explicit representation of the time-varying cost functions, or their gradients. At each time step, she observes the value of the corresponding cost function evaluated at her chosen action (zero-order oracle). The objective is to minimize the regret, that is, the difference between the sum of the costs she accumulates and that of a static optimal action had she known the sequence of cost functions a priori. We present a novel algorithm to minimize regret in unconstrained action spaces. Our algorithm hinges on a classical idea of one-point estimation of the gradients of the cost functions based on their observed values. The algorithm is independent of problem parameters. Letting $T$ denote the number of queries of the zero-order oracle and $n$ the problem dimension, the regret rate achieved is $O(n^{2/3}T^{2/3})$. Moreover, we adapt the presented algorithm to the setting with two-point feedback and demonstrate that the adapted procedure achieves the theoretical lower bound on the regret of $(n^{1/2}T^{1/2})$.
△ Less
Submitted 2 May, 2020; v1 submitted 13 June, 2018;
originally announced June 2018.
-
On Maximizing Safety in Stochastic Aircraft Trajectory Planning with Uncertain Thunderstorm Development
Authors:
Daniel Hentzen,
Maryam Kamgarpour,
Manuel Soler,
Daniel González-Arribas
Abstract:
Dealing with meteorological uncertainty poses a major challenge in air traffic management (ATM). Convective weather (commonly referred to as storms or thunderstorms) in particular represents a significant safety hazard that is responsible for one quarter of weather-related ATM delays in the US. With commercial air traffic on the rise and the risk of potentially critical capacity bottlenecks loomin…
▽ More
Dealing with meteorological uncertainty poses a major challenge in air traffic management (ATM). Convective weather (commonly referred to as storms or thunderstorms) in particular represents a significant safety hazard that is responsible for one quarter of weather-related ATM delays in the US. With commercial air traffic on the rise and the risk of potentially critical capacity bottlenecks looming, it is vital that future trajectory planning tools are able to account for meteorological uncertainty. We propose an approach to model the uncertainty inherent to forecasts of convective weather regions using statistical analysis of state-of-the-art forecast data. The developed stochastic storm model is tailored for use in an optimal control algorithm that maximizes the probability of reaching a waypoint while avoiding hazardous storm regions. Both the aircraft and the thunderstorms are modeled stochastically. The performance of the approach is illustrated and validated through simulated case studies based on recent nowcast data and storm observations.
△ Less
Submitted 6 June, 2018;
originally announced June 2018.
-
Exploiting Weak Supermodularity for Coalition-Proof Mechanisms
Authors:
Orcun Karaca,
Maryam Kamgarpour
Abstract:
Under the incentive-compatible Vickrey-Clarke-Groves mechanism, coalitions of participants can influence the auction outcome to obtain higher collective profit. These manipulations were proven to be eliminated if and only if the market objective is supermodular. Nevertheless, several auctions do not satisfy the stringent conditions for supermodularity. These auctions include electricity markets, w…
▽ More
Under the incentive-compatible Vickrey-Clarke-Groves mechanism, coalitions of participants can influence the auction outcome to obtain higher collective profit. These manipulations were proven to be eliminated if and only if the market objective is supermodular. Nevertheless, several auctions do not satisfy the stringent conditions for supermodularity. These auctions include electricity markets, which are the main motivation of our study. To characterize nonsupermodular functions, we introduce the supermodularity ratio and the weak supermodularity. We show that these concepts provide us with tight bounds on the profitability of collusion and shill bidding. We then derive an analytical lower bound on the supermodularity ratio. Our results are verified with case studies based on the IEEE test systems.
△ Less
Submitted 23 November, 2018; v1 submitted 29 March, 2018;
originally announced March 2018.
-
Scalable analysis of linear networked systems via chordal decomposition
Authors:
Yang Zheng,
Maryam Kamgarpour,
Aivar Sootla,
Antonis Papachristodoulou
Abstract:
This paper introduces a chordal decomposition approach for scalable analysis of linear networked systems, including stability, $\mathcal{H}_2$ and $\mathcal{H}_{\infty}$ performance. Our main strategy is to exploit any sparsity within these analysis problems and use chordal decomposition. We first show that Grone's and Agler's theorems can be generalized to block matrices with any partition. This…
▽ More
This paper introduces a chordal decomposition approach for scalable analysis of linear networked systems, including stability, $\mathcal{H}_2$ and $\mathcal{H}_{\infty}$ performance. Our main strategy is to exploit any sparsity within these analysis problems and use chordal decomposition. We first show that Grone's and Agler's theorems can be generalized to block matrices with any partition. This facilitates networked systems analysis, allowing one to solely focus on the physical connections of networked systems to exploit scalability. Then, by choosing Lyapunov functions with appropriate sparsity patterns, we decompose large positive semidefinite constraints in all of the analysis problems into multiple smaller ones depending on the maximal cliques of the system graph. This makes the solutions more computationally efficient via a recent first-order algorithm. Numerical experiments demonstrate the efficiency and scalability of the proposed method.
△ Less
Submitted 15 March, 2018;
originally announced March 2018.
-
Robust Distributed Control Beyond Quadratic Invariance
Authors:
Luca Furieri,
Maryam Kamgarpour
Abstract:
The problem of robust distributed control arises in several large-scale systems, such as transportation networks and power grid systems. In many practical scenarios controllers might not have enough information to make globally optimal decisions in a tractable way. We propose a novel class of tractable optimization problems whose solution is a controller complying with any specified information st…
▽ More
The problem of robust distributed control arises in several large-scale systems, such as transportation networks and power grid systems. In many practical scenarios controllers might not have enough information to make globally optimal decisions in a tractable way. We propose a novel class of tractable optimization problems whose solution is a controller complying with any specified information structure. The approach we suggest is based on decomposing intractable information constraints into two subspace constraints in the disturbance feedback domain. We discuss how to perform the decomposition in an optimized way. The resulting control policy is globally optimal when a condition known as Quadratic Invariance (QI) holds, whereas it is feasible and it provides a provable upper bound on the minimum cost when QI does not hold. Finally, we show that our method can lead to improved performance guarantees with respect to previous approaches, by applying the developed techniques to the platooning of autonomous vehicles.
△ Less
Submitted 9 March, 2019; v1 submitted 14 March, 2018;
originally announced March 2018.
-
From Uncertainty Data to Robust Policies for Temporal Logic Planning
Authors:
Pier Giuseppe Sessa,
Damian Frick,
Tony A. Wood,
Maryam Kamgarpour
Abstract:
We consider the problem of synthesizing robust disturbance feedback policies for systems performing complex tasks. We formulate the tasks as linear temporal logic specifications and encode them into an optimization framework via mixed-integer constraints. Both the system dynamics and the specifications are known but affected by uncertainty. The distribution of the uncertainty is unknown, however r…
▽ More
We consider the problem of synthesizing robust disturbance feedback policies for systems performing complex tasks. We formulate the tasks as linear temporal logic specifications and encode them into an optimization framework via mixed-integer constraints. Both the system dynamics and the specifications are known but affected by uncertainty. The distribution of the uncertainty is unknown, however realizations can be obtained. We introduce a data-driven approach where the constraints are fulfilled for a set of realizations and provide probabilistic generalization guarantees as a function of the number of considered realizations. We use separate chance constraints for the satisfaction of the specification and operational constraints. This allows us to quantify their violation probabilities independently. We compute disturbance feedback policies as solutions of mixed-integer linear or quadratic optimization problems. By using feedback we can exploit information of past realizations and provide feasibility for a wider range of situations compared to static input sequences. We demonstrate the proposed method on two robust motion-planning case studies for autonomous driving.
△ Less
Submitted 27 August, 2018; v1 submitted 11 January, 2018;
originally announced January 2018.
-
Exploiting structure of chance constrained programs via submodularity
Authors:
Damian Frick,
Pier Giuseppe Sessa,
Tony A. Wood,
Maryam Kamgarpour
Abstract:
We introduce a novel approach to reduce the computational effort of solving mixed-integer convex chance constrained programs through the scenario approach. Instead of reducing the number of required scenarios, we directly minimize the computational cost of the scenario program. We exploit the problem structure by efficiently partitioning the constraint function and considering a multiple chance co…
▽ More
We introduce a novel approach to reduce the computational effort of solving mixed-integer convex chance constrained programs through the scenario approach. Instead of reducing the number of required scenarios, we directly minimize the computational cost of the scenario program. We exploit the problem structure by efficiently partitioning the constraint function and considering a multiple chance constrained program that gives the same probabilistic guarantees as the original single chance constrained problem. We formulate the problem of finding the optimal partition, a partition achieving the lowest computational cost, as an optimization problem with nonlinear objective and combinatorial constraints. By using submodularity of the support rank of a set of constraints, we propose a polynomial-time algorithm to find suboptimal solutions to this partitioning problem and we give approximation guarantees for special classes of cost metrics. We illustrate that the resulting computational cost savings can be arbitrarily large and demonstrate our approach on two case studies from production and multi-agent planning.
△ Less
Submitted 18 September, 2018; v1 submitted 10 January, 2018;
originally announced January 2018.
-
Performance guarantees for greedy maximization of non-submodular controllability metrics
Authors:
Tyler Summers,
Maryam Kamgarpour
Abstract:
A key problem in emerging complex cyber-physical networks is the design of information and control topologies, including sensor and actuator selection and communication network design. These problems can be posed as combinatorial set function optimization problems to maximize a dynamic performance metric for the network. Some systems and control metrics feature a property called submodularity, whi…
▽ More
A key problem in emerging complex cyber-physical networks is the design of information and control topologies, including sensor and actuator selection and communication network design. These problems can be posed as combinatorial set function optimization problems to maximize a dynamic performance metric for the network. Some systems and control metrics feature a property called submodularity, which allows simple greedy algorithms to obtain provably near-optimal topology designs. However, many important metrics lack submodularity and therefore lack provable guarantees for using a greedy optimization approach. Here we show that performance guarantees can be obtained for greedy maximization of certain non-submodular functions of the controllability and observability Gramians. Our results are based on two key quantities: the submodularity ratio, which quantifies how far a set function is from being submodular, and the curvature, which quantifies how far a set function is from being supermodular.
△ Less
Submitted 19 November, 2018; v1 submitted 11 December, 2017;
originally announced December 2017.