Search | arXiv e-print repository

Best Response Sequences and Tradeoffs in Submodular Resource Allocation Games

Authors: Rohit Konda, Rahul Chandan, David Grimsman, Jason R. Marden

Abstract: Deriving competitive, distributed solutions to multi-agent problems is crucial for many develo** application domains; Game theory has emerged as a useful framework to design such algorithms. However, much of the attention within this framework is on the study of equilibrium behavior, whereas transient behavior is often ignored. Therefore, in this paper we study the transient efficiency guarantee… ▽ More Deriving competitive, distributed solutions to multi-agent problems is crucial for many develo** application domains; Game theory has emerged as a useful framework to design such algorithms. However, much of the attention within this framework is on the study of equilibrium behavior, whereas transient behavior is often ignored. Therefore, in this paper we study the transient efficiency guarantees of best response processes in the context of submodular resource allocation games, which find application in various engineering contexts. Specifically the main focus of this paper is on characterizing the optimal short-term system-level behavior under the best-response process. Interestingly, the resulting transient performance guarantees are relatively close to the optimal asymptotic performance guarantees. Furthermore, we characterize the trade-offs that result when optimizing for both asymptotic and transient efficiency through various utility designs. △ Less

Submitted 30 May, 2024; originally announced June 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2204.10364

arXiv:2405.20433 [pdf, other]

Efficient Industrial Refrigeration Scheduling with Peak Pricing

Authors: Rohit Konda, Jordan Prescott, Vikas Chandan, Jesse Crossno, Blake Pollard, Dan Walsh, Rick Bohonek, Jason R. Marden

Abstract: The widespread use of industrial refrigeration systems across various sectors contribute significantly to global energy consumption, highlighting substantial opportunities for energy conservation through intelligent control design. As such, this work focuses on control algorithm design in industrial refrigeration that minimize operational costs and provide efficient heat extraction. By adopting to… ▽ More The widespread use of industrial refrigeration systems across various sectors contribute significantly to global energy consumption, highlighting substantial opportunities for energy conservation through intelligent control design. As such, this work focuses on control algorithm design in industrial refrigeration that minimize operational costs and provide efficient heat extraction. By adopting tools from inventory control, we characterize the structure of these optimal control policies, exploring the impact of different energy cost-rate structures such as time-of-use (TOU) pricing and peak pricing. While classical threshold policies are optimal under TOU costs, introducing peak pricing challenges their optimality, emphasizing the need for carefully designed control strategies in the presence of significant peak costs. We provide theoretical findings and simulation studies on this phenomenon, offering insights for more efficient industrial refrigeration management. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2403.07831 [pdf, other]

Utilizing Load Shifting for Optimal Compressor Sequencing in Industrial Refrigeration

Authors: Rohit Konda, Vikas Chandan, Jesse Crossno, Blake Pollard, Dan Walsh, Rick Bohonek, Jason R. Marden

Abstract: The ubiquity and energy needs of industrial refrigeration has prompted several research studies investigating various control opportunities for reducing energy demand. This work focuses on one such opportunity, termed compressor sequencing, which entails intelligently selecting the operational state of the compressors to service the required refrigeration load with the least possible work. We firs… ▽ More The ubiquity and energy needs of industrial refrigeration has prompted several research studies investigating various control opportunities for reducing energy demand. This work focuses on one such opportunity, termed compressor sequencing, which entails intelligently selecting the operational state of the compressors to service the required refrigeration load with the least possible work. We first study the static compressor sequencing problem and observe that deriving the optimal compressor operational state is computationally challenging and can vary dramatically based on the refrigeration load. Thus we introduce load shifting in conjunction with compressor sequencing, which entails strategically precooling the facility to allow for more efficient compressor operation. Interestingly, we show that load shifting not only provides benefits in computing the optimal compressor operational state, but also can lead to significant energy savings. Our results are based on and compared to real-world sensor data from an operating industrial refrigeration site of Butterball LLC located in Huntsville, AR, which demonstrated that without load shifting, even optimal compressor operation results in compressors often running at intermediate capacity levels, which can lead to inefficiencies. Through collected data, we demonstrate that a load shifting approach for compressor sequencing has the potential to reduce energy use of the compressors up to 20% compared to optimal sequencing without load shifting. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2311.01379 [pdf, other]

Collaborative Decision-Making and the k-Strong Price of Anarchy in Common Interest Games

Authors: Bryce L. Ferguson, Dario Paccagnan, Bary S. R. Pradelski, Jason R. Marden

Abstract: The control of large-scale, multi-agent systems often entails distributing decision-making across the system components. However, with advances in communication and computation technologies, we can consider new collaborative decision-making paradigms that exist somewhere between centralized and distributed control. In this work, we seek to understand the benefits and costs of increased collaborati… ▽ More The control of large-scale, multi-agent systems often entails distributing decision-making across the system components. However, with advances in communication and computation technologies, we can consider new collaborative decision-making paradigms that exist somewhere between centralized and distributed control. In this work, we seek to understand the benefits and costs of increased collaborative communication in multi-agent systems. We specifically study this in the context of common interest games in which groups of up to k agents can coordinate their actions in maximizing the common objective function. The equilibria that emerge in these systems are the k-strong Nash equilibria of the common interest game; studying the properties of these states can provide relevant insights into the efficacy of inter-agent collaboration. Our contributions come threefold: 1) provide bounds on how well k-strong Nash equilibria approximate the optimal system welfare, formalized by the k-strong price of anarchy, 2) study the run-time and transient performance of collaborative agent-based dynamics, and 3) consider the task of redesigning objectives for groups of agents which improve system performance. We study these three facets generally as well as in the context of resource allocation problems, in which we provide tractable linear programs that give tight bounds on the k-strong price of anarchy. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Comments: arXiv admin note: text overlap with arXiv:2308.08045

arXiv:2308.14714 [pdf, other]

A Stochastic Surveillance Stackelberg Game: Co-Optimizing Defense Placement and Patrol Strategy

Authors: Yohan John, Gilberto Diaz-Garcia, Xiaoming Duan, Jason R. Marden, Francesco Bullo

Abstract: Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary. We adopt a worst-case omniscient adversary model from the literature and extend the formulation to accommodate heterogeneous defenses at the various nodes of the graph. Introducing this heterogeneity leads to interesting… ▽ More Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary. We adopt a worst-case omniscient adversary model from the literature and extend the formulation to accommodate heterogeneous defenses at the various nodes of the graph. Introducing this heterogeneity leads to interesting new patrol strategies. We identify efficient methods for computing these strategies in certain classes of graphs. We assess the effectiveness of these strategies via comparison to an upper bound on the value of the game. Finally, we leverage the heterogeneous defense formulation to develop novel defense placement algorithms that complement the patrol strategies. △ Less

Submitted 20 February, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: 9 pages, 1 figure, submitted as a technical note to the IEEE Transactions on Automatic Control. Replaced to fix inaccuracies

arXiv:2308.14299 [pdf, other]

Reinforcement Strategies in General Lotto Games

Authors: Keith Paarporn, Rahul Chandan, Mahnoosh Alizadeh, Jason R. Marden

Abstract: Strategic decisions are often made over multiple periods of time, wherein decisions made earlier impact a competitor's success in later stages. In this paper, we study these dynamics in General Lotto games, a class of models describing the competitive allocation of resources between two opposing players. We propose a two-stage formulation where one of the players has reserved resources that can be… ▽ More Strategic decisions are often made over multiple periods of time, wherein decisions made earlier impact a competitor's success in later stages. In this paper, we study these dynamics in General Lotto games, a class of models describing the competitive allocation of resources between two opposing players. We propose a two-stage formulation where one of the players has reserved resources that can be strategically pre-allocated across the battlefields in the first stage of the game as reinforcements. The players then simultaneously allocate their remaining real-time resources, which can be randomized, in a decisive final stage. Our main contributions provide complete characterizations of the optimal reinforcement strategies and resulting equilibrium payoffs in these multi-stage General Lotto games. Interestingly, we determine that real-time resources are at least twice as effective as reinforcement resources when considering equilibrium payoffs. △ Less

Submitted 28 August, 2023; originally announced August 2023.

Comments: 14 pages. arXiv admin note: text overlap with arXiv:2209.06090

arXiv:2308.08045 [pdf, other]

Collaborative Coalitions in Multi-Agent Systems: Quantifying the Strong Price of Anarchy for Resource Allocation Games

Authors: Bryce L. Ferguson, Dario Paccagnan, Bary S. R. Pradelski, Jason R. Marden

Abstract: The emergence of new communication technologies allows us to expand our understanding of distributed control and consider collaborative decision-making paradigms. With collaborative algorithms, certain local decision-making entities (or agents) are enabled to communicate and collaborate on their actions with one another to attain better system behavior. By limiting the amount of communication, the… ▽ More The emergence of new communication technologies allows us to expand our understanding of distributed control and consider collaborative decision-making paradigms. With collaborative algorithms, certain local decision-making entities (or agents) are enabled to communicate and collaborate on their actions with one another to attain better system behavior. By limiting the amount of communication, these algorithms exist somewhere between centralized and fully distributed approaches. To understand the possible benefits of this inter-agent collaboration, we model a multi-agent system as a common-interest game in which groups of agents can collaborate on their actions to jointly increase the system welfare. We specifically consider $k$-strong Nash equilibria as the emergent behavior of these systems and address how well these states approximate the system optimal, formalized by the $k$-strong price of anarchy ratio. Our main contributions are in generating tight bounds on the $k$-strong price of anarchy in finite resource allocation games as the solution to a tractable linear program. By varying $k$ --the maximum size of a collaborative coalition--we observe exactly how much performance is gained from inter-agent collaboration. To investigate further opportunities for improvement, we generate upper bounds on the maximum attainable $k$-strong price of anarchy when the agents' utility function can be designed. △ Less

Submitted 15 August, 2023; originally announced August 2023.

arXiv:2306.12603 [pdf, other]

The Cost of Informing Decision-Makers in Multi-Agent Maximum Coverage Problems with Random Resource Values

Authors: Bryce L. Ferguson, Dario Paccagnan, Jason R. Marden

Abstract: The emergent behavior of a distributed system is conditioned by the information available to the local decision-makers. Therefore, one may expect that providing decision-makers with more information will improve system performance; in this work, we find that this is not necessarily the case. In multi-agent maximum coverage problems, we find that even when agents' objectives are aligned with the gl… ▽ More The emergent behavior of a distributed system is conditioned by the information available to the local decision-makers. Therefore, one may expect that providing decision-makers with more information will improve system performance; in this work, we find that this is not necessarily the case. In multi-agent maximum coverage problems, we find that even when agents' objectives are aligned with the global welfare, informing agents about the realization of the resource's random values can reduce equilibrium performance by a factor of 1/2. This affirms an important aspect of designing distributed systems: information need be shared carefully. We further this understanding by providing lower and upper bounds on the ratio of system welfare when information is (fully or partially) revealed and when it is not, termed the value-of-informing. We then identify a trade-off that emerges when optimizing the performance of the best-case and worst-case equilibrium. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: To appear: LCSS

arXiv:2304.02068 [pdf, other]

Battlefield transfers in coalitional Blotto games

Authors: Vade Shah, Jason R. Marden

Abstract: In this work, we consider the framework of coalitional Blotto games in which two players compete against a common adversary by allocating their budgeted resources across disjoint sets of valued battlefields; the agent that allocates a higher amount wins the corresponding battlefield value. At the beginning of the game, the budgets of the agents and the values of the battlefields are specified. In… ▽ More In this work, we consider the framework of coalitional Blotto games in which two players compete against a common adversary by allocating their budgeted resources across disjoint sets of valued battlefields; the agent that allocates a higher amount wins the corresponding battlefield value. At the beginning of the game, the budgets of the agents and the values of the battlefields are specified. In the first stage, the players are allowed to perform a battlefield transfer in which one player offloads a number of its battlefields onto the other player. In the second stage, the adversary observes this transfer and determines how to allocate their budget accordingly. Finally, in the third stage, the players and the adversary allocate their budgets to their battlefields, the game is played, and their payoffs are realized. We provide necessary and sufficient conditions for the existence of a battlefield transfer that strictly increases the payoff of each player. We then augment the model, allowing players to not only transfer subsets of battlefields, but also portions of their budget, in the first stage. We also provide sufficient conditions for the existence of a joint transfer of battlefields and budgets. The results demonstrate that in almost all game instances, both players would benefit from such a joint transfer. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2301.02646 [pdf, other]

Trajectories for the Optimal Collection of Information

Authors: Matthew R. Kirchner, David Grimsman, Joao P. Hespanha, Jason R. Marden

Abstract: We study a scenario where an aircraft has multiple heterogeneous sensors collecting measurements to track a target vehicle of unknown location. The measurements are sampled along the flight path and our goals to optimize sensor placement to minimize estimation error. We select as a metric the Fisher Information Matrix (FIM), as "minimizing" the inverse of the FIM is required to achieve small estim… ▽ More We study a scenario where an aircraft has multiple heterogeneous sensors collecting measurements to track a target vehicle of unknown location. The measurements are sampled along the flight path and our goals to optimize sensor placement to minimize estimation error. We select as a metric the Fisher Information Matrix (FIM), as "minimizing" the inverse of the FIM is required to achieve small estimation error. We propose to generate the optimal path from the Hamilton-Jacobi (HJ) partial differential equation (PDE) as it is the necessary and sufficient condition for optimality. A traditional method of lines (MOL) approach, based on a spatial grid, lends itself well to the highly non-linear and non-convex structure of the problem induced by the FIM matrix. However, the sensor placement problem results in a state space dimension that renders a naive MOL approach intractable. We present a new hybrid approach, whereby we decompose the state space into two parts: a smaller subspace that still uses a grid and takes advantage of the robustness to non-linearities and non-convexities, and the remaining state space that can by found efficiently from a system of ODEs, avoiding formation of a spatial grid. △ Less

Submitted 12 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

Comments: Updated to comply with conference formatting. To appear in the proceedings of the 2023 IEEE Aerospace Conference

Report number: UCLA CAM 23-01

arXiv:2209.07055 [pdf, other]

Valid Utility Games with Information Sharing Constraints

Authors: David Grimsman, Philip N. Brown, Jason R. Marden

Abstract: The use of game theoretic methods for control in multiagent systems has been an important topic in recent research. Valid utility games in particular have been used to model real-world problems; such games have the convenient property that the value of any decision set which is a Nash equilibrium of the game is guaranteed to be within 1/2 of the value of the optimal decision set. However, an impli… ▽ More The use of game theoretic methods for control in multiagent systems has been an important topic in recent research. Valid utility games in particular have been used to model real-world problems; such games have the convenient property that the value of any decision set which is a Nash equilibrium of the game is guaranteed to be within 1/2 of the value of the optimal decision set. However, an implicit assumption in this guarantee is that each agent is aware of the decisions of all other agents. In this work, we first describe how this guarantee degrades as agents are only aware of a subset of the decisions of other agents. We then show that this loss can be mitigated by restriction to a relevant subclass of games. △ Less

Submitted 15 September, 2022; originally announced September 2022.

arXiv:2209.06090 [pdf, other]

Strategic investments in multi-stage General Lotto games

Authors: Rahul Chandan, Keith Paarporn, Mahnoosh Alizadeh, Jason R. Marden

Abstract: In adversarial interactions, one is often required to make strategic decisions over multiple periods of time, wherein decisions made earlier impact a player's competitive standing as well as how choices are made in later stages. In this paper, we study such scenarios in the context of General Lotto games, which models the competitive allocation of resources over multiple battlefields between two p… ▽ More In adversarial interactions, one is often required to make strategic decisions over multiple periods of time, wherein decisions made earlier impact a player's competitive standing as well as how choices are made in later stages. In this paper, we study such scenarios in the context of General Lotto games, which models the competitive allocation of resources over multiple battlefields between two players. We propose a two-stage formulation where one of the players has reserved resources that can be strategically pre-allocated across the battlefields in the first stage. The pre-allocation then becomes binding and is revealed to the other player. In the second stage, the players engage by simultaneously allocating their real-time resources against each other. The main contribution in this paper provides complete characterizations of equilibrium payoffs in the two-stage game, revealing the interplay between performance and the amount of resources expended in each stage of the game. We find that real-time resources are at least twice as effective as pre-allocated resources. We then determine the player's optimal investment when there are linear costs associated with purchasing each type of resource before play begins, and there is a limited monetary budget. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: 8 pages, 2 figures, to appear in proceedings of the 2022 IEEE Conference on Decision and Control

arXiv:2204.10364 [pdf, other]

Optimal Utility Design of Greedy Algorithms in Resource Allocation Games

Authors: Rohit Konda, Rahul Chandan, David Grimsman, Jason R. Marden

Abstract: Designing distributed algorithms for multi-agent problems is vital for many emerging application domains, and game-theoretic approaches are emerging as a useful paradigm to design such algorithms. However, much of the emphasis of the game-theoretic approach is on the study of equilibrium behavior, whereas transient behavior is often less explored. Therefore, in this paper we study the transient ef… ▽ More Designing distributed algorithms for multi-agent problems is vital for many emerging application domains, and game-theoretic approaches are emerging as a useful paradigm to design such algorithms. However, much of the emphasis of the game-theoretic approach is on the study of equilibrium behavior, whereas transient behavior is often less explored. Therefore, in this paper we study the transient efficiency guarantees of best response processes in the context of resource-allocation games, which are used to model a variety of engineering applications. Specifically, the main focus of the paper is on designing utility functions of agents to induce optimal short-term system-level behavior under a best-response process. Interestingly, the resulting transient performance guarantees are relatively close to the optimal asymptotic performance guarantees. Furthermore, we characterize a trade-off that results when optimizing for both asymptotic and transient efficiency through various utility designs. △ Less

Submitted 2 May, 2024; v1 submitted 21 April, 2022; originally announced April 2022.

Comments: This is an outdated arxiv version, please refer to the TAC version for the complete work

arXiv:2204.06046 [pdf, other]

doi 10.1109/CDC51059.2022.9992777

Avoiding Unintended Consequences: How Incentives Aid Information Provisioning in Bayesian Congestion Games

Authors: Bryce L. Ferguson, Philip N. Brown, Jason R. Marden

Abstract: When users lack specific knowledge of various system parameters, their uncertainty may lead them to make undesirable deviations in their decision making. To alleviate this, an informed system operator may elect to signal information to uninformed users with the hope of persuading them to take more preferable actions. In this work, we study public and truthful signalling mechanisms in the context o… ▽ More When users lack specific knowledge of various system parameters, their uncertainty may lead them to make undesirable deviations in their decision making. To alleviate this, an informed system operator may elect to signal information to uninformed users with the hope of persuading them to take more preferable actions. In this work, we study public and truthful signalling mechanisms in the context of Bayesian congestion games on parallel networks. We provide bounds on the possible benefit a signalling policy can provide with and without the concurrent use of monetary incentives. We find that though revealing information can reduce system cost in some settings, it can also be detrimental and cause worse performance than not signalling at all. However, by utilizing both signalling and incentive mechanisms, the system operator can guarantee that revealing information does not worsen performance while offering similar opportunities for improvement. These findings emerge from the closed form bounds we derive on the benefit a signalling policy can provide. We provide a numerical example which illustrates the phenomenon that revealing more information can degrade performance when incentives are not used and improves performance when incentives are used. △ Less

Submitted 30 March, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

arXiv:2204.04176 [pdf, other]

Path Defense in Dynamic Defender-Attacker Blotto Games (dDAB) with Limited Information

Authors: Austin K. Chen, Bryce L. Ferguson, Daigo Shishika, Michael Dorothy, Jason R. Marden, George J. Pappas, Vijay Kumar

Abstract: We consider a path guarding problem in dynamic Defender-Attacker Blotto games (dDAB), where a team of robots must defend a path in a graph against adversarial agents. Multi-robot systems are particularly well suited to this application, as recent work has shown the effectiveness of these systems in related areas such as perimeter defense and surveillance. When designing a defender policy that guar… ▽ More We consider a path guarding problem in dynamic Defender-Attacker Blotto games (dDAB), where a team of robots must defend a path in a graph against adversarial agents. Multi-robot systems are particularly well suited to this application, as recent work has shown the effectiveness of these systems in related areas such as perimeter defense and surveillance. When designing a defender policy that guarantees the defense of a path, information about the adversary and the environment can be helpful and may reduce the number of resources required by the defender to achieve a sufficient level of security. In this work, we characterize the necessary and sufficient number of assets needed to guarantee the defense of a shortest path between two nodes in dDAB games when the defender can only detect assets within $k$-hops of a shortest path. By characterizing the relationship between sensing horizon and required resources, we show that increasing the sensing capability of the defender greatly reduces the number of defender assets needed to defend the path. △ Less

Submitted 25 May, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

arXiv:2110.12099 [pdf, other]

doi 10.1109/TAC.2024.3367651

Strategically revealing intentions in General Lotto games

Authors: Keith Paarporn, Rahul Chandan, Dan Kovenock, Mahnoosh Alizadeh, Jason R. Marden

Abstract: Strategic decision-making in uncertain and adversarial environments is crucial for the security of modern systems and infrastructures. A salient feature of many optimal decision-making policies is a level of unpredictability, or randomness, which helps to keep an adversary uncertain about the system's behavior. This paper seeks to explore decision-making policies on the other end of the spectrum -… ▽ More Strategic decision-making in uncertain and adversarial environments is crucial for the security of modern systems and infrastructures. A salient feature of many optimal decision-making policies is a level of unpredictability, or randomness, which helps to keep an adversary uncertain about the system's behavior. This paper seeks to explore decision-making policies on the other end of the spectrum -- namely, whether there are benefits in revealing one's strategic intentions to an opponent before engaging in competition. We study these scenarios in a well-studied model of competitive resource allocation problem known as General Lotto games. In the classic formulation, two competing players simultaneously allocate their assets to a set of battlefields, and the resulting payoffs are derived in a zero-sum fashion. Here, we consider a multi-step extension where one of the players has the option to publicly pre-commit assets in a binding fashion to battlefields before play begins. In response, the opponent decides which of these battlefields to secure (or abandon) by matching the pre-commitment with its own assets. They then engage in a General Lotto game over the remaining set of battlefields. Interestingly, this paper highlights many scenarios where strategically revealing intentions can actually significantly improve one's payoff. This runs contrary to the conventional wisdom that randomness should be a central component of decision-making in adversarial environments. △ Less

Submitted 3 December, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

Comments: 12 pages

arXiv:2107.06331 [pdf, other]

The Unintended Consequences of Minimizing the Price of Anarchy in Congestion Games

Authors: Rahul Chandan, Dario Paccagnan, Jason R. Marden

Abstract: This work focuses on the design of taxes in atomic congestion games, a commonly studied model for competitive resource sharing. While most related studies focus on optimizing either the worst- or best-case performance (i.e., Price of Anarchy (PoA) or Price of Stability (PoS)), we investigate whether optimizing for the PoA has consequences on the PoS. Perhaps surprisingly, our results reveal a fund… ▽ More This work focuses on the design of taxes in atomic congestion games, a commonly studied model for competitive resource sharing. While most related studies focus on optimizing either the worst- or best-case performance (i.e., Price of Anarchy (PoA) or Price of Stability (PoS)), we investigate whether optimizing for the PoA has consequences on the PoS. Perhaps surprisingly, our results reveal a fundamental trade-off between the two performance metrics. Our main result demonstrates that the taxation rule that optimizes the PoA inherits a matching PoS, implying that the best outcome is no better than the worst outcome under such a design choice. We then study this trade-off in terms of the Pareto frontier between the PoA and PoS. Our results also establish that any taxes with PoS equal to 1 incur a much higher PoA, and that, in several well-studied cases, the untaxed setting lies strictly above the Pareto frontier. △ Less

Submitted 12 April, 2023; v1 submitted 13 July, 2021; originally announced July 2021.

Comments: 34 pages, 3 figures, 1 table

arXiv:2106.12133 [pdf, other]

A General Lotto game with asymmetric budget uncertainty

Authors: Keith Paarporn, Rahul Chandan, Mahnoosh Alizadeh, Jason R. Marden

Abstract: The General Lotto game is a popular variant of the famous Colonel Blotto game, in which two opposing players allocate limited resources over many battlefields. In this paper, we consider incomplete and asymmetric information formulations regarding the resource budgets of the players. In particular, one of the player's resource budget is common knowledge while the other player's is private. We prov… ▽ More The General Lotto game is a popular variant of the famous Colonel Blotto game, in which two opposing players allocate limited resources over many battlefields. In this paper, we consider incomplete and asymmetric information formulations regarding the resource budgets of the players. In particular, one of the player's resource budget is common knowledge while the other player's is private. We provide complete equilibrium characterizations in the scenario where the private resource budget is drawn from an arbitrary Bernoulli distribution. We then show that these characterizations can be used to analyze a multi-stage resource assignment problem where a commander must decide how to assign resources to sub-colonels that compete against opponents in separate General Lotto games. While optimal deterministic assignments have been characterized in the literature, we broaden the context by deriving optimal (Bernoulli) randomized assignments, which induce asymmetric information General Lotto games to be played. We demonstrate that randomizing can offer a four-fold improvement in the commander's performance over deterministic assignments. △ Less

Submitted 14 October, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

arXiv:2106.04031 [pdf, other]

Balancing Asymptotic and Transient Efficiency Guarantees in Set Covering Games

Authors: Rohit Konda, Rahul Chandan, David Grimsman, Jason R. Marden

Abstract: Game theoretic approaches have gained traction as robust methodologies for designing distributed local algorithms that induce a desired overall system configuration in multi-agent settings. However, much of the emphasis in these approaches is on providing asymptotic guarantees on the performance of a network of agents, and there is a gap in the study of efficiency guarantees along transients of th… ▽ More Game theoretic approaches have gained traction as robust methodologies for designing distributed local algorithms that induce a desired overall system configuration in multi-agent settings. However, much of the emphasis in these approaches is on providing asymptotic guarantees on the performance of a network of agents, and there is a gap in the study of efficiency guarantees along transients of these distributed algorithms. Therefore, in this paper, we study the transient efficiency guarantees of a natural game-theoretic algorithm in the class of set covering games, which have been used to model a variety of applications. Our main results characterize the optimal utility design that maximizes the guaranteed efficiency along the transient of the natural dynamics. Furthermore, we characterize the Pareto-optimal frontier with regards to guaranteed efficiency in the transient and the asymptote under a class of game-theoretic designs. Surprisingly, we show that there exists an extreme trade-off between the long-term and short-term guarantees in that an asymptotically optimal game-theoretic design can perform arbitrarily bad in the transient. △ Less

Submitted 2 November, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

arXiv:2106.04029 [pdf, other]

Mission Level Uncertainty in Multi-Agent Resource Allocation

Authors: Rohit Konda, Rahul Chandan, Jason R. Marden

Abstract: In recent years, a significant research effort has been devoted to the design of distributed protocols for the control of multi-agent systems, as the scale and limited communication bandwidth characteristic of such systems render centralized control impossible. Given the strict operating conditions, it is unlikely that every agent in a multi-agent system will have local information that is consist… ▽ More In recent years, a significant research effort has been devoted to the design of distributed protocols for the control of multi-agent systems, as the scale and limited communication bandwidth characteristic of such systems render centralized control impossible. Given the strict operating conditions, it is unlikely that every agent in a multi-agent system will have local information that is consistent with the true system state. Yet, the majority of works in the literature assume that agents share perfect knowledge of their environment. This paper focuses on understanding the impact that inconsistencies in agents' local information can have on the performance of multi-agent systems. More specifically, we consider the design of multi-agent operations under a game theoretic lens where individual agents are assigned utilities that guide their local decision making. We provide a tractable procedure for designing utilities that optimize the efficiency of the resulting collective behavior (i.e., price of anarchy) for classes of set covering games where the extent of the information inconsistencies is known. In the setting where the extent of the informational inconsistencies is not known, we show -- perhaps surprisingly -- that underestimating the level of uncertainty leads to better price of anarchy than overestimating it. △ Less

Submitted 7 June, 2021; originally announced June 2021.

arXiv:2103.16688 [pdf, other]

The Division of Assets in Multiagent Systems: A Case Study in Team Blotto Games

Authors: Keith Paarporn, Rahul Chandan, Mahnoosh Alizadeh, Jason R. Marden

Abstract: Multi-agent systems are designed to concurrently accomplish a diverse set of tasks at unprecedented scale. Here, the central problems faced by a system operator are to decide (i) how to divide available resources amongst the agents assigned to tasks and (ii) how to coordinate the behavior of the agents to optimize the efficiency of the resulting collective behavior. The focus of this paper is on p… ▽ More Multi-agent systems are designed to concurrently accomplish a diverse set of tasks at unprecedented scale. Here, the central problems faced by a system operator are to decide (i) how to divide available resources amongst the agents assigned to tasks and (ii) how to coordinate the behavior of the agents to optimize the efficiency of the resulting collective behavior. The focus of this paper is on problem (i), where we seek to characterize the impact of the division of resources on the best-case efficiency of the resulting collective behavior. Specifically, we focus on a team Colonel Blotto game where there are two sub-colonels competing against a common adversary in a two battlefield environment. Here, each sub-colonel is assigned a given resource budget and is required to allocate these resources independent of the other sub-colonel. However, their success is dependent on the allocation strategy of both sub-colonels. The central focus of this manuscript is on how to divide a common pool of resources among the two sub-colonels to optimize the resulting best-case efficiency guarantees. Intuitively, one would imagine that the more balanced the division of resources, the worse the performance, as such divisions restrict the sub-colonels' ability to employ joint randomized strategies that tend to be necessary for optimizing performance guarantees. However, the main result of this paper demonstrates that this intuition is actually incorrect. A more balanced division of resources can offer better performance guarantees than a more centralized division. Hence, this paper demonstrates that the resource division problem is highly non-trivial in such enmeshed environments and worthy of significant future research efforts. △ Less

Submitted 30 March, 2021; originally announced March 2021.

Comments: 7 pages, 2 figures

arXiv:2102.09655 [pdf, other]

doi 10.1109/TAC.2021.3088412

The Effectiveness of Subsidies and Tolls in Congestion Games

Authors: Bryce L. Ferguson, Philip N. Brown, Jason R. Marden

Abstract: Are rewards or penalties more effective in influencing user behavior? This work compares the effectiveness of subsidies and tolls in incentivizing user behavior in congestion games. The predominantly studied method of influencing user behavior in network routing problems is to institute taxes which alter users' observed costs in a manner that causes their self-interested choices to more closely al… ▽ More Are rewards or penalties more effective in influencing user behavior? This work compares the effectiveness of subsidies and tolls in incentivizing user behavior in congestion games. The predominantly studied method of influencing user behavior in network routing problems is to institute taxes which alter users' observed costs in a manner that causes their self-interested choices to more closely align with a system-level objective. Another conceivable method to accomplish the same goal is to subsidize the users' actions that are preferable from a system-level perspective. We show that, when users behave similarly and predictably, subsidies offer superior performance guarantees to tolls under similar budgetary constraints; however, in the presence of unknown player heterogeneity, subsidies fail to offer the same robustness as tolls. △ Less

Submitted 18 February, 2021; originally announced February 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:1910.02343

arXiv:2102.04542 [pdf, other]

Tractable mechanisms for computing near-optimal utility functions

Authors: Rahul Chandan, Dario Paccagnan, Jason R. Marden

Abstract: Large scale multiagent systems must rely on distributed decision making, as centralized coordination is either impractical or impossible. Recent works approach this problem under a game theoretic lens, whereby utility functions are assigned to each of the agents with the hope that their local optimization approximates the centralized optimal solution. Yet, formal guarantees on the resulting perfor… ▽ More Large scale multiagent systems must rely on distributed decision making, as centralized coordination is either impractical or impossible. Recent works approach this problem under a game theoretic lens, whereby utility functions are assigned to each of the agents with the hope that their local optimization approximates the centralized optimal solution. Yet, formal guarantees on the resulting performance cannot be obtained for broad classes of problems without compromising on their accuracy. In this work, we address this concern relative to the well-studied problem of resource allocation with nondecreasing concave welfare functions. We show that optimally designed local utilities achieve an approximation ratio (price of anarchy) of 1-c/e, where c is the function's curvature and e is Euler's constant. The upshot of our contributions is the design of approximation algorithms that are distributed and efficient, and whose performance matches that of the best existing polynomial-time (and centralized) schemes. △ Less

Submitted 8 February, 2021; originally announced February 2021.

Comments: 13 pages, 3 figures, to appear in Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021)

arXiv:2009.05018 [pdf, other]

The Cost of Denied Observation in Multiagent Submodular Optimization

Authors: David Grimsman, Joshua H. Seaton, Jason R. Marden, Philip N. Brown

Abstract: A popular formalism for multiagent control applies tools from game theory, casting a multiagent decision problem as a cooperation-style game in which individual agents make local choices to optimize their own local utility functions in response to the observable choices made by other agents. When the system-level objective is submodular maximization, it is known that if every agent can observe the… ▽ More A popular formalism for multiagent control applies tools from game theory, casting a multiagent decision problem as a cooperation-style game in which individual agents make local choices to optimize their own local utility functions in response to the observable choices made by other agents. When the system-level objective is submodular maximization, it is known that if every agent can observe the action choice of all other agents, then all Nash equilibria of a large class of resulting games are within a factor of $2$ of optimal; that is, the price of anarchy is $1/2$. However, little is known if agents cannot observe the action choices of other relevant agents. To study this, we extend the standard game-theoretic model to one in which a subset of agents either become \emph{blind} (unable to observe others' choices) or \emph{isolated} (blind, and also invisible to other agents), and we prove exact expressions for the price of anarchy as a function of the number of compromised agents. When $k$ agents are compromised (in any combination of blind or isolated), we show that the price of anarchy for a large class of utility functions is exactly $1/(2+k)$. We then show that if agents use marginal-cost utility functions and at least $1$ of the compromised agents is blind (rather than isolated), the price of anarchy improves to $1/(1+k)$. We also provide simulation results demonstrating the effects of these observation denials in a dynamic setting. △ Less

Submitted 24 September, 2020; v1 submitted 10 September, 2020; originally announced September 2020.

arXiv:2004.03050 [pdf, other]

doi 10.1109/TCNS.2022.3187078

The Impact of Message Passing in Agent-Based Submodular Maximization

Authors: David Grimsman, Matthew R. Kirchner, João P. Hespanha, Jason R. Marden

Abstract: This paper considers a set of sensors, which as a group are tasked with taking measurements of the environment and sending a small subset of the measurements to a centralized data fusion center, where the measurements will be used to estimate the overall state of the environment. The sensors' goal is to send the most informative set of measurements so that the estimate is as accurate as possible.… ▽ More This paper considers a set of sensors, which as a group are tasked with taking measurements of the environment and sending a small subset of the measurements to a centralized data fusion center, where the measurements will be used to estimate the overall state of the environment. The sensors' goal is to send the most informative set of measurements so that the estimate is as accurate as possible. This problem is formulated as a submodular maximization problem, for which there exists a well-studied greedy algorithm, where each sensor sequentially chooses a set of measurements from its own local set, and communicates its decision to the future sensors in the sequence. In this work, sensors can additionally share measurements with one another, in order to augment the decision set of each sensor. We explore how this increase in communication can be exploited to improve the results of the nominal greedy algorithm. Specifically, we show that this measurement passing can improve the quality of the resulting measurement set by up to a factor of $n+1$, where $n$ is the number of sensors. △ Less

Submitted 7 July, 2022; v1 submitted 6 April, 2020; originally announced April 2020.

Journal ref: IEEE Transactions on Control of Network Systems, 2022

arXiv:2003.05882 [pdf, other]

Stackelberg Equilibria for Two-Player Network Routing Games on Parallel Networks

Authors: David Grimsman, Joao P Hespanha, Jason R Marden

Abstract: We consider a two-player zero-sum network routing game in which a router wants to maximize the amount of legitimate traffic that flows from a given source node to a destination node and an attacker wants to block as much legitimate traffic as possible by flooding the network with malicious traffic. We address scenarios with asymmetric information, in which the router must reveal its policy before… ▽ More We consider a two-player zero-sum network routing game in which a router wants to maximize the amount of legitimate traffic that flows from a given source node to a destination node and an attacker wants to block as much legitimate traffic as possible by flooding the network with malicious traffic. We address scenarios with asymmetric information, in which the router must reveal its policy before the attacker decides how to distribute the malicious traffic among the network links, which is naturally modeled by the notion of Stackelberg equilibria. The paper focuses on parallel networks, and includes three main contributions: we show that computing the optimal attack policy against a given routing policy is an NP-hard problem; we establish conditions under which the Stackelberg equilibria lead to no regret; and we provide a metric that can be used to quantify how uncertainty about the attacker's capabilities limits the router's performance. △ Less

Submitted 12 March, 2020; originally announced March 2020.

arXiv:2003.04364 [pdf, other]

Distributed Submodular Maximization with Parallel Execution

Authors: Haoyuan Sun, David Grimsman, Jason R Marden

Abstract: The submodular maximization problem is widely applicable in many engineering problems where objectives exhibit diminishing returns. While this problem is known to be NP-hard for certain subclasses of objective functions, there is a greedy algorithm which guarantees approximation at least 1/2 of the optimal solution. This greedy algorithm can be implemented with a set of agents, each making a decis… ▽ More The submodular maximization problem is widely applicable in many engineering problems where objectives exhibit diminishing returns. While this problem is known to be NP-hard for certain subclasses of objective functions, there is a greedy algorithm which guarantees approximation at least 1/2 of the optimal solution. This greedy algorithm can be implemented with a set of agents, each making a decision sequentially based on the choices of all prior agents. In this paper, we consider a generalization of the greedy algorithm in which agents can make decisions in parallel, rather than strictly in sequence. In particular, we are interested in partitioning the agents, where a set of agents in the partition all make a decision simultaneously based on the choices of prior agents, so that the algorithm terminates in limited iterations. We provide bounds on the performance of this parallelized version of the greedy algorithm and show that dividing the agents evenly among the sets in the partition yields an optimal structure. We additionally show that this optimal structure is still near-optimal when the objective function exhibits a certain monotone property. Lastly, we show that the same performance guarantees can be achieved in the parallelized greedy algorithm even when agents can only observe the decisions of a subset of prior agents. △ Less

Submitted 30 June, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

arXiv:2002.11648 [pdf, ps, other]

When showing your hand pays off: Announcing strategic intentions in Colonel Blotto games

Authors: Rahul Chandan, Keith Paarporn, Jason R. Marden

Abstract: In competitive adversarial environments, it is often advantageous to obfuscate one's strategies or capabilities. However, revealing one's strategic intentions may shift the dynamics of the competition in complex ways. Can it ever be advantageous to reveal strategic intentions to an opponent? In this paper, we consider three-stage Colonel Blotto games in which one player can choose whether or not t… ▽ More In competitive adversarial environments, it is often advantageous to obfuscate one's strategies or capabilities. However, revealing one's strategic intentions may shift the dynamics of the competition in complex ways. Can it ever be advantageous to reveal strategic intentions to an opponent? In this paper, we consider three-stage Colonel Blotto games in which one player can choose whether or not to pre-commit resources to a single battlefield before play begins. This pre-commitment is public knowledge. In response, the opponent can either secure the battlefield by matching the pre-commitment with its own forces, or withdraw. In a two-player setting, we show that a weaker player never has an incentive to pre-commit any amount of resources to a battlefield regardless of how valuable it is. We then consider a three-player setting in which two players fight against a common adversary on separate fronts. Only one of the two players facing the adversary has the option of pre-committing. We find there are instances where this player benefits from pre-committing. The analysis indicates that under non-cooperative team settings and no possibility of forming alliances, there can be incentives to publicly announce one's strategic intentions to an adversary. △ Less

Submitted 13 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

Comments: 8 pages, 5 figures, to appear in the Proceedings of the 2020 American Control Conference

arXiv:1911.10181 [pdf, other]

Can Taxes Improve Congestion on All Networks?

Authors: Philip N. Brown, Jason R. Marden

Abstract: We ask if it is possible to positively influence social behavior with no risk of unintentionally incentivizing pathological behavior. In network routing problems, if network traffic is composed of many individual agents, it is known that self-interested behavior among the agents can lead to suboptimal network congestion. We study situations in which a system planner charges monetary tolls for the… ▽ More We ask if it is possible to positively influence social behavior with no risk of unintentionally incentivizing pathological behavior. In network routing problems, if network traffic is composed of many individual agents, it is known that self-interested behavior among the agents can lead to suboptimal network congestion. We study situations in which a system planner charges monetary tolls for the use of network links in an effort to incentivize efficient routing choices by the users, but in which the users' sensitivity to tolls is heterogeneous and unknown. We seek locally-computed tolls that are guaranteed not to incentivize worse network routing than in the un-influenced case. Our main result is to show that if networks are sufficiently complex and populations sufficiently diverse, perverse incentives cannot be systematically avoided: any taxation mechanism that improves outcomes on one network must necessarily degrade them on another. Nonetheless, for the simple class of parallel networks, non-perverse taxes do exist; we fully characterize all such taxation mechanisms, showing that they are a generalized version of traditional marginal-cost tolls. △ Less

Submitted 22 November, 2019; originally announced November 2019.

arXiv:1911.09806 [pdf, other]

Optimal Taxes in Atomic Congestion Games

Authors: Dario Paccagnan, Rahul Chandan, Bryce L Ferguson, Jason R Marden

Abstract: How can we design mechanisms to promote efficient use of shared resources? Here, we answer this question in relation to the well-studied class of atomic congestion games, used to model a variety of problems, including traffic routing. Within this context, a methodology for designing tolling mechanisms that minimize the system inefficiency (price of anarchy) exploiting solely local information is s… ▽ More How can we design mechanisms to promote efficient use of shared resources? Here, we answer this question in relation to the well-studied class of atomic congestion games, used to model a variety of problems, including traffic routing. Within this context, a methodology for designing tolling mechanisms that minimize the system inefficiency (price of anarchy) exploiting solely local information is so far missing in spite of the scientific interest. In this manuscript we resolve this problem through a tractable linear programming formulation that applies to and beyond polynomial congestion games. When specializing our approach to the polynomial case, we obtain tight values for the optimal price of anarchy and corresponding tolls, uncovering an unexpected link with load balancing games. We also derive optimal tolling mechanisms that are constant with the congestion level, generalizing the results of Caragiannis et al. [ACM Transactions on Algorithms, 2010] to polynomial congestion games and beyond. Finally, we apply our techniques to compute the efficiency of the marginal cost mechanism. Surprisingly, optimal tolling mechanism using only local information perform closely to existing mechanism that utilize global information [Bilò and Vinci, ACM Transactions on Economics and Computation, 2019], while the marginal cost mechanism, known to be optimal in the continuous-flow model, has lower efficiency than that encountered levying no toll. All results are tight for pure Nash equilibria, and extend to coarse correlated equilibria. △ Less

Submitted 11 March, 2021; v1 submitted 21 November, 2019; originally announced November 2019.

Comments: To appear in ACM Transactions on Economics and Computation, 32 pages, 5 figures

arXiv:1911.07823 [pdf, other]

When Smoothness is Not Enough: Toward Exact Quantification and Optimization of the Price of Anarchy

Authors: Rahul Chandan, Dario Paccagnan, Jason R. Marden

Abstract: The price of anarchy (PoA) is a popular metric for analyzing the inefficiency of self-interested decision making. Although its study is widespread, characterizing the PoA can be challenging. A commonly employed approach is based on the smoothness framework, which provides tight PoA values under the assumption that the system objective consists in the sum of the agents' individual welfares. Unfortu… ▽ More The price of anarchy (PoA) is a popular metric for analyzing the inefficiency of self-interested decision making. Although its study is widespread, characterizing the PoA can be challenging. A commonly employed approach is based on the smoothness framework, which provides tight PoA values under the assumption that the system objective consists in the sum of the agents' individual welfares. Unfortunately, several important classes of problems do not satisfy this requirement (e.g., taxation in congestion games), and our first result demonstrates that the smoothness framework does *not* tightly characterize the PoA for such settings. Motivated by this observation, this work develops a framework that achieves two chief objectives: i) to tightly characterize the PoA for such scenarios, and ii) to do so through a tractable approach. As a direct consequence, the proposed framework recovers and generalizes many existing PoA results, and enables efficient computation of incentives that optimize the PoA. We conclude by highlighting the applicability of our contributions to incentive design in congestion games and utility design in distributed welfare games. △ Less

Submitted 24 May, 2021; v1 submitted 18 November, 2019; originally announced November 2019.

Comments: 25 pages, 5 figures, 1 table, in review

arXiv:1909.03382 [pdf, other]

Characterizing the interplay between information and strength in Blotto games

Authors: Keith Paarporn, Rahul Chandan, Mahnoosh Alizadeh, Jason R. Marden

Abstract: In this paper, we investigate informational asymmetries in the Colonel Blotto game, a game-theoretic model of competitive resource allocation between two players over a set of battlefields. The battlefield valuations are subject to randomness. One of the two players knows the valuations with certainty. The other knows only a distribution on the battlefield realizations. However, the informed playe… ▽ More In this paper, we investigate informational asymmetries in the Colonel Blotto game, a game-theoretic model of competitive resource allocation between two players over a set of battlefields. The battlefield valuations are subject to randomness. One of the two players knows the valuations with certainty. The other knows only a distribution on the battlefield realizations. However, the informed player has fewer resources to allocate. We characterize unique equilibrium payoffs in a two battlefield setup of the Colonel Blotto game. We then focus on a three battlefield setup in the General Lotto game, a popular variant of the Colonel Blotto game. We characterize the unique equilibrium payoffs and mixed equilibrium strategies. We quantify the value of information - the difference in equilibrium payoff between the asymmetric information game and complete information game. We find information strictly improves the informed player's performance guarantee. However, the magnitude of improvement varies with the informed player's strength as well as the game parameters. Our analysis highlights the interplay between strength and information in adversarial environments. △ Less

Submitted 13 September, 2019; v1 submitted 8 September, 2019; originally announced September 2019.

Comments: 8 pages, 2 figures. Accepted for presentation at 58th Conference on Decision and Control (CDC), 2019

arXiv:1909.02671 [pdf, other]

doi 10.1109/TCNS.2020.3038842

The Impact of Complex and Informed Adversarial Behavior in Graphical Coordination Games

Authors: Keith Paarporn, Brian Canty, Philip N. Brown, Mahnoosh Alizadeh, Jason R. Marden

Abstract: How does system-level information impact the ability of an adversary to degrade performance in a networked control system? How does the complexity of an adversary's strategy affect its ability to degrade performance? This paper focuses on these questions in the context of graphical coordination games where an adversary can influence a given fraction of the agents in the system, and the agents foll… ▽ More How does system-level information impact the ability of an adversary to degrade performance in a networked control system? How does the complexity of an adversary's strategy affect its ability to degrade performance? This paper focuses on these questions in the context of graphical coordination games where an adversary can influence a given fraction of the agents in the system, and the agents follow log-linear learning, a well-known distributed learning algorithm. Focusing on a class of homogeneous ring graphs of various connectivity, we begin by demonstrating that minimally connected ring graphs are the most susceptible to adversarial influence. We then proceed to characterize how both (i) the sophistication of the attack strategies (static vs dynamic) and (ii) the informational awareness about the network structure can be leveraged by an adversary to degrade system performance. Focusing on the set of adversarial policies that induce stochastically stable states, our findings demonstrate that the relative importance between sophistication and information changes depending on the the influencing power of the adversary. In particular, sophistication far outweighs informational awareness with regards to degrading system-level damage when the adversary's influence power is relatively weak. However, the opposite is true when an adversary's influence power is more substantial. △ Less

Submitted 3 October, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

Comments: 15 pages, 4 figures

arXiv:1906.01142 [pdf, other]

doi 10.1109/TAC.2020.3002499

A risk-security tradeoff in graphical coordination games

Authors: Keith Paarporn, Mahnoosh Alizadeh, Jason R. Marden

Abstract: A system relying on the collective behavior of decision-makers can be vulnerable to a variety of adversarial attacks. How well can a system operator protect performance in the face of these risks? We frame this question in the context of graphical coordination games, where the agents in a network choose among two conventions and derive benefits from coordinating neighbors, and system performance i… ▽ More A system relying on the collective behavior of decision-makers can be vulnerable to a variety of adversarial attacks. How well can a system operator protect performance in the face of these risks? We frame this question in the context of graphical coordination games, where the agents in a network choose among two conventions and derive benefits from coordinating neighbors, and system performance is measured in terms of the agents' welfare. In this paper, we assess an operator's ability to mitigate two types of adversarial attacks - 1) broad attacks, where the adversary incentivizes all agents in the network and 2) focused attacks, where the adversary can force a selected subset of the agents to commit to a prescribed convention. As a mitigation strategy, the system operator can implement a class of distributed algorithms that govern the agents' decision-making process. Our main contribution characterizes the operator's fundamental trade-off between security against worst-case broad attacks and vulnerability from focused attacks. We show that this tradeoff significantly improves when the operator selects a decision-making process at random. Our work highlights the design challenges a system operator faces in maintaining resilience of networked distributed systems. △ Less

Submitted 6 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

Comments: 13 pages, double column, 4 figures. Submitted for journal publication

arXiv:1904.10915 [pdf, ps, other]

When Smoothness is Not Enough: Toward Exact Quantification and Optimization of the Price-of-Anarchy

Authors: Rahul Chandan, Dario Paccagnan, Jason R. Marden

Abstract: Today's multiagent systems have grown too complex to rely on centralized controllers, prompting increasing interest in the design of distributed algorithms. In this respect, game theory has emerged as a valuable tool to complement more traditional techniques. The fundamental idea behind this approach is the assignment of agents' local cost functions, such that their selfish minimization attains, o… ▽ More Today's multiagent systems have grown too complex to rely on centralized controllers, prompting increasing interest in the design of distributed algorithms. In this respect, game theory has emerged as a valuable tool to complement more traditional techniques. The fundamental idea behind this approach is the assignment of agents' local cost functions, such that their selfish minimization attains, or is provably close to, the global objective. Any algorithm capable of computing an equilibrium of the corresponding game inherits an approximation ratio that is, in the worst case, equal to its price-of-anarchy. Therefore, a successful application of the game design approach hinges on the possibility to quantify and optimize the equilibrium performance. Toward this end, we introduce the notion of generalized smoothness, and show that the resulting efficiency bounds are significantly tighter compared to those obtained using the traditional smoothness approach. Leveraging this newly-introduced notion, we quantify the equilibrium performance for the class of local resource allocation games. Finally, we show how the agents' local decision rules can be designed in order to optimize the efficiency of the corresponding equilibria, by means of a tractable linear program. △ Less

Submitted 18 February, 2020; v1 submitted 24 April, 2019; originally announced April 2019.

Comments: 9 pages, double column, 1 figure, 1 table, to appear in the proceedings of the 2019 IEEE Conference on Decision and Control

arXiv:1903.06288 [pdf, ps, other]

Optimal Price of Anarchy in Cost-Sharing Games

Authors: Rahul Chandan, Dario Paccagnan, Jason R. Marden

Abstract: The design of distributed algorithms is central to the study of multiagent systems control. In this paper, we consider a class of combinatorial cost-minimization problems and propose a framework for designing distributed algorithms with a priori performance guarantees that are near-optimal. We approach this problem from a game-theoretic perspective, assigning agents cost functions such that the eq… ▽ More The design of distributed algorithms is central to the study of multiagent systems control. In this paper, we consider a class of combinatorial cost-minimization problems and propose a framework for designing distributed algorithms with a priori performance guarantees that are near-optimal. We approach this problem from a game-theoretic perspective, assigning agents cost functions such that the equilibrium efficiency (price of anarchy) is optimized. Once agents' cost functions have been specified, any algorithm capable of computing a Nash equilibrium of the system inherits a performance guarantee matching the price of anarchy. Towards this goal, we formulate the problem of computing the price of anarchy as a tractable linear program. We then present a framework for designing agents' local cost functions in order to optimize for the worst-case equilibrium efficiency. Finally, we investigate the implications of our findings when this framework is applied to systems with convex, nondecreasing costs. △ Less

Submitted 14 March, 2019; originally announced March 2019.

Comments: 8 pages, double column, 1 figure, 2 tables, submitted to 2019 American Control Conference

arXiv:1807.10639 [pdf, other]

doi 10.1109/TCNS.2018.2889005

The Impact of Information in Greedy Submodular Maximization

Authors: David Grimsman, Mohd. Shabbir Ali, João P. Hespanha, Jason R. Marden

Abstract: The maximization of submodular functions is an NP-Hard problem for certain subclasses of functions, for which a simple greedy algorithm has been shown to guarantee a solution whose quality is within 1/2 of the optimal. When this algorithm is implemented in a distributed way, agents sequentially make decisions based on the decisions of all previous agents. This work explores how limited access to t… ▽ More The maximization of submodular functions is an NP-Hard problem for certain subclasses of functions, for which a simple greedy algorithm has been shown to guarantee a solution whose quality is within 1/2 of the optimal. When this algorithm is implemented in a distributed way, agents sequentially make decisions based on the decisions of all previous agents. This work explores how limited access to the decisions of previous agents affects the quality of the solution of the greedy algorithm. Specifically, we provide tight upper and lower bounds on how well the algorithm performs, as a function of the information available to each agent. Intuitively, the results show that performance roughly degrades proportionally to the size of the largest group of agents which make decisions independently. Additionally, we consider the case where a system designer is given a set of agents and a global limit on the amount of information that can be accessed. Our results show that the best designs partition the agents into equally-sized sets and allow agents to access the decisions of all previous agents within the same set. △ Less

Submitted 9 January, 2019; v1 submitted 25 July, 2018; originally announced July 2018.

Journal ref: IEEE Transactions on Control of Network Systems, December 2018

arXiv:1711.00609 [pdf, other]

Security Against Impersonation Attacks in Distributed Systems

Authors: Philip N. Brown, Holly Borowski, Jason R. Marden

Abstract: In a multi-agent system, transitioning from a centralized to a distributed decision-making strategy can introduce vulnerability to adversarial manipulation. We study the potential for adversarial manipulation in a class of graphical coordination games where the adversary can pose as a friendly agent in the game, thereby influencing the decision-making rules of a subset of agents. The adversary's i… ▽ More In a multi-agent system, transitioning from a centralized to a distributed decision-making strategy can introduce vulnerability to adversarial manipulation. We study the potential for adversarial manipulation in a class of graphical coordination games where the adversary can pose as a friendly agent in the game, thereby influencing the decision-making rules of a subset of agents. The adversary's influence can cascade throughout the system, indirectly influencing other agents' behavior and significantly impacting the emergent collective behavior. The main results in this paper focus on characterizing conditions under which the adversary's local influence can dramatically impact the emergent global behavior, e.g., destabilize efficient Nash equilibria. △ Less

Submitted 1 November, 2017; originally announced November 2017.

arXiv:1710.08500 [pdf, other]

Are Multiagent Systems Resilient to Communication Failures?

Authors: Philip N. Brown, Holly P. Borowski, Jason R. Marden

Abstract: A challenge in multiagent control systems is to ensure that they are appropriately resilient to communication failures between the various agents. In many common game-theoretic formulations of these types of systems, it is implicitly assumed that all agents have access to as much information about other agents' actions as needed. This paper endeavors to augment these game-theoretic methods with po… ▽ More A challenge in multiagent control systems is to ensure that they are appropriately resilient to communication failures between the various agents. In many common game-theoretic formulations of these types of systems, it is implicitly assumed that all agents have access to as much information about other agents' actions as needed. This paper endeavors to augment these game-theoretic methods with policies that would allow agents to react on-the-fly to losses of this information. Unfortunately, we show that even if a single agent loses communication with one other weakly-coupled agent, this can cause arbitrarily-bad system states to emerge as various solution concepts of an associated game, regardless of how the agent accounts for the communication failure and regardless of how weakly coupled the agents are. Nonetheless, we show that the harm that communication failures can cause is limited by the structure of the problem; when agents' action spaces are richer, problems are more susceptible to these types of pathologies. Finally, we undertake an initial study into how a system designer might prevent these pathologies, and explore a few limited settings in which communication failures cannot cause harm. △ Less

Submitted 23 October, 2017; originally announced October 2017.

arXiv:1710.07460 [pdf, other]

doi 10.1109/TAC.2018.2878397

The Importance of System-Level Information in Multiagent Systems Design: Cardinality and Covering Problems

Authors: Dario Paccagnan, Jason R. Marden

Abstract: A fundamental challenge in multiagent systems is to design local control algorithms to ensure a desirable collective behaviour. The information available to the agents, gathered either through communication or sensing, naturally restricts the achievable performance. Hence, it is fundamental to identify what piece of information is valuable and can be exploited to design control laws with enhanced… ▽ More A fundamental challenge in multiagent systems is to design local control algorithms to ensure a desirable collective behaviour. The information available to the agents, gathered either through communication or sensing, naturally restricts the achievable performance. Hence, it is fundamental to identify what piece of information is valuable and can be exploited to design control laws with enhanced performance guarantees. This paper studies the case when such information is uncertain or inaccessible for a class of submodular resource allocation problems termed covering problems. In the first part of this work we pinpoint a fundamental risk-reward tradeoff faced by the system operator when conditioning the control design on a valuable but uncertain piece of information, which we refer to as the cardinality, that represents the maximum number of agents that can simultaneously select any given resource. Building on this analysis, we propose a distributed algorithm that allows agents to learn the cardinality while adjusting their behaviour over time. This algorithm is proved to perform on par or better to the optimal design obtained when the exact cardinality is known a priori. △ Less

Submitted 28 October, 2018; v1 submitted 20 October, 2017; originally announced October 2017.

Comments: 14 pages, double column, 7 figures

arXiv:1710.01409 [pdf, other]

Multiagent Maximum Coverage Problems: The Trade-off Between Anarchy and Stability

Authors: Vinod Ramaswamy, Dario Paccagnan, Jason R. Marden

Abstract: The price of anarchy and price of stability are three well-studied performance metrics that seek to characterize the inefficiency of equilibria in distributed systems. The distinction between these two performance metrics centers on the equilibria that they focus on: the price of anarchy characterizes the quality of the worst-performing equilibria, while the price of stability characterizes the qu… ▽ More The price of anarchy and price of stability are three well-studied performance metrics that seek to characterize the inefficiency of equilibria in distributed systems. The distinction between these two performance metrics centers on the equilibria that they focus on: the price of anarchy characterizes the quality of the worst-performing equilibria, while the price of stability characterizes the quality of the best-performing equilibria. While much of the literature focuses on these metrics from an analysis perspective, in this work we consider these performance metrics from a design perspective. Specifically, we focus on the setting where a system operator is tasked with designing local utility functions to optimize these performance metrics in a class of games termed covering games. Our main result characterizes a fundamental trade-off between the price of anarchy and price of stability in the form of a fully explicit Pareto frontier. Within this setup, optimizing the price of anarchy comes directly at the expense of the price of stability (and vice versa). Our second results demonstrates how a system-operator could incorporate an additional piece of system-level information into the design of the agents' utility functions to breach these limitations and improve the system's performance. This valuable piece of system-level information pertains to the performance of worst performing agent in the system. △ Less

Submitted 14 March, 2020; v1 submitted 3 October, 2017; originally announced October 2017.

Comments: 14 pages, 4 figures

arXiv:1402.3610 [pdf, ps, other]

Potential Games are Necessary to Ensure Pure Nash Equilibria in Cost Sharing Games

Authors: Ragavendran Gopalakrishnan, Jason R. Marden, Adam Wierman

Abstract: We consider the problem of designing distribution rules to share "welfare" (cost or revenue) among individually strategic agents. There are many known distribution rules that guarantee the existence of a (pure) Nash equilibrium in this setting, e.g., the Shapley value and its weighted variants; however, a characterization of the space of distribution rules that guarantee the existence of a Nash eq… ▽ More We consider the problem of designing distribution rules to share "welfare" (cost or revenue) among individually strategic agents. There are many known distribution rules that guarantee the existence of a (pure) Nash equilibrium in this setting, e.g., the Shapley value and its weighted variants; however, a characterization of the space of distribution rules that guarantee the existence of a Nash equilibrium is unknown. Our work provides an exact characterization of this space for a specific class of scalable and separable games, which includes a variety of applications such as facility location, routing, network formation, and coverage games. Given arbitrary local welfare functions W, we prove that a distribution rule guarantees equilibrium existence for all games (i.e., all possible sets of resources, agent action sets, etc.) if and only if it is equivalent to a generalized weighted Shapley value on some "ground" welfare functions W', which can be distinct from W. However, if budget-balance is required in addition to the existence of a Nash equilibrium, then W' must be the same as W. We also provide an alternate characterization of this space in terms of "generalized" marginal contributions, which is more appealing from the point of view of computational tractability. A possibly surprising consequence of our result is that, in order to guarantee equilibrium existence in all games with any fixed local welfare functions, it is necessary to work within the class of potential games. △ Less

Submitted 14 February, 2014; originally announced February 2014.

Comments: Presented at various conferences in 2013. Accepted for journal publication in December 2013

MSC Class: Primary: 91A10; secondary: 91A40; 91A80

Showing 1–42 of 42 results for author: Marden, J R