Collaborative Decision-Making and the k𝑘kitalic_k-Strong Price of Anarchy in Common Interest Games

Bryce L. Ferguson    \IEEEmembershipStudent Member, IEEE    Dario Paccagnan    \IEEEmembershipMember, IEEE    Bary S. R. Pradelski    and Jason R. Marden    \IEEEmembershipSenior Member, IEEE This work was supported in part by the Office of Naval Research under Grant # N00014-20-1-2359, the Air Force Office of Scientific Research under Grants # FA95550-20-1-0054 and # FA9550-21-1-0203, and the French National Research Agency (ANR) under Grant # ANR-19-CE48-0018-01.Bryce L. Ferguson and Jason R. Marden are with the Department of Electrical and Computer Engineering at the University of California, Santa Barbara, CA. {blferguson,jrmarden}@ece.ucsb.edu.Dario Paccagnan is with the Department of Computing at Imperial College London, UK. [email protected].Bary S. R. Pradelski is with the National Centre for Scientific Research (CNRS) and the Department of Economics at Oxford, UK.[email protected].
Abstract

The control of large-scale, multi-agent systems often entails distributing decision-making across the system components. However, with advances in communication and computation technologies, we can consider new collaborative decision-making paradigms that bridge centralized and distributed control architectures. In this work, we seek to understand the benefits and costs of increased collaborative communication in multi-agent systems. We specifically study this in the context of common interest games in which groups of up to k𝑘kitalic_k agents can coordinate their actions in maximizing a common objective function. The equilibria that emerge in these systems are the k𝑘kitalic_k-strong Nash equilibria of the common interest game; studying the properties of these states provides relevant insights into the efficacy of inter-agent collaboration. Our contributions come threefold: 1) provide bounds on how well k𝑘kitalic_k-strong Nash equilibria approximate the optimal system welfare, formalized by the k𝑘kitalic_k-strong price of anarchy, 2) prove the run-time and transient performance of collaborative agent-based dynamics, and 3) introduce techniques of redesigning objectives for groups of agents which improve system performance. We study these three facets generally as well as in the context of resource allocation problems, in which we provide tractable linear programs that give tight bounds on the k𝑘kitalic_k-strong price of anarchy.

1 Introduction

Large-scale systems such as transportation services [1], robotic fleets [2], supply chains [3], or cloud computing services [4] can be challenging to design effective control schemes for due to their many components and vast scale. The two prevailing paradigms to design control schemes are centralized control [5, 6, 7], which guides behavior across the entire system and distributed control  [8, 9, 10], which allows local components to guide their own behavior. Each of these approaches possesses respective pros and cons: centralization allows for more direct manipulation of system behavior at the cost of greater communication and computation requirements, while decentralization reduces the communication and computation requirements but cannot always attain the desired system behavior. Advancements in embedded communication and computation [11, 12, 13, 14] enable the design of new paradigms that exist between centralized and distributed control.

Specifically, we study the efficacy of learning in multi-agent systems when individual system components (or agents) can partially communicate and thus coordinate their behavior. Many engineering domains are on the precipice of enabling these collaborative paradigms; for example, autonomous vehicle platoons with connected cruise control [15], unmanned aerial surveillance vehicles with range-limited communication [16], and cloud computing networks with emerging distributed learning techniques [17]. In each of these settings, inter-agent communication and collaboration offer the opportunity to improve the performance attainable by the system as a whole; however, implementing these frameworks incurs costs that are both monetary–in the form of the additional technology required–and computational–in the form of more complex decision-making algorithms. In this work, we provide tools to help better understand the benefits and costs associated with collaborative communication in multi-agent systems.

Refer to caption
Figure 1: Illustration of the k𝑘kitalic_k-strong Nash equilibrium local optimality guarantee for a three-agent common-interest game where k{1,2,3}𝑘123k\in\{1,2,3\}italic_k ∈ { 1 , 2 , 3 }. In each case, if the dark cube is a k𝑘kitalic_k-strong Nash equilibrium, then it is optimal over the highlighted region with respect to the shared objective function W𝑊Witalic_W. As k𝑘kitalic_k (the size of collaborative groups) increases, the local optimality is strengthened by holding overall k𝑘kitalic_k-lateral deviations.

We model a multi-agent system as a common interest game where some (but not all) groups of agents can collaborate in selecting their actions to maximize the system welfare. We particularly focus on the case where a collaborative action takes the form of a group best response, i.e., a group of agents updating their actions in response to the remaining players’ actions. As the size and number of these collaborative groups increase, a coordinated group decision has a larger impact on system behavior. To range the level of collaboration between the fully distributed setting (where no agents can collaborate) and the fully centralized setting (where all agents can collaborate collectively), we consider the cases where groups of up to k𝑘kitalic_k agents can collaborate. In these collaborative environments, a stable state of the system is that of the k𝑘kitalic_k-strong Nash equilibrium [18]. Researchers have studied the existence [19] and computation [20] of strong Nash equilibria in settings including congestion games [21], lexicographical games [22], and Markov games [23]. This work applies these concepts to multi-agent systems. To understand the possible benefits of collaboration to system performance, we quantify how well k𝑘kitalic_k-strong Nash equilibria approximate the optimal welfare, termed the k𝑘kitalic_k-strong price of anarchy [24, 25]. To understand the possible cost of collaboration, we analyze the running time and transient performance of agent-based dynamics, which converge to k𝑘kitalic_k-strong Nash equilibria.

Distributed learning in games has been a widely studied area in controls [26], but the ability to reach equilibrium with coalitional best responses has not yet been studied; we thus study the added run time of collaborative algorithms. Quantifying the k𝑘kitalic_k-strong price of anarchy has been studied in network formation games [24, 25] and load balancing games [27, 28, 29], as well as more general utility maximizing games [30, 31]. In many of these, the bounds are either not tight (particularly for finitely many players) or hold for equilibria which need not exist. By focusing on the class of common interest games, we guarantee the existence of collaborative equilibria, provide tight approximation bounds, and develop new insights into collaborative multi-agent optimization.

Organization - This work provides tools to understand the benefits and costs of collaborative communication by studying the qualities of k𝑘kitalic_k-strong Nash equilibria. In Section 3.1, we consider the case where groups of agents are designed to maximize the system welfare and introduce the notion of (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth games (a generalization of smooth games [32] and coalitionally smooth games [30]), and provide bounds on the k𝑘kitalic_k-strong price of anarchy. Then, in Section 3.2, we focus on the well-studied setting of distributed resource allocation problems [33, 34, 35, 36, 37, 38], and provide tight bounds on the k𝑘kitalic_k-strong price of anarchy via the solution of a tractable linear program. Fig. 3 plots these bounds and demonstrates how increased collaboration improves efficiency guarantees in several classes of resource allocation problems. In Section 4, we consider the effects of group decision-making on agent-based dynamics; specifically, we show the added run-time complexity of coalitional round-robin dynamics and provide transient performance guarantees of asynchronous best response dynamics. We support our findings with numerical examples. In Section 5, we consider that the system operator may be able to design the agents’ objective separately from the system welfare; we provide a generalized technique for bounding the k𝑘kitalic_k-strong price of anarchy in this setting. In Section 5.2, we again focus on the setting of resource allocation and provide two linear programs to lower and upper bound the attainable k𝑘kitalic_k-strong price of anarchy guarantee via utility design.

2 Preliminaries

Throughout, we will denote [n]={1,,n}delimited-[]𝑛1𝑛[n]=\{1,\ldots,n\}[ italic_n ] = { 1 , … , italic_n }. We will regularly use the binomial coefficient (nk)=n!(nk)!k!binomial𝑛𝑘𝑛𝑛𝑘𝑘\binom{n}{k}=\frac{n!}{(n-k)!k!}( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) = divide start_ARG italic_n ! end_ARG start_ARG ( italic_n - italic_k ) ! italic_k ! end_ARG in constructing optimization problems; we define this value as 00 when n<k𝑛𝑘n<kitalic_n < italic_k for ease of notation.

2.1 Collaborative Decision Making

Consider a finite set of agents N={1,,n}𝑁1𝑛N=\{1,\ldots,n\}italic_N = { 1 , … , italic_n }. Each agent iN𝑖𝑁i\in Nitalic_i ∈ italic_N selects an action aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT from a finite action set 𝒜isubscript𝒜𝑖\mathcal{A}_{i}caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. When each agent selects an action, we will denote their joint action by the tuple a=(a1,,an)𝒜=𝒜1××𝒜n𝑎subscript𝑎1subscript𝑎𝑛𝒜subscript𝒜1subscript𝒜𝑛a=(a_{1},\ldots,a_{n})\in\mathcal{A}=\mathcal{A}_{1}\times\cdots\times\mathcal% {A}_{n}italic_a = ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ∈ caligraphic_A = caligraphic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × ⋯ × caligraphic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Let G=(N,𝒜)𝐺𝑁𝒜G=(N,\mathcal{A})italic_G = ( italic_N , caligraphic_A ) be a tuple encoding the components of the agent environment. The system’s performance is dictated by the agents’ actions; as such, for each joint-action a𝑎aitalic_a, we assign a system welfare W(a)𝑊𝑎W(a)italic_W ( italic_a ) where W:𝒜0:𝑊𝒜subscriptabsent0W:\mathcal{A}\rightarrow\mathbb{R}_{\geq 0}italic_W : caligraphic_A → blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT is the system designer’s objective function. With this, we let the tuple (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) denote a multi-agent system (often referred to as a system), which defines the primitives of the system designer’s problem of designing an effective control algorithm.

The system designer would like to configure the agents to reach a joint action that maximizes the system welfare, i.e.,

aoptargmaxa𝒜W(a).superscript𝑎optsubscriptargmax𝑎𝒜𝑊𝑎a^{\rm opt}\in\operatorname*{arg\,max}_{a\in\mathcal{A}}W(a).italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a ) . (1)

Though this system state is ideal, it may be difficult to attain as 1) solving for the optimal allocation can be combinatorial and in some cases (including those from Section 3.2) NP-hard [33], and 2) it requires a centralized authority to control all agents, which may be practically or logistically difficult. To resolve this, we will consider that agents make decisions in a decentralized manner.

Fully distributing the decision-making involves designing each agent to update their action locally and has been widely studied and developed to guarantee reasonable system behavior [8]; however, fully distributing decision-making may often become unnecessary as emerging communication technologies enable collaborative inter-agent decision-making[11]. To implement one such collaborative system architecture, a system operator must make two decisions: 1) which group of agents can collaborate on their decisions (possibly subject to some operational constraints), and 2) how the agents should collaborate on their decisions. A natural choice for the latter is a group best response. Let ΓNΓ𝑁\Gamma\subseteq Nroman_Γ ⊆ italic_N be a group of agents endowed with the ability to collaboratively select a group action aΓ𝒜Γ=iΓ𝒜isubscript𝑎Γsubscript𝒜Γsubscriptproduct𝑖Γsubscript𝒜𝑖a_{\Gamma}\in\mathcal{A}_{\Gamma}={\prod}_{i\in\Gamma}\mathcal{A}_{i}italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT = ∏ start_POSTSUBSCRIPT italic_i ∈ roman_Γ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, which they select by maximizing the system welfare over their group action-set,

aΓargmaxaΓ𝒜ΓW(aΓ,aΓ),subscript𝑎Γsubscriptargmaxsuperscriptsubscript𝑎Γsubscript𝒜Γ𝑊superscriptsubscript𝑎Γsubscript𝑎Γa_{\Gamma}\in\operatorname*{arg\,max}_{a_{\Gamma}^{\prime}\in\mathcal{A}_{% \Gamma}}W(a_{\Gamma}^{\prime},a_{-\Gamma}),italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) , (2)

where aΓsubscript𝑎Γa_{-\Gamma}italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT denotes the actions of the players iNΓ𝑖𝑁Γi\in N\setminus\Gammaitalic_i ∈ italic_N ∖ roman_Γ. If there are multiple elements in the argmax, the group breaks them at random unless they can remain with their current action.

Intuitively, a group best responding and collaboratively maximizing the system welfare should lead to direct improvements to system performance; however, one can consider other group decision-making rules as well. In particular, in Section 5, we will consider that the system designer can design the agents’ objective separately from the system objective as a means to further shape system behavior. In either case, one would imagine that the greater the collaborative structure, the greater the impact on emergent behavior.

For the system operator’s decision over which groups should collaborate, let 𝒞2N𝒞superscript2𝑁\mathcal{C}\subseteq 2^{N}caligraphic_C ⊆ 2 start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT denote the collaboration set, or the set of groups of agents (Γ𝒞Γ𝒞\Gamma\in\mathcal{C}roman_Γ ∈ caligraphic_C) able to collaborate their decisions. These collaborations can overlap–where agents can partake in multiple, disparate collaborations–and vary in size. For example, if agents send signals through a communication network [39], we will have 𝒞={(i,j)N2(i,j)E}𝒞conditional-set𝑖𝑗superscript𝑁2𝑖𝑗𝐸\mathcal{C}=\{(i,j)\in N^{2}\mid(i,j)\in E\}caligraphic_C = { ( italic_i , italic_j ) ∈ italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∣ ( italic_i , italic_j ) ∈ italic_E } where E𝐸Eitalic_E are the edges in a communication graph. If agents are allowed to communicate with each other one at a time and make pairwise decisions [40], then 𝒞={(i,j)N2}𝒞𝑖𝑗superscript𝑁2\mathcal{C}=\{(i,j)\in N^{2}\}caligraphic_C = { ( italic_i , italic_j ) ∈ italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT }. If agents can only communicate with others within a local proximity [41], then 𝒞={ΓNρ(i,j)di,jΓ}𝒞conditional-setΓ𝑁formulae-sequence𝜌𝑖𝑗𝑑for-all𝑖𝑗Γ\mathcal{C}=\{\Gamma\subseteq N\mid\rho(i,j)\leq d\leavevmode\nobreak\ \forall i% ,j\in\Gamma\}caligraphic_C = { roman_Γ ⊆ italic_N ∣ italic_ρ ( italic_i , italic_j ) ≤ italic_d ∀ italic_i , italic_j ∈ roman_Γ } where ρ𝜌\rhoitalic_ρ measures the distance between two agents and d𝑑ditalic_d is a maximum communication range. Once the system operator decides on the collaborative structure and the group decision-making protocol, the agents’ decision-making process forms a collaborative multi-agent system, denoted by the tuple (G,W,𝒞)𝐺𝑊𝒞(G,W,\mathcal{C})( italic_G , italic_W , caligraphic_C ).

As we vary the number and size of collaborative sets, we can consider control paradigms somewhere between centralized (i.e., {N}𝒞𝑁𝒞\{N\}\in\mathcal{C}{ italic_N } ∈ caligraphic_C) and fully distributed (i.e., 𝒞={{1},{2},,{n}}𝒞12𝑛\mathcal{C}=\left\{\{1\},\{2\},\ldots,\{n\}\right\}caligraphic_C = { { 1 } , { 2 } , … , { italic_n } }. This work seeks to understand the efficacy of different levels of communication/collaboration. To more effectively quantify this, we consider a specific type of collaboration set in which we can range between the centralized and distributed extremes.

2.2 k-Strong Nash Equilibria

We consider the collaboration sets that contain groups of agents up to size k𝑘kitalic_k. Let 𝒞k={ΓN|Γ|=k}subscript𝒞𝑘conditional-setΓ𝑁Γ𝑘\mathcal{C}_{k}=\{\Gamma\subseteq N\mid|\Gamma|=k\}caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = { roman_Γ ⊆ italic_N ∣ | roman_Γ | = italic_k } denote the subsets of exactly k𝑘kitalic_k agents and 𝒞[k]=ζ[k]𝒞ζsubscript𝒞delimited-[]𝑘subscript𝜁delimited-[]𝑘subscript𝒞𝜁\mathcal{C}_{[k]}=\bigcup_{\zeta\in[k]}\mathcal{C}_{\zeta}caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT = ⋃ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT be the subsets that contain at most k𝑘kitalic_k agents. When k=1𝑘1k=1italic_k = 1, we recover the fully distributed setting, and when k=n𝑘𝑛k=nitalic_k = italic_n, we recover the fully centralized setting. As we vary k𝑘kitalic_k between 1111 and n𝑛nitalic_n, we sweep through different levels of communication and collaboration.

Refer to caption
Figure 2: Strong Price of Anarchy in resource covering games with n=20𝑛20n=20italic_n = 20 players and coalitions up to size k𝑘kitalic_k (horizontal axis). As the size of groups that are allowed to collaborate grows, so too does the approximation ratio (i.e., strong price of anarchy) of a k𝑘kitalic_k-strong Nash equilibrium. The efficiency of an equilibrium can be further improved by designing the utility functions agents are set to maximize. The solid green line is the k𝑘kitalic_k-strong price of anarchy when agents maximize the system objective (generated by Theorem 3.3). The dashed red line is an upper bound on the k𝑘kitalic_k-strong price of anarchy while using an optimal utility design (generated by Proposition 5.3).

In the game-theoretic approach to multi-agent systems, a Nash equilibrium is a joint action where no agent can unilaterally deviate their action to improve the system welfare [18]. We generalize this concept to the setting of collaborative decision-making by considering a k𝑘kitalic_k-strong Nash equilibrium as a joint action where no group of k𝑘kitalic_k agents can deviate their group’s actions to improve the welfare.

Definition 1.

A joint-action akSNE𝒜superscript𝑎𝑘SNE𝒜a^{k{\rm SNE}}\in\mathcal{A}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ∈ caligraphic_A is a k𝑘kitalic_k-strong Nash equilibrium for the common-interest game (G,W,𝒞[k])𝐺𝑊subscript𝒞delimited-[]𝑘(G,W,\mathcal{C}_{[k]})( italic_G , italic_W , caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT ) if

W(akSNE)W(aΓ,aΓkSNE),aΓ𝒜Γ,Γ𝒞[k].formulae-sequence𝑊superscript𝑎𝑘SNE𝑊subscriptsuperscript𝑎Γsubscriptsuperscript𝑎𝑘SNEΓformulae-sequencefor-allsubscriptsuperscript𝑎Γsubscript𝒜ΓΓsubscript𝒞delimited-[]𝑘W(a^{k{\rm SNE}})\geq W(a^{\prime}_{\Gamma},a^{k{\rm SNE}}_{-\Gamma}),% \leavevmode\nobreak\ \forall a^{\prime}_{\Gamma}\in\mathcal{A}_{\Gamma},% \leavevmode\nobreak\ \Gamma\in\mathcal{C}_{[k]}.italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) ≥ italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) , ∀ italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT . (3)

Let kSNE(G,W)𝒜𝑘SNE𝐺𝑊𝒜k{\rm SNE}(G,W)\subseteq\mathcal{A}italic_k roman_SNE ( italic_G , italic_W ) ⊆ caligraphic_A denote the set of all k𝑘kitalic_k-strong Nash equilibria. Note that when k=1𝑘1k=1italic_k = 1, we recover the classical definition of a Nash equilibrium, and when k=n𝑘𝑛k=nitalic_k = italic_n, the equilibrium condition implies global optimality. Definition 1 differs slightly from the literature, where k𝑘kitalic_k-strong Nash equilibria are defined by no group of agents deviating to a new group action that is Pareto-optimal for the group (i.e., no agent receives a lower payoff with respect to their individual utility function) [18]; when the agents respond to a common interest objective, the definitions are equivalent. Additionally, in general games, k𝑘kitalic_k-strong Nash equilibria need not exist; however, that is not the case in our setting due to the common-interest structure we impose on agent decision-making.

Proposition 2.1.

In a system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) with collaboration set 𝒞[k]subscript𝒞delimited-[]𝑘\mathcal{C}_{[k]}caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT for any k[n]𝑘delimited-[]𝑛k\in[n]italic_k ∈ [ italic_n ], a k𝑘kitalic_k-strong Nash equilibrium exists.

The proof appears in the appendix.

The main focus of this work is understanding how equilibrium performance changes with the level of collaborative communication. Notice that (3) serves as a local optimality guarantee in the neighborhood of k𝑘kitalic_k-lateral deviations. Fig. 1 depicts this for a three-player matrix game; when k=1𝑘1k=1italic_k = 1, a 1111-strong Nash equilibrium (or just Nash equilibrium) is optimal over the unilateral deviations, when k=2𝑘2k=2italic_k = 2 a 2222-strong Nash equilibrium is optimal over the bilateral deviations, and when k=3=n𝑘3𝑛k=3=nitalic_k = 3 = italic_n, the 3333-strong Nash equilibrium is optimal over the whole joint-action space. From this, we observe that the local optimality guarantee is strengthened as we increase the level of collaboration k𝑘kitalic_k (i.e., kSNEkSNEsuperscript𝑘SNE𝑘SNEk^{\prime}{\rm SNE}\subseteq k{\rm SNE}italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_SNE ⊆ italic_k roman_SNE for k>ksuperscript𝑘𝑘k^{\prime}>kitalic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT > italic_k).

To quantify the effect of varying k𝑘kitalic_k on equilibrium performance, we consider the ratio of worst-case equilibrium welfare and the optimal attainable welfare, termed the k𝑘kitalic_k-strong price of anarchy.

SPoAk(G,W)=minakSNEkSNE(G,W)W(akSNE)maxaopt𝒜W(aopt)[0,1],subscriptSPoA𝑘𝐺𝑊subscriptsuperscript𝑎𝑘SNE𝑘SNE𝐺𝑊𝑊superscript𝑎𝑘SNEsubscriptsuperscript𝑎opt𝒜𝑊superscript𝑎opt01\mathrm{SPoA}_{k}(G,W)=\frac{\min_{a^{k{\rm SNE}}\in k{\rm SNE}(G,W)}W(a^{k{% \rm SNE}})}{\max_{a^{\rm opt}\in\mathcal{A}}W(a^{\rm opt})}\in[0,1],roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W ) = divide start_ARG roman_min start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ∈ italic_k roman_SNE ( italic_G , italic_W ) end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) end_ARG start_ARG roman_max start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) end_ARG ∈ [ 0 , 1 ] , (4)

where we let 0/0000/00 / 0 be defined as 1111 to ignore the degenerate case when no welfare is attainable. In the multi-agent system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) with communication structure 𝒞[k]subscript𝒞delimited-[]𝑘\mathcal{C}_{[k]}caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT, every k𝑘kitalic_k-strong Nash equilibrium approximates the optimal solution at least as well as SPoAk(G,W)subscriptSPoA𝑘𝐺𝑊\mathrm{SPoA}_{k}(G,W)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W ). Accordingly, we will use the k𝑘kitalic_k-strong price of anarchy to understand the efficiency associated with collaborative decision-making. For example, in Fig. 2, we depict the k𝑘kitalic_k-strong price of anarchy in resource covering games [33] for 1kn1𝑘𝑛1\leq k\leq n1 ≤ italic_k ≤ italic_n, illustrating the performance guarantees attainable between centralized and distributed control paradigms.

2.3 Summary of Contributions

This work studies the benefits and costs of increased collaborative communication within multi-agent systems. Our contributions come threefold:
1) In Section 3, we provide tools to quantify the k𝑘kitalic_k-strong price of anarchy when agents optimize the system objective. We introduce (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth games and provide a k𝑘kitalic_k-strong price of anarchy guarantee using the parameters λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ. We then focus on the class of resource allocation games, where in Proposition 3.2, we show that these parameters can be found via the solution to a tractable linear program. In Theorem 3.3, we show that combining the constraints of each of the k𝑘kitalic_k linear programs gives a tight bound. Figure 3 depicts the k𝑘kitalic_k-strong price of anarchy for several classes of resource allocation games.
2) In Section 4, we study collaborative dynamics that reach these equilibria. In Section 4.1, we introduce the coalitional round-robin dynamics and show that an equilibrium is reached in a finite number of best responses and that the number of welfare comparisons grows with a small-base exponential of k𝑘kitalic_k. In Section 4.2, we introduce the asynchronous coalitional best response dynamics, which we show converge almost surely. Further, if the game is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth, then we provide a bound on the transient performance (or the cumulative welfare along the dynamics). We support these findings with a numerical study in Section 4.3.
3) In Section 5, we consider how to improve the design of a group’s decision-making process. By providing the agents with a new, designed objective function, the system designer may alter the set of equilibria and ideally increase the k𝑘kitalic_k-strong price of anarchy. In Section 5.1, we generalize the notion of coalitional smoothness to the setting where the agents’ objective differs from the system welfare, and in Theorem 5.2, we show how we can construct an optimal utility rule. Fig. 5 shows the k𝑘kitalic_k-strong price of anarchy under the optimal utility design for resource allocation games, demonstrating the added benefit of designing how groups of agents make decisions.

Refer to caption
Figure 3: Tight k𝑘kitalic_k-strong price of anarchy bounds for resource allocation games with various welfare functions. We illustrate four settings of local welfare function (top, left to right), and for each, we use Theorem 3.3 to generate tight bounds on the k𝑘kitalic_k-strong price of anarchy for all 1kn1𝑘𝑛1\leq k\leq n1 ≤ italic_k ≤ italic_n. The bottom figures show these bounds and illustrate how increased inter-agent collaboration increases our efficiency guarantees on equilibrium system welfare.

3 Quantifying k𝑘kitalic_k-Strong Price of Anarchy

3.1 Coalitionally Smooth Games

We first consider the efficiency of k𝑘kitalic_k-strong Nash equilibria for general multi-agent systems. This efficiency–quantified by the k𝑘kitalic_k-strong price of anarchy–is conditioned on the system welfare W𝑊Witalic_W and the agent decision-making environment G𝐺Gitalic_G. In Definition 2, we provide a condition on a system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) that will be useful in bounding the k𝑘kitalic_k-strong price of anarchy.

Definition 2.

A system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth, where λ,μ0k𝜆𝜇superscriptsubscriptabsent0𝑘\lambda,\mu\in\mathbb{R}_{\geq 0}^{k}italic_λ , italic_μ ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, if for all a,a𝒜𝑎superscript𝑎𝒜a,a^{\prime}\in\mathcal{A}italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A

1(nζ)Γ𝒞ζW(aΓ,aΓ)λζW(a)μζW(a),ζ[k].formulae-sequence1binomial𝑛𝜁subscriptΓsubscript𝒞𝜁𝑊superscriptsubscript𝑎Γsubscript𝑎Γsubscript𝜆𝜁𝑊superscript𝑎subscript𝜇𝜁𝑊𝑎for-all𝜁delimited-[]𝑘\frac{1}{\binom{n}{\zeta}}\sum_{\Gamma\in\mathcal{C}_{\zeta}}W(a_{\Gamma}^{% \prime},a_{-\Gamma})\geq\lambda_{\zeta}W(a^{\prime})-\mu_{\zeta}W(a),% \leavevmode\nobreak\ \forall\zeta\in[k].divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a ) , ∀ italic_ζ ∈ [ italic_k ] . (5)

In (5), we provide a constraint on the welfare function stating that the average effect of a group of size ζ𝜁\zetaitalic_ζ deviating their action from a𝑎aitalic_a to asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is lower bounded by a linear combination of the welfare of a𝑎aitalic_a and asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. The term smooth is in reference to the welfare function’s change over the joint-action space being bounded by (5). Additionally, Definition 2 extends the classic notion of smooth games [32] and coalitional smoothness for strong equilibria [30] to the setting of k𝑘kitalic_k-coalitions in common interest games.

In effect, every system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) is smooth with λζ=μζ=0subscript𝜆𝜁subscript𝜇𝜁0\lambda_{\zeta}=\mu_{\zeta}=0italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT = 0 for all ζ[k]𝜁delimited-[]𝑘\zeta\in[k]italic_ζ ∈ [ italic_k ], but some parameters (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ ) are more useful than others. In Proposition 3.1, we show that the parameters λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ from Definition 2 can be used to lower bound the k𝑘kitalic_k-strong price of anarchy.

Proposition 3.1.

A system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) that is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth has k𝑘kitalic_k-strong price of anarchy satisfying

SPoAk(G,W)λζ1+μζ,ζ[k].formulae-sequencesubscriptSPoA𝑘𝐺𝑊subscript𝜆𝜁1subscript𝜇𝜁for-all𝜁delimited-[]𝑘\mathrm{SPoA}_{k}(G,W)\geq\frac{\lambda_{\zeta}}{1+\mu_{\zeta}},\leavevmode% \nobreak\ \forall\zeta\in[k].roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W ) ≥ divide start_ARG italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG , ∀ italic_ζ ∈ [ italic_k ] . (6)
Proof.

Let akSNE𝒜superscript𝑎𝑘SNE𝒜a^{k{\rm SNE}}\in\mathcal{A}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ∈ caligraphic_A denote a k𝑘kitalic_k-strong Nash equilibrium in the system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) (i.e., satisfying Definition 1), and let aoptargmaxa𝒜W(a)superscript𝑎optsubscriptargmax𝑎𝒜𝑊𝑎a^{\rm opt}\in\operatorname*{arg\,max}_{a\in\mathcal{A}}W(a)italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a ) denote an optimal joint action. For any ζ[k]𝜁delimited-[]𝑘\zeta\in[k]italic_ζ ∈ [ italic_k ], we have

W(akSNE)𝑊superscript𝑎𝑘SNE\displaystyle W(a^{k{\rm SNE}})italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) =1(nζ)Γ𝒞ζW(akSNE)absent1binomial𝑛𝜁subscriptΓsubscript𝒞𝜁𝑊superscript𝑎𝑘SNE\displaystyle=\frac{1}{\binom{n}{\zeta}}\sum_{\Gamma\in\mathcal{C}_{\zeta}}W(a% ^{k{\rm SNE}})= divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) (7a)
1(nζ)Γ𝒞ζW(aΓopt,aΓkSNE)absent1binomial𝑛𝜁subscriptΓsubscript𝒞𝜁𝑊subscriptsuperscript𝑎optΓsubscriptsuperscript𝑎𝑘SNEΓ\displaystyle\geq\frac{1}{\binom{n}{\zeta}}\sum_{\Gamma\in\mathcal{C}_{\zeta}}% W(a^{\rm opt}_{\Gamma},a^{k{\rm SNE}}_{-\Gamma})≥ divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) (7b)
λζW(aopt)μζW(akSNE).absentsubscript𝜆𝜁𝑊superscript𝑎optsubscript𝜇𝜁𝑊superscript𝑎𝑘SNE\displaystyle\geq\lambda_{\zeta}W(a^{\rm opt})-\mu_{\zeta}W(a^{k{\rm SNE}}).≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) . (7c)

Where (7a) holds from |𝒞ζ|=(nζ)subscript𝒞𝜁binomial𝑛𝜁|\mathcal{C}_{\zeta}|=\binom{n}{\zeta}| caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT | = ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ), (7b) holds from Definition 1, and (7c) holds from Definition 2. Rearranging, we get W(akSNE)/W(aopt)λζ/(1+μζ)𝑊superscript𝑎𝑘SNE𝑊superscript𝑎optsubscript𝜆𝜁1subscript𝜇𝜁W(a^{k{\rm SNE}})/W(a^{\rm opt})\geq\lambda_{\zeta}/(1+\mu_{\zeta})italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) / italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT / ( 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ). ∎

(6) provides k𝑘kitalic_k lower bounds on the k𝑘kitalic_k-strong price of anarchy; accordingly, a k𝑘kitalic_k-strong Nash equilibrium approximates the system optimal at least as well as maxζ[k]{λζ/(1+μζ)}subscript𝜁delimited-[]𝑘subscript𝜆𝜁1subscript𝜇𝜁\max_{\zeta\in[k]}\{\lambda_{\zeta}/(1+\mu_{\zeta})\}roman_max start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT { italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT / ( 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) }. Often, the best lower bound is provided by ζ=k𝜁𝑘\zeta=kitalic_ζ = italic_k; however, this is not true in general. As such, we must consider each of the constraints in (5) to derive the best bounds.

The efficiency bounds of this form are valuable for several reasons, including: 1) they can be used to provide insights on the transient guarantees of various multi-agent dynamics (see Section 4, 2) they easily generalize to broader equilibrium concepts (subject of future work), and 3) if parameters (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ ) can be shown to satisfy (5) for a set of systems 𝒮𝒮\mathcal{S}caligraphic_S, then each system (G,W)𝒮𝐺𝑊𝒮(G,W)\in\mathcal{S}( italic_G , italic_W ) ∈ caligraphic_S inherits the efficiency guarantee of (6). This last point is particularly pertinent, as system models may be subject to noise, mischaracterizations, or changes over time. If the efficiency guarantee holds across many similar systems, then the guarantees are essentially robust to these issues.

In the Section 3.2, we will provide methods to find coalitional smoothness parameters for classes of resource allocation games via tractable linear programs.

3.2 Resource Allocation Games

In this subsection, we consider the well-studied class of resource allocation games [35, 33, 36, 37, 42]. Consider a set of resources or tasks ={1,,R}1𝑅\mathcal{R}=\{1,\ldots,R\}caligraphic_R = { 1 , … , italic_R }, to which agents are assigned, i.e., agent iN𝑖𝑁i\in Nitalic_i ∈ italic_N selects a subset of these resources as its action aisubscript𝑎𝑖a_{i}\subseteq\mathcal{R}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ caligraphic_R from a constrained set of subsets 𝒜i2subscript𝒜𝑖superscript2\mathcal{A}_{i}\subseteq 2^{\mathcal{R}}caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ 2 start_POSTSUPERSCRIPT caligraphic_R end_POSTSUPERSCRIPT. Each resource r𝑟r\in\mathcal{R}italic_r ∈ caligraphic_R has a value vr0subscript𝑣𝑟0v_{r}\geq 0italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ≥ 0; the welfare contributed by a resource is vrw(|a|r)subscript𝑣𝑟𝑤subscript𝑎𝑟v_{r}w(|a|_{r})italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_w ( | italic_a | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ), where w:{0,,n}0:𝑤0𝑛subscriptabsent0w:\{0,\ldots,n\}\rightarrow\mathbb{R}_{\geq 0}italic_w : { 0 , … , italic_n } → blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT captures the added benefit of having multiple agents assigned to the same resources and |a|rsubscript𝑎𝑟|a|_{r}| italic_a | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT is the number of agents assigned to r𝑟ritalic_r in allocation a𝑎aitalic_a. Assume that w(0)=0𝑤00w(0)=0italic_w ( 0 ) = 0 as no welfare is contributed by resources assigned to zero agents and further that w(y)>0𝑤𝑦0w(y)>0italic_w ( italic_y ) > 0 for all y>0𝑦0y>0italic_y > 0. The system welfare is thus

W(a)=rvrw(|a|r).𝑊𝑎subscript𝑟subscript𝑣𝑟𝑤subscript𝑎𝑟W(a)=\sum_{r\in\mathcal{R}}v_{r}w(|a|_{r}).italic_W ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_w ( | italic_a | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) . (8)

For ease of notation, we will refer to the system welfare by the local welfare rule w𝑤witalic_w, noting in the agent-environment G𝐺Gitalic_G, it generates a welfare function W𝑊Witalic_W via (8).

As discussed in 3.1, we wish to find efficiency bounds that hold over a class of resource allocation problems. Let G=(,N,𝒜,{vr}r)𝐺𝑁𝒜subscriptsubscript𝑣𝑟𝑟G=(\mathcal{R},N,\mathcal{A},\{v_{r}\}_{r\in\mathcal{R}})italic_G = ( caligraphic_R , italic_N , caligraphic_A , { italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_r ∈ caligraphic_R end_POSTSUBSCRIPT ) denote a resource allocation problem, and let 𝒢nsubscript𝒢𝑛\mathcal{G}_{n}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT denote the set of all such resource allocation problems with at most n𝑛nitalic_n agents. In Proposition 3.2, we propose a tractable linear program whose solution provides parameters (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ ) which satisfy Definition 2 for every system (G,w)𝒢n×{w}𝐺𝑤subscript𝒢𝑛𝑤(G,w)\in\mathcal{G}_{n}\times\{w\}( italic_G , italic_w ) ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w }. From Proposition 3.1, this also provides a lower bound on the k𝑘kitalic_k-strong price of anarchy for the class of resource allocation problems with local welfare w𝑤witalic_w.

Proposition 3.2.

Each resource allocation problem (G,w)𝒢n×{w}𝐺𝑤subscript𝒢𝑛𝑤(G,w)\in\mathcal{G}_{n}\times\{w\}( italic_G , italic_w ) ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w } is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth with λζ=1/νζsubscript𝜆𝜁1superscriptsubscript𝜈𝜁\lambda_{\zeta}=1/\nu_{\zeta}^{\star}italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT = 1 / italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and μζ=ρζ/νζ1subscript𝜇𝜁superscriptsubscript𝜌𝜁superscriptsubscript𝜈𝜁1\mu_{\zeta}=\rho_{\zeta}^{\star}/\nu_{\zeta}^{\star}-1italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT = italic_ρ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT / italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - 1, where (ρζ,νζ)superscriptsubscript𝜌𝜁superscriptsubscript𝜈𝜁(\rho_{\zeta}^{\star},\nu_{\zeta}^{\star})( italic_ρ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) is a solution to the linear program (Pζ𝜁\zetaitalic_ζ):

(ρζ,νζ)argminρν0ρsuperscriptsubscript𝜌𝜁superscriptsubscript𝜈𝜁subscriptargmin𝜌𝜈0𝜌\displaystyle(\rho_{\zeta}^{\star},\nu_{\zeta}^{\star})\in\operatorname*{arg\,% min}_{\rho\geq\nu\geq 0}\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \rho( italic_ρ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∈ start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_ρ ≥ italic_ν ≥ 0 end_POSTSUBSCRIPT italic_ρ
s.t.0w(o+x)ρw(e+x)+formulae-sequencest0𝑤𝑜𝑥limit-from𝜌𝑤𝑒𝑥\displaystyle{\rm s.t.}\hskip 8.0pt0\geq w(o+x)-\rho w(e+x)+roman_s . roman_t . 0 ≥ italic_w ( italic_o + italic_x ) - italic_ρ italic_w ( italic_e + italic_x ) +
ν((nζ)w(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα))𝜈binomial𝑛𝜁𝑤𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \nu\left(\binom{n}{% \zeta}w(e+x)-\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)\right)italic_ν ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_w ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) )
(e,x,o)for-all𝑒𝑥𝑜\displaystyle\hskip 160.0pt\forall(e,x,o)\in\mathcal{I}∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I (Pζ𝜁\zetaitalic_ζ)

The constraints are parameterized by the triples :={(e,x,o)031e+x+on}assignconditional-set𝑒𝑥𝑜superscriptsubscriptabsent031𝑒𝑥𝑜𝑛\mathcal{I}:=\{(e,x,o)\in\mathbb{N}_{\geq 0}^{3}\mid 1\leq e+x+o\leq n\}caligraphic_I := { ( italic_e , italic_x , italic_o ) ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ∣ 1 ≤ italic_e + italic_x + italic_o ≤ italic_n }. With the possibility of collaboration, an equilibrium becomes more difficult to characterize than in a fully distributed setting. We circumvent this by introducing a parameterization which allows us to generalize the 𝒪(ζ=1k(nζ)mζ)𝒪superscriptsubscript𝜁1𝑘binomial𝑛𝜁superscript𝑚𝜁\mathcal{O}\left(\sum_{\zeta=1}^{k}{n\choose\zeta}m^{\zeta}\right)caligraphic_O ( ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( binomial start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_m start_POSTSUPERSCRIPT italic_ζ end_POSTSUPERSCRIPT ) comparisons of (3) (where m:=maxiN|𝒜i|assign𝑚subscript𝑖𝑁subscript𝒜𝑖m:=\max_{i\in N}|\mathcal{A}_{i}|italic_m := roman_max start_POSTSUBSCRIPT italic_i ∈ italic_N end_POSTSUBSCRIPT | caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT |) into 𝒪(n3)𝒪superscript𝑛3\mathcal{O}(n^{3})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) linear inequalities. Further, satisfying these inequalities provides parameters (λζ,μζ)subscript𝜆𝜁subscript𝜇𝜁(\lambda_{\zeta},\mu_{\zeta})( italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) that satisfy Definition 2, leading to (Pζ𝜁\zetaitalic_ζ) as a search for such parameters with the best k𝑘kitalic_k-strong price of anarchy guarantee.

Proof of Proposition 3.2: The proof largely relies on introducing a parameterization that lets us treat (5) as a set of linear constraints. Consider a resource allocation game (G,w)𝒢n×{w}𝐺𝑤subscript𝒢𝑛𝑤(G,w)\in\mathcal{G}_{n}\times\{w\}( italic_G , italic_w ) ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w } and any two actions a,a𝒜𝑎superscript𝑎𝒜a,a^{\prime}\in\mathcal{A}italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A. To each resource r𝑟r\in\mathcal{R}italic_r ∈ caligraphic_R, we assign a label (er,xr,or)subscript𝑒𝑟subscript𝑥𝑟subscript𝑜𝑟(e_{r},x_{r},o_{r})( italic_e start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_o start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ), where

ersubscript𝑒𝑟\displaystyle e_{r}italic_e start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT =|{iNraiai}|absentconditional-set𝑖𝑁𝑟subscript𝑎𝑖superscriptsubscript𝑎𝑖\displaystyle=\lvert\{i\in N\mid r\in a_{i}\setminus a_{i}^{\prime}\}\rvert= | { italic_i ∈ italic_N ∣ italic_r ∈ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } |
xrsubscript𝑥𝑟\displaystyle x_{r}italic_x start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT =|{iNraiai}|absentconditional-set𝑖𝑁𝑟subscript𝑎𝑖superscriptsubscript𝑎𝑖\displaystyle=\lvert\{i\in N\mid r\in a_{i}\cap a_{i}^{\prime}\}\rvert= | { italic_i ∈ italic_N ∣ italic_r ∈ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } |
orsubscript𝑜𝑟\displaystyle o_{r}italic_o start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT =|{iNraiai}|.absentconditional-set𝑖𝑁𝑟superscriptsubscript𝑎𝑖subscript𝑎𝑖\displaystyle=\lvert\{i\in N\mid r\in a_{i}^{\prime}\setminus a_{i}\}\rvert.= | { italic_i ∈ italic_N ∣ italic_r ∈ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∖ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } | .

This is to say, ersubscript𝑒𝑟e_{r}italic_e start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT denotes the number of agents utilizing resource r𝑟ritalic_r in joint action a𝑎aitalic_a but not asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, orsubscript𝑜𝑟o_{r}italic_o start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT is the number that uses resource r𝑟ritalic_r in joint action asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT but not a𝑎aitalic_a, and xrsubscript𝑥𝑟x_{r}italic_x start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT is the number that uses r𝑟ritalic_r in both a𝑎aitalic_a and asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. In the set of games 𝒢nsubscript𝒢𝑛\mathcal{G}_{n}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, let ={(e,x,o)031e+x+on}conditional-set𝑒𝑥𝑜superscriptsubscriptabsent031𝑒𝑥𝑜𝑛\mathcal{I}=\{(e,x,o)\in\mathbb{N}_{\geq 0}^{3}\mid 1\leq e+x+o\leq n\}caligraphic_I = { ( italic_e , italic_x , italic_o ) ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ∣ 1 ≤ italic_e + italic_x + italic_o ≤ italic_n } denote the set of possible labels, and θ(e,x,o):=r(e,x,o)vr,assign𝜃𝑒𝑥𝑜subscript𝑟subscript𝑒𝑥𝑜subscript𝑣𝑟\theta(e,x,o):=\sum_{r\in\mathcal{R}_{(e,x,o)}}v_{r},italic_θ ( italic_e , italic_x , italic_o ) := ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R start_POSTSUBSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , where (e,x,o)={rer=e,xr=x,or=o}subscript𝑒𝑥𝑜conditional-set𝑟formulae-sequencesubscript𝑒𝑟𝑒formulae-sequencesubscript𝑥𝑟𝑥subscript𝑜𝑟𝑜\mathcal{R}_{(e,x,o)}=\{r\in\mathcal{R}\mid e_{r}=e,x_{r}=x,o_{r}=o\}caligraphic_R start_POSTSUBSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUBSCRIPT = { italic_r ∈ caligraphic_R ∣ italic_e start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = italic_e , italic_x start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = italic_x , italic_o start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = italic_o } denotes the set of resources with label (e,x,o)𝑒𝑥𝑜(e,x,o)( italic_e , italic_x , italic_o ). The parameter θ0||𝜃superscriptsubscriptabsent0\theta\in\mathbb{R}_{\geq 0}^{|\mathcal{I}|}italic_θ ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_I | end_POSTSUPERSCRIPT is a vector with elements for each label.

We will now express the terms in (5) using this parameterization. Because W(a)=rvrw(|a|r)𝑊𝑎subscript𝑟subscript𝑣𝑟𝑤subscript𝑎𝑟W(a)=\sum_{r\in\mathcal{R}}v_{r}w(|a|_{r})italic_W ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_w ( | italic_a | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) depends only on the number of agents utilizing a resource; we can represent |a|r=er+xrsubscript𝑎𝑟subscript𝑒𝑟subscript𝑥𝑟|a|_{r}=e_{r}+x_{r}| italic_a | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT and write the system welfare as

W(a)𝑊𝑎\displaystyle W(a)italic_W ( italic_a ) =rvrw(er+xr)absentsubscript𝑟subscript𝑣𝑟𝑤subscript𝑒𝑟subscript𝑥𝑟\displaystyle=\sum_{r\in\mathcal{R}}v_{r}w(e_{r}+x_{r})= ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_w ( italic_e start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
=(e,x,o)(re,x,ovr)w(e+x)absentsubscript𝑒𝑥𝑜subscript𝑟subscript𝑒𝑥𝑜subscript𝑣𝑟𝑤𝑒𝑥\displaystyle=\sum_{(e,x,o)\in\mathcal{I}}\Bigg{(}\sum_{r\in\mathcal{R}_{e,x,o% }}v_{r}\Bigg{)}w(e+x)= ∑ start_POSTSUBSCRIPT ( italic_e , italic_x , italic_o ) ∈ caligraphic_I end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) italic_w ( italic_e + italic_x )
=e,x,oθ(e,x,o)w(e+x).absentsubscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑒𝑥\displaystyle=\sum_{e,x,o}\theta(e,x,o)w(e+x).= ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_e + italic_x ) .

When not stated, the sum over (e,x,o)𝑒𝑥𝑜(e,x,o)( italic_e , italic_x , italic_o ) is implied to be for each label in \mathcal{I}caligraphic_I. Similar steps can be followed to show W(a)=e,x,oθ(e,x,o)w(o+x)𝑊superscript𝑎subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑜𝑥W(a^{\prime})=\sum_{e,x,o}\theta(e,x,o)w(o+x)italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_o + italic_x ).

Finally, the term Γ𝒞ζW(aΓ,aΓ)subscriptΓsubscript𝒞𝜁𝑊superscriptsubscript𝑎Γsubscript𝑎Γ\sum_{\Gamma\in\mathcal{C}_{\zeta}}W(a_{\Gamma}^{\prime},a_{-\Gamma})∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) can similarly be transcribed by this parameterization:

Γ𝒞ζW(aΓ,aΓ)subscriptΓsubscript𝒞𝜁𝑊subscriptsuperscript𝑎Γsubscript𝑎Γ\displaystyle\sum_{\Gamma\in\mathcal{C}_{\zeta}}W(a^{\prime}_{\Gamma},a_{-% \Gamma})∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT )
=Γ𝒞ζe,x,or(e,x,o)vrw(|aΓ,aΓ|r)\displaystyle=\sum_{\Gamma\in\mathcal{C}_{\zeta}}\sum_{e,x,o}\sum_{r\in% \mathcal{R}_{(e,x,o)}}v_{r}w(|a^{\prime}_{\Gamma},a_{-\Gamma}|_{r})= ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R start_POSTSUBSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_w ( | italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
=e,x,or(e,x,o)vrΓ𝒞ζw(|aΓ,aΓ|r)\displaystyle=\sum_{e,x,o}\sum_{r\in\mathcal{R}_{(e,x,o)}}v_{r}\sum_{\Gamma\in% \mathcal{C}_{\zeta}}w(|a^{\prime}_{\Gamma},a_{-\Gamma}|_{r})= ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R start_POSTSUBSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_w ( | italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
=e,x,or(e,x,o)vr0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα)absentsubscript𝑒𝑥𝑜subscript𝑟subscript𝑒𝑥𝑜subscript𝑣𝑟subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼\displaystyle=\sum_{e,x,o}\sum_{r\in\mathcal{R}_{(e,x,o)}}v_{r}\sum_{\begin{% subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)= ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R start_POSTSUBSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α )
=e,x,oθ(e,x,o)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα)absentsubscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼\displaystyle=\sum_{e,x,o}\theta(e,x,o)\sum_{\begin{subarray}{c}0\leq\alpha% \leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)= ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α )

where the set of coalitions 𝒞ζsubscript𝒞𝜁\mathcal{C}_{\zeta}caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT was partitioned according to the action profile of the agents in each coalition. We let α𝛼\alphaitalic_α denote the number of agents in ΓΓ\Gammaroman_Γ that utilize resource r𝑟ritalic_r only in joint action a𝑎aitalic_a and β𝛽\betaitalic_β the number of agents in ΓΓ\Gammaroman_Γ that utilize r𝑟ritalic_r only in joint action asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. By simple counting arguments, there are exactly (eα)(oβ)(neoζαβ)binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽{e\choose\alpha}{o\choose\beta}\binom{n-e-o}{\zeta-\alpha-\beta}( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) coalitions grouped with the same α𝛼\alphaitalic_α and β𝛽\betaitalic_β. This decomposition is possible as the number of agents utilizing resource r𝑟ritalic_r after a group ΓΓ\Gammaroman_Γ deviates is precisely e+x+βα𝑒𝑥𝛽𝛼e+x+\beta-\alphaitalic_e + italic_x + italic_β - italic_α.

The smoothness constraint (5) is satisfied only if

1(nζ)e,x,oθ(e,x,o)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα)λζe,x,oθ(e,x,o)w(o+x)μζe,x,oθ(e,x,o)w(e+x).1binomial𝑛𝜁subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼subscript𝜆𝜁subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑜𝑥subscript𝜇𝜁subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑒𝑥\frac{1}{\binom{n}{\zeta}}\sum_{e,x,o}\theta(e,x,o)\sum_{\begin{subarray}{c}0% \leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)\\ \geq\lambda_{\zeta}\sum_{e,x,o}\theta(e,x,o)w(o+x)-\mu_{\zeta}\sum_{e,x,o}% \theta(e,x,o)w(e+x).start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) end_CELL end_ROW start_ROW start_CELL ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_o + italic_x ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_e + italic_x ) . end_CELL end_ROW

As θ(e,x,o)0𝜃𝑒𝑥𝑜0\theta(e,x,o)\geq 0italic_θ ( italic_e , italic_x , italic_o ) ≥ 0 for all (e,x,o)𝑒𝑥𝑜(e,x,o)\in\mathcal{I}( italic_e , italic_x , italic_o ) ∈ caligraphic_I, it is sufficient to satisfy

1(nζ)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα)λζw(o+x)μζw(e+x),(e,x,o).\frac{1}{\binom{n}{\zeta}}\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)\\ \geq\lambda_{\zeta}w(o+x)-\mu_{\zeta}w(e+x),\quad\forall(e,x,o)\in\mathcal{I}.start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) end_CELL end_ROW start_ROW start_CELL ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_w ( italic_o + italic_x ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_w ( italic_e + italic_x ) , ∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I . end_CELL end_ROW (9)

Observe that (9) is independent of a𝑎aitalic_a, asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, and G𝐺Gitalic_G. As such, this set of constraints serves as a sufficient condition that any G𝒢n𝐺subscript𝒢𝑛G\in\mathcal{G}_{n}italic_G ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT satisfies (5) for all respective a,a𝒜𝑎superscript𝑎𝒜a,a^{\prime}\in\mathcal{A}italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A.

To find parameters λζsubscript𝜆𝜁\lambda_{\zeta}italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT and μζsubscript𝜇𝜁\mu_{\zeta}italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT that provide the best k𝑘kitalic_k-strong price of anarchy guarantee, we formulate the following optimization problem:

maxλζ,μζ0subscriptsubscript𝜆𝜁subscript𝜇𝜁0\displaystyle\max_{\lambda_{\zeta},\mu_{\zeta}\geq 0}\quadroman_max start_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT λζ1+μζsubscript𝜆𝜁1subscript𝜇𝜁\displaystyle\frac{\lambda_{\zeta}}{1+\mu_{\zeta}}divide start_ARG italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG (P1ζ𝜁\zetaitalic_ζ)
s.t.formulae-sequencest\displaystyle{\rm s.t.}\quadroman_s . roman_t . (9)italic-(9italic-)\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \eqref{eq:param_constraint}italic_( italic_)

We restrict λζsubscript𝜆𝜁\lambda_{\zeta}italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT to be non-negative, though this constraint is not active except in degenerate cases. Finally, we transform (P1ζ𝜁\zetaitalic_ζ) by substituting new decision variables ρ=(1+μζ)/λζ𝜌1subscript𝜇𝜁subscript𝜆𝜁\rho=(1+\mu_{\zeta})/\lambda_{\zeta}italic_ρ = ( 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) / italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT and ν=1/((nζ)λζ)0𝜈1binomial𝑛𝜁subscript𝜆𝜁0\nu=1/\left(\binom{n}{\zeta}\lambda_{\zeta}\right)\geq 0italic_ν = 1 / ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) ≥ 0. The new objective becomes 1/ρ1𝜌1/\rho1 / italic_ρ. Note that the constraint (e,x,o)=(1,0,0)𝑒𝑥𝑜100(e,x,o)=(1,0,0)( italic_e , italic_x , italic_o ) = ( 1 , 0 , 0 ) implies ρ0𝜌0\rho\geq 0italic_ρ ≥ 0; we can thus invert the objective and change the minimization to a maximization, giving (Pζ𝜁\zetaitalic_ζ). ∎

P=superscript𝑃absent\displaystyle P^{\star}=italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = minρ,{νζ0}ζ[k]subscript𝜌subscriptsubscript𝜈𝜁0𝜁delimited-[]𝑘\displaystyle\min_{\rho,\{\nu_{\zeta}\geq 0\}_{\zeta\in[k]}}roman_min start_POSTSUBSCRIPT italic_ρ , { italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ≥ 0 } start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT end_POSTSUBSCRIPT ρ𝜌\displaystyle\rhoitalic_ρ
s.t.formulae-sequencest\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ {\rm s.t.}roman_s . roman_t . 0w(o+x)ρw(e+x)+ζ[k]νζ((nζ)w(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα))0𝑤𝑜𝑥𝜌𝑤𝑒𝑥subscript𝜁delimited-[]𝑘subscript𝜈𝜁binomial𝑛𝜁𝑤𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼\displaystyle 0\geq w(o+x)-\rho w(e+x)+\sum_{\zeta\in[k]}\nu_{\zeta}\left(% \binom{n}{\zeta}w(e+x)-\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)\right)0 ≥ italic_w ( italic_o + italic_x ) - italic_ρ italic_w ( italic_e + italic_x ) + ∑ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_w ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) )
(e,x,o)for-all𝑒𝑥𝑜\displaystyle\hskip 303.53377pt\forall(e,x,o)\in\mathcal{I}∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I (P[k]delimited-[]𝑘[k][ italic_k ])

The smoothness parameters found via Proposition 3.2 can be used with Proposition 3.1 to generate lower bounds on the k𝑘kitalic_k-strong price of anarchy. However, these bounds need not be tight, i.e., there may be no system in the class 𝒢n×{w}subscript𝒢𝑛𝑤\mathcal{G}_{n}\times\{w\}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w } that attains this inefficiency, and better bounds may be possible. To study what efficiency we can guarantee across a class of resource allocation problems, we define the k𝑘kitalic_k-strong price of anarchy bound for (𝒢n,w)subscript𝒢𝑛𝑤(\mathcal{G}_{n},w)( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) as

SPoAk(𝒢n,w)=minG𝒢nSPoAk(G,w).subscriptSPoA𝑘subscript𝒢𝑛𝑤subscript𝐺subscript𝒢𝑛subscriptSPoA𝑘𝐺𝑤\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)=\min_{G\in\mathcal{G}_{n}}\mathrm{SPoA}_{% k}(G,w).roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) = roman_min start_POSTSUBSCRIPT italic_G ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_w ) . (10)

This performance ratio is parameterized by our choice of welfare function w𝑤witalic_w and the size of collaborative coalitions k𝑘kitalic_k. In Theorem 3.3, we provide a linear program whose value provides an exact value of SPoAk(𝒢n,w)subscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ). We do this by showing that the constraints of the k𝑘kitalic_k linear programs in Proposition 3.2 can be combined to give an exact quantification of the k𝑘kitalic_k-strong price of anarchy bound.

Theorem 3.3.

For the class of resource allocation problems 𝒢nsubscript𝒢𝑛\mathcal{G}_{n}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT with welfare function w𝑤witalic_w, when groups maximize the common interest welfare, then

SPoAk(𝒢n,w)=1/P(n,w,k),subscriptSPoA𝑘subscript𝒢𝑛𝑤1superscript𝑃𝑛𝑤𝑘\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)=1/P^{\star}(n,w,k),roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) = 1 / italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ) , (11)

where P(n,w,k)superscript𝑃𝑛𝑤𝑘P^{\star}(n,w,k)italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ) is the solution to (P[k]delimited-[]𝑘[k][ italic_k ]).

The proof appears in the appendix.

In Fig. 3, we consider four welfare functions and plot the tight bounds on the k𝑘kitalic_k-strong price of anarchy for 1kn1𝑘𝑛1\leq k\leq n1 ≤ italic_k ≤ italic_n. As expected, we observe that increased communication improves efficiency guarantees; the amount of this increase is useful in determining the benefits of inter-agent communication/collaboration. However, this collaboration comes at a cost; in Section 4, we will study the complexity of distributed dynamics reaching k𝑘kitalic_k-strong Nash equilibria.

4 Coalitional Dynamics

Section 3 provided several tools for quantifying the efficiency guarantees of k𝑘kitalic_k-strong Nash equilibria. In this section, we will study the qualities of group-based dynamics that reach these equilibria. In particular, we will discuss the convergence rate and transient performance when agents follow the Coalitional round-robin and Asynchronous Best Response, respectively. We will denote atsuperscript𝑎𝑡a^{t}italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT as the joint action occurring at time t𝑡t\in\mathbb{N}italic_t ∈ blackboard_N and ΓtNsuperscriptΓ𝑡𝑁\Gamma^{t}\subseteq Nroman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ⊆ italic_N as the group of agents updating their action at time t𝑡titalic_t.

4.1 Round Robin

We first consider the k𝑘kitalic_k-coalitional round robin agent dynamics, in which each group of k𝑘kitalic_k agents updates their actions sequentially, following a set order σΣ(nk)𝜎subscriptΣbinomial𝑛𝑘\sigma\in\Sigma_{\binom{n}{k}}italic_σ ∈ roman_Σ start_POSTSUBSCRIPT ( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) end_POSTSUBSCRIPT, where σ(z)𝜎𝑧\sigma(z)italic_σ ( italic_z ) for z{1,,(nk)}𝑧1binomial𝑛𝑘z\in\{1,\ldots,\binom{n}{k}\}italic_z ∈ { 1 , … , ( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) } is the index of a group Γ𝒞[k]Γsubscript𝒞delimited-[]𝑘\Gamma\in\mathcal{C}_{[k]}roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT. We will call a round one pass through σ𝜎\sigmaitalic_σ in which each group updates their action. At their turn, the group ΓtsuperscriptΓ𝑡\Gamma^{t}roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT selects their best response to the current action, i.e., aΓtt+1argmaxaΓt𝒜ΓtW(aΓt,aΓt)superscriptsubscript𝑎superscriptΓ𝑡𝑡1subscriptargmaxsubscript𝑎superscriptΓ𝑡subscript𝒜superscriptΓ𝑡𝑊subscript𝑎superscriptΓ𝑡subscript𝑎superscriptΓ𝑡a_{\Gamma^{t}}^{t+1}\in\operatorname*{arg\,max}_{a_{\Gamma^{t}}\in\mathcal{A}_% {\Gamma^{t}}}W(a_{\Gamma^{t}},a_{-\Gamma^{t}})italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ), where ties are broken uniformly at random unless aΓttargmaxaΓt𝒜ΓtW(aΓt,aΓt)superscriptsubscript𝑎superscriptΓ𝑡𝑡subscriptargmaxsubscript𝑎superscriptΓ𝑡subscript𝒜superscriptΓ𝑡𝑊subscript𝑎superscriptΓ𝑡subscript𝑎superscriptΓ𝑡a_{\Gamma^{t}}^{t}\in\operatorname*{arg\,max}_{a_{\Gamma^{t}}\in\mathcal{A}_{% \Gamma^{t}}}W(a_{\Gamma^{t}},a_{-\Gamma^{t}})italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ), in which case the group selects their current action aΓtt+1=aΓttsuperscriptsubscript𝑎superscriptΓ𝑡𝑡1superscriptsubscript𝑎superscriptΓ𝑡𝑡a_{\Gamma^{t}}^{t+1}=a_{\Gamma^{t}}^{t}italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. The dynamics are more formally described in Algorithm 1.

Algorithm 1 k𝑘kitalic_k-Round-Robin Dynamics
procedure k𝑘kitalic_kRoundRobin(W,𝒜,N,σ,a𝑊𝒜𝑁𝜎𝑎W,\mathcal{A},N,\sigma,aitalic_W , caligraphic_A , italic_N , italic_σ , italic_a)
     a¯¯𝑎absent\overline{a}\leftarrowover¯ start_ARG italic_a end_ARG ← NULL
     while a¯a¯𝑎𝑎\overline{a}\neq aover¯ start_ARG italic_a end_ARG ≠ italic_a do
         a¯a¯𝑎𝑎\overline{a}\leftarrow aover¯ start_ARG italic_a end_ARG ← italic_a
         for z{1,,(nk)}𝑧1binomial𝑛𝑘z\in\{1,\ldots,\binom{n}{k}\}italic_z ∈ { 1 , … , ( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) } do
              Γ𝒞k(σ(z))Γsubscript𝒞𝑘𝜎𝑧\Gamma\leftarrow\mathcal{C}_{k}(\sigma(z))roman_Γ ← caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_σ ( italic_z ) ) \triangleright Get group
              for aΓ+𝒜ΓaΓsuperscriptsubscript𝑎Γsubscript𝒜Γsubscript𝑎Γa_{\Gamma}^{+}\in\mathcal{A}_{\Gamma}\setminus a_{\Gamma}italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∖ italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT do \triangleright Group deviations
                  if W(aΓ+,aΓ)>W(a)𝑊superscriptsubscript𝑎Γsubscript𝑎Γ𝑊𝑎W(a_{\Gamma}^{+},a_{-\Gamma})>W(a)italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) > italic_W ( italic_a ) then
                       a(aΓ+,aΓ)𝑎superscriptsubscript𝑎Γsubscript𝑎Γa\leftarrow(a_{\Gamma}^{+},a_{-\Gamma})italic_a ← ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT )                                               

These dynamics are synchronous (in that agents must follow a set order) but provide an understanding of how groups of agents can make decisions in a localized manner, and we can analyze the equilibrium hitting time. In the fully distributed setting (k=1𝑘1k=1italic_k = 1), it has been shown that these dynamics reach a Nash equilibrium in finite time and require 𝒪(mn)𝒪superscript𝑚𝑛\mathcal{O}(m^{n})caligraphic_O ( italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) welfare evaluations [43]. In Proposition 4.1, we find that in the coalitional settings, we maintain the finite convergence time and incur a small base exponential gain in the number of welfare comparisons required. Recent work has shown that the examples that realize these worst-case hitting times are fragile and that equilibria can be computed in polynomial-time under smoothed running-time analysis [44]. As a first step, we consider the worst-case run time, but the authors believe that similar findings on the added complexity of group decision-making will hold under smoothed running-time analysis, though this is the subject of ongoing work. Recall n=|N|𝑛𝑁n=|N|italic_n = | italic_N | and m:=maxiN|𝒜i|assign𝑚subscript𝑖𝑁subscript𝒜𝑖m:=\max_{i\in N}|\mathcal{A}_{i}|italic_m := roman_max start_POSTSUBSCRIPT italic_i ∈ italic_N end_POSTSUBSCRIPT | caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT |.

Proposition 4.1.

The k𝑘kitalic_k-Coalitional-Round-Robin dynamics converge in finite time and requires 𝒪(mn(111/m)k)𝒪superscript𝑚𝑛superscript111𝑚𝑘\mathcal{O}\left(m^{n}\left(\frac{1}{1-\nicefrac{{1}}{{m}}}\right)^{k}\right)caligraphic_O ( italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG 1 - / start_ARG 1 end_ARG start_ARG italic_m end_ARG end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) welfare evaluations.

Proof.

First, we verify that the output of Algorithm 1 is a k𝑘kitalic_k-strong Nash equilibrium, then we consider how long it takes Algorithm 1 to converge. Algorithm 1 terminates after a round in which no group Γ𝒞kΓsubscript𝒞𝑘\Gamma\in\mathcal{C}_{k}roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can select a new action in which the welfare increases, i.e., W(a)W(aΓ,aΓ)𝑊𝑎𝑊subscript𝑎Γsubscript𝑎ΓW(a)\geq W(a_{\Gamma},a_{-\Gamma})italic_W ( italic_a ) ≥ italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) for all aΓ𝒜Γsubscript𝑎Γsubscript𝒜Γa_{\Gamma}\in\mathcal{A}_{\Gamma}italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT and Γ𝒞kΓsubscript𝒞𝑘\Gamma\in\mathcal{C}_{k}roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT where a𝑎aitalic_a is the output of Algorithm 1. A deviation for a any subgroup Γ𝒞[k]superscriptΓsubscript𝒞delimited-[]𝑘\Gamma^{\prime}\in\mathcal{C}_{[k]}roman_Γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT is subsumed by the joint action (aΓ,aΓΓ)𝒜Γsubscript𝑎superscriptΓsubscript𝑎ΓsuperscriptΓsubscript𝒜Γ(a_{\Gamma^{\prime}},a_{\Gamma\setminus\Gamma^{\prime}})\in\mathcal{A}_{\Gamma}( italic_a start_POSTSUBSCRIPT roman_Γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT roman_Γ ∖ roman_Γ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT. As such, a state a𝑎aitalic_a terminates Algorithm 1 if and only if it satisfies (3) and is a k𝑘kitalic_k-strong Nash equilibrium.

Without loss of generality, we assume each agent possesses m𝑚mitalic_m actions; for each agent, i𝑖iitalic_i that has fewer actions, assign m|𝒜i|𝑚subscript𝒜𝑖m-|\mathcal{A}_{i}|italic_m - | caligraphic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | dummy actions with minimum welfare. In one round of the k𝑘kitalic_k-Round-Robin dynamics, each group of agents is given the opportunity to deviate their action. First, we note that no group ΓΓ\Gammaroman_Γ will respond to the same complimentary group action aΓsubscript𝑎Γa_{-\Gamma}italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT in two consecutive rounds unless a𝑎aitalic_a is a k𝑘kitalic_k-strong Nash equilibrium. If the group ΓΓ\Gammaroman_Γ rejects a group action aΓsubscript𝑎Γa_{\Gamma}italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT in response to aΓsubscript𝑎Γa_{-\Gamma}italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT, the joint action (aΓ,aΓ)subscript𝑎Γsubscript𝑎Γ(a_{\Gamma},a_{-\Gamma})( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) is eliminated from consideration as an output of Algorithm 1. Accounting for overlaps between the groups, in any round that does not start in a k𝑘kitalic_k-strong Nash equilibrium, at least y=ζ=1k(nζ)(m1)ζ,𝑦superscriptsubscript𝜁1𝑘binomial𝑛𝜁superscript𝑚1𝜁y=\sum_{\zeta=1}^{k}\binom{n}{\zeta}(m-1)^{\zeta},italic_y = ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) ( italic_m - 1 ) start_POSTSUPERSCRIPT italic_ζ end_POSTSUPERSCRIPT , joint actions are eliminated as possible outputs of Algorithm 1. As there are mnsuperscript𝑚𝑛m^{n}italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT joint actions in total, there can be at most rmny+1𝑟superscript𝑚𝑛𝑦1r\leq\lfloor\frac{m^{n}}{y}\rfloor+1italic_r ≤ ⌊ divide start_ARG italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_y end_ARG ⌋ + 1 rounds that do not start in a k𝑘kitalic_k-strong Nash equilibrium; this proves the finite convergence time. In each round, there are exactly (nk)mkbinomial𝑛𝑘superscript𝑚𝑘\binom{n}{k}m^{k}( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) italic_m start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT welfare checks; thus, the total number of welfare checks is no more than (mny+1)(nk)mksuperscript𝑚𝑛𝑦1binomial𝑛𝑘superscript𝑚𝑘(\frac{m^{n}}{y}+1)\binom{n}{k}m^{k}( divide start_ARG italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_y end_ARG + 1 ) ( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) italic_m start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT. Removing lower order terms from y𝑦yitalic_y gives the stated bound. ∎

From Proposition 4.1, we observe two things: 1) the coalitional dynamics do not require drastically more welfare evaluations than the fully distributed round-robin, but 2) the convergence rate is slow regardless of k𝑘kitalic_k. In light of this, we turn our focus to understanding the transient performance of collaborative decision-making dynamics. Further, in many settings, it is desirable to allow agents or groups to update their actions asynchronously. In Section 4.2, we will consider both of these factors in the asynchronous best response dynamics.

4.2 Asynchronous Best-Response Dynamics

Motivated by settings where agents (or groups of agents) perform action revisions asynchronously or on their own time scales, we consider a dynamical system where the next group of agents to update is random.

We define the Asynchronous k𝑘kitalic_k-Coalitional Best-Response Dynamics as follows: let t0𝑡0t\geq 0italic_t ≥ 0 denote the number of agent (or group) updates that have yet occurred111Counting time steps in terms of the number of updates subsumes cases where agents (or groups) update with respect to individual and independent random clocks. The rate of each clock is analogous to the selection probability for different groups.. The updating group ΓtsuperscriptΓ𝑡\Gamma^{t}roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT is selected at random, such that the size of the group ζ𝜁\zetaitalic_ζ is picked with probability pζ=𝔼[|Γt|=ζ]subscript𝑝𝜁𝔼delimited-[]superscriptΓ𝑡𝜁p_{\zeta}=\mathbb{E}[|\Gamma^{t}|=\zeta]italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT = blackboard_E [ | roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT | = italic_ζ ] and the specific agents in the group are drawn uniformly at random. Once formed, the updating group, ΓtsuperscriptΓ𝑡\Gamma^{t}roman_Γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, chooses their best response in the same manner as the coalitional round robin described in Section 4.1.

From their distributed decision-making and asynchronicity, these dynamics capture the behavior of real-time multi-agent systems components. In Theorem 4.2, we show these dynamics converge almost surely to a k𝑘kitalic_k-strong Nash equilibrium, and further, if the system is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth, we provide a bound on the cumulative welfare relative to the optimal.

Theorem 4.2.

The Asynchronous k𝑘kitalic_k-Coalitional Best-Response Dynamics converge almost surely to the set of k𝑘kitalic_k-strong Nash equilibrium. Further, if (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) is a (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth system, then after T1𝑇1T\geq 1italic_T ≥ 1 update steps, the cumulative expected welfare satisfies

𝔼[1Tt=1TW(at)]T12Tζ=1kpζλζ1+ζ=1kpζμζW(aopt),𝔼delimited-[]1𝑇superscriptsubscript𝑡1𝑇𝑊superscript𝑎𝑡𝑇12𝑇superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝑊superscript𝑎opt\mathbb{E}\left[\frac{1}{T}\sum_{t=1}^{T}W(a^{t})\right]\geq\frac{T-1}{2T}% \frac{\sum_{\zeta=1}^{k}p_{\zeta}\lambda_{\zeta}}{1+\sum_{\zeta=1}^{k}p_{\zeta% }\mu_{\zeta}}W(a^{\rm opt}),blackboard_E [ divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] ≥ divide start_ARG italic_T - 1 end_ARG start_ARG 2 italic_T end_ARG divide start_ARG ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) , (12)

where pζsubscript𝑝𝜁p_{\zeta}italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT is the probability a group of size ζ𝜁\zetaitalic_ζ best responds.

Interestingly, the bound on the average transient welfare depends on how frequently groups of different sizes are sampled to perform their best response. When the agents are designed to more regularly collaborate in larger groups, the transient guarantee will often be better.

Proof.

First, we show that the Asynchronous k𝑘kitalic_k-Coalitional Best-Response Dynamics converges in general. A group ΓΓ\Gammaroman_Γ revises their action only to one of strictly higher payoff if one exists. Consider the resulting Markov chain \mathcal{M}caligraphic_M with states 𝒜𝒜\mathcal{A}caligraphic_A. Any state a𝒜kSNE𝑎𝒜𝑘SNEa\in\mathcal{A}\setminus k{\rm SNE}italic_a ∈ caligraphic_A ∖ italic_k roman_SNE has an outgoing edge with positive probability as there exists some group Γ𝒞[k]Γsubscript𝒞delimited-[]𝑘\Gamma\in\mathcal{C}_{[k]}roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT that is selected with probability p|Γ|/|𝒞|Γ||>0subscript𝑝Γsubscript𝒞Γ0p_{|\Gamma|}/|\mathcal{C}_{|\Gamma|}|>0italic_p start_POSTSUBSCRIPT | roman_Γ | end_POSTSUBSCRIPT / | caligraphic_C start_POSTSUBSCRIPT | roman_Γ | end_POSTSUBSCRIPT | > 0 which would revise their action. Any state akSNE𝑎𝑘SNEa\in k{\rm SNE}italic_a ∈ italic_k roman_SNE has no outgoing edges with positive probability as no group Γ𝒞[k]Γsubscript𝒞delimited-[]𝑘\Gamma\in\mathcal{C}_{[k]}roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT can revise their action to strictly increase the welfare. Finally, there are no cycles (excluding self-loops) in \mathcal{M}caligraphic_M, as every outgoing edge is directed from a joint action of lower welfare to one of strictly higher welfare. As such, the set kSNE𝑘SNEk{\rm SNE}italic_k roman_SNE is absorbing and [limtatkSNE]=1delimited-[]subscript𝑡superscript𝑎𝑡𝑘SNE1\mathbb{P}[\lim_{t\rightarrow\infty}a^{t}\in k{\rm SNE}]=1blackboard_P [ roman_lim start_POSTSUBSCRIPT italic_t → ∞ end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ italic_k roman_SNE ] = 1.

Now, consider that the system (G,W)𝐺𝑊(G,W)( italic_G , italic_W ) is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth. As the selection of the updating group is random, the welfare at time t+1𝑡1t+1italic_t + 1 is a random variable, even when conditioned on atsuperscript𝑎𝑡a^{t}italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT; the expectation of the succeeding welfare can be written

𝔼[W(at+1)\displaystyle\mathbb{E}[W(a^{t+1})\midblackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) ∣ at=a]=ζ=1kpζΓ𝒞ζ1(nζ)W(aΓ+,aΓ)\displaystyle a^{t}=a]=\sum_{\zeta=1}^{k}p_{\zeta}\sum_{\Gamma\in\mathcal{C}_{% \zeta}}\frac{1}{\binom{n}{\zeta}}W(a_{\Gamma}^{+},a_{-\Gamma})italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = italic_a ] = ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT )
ζ=1kpζΓ𝒞ζ1(nζ)W(aΓopt,aΓ)absentsuperscriptsubscript𝜁1𝑘subscript𝑝𝜁subscriptΓsubscript𝒞𝜁1binomial𝑛𝜁𝑊subscriptsuperscript𝑎optΓsubscript𝑎Γ\displaystyle\geq\sum_{\zeta=1}^{k}p_{\zeta}\sum_{\Gamma\in\mathcal{C}_{\zeta}% }\frac{1}{\binom{n}{\zeta}}W(a^{\rm opt}_{\Gamma},a_{-\Gamma})≥ ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT )
ζ=1kpζ(λζW(aopt)μζW(a))absentsuperscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁𝑊superscript𝑎optsubscript𝜇𝜁𝑊𝑎\displaystyle\geq\sum_{\zeta=1}^{k}p_{\zeta}\left(\lambda_{\zeta}W(a^{\rm opt}% )-\mu_{\zeta}W(a)\right)≥ ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a ) )
=(ζ=1kpζλζ)W(aopt)(ζ=1kpζμζ)W(a),absentsuperscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁𝑊superscript𝑎optsuperscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝑊𝑎\displaystyle=\left(\sum_{\zeta=1}^{k}p_{\zeta}\lambda_{\zeta}\right)W(a^{\rm opt% })-\left(\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}\right)W(a),= ( ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) - ( ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) italic_W ( italic_a ) ,

where aΓ+argmaxaΓ𝒜ΓW(aΓ,aΓ)superscriptsubscript𝑎Γsubscriptargmaxsubscript𝑎Γsubscript𝒜Γ𝑊subscript𝑎Γsubscript𝑎Γa_{\Gamma}^{+}\in\operatorname*{arg\,max}_{a_{\Gamma}\in\mathcal{A}_{\Gamma}}W% (a_{\Gamma},a_{-\Gamma})italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) is the update state for the group ΓΓ\Gammaroman_Γ following the dynamics; the welfare for each possible updated joint action is the same, so determining which group action is selected is irrelevant. As aΓ+superscriptsubscript𝑎Γa_{\Gamma}^{+}italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT is a best response, the welfare is no better for selecting a different action, namely aΓoptsubscriptsuperscript𝑎optΓa^{\rm opt}_{\Gamma}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT. The final inequality holds from (5). Taking the expectation of 𝔼[W(at+1)at=a]𝔼delimited-[]conditional𝑊superscript𝑎𝑡1superscript𝑎𝑡𝑎\mathbb{E}[W(a^{t+1})\mid a^{t}=a]blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) ∣ italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = italic_a ] over atsuperscript𝑎𝑡a^{t}italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT gives

𝔼[W(at+1)](ζ=1kpζλζ)W(aopt)(ζ=1kpζμζ)𝔼[W(at)].𝔼delimited-[]𝑊superscript𝑎𝑡1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁𝑊superscript𝑎optsuperscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝔼delimited-[]𝑊superscript𝑎𝑡\mathbb{E}\left[W(a^{t+1})\right]\geq\left(\sum_{\zeta=1}^{k}p_{\zeta}\lambda_% {\zeta}\right)W(a^{\rm opt})-\left(\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}% \right)\mathbb{E}\left[W(a^{t})\right].blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) ] ≥ ( ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) - ( ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] .

Rearranging terms shows

𝔼[W(at+1)]ζ=1kpζλζ1+ζ=1kpζμζW(aopt)(ζ=1kpζμζ)(ζ=1kpζλζ1+ζ=1kpζμζW(aopt)𝔼[W(at)]).𝔼delimited-[]𝑊superscript𝑎𝑡1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝑊superscript𝑎optsuperscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝑊superscript𝑎opt𝔼delimited-[]𝑊superscript𝑎𝑡\mathbb{E}\left[W(a^{t+1})\right]-\frac{\sum_{\zeta=1}^{k}p_{\zeta}\lambda_{% \zeta}}{1+\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}}W(a^{\rm opt})\\ \geq\left(\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}\right)\left(\frac{\sum_{\zeta% =1}^{k}p_{\zeta}\lambda_{\zeta}}{1+\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}}W(a^% {\rm opt})-\mathbb{E}\left[W(a^{t})\right]\right).start_ROW start_CELL blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) ] - divide start_ARG ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL ≥ ( ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) ( divide start_ARG ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) - blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] ) . end_CELL end_ROW

Observe that either 𝔼[W(at)]ζ=1kpζλζ1+ζ=1kpζμζW(aopt)𝔼delimited-[]𝑊superscript𝑎𝑡superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝑊superscript𝑎opt\mathbb{E}\left[W(a^{t})\right]\geq\frac{\sum_{\zeta=1}^{k}p_{\zeta}\lambda_{% \zeta}}{1+\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}}W(a^{\rm opt})blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] ≥ divide start_ARG ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) or 𝔼[W(at+1)]ζ=1kpζλζ1+ζ=1kpζμζW(aopt)0𝔼delimited-[]𝑊superscript𝑎𝑡1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜆𝜁1superscriptsubscript𝜁1𝑘subscript𝑝𝜁subscript𝜇𝜁𝑊superscript𝑎opt0\mathbb{E}\left[W(a^{t+1})\right]-\frac{\sum_{\zeta=1}^{k}p_{\zeta}\lambda_{% \zeta}}{1+\sum_{\zeta=1}^{k}p_{\zeta}\mu_{\zeta}}W(a^{\rm opt})\geq 0blackboard_E [ italic_W ( italic_a start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) ] - divide start_ARG ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_ζ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) ≥ 0. Accordingly, in expectation, every other update must satisfy the bound, giving the average cumulative welfare bound in (12). ∎

Theorem 4.2 shows that the transient efficiency changes with the frequency with which different group sizes perform best responses. To attain the best transient guarantee, we can select p𝑝pitalic_p carefully.

Corollary 1.

If a system (G,w)𝐺𝑤(G,w)( italic_G , italic_w ) is a resource allocation problem in 𝒢n×{w}subscript𝒢𝑛𝑤\mathcal{G}_{n}\times\{w\}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w }, then selecting pζνζψ[k](nψ)νψproportional-tosubscript𝑝𝜁superscriptsubscript𝜈𝜁subscript𝜓delimited-[]𝑘binomial𝑛𝜓superscriptsubscript𝜈𝜓p_{\zeta}\propto\frac{\nu_{\zeta}^{\star}}{\sum_{\psi\in[k]}\binom{n}{\psi}\nu% _{\psi}^{\star}}italic_p start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∝ divide start_ARG italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_ψ ∈ [ italic_k ] end_POSTSUBSCRIPT ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ψ end_ARG ) italic_ν start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_ARG for all ζ[k]𝜁delimited-[]𝑘\zeta\in[k]italic_ζ ∈ [ italic_k ] gives

𝔼[1Tt=1TW(at)]T12TSPoAk(𝒢n,w)W(aopt).𝔼delimited-[]1𝑇superscriptsubscript𝑡1𝑇𝑊superscript𝑎𝑡𝑇12𝑇subscriptSPoA𝑘subscript𝒢𝑛𝑤𝑊superscript𝑎opt\mathbb{E}\left[\frac{1}{T}\sum_{t=1}^{T}W(a^{t})\right]\geq\frac{T-1}{2T}% \mathrm{SPoA}_{k}(\mathcal{G}_{n},w)W(a^{\rm opt}).blackboard_E [ divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] ≥ divide start_ARG italic_T - 1 end_ARG start_ARG 2 italic_T end_ARG roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) .

The proof is omitted as it is straightforward by rearranging terms in the constraints of (D).

Together, Theorem 4.2 and Corollary 1 provide insight into the transient performance of non-deterministic multi-agent dynamical systems with collaborative communication. Future work will study the traits of non-best-response dynamics, namely regret-based decision-making.

4.3 Numerical Example

We support the findings of Section 4.2 by numerical example. We randomly generate resource allocation problems and simulate the coalitional asynchronous best response dynamics when groups of size k{1,2,3,4,5}𝑘12345k\in\{1,2,3,4,5\}italic_k ∈ { 1 , 2 , 3 , 4 , 5 } update.

The resource allocation problems are generated by creating 100 resources with values independently drawn uniformly at random on [0,1]01[0,1][ 0 , 1 ]. Each of the 25 agents is endowed with between 1 and 10 actions (also sampled uniformly at random). For each action of each player, each resource is included in that particular action with probability 0.25. This defines a tuple G𝐺Gitalic_G. We use the local welfare function w(x)=xex/5𝑤𝑥𝑥superscript𝑒𝑥5w(x)=xe^{-x/5}italic_w ( italic_x ) = italic_x italic_e start_POSTSUPERSCRIPT - italic_x / 5 end_POSTSUPERSCRIPT to capture some added benefit from having multiple agents use the same resource and eventual diminishing returns and increased cost from over congestion.

We select a random initial condition and run the asynchronous best response dynamics with pk=1subscript𝑝𝑘1p_{k}=1italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 for one value k{1,2,3,4,5}𝑘12345k\in\{1,2,3,4,5\}italic_k ∈ { 1 , 2 , 3 , 4 , 5 } (i.e., only groups of exactly size k𝑘kitalic_k are sampled, but the simulation is repeated for 1k51𝑘51\leq k\leq 51 ≤ italic_k ≤ 5. We ran this simulation 100 times.

In Fig. 4(a), we plot the average welfare across the simulations over the number of group action revisions. We observe that the larger coalitions provide superior transient and long-run performance. However, a single group action revision requires more computation for larger coalitions. In Fig. 4(b), for each coalition size k{1,,5}𝑘15k\in\{1,\ldots,5\}italic_k ∈ { 1 , … , 5 }, we show a scatter plot of the number of cumulative welfare evaluations and the attained system welfare, along with a trend line fit to the data within two standard deviations of the average number of welfare evaluations. Here, we observe that for lower values of welfare, the smaller coalitions can attain similar welfare with fewer welfare evaluations but that the larger coalitions reach higher welfare much more regularly.

These conclusions help to identify the trade-off in designing systems with collaborative communication: better performance is attainable at the cost of greater computation.

Refer to caption
(a) Group revisions
Refer to caption
(b) Welfare checks
Figure 4: Numerical example of the coalitional asynchronous best response dynamics. In Fig. 4(a), the system welfare is plotted over the number of group action revisions, and in Fig. 4(b) it is plotted over the number of welfare evaluations. From this data, we can observe that group revisions offer superior system transient and long-term performance but require more welfare evaluations to compute group actions.
Refer to caption
Figure 5: Bounds on the k𝑘kitalic_k-strong Price of Anarchy using the optimal utility function in the class of resource allocation games with welfare function w𝑤witalic_w. Upper bound on SPoAk(𝒢n,w)superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) generated by Proposition 5.3 and lower bound and utility rule that attains it generated by Theorem 5.2. Compared with the k𝑘kitalic_k-strong price of anarchy when agents optimize the system welfare (lighter line), we demonstrate the possible and guaranteed gain in equilibrium performance attainable by designing group decision-making for collaborative multi-agent systems.

5 Utility Design

Up until this point, agents and groups of agents have been set to optimize the system welfare W𝑊Witalic_W over their respective individual or group actions. Though this is a reasonable approach, the system designer may seek to further improve system performance by designing how a group of agents makes a decision. Consider that groups of agents instead maximize the objective function U:𝒜0:𝑈𝒜subscriptabsent0U:\mathcal{A}\rightarrow\mathbb{R}_{\geq 0}italic_U : caligraphic_A → blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT (henceforth referred to as the utility function), i.e.,

aΓargmaxaΓ𝒜ΓU(aΓ,aΓ),subscript𝑎Γsubscriptargmaxsuperscriptsubscript𝑎Γsubscript𝒜Γ𝑈superscriptsubscript𝑎Γsubscript𝑎Γa_{\Gamma}\in\operatorname*{arg\,max}_{a_{\Gamma}^{\prime}\in\mathcal{A}_{% \Gamma}}U(a_{\Gamma}^{\prime},a_{-\Gamma}),italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_U ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) , (13)

where ties are still broken at random unless the current group action is in the argmax. By designing the utility function U𝑈Uitalic_U, the system operator can alter how groups of agents make decisions and, ideally, improve the performance of the system. A multi-agent system is now captured by the tuple (G,W,U)𝐺𝑊𝑈(G,W,U)( italic_G , italic_W , italic_U ), where the previous results are the special case when U=W𝑈𝑊U=Witalic_U = italic_W.

By redefining the objective functions groups of agents seek to maximize, we additionally alter the equilibria that emerge from collaborative decision-making. We alter the definition of k𝑘kitalic_k-strong Nash equilibria to hold with respect to the utility function, i.e.,

U(akSNE)U(aΓ,aΓkSNE),aΓ𝒜Γ,Γ𝒞[k].formulae-sequence𝑈superscript𝑎𝑘SNE𝑈subscriptsuperscript𝑎Γsubscriptsuperscript𝑎𝑘SNEΓformulae-sequencefor-allsubscriptsuperscript𝑎Γsubscript𝒜ΓΓsubscript𝒞delimited-[]𝑘U(a^{k{\rm SNE}})\geq U(a^{\prime}_{\Gamma},a^{k{\rm SNE}}_{-\Gamma}),% \leavevmode\nobreak\ \forall a^{\prime}_{\Gamma}\in\mathcal{A}_{\Gamma},% \leavevmode\nobreak\ \Gamma\in\mathcal{C}_{[k]}.italic_U ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) ≥ italic_U ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) , ∀ italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT . (14)

Let kSNE(G,U)𝑘SNE𝐺𝑈k{\rm SNE}(G,U)italic_k roman_SNE ( italic_G , italic_U ) denote the set of k𝑘kitalic_k-strong Nash equilibria when agents optimize the objective U𝑈Uitalic_U. The new set of equilibria implies the equilibrium performance guarantee may also change. As such, we redefine the k𝑘kitalic_k-strong price of anarchy as the approximation of the optimal welfare provided the system equilibria under objective function U𝑈Uitalic_U,

SPoAk(G,W,U)=minakSNEkSNE(G,U)W(akSNE)maxaopt𝒜W(aopt).subscriptSPoA𝑘𝐺𝑊𝑈subscriptsuperscript𝑎𝑘SNE𝑘SNE𝐺𝑈𝑊superscript𝑎𝑘SNEsubscriptsuperscript𝑎opt𝒜𝑊superscript𝑎opt\mathrm{SPoA}_{k}(G,W,U)=\frac{\min_{a^{k{\rm SNE}}\in k{\rm SNE}(G,U)}W(a^{k{% \rm SNE}})}{\max_{a^{\rm opt}\in\mathcal{A}}W(a^{\rm opt})}.roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W , italic_U ) = divide start_ARG roman_min start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ∈ italic_k roman_SNE ( italic_G , italic_U ) end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) end_ARG start_ARG roman_max start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) end_ARG . (15)

With this new design opportunity, we identify two goals in understanding the new attainable performance of collaborative decision-making: 1) quantifying the performance of a prescribed utility function, and 2) finding a utility function that provides the greatest k𝑘kitalic_k-strong price of anarchy guarantees. We address these two points in general in Section 5.1 and more thoroughly within resource allocation problems in Section 5.2.

5.1 Generalized Coalitionally Smooth Games

In this section, we consider the general setting and particularly focus on quantifying the k𝑘kitalic_k-strong price of anarchy of a system (G,W,U)𝐺𝑊𝑈(G,W,U)( italic_G , italic_W , italic_U ). As in Section 3.1, we introduce a notion of smooth systems now generalized to the setting where the agent objective U𝑈Uitalic_U differs from the system objective W𝑊Witalic_W.

Definition 3.

A system (G,W,U)𝐺𝑊𝑈(G,W,U)( italic_G , italic_W , italic_U ) is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-generalized-coalitionally smooth, where λ,μ0k𝜆𝜇superscriptsubscriptabsent0𝑘\lambda,\mu\in\mathbb{R}_{\geq 0}^{k}italic_λ , italic_μ ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, if for all a,a𝒜𝑎superscript𝑎𝒜a,a^{\prime}\in\mathcal{A}italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A

1(nζ)Γ𝒞ζU(aΓ,aΓ)U(a)+W(a)λζW(a)μζW(a),ζ[k].formulae-sequence1binomial𝑛𝜁subscriptΓsubscript𝒞𝜁𝑈superscriptsubscript𝑎Γsubscript𝑎Γ𝑈𝑎𝑊𝑎subscript𝜆𝜁𝑊superscript𝑎subscript𝜇𝜁𝑊𝑎for-all𝜁delimited-[]𝑘\frac{1}{\binom{n}{\zeta}}\sum_{\Gamma\in\mathcal{C}_{\zeta}}U(a_{\Gamma}^{% \prime},a_{-\Gamma})-U(a)+W(a)\\ \geq\lambda_{\zeta}W(a^{\prime})-\mu_{\zeta}W(a),\leavevmode\nobreak\ \forall% \zeta\in[k].start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_U ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) - italic_U ( italic_a ) + italic_W ( italic_a ) end_CELL end_ROW start_ROW start_CELL ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a ) , ∀ italic_ζ ∈ [ italic_k ] . end_CELL end_ROW (16)

Like (5), (16) provides a bound on average deviation effect of a group of size ζ𝜁\zetaitalic_ζ but on the utility function instead of the welfare. In Proposition 5.1, we show that (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-generalized-coalitionally smooth system permits a bound on the k𝑘kitalic_k-strong price of anarchy.

Proposition 5.1.

A system (G,W,U)𝐺𝑊𝑈(G,W,U)( italic_G , italic_W , italic_U ) that is (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-generalized-coalitionally smooth has k𝑘kitalic_k-strong price of anarchy satisfying

SPoAk(G,W,U)λζ1+μζ,ζ[k].formulae-sequencesubscriptSPoA𝑘𝐺𝑊𝑈subscript𝜆𝜁1subscript𝜇𝜁for-all𝜁delimited-[]𝑘\mathrm{SPoA}_{k}(G,W,U)\geq\frac{\lambda_{\zeta}}{1+\mu_{\zeta}},\leavevmode% \nobreak\ \forall\zeta\in[k].roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W , italic_U ) ≥ divide start_ARG italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG , ∀ italic_ζ ∈ [ italic_k ] . (17)
Proof.

Let akSNE𝒜superscript𝑎𝑘SNE𝒜a^{k{\rm SNE}}\in\mathcal{A}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ∈ caligraphic_A denote a k𝑘kitalic_k-strong Nash equilibrium when agents follow objective function U𝑈Uitalic_U, and let aoptargmaxa𝒜W(a)superscript𝑎optsubscriptargmax𝑎𝒜𝑊𝑎a^{\rm opt}\in\operatorname*{arg\,max}_{a\in\mathcal{A}}W(a)italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a ) denote an optimal joint action. For any ζ[k]𝜁delimited-[]𝑘\zeta\in[k]italic_ζ ∈ [ italic_k ], we have

W(akSNE)𝑊superscript𝑎𝑘SNE\displaystyle W(a^{k{\rm SNE}})italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) 1(nζ)Γ𝒞ζU(aΓopt,aΓkSNE)U(akSNE)+W(akSNE)absent1binomial𝑛𝜁subscriptΓsubscript𝒞𝜁𝑈subscriptsuperscript𝑎optΓsubscriptsuperscript𝑎𝑘SNEΓ𝑈superscript𝑎𝑘SNE𝑊superscript𝑎𝑘SNE\displaystyle\geq\frac{1}{\binom{n}{\zeta}}\sum_{\Gamma\in\mathcal{C}_{\zeta}}% U(a^{\rm opt}_{\Gamma},a^{k{\rm SNE}}_{-\Gamma})-U(a^{k{\rm SNE}})+W(a^{k{\rm SNE% }})≥ divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_U ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) - italic_U ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) + italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) (18a)
λζW(aopt)μζW(akSNE).absentsubscript𝜆𝜁𝑊superscript𝑎optsubscript𝜇𝜁𝑊superscript𝑎𝑘SNE\displaystyle\geq\lambda_{\zeta}W(a^{\rm opt})-\mu_{\zeta}W(a^{k{\rm SNE}}).≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) - italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) . (18b)

Where (18a) holds from 1(nζ)Γ𝒞ζU(aΓopt,aΓkSNE)U(akSNE)01binomial𝑛𝜁subscriptΓsubscript𝒞𝜁𝑈subscriptsuperscript𝑎optΓsubscriptsuperscript𝑎𝑘SNEΓ𝑈superscript𝑎𝑘SNE0\frac{1}{\binom{n}{\zeta}}\sum_{\Gamma\in\mathcal{C}_{\zeta}}U(a^{\rm opt}_{% \Gamma},a^{k{\rm SNE}}_{-\Gamma})-U(a^{k{\rm SNE}})\geq 0divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_U ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) - italic_U ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) ≥ 0 by akSNEsuperscript𝑎𝑘SNEa^{k{\rm SNE}}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT being a k𝑘kitalic_k-strong Nash equilibrium and (7c) holds from Definition 3. Rearranging, we get W(akSNE)/W(aopt)λζ/(1+μζ)𝑊superscript𝑎𝑘SNE𝑊superscript𝑎optsubscript𝜆𝜁1subscript𝜇𝜁W(a^{k{\rm SNE}})/W(a^{\rm opt})\geq\lambda_{\zeta}/(1+\mu_{\zeta})italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) / italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT / ( 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ). ∎

Beyond quantifying the k𝑘kitalic_k-strong price of anarchy for a system (G,W,U)𝐺𝑊𝑈(G,W,U)( italic_G , italic_W , italic_U ), one may wish to find the utility function which provides the best efficiency guarantee, i.e.,

UargmaxU:𝒜0SPoAk(G,W,U).𝑈subscriptargmax:superscript𝑈𝒜subscriptabsent0subscriptSPoA𝑘𝐺𝑊superscript𝑈U\in\operatorname*{arg\,max}_{U^{\prime}:\mathcal{A}\rightarrow\mathbb{R}_{% \geq 0}}\mathrm{SPoA}_{k}(G,W,U^{\prime}).italic_U ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT : caligraphic_A → blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W , italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .

For a specific problem (G,W)𝐺𝑊(G,W)( italic_G , italic_W ), it is possible to design a utility function which guarantees that a system optimal aoptsuperscript𝑎opta^{\rm opt}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT is a unique equilibrium and provides SPoAk(G,W,U)=1subscriptSPoA𝑘𝐺𝑊𝑈1\mathrm{SPoA}_{k}(G,W,U)=1roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_W , italic_U ) = 1 (e.g., U(a)=iN𝟙[ai=aiopt]𝑈𝑎subscript𝑖𝑁1delimited-[]subscript𝑎𝑖subscriptsuperscript𝑎opt𝑖U(a)=\sum_{i\in N}{\mathds{1}}[a_{i}=a^{\rm opt}_{i}]italic_U ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_i ∈ italic_N end_POSTSUBSCRIPT blackboard_1 [ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ]). However, this would require knowing the optimal allocations a priori, which poses several problems, including: 1) computing an optimal allocation can be intractable, and 2) system parameters may be subject to modeling errors, noise, or changes over time, causing the optimal allocations to change. As such, we will consider the design of utility rules, which provide a set of instructions to construct a utility function across a class of systems and eliminate the computational burden of solving for a new utility function for each system while maintaining improved performance guarantees. Luckily, the approach in Proposition 5.1 is amenable to generating performance guarantees across a class of systems, and in Section 5.2, we will investigate optimal utility rules more thoroughly in resource allocation problems.

5.2 Resource Allocation Games

In this section, we consider the k𝑘kitalic_k-strong price of anarchy in classes of resource allocation problems when the agents’ objective is derived from a utility rule u0n+1𝑢subscriptsuperscript𝑛1absent0u\in\mathbb{R}^{n+1}_{\geq 0}italic_u ∈ blackboard_R start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT. In an agent environment G=(N,𝒜,,{vr}r)𝐺𝑁𝒜subscriptsubscript𝑣𝑟𝑟G=(N,\mathcal{A},\mathcal{R},\{v_{r}\}_{r\in\mathcal{R}})italic_G = ( italic_N , caligraphic_A , caligraphic_R , { italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_r ∈ caligraphic_R end_POSTSUBSCRIPT ), the utility rule u𝑢uitalic_u can be applied to derive the utility function

U(a)=rvru(|a|r).𝑈𝑎subscript𝑟subscript𝑣𝑟𝑢subscript𝑎𝑟U(a)=\sum_{r\in\mathcal{R}}v_{r}u(|a|_{r}).italic_U ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_r ∈ caligraphic_R end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_u ( | italic_a | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) .

To normalize the utility function, we set u(0)=0𝑢00u(0)=0italic_u ( 0 ) = 0. We ultimately consider the performance of a utility rule u𝑢uitalic_u across all agent environments G𝒢n𝐺subscript𝒢𝑛G\in\mathcal{G}_{n}italic_G ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT with welfare function w𝑤witalic_w. We slightly abuse notation to refer to a system by the tuple (G,w,u)𝐺𝑤𝑢(G,w,u)( italic_G , italic_w , italic_u ). To quantify this performance, we generalize the k𝑘kitalic_k-strong price of anarchy bound defined in Section 4.1 to hold for cases where groups of agents optimize the utility function.

SPoAk(𝒢n,w,u)=minG𝒢nSPoAk(G,w,u).subscriptSPoA𝑘subscript𝒢𝑛𝑤𝑢subscript𝐺subscript𝒢𝑛subscriptSPoA𝑘𝐺𝑤𝑢\mathrm{SPoA}_{k}(\mathcal{G}_{n},w,u)=\min_{G\in\mathcal{G}_{n}}\mathrm{SPoA}% _{k}(G,w,u).roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w , italic_u ) = roman_min start_POSTSUBSCRIPT italic_G ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_G , italic_w , italic_u ) . (19)

The performance ratio is parameterized by the pair (w,u)𝑤𝑢(w,u)( italic_w , italic_u ); as such, we will discuss the effectiveness of a utility rule u𝑢uitalic_u with respect to a given welfare function w𝑤witalic_w.

Taking the utility rule approach completely eliminates the computational cost of deriving a utility function for each problem instance; now we seek to understand the capabilities of this approach in two ways: 1) in Theorem 5.2 we demonstrate how we can construct utility rules with good performance guarantees, and 2) in Proposition 5.1 we provide an upper bound on the best attainable performance a utility rule can provide. In Corollary 2, we provide a formal condition on when the constructed utility rule is optimal.

Theorem 5.2.

Any resource allocation problem (G,W)𝒢n×{w}𝐺𝑊subscript𝒢𝑛𝑤(G,W)\in\mathcal{G}_{n}\times\{w\}( italic_G , italic_W ) ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w } with the utility rule u~ζsubscript~𝑢𝜁\widetilde{u}_{\zeta}over~ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT is (1,ρ~ζ1)1subscript~𝜌𝜁1(1,\widetilde{\rho}_{\zeta}-1)( 1 , over~ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT - 1 )-k𝑘kitalic_k-generalized-coallitionally smooth, where u~ζsubscript~𝑢𝜁\widetilde{u}_{\zeta}over~ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT and ρ~ζsubscript~𝜌𝜁\widetilde{\rho}_{\zeta}over~ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT are solutions to the linear program,

(ρ~ζ,u~ζ)argminρ0,u0n+1ρsubscript~𝜌𝜁subscript~𝑢𝜁subscriptargminformulae-sequence𝜌0𝑢subscriptsuperscript𝑛1absent0𝜌\displaystyle(\widetilde{\rho}_{\zeta},\widetilde{u}_{\zeta})\in\operatorname*% {arg\,min}_{\rho\geq 0,u\in\mathbb{R}^{n+1}_{\geq 0}}\leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \rho( over~ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT , over~ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) ∈ start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_ρ ≥ 0 , italic_u ∈ blackboard_R start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ρ
s.t.0w(o+x)ρw(e+x)+formulae-sequencest0𝑤𝑜𝑥limit-from𝜌𝑤𝑒𝑥\displaystyle{\rm s.t.}\hskip 8.0pt0\geq w(o+x)-\rho w(e+x)+roman_s . roman_t . 0 ≥ italic_w ( italic_o + italic_x ) - italic_ρ italic_w ( italic_e + italic_x ) +
((nζ)u(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)u(e+x+βα))binomial𝑛𝜁𝑢𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑢𝑒𝑥𝛽𝛼\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \left(\binom{n}{\zeta}u% (e+x)-\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}u(e+x+\beta-\alpha)\right)( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_u ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_u ( italic_e + italic_x + italic_β - italic_α ) )
(e,x,o).for-all𝑒𝑥𝑜\displaystyle\hskip 160.0pt\forall(e,x,o)\in\mathcal{I}.∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I . (Qζ𝜁\zetaitalic_ζ)
Proof.

Consider the parameterization described in the proof of Proposition 3.2, where for any two actions a,a𝒜𝑎superscript𝑎𝒜a,a^{\prime}\in\mathcal{A}italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A, we can rewrite W(a)=e,x,oθ(e,x,o)w(e+x)𝑊𝑎subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑒𝑥W(a)=\sum_{e,x,o}\theta(e,x,o)w(e+x)italic_W ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_e + italic_x ) and W(a)=e,x,oθ(e,x,o)w(o+x)𝑊superscript𝑎subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑜𝑥W(a^{\prime})=\sum_{e,x,o}\theta(e,x,o)w(o+x)italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_o + italic_x ). Now, we can additionally rewrite U(a)=e,x,oθ(e,x,o)u(e+x)𝑈𝑎subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑢𝑒𝑥U(a)=\sum_{e,x,o}\theta(e,x,o)u(e+x)italic_U ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_u ( italic_e + italic_x ) and

Γ𝒞ζW(aΓ,aΓ)=e,x,oθ(e,x,o)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα).subscriptΓsubscript𝒞𝜁𝑊subscriptsuperscript𝑎Γsubscript𝑎Γsubscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼\sum_{\Gamma\in\mathcal{C}_{\zeta}}W(a^{\prime}_{\Gamma},a_{-\Gamma})\\ =\sum_{e,x,o}\theta(e,x,o)\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha).start_ROW start_CELL ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) . end_CELL end_ROW

We can now write out (16), the (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-generalized-coalitionally smooth constraint, as

e,x,oθ(e,x,o)(1(nζ)0αe0βoα+βζ(eα)(oβ)(neoζαβ)u(e+x+βα)u(e+x))e,x,oθ(e,x,o)(λζw(o+x)(μζ+1)w(e+x)).subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜1binomial𝑛𝜁subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑢𝑒𝑥𝛽𝛼𝑢𝑒𝑥subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜subscript𝜆𝜁𝑤𝑜𝑥subscript𝜇𝜁1𝑤𝑒𝑥\sum_{e,x,o}\theta(e,x,o)\Bigg{(}\frac{1}{\binom{n}{\zeta}}\sum_{\begin{% subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}u(e+x+\beta-\alpha)\\ -u(e+x)\Bigg{)}\geq\sum_{e,x,o}\theta(e,x,o)\left(\lambda_{\zeta}w(o+x)-(\mu_{% \zeta}+1)w(e+x)\right).start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) ( divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_u ( italic_e + italic_x + italic_β - italic_α ) end_CELL end_ROW start_ROW start_CELL - italic_u ( italic_e + italic_x ) ) ≥ ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) ( italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_w ( italic_o + italic_x ) - ( italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT + 1 ) italic_w ( italic_e + italic_x ) ) . end_CELL end_ROW

As before, we can observe that this constraint is sufficiently satisfied when

1(nζ)0αe0βoα+βζ(eα)(oβ)(neoζαβ)u(e+x+βα)u(e+x)λζw(o+x)(μζ+1)w(e+x),(e,x,o).\frac{1}{\binom{n}{\zeta}}\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}u(e+x+\beta-\alpha)-u(e+x)\\ \geq\lambda_{\zeta}w(o+x)-(\mu_{\zeta}+1)w(e+x),\quad\forall(e,x,o)\in\mathcal% {I}.start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_u ( italic_e + italic_x + italic_β - italic_α ) - italic_u ( italic_e + italic_x ) end_CELL end_ROW start_ROW start_CELL ≥ italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_w ( italic_o + italic_x ) - ( italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT + 1 ) italic_w ( italic_e + italic_x ) , ∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I . end_CELL end_ROW (20)

The task of finding smoothness parameters that give the best price of anarchy guarantee becomes the same problem as (P1ζ𝜁\zetaitalic_ζ) but now with constraint set (20). By substituting the decision variables ρ=(1+μζ)/λζ𝜌1subscript𝜇𝜁subscript𝜆𝜁\rho=(1+\mu_{\zeta})/\lambda_{\zeta}italic_ρ = ( 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) / italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT and ν=1/((nζ)λζ)0𝜈1binomial𝑛𝜁subscript𝜆𝜁0\nu=1/\left(\binom{n}{\zeta}\lambda_{\zeta}\right)\geq 0italic_ν = 1 / ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ) ≥ 0, we attain the new constraint set

0w(o+x)ρw(e+x)+ν((nζ)u(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)u(e+x+βα))(e,x,o).0𝑤𝑜𝑥𝜌𝑤𝑒𝑥𝜈binomial𝑛𝜁𝑢𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑢𝑒𝑥𝛽𝛼for-all𝑒𝑥𝑜0\geq w(o+x)-\rho w(e+x)+\\ \nu\left(\binom{n}{\zeta}u(e+x)-\sum_{\begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}u(e+x+\beta-\alpha)\right)\\ \forall(e,x,o)\in\mathcal{I}.start_ROW start_CELL 0 ≥ italic_w ( italic_o + italic_x ) - italic_ρ italic_w ( italic_e + italic_x ) + end_CELL end_ROW start_ROW start_CELL italic_ν ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_u ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_u ( italic_e + italic_x + italic_β - italic_α ) ) end_CELL end_ROW start_ROW start_CELL ∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I . end_CELL end_ROW (21)

The new objective222As an aside, the transformed program up to this point can be used to evaluate the performance of a specified utility rule. becomes 1/ρ1𝜌1/\rho1 / italic_ρ.

Finally, we let u0n𝑢subscriptsuperscript𝑛absent0u\in\mathbb{R}^{n}_{\geq 0}italic_u ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT become a decision variable in the program. Observe that every occurrence of u𝑢uitalic_u is multiplied by ν𝜈\nuitalic_ν, and every occurrence of ν𝜈\nuitalic_ν multiplies u𝑢uitalic_u. As such, we can define the new decision variable u=νusuperscript𝑢𝜈𝑢u^{\prime}=\nu uitalic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_ν italic_u and retrieve the linear program (Qζ𝜁\zetaitalic_ζ). ∎

The utility rule u^ζsubscript^𝑢𝜁\hat{u}_{\zeta}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT that (Qζ𝜁\zetaitalic_ζ) provides us some guarantee on attainable performance from designing group decision-making in collaborative systems. However, it is not yet clear if these are the best possible utility rules. To understand what the best possible performance is of a collaborative system, we define the optimal k𝑘kitalic_k-strong price of anarchy as

SPoAk(𝒢n,w)=supu:[n]R0SPoAk(𝒢n,w,u).superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤subscriptsupremum:𝑢delimited-[]𝑛subscript𝑅absent0subscriptSPoA𝑘subscript𝒢𝑛𝑤𝑢\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)=\sup_{u:[n]\rightarrow{R}_{\geq 0% }}\mathrm{SPoA}_{k}(\mathcal{G}_{n},w,u).roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) = roman_sup start_POSTSUBSCRIPT italic_u : [ italic_n ] → italic_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w , italic_u ) . (22)

This upper bound informs us of what efficiency is possible to hope for out of a collaborative system. In Proposition 5.3, we bound this quantity.

Proposition 5.3.

For the class of resource allocation problems 𝒢n×{w}subscript𝒢𝑛𝑤\mathcal{G}_{n}\times\{w\}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w }, when agents maximize the optimal utility design objective usuperscript𝑢u^{\star}italic_u start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT,

SPoAk(𝒢n,w)1/Q(n,w,k),superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤1superscript𝑄𝑛𝑤𝑘\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)\leq 1/Q^{\star}(n,w,k),roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) ≤ 1 / italic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ) , (23)

where Q(n,w,k)superscript𝑄𝑛𝑤𝑘Q^{\star}(n,w,k)italic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ) is value of the linear program

Q(n,w,k)=minρ0,{uζ0n+1}ζ[k]ρsuperscript𝑄𝑛𝑤𝑘subscript𝜌0subscriptsubscript𝑢𝜁subscriptsuperscript𝑛1absent0𝜁delimited-[]𝑘𝜌\displaystyle Q^{\star}(n,w,k)=\min_{\rho\geq 0,\{u_{\zeta}\in\mathbb{R}^{n+1}% _{\geq 0}\}_{\zeta\in[k]}}\leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \rhoitalic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ) = roman_min start_POSTSUBSCRIPT italic_ρ ≥ 0 , { italic_u start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ρ
s.t.0w(o+x)ρw(e+x)+formulae-sequencest0𝑤𝑜𝑥limit-from𝜌𝑤𝑒𝑥\displaystyle{\rm s.t.}\hskip 8.0pt0\geq w(o+x)-\rho w(e+x)+roman_s . roman_t . 0 ≥ italic_w ( italic_o + italic_x ) - italic_ρ italic_w ( italic_e + italic_x ) +
ζ[k]((nζ)uζ(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)uζ(e+x+βα))subscript𝜁delimited-[]𝑘binomial𝑛𝜁subscript𝑢𝜁𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽subscript𝑢𝜁𝑒𝑥𝛽𝛼\displaystyle\sum_{\zeta\in[k]}\left(\binom{n}{\zeta}u_{\zeta}(e+x)-\sum_{% \begin{subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}u_{\zeta}(e+x+\beta-\alpha)\right)∑ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_u start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_u start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ( italic_e + italic_x + italic_β - italic_α ) )
(e,x,o).for-all𝑒𝑥𝑜\displaystyle\hskip 140.0pt\forall(e,x,o)\in\mathcal{I}.∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I . (Q[k]delimited-[]𝑘[k][ italic_k ])

The proof appears in the appendix.

Note that Theorem 5.2 provides a utility rule with associated performance guarantee which lower bounds SPoAk(𝒢n,w)superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ), and Proposition 5.3 provides an upper bound. In Corollary 2, we note that when these two bounds match, we have a tight bound on SPoAk(𝒢n,w)superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) as well as an optimal utility rule.

Corollary 2.

For the class of resource allocation problems 𝒢n×{w}subscript𝒢𝑛𝑤\mathcal{G}_{n}\times\{w\}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT × { italic_w }, if the value of (Qζ𝜁\zetaitalic_ζ) satisfies ρζ=Q(n,w,k)subscriptsuperscript𝜌𝜁superscript𝑄𝑛𝑤𝑘\rho^{\star}_{\zeta}=Q^{\star}(n,w,k)italic_ρ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT = italic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ), then SPoAk(𝒢n,w)=1/Q(n,w,k)superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤1superscript𝑄𝑛𝑤𝑘\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)=1/Q^{\star}(n,w,k)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) = 1 / italic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_n , italic_w , italic_k ) is a tight bound and a solution u~ζsubscript~𝑢𝜁\widetilde{u}_{\zeta}over~ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT to (Qζ𝜁\zetaitalic_ζ) is an optimal utility rule.

Proof.

This follows immediately from 1/ρζ=λζ1+μζ1superscriptsubscript𝜌𝜁subscript𝜆𝜁1subscript𝜇𝜁1/\rho_{\zeta}^{\star}=\frac{\lambda_{\zeta}}{1+\mu_{\zeta}}1 / italic_ρ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = divide start_ARG italic_λ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_μ start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_ARG being a lower bound on SPoAk(𝒢n,w)superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) and the reciprocal of the value of (Q[k]delimited-[]𝑘[k][ italic_k ]), 1/Q1superscript𝑄1/Q^{\star}1 / italic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT being an upper bound. When the two match, the bound must be tight. ∎

The two bounds coinciding is not guaranteed but does occur at the extremes (k=1𝑘1k=1italic_k = 1 and k=n𝑘𝑛k=nitalic_k = italic_n); further, the gap between the two bounds (if present) is often small, and the lower bound attained by the utility rule constructed in Theorem 5.2 often demonstrates a significant improvement over the setting where agents simply optimize the system objective. Consider the four welfare functions from Fig. 3 again; for each, we find that the utility rule computed using Theorem 5.2 and the upper bound on SPoAk(𝒢n,w)superscriptsubscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) using Proposition 5.3. In Fig. 5 we plot these lower and upper bounds on SPoAk(𝒢20,w)superscriptsubscriptSPoA𝑘subscript𝒢20𝑤\mathrm{SPoA}_{k}^{\star}(\mathcal{G}_{20},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( caligraphic_G start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT , italic_w ) for each utility function and for each value of 1kn1𝑘𝑛1\leq k\leq n1 ≤ italic_k ≤ italic_n; these values are juxtaposed with the k𝑘kitalic_k-strong price of anarchy when agents optimize the system objective w𝑤witalic_w to demonstrate the possible gain in performance from designing the agents’ objective in collaborative systems.

6 Conclusion

In this work, we provided a variety of tools for evaluating the benefits and costs of collaborative communication in multi-agent systems. A collaborative multi-agent system was modeled by a common interest game where groups of players collaboratively perform their best responses simultaneously. We specifically considered the k𝑘kitalic_k-strong Nash equilibrium as a relevant equilibrium concept to gain insights into system behavior between the fully centralized and fully distributed settings. We introduced the notion of (λ,μ)𝜆𝜇(\lambda,\mu)( italic_λ , italic_μ )-k𝑘kitalic_k-coalitionally smooth systems and derived bounds on how well the k𝑘kitalic_k-strong Nash equilibrium approximates the optimum in such systems. Further analysis studied the running time of collaborative multi-agent decision dynamics and their transient performance, as well as the possible performance gains from designing agents’ objectives separately from the system objective. Finally, we underwent a more thorough study in the class of resource allocation games, in which we provided tractable linear programs whose solutions give tight bounds on the k𝑘kitalic_k-strong price of anarchy in resource allocation games. Future work will study less extensive communication paradigms and dynamical systems that emerge when agents learn together.

References

References

  • [1] S. Wollenstein-Betech, A. Houshmand, M. Salazar, M. Pavone, C. G. Cassandras, and I. C. Paschalidis, “Congestion-aware Routing and Rebalancing of Autonomous Mobility-on-Demand Systems in Mixed Traffic,” in 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), 2020, pp. 1–7.
  • [2] A. Khamis, A. Hussein, and A. Elmogy, “Multi-robot task allocation: A review of the state-of-the-art,” Cooperative robots and sensor networks 2015, pp. 31–51, 2015.
  • [3] V. Ranganathan, P. Kumar, U. Kaur, S. H. Li, T. Chakraborty, and R. Chandra, “Re-Inventing the Food Supply Chain with IoT: A Data-Driven Solution to Reduce Food Loss,” IEEE Internet of Things Magazine, vol. 5, no. 1, pp. 41–47, Mar. 2022.
  • [4] K. Tsakalozos, H. Kllapi, E. Sitaridi, M. Roussopoulos, D. Paparas, and A. Delis, “Flexible use of cloud resources through profit maximization and price discrimination,” in Proc. International Conference on Data Engineering, 2011, pp. 75–86.
  • [5] F. G. Filip, “Decision support and control for large-scale complex systems,” Annual Reviews in Control, vol. 32, no. 1, pp. 61–70, Apr. 2008.
  • [6] C. Daini, P. Goatin, M. L. D. Monache, and A. Ferrara, “Centralized Traffic Control via Small Fleets of Connected and Automated Vehicles,” in 2022 European Control Conference (ECC), Jul. 2022, pp. 371–376.
  • [7] L. Fang and H. Li, “Centralized resource allocation based on the cost–revenue analysis,” Computers & Industrial Engineering, vol. 85, pp. 395–401, Jul. 2015.
  • [8] G. Antonelli, “Interconnected dynamic systems: An overview on distributed control,” IEEE Control Systems Magazine, vol. 33, no. 1, pp. 76–88, 2013.
  • [9] J. R. Marden, Gü. Arslan, and J. S. Shamma, “Cooperative Control and Potential Games,” IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 39, no. 6, pp. 1393–1407, 2009.
  • [10] R. M. Murray, “Recent research in cooperative control of multivehicle systems,” Journal of Dynamic Systems, Measurement, and Control, vol. 129, no. 5, pp. 571–583, May 2007.
  • [11] A. Das, T. Gervet, J. Romoff, D. Batra, D. Parikh, M. Rabbat, and J. Pineau, “Tarmac: Targeted multi-agent communication,” in International Conference on Machine Learning.   PMLR, 2019, pp. 1538–1546.
  • [12] B. L. Ferguson, D. Paccagnan, and J. R. Marden, “The cost of informing decision-makers in multi-agent maximum coverage problems with random resource values,” IEEE Control Systems Letters, vol. 7, pp. 2928–2933, 2023.
  • [13] D. D. Šiljak and A. I. Zečević, “Control of large-scale systems: Beyond decentralized feedback,” Annual Reviews in Control, vol. 29, no. 2, pp. 169–179, Jan. 2005.
  • [14] Z. Xu and V. Tzoumas, “Resource-Aware Distributed Submodular Maximization: A Paradigm for Multi-Robot Decision-Making,” in 2022 IEEE 61st Conference on Decision and Control (CDC), Dec. 2022, pp. 5959–5966.
  • [15] G. Orosz, “Connected cruise control: Modelling, delay effects, and nonlinear behaviour,” Vehicle System Dynamics, vol. 54, no. 8, pp. 1147–1176, 2016.
  • [16] H. Nawaz, H. M. Ali, and A. A. Laghari, “UAV Communication Networks Issues: A Review,” Archives of Computational Methods in Engineering, vol. 28, no. 3, pp. 1349–1369, May 2021.
  • [17] A. Lazaridou and M. Baroni, “Emergent multi-agent communication in the deep learning era,” arXiv preprint arXiv:2006.02419, 2020.
  • [18] R. J. Aumann, “Acceptable points in general cooperative n-person games,” Contributions to the Theory of Games, vol. 4, pp. 287–324, 1959.
  • [19] R. Nessah and G. Tian, “On the existence of strong Nash equilibria,” Journal of Mathematical Analysis and Applications, vol. 414, no. 2, pp. 871–885, 2014.
  • [20] N. Gatti, M. Rocco, and T. Sandholm, “On the verification and computation of strong Nash equilibrium,” arXiv preprint arXiv:1711.06318, 2017.
  • [21] R. Holzman and N. Law-Yone, “Strong equilibrium in congestion games,” Games and economic behavior, vol. 21, no. 1-2, pp. 85–101, 1997.
  • [22] T. Harks, M. Klimm, and R. H. Möhring, “Strong Nash equilibria in games with the lexicographical improvement property,” in Internet and Network Economics: 5th International Workshop, WINE 2009, Rome, Italy, December 14-18, 2009. Proceedings 5.   Springer, 2009, pp. 463–470.
  • [23] J. B. Clempner and A. S. Poznyak, “Finding the strong nash equilibrium: Computation, existence and characterization for markov games,” Journal of Optimization Theory and Applications, vol. 186, pp. 1029–1052, 2020.
  • [24] A. Epstein, M. Feldman, and Y. Mansour, “Strong equilibrium in cost sharing connection games,” in Proceedings of the 8th ACM Conference on Electronic Commerce, ser. EC ’07.   New York, NY, USA: Association for Computing Machinery, Jun. 2007, pp. 84–92.
  • [25] N. Andelman, M. Feldman, and Y. Mansour, “Strong price of anarchy,” Games and Economic Behavior, vol. 65, no. 2, pp. 289–317, Mar. 2009.
  • [26] J. Barreiro-Gomez, G. Obando, and N. Quijano, “Distributed Population Dynamics: Optimization and Control Applications,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, pp. 1–11, 2016.
  • [27] A. Fiat, H. Kaplan, M. Levy, and S. Olonetsky, “Strong price of anarchy for machine load balancing,” in ICALP, vol. 4596.   Springer, 2007, pp. 583–594.
  • [28] L. Epstein and R. van Stee, “The price of anarchy on uniformly related machines revisited,” Information and Computation, vol. 212, pp. 37–54, 2012.
  • [29] S. Chien and A. Sinclair, “Strong and pareto price of anarchy in congestion games.” in ICALP (1).   Citeseer, 2009, pp. 279–291.
  • [30] Y. Bachrach, V. Syrgkanis, É. Tardos, and M. Vojnović, “Strong Price of Anarchy, Utility Games and Coalitional Dynamics,” in Algorithmic Game Theory, ser. Lecture Notes in Computer Science, R. Lavi, Ed.   Berlin, Heidelberg: Springer, 2014, pp. 218–230.
  • [31] M. Feldman and O. Friedler, “A unified framework for strong price of anarchy in clustering games,” in Automata, Languages, and Programming: 42nd International Colloquium, ICALP 2015, Kyoto, Japan, July 6-10, 2015, Proceedings, Part II.   Springer, 2015, pp. 601–613.
  • [32] T. Roughgarden, “Intrinsic robustness of the price of anarchy,” Communications of the ACM, vol. 55, no. 7, pp. 116–123, 2012.
  • [33] M. Gairing, “Covering games: Approximation through non-cooperation,” in Internet and Economics, 2009, pp. 184–195.
  • [34] V. Bilò and C. Vinci, “Dynamic taxes for polynomial congestion games,” in EC 2016 - Proceedings of the 2016 ACM Conference on Economics and Computation.   New York, New York, USA: ACM Press, 2016, pp. 839–856.
  • [35] D. Paccagnan, R. Chandan, and J. R. Marden, “Utility Design for Distributed Resource Allocation—Part I: Characterizing and Optimizing the Exact Price of Anarchy,” IEEE Transactions on Automatic Control, vol. 65, no. 11, pp. 4616–4631, Nov. 2020.
  • [36] R. Zhang, Y. Zhang, R. Konda, B. Ferguson, J. Marden, and N. Li, “Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity,” Apr. 2023.
  • [37] R. Konda, R. Chandan, D. Grimsman, and J. R. Marden, “Optimal Design of Best Response Dynamics in Resource Allocation Games,” Apr. 2022.
  • [38] B. L. Ferguson and J. R. Marden, “Robust Utility Design in Distributed Resource Allocation Problems with Defective Agents,” Dynamic Games and Applications, pp. 1–23, Aug. 2022.
  • [39] W. Saad, Z. Han, M. Debbah, A. Hjorungnes, and T. Basar, “Coalitional game theory for communication networks,” Ieee signal processing magazine, vol. 26, no. 5, pp. 77–97, 2009.
  • [40] H. Bayram and H. I. Bozma, “Multirobot communication network topology via centralized pairwise games,” in 2013 IEEE International Conference on Robotics and Automation.   IEEE, 2013, pp. 2521–2526.
  • [41] D. Cappello and T. Mylvaganam, “Distributed differential games for control of multi-agent systems,” IEEE Transactions on Control of Network Systems, vol. 9, no. 2, pp. 635–646, 2021.
  • [42] A. Vetta, “Nash equilibria in competitive societies, with applications to facility location, traffic routing and auctions,” The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings., pp. 416–425, 2002.
  • [43] S. Durand and B. Gaujal, “Complexity and Optimality of the Best Response Algorithm in Random Potential Games,” in Algorithmic Game Theory, M. Gairing and R. Savani, Eds.   Berlin, Heidelberg: Springer, 2016, pp. 40–51.
  • [44] Y. Giannakopoulos, “A Smoothed FPTAS for Equilibria in Congestion Games,” Jul. 2023.

Proof of Proposition 2.1: To show existence, we can simply observe that aoptargmaxa𝒜W(a)superscript𝑎optsubscriptargmax𝑎𝒜𝑊𝑎a^{\rm opt}\in\operatorname*{arg\,max}_{a\in\mathcal{A}}W(a)italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a ) is a k𝑘kitalic_k-strong Nash equilibrium for any k[n]𝑘delimited-[]𝑛k\in[n]italic_k ∈ [ italic_n ]. Because W(aopt)W(a)𝑊superscript𝑎opt𝑊superscript𝑎W(a^{\rm opt})\geq W(a^{\prime})italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) ≥ italic_W ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) for all a𝒜superscript𝑎𝒜a^{\prime}\in\mathcal{A}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A, the global optimal satisfies W(aopt)W(aΓ,aΓopt),aΓ𝒜Γ,Γ𝒞[k]formulae-sequence𝑊superscript𝑎opt𝑊superscriptsubscript𝑎Γsubscriptsuperscript𝑎optΓformulae-sequencefor-allsuperscriptsubscript𝑎Γsubscript𝒜ΓΓsubscript𝒞delimited-[]𝑘W(a^{\rm opt})\geq W(a_{\Gamma}^{\prime},a^{\rm opt}_{-\Gamma}),\leavevmode% \nobreak\ \forall a_{\Gamma}^{\prime}\in\mathcal{A}_{\Gamma},\leavevmode% \nobreak\ \Gamma\in\mathcal{C}_{[k]}italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) ≥ italic_W ( italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) , ∀ italic_a start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT. ∎

Proof of Theorem 3.3: The proof can be outlined in four parts: first, the problem of finding SPoAk(𝒢n,w)subscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) is transformed and relaxed; second, the parameterization used in the proof of Proposition 3.2 is used to turn the relaxed problem into a linear program. Next, an example is constructed to show the linear program provides a tight bound. Finally, we take the dual of said linear program.

Refer to caption
Figure 6: Game construction for worst-case k𝑘kitalic_k-strong price of anarchy. Three of the n𝑛nitalic_n players’ action sets are shown (color-coded in red, green, and blue, respectively) on three of n!𝑛n!italic_n ! rings for the label (e,x,o)=(2,1,1)𝑒𝑥𝑜211(e,x,o)=(2,1,1)( italic_e , italic_x , italic_o ) = ( 2 , 1 , 1 ). A ring has n𝑛nitalic_n positions, one for each player. For a label (e,x,o)𝑒𝑥𝑜(e,x,o)( italic_e , italic_x , italic_o ) we generate n!𝑛n!italic_n ! rings for all the orderings of players over positions. This is repeated for each label. Players still only have two actions, but each action covers resources from each ring. The value of a resource is equal to the value of θsuperscript𝜃\theta^{\star}italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, a solution to (D), for the label with which it is associated.
maxθ0||subscript𝜃subscriptsuperscriptabsent0\displaystyle\max_{\theta\in\mathbb{R}^{|\mathcal{I}|}_{\geq 0}}\quadroman_max start_POSTSUBSCRIPT italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT | caligraphic_I | end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT e,x,ow(o+x)θ(e,x,o)subscript𝑒𝑥𝑜𝑤𝑜𝑥𝜃𝑒𝑥𝑜\displaystyle\sum_{e,x,o}w(o+x)\theta(e,x,o)∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_w ( italic_o + italic_x ) italic_θ ( italic_e , italic_x , italic_o )
s.t.formulae-sequencest\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ {\rm s.t.}roman_s . roman_t . e,x,o((nζ)w(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)w(e+x+βα))θ(e,x,o)0subscript𝑒𝑥𝑜binomial𝑛𝜁𝑤𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼𝜃𝑒𝑥𝑜0\displaystyle\sum_{e,x,o}\left(\binom{n}{\zeta}w(e+x)-\sum_{\begin{subarray}{c% }0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)\right)\theta(e,x,o)\geq 0∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_w ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) ) italic_θ ( italic_e , italic_x , italic_o ) ≥ 0
ζ{1,,k}for-all𝜁1𝑘\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode% \nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \forall\zeta\in% \{1,\ldots,k\}∀ italic_ζ ∈ { 1 , … , italic_k }
e,x,ow(e+x)θ(e,x,o)=1subscript𝑒𝑥𝑜𝑤𝑒𝑥𝜃𝑒𝑥𝑜1\displaystyle\sum_{e,x,o}w(e+x)\theta(e,x,o)=1∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_w ( italic_e + italic_x ) italic_θ ( italic_e , italic_x , italic_o ) = 1 (D)

1) Relaxing the problem: Quantifying SPoAk(𝒢n,w)subscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) can be expressed as taking the minimum k𝑘kitalic_k-strong price of anarchy over all games in 𝒢nsubscript𝒢𝑛\mathcal{G}_{n}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, i.e.,

minG𝒢nminakSNEkSNE(G)W(akSNE)maxaopt𝒜W(aopt)𝐺subscript𝒢𝑛minsubscriptsuperscript𝑎𝑘SNE𝑘SNE𝐺𝑊superscript𝑎𝑘SNEsubscriptsuperscript𝑎opt𝒜𝑊superscript𝑎opt\displaystyle\underset{\displaystyle\scriptstyle G\in\mathcal{G}_{n}}{\mathrm{% min}}\quad\frac{\min_{a^{k{\rm SNE}}\in k{\rm SNE}(G)}W(a^{k{\rm SNE}})}{\max_% {a^{\rm opt}\in\mathcal{A}}W(a^{\rm opt})}\hfil\hfil\hfil\hfilstart_UNDERACCENT italic_G ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_min end_ARG divide start_ARG roman_min start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ∈ italic_k roman_SNE ( italic_G ) end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) end_ARG start_ARG roman_max start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) end_ARG (D1)

To make this problem more approachable, we introduce several transformations and relaxations. First, rather than searching over the entire set of game 𝒢nsubscript𝒢𝑛\mathcal{G}_{n}caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we search over the set of games 𝒢^nsubscript^𝒢𝑛\hat{\mathcal{G}}_{n}over^ start_ARG caligraphic_G end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, in which each agent has exactly two actions. This reduction of the search space can be done without loss of generality, i.e., SPoAk(𝒢n,w)=SPoAk(𝒢^n,w)subscriptSPoA𝑘subscript𝒢𝑛𝑤subscriptSPoA𝑘subscript^𝒢𝑛𝑤\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)=\mathrm{SPoA}_{k}(\hat{\mathcal{G}}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) = roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( over^ start_ARG caligraphic_G end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ). Trivially, 𝒢^n𝒢nsubscript^𝒢𝑛subscript𝒢𝑛\hat{\mathcal{G}}_{n}\subset\mathcal{G}_{n}over^ start_ARG caligraphic_G end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊂ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Further, consider any game G𝒢n𝐺subscript𝒢𝑛G\in\mathcal{G}_{n}italic_G ∈ caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT; if for every player, each of their actions is removed except their action in the optimal allocation aioptsubscriptsuperscript𝑎opt𝑖a^{\rm opt}_{i}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and their action in their worst k𝑘kitalic_k-strong Nash equilibrium aikSNEsubscriptsuperscript𝑎𝑘SNE𝑖a^{k{\rm SNE}}_{i}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the new problem will maintain the same k𝑘kitalic_k-strong price of anarchy, but will now exist in 𝒢^nsubscript^𝒢𝑛\hat{\mathcal{G}}_{n}over^ start_ARG caligraphic_G end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. With this reduction, we will denote each player’s action set as 𝒜^i={aiopt,aikSNE}subscript^𝒜𝑖subscriptsuperscript𝑎opt𝑖subscriptsuperscript𝑎𝑘SNE𝑖\hat{\mathcal{A}}_{i}=\{a^{\rm opt}_{i},a^{k{\rm SNE}}_{i}\}over^ start_ARG caligraphic_A end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }. Second, we normalize each resource value vrsubscript𝑣𝑟v_{r}italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT such that the equilibrium welfare is one. This, too, can be done without loss of generality by scaling each resource identically, thus not altering the SPoASPoA\mathrm{SPoA}roman_SPoA ratio. Third, we invert the objective and consider the maximization of W(aopt)/W(akSNE)𝑊superscript𝑎opt𝑊superscript𝑎𝑘SNEW(a^{\rm opt})/W(a^{k{\rm SNE}})italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) / italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ). Finally, we sum over each of the k𝑘kitalic_k-coalition equilibrium constraints. For each ζ[k]𝜁delimited-[]𝑘\zeta\in[k]italic_ζ ∈ [ italic_k ], rather than satisfying each inequality in (3), sum over every combination of the ζ𝜁\zetaitalic_ζ out of n𝑛nitalic_n players, denoted 𝒞ζsubscript𝒞𝜁\mathcal{C}_{\zeta}caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT. Applying these reductions to (D1) gives,

maxG𝒢^nW(aopt)𝐺subscript^𝒢𝑛max𝑊superscript𝑎opt\displaystyle\underset{\displaystyle\scriptstyle G\in\hat{\mathcal{G}}_{n}}{% \mathrm{max}}\quad W(a^{\rm opt})\hfil\hfil\hfil\hfilstart_UNDERACCENT italic_G ∈ over^ start_ARG caligraphic_G end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_max end_ARG italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) (D2)
s.t.formulae-sequencest\displaystyle\mathmakebox[width("$\underset{\displaystyle\phantom{\scriptstyle G% \in\hat{\mathcal{G}}_{n}}}{\mathrm{max}}$")][c]{\mathmakebox[width("$\mathrm{% max}$")][l]{\mathrm{\kern 1.00006pts.t.}}}\quadroman_s . roman_t . (nζ)W(akSNE)Γ𝒞ζW(aΓopt,aΓkSNE),ζ[k]formulae-sequencebinomial𝑛𝜁𝑊superscript𝑎𝑘SNEsubscriptΓsubscript𝒞𝜁𝑊subscriptsuperscript𝑎optΓsubscriptsuperscript𝑎𝑘SNEΓfor-all𝜁delimited-[]𝑘\displaystyle\binom{n}{\zeta}W(a^{k{\rm SNE}})\geq\sum_{\Gamma\in\mathcal{C}_{% \zeta}}W(a^{\rm opt}_{\Gamma},a^{k{\rm SNE}}_{-\Gamma}),\leavevmode\nobreak\ % \forall\zeta\in[k]\hfil\hfil( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) ≥ ∑ start_POSTSUBSCRIPT roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) , ∀ italic_ζ ∈ [ italic_k ]
W(akSNE)=1𝑊superscript𝑎𝑘SNE1\displaystyle W(a^{k{\rm SNE}})=1italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) = 1

(D2) provides a lower bound on SPoAk(𝒢n,w)subscriptSPoA𝑘subscript𝒢𝑛𝑤\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) as the feasible set was expanded. Later, we will show that the bound is tight by constructing an example that realizes it.

2) Parameterization: We use the parameterization introduced in the proof of Proposition 3.2 with respect to the joint actions a=akSNE𝑎superscript𝑎𝑘SNEa=a^{k{\rm SNE}}italic_a = italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT and a=aoptsuperscript𝑎superscript𝑎opta^{\prime}=a^{\rm opt}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT. By considering any θ0||𝜃superscriptsubscriptabsent0\theta\in\mathbb{R}_{\geq 0}^{|\mathcal{I}|}italic_θ ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_I | end_POSTSUPERSCRIPT, we can parameterize any game G𝒢^n𝐺subscript^𝒢𝑛G\in\hat{\mathcal{G}}_{n}italic_G ∈ over^ start_ARG caligraphic_G end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT; to find the worst-case price of anarchy, we search over all such parameters, i.e., look over the entire class of games. The linear program (D) is the result of the search for the vector θ𝜃\thetaitalic_θ that results in the highest price of anarchy.

3) Constructing an example: Consider the following resource allocation problem: for each label (e,x,o)𝑒𝑥𝑜(e,x,o)\in\mathcal{I}( italic_e , italic_x , italic_o ) ∈ caligraphic_I and permutation of the n𝑛nitalic_n player σΣn𝜎subscriptΣ𝑛\sigma\in{\Sigma}_{n}italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, define a ring of n𝑛nitalic_n resources. Total, there are nn!||𝑛𝑛nn!|\mathcal{I}|italic_n italic_n ! | caligraphic_I | resources. Let ri,j(e,x,o)superscriptsubscript𝑟𝑖𝑗𝑒𝑥𝑜r_{i,j}^{(e,x,o)}italic_r start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUPERSCRIPT denote the resource with label (e,x,o)𝑒𝑥𝑜(e,x,o)( italic_e , italic_x , italic_o ) at position i𝑖iitalic_i in the j𝑗jitalic_jth ring. Consider, for instance, the n!𝑛n!italic_n ! rings associated with the label (e,x,o)=(2,1,1)𝑒𝑥𝑜211(e,x,o)=(2,1,1)( italic_e , italic_x , italic_o ) = ( 2 , 1 , 1 ) as depicted in Fig. 6. We will construct the actions aikSNEsubscriptsuperscript𝑎𝑘SNE𝑖a^{k{\rm SNE}}_{i}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and aioptsubscriptsuperscript𝑎opt𝑖a^{\rm opt}_{i}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT so that for each resource in these rings, e+x=3𝑒𝑥3e+x=3italic_e + italic_x = 3 agents have it in only their equilibrium action, and x+o=2𝑥𝑜2x+o=2italic_x + italic_o = 2 agents have it only in their optimal action. In the first ring (with the monotonic permutation σ=(1,2,3,,n)𝜎123𝑛\sigma=(1,2,3,\ldots,n)italic_σ = ( 1 , 2 , 3 , … , italic_n )), agent i𝑖iitalic_i has actions aikSNE={ri,1(2,1,1),ri+1%n,1(2,1,1),ri+2%n,1(2,1,1)}subscriptsuperscript𝑎𝑘SNE𝑖subscriptsuperscript𝑟211𝑖1subscriptsuperscript𝑟211𝑖percent1𝑛1subscriptsuperscript𝑟211𝑖percent2𝑛1a^{k{\rm SNE}}_{i}=\{r^{(2,1,1)}_{i,1},r^{(2,1,1)}_{i+1\%n,1},r^{(2,1,1)}_{i+2% \%n,1}\}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_r start_POSTSUPERSCRIPT ( 2 , 1 , 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , italic_r start_POSTSUPERSCRIPT ( 2 , 1 , 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 1 % italic_n , 1 end_POSTSUBSCRIPT , italic_r start_POSTSUPERSCRIPT ( 2 , 1 , 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 2 % italic_n , 1 end_POSTSUBSCRIPT } and aiopt={ri+2,1(2,1,1),ri+3%n,1(2,1,1)}subscriptsuperscript𝑎opt𝑖subscriptsuperscript𝑟211𝑖21subscriptsuperscript𝑟211𝑖percent3𝑛1a^{\rm opt}_{i}=\{r^{(2,1,1)}_{i+2,1},r^{(2,1,1)}_{i+3\%n,1}\}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_r start_POSTSUPERSCRIPT ( 2 , 1 , 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 2 , 1 end_POSTSUBSCRIPT , italic_r start_POSTSUPERSCRIPT ( 2 , 1 , 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i + 3 % italic_n , 1 end_POSTSUBSCRIPT }, where %percent\%% denotes the modulo operator so the selected resources wrap around the ring. This pattern continues for each ring j[n!]𝑗delimited-[]𝑛j\in[n!]italic_j ∈ [ italic_n ! ] with a different permutation of players σΣn𝜎subscriptΣ𝑛\sigma\in{\Sigma}_{n}italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. At a ring with label (e,x,o)𝑒𝑥𝑜(e,x,o)( italic_e , italic_x , italic_o ) and permutation σ𝜎\sigmaitalic_σ, player i𝑖iitalic_i has the actions aikSNE={rσ(i),j(e,x,o),,rσ(i)+e+x1%n,j(e,x,o)}subscriptsuperscript𝑎𝑘SNE𝑖subscriptsuperscript𝑟𝑒𝑥𝑜𝜎𝑖𝑗subscriptsuperscript𝑟𝑒𝑥𝑜𝜎𝑖𝑒𝑥percent1𝑛𝑗a^{k{\rm SNE}}_{i}=\{r^{(e,x,o)}_{\sigma(i),j},\ldots,r^{(e,x,o)}_{\sigma(i)+e% +x-1\%n,j}\}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_r start_POSTSUPERSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ ( italic_i ) , italic_j end_POSTSUBSCRIPT , … , italic_r start_POSTSUPERSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ ( italic_i ) + italic_e + italic_x - 1 % italic_n , italic_j end_POSTSUBSCRIPT } and aiopt={rσ(i)+e%n,j(e,x,o),,rσ(i)+e+x+o1%n,j(e,x,o)}subscriptsuperscript𝑎opt𝑖subscriptsuperscript𝑟𝑒𝑥𝑜𝜎𝑖percent𝑒𝑛𝑗subscriptsuperscript𝑟𝑒𝑥𝑜𝜎𝑖𝑒𝑥𝑜percent1𝑛𝑗a^{\rm opt}_{i}=\{r^{(e,x,o)}_{\sigma(i)+e\%n,j},\ldots,r^{(e,x,o)}_{\sigma(i)% +e+x+o-1\%n,j}\}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_r start_POSTSUPERSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ ( italic_i ) + italic_e % italic_n , italic_j end_POSTSUBSCRIPT , … , italic_r start_POSTSUPERSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ ( italic_i ) + italic_e + italic_x + italic_o - 1 % italic_n , italic_j end_POSTSUBSCRIPT }. Finally, each resource of type (e,x,o)𝑒𝑥𝑜(e,x,o)( italic_e , italic_x , italic_o ) has a value θ(e,x,o)𝜃𝑒𝑥𝑜\theta(e,x,o)italic_θ ( italic_e , italic_x , italic_o ) where θ𝜃\thetaitalic_θ is a fixed parameter. The function which encodes the welfare from player overlap is w𝑤witalic_w.

In the joint action akSNEsuperscript𝑎𝑘SNEa^{k{\rm SNE}}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT, each resource is covered by exactly e+x𝑒𝑥e+xitalic_e + italic_x agents, and the system welfare can be written

W(akSNE)=e,x,onn!θ(e,x,o)w(e+x).𝑊superscript𝑎𝑘SNEsubscript𝑒𝑥𝑜𝑛𝑛𝜃𝑒𝑥𝑜𝑤𝑒𝑥W(a^{k{\rm SNE}})=\sum_{e,x,o}nn!\theta(e,x,o)w(e+x).italic_W ( italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_n italic_n ! italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_e + italic_x ) . (24)

Similarly, joint action aoptsuperscript𝑎opta^{\rm opt}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT satisfies

W(aopt)=e,x,onn!θ(e,x,o)w(o+x).𝑊superscript𝑎optsubscript𝑒𝑥𝑜𝑛𝑛𝜃𝑒𝑥𝑜𝑤𝑜𝑥W(a^{\rm opt})=\sum_{e,x,o}nn!\theta(e,x,o)w(o+x).italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_n italic_n ! italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_o + italic_x ) . (25)

Now, consider a coalition Γ𝒞[k]Γsubscript𝒞delimited-[]𝑘\Gamma\in\mathcal{C}_{[k]}roman_Γ ∈ caligraphic_C start_POSTSUBSCRIPT [ italic_k ] end_POSTSUBSCRIPT and denote by ζ𝜁\zetaitalic_ζ its cardinality. The system welfare of this group deviating their action to aΓoptsubscriptsuperscript𝑎optΓa^{\rm opt}_{\Gamma}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT is

W(aΓopt,aΓkSNE)=e,x,oj=1n!i=1nθ(e,x,o)w(|aΓopt,aΓkSNE|r)\displaystyle W(a^{\rm opt}_{\Gamma},a^{k{\rm SNE}}_{-\Gamma})=\sum_{e,x,o}% \sum_{j=1}^{n!}\sum_{i=1}^{n}\theta(e,x,o)w(|a^{\rm opt}_{\Gamma},a^{k{\rm SNE% }}_{-\Gamma}|_{r})italic_W ( italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n ! end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( | italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - roman_Γ end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
=e,x,oθ(e,x,o)0αe0βoα+βζnn!(eα)(oβ)(neoζαβ)w(e+x+βα)absentsubscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁𝑛𝑛binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑤𝑒𝑥𝛽𝛼\displaystyle=\sum_{e,x,o}\theta(e,x,o)\sum_{\begin{subarray}{c}0\leq\alpha% \leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}nn!{e\choose\alpha}{o\choose\beta}\binom{n% -e-o}{\zeta-\alpha-\beta}w(e+x+\beta-\alpha)= ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_n italic_n ! ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_w ( italic_e + italic_x + italic_β - italic_α ) (26)

where we let r𝑟ritalic_r be the shorthand for ri,j(e,x,o)superscriptsubscript𝑟𝑖𝑗𝑒𝑥𝑜r_{i,j}^{(e,x,o)}italic_r start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_e , italic_x , italic_o ) end_POSTSUPERSCRIPT. The second equality holds by defining α𝛼\alphaitalic_α and β𝛽\betaitalic_β as the number of players in ΓΓ\Gammaroman_Γ who invested in resource r𝑟ritalic_r exclusively in their action akSNEsuperscript𝑎𝑘SNEa^{k{\rm SNE}}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT or aoptsuperscript𝑎opta^{\rm opt}italic_a start_POSTSUPERSCRIPT roman_opt end_POSTSUPERSCRIPT respectively. By counting arguments, there are exactly (eα)(oβ)(neoζαβ)binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽\binom{e}{\alpha}\binom{o}{\beta}\binom{n-e-o}{\zeta-\alpha-\beta}( FRACOP start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( FRACOP start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) positions for the players in ΓΓ\Gammaroman_Γ which yield the profile (α,β)𝛼𝛽(\alpha,\beta)( italic_α , italic_β ) for a resource at some fixed position in the ring, there are ζ!𝜁\zeta!italic_ζ ! ways to order the players in ΓΓ\Gammaroman_Γ, (nζ)!𝑛𝜁(n-\zeta)!( italic_n - italic_ζ ) ! ways to order the players not in ΓΓ\Gammaroman_Γ, and n𝑛nitalic_n resource in each ring.

Verifying akSNEsuperscript𝑎𝑘SNEa^{k{\rm SNE}}italic_a start_POSTSUPERSCRIPT italic_k roman_SNE end_POSTSUPERSCRIPT is a k𝑘kitalic_k-strong Nash equilibrium boils down to showing (24) is greater than or equal to (26). We can see that this holds whenever θ𝜃\thetaitalic_θ is a feasible point in (D). Accordingly, the k𝑘kitalic_k-strong price of anarchy satisfies

1QSPoAk(𝒢n,w)1e,x,oθ(e,x,o)w(o+x),1superscript𝑄subscriptSPoA𝑘subscript𝒢𝑛𝑤1subscript𝑒𝑥𝑜𝜃𝑒𝑥𝑜𝑤𝑜𝑥\frac{1}{Q^{\star}}\leq\mathrm{SPoA}_{k}(\mathcal{G}_{n},w)\leq\frac{1}{\sum_{% e,x,o}\theta(e,x,o)w(o+x)},divide start_ARG 1 end_ARG start_ARG italic_Q start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_ARG ≤ roman_SPoA start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_w ) ≤ divide start_ARG 1 end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_e , italic_x , italic_o end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) italic_w ( italic_o + italic_x ) end_ARG , (27)

where the first inequality holds from the reductions made in part 1, and the second holds as the k𝑘kitalic_k-strong price of anarchy is upper bounded by any particular problem; comparing (24) and (25) gives the final expression. Letting θ𝜃\thetaitalic_θ take on the solution to (D) shows the bound is tight.

4) Taking the Dual: Before considering the dual program to (D), we first show that the primal is feasible. It is easy to verify the feasible set is non-empty by considering the point θ(1,0,0)=1/w(1)𝜃1001𝑤1\theta(1,0,0)=1/w(1)italic_θ ( 1 , 0 , 0 ) = 1 / italic_w ( 1 ) and zero otherwise. Now, we must show that the feasible set is compact, and thus, the value of (D) is bounded. From the equality constraint, we can obtain

1miny>0w(y)e,x,oe+x>0θ(e,x,o).1subscript𝑦0𝑤𝑦subscript𝑒𝑥𝑜𝑒𝑥0𝜃𝑒𝑥𝑜1\geq\min_{y>0}w(y)\sum_{\begin{subarray}{c}e,x,o\\ e+x>0\end{subarray}}\theta(e,x,o).1 ≥ roman_min start_POSTSUBSCRIPT italic_y > 0 end_POSTSUBSCRIPT italic_w ( italic_y ) ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_e , italic_x , italic_o end_CELL end_ROW start_ROW start_CELL italic_e + italic_x > 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_θ ( italic_e , italic_x , italic_o ) .

Because we assume w(y)>0𝑤𝑦0w(y)>0italic_w ( italic_y ) > 0 for all y>0𝑦0y>0italic_y > 0, we show that each value of θ(e,x,o)𝜃𝑒𝑥𝑜\theta(e,x,o)italic_θ ( italic_e , italic_x , italic_o ) such that e+x>0𝑒𝑥0e+x>0italic_e + italic_x > 0 is bounded. For the remaining values of θ(0,0,o)𝜃00𝑜\theta(0,0,o)italic_θ ( 0 , 0 , italic_o ), consider the equilibrium constraint333The ζ=1𝜁1\zeta=1italic_ζ = 1 constraint is present in (D) for all k1𝑘1k\geq 1italic_k ≥ 1. when ζ=1𝜁1\zeta=1italic_ζ = 1. By rearranging terms and observing the bounded terms from the previous argument, we observe Lw(1)o[n]oθ(0,0,o)𝐿𝑤1subscript𝑜delimited-[]𝑛𝑜𝜃00𝑜L\geq w(1)\sum_{o\in[n]}o\theta(0,0,o)italic_L ≥ italic_w ( 1 ) ∑ start_POSTSUBSCRIPT italic_o ∈ [ italic_n ] end_POSTSUBSCRIPT italic_o italic_θ ( 0 , 0 , italic_o ), where L𝐿Litalic_L is a bounded value. Because w(1)>0𝑤10w(1)>0italic_w ( 1 ) > 0, the remaining decision variables are also bounded, and thus the feasible set is finite.

Now, we find the dual program to (D). Because (D) is a linear program, we can rewrite it in the more concise form

maxθ||bθ𝜃superscriptmaxsuperscript𝑏top𝜃\displaystyle\underset{\displaystyle\scriptstyle\theta\in\mathbb{R}^{|\mathcal% {I}|}}{\mathrm{max}}\quad b^{\top}\theta\hfil\hfil\hfil\hfilstart_UNDERACCENT italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT | caligraphic_I | end_POSTSUPERSCRIPT end_UNDERACCENT start_ARG roman_max end_ARG italic_b start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ
s.t.formulae-sequencest\displaystyle\mathmakebox[width("$\underset{\displaystyle\phantom{\scriptstyle% \theta\in\mathbb{R}^{|\mathcal{I}|}}}{\mathrm{max}}$")][c]{\mathmakebox[width(% "$\mathrm{max}$")][l]{\mathrm{\kern 1.00006pts.t.}}}\quadroman_s . roman_t . cζθsuperscriptsubscript𝑐𝜁top𝜃\displaystyle c_{\zeta}^{\top}\thetaitalic_c start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ 0,ζ[k]formulae-sequenceabsent0for-all𝜁delimited-[]𝑘\displaystyle\geq 0,\leavevmode\nobreak\ \forall\zeta\in[k]≥ 0 , ∀ italic_ζ ∈ [ italic_k ] (νζ)subscript𝜈𝜁\displaystyle\quad\quad(\nu_{\zeta})( italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT )
dθ1superscript𝑑top𝜃1\displaystyle d^{\top}\theta-1italic_d start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ - 1 =0absent0\displaystyle=0= 0 (ρ)𝜌\displaystyle\quad\quad(\rho)( italic_ρ )
θ𝜃\displaystyle\thetaitalic_θ 0absent0\displaystyle\geq 0≥ 0 (ϕ)italic-ϕ\displaystyle\quad\quad(\phi)( italic_ϕ )

where ν0𝜈0\nu\geq 0italic_ν ≥ 0, ρ𝜌\rhoitalic_ρ, and ϕ0italic-ϕ0\phi\geq 0italic_ϕ ≥ 0 are the associated dual variables. The Lagrangian function is defined as (θ,ν,ρ,ϕ)=bθ+(ζ[k]νζcζθ)ρ(dθ1)+ϕθ𝜃𝜈𝜌italic-ϕsuperscript𝑏top𝜃subscript𝜁delimited-[]𝑘subscript𝜈𝜁superscriptsubscript𝑐𝜁top𝜃𝜌superscript𝑑top𝜃1superscriptitalic-ϕtop𝜃\mathcal{L}(\theta,\nu,\rho,\phi)=b^{\top}\theta+(\sum_{\zeta\in[k]}\nu_{\zeta% }c_{\zeta}^{\top}\theta)-\rho(d^{\top}\theta-1)+\phi^{\top}\thetacaligraphic_L ( italic_θ , italic_ν , italic_ρ , italic_ϕ ) = italic_b start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ + ( ∑ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ ) - italic_ρ ( italic_d start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ - 1 ) + italic_ϕ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ. Let g(ν,ρ,ϕ)=supθ||(θ,ν,ρ,ϕ)𝑔𝜈𝜌italic-ϕsubscriptsupremum𝜃superscript𝜃𝜈𝜌italic-ϕg(\nu,\rho,\phi)=\sup_{\theta\in\mathbb{R}^{|\mathcal{I}|}}\mathcal{L}(\theta,% \nu,\rho,\phi)italic_g ( italic_ν , italic_ρ , italic_ϕ ) = roman_sup start_POSTSUBSCRIPT italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT | caligraphic_I | end_POSTSUPERSCRIPT end_POSTSUBSCRIPT caligraphic_L ( italic_θ , italic_ν , italic_ρ , italic_ϕ ) serve as an upper bound to (D). The dual program is derived by minimizing g(ν,ρ,ϕ)𝑔𝜈𝜌italic-ϕg(\nu,\rho,\phi)italic_g ( italic_ν , italic_ρ , italic_ϕ ); note that this value is only unbounded above unless b+ζ[k]νζcζρd+ϕ=0superscript𝑏topsubscript𝜁delimited-[]𝑘subscript𝜈𝜁superscriptsubscript𝑐𝜁top𝜌superscript𝑑topsuperscriptitalic-ϕtop0b^{\top}+\sum_{\zeta\in[k]}\nu_{\zeta}c_{\zeta}^{\top}-\rho d^{\top}+\phi^{% \top}=0italic_b start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - italic_ρ italic_d start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_ϕ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = 0. Substituting this into the objective and removing the free variable ϕitalic-ϕ\phiitalic_ϕ so that the equality constraint becomes an inequality, the dual problem becomes

minρ,{νζ0}ζ[k]ρ𝜌subscriptsubscript𝜈𝜁subscriptabsent0𝜁delimited-[]𝑘min𝜌\displaystyle\underset{\displaystyle\scriptstyle\rho,\{\nu_{\zeta}\in\mathbb{R% }_{\geq 0}\}_{\zeta\in[k]}}{\mathrm{min}}\quad\rho\hfil\hfil\hfil\hfilstart_UNDERACCENT italic_ρ , { italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_min end_ARG italic_ρ (P1)
s.t.formulae-sequencest\displaystyle\mathmakebox[width("$\underset{\displaystyle\phantom{\scriptstyle% \rho,\{\nu_{\zeta}\in\mathbb{R}_{\geq 0}\}_{\zeta\in[k]}}}{\mathrm{min}}$")][c% ]{\mathmakebox[width("$\mathrm{min}$")][l]{\mathrm{\kern 1.00006pts.t.}}}\quadroman_s . roman_t . bρd+ζ[k]νζcζsuperscript𝑏top𝜌superscript𝑑topsubscript𝜁delimited-[]𝑘subscript𝜈𝜁subscript𝑐𝜁\displaystyle b^{\top}-\rho d^{\top}+\sum_{\zeta\in[k]}\nu_{\zeta}c_{\zeta}italic_b start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - italic_ρ italic_d start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT 0absent0\displaystyle\leq 0\hfil\hfil≤ 0

From strong duality, (P1) provides the same value as (D). Expanding terms show that (P1) is equivalent to (P[k]delimited-[]𝑘[k][ italic_k ]). ∎

Proof of Proposition 5.3: The proof is straightforward and simply requires generalizing the constraint set of (P[k]delimited-[]𝑘[k][ italic_k ]). Consider taking the same steps as the proof of Theorem 3.3 but with the equilibrium constraint defined by the utility rule u𝑢uitalic_u. This will result in the same linear program as in (P[k]delimited-[]𝑘[k][ italic_k ]), but now with the constraint set

0w(o+x)ρw(e+x)+ζ[k]νζ((nζ)u(e+x)0αe0βoα+βζ(eα)(oβ)(neoζαβ)u(e+x+βα))(e,x,o).0𝑤𝑜𝑥𝜌𝑤𝑒𝑥subscript𝜁delimited-[]𝑘subscript𝜈𝜁binomial𝑛𝜁𝑢𝑒𝑥subscript0𝛼𝑒0𝛽𝑜𝛼𝛽𝜁binomial𝑒𝛼binomial𝑜𝛽binomial𝑛𝑒𝑜𝜁𝛼𝛽𝑢𝑒𝑥𝛽𝛼for-all𝑒𝑥𝑜0\geq w(o+x)-\rho w(e+x)+\\ \sum_{\zeta\in[k]}\nu_{\zeta}\left(\binom{n}{\zeta}u(e+x)-\sum_{\begin{% subarray}{c}0\leq\alpha\leq e\\ 0\leq\beta\leq o\\ \alpha+\beta\leq\zeta\end{subarray}}{e\choose\alpha}{o\choose\beta}\binom{n-e-% o}{\zeta-\alpha-\beta}u(e+x+\beta-\alpha)\right)\\ \forall(e,x,o)\in\mathcal{I}.start_ROW start_CELL 0 ≥ italic_w ( italic_o + italic_x ) - italic_ρ italic_w ( italic_e + italic_x ) + end_CELL end_ROW start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_ζ ∈ [ italic_k ] end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ( ( FRACOP start_ARG italic_n end_ARG start_ARG italic_ζ end_ARG ) italic_u ( italic_e + italic_x ) - ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 0 ≤ italic_α ≤ italic_e end_CELL end_ROW start_ROW start_CELL 0 ≤ italic_β ≤ italic_o end_CELL end_ROW start_ROW start_CELL italic_α + italic_β ≤ italic_ζ end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ( binomial start_ARG italic_e end_ARG start_ARG italic_α end_ARG ) ( binomial start_ARG italic_o end_ARG start_ARG italic_β end_ARG ) ( FRACOP start_ARG italic_n - italic_e - italic_o end_ARG start_ARG italic_ζ - italic_α - italic_β end_ARG ) italic_u ( italic_e + italic_x + italic_β - italic_α ) ) end_CELL end_ROW start_ROW start_CELL ∀ ( italic_e , italic_x , italic_o ) ∈ caligraphic_I . end_CELL end_ROW (28)

At this point, the new linear program will provide tight bounds on a specified utility rule u𝑢uitalic_u.

Finally, we substitute the new decision variable uζ0nsubscript𝑢𝜁subscriptsuperscript𝑛absent0u_{\zeta}\in\mathbb{R}^{n}_{\geq 0}italic_u start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT into each occurrence of νζusubscript𝜈𝜁𝑢\nu_{\zeta}uitalic_ν start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT italic_u. This enlarges the feasible set, which now subsumes all the feasible points that would evaluate a utility rule u𝑢uitalic_u by satisfying u=uζ𝑢subscript𝑢𝜁u=u_{\zeta}italic_u = italic_u start_POSTSUBSCRIPT italic_ζ end_POSTSUBSCRIPT for all ζ[k]𝜁delimited-[]𝑘\zeta\in[k]italic_ζ ∈ [ italic_k ]. As we do not enforce this constraint, the value of the final program (Q[k]delimited-[]𝑘[k][ italic_k ]) provides a lower bound on the original program, or its reciprocal provides an upper bound on the k𝑘kitalic_k-strong price of anarchy under the optimal utility design. ∎