HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: bibentry

Authors: achieve the best HTML results from your LaTeX submissions by selecting from this list of supported packages.

License: arXiv.org perpetual non-exclusive license
arXiv:2312.11391v1 [cs.AI] 18 Dec 2023

FedCompetitors: Harmonious Collaboration in Federated Learning with Competing Participants

Shanli Tan1\equalcontrib, Hao Cheng2\equalcontrib, Xiaohu Wu1\equalcontrib{}^{\bigstar}start_FLOATSUPERSCRIPT ★ end_FLOATSUPERSCRIPT, Han Yu3\equalcontrib, Tiantian He4{}^{\bigstar}start_FLOATSUPERSCRIPT ★ end_FLOATSUPERSCRIPT, Yew-Soon Ong3,4,
Chongjun Wang2, Xiaofeng Tao1
Abstract

Federated learning (FL) provides a privacy-preserving approach for collaborative training of machine learning models. Given the potential data heterogeneity, it is crucial to select appropriate collaborators for each FL participant (FL-PT) based on data complementarity. Recent studies have addressed this challenge. Similarly, it is imperative to consider the inter-individual relationships among FL-PTs where some FL-PTs engage in competition. Although FL literature has acknowledged the significance of this scenario, practical methods for establishing FL ecosystems remain largely unexplored. In this paper, we extend a principle from the balance theory, namely “the friend of my enemy is my enemy”, to ensure the absence of conflicting interests within an FL ecosystem. The extended principle and the resulting problem are formulated via graph theory and integer linear programming. A polynomial-time algorithm is proposed to determine the collaborators of each FL-PT. The solution guarantees high scalability, allowing even competing FL-PTs to smoothly join the ecosystem without conflict of interest. The proposed framework jointly considers competition and data heterogeneity. Extensive experiments on real-world and synthetic data demonstrate its efficacy compared to five alternative approaches, and its ability to establish efficient collaboration networks among FL-PTs.

Introduction

Federated Learning (FL) represents a paradigm within distributed machine learning (ML) that facilitates the collaborative training of ML models by leveraging data from multiple parties while upholding privacy considerations (Yang et al. 2019). Each participant in FL (referred to as FL-PT) acts as a custodian of data and directly employs its dataset to locally train a model. In the well-established Federated Averaging (FedAvg) framework (McMahan et al. 2017), a central server (CS) periodically gathers model updates from individual FL-PTs, which are then aggregated to refine a global model. Similarly, each FL-PT regularly acquires the latest global model from the CS and further enhances it through local training. This iterative interplay between the CS and FL-PTs persists until the global model achieves convergence. FL has demonstrated significant promise across diverse domains, including healthcare, digital banking, ridesharing, recommender systems, and drug discovery (Sheller et al. 2020; Long et al. 2020; Yang et al. 2020; Wang et al. 2022; Oldenhof et al. 2023; Sun et al. 2023).

For example, consider a clinical research network of multiple hospitals (Fleurence et al. 2014). These hospitals possess the capacity to collaboratively construct ML models. In an optimal setting, the global model derived from FL should outperform models crafted by individual FL-PTs. However, a potential complication arises from the non-independent and non-identically distributed (Non-IID) nature of data across these FL-PTs (Zhu et al. 2021). Each FL-PT undertakes local model training, which might lead it to a distinct local optima, diverging from the global optima. Consequently, the model performance of an FL-PT might experience degradation due to the FL process (Wang et al. 2019). The diversity in data characteristics among FL-PTs can be graphically portrayed using a directed benefit graph denoted as 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT (Cui et al. 2022). In this graphical representation, an edge from FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT signifies that the data from visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can potentially enhance the learning outcomes of vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT through the FL process.

Besides data heterogeneity, another important factor is the relationships among FL-PTs. For instance, in the context of hospitals located in different cities, they serve distinct populations. As depicted in Figure 1, the hospital in city C𝐶Citalic_C solely focuses on improving its own ML model, and its utility is independent of any FL-PT in other cities. Such two FL-PTs are considered “independent”, where the shared global model in FL functions as a public good, similar to a radio signal where each individual only values the received signal quality (Tang and Wong 2021). In contrast, hospitals within the same city (e.g., city B𝐵Bitalic_B) serve the same population, which can include both public and private hospitals. Then, competition arises where the utility of an FL-PT also depends on the model performance of its competitor (Brekke, Siciliani, and Straume 2011). Such FL-PTs are considered “competitive”. The inter-individual relationship between any two FL-PTs can be represented by an undirected graph 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT.

Refer to caption
Figure 1: Illustration of the Relationships among Hospitals: the black line denotes the competing relationship between two hospitals.

In the presence of both data heterogeneity and competition, selecting suitable collaborators for each FL-PT is a crucial challenge. Recently, Cui et al. (2022) consider the data heterogeneity case (i.e., the edge set of 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is non-empty and the edge set of 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is empty) and leverages the concept of core-stable coalition from cooperative games to effectively address this. All FL-PTs are partitioned into disjoint groups/coalitions. Let π(i)𝜋𝑖\pi(i)italic_π ( italic_i ) denote the coalition to which visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT belongs where π𝜋\piitalic_π is called a coalition structure, and visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s utility depends on the FL-PTs in π(i)𝜋𝑖\pi(i)italic_π ( italic_i ). For a core-stable coalition structure π𝜋\piitalic_π, there is no other coalition 𝒞𝒞\mathcal{C}caligraphic_C such that every FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒞𝒞\mathcal{C}caligraphic_C prefers 𝒞𝒞\mathcal{C}caligraphic_C over π(i)𝜋𝑖\pi(i)italic_π ( italic_i ) (Aziz and Savani 2016). Nevertheless, there is no existing work addressing the issue of competition among a part of FL-PTs when establishing collaborations in FL ecosystems.

In this paper, we propose the FedCompetitors approach to bridge this gap. It is general in the sense that (i) the edge set of 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is empty or non-empty except the complete graph case and (ii) the edge set of 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is non-empty. The presence of competing FL-PTs has been recognized as an important aspect in the FL literature (Kairouz et al. 2021; Zhan et al. 2022; Shi, Yu, and Leung 2023). In balance theory, a principle, namely “the friend of my enemy is my enemy”, can avoid conflict of interest (Leskovec, Huttenlocher, and Kleinberg 2010; Cartwright and Harary 1956). We apply its extended version to establish collaboration among FL-PTs. Specifically, suppose visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT compete, and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the friend of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (i.e., visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT benefits from the data of vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in FL training). The FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, its friend vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, and other FL-PTs who benefit visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are in an alliance. Then, the CS regulates that vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT will not make a contribution to any FL-PT in the alliance, which ensures that no FL-PTs directly or indirectly assist their competitors. If two FL-PTs can collaborate together, they are independent of each other. In a group of independent FL-PTs, an FL-PT can freely collaborate with other FL-PTs in the group, thereby maximizing the social welfare of the entire FL ecosystem.

The extended principle and the resulting problem above can be formulated via graph theory and integer linear programming. We further propose a polynomial-time algorithm that is to determine the collaborators of each FL-PT. Using the proposed solution, even competing FL-PTs can seamlessly join without conflict of interest and the FL ecosystem thus exhibits a high level of scalability and is trusted by FL-PTs with conflicting interests (Tariq et al. 2023; Yu et al. 2014). Extensive experiments on both synthetic and real-world datasets demonstrate the effectiveness of FedCompetitors over the state of the art.

Related Work

We focus on the context of cross-silo FL, where FL-PTs are typically companies or organizations and they both contribute their data and utilize the trained ML models. In the existing research, two scenarios have been extensively investigated: (i) any two FL-PTs in the FL ecosystem are independent of each other and an FL-PT solely focuses on improving its own model performance, without considering the potential competition, and (ii) any two FL-PTs in the FL ecosystem compete against each other where 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is a complete graph. In this paper, we mainly consider the scenario where there exists competition among a part of FL-PTs and an FL-PT will not collaborate with its competitors and other FL-PTs with potential conflict of interest.

Firstly, in the independent scenario, prior studies focus on alleviating the side effect of data heterogeneity. While applying Hedonic games that are a type of cooperative games (Aziz and Savani 2016), stable coalition structures are sought to establish collaboration among FL-PTs. Donahue and Kleinberg (2021) provide an analytical understanding of what partition of FL-PTs leads to a stable coalition structure for mean estimation and linear regression. Chaudhury et al. (2022) treat all FL-PTs as a grand coalition and optimizes a common model for all FL-PTs, which is considered core-stable if there is no other coalition 𝒮𝒮\mathcal{S}caligraphic_S of FL-PTs that could significantly benefit by training a model with only their data. Another way that learns personalized models for FL-PTs works as follows (Tan et al. 2022): (i) use the CS to train a global model, and (ii) adapt the model to the local data of FL-PTs. Several approaches, such as meta-learning, and multi-task learning, have been employed for personalization (Fallah, Mokhtari, and Ozdaglar 2020; Smith et al. 2017). Ding and Wang (2022) study the case when the FL ecosystem expands to have numerous independent FL-PTs. A group of FL-PTs that has similar contributors is a group of collaboration partners. The authors propose to partition all FL-PTs into K𝐾Kitalic_K groups and adaptively learn a small number K𝐾Kitalic_K of models for n𝑛nitalic_n FL-PTs, where 1Knmuch-less-than1𝐾much-less-than𝑛1\ll K\ll n1 ≪ italic_K ≪ italic_n.

Secondly, in the competition scenario, all FL-PTs are assumed to offer the same service in a given market. Wu and Yu (2022) aim to achieve the objective of maintaining a negligible change in market share after FL-PTs join the FL ecosystem (Farris et al. 2010; Wu, De Pellegrini, and Casale 2023), and analyze the achievability of this objective. Afterwards, two other works study the profitablity of FL-PTs in the given market after FL-PTs join the FL ecosystem, but are taken under different assumptions on the source of extra profit brought by FL. Specifically, Tsoy and Konstantinov (2023) use the following assumption: (i) each consumer has a fixed budget that is allocated to multiple services from different markets, and (ii) if an FL-PT has a higher model quality, its service quality is higher and the consumer will allocate more of its budget to consume the service. Huang, Ke, and Liu (2023) consider duopoly business competition between two FL-PTs and assume that, if the model-related service can be improved by FL, customers will have willingness to pay more and FL-PTs thus have opportunities to increase their profits.

Model and Assumptions

We use graph theory to describe our model of interest and mathematically formulate the extended principle. Specifically, let us consider a set of n𝑛nitalic_n FL-PTs denoted by 𝒱={v1,\mathcal{V}=\{v_{1},caligraphic_V = { italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , v2,,vn}v_{2},\cdots,v_{n}\}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }. Each FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT possesses a local dataset 𝒟isubscript𝒟𝑖\mathcal{D}_{i}caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The FL-PTs contemplate joining a collaborative FL network, facilitated by the CS. However, challenges such as data heterogeneity and competition arise among the FL-PTs. To characterize the various relationships among the FL-PTs, three graphs are employed.

Competing graph 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT. An undirected graph 𝒢c=(𝒱,Ec)subscript𝒢𝑐𝒱subscript𝐸𝑐\mathcal{G}_{c}=(\mathcal{V},E_{c})caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = ( caligraphic_V , italic_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ) is used to represent the competing relations between any two FL-PTs, where 𝒱𝒱\mathcal{V}caligraphic_V is the set of nodes/FL-PTs and Ecsubscript𝐸𝑐E_{c}italic_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is the set of edges. An edge (vi,vj)Ecsubscript𝑣𝑖subscript𝑣𝑗subscript𝐸𝑐(v_{i},v_{j})\in E_{c}( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT signifies a competitive relationship between FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. The adjacency matrix of 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is denoted as Sn×nsubscript𝑆𝑛𝑛S_{n\times n}italic_S start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT: its main diagonal elements are set to zero, i.e., si,i=0subscript𝑠𝑖𝑖0s_{i,i}=0italic_s start_POSTSUBSCRIPT italic_i , italic_i end_POSTSUBSCRIPT = 0; when ij𝑖𝑗i\neq jitalic_i ≠ italic_j, si,j=1subscript𝑠𝑖𝑗1s_{i,j}=1italic_s start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = 1 if visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT competes with vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, and si,j=0subscript𝑠𝑖𝑗0s_{i,j}=0italic_s start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = 0 if visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is independent of vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Each FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT will report its competitors to CS, as it hopes that CS will correctly utilize this information to prevent its competitors from benefiting from its data. Thus, CS has the knowledge of 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT.

Benefit graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT. A benefit graph is employed to depict the impact of sample distribution discrepancies among the n𝑛nitalic_n FL-PTs. For any two FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, if wj,i=0subscript𝑤𝑗𝑖0w_{j,i}=0italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0, it indicates that visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT cannot benefit from the data of vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Conversely, if wj,i>0subscript𝑤𝑗𝑖0w_{j,i}>0italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT > 0, it implies that visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can benefit from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s data, with larger values of wj,isubscript𝑤𝑗𝑖w_{j,i}italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT signifying greater benefit to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. These values wj,isubscript𝑤𝑗𝑖w_{j,i}italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT define a directed graph denoted as 𝒢b=(𝒱,Eb)subscript𝒢𝑏𝒱subscript𝐸𝑏\mathcal{G}_{b}=(\mathcal{V},E_{b})caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = ( caligraphic_V , italic_E start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ), referred to as the benefit graph: (vj,vi)Ebsubscript𝑣𝑗subscript𝑣𝑖subscript𝐸𝑏(v_{j},v_{i})\in E_{b}( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT if and only if ij𝑖𝑗i\neq jitalic_i ≠ italic_j and wj,i>0subscript𝑤𝑗𝑖0w_{j,i}>0italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT > 0. The adjacency matrix of 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is denoted as Wn×nsubscript𝑊𝑛𝑛W_{n\times n}italic_W start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, where the i𝑖iitalic_i-th column comprises the weights w1,i,w2,i,,wn,isubscript𝑤1𝑖subscript𝑤2𝑖subscript𝑤𝑛𝑖w_{1,i},w_{2,i},\cdots,w_{n,i}italic_w start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 , italic_i end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_n , italic_i end_POSTSUBSCRIPT, representing the importance of the n𝑛nitalic_n FL-PTs to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The level of potential (LoP) of an FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT contributing to the other FL-PTs 𝒱{vi}𝒱subscript𝑣𝑖\mathcal{V}-\{v_{i}\}caligraphic_V - { italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } is defined as

wi=jiwi,j,subscript𝑤𝑖subscript𝑗𝑖subscript𝑤𝑖𝑗\displaystyle w_{i}=\sum\nolimits_{j\neq i}{w_{i,j}},italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT , (1)

which measures the importance of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to the FL ecosystem. The graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT can be obtained by the hypernetwork technique in (Cui et al. 2022; Navon et al. 2021).

Data usage graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT. Although visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT may benefit from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s data (wj,i>0subscript𝑤𝑗𝑖0w_{j,i}>0italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT > 0), CS has the authority to determine whether visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can actually utilize vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s local model update information (i.e., indirectly use vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s data) in the FL training process or not. Let X=(xj,i)𝑋subscript𝑥𝑗𝑖X=(x_{j,i})italic_X = ( italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT ) be a n×n𝑛𝑛n\times nitalic_n × italic_n matrix where

xj,i{0,1}subscript𝑥𝑗𝑖01\displaystyle x_{j,i}\in\{0,1\}italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT ∈ { 0 , 1 } (2)

is a decision variable: for two different FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, xj,isubscript𝑥𝑗𝑖x_{j,i}italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT is set to one if vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT will contribute to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (i.e., visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT will utilize vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s local model update information) in the FL training process and xj,isubscript𝑥𝑗𝑖x_{j,i}italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT is set to zero otherwise. X𝑋Xitalic_X defines a directed graph 𝒢u=(𝒱,Eu)subscript𝒢𝑢𝒱subscript𝐸𝑢\mathcal{G}_{u}=(\mathcal{V},E_{u})caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = ( caligraphic_V , italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ), called the data usage graph: (vj,vi)Eusubscript𝑣𝑗subscript𝑣𝑖subscript𝐸𝑢(v_{j},v_{i})\in E_{u}( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT if and only if ji𝑗𝑖j\neq iitalic_j ≠ italic_i and xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1; then, vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is said to be a collaborator or friend of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Consider any pair of FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. If vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s data cannot benefit visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (wj,i=0subscript𝑤𝑗𝑖0w_{j,i}=0italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0), we set xj,i=0subscript𝑥𝑗𝑖0x_{j,i}=0italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0. Only when vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’s data can benefit visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, there is a possibility that xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1. Consequently, Eusubscript𝐸𝑢E_{u}italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is a subset of Ebsubscript𝐸𝑏E_{b}italic_E start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, leading directly to the following conclusion.

Lemma 1.

For any two nodes vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, if there is no path from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the benefit graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, then this also holds in the data usage graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT.

Principle for avoiding conflict of interest

Below, we extend the principle that “the friend of my enemy is my enemy”.

Assumption 1.

For any two competing FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (i.e., (vi,vj)Ecsubscript𝑣𝑖subscript𝑣𝑗subscript𝐸𝑐(v_{i},v_{j})\in E_{c}( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT), vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is unreachable to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the data usage graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT.

Refer to caption
Figure 2: Illustration of Assumption 1: vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is reachable to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, while visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT compete against each other.

Assumption 1 is implemented while establishing the collaboration relationships among FL-PTs. Suppose there is a path from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the benefit graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT whose length is pi,jsubscript𝑝𝑖𝑗p_{i,j}italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT. We use Figure 2 to explain the implication of Assumption 1. If pi,j=1subscript𝑝𝑖𝑗1p_{i,j}=1italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = 1, it posits that one FL-PT refuses to contribute to its competitor. If pi,j=2subscript𝑝𝑖𝑗2p_{i,j}=2italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = 2, we use vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT to denote the intermediate node between vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. If visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT benefits from vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s friend; vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is not willing to see the enhancement of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s model and will threaten not to contribute to vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Assumption 1 posits that, if (vk,vi)Eusubscript𝑣𝑘subscript𝑣𝑖subscript𝐸𝑢(v_{k},v_{i})\in E_{u}( italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, then (vj,vk)Eusubscript𝑣𝑗subscript𝑣𝑘subscript𝐸𝑢(v_{j},v_{k})\notin E_{u}( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∉ italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, i.e., vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT doesn’t help the friend vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT of its enemy visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Generally, for any pi,jsubscript𝑝𝑖𝑗p_{i,j}italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT, the path from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is denoted as

Pji=(vj0,vj1,,vjpi,j),superscriptsubscript𝑃𝑗𝑖subscript𝑣subscript𝑗0subscript𝑣subscript𝑗1subscript𝑣subscript𝑗subscript𝑝𝑖𝑗\displaystyle P_{j}^{i}=(v_{j_{0}},v_{j_{1}},\cdots,v_{j_{p_{i,j}}}),italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT = ( italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , (3)

where j0=jsubscript𝑗0𝑗j_{0}=jitalic_j start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_j and jpi,j=isubscript𝑗subscript𝑝𝑖𝑗𝑖j_{p_{i,j}}=iitalic_j start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_i. If any, let t𝑡titalic_t be the minimum integer in [1,pi,j1]1subscript𝑝𝑖𝑗1[1,p_{i,j}-1][ 1 , italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT - 1 ] such that (vjl,vjl+1)Eusubscript𝑣subscript𝑗𝑙subscript𝑣subscript𝑗𝑙1subscript𝐸𝑢(v_{j_{l}},v_{j_{l+1}})\in E_{u}( italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT for every l[t,pi,j1]𝑙𝑡subscript𝑝𝑖𝑗1l\in[t,p_{i,j}-1]italic_l ∈ [ italic_t , italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT - 1 ] where vjlsubscript𝑣subscript𝑗𝑙v_{j_{l}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT helps vjl+1subscript𝑣subscript𝑗𝑙1v_{j_{l+1}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then, FL-PTs vjt,subscript𝑣subscript𝑗𝑡v_{j_{t}},italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT , vjt+1,subscript𝑣subscript𝑗𝑡1v_{j_{t+1}},italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ,\cdots,⋯ , vjpi,jsubscript𝑣subscript𝑗subscript𝑝𝑖𝑗v_{j_{p_{i,j}}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT are said to be in an alliance, and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT will not help any member in this alliance. Assumption 1 follows a common logic in reality that nobody wants to see others help its enemy and its enemy’s friends. By applying Assumption 1, it is strictly guaranteed that each FL-PT will not make a contribution to its competitors directly or indirectly.

Refer to caption
(a) visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is reachable to the red node in the oval, which is also the competitor of the blue nodes.
Refer to caption
(b) vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is reachable from the red node in the oval, which is also the competitor of the golden nodes.
Figure 3: Effect on Assumption 1 after adding an edge (vj,vi)subscript𝑣𝑗subscript𝑣𝑖(v_{j},v_{i})( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) in the data usage graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT.

For any competing FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, let 𝒫j,isubscript𝒫𝑗𝑖\mathcal{P}_{j,i}caligraphic_P start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT denote the set of all reachable paths from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT. Assumption 1 can be characterized by 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT, 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, and 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT.

Proposition 1.

Assumption 1 holds if and only if the following condition is satisfied:

xj,j1+xj1,j2+subscript𝑥𝑗subscript𝑗1limit-fromsubscript𝑥subscript𝑗1subscript𝑗2\displaystyle x_{j,j_{1}}+x_{j_{1},j_{2}}+italic_x start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + +xjpi,j,ipi,j1,subscript𝑥subscript𝑗subscript𝑝𝑖𝑗𝑖subscript𝑝𝑖𝑗1\displaystyle\cdots+x_{j_{p_{i,j}},i}\leqslant p_{i,j}-1,⋯ + italic_x start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_i end_POSTSUBSCRIPT ⩽ italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT - 1 , (4)
(vi,vj)Ec,Pji𝒫j,i.formulae-sequencefor-allsubscript𝑣𝑖subscript𝑣𝑗subscript𝐸𝑐for-allsuperscriptsubscript𝑃𝑗𝑖subscript𝒫𝑗𝑖\displaystyle\forall(v_{i},v_{j})\in E_{c},\,\forall P_{j}^{i}\in\mathcal{P}_{% j,i}.∀ ( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , ∀ italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT .
Proof.

Firstly, we prove the reverse direction. By Lemma 1, to satisfy Assumption 1, we only need to focus on such vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT that are reachable in 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT. Pjisuperscriptsubscript𝑃𝑗𝑖P_{j}^{i}italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT is defined in Eq. (3). If Eq. (4) holds, then, for any Pji𝒫j,isuperscriptsubscript𝑃𝑗𝑖subscript𝒫𝑗𝑖P_{j}^{i}\in\mathcal{P}_{j,i}italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT there exist two adjacent nodes vjlsubscript𝑣subscript𝑗𝑙v_{j_{l}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT and vjl+1subscript𝑣subscript𝑗𝑙1v_{j_{l+1}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT in Pjisuperscriptsubscript𝑃𝑗𝑖P_{j}^{i}italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT, where l[0,pi,j1]𝑙0subscript𝑝𝑖𝑗1l\in[0,p_{i,j}-1]italic_l ∈ [ 0 , italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT - 1 ], such that xjl,jl+1subscript𝑥subscript𝑗𝑙subscript𝑗𝑙1x_{j_{l},j_{l+1}}italic_x start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT=0absent0=0= 0 and (vjl,vjl+1)Eusubscript𝑣subscript𝑗𝑙subscript𝑣subscript𝑗𝑙1subscript𝐸𝑢(v_{j_{l}},v_{j_{l+1}})\notin E_{u}( italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∉ italic_E start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT. Thus, there are no reachable paths from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT and Assumption 1 is satisfied. Secondly, we prove the forward direction by contradiction. The length of Pjisuperscriptsubscript𝑃𝑗𝑖P_{j}^{i}italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT is pi,jsubscript𝑝𝑖𝑗p_{i,j}italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT. If Eq. (4) doesn’t hold, then, for any l[0,pi,j1]𝑙0subscript𝑝𝑖𝑗1l\in[0,p_{i,j}-1]italic_l ∈ [ 0 , italic_p start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT - 1 ], xjl,jl+1=1subscript𝑥subscript𝑗𝑙subscript𝑗𝑙11x_{j_{l},j_{l+1}}=1italic_x start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 1 and there exists an edge from vjlsubscript𝑣subscript𝑗𝑙v_{j_{l}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT to vjl+1subscript𝑣subscript𝑗𝑙1v_{j_{l+1}}italic_v start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT in the graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, which contradicts Assumption 1 where vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is not reachable to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT. ∎

In this paper, we aim to propose a framework that can construct an FL ecosystem without conflict of interest. Mathematically, our problem is to determine the matrix Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT of decision variables that satisfy Eq. (2) and (4), which determines the collaborators of FL-PTs. Eq. (4) is equivalent to Assumption 1 by Proposition 1. The absence of conflicting interests among FL-PTs is guaranteed by Eq. (4).

Polynomial-Time Algorithm

We propose a polynomial-time algorithm to determine the matrix Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT of decision variables subject to Eq. (2) and (4). We begin by describing the algorithm’s initial states. The LoP wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in Eq. (1) measures the importance of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to the FL ecosystem. We sort the LoPs of all FL-PTs in non-increasing order, and without loss of generality, we assume:

w1w2wn.subscript𝑤1subscript𝑤2subscript𝑤𝑛\displaystyle w_{1}\geqslant w_{2}\geqslant\cdots\geqslant w_{n}.italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⩾ italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⩾ ⋯ ⩾ italic_w start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT . (5)

The initial values of Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT are set as follows:

xj,i=1 if i=j, and xj,i=0 if ij.formulae-sequencesubscript𝑥𝑗𝑖1 if 𝑖𝑗 and subscript𝑥𝑗𝑖0 if 𝑖𝑗\displaystyle x_{j,i}=1\text{ if }i=j,\text{ and }x_{j,i}=0\text{ if }i\neq j.italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1 if italic_i = italic_j , and italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0 if italic_i ≠ italic_j . (6)

This defines the initial 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, which will be updated as the algorithm runs. We also define a connectivity matrix Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT of 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT: when ij𝑖𝑗i\neq jitalic_i ≠ italic_j, cj,i=1subscript𝑐𝑗𝑖1c_{j,i}=1italic_c start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1 if there is a path from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and cj,i=0subscript𝑐𝑗𝑖0c_{j,i}=0italic_c start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0 otherwise; ci,isubscript𝑐𝑖𝑖c_{i,i}italic_c start_POSTSUBSCRIPT italic_i , italic_i end_POSTSUBSCRIPT is always set to one trivially. Initially, Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT is set as an identity matrix, i.e., a diagonal matrix whose main diagonal elements are all one.

Data: Sn×nsubscript𝑆𝑛𝑛S_{n\times n}italic_S start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, and Wn×nsubscript𝑊𝑛𝑛W_{n\times n}italic_W start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT
Result: Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT
1 Initialize Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT by Eq. (6) and Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT to be an identity matrix;
2 Generate the sorted sequence (i.e., Eq. (5));
3 for visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the sorted sequence do
4       Solve the ILP problem (7) by Algorithm 2;
5      
Algorithm 1 Collaborator Selection

The proposed algorithm is presented as Algorithm 1. The n𝑛nitalic_n FL-PTs are considered sequentially from v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (line 3). At the step for visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (line 4), the decision variables to be determined are {xj,i}jisubscriptsubscript𝑥𝑗𝑖𝑗𝑖\{x_{j,i}\}_{j\neq i}{ italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT and we maximize the benefit of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT:

maximizejiwj,ixj,imaximizesubscript𝑗𝑖subscript𝑤𝑗𝑖subscript𝑥𝑗𝑖\displaystyle\text{maximize}\enskip\sum\nolimits_{j\neq i}{w_{j,i}\cdot x_{j,i}}maximize ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT ⋅ italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT (7)

subject to Eq. (2) and (4). Afterwards, Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT is updated and the collaborators of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are determined. Next, we solve the integer linear programming (ILP) problem (7). Let isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denote all FL-PTs that can benefit visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT but are independent of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, which can be defined by the adjacency matrix Wn×nsubscript𝑊𝑛𝑛W_{n\times n}italic_W start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT of 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT and the adjacency matrix Sn×nsubscript𝑆𝑛𝑛S_{n\times n}italic_S start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT of 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT:

i={vj𝒱|ji,wj,i>0,sj,i=0}.subscript𝑖conditional-setsubscript𝑣𝑗𝒱formulae-sequence𝑗𝑖formulae-sequencesubscript𝑤𝑗𝑖0subscript𝑠𝑗𝑖0\displaystyle\mathcal{B}_{i}=\left\{v_{j}\in\mathcal{V}\,|\,j\neq i,w_{j,i}>0,% s_{j,i}=0\right\}.caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_V | italic_j ≠ italic_i , italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT > 0 , italic_s start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0 } . (8)

isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT includes all possible collaborators of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

For any vjisubscript𝑣𝑗subscript𝑖v_{j}\in\mathcal{B}_{i}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, let 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT denote a set consisting of all nodes that are reachable to vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, as well as vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT itself, which can be defined by the connectivity matrix Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT:

𝒱j={vk𝒱|ck,j=1}.superscriptsubscript𝒱𝑗conditional-setsubscript𝑣𝑘𝒱subscript𝑐𝑘𝑗1\displaystyle\mathcal{V}_{j}^{-}=\left\{v_{k}\in\mathcal{V}\,|\,c_{k,j}=1% \right\}.caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V | italic_c start_POSTSUBSCRIPT italic_k , italic_j end_POSTSUBSCRIPT = 1 } . (9)

Let 𝒮jsuperscriptsubscript𝒮𝑗\mathcal{S}_{j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT denote all competitors of the nodes in 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT, and 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT denote the nodes of 𝒮jsuperscriptsubscript𝒮𝑗\mathcal{S}_{j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT that are reachable from visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT:

𝒮jsuperscriptsubscript𝒮𝑗\displaystyle\mathcal{S}_{j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ={vk𝒱|vp𝒱j:sk,p=1},absentconditional-setsubscript𝑣𝑘𝒱:subscript𝑣𝑝superscriptsubscript𝒱𝑗subscript𝑠𝑘𝑝1\displaystyle=\left\{v_{k}\in\mathcal{V}\,|\,\exists v_{p}\in\mathcal{V}_{j}^{% -}:s_{k,p}=1\right\},= { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V | ∃ italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∈ caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT : italic_s start_POSTSUBSCRIPT italic_k , italic_p end_POSTSUBSCRIPT = 1 } , (10)
𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\displaystyle\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ={vk𝒮j|ci,k=1}𝒮j.absentconditional-setsubscript𝑣𝑘superscriptsubscript𝒮𝑗subscript𝑐𝑖𝑘1superscriptsubscript𝒮𝑗\displaystyle=\left\{v_{k}\in\mathcal{S}_{j}^{-}\,|\,c_{i,k}=1\right\}% \subseteq\mathcal{S}_{j}^{-}.= { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT | italic_c start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = 1 } ⊆ caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT . (11)

As illustrated in Figure 3(a), if 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}\neq\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ≠ ∅, we have xj,i=0subscript𝑥𝑗𝑖0x_{j,i}=0italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0; otherwise, some nodes in 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT will be reachable to its competitor (e.g., the node in the oval) in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, which violates Eq. (4). Let 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT denote a set consisting of all nodes that are reachable from visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, as well as visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT itself:

𝒱i+={vk𝒱|ci,k=1}.superscriptsubscript𝒱𝑖conditional-setsubscript𝑣𝑘𝒱subscript𝑐𝑖𝑘1\displaystyle\mathcal{V}_{i}^{+}=\{v_{k}\in\mathcal{V}\,|\,c_{i,k}=1\}.caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V | italic_c start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = 1 } . (12)

Let 𝒮i+superscriptsubscript𝒮𝑖\mathcal{S}_{i}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT denote all competitors of the nodes in 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT, and 𝒮i,j+superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT denote the nodes of 𝒮i+superscriptsubscript𝒮𝑖\mathcal{S}_{i}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT that are reachable to vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT:

𝒮i+superscriptsubscript𝒮𝑖\displaystyle\mathcal{S}_{i}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ={vk𝒱|vp𝒱i+:sp,k=1},absentconditional-setsubscript𝑣𝑘𝒱:subscript𝑣𝑝superscriptsubscript𝒱𝑖subscript𝑠𝑝𝑘1\displaystyle=\{v_{k}\in\mathcal{V}\,|\,\exists v_{p}\in\mathcal{V}_{i}^{+}:s_% {p,k}=1\},= { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V | ∃ italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∈ caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT : italic_s start_POSTSUBSCRIPT italic_p , italic_k end_POSTSUBSCRIPT = 1 } , (13)
𝒮i,j+superscriptsubscript𝒮𝑖𝑗\displaystyle\mathcal{S}_{i,j}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ={vk𝒮i+|ck,j=1}𝒮i+.absentconditional-setsubscript𝑣𝑘superscriptsubscript𝒮𝑖subscript𝑐𝑘𝑗1superscriptsubscript𝒮𝑖\displaystyle=\{v_{k}\in\mathcal{S}_{i}^{+}\,|\,c_{k,j}=1\}\subseteq\mathcal{S% }_{i}^{+}.= { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT | italic_c start_POSTSUBSCRIPT italic_k , italic_j end_POSTSUBSCRIPT = 1 } ⊆ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT . (14)

Here, by Eq. (9), (11), (12), and (14), we have

𝒮i,j=𝒱i+𝒮j𝒱i+ and 𝒮i,j+=𝒱j𝒮i+𝒱j.superscriptsubscript𝒮𝑖𝑗superscriptsubscript𝒱𝑖superscriptsubscript𝒮𝑗superscriptsubscript𝒱𝑖 and superscriptsubscript𝒮𝑖𝑗superscriptsubscript𝒱𝑗superscriptsubscript𝒮𝑖superscriptsubscript𝒱𝑗\displaystyle\mathcal{S}_{i,j}^{-}=\mathcal{V}_{i}^{+}\cap\mathcal{S}_{j}^{-}% \subseteq\mathcal{V}_{i}^{+}\text{ and }\mathcal{S}_{i,j}^{+}=\mathcal{V}_{j}^% {-}\cap\mathcal{S}_{i}^{+}\subseteq\mathcal{V}_{j}^{-}.caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ∩ caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ⊆ caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT and caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ∩ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ⊆ caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT . (15)

As illustrated in Figure 3(b), if 𝒮i,j+superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}\neq\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ≠ ∅, then xj,i=0subscript𝑥𝑗𝑖0x_{j,i}=0italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 0; otherwise, some nodes in 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT will be reachable from its competitor (e.g., the node in the oval) in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, violating Eq. (4).

Based on the above understanding, we propose Algorithm 2 to solve the ILP problem (7). For a node vjisubscript𝑣𝑗subscript𝑖v_{j}\in\mathcal{B}_{i}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, wj,isubscript𝑤𝑗𝑖w_{j,i}italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT represents the importance of vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We sort the nodes of isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the non-increasing order of their values wj,isubscript𝑤𝑗𝑖w_{j,i}italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT (line 1). The nodes of isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are considered sequentially in this order (line 2). For each node vjisubscript𝑣𝑗subscript𝑖v_{j}\in\mathcal{B}_{i}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, if 𝒮i,j+=superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = ∅ and 𝒮i,j=superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = ∅, the algorithm sets vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT as the collaborator of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (i.e., xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1), with the connectivity from vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is updated (lines 3-4). Finally, we consider the effect of setting xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1 on the connectivity between any two nodes vpsubscript𝑣𝑝v_{p}italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and vqsubscript𝑣𝑞v_{q}italic_v start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT in the graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, except (vj,vi)subscript𝑣𝑗subscript𝑣𝑖(v_{j},v_{i})( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (line 5). In the graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, if we have before executing line 4 that vpsubscript𝑣𝑝v_{p}italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is not reachable to vqsubscript𝑣𝑞v_{q}italic_v start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT, vpsubscript𝑣𝑝v_{p}italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is reachable to vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, and visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is reachable to vqsubscript𝑣𝑞v_{q}italic_v start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT, then vpsubscript𝑣𝑝v_{p}italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT becomes reachable to vqsubscript𝑣𝑞v_{q}italic_v start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT (lines 6-7).

Lemma 2.

Given Wn×nsubscript𝑊𝑛𝑛W_{n\times n}italic_W start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, Sn×nsubscript𝑆𝑛𝑛S_{n\times n}italic_S start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT and Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, the time complexity of finding isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ) while the time complexity of finding 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT or 𝒮i,j+superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

Proof.

By Eq. (8), the time complexity of finding isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ) where |i|nsubscript𝑖𝑛|\mathcal{B}_{i}|\leqslant n| caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ⩽ italic_n. By Eq. (9), the time complexity of finding 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ) where |𝒱j|nsuperscriptsubscript𝒱𝑗𝑛|\mathcal{V}_{j}^{-}|\leqslant n| caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT | ⩽ italic_n. By Eq. (10), 𝒮jsuperscriptsubscript𝒮𝑗\mathcal{S}_{j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT can be found by (i) checking every vk𝒱subscript𝑣𝑘𝒱v_{k}\in\mathcal{V}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V and (ii) judging whether there exists a node vp𝒱jsubscript𝑣𝑝superscriptsubscript𝒱𝑗v_{p}\in\mathcal{V}_{j}^{-}italic_v start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∈ caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT such that sk,p=1subscript𝑠𝑘𝑝1s_{k,p}=1italic_s start_POSTSUBSCRIPT italic_k , italic_p end_POSTSUBSCRIPT = 1; the resulting time complexity is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ); here, |𝒮j|nsuperscriptsubscript𝒮𝑗𝑛|\mathcal{S}_{j}^{-}|\leqslant n| caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT | ⩽ italic_n. Given 𝒮jsuperscriptsubscript𝒮𝑗\mathcal{S}_{j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT, by Eq. (11), the time complexity of finding 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ). Finally, the time complexity of finding 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Similarly to 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT, the time complexity of finding 𝒮i,j+superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT is also 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). ∎

Data: Wn×nsubscript𝑊𝑛𝑛W_{n\times n}italic_W start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, Sn×nsubscript𝑆𝑛𝑛S_{n\times n}italic_S start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, and Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT
Result: the updated Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT, and Cn×nsubscript𝐶𝑛𝑛C_{n\times n}italic_C start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT
1 Sort the nodes of isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in non-increasing order of their values wj,isubscript𝑤𝑗𝑖w_{j,i}italic_w start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT, generating a sorted sequence;
2 for vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the sorted sequence do
3       if 𝒮i,j+=superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = ∅ \wedge 𝒮i,j=superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = ∅ then
4             xj,i1subscript𝑥𝑗𝑖1x_{j,i}\leftarrow 1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT ← 1, cj,i1subscript𝑐𝑗𝑖1c_{j,i}\leftarrow 1italic_c start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT ← 1;
5            for any two integers p[1,n]𝑝1𝑛p\in[1,n]italic_p ∈ [ 1 , italic_n ] and q[1,n]𝑞1𝑛q\in[1,n]italic_q ∈ [ 1 , italic_n ] with pq𝑝𝑞p\neq qitalic_p ≠ italic_q and (p,q)(j,i)𝑝𝑞𝑗𝑖(p,q)\neq(j,i)( italic_p , italic_q ) ≠ ( italic_j , italic_i ) do
6                   if cp,q=0subscript𝑐𝑝𝑞0c_{p,q}=0italic_c start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT = 0 \wedge cp,j=1ci,q=1subscript𝑐𝑝𝑗1subscript𝑐𝑖𝑞1c_{p,j}=1\wedge c_{i,q}=1italic_c start_POSTSUBSCRIPT italic_p , italic_j end_POSTSUBSCRIPT = 1 ∧ italic_c start_POSTSUBSCRIPT italic_i , italic_q end_POSTSUBSCRIPT = 1 then
7                         cp,q1subscript𝑐𝑝𝑞1c_{p,q}\leftarrow 1italic_c start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT ← 1;
8                        
9                  
10            
11      
Algorithm 2 ILP Solver
Proposition 2.

Suppose Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT satisfies Eq. (2) and (4) before visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is considered. Algorithm 2 gives a feasible solution to the ILP problem (7) with a time complexity 𝒪(n3)𝒪superscript𝑛3\mathcal{O}(n^{3})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) when visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is considered.

Proof.

By Proposition 1, Eq. (4) is equivalent to Assumption 1. Firstly, we prove by contradiction that Algorithm 2 gives a feasible solution. Before visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is considered, no two competitors in 𝒱𝒱\mathcal{V}caligraphic_V are reachable in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT by Assumption 1. Setting xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1 is equivalent to adding an edge (vj,vi)subscript𝑣𝑗subscript𝑣𝑖(v_{j},v_{i})( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT. By the definition of 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT and 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT, the addition of (vj,vi)subscript𝑣𝑗subscript𝑣𝑖(v_{j},v_{i})( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) can only affect the reachability from the nodes of 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT to the nodes of 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT. Suppose there exists a node vjisubscript𝑣𝑗subscript𝑖v_{j}\in\mathcal{B}_{i}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT satisfying 𝒮i,j+=superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = ∅ and 𝒮i,j=superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = ∅, such that, Assumption 1 is violated after setting xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1. Thus, the addition of (vj,vi)subscript𝑣𝑗subscript𝑣𝑖(v_{j},v_{i})( italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) leads to that some node of 𝒱jsuperscriptsubscript𝒱𝑗\mathcal{V}_{j}^{-}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is reachable to and competes with some node of 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT in 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT. Then, there exists a node vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT such that either vk𝒱jsubscript𝑣𝑘superscriptsubscript𝒱𝑗v_{k}\in\mathcal{V}_{j}^{-}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT and vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is a competitor of some node in 𝒱i+superscriptsubscript𝒱𝑖\mathcal{V}_{i}^{+}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT (i.e., vk𝒮i,j+subscript𝑣𝑘superscriptsubscript𝒮𝑖𝑗v_{k}\in\mathcal{S}_{i,j}^{+}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT by Eq. (13) and (15)), or vk𝒱j+subscript𝑣𝑘superscriptsubscript𝒱𝑗v_{k}\in\mathcal{V}_{j}^{+}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT and vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is a competitor of the nodes of 𝒱isuperscriptsubscript𝒱𝑖\mathcal{V}_{i}^{-}caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT (i.e., vk𝒮i,jsubscript𝑣𝑘superscriptsubscript𝒮𝑖𝑗v_{k}\in\mathcal{S}_{i,j}^{-}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT by Eq. (10) and (15)). 𝒮i,jsuperscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{-}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT and 𝒮i,j+superscriptsubscript𝒮𝑖𝑗\mathcal{S}_{i,j}^{+}caligraphic_S start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT are non-empty, which contradicts the condition in line 3 that leads to xj,i=1subscript𝑥𝑗𝑖1x_{j,i}=1italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT = 1.

Secondly, we show the complexity of Algorithm 2. Given isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the time complexity of sorting the nodes of isubscript𝑖\mathcal{B}_{i}caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is 𝒪(nlogn)𝒪𝑛𝑛\mathcal{O}(n\log{n})caligraphic_O ( italic_n roman_log italic_n ), e.g., using the mergesort algorithm. Thus, by Lemma 2, the time complexity in line 1 is 𝒪(nlogn)𝒪𝑛𝑛\mathcal{O}(n\log{n})caligraphic_O ( italic_n roman_log italic_n ). For the for-loop in line 2, its time complexity is 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ) where |i|nsubscript𝑖𝑛|\mathcal{B}_{i}|\leqslant n| caligraphic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ⩽ italic_n; by Lemma 2, the time complexity in line 3 is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). For the for-loop in line 5, the time complexity is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). The total time complexity in lines 2–7 is 𝒪(n3)𝒪superscript𝑛3\mathcal{O}(n^{3})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). Finally, Algorithm 2 has a time complexity 𝒪(n3)𝒪superscript𝑛3\mathcal{O}(n^{3})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). ∎

We show the correctness of Algorithm 1. At the beginning of Algorithm 1, Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT satisfies Eq. (2) and (4) by Eq. (6). After each step for visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in line 4, Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT still satisfies these constraints by Proposition 2. When Algorithm 1 ends, the final collaborating relationship among all FL-PTs is determined by Xn×nsubscript𝑋𝑛𝑛X_{n\times n}italic_X start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT. By Eq. (1), the time complexity of computing wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for each FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ); thus, the time complexity of computing w1,w2,,wnsubscript𝑤1subscript𝑤2subscript𝑤𝑛w_{1},w_{2},\cdots,w_{n}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). The time complexity of sorting w1,w2,,wnsubscript𝑤1subscript𝑤2subscript𝑤𝑛w_{1},w_{2},\cdots,w_{n}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is 𝒪(nlogn)𝒪𝑛𝑛\mathcal{O}(n\log{n})caligraphic_O ( italic_n roman_log italic_n ). Thus, the time complexity in line 2 of Algorithm 1 is 𝒪(n2)𝒪superscript𝑛2\mathcal{O}(n^{2})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). By Proposition 2, the time complexity in lines 3-4 is 𝒪(n4)𝒪superscript𝑛4\mathcal{O}(n^{4})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ). Thus, the time complexity of Algorithm 1 is 𝒪(n4)𝒪superscript𝑛4\mathcal{O}(n^{4})caligraphic_O ( italic_n start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ).

Table 1: Experiments with synthetic data under fixed competing graphs

[b] Weakly Non-IID setting (MSE) v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT v6subscript𝑣6v_{6}italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT v7subscript𝑣7v_{7}italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT v8subscript𝑣8v_{8}italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT Local 0.23±plus-or-minus\pm±0.08 0.23±plus-or-minus\pm±0.09 0.87±plus-or-minus\pm±0.41 0.82±plus-or-minus\pm±0.26 0.23±plus-or-minus\pm±0.10 0.23±plus-or-minus\pm±0.07 0.82±plus-or-minus\pm±0.24 0.78±plus-or-minus\pm±0.30 FedAvg 0.20±plus-or-minus\pm±0.06 0.20±plus-or-minus\pm±0.06 0.20±plus-or-minus\pm±0.10 0.19±plus-or-minus\pm±0.07 0.19±plus-or-minus\pm±0.06 0.19±plus-or-minus\pm±0.06 0.19±plus-or-minus\pm±0.08 0.19±plus-or-minus\pm±0.10 FedProx 0.16±plus-or-minus\pm±0.06 0.17±plus-or-minus\pm±0.07 0.15±plus-or-minus\pm±0.09 0.17±plus-or-minus\pm±0.08 0.17±plus-or-minus\pm±0.06 0.17±plus-or-minus\pm±0.06 0.16±plus-or-minus\pm±0.09 0.18±plus-or-minus\pm±0.07 SCAFFOLD 0.17±plus-or-minus\pm±0.07 0.17±plus-or-minus\pm±0.07 0.16±plus-or-minus\pm±0.09 0.16±plus-or-minus\pm±0.07 0.18±plus-or-minus\pm±0.06 0.18±plus-or-minus\pm±0.07 0.18±plus-or-minus\pm±0.08 0.18±plus-or-minus\pm±0.08 CE 0.14±plus-or-minus\pm±0.10 0.14±plus-or-minus\pm±0.11 1.14±plus-or-minus\pm±0.67 1.20±plus-or-minus\pm±0.88 0.15±plus-or-minus\pm±0.08 0.16±plus-or-minus\pm±0.09 1.23±plus-or-minus\pm±0.37 1.22±plus-or-minus\pm±0.81 FedCompetitors 0.14±plus-or-minus\pm±0.12 0.14±plus-or-minus\pm±0.07 0.13±plus-or-minus\pm±0.06 0.15±plus-or-minus\pm±0.06 0.15±plus-or-minus\pm±0.08 0.14±plus-or-minus\pm±0.06 0.14±plus-or-minus\pm±0.07 0.14±plus-or-minus\pm±0.07 Strongly Non-IID Setting (MSE) v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT v6subscript𝑣6v_{6}italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT v7subscript𝑣7v_{7}italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT v8subscript𝑣8v_{8}italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT Local 0.23±plus-or-minus\pm±0.08 0.23±plus-or-minus\pm±0.08 0.22±plus-or-minus\pm±0.07 0.23±plus-or-minus\pm±0.08 0.23±plus-or-minus\pm±0.06 0.22±plus-or-minus\pm±0.06 0.22±plus-or-minus\pm±0.08 0.23±plus-or-minus\pm±0.07 FedAvg 24.47±plus-or-minus\pm±4.98 24.85±plus-or-minus\pm±4.82 24.85±plus-or-minus\pm±5.03 24.73±plus-or-minus\pm±5.67 24.15±plus-or-minus\pm±3.00 24.47±plus-or-minus\pm±2.78 24.17±plus-or-minus\pm±4.40 24.97±plus-or-minus\pm±3.81 FedProx 17.80±plus-or-minus\pm±7.54 17.82±plus-or-minus\pm±6.42 17.88±plus-or-minus\pm±7.68 17.86±plus-or-minus\pm±7.64 17.69±plus-or-minus\pm±7.14 17.76±plus-or-minus\pm±6.23 17.68±plus-or-minus\pm±5.94 17.73±plus-or-minus\pm±7.04 SCAFFOLD 17.22±plus-or-minus\pm±2.85 17.44±plus-or-minus\pm±2.17 17.39±plus-or-minus\pm±4.02 17.20±plus-or-minus\pm±3.58 16.87±plus-or-minus\pm±2.75 17.13±plus-or-minus\pm±2.79 17.00±plus-or-minus\pm±2.41 17.33±plus-or-minus\pm±2.59 CE 0.15±plus-or-minus\pm±0.12 0.14±plus-or-minus\pm±0.11 0.14±plus-or-minus\pm±0.07 0.14±plus-or-minus\pm±0.07 0.14±plus-or-minus\pm±0.06 0.14±plus-or-minus\pm±0.06 0.12±plus-or-minus\pm±0.05 0.12±plus-or-minus\pm±0.05 FedCompetitors 0.14±plus-or-minus\pm±0.07 0.13±plus-or-minus\pm±0.06 0.13±plus-or-minus\pm±0.06 0.14±plus-or-minus\pm±0.09 0.13±plus-or-minus\pm±0.07 0.14±plus-or-minus\pm±0.06 0.11±plus-or-minus\pm±0.04 0.13±plus-or-minus\pm±0.07

Experimental Evaluation

We conduct experiments on synthetic data and the CIFAR-10 dataset. To investigate the practicality of FedCompetitors, we also adopt the electronic health record (EHR) dataset eICU (Pollard et al. 2018) to illustrate the collaboration relationships of FL-PTs on a real-world network of multiple hospitals.

Comparison baselines

Compared with the proposed approach in the last section, we now give a more intuitive procedure to address the competing relationships among FL-PTs. This procedure makes the previous FL approaches (e.g., FedAvg) applicable to the scenario of this paper. At a high level, we will find a partition of all FL-PTs into several disjoint groups such that the FL-PTs in each group are independent of each other, without conflict of interest. Then, baselines can be generated by directly applying the previous FL approaches to each group of FL-PTs. Specifically, the competing graph 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT describes the competing relationship among FL-PTs. Let 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT denote the complement of 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT: the nonexistence of an edge between visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT leads to the existence of an edge (vi,vj)subscript𝑣𝑖subscript𝑣𝑗(v_{i},v_{j})( italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) in 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT, and vice versa. Each edge in the graph 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT indicates that the two FL-PTs connected by this edge are independent. A clique is a subset of nodes of 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT such that every two nodes in the clique are adjacent, that is, a clique is a subgraph that is complete. A clique cover of 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is a partition of all nodes into cliques within which every two nodes in the clique are adjacent and independent of each other (Tomita, Tanaka, and Takahashi 2006). A minimum clique cover is a clique cover that uses as few cliques as possible.

The FL-PTs in each clique are grouped together to take FL training, without involving the FL-PTs from other cliques. We apply four typical FL approaches directly to the nodes of each clique for FL training: FedAvg, CE, FedProx (Li et al. 2020) and SCAFFOLD (Karimireddy et al. 2020), which generates four baselines. The collaboration equilibrium (CE) approach is proposed in (Cui et al. 2022) where each coalition is defined as a strongly connected component of the benefit graph; its effectiveness has well been validated against several other approaches. FedProx and SCAFFOLD represent two typical approaches that make the aggregated model at the CS close to the global optima and are two benchmarks in (Li et al. 2022) for showing the FL performance under Non-IID data settings. The fifth baseline is Local where each FL-PT takes local ML training without collaboration.

General experimental setting. Like (Cui et al. 2022), the hypernetwork technique in (Navon et al. 2021) is used to compute the benefit graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT and a hypernetwork is constructed by a multilayer perceptron (MLP). When it comes to a specific dataset, all approaches have the same network structure for each FL-PT to execute the learning tasks.

Synthetic experiments

We show the experimental results on synthetic data with fixed competing graphs. Specifically, let us consider 8 FL-PTs {v1,v2,,v8}subscript𝑣1subscript𝑣2subscript𝑣8\{v_{1},v_{2},\cdots,v_{8}\}{ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT }. The synthetic features are generated by x𝒰[1.0,1.0]similar-to𝑥𝒰1.01.0x\sim\mathcal{U}[-1.0,1.0]italic_x ∼ caligraphic_U [ - 1.0 , 1.0 ]. Given the FL-PT visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the grand truth weights ui,l=vl+ri,lsubscript𝑢𝑖𝑙subscript𝑣𝑙subscript𝑟𝑖𝑙u_{i,l}=v_{l}+r_{i,l}italic_u start_POSTSUBSCRIPT italic_i , italic_l end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT + italic_r start_POSTSUBSCRIPT italic_i , italic_l end_POSTSUBSCRIPT are sampled as vsimilar-to𝑣absentv\simitalic_v ∼ 𝒰[0.0,1.0]𝒰0.01.0\mathcal{U}[0.0,1.0]caligraphic_U [ 0.0 , 1.0 ] and ri,lsimilar-tosubscript𝑟𝑖𝑙absentr_{i,l}\simitalic_r start_POSTSUBSCRIPT italic_i , italic_l end_POSTSUBSCRIPT ∼ 𝒩(0.0,ρ2)𝒩0.0superscript𝜌2\mathcal{N}(0.0,\rho^{2})caligraphic_N ( 0.0 , italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) where l{1,2,3}𝑙123l\in\{1,2,3\}italic_l ∈ { 1 , 2 , 3 }; the noise ϵ𝒩(0.0,\epsilon\sim\mathcal{N}(0.0,italic_ϵ ∼ caligraphic_N ( 0.0 , 0.12)0.1^{2})0.1 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) is added to each label.

Weakly Non-IID setting. ρ2superscript𝜌2\rho^{2}italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT measures the data distribution discrepancy among FL-PTs. We set ρ=0.01𝜌0.01\rho=0.01italic_ρ = 0.01, which means that the generated data are weakly non-iid in terms of sample features and labels. The same type of polynomial regression tasks is learned by all FL-PTs and the synthetic labels are defined as: y=l=13ui,lTxl+ϵ𝑦superscriptsubscript𝑙13superscriptsubscript𝑢𝑖𝑙𝑇superscript𝑥𝑙italic-ϵy=\sum_{l=1}^{3}{u_{i,l}^{T}x^{l}}+\epsilonitalic_y = ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT + italic_ϵ. The network used for predicting the label at each FL-PT is an MLP with one hidden layer. FL-PTs v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT and v6subscript𝑣6v_{6}italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT have 2000 samples, while the other FL-PTs have 100 samples. Thus, there exists quantity skew, i.e., a significant difference in the sample quantities of FL-PTs. Two large FL-PTs v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are independent and compete with the other two large FL-PTs v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT and v6subscript𝑣6v_{6}italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT that are independent. Each small FL-PT competes one large FL-PT: (v1,v7)subscript𝑣1subscript𝑣7(v_{1},v_{7})( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT ), (v2,v8)subscript𝑣2subscript𝑣8(v_{2},v_{8})( italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT ), (v3,v5)subscript𝑣3subscript𝑣5(v_{3},v_{5})( italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ), and (v4,v6)subscript𝑣4subscript𝑣6(v_{4},v_{6})( italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ) are edges in the competing graph 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT. Such 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT leads to a unique clique cover. Under this setting, the minimum clique cover of 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is {vi}i=14superscriptsubscriptsubscript𝑣𝑖𝑖14\{v_{i}\}_{i=1}^{4}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT and {vi}i=58superscriptsubscriptsubscript𝑣𝑖𝑖58\{v_{i}\}_{i=5}^{8}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT, and small FL-PTs benefit large FL-PTs little. The experimental results (measured by mean squared error (MSE)) are given in Table 1. On average, CE has the worst performance since small FL-PTs v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, v7subscript𝑣7v_{7}italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT and v8subscript𝑣8v_{8}italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT cannot benefit from large FL-PTs. Particularly, FedCompetitors has the best performance compared with the five baselines.

Table 2: Experiments with eICU under a fixed competing graph

[t] AUC v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT v6subscript𝑣6v_{6}italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT v7subscript𝑣7v_{7}italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT v8subscript𝑣8v_{8}italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT v9subscript𝑣9v_{9}italic_v start_POSTSUBSCRIPT 9 end_POSTSUBSCRIPT v10subscript𝑣10v_{10}italic_v start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT Local 76.12 69.46 68.94 68.04 76.46 40.00 69.30 60.53 56.94 49.12 FedAvg 75.26 72.09 68.87 74.13 83.72 41.67 79.37 54.41 66.67 38.10 CE 83.53 75.64 74.38 74.46 80.89 82.61 71.43 66.67 66.67 80.00 FedCompetitors 81.50 78.23 69.18 83.52 85.91 89.58 80.70 68.89 90.48 95.24

Table 3: Experiments with CIFAR-10 under randomly generated competing graphs

[b] MTA Local 86.46±4.12plus-or-minus86.464.1286.46\pm 4.1286.46 ± 4.12 FedAvg 52.99±4.38plus-or-minus52.994.3852.99\pm 4.3852.99 ± 4.38 FedProx 51.13±7.10plus-or-minus51.137.1051.13\pm 7.1051.13 ± 7.10 SCAFFOLD 51.20±7.09plus-or-minus51.207.0951.20\pm 7.0951.20 ± 7.09 CE 87.80±7.18plus-or-minus87.807.1887.80\pm 7.1887.80 ± 7.18 FedCompetitors 91.33 ±plus-or-minus\pm± 4.14

Strongly Non-IID setting. This setting is the same as the setting above expect three aspects. Firstly, each FL-PT has 2000 samples and there is no quantity skew. Secondly, we generate conflicting learning tasks by flip** over the labels of some FL-PTs: y=l=13ui,lTxl+ϵ𝑦superscriptsubscript𝑙13superscriptsubscript𝑢𝑖𝑙𝑇superscript𝑥𝑙italic-ϵy=-\sum_{l=1}^{3}{u_{i,l}^{T}x^{l}}+\epsilonitalic_y = - ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT + italic_ϵ for i{5,6,7,8}𝑖5678i\in\{5,6,7,8\}italic_i ∈ { 5 , 6 , 7 , 8 }, which leads to strongly Non-IID among the eight FL-PTs in terms of the labels. Thirdly, we test on a different competing graph where there are two independent groups of FL-PTs {vi}i=14superscriptsubscriptsubscript𝑣𝑖𝑖14\{v_{i}\}_{i=1}^{4}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT and {vi}i=58superscriptsubscriptsubscript𝑣𝑖𝑖58\{v_{i}\}_{i=5}^{8}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT: for i{1,5}𝑖15i\in\{1,5\}italic_i ∈ { 1 , 5 }, the FL-PTs visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and vi+1subscript𝑣𝑖1v_{i+1}italic_v start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT are independent of each other and compete with vi+2subscript𝑣𝑖2v_{i+2}italic_v start_POSTSUBSCRIPT italic_i + 2 end_POSTSUBSCRIPT and vi+3subscript𝑣𝑖3v_{i+3}italic_v start_POSTSUBSCRIPT italic_i + 3 end_POSTSUBSCRIPT that are also independent of each other. Under this setting, all FL-PTs in the same group can benefit each other; the minimum clique cover of 𝒢csuperscriptsubscript𝒢𝑐\mathcal{G}_{c}^{-}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is {v1,\{v_{1},{ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , v2,v5,v6}v_{2},v_{5},v_{6}\}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT } and {v3,v4,v7,v8}subscript𝑣3subscript𝑣4subscript𝑣7subscript𝑣8\{v_{3},v_{4},v_{7},v_{8}\}{ italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT }. The experimental results are given in Table 1. FedAvg, FedProx, and SCAFFOLD perform the worst since training a global model cannot simultaneously satisfy the FL-PTs in the same clique with conflicting learning tasks. It is observed that FedCompetitors has the best performance compared with the five baselines.

Benchmark experiments

We conduct experiments on CIFAR-10 with competing graphs that are generated randomly. CIFAR-10 is an image classification dataset and has 10 classes, each with 6000 images. We follow the setting in (Cui et al. 2022) for CIFAR-10 to construct Non-IID data and network structures, and to measure performance. There are 10 FL-PTs, and each FL-PT randomly obtains 2 of the 10 classes to simulate the Non-IID setting. The model performance is measured by the mean test accuracy (MTA). To simulate competition, we set the probability of two FL-PTs competing against each other to 0.2, thus generating a random competing graph 𝒢csubscript𝒢𝑐\mathcal{G}_{c}caligraphic_G start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT, which constrains the collaboration between some FL-PTs. Table 3 shows the experimental results. It is observed that FedCompetitors has the best performance. FedAvg, FedProx, and SCAFFOLD perform worst since training a global model cannot simultaneously satisfy the FL-PTs in the same clique with data heterogeneity. FedCompetitors performs better than CE by 3.53%.

Hospital collaboration example

eICU is a dataset collecting EHRs from many hospitals across the United States admitted to the intensive care unit (ICU). The task is to predict mortality during hospitalization. We use this dataset to illustrate a benefit graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT and a data usage graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT in the real world. The setting here is the same as the setting in (Cui et al. 2022) for eICU, including the data pre-processing procedure, the way of choosing hospitals, the network structures, and the performance metric. There are 10 hospitals, among which the first 5 hospitals {vi}i=15superscriptsubscriptsubscript𝑣𝑖𝑖15\{v_{i}\}_{i=1}^{5}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT are large with about 1000 patients per hospital and the others are small with about 100 patients per hospital. Label imbalance occurs since more than 90% samples have negative labels; thus, AUC is used to measure the utility of each FL-PT. The generated benefit graph 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is illustrated in Figure 4(a).

Refer to caption
(a) 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT.
Refer to caption
(b) 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT.
Figure 4: Illustration of hospital collaboration.

Let us consider the case where more than one large hospital may be located in the same city while small hospitals are dispersed in rural areas with lower population densities; competition mainly occurs among large hospitals. We assume that v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT competes with v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT, while v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT competes with v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT and v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT, respectively. For the baselines except the local approach, the way of generating the clique cover is independent of 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT where FL-PTs in each clique collaborate together; the generated clique cover is {v4,v5}subscript𝑣4subscript𝑣5\{v_{4},v_{5}\}{ italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT } and {vi}i=13{vi}i=610superscriptsubscriptsubscript𝑣𝑖𝑖13superscriptsubscriptsubscript𝑣𝑖𝑖610\{v_{i}\}_{i=1}^{3}\cup\{v_{i}\}_{i=6}^{10}{ italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ∪ { italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 6 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT. For FedCompetitors, the generated data usage graph 𝒢usubscript𝒢𝑢\mathcal{G}_{u}caligraphic_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is illustrated in Figure 4(b), which fully utilizes the information on 𝒢bsubscript𝒢𝑏\mathcal{G}_{b}caligraphic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT by Algorithm 1. Compared with the baselines, it is observed from Figure 4(b) that the local model update information of v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT and v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT can also be utilized by other FL-PTs {v1,v7,v8,v9,v10}subscript𝑣1subscript𝑣7subscript𝑣8subscript𝑣9subscript𝑣10\{v_{1},v_{7},v_{8},v_{9},v_{10}\}{ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 9 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT } while v4subscript𝑣4v_{4}italic_v start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT and v5subscript𝑣5v_{5}italic_v start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT can similarly benefit from v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in the FL training process. This is an advantage of FedCompetitors and is reflected in the experimental results, which are given in Table 2. Overall, FedCompetitors achieves the best performance.

Conclusions

We consider in this paper an open research problem in which a subset of FL-PTs in the FL ecosystem engage in competition. We extend a principle from balance theory that “the friend of my enemy is my enemy” to guarantee that no conflict of interest occurs among FL-PTs. The resulting FL ecosystem thus exhibits a high level of scalability since FL-PTs that even compete can join smoothly. We formulate the problem and show that it is mathematically solvable in polynomial time. Thus, an efficient algorithm is proposed to determine the collaboration relationships of FL-PTs. The framework of this paper is also general since it considers both competition and data heterogeneity, which is another important aspect in FL. Extensive experiments demonstrate the effectiveness of the proposed framework.

Acknowledgments

This research was supported in part by the National Key R&D Program of China (No. 2022YFB2902900). This research/project is also supported, in part, by the National Research Foundation Singapore and DSO National Laboratories under the AI Singapore Programme (AISG Award No: AISG2-RP-2020-019); the RIE 2020 Advanced Manufacturing and Engineering (AME) Programmatic Fund (No. A20G8b0102), Singapore; and the Center for Frontier AI Research (CFAR), Agency for Science, Technology and Research (A{}^{\ast}start_FLOATSUPERSCRIPT ∗ end_FLOATSUPERSCRIPTSTAR), Singapore. The work of Hao Cheng and Chongjun Wang was supported by the National Natural Science Foundation of China (Grant No. 62192783, 62376117). The work of Shanli Tan was done when he was a research intern with Xiaohu Wu at the National Engineering Research Center of Mobile Network Technologies, Bei**g University of Posts and Telecommunications, China.

References

  • Aziz and Savani (2016) Aziz, H.; and Savani, R. 2016. Hedonic Games. In Brandt, F.; Conitzer, V.; Endriss, U.; Lang, J.; and Procaccia, A. D., eds., Handbook of Computational Social Choice, 356–376. Cambridge University Press.
  • Brekke, Siciliani, and Straume (2011) Brekke, K. R.; Siciliani, L.; and Straume, O. R. 2011. Hospital competition and quality with regulated prices. Scandinavian Journal of Economics, 113(2): 444–469.
  • Cartwright and Harary (1956) Cartwright, D.; and Harary, F. 1956. Structural balance: A generalization of Heider’s theory. Psychological Review, 63(5): 277.
  • Chaudhury et al. (2022) Chaudhury, B. R.; Li, L.; Kang, M.; Li, B.; and Mehta, R. 2022. Fairness in federated learning via core-stability. In Advances in Neural Information Processing Systems (NeurIPS’22), volume 35, 5738–5750.
  • Cui et al. (2022) Cui, S.; Liang, J.; Pan, W.; Chen, K.; Zhang, C.; and Wang, F. 2022. Collaboration equilibrium in federated learning. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’22), 241–251.
  • Ding and Wang (2022) Ding, S.; and Wang, W. 2022. Collaborative learning by detecting collaboration partners. In Advances in Neural Information Processing Systems (NeurIPS’22), volume 35, 15629–15641.
  • Donahue and Kleinberg (2021) Donahue, K.; and Kleinberg, J. 2021. Model-sharing games: Analyzing federated learning under voluntary participation. Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI’21), 35(6): 5303–5311.
  • Fallah, Mokhtari, and Ozdaglar (2020) Fallah, A.; Mokhtari, A.; and Ozdaglar, A. 2020. Personalized federated learning with theoretical guarantees: A model-agnostic meta-learning approach. In Advances in Neural Information Processing Systems (NeurIPS’20), volume 33, 3557–3568.
  • Farris et al. (2010) Farris, P. W.; Bendle, N.; Pfeifer, P. E.; and Reibstein, D. 2010. Marketing metrics: The definitive guide to measuring marketing performance. Pearson Education.
  • Fleurence et al. (2014) Fleurence, R. L.; Curtis, L. H.; Califf, R. M.; Platt, R.; Selby, J. V.; and Brown, J. S. 2014. Launching PCORnet, a national patient-centered clinical research network. Journal of the American Medical Informatics Association, 21(4): 578–582.
  • Huang, Ke, and Liu (2023) Huang, C.; Ke, S.; and Liu, X. 2023. Duopoly business competition in cross-silo federated learning. IEEE Transactions on Network Science and Engineering, 1–13.
  • Kairouz et al. (2021) Kairouz, P.; McMahan, H. B.; Avent, B.; Bellet, A.; Bennis, M.; Nitin Bhagoji, A.; Bonawitz, K.; Charles, Z.; Cormode, G.; Cummings, R.; D’Oliveira, R. G. L.; Eichner, H.; El Rouayheb, S.; Evans, D.; Gardner, J.; Garrett, Z.; Gascón, A.; Ghazi, B.; Gibbons, P. B.; Gruteser, M.; Harchaoui, Z.; He, C.; He, L.; Huo, Z.; Hutchinson, B.; Hsu, J.; Jaggi, M.; Javidi, T.; Joshi, G.; Khodak, M.; Konecný, J.; Korolova, A.; Koushanfar, F.; Koyejo, S.; Lepoint, T.; Liu, Y.; Mittal, P.; Mohri, M.; Nock, R.; Özgür, A.; Pagh, R.; Qi, H.; Ramage, D.; Raskar, R.; Raykova, M.; Song, D.; Song, W.; Stich, S. U.; Sun, Z.; Suresh, A. T.; Tramèr, F.; Vepakomma, P.; Wang, J.; Xiong, L.; Xu, Z.; Yang, Q.; Yu, F. X.; Yu, H.; and Zhao, S. 2021. Advances and Open Problems in Federated Learning. Foundations and Trends in Machine Learning, 14(1–2): 1–210.
  • Karimireddy et al. (2020) Karimireddy, S. P.; Kale, S.; Mohri, M.; Reddi, S.; Stich, S.; and Suresh, A. T. 2020. Scaffold: Stochastic controlled averaging for federated learning. In Proceedings of the 37th International Conference on Machine Learning (ICML’20), volume 119, 5132–5143.
  • Leskovec, Huttenlocher, and Kleinberg (2010) Leskovec, J.; Huttenlocher, D.; and Kleinberg, J. 2010. Predicting positive and negative links in online social networks. In Proceedings of the 19th International Conference on World Wide Web (WWW’10), 641–650.
  • Li et al. (2022) Li, Q.; Diao, Y.; Chen, Q.; and He, B. 2022. Federated learning on non-iid data silos: An experimental study. In Proceedings of the IEEE 38th International Conference on Data Engineering (ICDE’22), 965–978.
  • Li et al. (2020) Li, T.; Sahu, A. K.; Zaheer, M.; Sanjabi, M.; Talwalkar, A.; and Smith, V. 2020. Federated optimization in heterogeneous networks. In Proceedings of Machine Learning and Systems, volume 2, 429–450.
  • Long et al. (2020) Long, G.; Tan, Y.; Jiang, J.; and Zhang, C. 2020. Federated learning for open banking. In Federated Learning, 240–254. Springer.
  • McMahan et al. (2017) McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; and Arcas, B. A. 2017. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS’17), 1273–1282.
  • Navon et al. (2021) Navon, A.; Shamsian, A.; Fetaya, E.; and Chechik, G. 2021. Learning the Pareto Front with Hypernetworks. In International Conference on Learning Representations (ICLR’21).
  • Oldenhof et al. (2023) Oldenhof, M.; Ács, G.; Pejó, B.; Schuffenhauer, A.; Holway, N.; Sturm, N.; Dieckmann, A.; Fortmeier, O.; Boniface, E.; Mayer, C.; et al. 2023. Industry-scale orchestrated federated learning for drug discovery. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, 15576–15584.
  • Pollard et al. (2018) Pollard, T. J.; Johnson, A. E.; Raffa, J. D.; Celi, L. A.; Mark, R. G.; and Badawi, O. 2018. The eICU collaborative research database, a freely available multi-center database for critical care research. Scientific data, 5(1): 1–13.
  • Sheller et al. (2020) Sheller, M. J.; Edwards, B.; Reina, G. A.; Martin, J.; Pati, S.; Kotrotsou, A.; Milchenko, M.; Xu, W.; Marcus, D.; Colen, R. R.; et al. 2020. Federated learning in medicine: Facilitating multi-institutional collaborations without sharing patient data. Scientific Reports, 10(1): 1–12.
  • Shi, Yu, and Leung (2023) Shi, Y.; Yu, H.; and Leung, C. 2023. Towards fairness-aware federated learning. IEEE Transactions on Neural Networks and Learning Systems, 1–17.
  • Smith et al. (2017) Smith, V.; Chiang, C.-K.; Sanjabi, M.; and Talwalkar, A. S. 2017. Federated multi-task learning. In Advances in neural information processing systems (NIPS’17), volume 30.
  • Sun et al. (2023) Sun, C.; Huang, C.; Shou, B.; and Huang, J. 2023. Federated Learning in Competitive EV Charging Market. arXiv preprint arXiv:2310.08794.
  • Tan et al. (2022) Tan, A. Z.; Yu, H.; Cui, L.; and Yang, Q. 2022. Towards personalized federated learning. IEEE Transactions on Neural Networks and Learning Systems, 1–17.
  • Tang and Wong (2021) Tang, M.; and Wong, V. W. 2021. An incentive mechanism for cross-silo federated learning: A public goods perspective. In Proceedings of the 2022 IEEE Conference on Computer Communications (INFOCOM’22), 1–10. IEEE.
  • Tariq et al. (2023) Tariq, A.; Serhani, M. A.; Sallabi, F.; Qayyum, T.; Barka, E. S.; and Shuaib, K. A. 2023. Trustworthy Federated Learning: A Survey. arXiv preprint arXiv:2305.11537.
  • Tomita, Tanaka, and Takahashi (2006) Tomita, E.; Tanaka, A.; and Takahashi, H. 2006. The worst-case time complexity for generating all maximal cliques and computational experiments. Theoretical computer science, 363(1): 28–42.
  • Tsoy and Konstantinov (2023) Tsoy, N.; and Konstantinov, N. 2023. Strategic data sharing between competitors. arXiv preprint arXiv:2305.16052.
  • Wang et al. (2022) Wang, Y.; Tong, Y.; Zhou, Z.; Ren, Z.; Xu, Y.; Wu, G.; and Lv, W. 2022. Fed-LTD: Towards Cross-Platform Ride Hailing via Federated Learning to Dispatch. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’22), 4079–4089.
  • Wang et al. (2019) Wang, Z.; Dai, Z.; Póczos, B.; and Carbonell, J. 2019. Characterizing and avoiding negative transfer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19), 11293–11302.
  • Wu, De Pellegrini, and Casale (2023) Wu, X.; De Pellegrini, F.; and Casale, G. 2023. Delay and price differentiation in cloud computing: A service model, supporting architectures, and performance. ACM Trans. Model. Perform. Eval. Comput. Syst. Just Accepted.
  • Wu and Yu (2022) Wu, X.; and Yu, H. 2022. MarS-FL: Enabling competitors to collaborate in federated learning. IEEE Transactions on Big Data, 1–11.
  • Yang et al. (2020) Yang, L.; Tan, B.; Zheng, V. W.; Chen, K.; and Yang, Q. 2020. Federated recommendation systems. In Federated Learning: Privacy and Incentive, 225–239. Springer.
  • Yang et al. (2019) Yang, Q.; Liu, Y.; Chen, T.; and Tong, Y. 2019. Federated machinelearning: concept and applications. ACM Transactions on Intelligent Systems and Technology, 10(2): 12:1–12:19.
  • Yu et al. (2014) Yu, H.; Miao, C.; An, B.; Shen, Z.; and Leung, C. 2014. Reputation-aware task allocation for human trustees. In Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems, 357–364.
  • Zhan et al. (2022) Zhan, Y.; Zhang, J.; Hong, Z.; Wu, L.; Li, P.; and Guo, S. 2022. A survey of incentive mechanism design for federated learning. IEEE Transactions on Emerging Topics in Computing, 10(2): 1035–1044.
  • Zhu et al. (2021) Zhu, H.; Xu, J.; Liu, S.; and **, Y. 2021. Federated learning on non-IID data: A survey. Neurocomputing, 465: 371–390.