• Distributed Delta-Coloring under Bandwidth Limitations



    Magnús M. Halldórsson\cdot[email protected]\cdotReykjavik University, Iceland

    Yannic Maus111Supported by the Austrian Science Fund (FWF), Grant P36280-N.\cdot[email protected]\cdotTU Graz, Austria

Abstract

We consider the problem of coloring graphs of maximum degree ΔΔ\Deltaroman_Δ with ΔΔ\Deltaroman_Δ colors in the distributed setting with limited bandwidth. Specifically, we give a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round randomized algorithm in the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST model. This is close to the lower bound of Ω(loglogn)Ω𝑛\Omega(\log\log n)roman_Ω ( roman_log roman_log italic_n ) rounds from [Brandt et al., STOC ’16], which holds also in the more powerful 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model. The core of our algorithm is a reduction to several special instances of the constructive Lovász local lemma (LLL) and the deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list coloring problem.

1 Introduction

The objective in the c𝑐citalic_c-coloring problem is to color the vertices of a graph with c𝑐citalic_c such that any two adjacent vertices receive different colors. In the distributed setting, the Δ+1Δ1\Delta+1roman_Δ + 1-coloring problem has long been the focus of interest as the natural local coloring problem: any partial solution can be extended to a valid full solution. It has fast poly(loglogn)poly𝑛\operatorname{\text{{\rm poly}}}(\log\log n)poly ( roman_log roman_log italic_n )-round algorithms, both in 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL [13] and 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST [29], and so does the more general deg+1-list coloring problem (d1LC), which is what remains when a subset of the nodes has been Δ+1Δ1\Delta+1roman_Δ + 1-colored [30, 34].

The ΔΔ\Deltaroman_Δ-coloring problem, on the other hand, is non-local: fixing the colors of just two nodes can make it impossible to form a proper ΔΔ\Deltaroman_Δ-coloring, see Figure 1 for an example. Due to its simplicity, it has become the prototypical problem for the frontier of the unknown [27, 3]. Even the existence of such colorings is non-trivial: a celebrated result by Brooks from the ’40s shows that ΔΔ\Deltaroman_Δ-colorings exist for any connected graph that is neither an odd cycle nor a clique on Δ+1Δ1\Delta+1roman_Δ + 1 nodes [10].

A poly(loglogn)poly𝑛\operatorname{\text{{\rm poly}}}(\log\log n)poly ( roman_log roman_log italic_n )-round ΔΔ\Deltaroman_Δ-coloring algorithm was recently given in 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL [22], but no non-trivial algorithm is known in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST. It is of natural interest to examine if the transition from local to non-local problems behaves differently in 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL and in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST. Thus, we set out to answer the following question:

Is there a sublogarithmic time distributed ΔΔ\Deltaroman_Δ-coloring algorithm using small messages?

In this work, we answer the question in the affirmative. We prove the following theorem.

Theorem 1.1.

There is a randomized polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm to ΔΔ\Deltaroman_Δ-color any graph with maximum degree Δ3Δ3\Delta\geq 3roman_Δ ≥ 3. The algorithm works with high probability.

Theorem 1.1 nearly matches the lower bound of Ω(loglogn)Ω𝑛\Omega(\log\log n)roman_Ω ( roman_log roman_log italic_n ) that holds in 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL [8]. In [3], the authors claim that in order to make progress in our understanding of distributed complexity theory, we require a ΔΔ\Deltaroman_Δ-coloring algorithm that is genuinely different from the approaches in [42, 27]. This is due to the fact that the current state-of-the-art runtime for ΔΔ\Deltaroman_Δ-coloring lies exactly in the regime that is poorly understood. The approaches of [42, 27] are based on brute-forcing solutions on carefully chosen subgraphs of super-constant diameter. In contrast, our results are based on a bandwidth-efficient deterministic reduction to a constant number of ‘simple’ Lovász Local Lemma (LLL) instances and O(logΔ)𝑂ΔO(\log\Delta)italic_O ( roman_log roman_Δ ) instances of d1LC; the LLL is a general solution method applicable to a wide range of problems.

It is known that LLL is complete for sublogarithmic computation on constant-degree graphs, but its role on general graphs is widely open [15]. Our algorithm adds to the small list of problems (see the related work section in [32]) that can be solved in sublogarithmic time with an LLL-type approach, even under the presence of bandwidth restrictions. Before continuing further, let us first detail the computational model.

In the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST model, a communication network is abstracted as an n𝑛nitalic_n-node graph of maximum degree ΔΔ\Deltaroman_Δ, where nodes serve as computing entities and edges represent communication links. Initially, a node is unaware of the topology of the graph G𝐺Gitalic_G, nodes can communicate with their neighbors in order to coordinate their actions. This communication happens in synchronous rounds where, in each round, a node can perform arbitrary local computations and send one message of O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) bits over each incident edge. At the end of the algorithm, each node outputs its own portion of the solution, e.g., its color in coloring problems. The 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model is identical, except without restrictions on message size.

1.1 Technical Overview on Previous Approaches

Previous fast distributed ΔΔ\Deltaroman_Δ-coloring algorithms either use huge bandwidth [42, 27] or use limited bandwidth but only work in the extreme cases of either very high-degree [22] or super low-degree graphs [40]. Optimally, we would like to take any of these solutions and run them with minor modifications to obtain an algorithm that uses low bandwidth and works for all degrees. This approach is entirely infeasible for the highly specialized algorithms in [42, 27, 28]. These works crucially rely on learning the full topology of non-constant diameter subgraphs, which is impossible in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST.

For graphs of super-low degree, i.e., at most polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n, an efficient ΔΔ\Deltaroman_Δ-coloring algorithm with low bandwidth can be deduced from the results in [40]. In fact, the paper takes a complexity-theoretic approach and shows that any problem can be solved in sublogarithmic time with low bandwidth as long as 1) the problem is defined on low-degree graphs, 2) a given solution can be checked efficiently for correctness by a distributed algorithm, and 3) the problem admits a sublogarithmic time 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model algorithm. As such, the results are not very constructive for any specific problem like the ΔΔ\Deltaroman_Δ-coloring problem. In fact, it is known that these generic techniques cannot be extended to problems defined on graphs with larger degrees [4], which is the main target of our work.

Our best hope is then the polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model algorithm of [22]. We discuss it in detail throughout the next few pages as it motivates the design choices of our solution. Unfortunately, for maximum degrees that are at most poly-logarithmic, it relies on the prior O(logΔ)+polyloglogn𝑂Δpoly𝑛O(\log\Delta)+\operatorname{\text{{\rm poly}}}\log\log nitalic_O ( roman_log roman_Δ ) + poly roman_log roman_log italic_n-round 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model algorithm from [27] in a black-box manner. For large maximum degrees, however, when ΔΔ\Deltaroman_Δ is ω(log3n)𝜔superscript3𝑛\omega(\log^{3}n)italic_ω ( roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_n ), they provide a sophisticated constant-round randomized reduction to the deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list coloring problem (d1LC) that also works with low bandwidth. The central ingredient in this reduction is the notion of slack.

Slack.

To reduce the ΔΔ\Deltaroman_Δ-coloring problem to d1LC, it suffices to obtain a unit amount of slack for each node. Namely, if two neighbors of a node are assigned the same color, there are then more colors available to the node than its number of uncolored neighbors. Slack can be easily generated w.h.p. (for most, but not all, kinds of nodes) with a simple single-round procedure termed 𝖲𝗅𝖺𝖼𝗄𝖦𝖾𝗇𝖾𝗋𝖺𝗍𝗂𝗈𝗇𝖲𝗅𝖺𝖼𝗄𝖦𝖾𝗇𝖾𝗋𝖺𝗍𝗂𝗈𝗇\mathsf{SlackGeneration}sansserif_SlackGeneration, as long as the graph has high degree. This observation has been used in countless papers on various coloring problems, e.g., [19, 35, 13, 29, 22]. For intermediate-degree graphs, this slack generation problem can be formulated as an instance of the constructive Lovász Local Lemma (LLL), but one that seems inherently non-implementable in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST, as we explain later.

Recall that the LLL is a general solution method applicable to a wide range of problems. Defined over a set of independent random variables, it asks for an assignment of the variables that avoids a set of ”bad” events. The original theorem [18] shows that such an assignment exists as long as the probability of the events to occur is sufficiently small in relation to the dependence degree of the events, i.e., the number of other events that share a variable. There is now a general 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL algorithm running in O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) rounds of 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL [39, 16], but superfast poly(loglogn)poly𝑛\operatorname{\text{{\rm poly}}}(\log\log n)poly ( roman_log roman_log italic_n ) algorithms are only known for restricted cases [20, 26, 17]. Even less is known about solvability in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST [31, 32].

In the presented slack generation LLL, there is a bad event for each node that holds if the respective node does not obtain slack. The mentioned 𝖲𝗅𝖺𝖼𝗄𝖦𝖾𝗇𝖾𝗋𝖺𝗍𝗂𝗈𝗇𝖲𝗅𝖺𝖼𝗄𝖦𝖾𝗇𝖾𝗋𝖺𝗍𝗂𝗈𝗇\mathsf{SlackGeneration}sansserif_SlackGeneration works as follows. Each node gets activated with a constant probability, picks a random candidate color that it keeps if no neighbor wants to get the same color and discards otherwise (see Algorithm 3 in Section 3 for details). Hence, there are random variables for each node depicting its activation status and candidate color choice. The main reason why this LLL cannot be directly implemented in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST is that events involve values of variables at distance 2 in the communication graph. This makes it impossible for an event node to obtain full information on the status of all its variables, an ingredient that essentially is crucial in all known sublogarithmic-time LLL algorithms. The formal meaning of the word ‘essential’ in that sentence is extremely technical and is captured by the notion of a simulatable LLL (see Definition 5.2). In essence, it says that the LLL is easy enough such that event nodes can learn enough information about their variables to execute some simple primitives such as evaluating their status (does the event hold or not), resampling their variables, and computing certain conditional probabilities for the event to hold under partial variable assignments. The latter condition is the most challenging one to ensure.

1.2 Our Technical Approach

What we have discussed so far is only half the truth. In fact, the slack generation process only works for sparse nodes, i.e., nodes with many non-edges in their neighborhood. If the graph is locally too dense, then slack cannot be obtained via this LLL. Thus, the algorithm of [22] carefully analyzes the topological structure of the hard instances for ΔΔ\Deltaroman_Δ-coloring, combining several different (deterministic and randomized) methods to create slack. Such a treatment seems to be inherent to the ΔΔ\Deltaroman_Δ-coloring problem as a very similar classification was independently and currently discovered in the streaming model [1]. Additionally, it has also been shown to be useful in different models of computation. In the aftermath of these works, it has been used to obtain efficient massively parallel algorithms for the problem [11].

Our algorithm is based on a fine-grained version of this classification equipped with a sequence of various LLLs for eventual slack generation. Each LLL is easier to solve in the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST model than the aforementioned slack generation LLL. In the following, we use the terminology of [22], and explain their algorithm and our solution in more detail.

Refer to caption
Figure 1: This is an example of an almost clique (AC). The depicted nice AC is a clique on Δ+1Δ1\Delta+1roman_Δ + 1 nodes with a single missing (red) edge. It is essential that the two nodes incident to the missing edge receive the same color to solve the ΔΔ\Deltaroman_Δ-coloring problem. All non-nice ACs form proper cliques.

Like in all recent randomized distributed graph coloring algorithms, they divide the graph into sparse and dense parts that are referred to as ”almost-cliques” (ACs). Then, they partition the ACs further into different types – ordinary, nice, difficult – each of which admits a different coloring approach. See Figure 1 for an example of an AC. One challenge is that all these different types of tricky subgraphs may appear in the same graph and close to each other. For this overview it is best to imagine each AC as a proper clique on almost ΔΔ\Deltaroman_Δ nodes in which each node has a few external neighbors residing in other ACs and creating lots of dependencies between different ACs. Thus, their algorithm is fragile with regard to the order in which different types of ACs are colored. The starting point of our work is that the core step of their algorithm does not work in low-degree graphs. More detailed, the first step of their algorithm executes SlackGeneration (see Algorithm 3 in Section 3) on a carefully selected subset of nodes to achieve three objectives: a) giving slack to all sparse nodes, b) providing a slack-toehold222A slack-toehold for an AC is an uncolored node that can be stalled to be colored later. All of its neighbors then lose one competitor for the remaining colors, providing them with temporal slack. for a subclass of the difficult ACs that the authors term ”runaway”, and c) providing each ordinary clique with a node that has slack. Each of these probabilistic guarantees holds w.h.p. as long as Δ=ω(log3n)Δ𝜔superscript3𝑛\Delta=\omega(\log^{3}n)roman_Δ = italic_ω ( roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_n ). Their proof shows that, in essence, all three cases are LLLs but ones that are far from being simulatable. We discuss our solutions for a)–c), separately.

Solution for a): Providing slack to sparse graphs is the main application of the LLL algorithm in [32]. In essence, we adapt their techniques to provide slack to sparse nodes but provide additional guarantees that are needed for other parts of the graph.

Refer to caption
Figure 2: for part b): The illustration depicts three difficult cliques of different layers. The external degree of C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is 1111, the external degree of C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is 2222 and the external degree of C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT is 4444. C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT has the lowest layer and its special node (the red node) is part of C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. The blue special node of C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is part of C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. So when we color C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT the red node serves as an uncolored toehold providing slack to two gray nodes of C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Stalling the coloring of these gray nodes provides slack to the white nodes of C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT so that they can be colored, followed by the gray ones. For illustration purposes, we chose ΔΔ\Deltaroman_Δ to be 9999, but note that this would actually not classify C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT as a difficult clique. A special node of C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT would need 2eC3=82subscript𝑒subscript𝐶382e_{C_{3}}=82 italic_e start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 8 neighbors in C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, which is impossible due to C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT’s size.

Solution for b): For the difficult cliques we propose a solution that eliminates randomness and solely colors all the nodes via a sequence of d1LC instances. See Figure 2 for an illustration of our solution. First, we adjust the classification of difficult almost-cliques from [22]. All nodes in a given difficult clique have the same external degree. We associate with each such AC C𝐶Citalic_C a special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT on its outside that has many neighbors on the inside (namely, more than twice the external degree of C𝐶Citalic_C’s nodes).

From here, we assign each difficult clique a layer that determines the step in which it gets colored. Those with a special node that is not contained in another difficult clique are treated separately and assigned to layer \infty, to be dealt with at the very end. The other difficult cliques are assigned to layers indexed by the base-2 logarithm of their external degree. The crucial property that follows is that the cliques in a given layer have their special node in a higher layer. This allows us to color the cliques layer by layer, starting with smaller layers. The special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is stalled to be colored later, providing a toehold for C𝐶Citalic_C. This way, we color the cliques and special nodes in all layers besides \infty.

This leaves the problem of coloring ACs the \infty layer and their still uncolored special nodes. In this exposition, we assume that special nodes are not shared by multiple difficult cliques. In that case, we pair the special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT up with some node uCCsubscript𝑢𝐶𝐶u_{C}\in Citalic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_C that is not adjacent to sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT with the objective to same-color the nodes: assigning both the same color. This is done via a virtual coloring problem capturing the dependencies between all selected pairs in the participating difficult cliques and the restrictions imposed by already colored vertices of the graph. We show that this virtual coloring instance is indeed a d1LC instance and can be solved efficiently in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST despite being a problem on a virtual graph. As a result, the clique C𝐶Citalic_C obtains an uncolored node yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT that is adjacent to both sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, has slack due to two same-colored neighbors, and can serve as a toehold for C𝐶Citalic_C.

Besides removing the need for randomization to solve the difficult cliques, our classification of difficult cliques also captures significantly more ACs than the definition of difficult cliques in [22]. The additional structure provided to the remaining ACs is exploited down the line in the most challenging part of the algorithm, dealing with the ordinary cliques in part c).

Refer to caption
Figure 3: for Part c): For the large ordinary cliques, we find triples of nodes consisting of a yellow (striped), a light yellow (dotted), and a gray (solid) node. The two yellow nodes are non-adjacent while the gray node is adjacent to both of them. The goal is to same-color the pairs of yellow/light-yellow nodes, to which end we form a virtual coloring instance consisting of all pairs and their dependencies. After same-coloring the yellow nodes, the gray node provides a slack-toehold for the clique. An important aspect is that triples of different ordinary ACs are non-overlap** and no neighborhood of the graph contains too many nodes in such pairs, as otherwise we may run into unsolvable subinstances down the line. We find these triples by a sequence of ‘simple’ LLLs.

Solution for c): The most involved part by far is dealing with case c). We split the ordinary cliques into the small (of size less than ΔΔ/polyloglog(n)ΔΔpoly𝑛\Delta-\Delta/\operatorname{\text{{\rm poly}}}\log\log(n)roman_Δ - roman_Δ / poly roman_log roman_log ( italic_n )) and large. The small ones can be handled just like the sparse nodes, as one can show that their induced neighborhoods are relatively sparse. The main effort then is to manually create slack for the large ordinary cliques. For this exposition, it is best to imagine an ordinary clique to be a clique on ΔΔ\Deltaroman_Δ nodes in which each node of the clique has exactly one external neighbor that is again a member of a large ordinary clique. See Figure 3 for an illustration.

In order to create slack-toehold in each large AC C𝐶Citalic_C, we compute a ”vee-shaped” triple (xC,yC,zC)subscript𝑥𝐶subscript𝑦𝐶subscript𝑧𝐶(x_{C},y_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) of nodes, with xC,yCCsubscript𝑥𝐶subscript𝑦𝐶𝐶x_{C},y_{C}\in Citalic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_C and zCCsubscript𝑧𝐶𝐶z_{C}\notin Citalic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∉ italic_C, but zCN(yC)subscript𝑧𝐶𝑁subscript𝑦𝐶z_{C}\in N(y_{C})italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_N ( italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is also a non-neighbor of xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. Then, we set up a virtual list coloring instance with a node for each such pair with the objective to same-color the pairs (xC,zC)subscript𝑥𝐶subscript𝑧𝐶(x_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ). As we ensure that yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is uncolored, it serves as a slack-toehold for the AC. As many of the important ACs can be mutually adjacent, the main difficulty lies in finding non-overlap** triples for the ACs. We ensure this by first computing a suitable candidate set Z𝑍Zitalic_Z from which we then pick the third node zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of the triple. Finding the set Z𝑍Zitalic_Z can be modeled as an ‘easy’ LLL fitting the framework of [32]. Finding the node zCZsubscript𝑧𝐶𝑍z_{C}\in Zitalic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_Z can also be modeled as a different type of ‘easy’ LLL. In essence, the first LLL is easy (in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST) as its bad events only consist of simple bounds on the number of neighbors in Z𝑍Zitalic_Z. Next, we elaborate on our LLL for finding zCZsubscript𝑧𝐶𝑍z_{C}\in Zitalic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_Z with slightly more detail; due to further technicalities of the existing LLL algorithms from which we spare you in this technical overview, our actual solution differs slightly from the one presented here.

With a given set Z𝑍Zitalic_Z, we model the problem of selecting zCZsubscript𝑧𝐶𝑍z_{C}\in Zitalic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_Z as an LLL as follows. Each AC C𝐶Citalic_C sends a proposal (to serve as its zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT node) to each outside neighbor inside Z𝑍Zitalic_Z with probability polyloglogn/Δpoly𝑛Δ\operatorname{\text{{\rm poly}}}\log\log n/\Deltapoly roman_log roman_log italic_n / roman_Δ. The proposal is successful if no other AC proposes to that node. We show that with a constant probability, no other AC proposes to the same node and that this is independent for different nodes in Z𝑍Zitalic_Z. Since we ensure C𝐶Citalic_C has many neighbors in Z𝑍Zitalic_Z, we obtain that the probability that none of C𝐶Citalic_C’s proposals are successful is bounded above by p=exp(Ω(polylogn))𝑝Ωpoly𝑛p=\exp(-\Omega(\operatorname{\text{{\rm poly}}}\log n))italic_p = roman_exp ( - roman_Ω ( poly roman_log italic_n ) ). The main benefit is that this LLL and also the LLL for finding the set Z𝑍Zitalic_Z are simple enough to be simulatable (in contrast to LLLs based on randomized slack generation for those ACs that can be derived from the proofs in [32]).

Once we have found zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, the structure of large ordinary ACs implies that we can deterministically find the other two nodes xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of the triple. Additional complications arise in ensuring that the list coloring instance of the pairs is a d1LC instance, i.e., that the size of the joint available color palette of xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT exceeds the maximum degree in the virtual graph induced by the pairs. The last difficulty that appears is solving the d1LC instance, as the bandwidth between the nodes within a pair is very limited and existing d1LC algorithms cannot be run in a black-box manner.

Further related work.

Graph coloring is fundamental to distributed computing as an elegant way of breaking symmetry and avoiding contention, and was, in fact, the topic of the original paper introducing the 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model [37]. There is an abundance of efficient deterministic and randomized Δ+1Δ1\Delta+1roman_Δ + 1-coloring algorithms in 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL and 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST for various settings, e.g., [2, 35, 21, 13, 9, 43, 39, 29, 33, 30, 24]. The excellent monograph on distributed graph coloring by Barenboim and Elkin is still a great resource for older results [5].

There are significantly fewer results for coloring with fewer than Δ+1Δ1\Delta+1roman_Δ + 1 colors. A 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL algorithm is known for ΔkΔ𝑘\Delta-kroman_Δ - italic_k-coloring in graphs not containing too large cliques [6]. An O(loglogn)𝑂𝑛O(\log\log n)italic_O ( roman_log roman_log italic_n )-round ΔΔ\Deltaroman_Δ-coloring algorithm in the 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model is known for trees [12], matching the lower bound [8] within a constant factor. Additionally, there are works coloring special graph classes such as coloring planar graphs with 6666 or 5555 colors in O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) rounds with a deterministic 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL algorithm [14, 41].

Outline.

In Section 2, we define the notion of slack and state required results from prior work on solving d1LC and computing an almost clique decomposition (ACD). In Section 3, we present our ΔΔ\Deltaroman_Δ-coloring algorithm with essentially all proofs. The algorithm consists of 5555 phases and all phases except for Phases 1 (ACD computation) and Phase 2 are deterministic reductions to various d1LC instances. In Phase 2, we provide slack to sparse nodes and the nodes in ordinary cliques; this refers to part a) and part c) described in Section 1.2. For ease of presentation, the (involved) Phase 2 is presented in two consecutive sections, where we first reduce Phase 2 to solving four different subproblems in Section 4 and then solve each of these subproblems via an instance of the constructive Lovász Local Lemma in Section 5.

2 Preliminaries: d1LC, Slack, Almost-Clique Decomposition, Graytone

In the deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list coloring (d1LC) problem, each node of a graph receives as input a list of allowaed colors whose size exceeds its degree. The goal is to compute a proper vertex coloring in which each node outputs a color from its list. The problem can be solved with a simple centralized greedy algorithm, and it also admits efficient distributed algorithms.

Lemma 2.1 (List coloring [30, 34]).

There is a randomized 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm to (deg+1)𝑑𝑒𝑔1(deg+1)( italic_d italic_e italic_g + 1 )-list-color (d1LC) any graph in O(log5logn)𝑂superscript5𝑛O(\log^{5}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) rounds, w.h.p. This reduces to O(log3logn)𝑂superscript3𝑛O(\log^{3}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_log italic_n ) rounds when the degrees and the size of the color space is poly(logn)poly𝑛\operatorname{\text{{\rm poly}}}(\log n)poly ( roman_log italic_n ).

The slack of a node (potentially in a subgraph) is defined as the difference between the size of its palette and the number of uncolored neighbors (in the subgraph).

Definition 2.2 (Slack).

Let v𝑣vitalic_v be a node with color palette Ψ(v)Ψ𝑣\Psi(v)roman_Ψ ( italic_v ) in a subgraph H𝐻Hitalic_H of G𝐺Gitalic_G. The slack of v𝑣vitalic_v in H𝐻Hitalic_H is the difference |Ψ(v)|dΨ𝑣𝑑|\Psi(v)|-d| roman_Ψ ( italic_v ) | - italic_d, where d𝑑ditalic_d is the number of uncolored neighbors of v𝑣vitalic_v in H𝐻Hitalic_H.

We use the following helpful terminology.

Definition 2.3 (Graytone [22]).

Consider an arbitrary step of the algorithm. A node is gray if it has unit-slack or a neighbor that will be colored in a later step of the algorithm. A node is grayish if it is not gray but has a gray neighbor. A set of gray and grayish nodes is said to be graytone.

Any graytone set can be colored as two d1LC instances: first the grayish nodes and then the gray. We emphasize that the graytone property depends on the order in which nodes are processed. It always refers to a certain step of the algorithm in which we color the respective set. Throughout our algorithm we aim at making more and more nodes graytone.

The following construction is central to our approach.

Lemma 2.4 (ACD computation [1, 22]).

For any graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ), there is a partition (almost-clique decomposition (ACD) of V𝑉Vitalic_V into sets Vsparsesubscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒V_{sparse}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT and C1,C2,,Ctsubscript𝐶1subscript𝐶2subscript𝐶𝑡C_{1},C_{2},\ldots,C_{t}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT such that each node in Vsparsesubscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒V_{sparse}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT is Ω(ϵ2Δ)Ωsuperscriptitalic-ϵ2Δ\Omega(\epsilon^{2}\Delta)roman_Ω ( italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ )-sparse and for every i[t]𝑖delimited-[]𝑡i\in[t]italic_i ∈ [ italic_t ],

  1. (i)

    (1ε/4)Δ|Ci|(1+ε)Δ1𝜀4Δsubscript𝐶𝑖1𝜀Δ(1-\varepsilon/4)\Delta\leq|C_{i}|\leq(1+\varepsilon)\Delta( 1 - italic_ε / 4 ) roman_Δ ≤ | italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ ( 1 + italic_ε ) roman_Δ ,

  2. (ii)

    Each vCi𝑣subscript𝐶𝑖v\in C_{i}italic_v ∈ italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has at least (1ε)Δ1𝜀Δ(1-\varepsilon)\Delta( 1 - italic_ε ) roman_Δ neighbors in Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT: |N(v)Ci|(1ε)Δ𝑁𝑣subscript𝐶𝑖1𝜀Δ|N(v)\cap C_{i}|\geq(1-\varepsilon)\Delta| italic_N ( italic_v ) ∩ italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≥ ( 1 - italic_ε ) roman_Δ ,

  3. (iii)

    Each node uCi𝑢subscript𝐶𝑖u\not\in C_{i}italic_u ∉ italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has at most (1ε/2)Δ1𝜀2Δ(1-\varepsilon/2)\Delta( 1 - italic_ε / 2 ) roman_Δ neighbors in Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT: |N(u)Ci|(1ε/2)Δ𝑁𝑢subscript𝐶𝑖1𝜀2Δ|N(u)\cap C_{i}|\leq(1-\varepsilon/2)\Delta| italic_N ( italic_u ) ∩ italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ ( 1 - italic_ε / 2 ) roman_Δ.

Further, there is an O(1)𝑂1O(1)italic_O ( 1 )-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm to compute a valid ACD, w.h.p.

We adapt a proof from [22] that, as stated, applies only to the case when ΔΔ\Deltaroman_Δ is sufficiently large. Technically, the argument differs only in that we build on [34] instead of [29] in the first step of the argument, where we compute a decomposition with weaker properties. We have opted to rephrase it, given the different constants in the definitions of these works and in order to make it more self-contained.

Proof of Lemma 2.4.

We first use a O(1)𝑂1O(1)italic_O ( 1 )-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm of [HNT22] to compute a weaker form of ACD with parameter ε/4𝜀4\varepsilon/4italic_ε / 4.333While such a statement is used in the paper, it is not explicitly stated. Alternatively, we may use an alternative (slower) implementation (Lemma 4.4) in [Flin et al (FGHKN22), arXiv:2301.06457]] that runs O(loglogn)𝑂𝑛O(\log\log n)italic_O ( roman_log roman_log italic_n ) rounds for Δ=poly(logn)Δpoly𝑛\Delta=\operatorname{\text{{\rm poly}}}(\log n)roman_Δ = poly ( roman_log italic_n ) and still suffices for our main result. The slowdown in [FGHKN22] comes from working with sparsified graphs, while a 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST version also runs in O(1)𝑂1O(1)italic_O ( 1 ) rounds. Namely, it computes w.h.p. a partition (V,D1,D2,,Dk)superscript𝑉subscript𝐷1subscript𝐷2subscript𝐷𝑘(V^{\prime},D_{1},D_{2},\ldots,D_{k})( italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_D start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) where nodes in Vsuperscript𝑉V^{\prime}italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are Ω(Δ)ΩΔ\Omega(\Delta)roman_Ω ( roman_Δ )-sparse and we have, for each i[k]𝑖delimited-[]𝑘i\in[k]italic_i ∈ [ italic_k ]:

  1. (a)

    |Di|(1+ε/4)Δsubscript𝐷𝑖1𝜀4Δ|D_{i}|\leq(1+\varepsilon/4)\Delta| italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ ( 1 + italic_ε / 4 ) roman_Δ, and

  2. (b)

    |N(v)Di|(1ε/4)Δ𝑁𝑣subscript𝐷𝑖1𝜀4Δ|N(v)\cap D_{i}|\geq(1-\varepsilon/4)\Delta| italic_N ( italic_v ) ∩ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≥ ( 1 - italic_ε / 4 ) roman_Δ, for each vDi𝑣subscript𝐷𝑖v\in D_{i}italic_v ∈ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

What this construction does not satisfy is condition (iii).

We form a modified decomposition (Vsparse,C1,,Ck)subscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒subscript𝐶1subscript𝐶𝑘(V_{sparse},C_{1},\cdots,C_{k})( italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) as follows. For each i[t]𝑖delimited-[]𝑡i\in[t]italic_i ∈ [ italic_t ], let Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT consist of Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT along with the nodes in Vsuperscript𝑉V^{\prime}italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with at least (1ε)Δ1𝜀Δ(1-\varepsilon)\Delta( 1 - italic_ε ) roman_Δ neighbors in Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Let Vsparse=ViCiV_{sparse}=V\setminus\cup_{i}C_{i}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT = italic_V ∖ ∪ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Observe that the decomposition is well-defined, as a node uV𝑢superscript𝑉u\in V^{\prime}italic_u ∈ italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT cannot have (1ε)Δ>Δ/21𝜀ΔΔ2(1-\varepsilon)\Delta>\Delta/2( 1 - italic_ε ) roman_Δ > roman_Δ / 2 neighbors in more than one Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

We first bound from above the number of nodes added to each part Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Each node in Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has at most εΔ/4𝜀Δ4\varepsilon\Delta/4italic_ε roman_Δ / 4 outside neighbors, so the number of edges with exactly one endpoint in Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is at most εΔ|Di|/4ε(1+ε/4)Δ2/4𝜀Δsubscript𝐷𝑖4𝜀1𝜀4superscriptΔ24\varepsilon\Delta|D_{i}|/4\leq\varepsilon(1+\varepsilon/4)\Delta^{2}/4italic_ε roman_Δ | italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | / 4 ≤ italic_ε ( 1 + italic_ε / 4 ) roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4, using (a) to bound |Di|subscript𝐷𝑖|D_{i}|| italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT |. Each node in CiDisubscript𝐶𝑖subscript𝐷𝑖C_{i}\setminus D_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is incident on at least (1ε)Δ1𝜀Δ(1-\varepsilon)\Delta( 1 - italic_ε ) roman_Δ such edges (by definition). Thus,

|CiDi|ε/4(1+ε/4)Δ/(1ε)εΔ/2.subscript𝐶𝑖subscript𝐷𝑖𝜀41𝜀4Δ1𝜀𝜀Δ2|C_{i}\setminus D_{i}|\leq\varepsilon/4\cdot(1+\varepsilon/4)\Delta/(1-% \varepsilon)\leq\varepsilon\Delta/2\ .| italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ italic_ε / 4 ⋅ ( 1 + italic_ε / 4 ) roman_Δ / ( 1 - italic_ε ) ≤ italic_ε roman_Δ / 2 . (1)

Now, (iii) holds since a node outside Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has at most (1ε)Δ1𝜀Δ(1-\varepsilon)\Delta( 1 - italic_ε ) roman_Δ neighbors in Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (by the definition of Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) and at most |CiDi|ϵΔ/2subscript𝐶𝑖subscript𝐷𝑖italic-ϵΔ2|C_{i}\setminus D_{i}|\leq\epsilon\Delta/2| italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ italic_ϵ roman_Δ / 2 other neighbors in Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (by Eq. 1). Also, (ii) holds for nodes in Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by (b) and for nodes in CiDisubscript𝐶𝑖subscript𝐷𝑖C_{i}\setminus D_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by the definition of Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. For the lower bound in (i), |Ci||Di|(1ε/4)Δsubscript𝐶𝑖subscript𝐷𝑖1𝜀4Δ|C_{i}|\geq|D_{i}|\geq(1-\varepsilon/4)\Delta| italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≥ | italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≥ ( 1 - italic_ε / 4 ) roman_Δ, by (b). For the upper bound of (i), we have |Ci||Di|+|CiDi|(1+3ϵ/4)Δsubscript𝐶𝑖subscript𝐷𝑖subscript𝐶𝑖subscript𝐷𝑖13italic-ϵ4Δ|C_{i}|\leq|D_{i}|+|C_{i}\setminus D_{i}|\leq(1+3\epsilon/4)\Delta| italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ | italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | + | italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ ( 1 + 3 italic_ϵ / 4 ) roman_Δ (by (a) and Eq. 1).

Finally, the claim about Vsparsesubscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒V_{sparse}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT follows from the definition of Vsuperscript𝑉V^{\prime}italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, as VsparseVsubscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒superscript𝑉V_{sparse}\subseteq V^{\prime}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT ⊆ italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. ∎

We say that nodes in Vsparsesubscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒V_{sparse}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT are sparse and other nodes are dense. It is immediate from Lemma 2.4 that each dense node has external degree (or neighbors outside its AC) at most εΔ𝜀Δ\varepsilon\Deltaitalic_ε roman_Δ and at most 2εΔ2𝜀Δ2\varepsilon\Delta2 italic_ε roman_Δ non-neighbors in its AC. Also, any pair of nodes in Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT have at least (13ε)Δ3Δ/413𝜀Δ3Δ4(1-3\varepsilon)\Delta\geq 3\Delta/4( 1 - 3 italic_ε ) roman_Δ ≥ 3 roman_Δ / 4 common neighbors in Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Notation. For a graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) and two nodes u,vV𝑢𝑣𝑉u,v\in Vitalic_u , italic_v ∈ italic_V, let distG(u,v)subscriptdist𝐺𝑢𝑣\operatorname{dist}_{G}(u,v)roman_dist start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u , italic_v ) denote the length of a shortest (unweighted) path between u𝑢uitalic_u and v𝑣vitalic_v in G𝐺Gitalic_G. For a set SV𝑆𝑉S\subseteq Vitalic_S ⊆ italic_V we denote distG(v,S)=minuSdistG(v,u)subscriptdist𝐺𝑣𝑆subscript𝑢𝑆subscriptdist𝐺𝑣𝑢\operatorname{dist}_{G}(v,S)=\min_{u\in S}\operatorname{dist}_{G}(v,u)roman_dist start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v , italic_S ) = roman_min start_POSTSUBSCRIPT italic_u ∈ italic_S end_POSTSUBSCRIPT roman_dist start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v , italic_u ). N(v)𝑁𝑣N(v)italic_N ( italic_v ) denotes the set of neighbors of a node vV𝑣𝑉v\in Vitalic_v ∈ italic_V.

3 ΔΔ\Deltaroman_Δ-Coloring in CONGEST

In this subsection, we prove the following theorem.

See 1.1

The extreme cases of very large ΔΔ\Deltaroman_Δ and very small ΔΔ\Deltaroman_Δ can be solved in the claimed runtime with prior work [22, 40], see the proof of Theorem 1.1 in Section 3.3. Here, we present an algorithm for the most challenging regime where ΔO(polylogn)Ω(polyloglogn)Δ𝑂poly𝑛Ωpoly𝑛\Delta\in O(\operatorname{\text{{\rm poly}}}\log n)\cap\Omega(\operatorname{% \text{{\rm poly}}}\log\log n)roman_Δ ∈ italic_O ( poly roman_log italic_n ) ∩ roman_Ω ( poly roman_log roman_log italic_n ).

In the extreme case that Δ=ω(log21n)Δ𝜔superscript21𝑛\Delta=\omega(\log^{21}n)roman_Δ = italic_ω ( roman_log start_POSTSUPERSCRIPT 21 end_POSTSUPERSCRIPT italic_n ), the ΔΔ\Deltaroman_Δ-coloring algorithm from [22] even runs in O(logn)𝑂superscript𝑛O(\log^{*}n)italic_O ( roman_log start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_n ) rounds. A lower bound of Ω(logΔlogn)ΩsubscriptΔ𝑛\Omega(\log_{\Delta}\log n)roman_Ω ( roman_log start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT roman_log italic_n ) rounds in the 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL model for the ΔΔ\Deltaroman_Δ-coloring problem [8] rules out a O(logn)𝑂superscript𝑛O(\log^{*}n)italic_O ( roman_log start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_n ) algorithm for small ΔΔ\Deltaroman_Δ. Hence, in this section, we aim for an algorithm using polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds. In fact, we reduce the ΔΔ\Deltaroman_Δ-coloring problem to a few list coloring instances and a few LLL instances, each of which we solve in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds.

3.1 Fine-Grained ACD Partition

The following definitions of types of almost-cliques are crucial for all results of the paper. The reader is hereby warned to read them slowly!

Definition 3.1 (Types of almost-cliques).

For an AC C𝐶Citalic_C, let eC=Δ|C|+1subscript𝑒𝐶Δ𝐶1e_{C}=\Delta-|C|+1italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT = roman_Δ - | italic_C | + 1. An AC is easy if it contains a non-edge or a node of degree less than ΔΔ\Deltaroman_Δ. A node vC𝑣𝐶v\notin Citalic_v ∉ italic_C is an intrusive neighbor of a non-easy C𝐶Citalic_C if v𝑣vitalic_v has at least 2eC2subscript𝑒𝐶2e_{C}2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT neighbors in C𝐶Citalic_C. A non-easy AC is difficult if it has an intrusive neighbor. Each difficult AC C𝐶Citalic_C arbitrarily selects one of its intrusive neighbors as its special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. An AC is nice if it is easy or if it is both non-difficult and contains a special node (necessarily for another AC). An AC is ordinary if it is neither nice nor difficult.

Note that all ACs except the easy are proper cliques and all nodes in such a clique C𝐶Citalic_C have external degree eCsubscript𝑒𝐶e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. We say that a node is ordinary (difficult, nice) if it belongs to an ordinary (difficult, nice) AC, respectively. The difficult ACs are divided into levels.

Definition 3.2 (Levels of difficult ACs).

The maximum level \infty contains all difficult ACs whose special node is not contained in a difficult AC. A difficult AC C𝐶Citalic_C that is not at the maximum level has level (C)=log2eC𝐶subscript2subscript𝑒𝐶\ell(C)=\lceil\log_{2}e_{C}\rceilroman_ℓ ( italic_C ) = ⌈ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ⌉.

Observe that (C)log2Δ=O(loglogn)𝐶subscript2Δ𝑂𝑛\ell(C)\leq\log_{2}\Delta=O(\log\log n)roman_ℓ ( italic_C ) ≤ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_Δ = italic_O ( roman_log roman_log italic_n ) for all difficult ACs.

Definition 3.3 (Node classification).
  1. The nodes are partitioned into the following sets:

  2. 1.

    𝒮𝒮\mathcal{S}caligraphic_S: the set of special nodes that are not in difficult ACs,

  3. 2.

    𝒟subscript𝒟\mathcal{D}_{\ell}caligraphic_D start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT: nodes in difficult ACs of level \ellroman_ℓ, [lgΔ]{}delimited-[]lgΔ\ell\in[\lg\Delta]\cup\{\infty\}roman_ℓ ∈ [ roman_lg roman_Δ ] ∪ { ∞ } (might include special nodes),

  4. 3.

    𝒩𝒩\mathcal{N}caligraphic_N: nodes in nice ACs, excluding those in 𝒮𝒮\mathcal{S}caligraphic_S,

  5. 4.

    𝒪𝒪\mathcal{O}caligraphic_O: nodes in ordinary ACs, and

  6. 5.

    Vsubscript𝑉V_{*}italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT: nodes in Vsparsesubscript𝑉𝑠𝑝𝑎𝑟𝑠𝑒V_{sparse}italic_V start_POSTSUBSCRIPT italic_s italic_p italic_a italic_r italic_s italic_e end_POSTSUBSCRIPT, excluding those in 𝒮𝒮\mathcal{S}caligraphic_S.

Our classification is built on [22] but is subtly different and more fine-grained. We are driven by a need to limit the reach of probabilistic arguments, being that we are in the challenging sub-logarithmic degree range. Thus, a strictly smaller set of dense nodes (the ordinary) needs probabilistic slack in our formulation. On the other hand, the easy, difficult, and nice definitions are more inclusive here. The difficult ones are here divided into super-constant number of levels, as opposed to only two types in [22].

The underlying idea is to ensure that every node gets at least one unit of slack, ensuring that it can be colored as part of a d1LC instance. Easy nodes have such slack from the start; difficult ones get it from their special nodes (special nodes are used in several different ways to provide slack); sparse and ordinary nodes get it from probabilistic slack generation; and non-easy nice ones get it from same-coloring a non-edge it contains. The most challenging part of the low-degree regime is the probabilistic part. That has guided our definition, resulting in the ordinary ACs being defined as restrictively as possible and, in fact, much more restrictive than the ordinary ACs in [22].

3.2 Algorithm for ΔΔ\Deltaroman_Δ-coloring

Our ΔΔ\Deltaroman_Δ-coloring algorithm consists of the following five phases.

Algorithm 1 ΔΔ\Deltaroman_Δ-coloring
1:  Compute an ACD (ε=1/172𝜀1172\varepsilon=1/172italic_ε = 1 / 172) and form the ordered partition of the nodes.
2:  Color sparse nodes Vsubscript𝑉V_{*}italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and ordinary nodes 𝒪𝒪\mathcal{O}caligraphic_O
3:  Color nice nodes 𝒩𝒩\mathcal{N}caligraphic_N
4:  For increasing 1<11\leq\ell<\infty1 ≤ roman_ℓ < ∞ :  Color difficult nodes 𝒟subscript𝒟\mathcal{D}_{\ell}caligraphic_D start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT in level \ellroman_ℓ
5:  Color difficult nodes in 𝒟subscript𝒟\mathcal{D}_{\infty}caligraphic_D start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT and special nodes in 𝒮𝒮\mathcal{S}caligraphic_S

The remainder of the paper describes these phases in detail. Only Phases 1 and 2 are randomized. Phase 2 is also the most involved part of our algorithm. For ease of presentation, we defer its details when ΔΔ\Deltaroman_Δ is at most logarithmic to Sections 4 and 5. In this section, we present Phase 2 in the case of ΔclognΔ𝑐𝑛\Delta\geq c\log nroman_Δ ≥ italic_c roman_log italic_n for a sufficiently large constant c𝑐citalic_c, where Phase 2 does not require any LLL and which is sufficient to understand how Phase 2 interacts with the remaining phases. The remaining phases are identical in both cases.

3.2.1 Phase 1: Partitioning the Nodes

We first apply Lemma 2.4 to compute an ACD for ε=1/172𝜀1172\varepsilon=1/172italic_ε = 1 / 172 and break the graph into nice ACs, difficult ACs, ordinary ACs, and the remaining nodes in Vsubscript𝑉V_{*}italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT according to Definition 3.3.

3.2.2 Phase 2: Sparse and Ordinary Nodes (Δlognmuch-greater-thanΔ𝑛\Delta\gg\log nroman_Δ ≫ roman_log italic_n)

In this subsection, we prove the following lemma.

Lemma 3.4.

There exists a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that w.h.p. colors the sparse nodes and nodes in ordinary cliques if ΔclognΔ𝑐𝑛\Delta\geq c\log nroman_Δ ≥ italic_c roman_log italic_n for a sufficiently large constant c𝑐citalic_c.

Lemma 3.4 essentially follows from the proof of Lemma 3.5 in [22, arxiv version]. However, as we have changed the definition of ordinary cliques, we spell out the required details.

Slack generation is based on trying a random color for a subset of nodes. Sample a set of nodes and a random color for each of the sampled nodes. Nodes keep the random color if none of their neighbors choose the same color. See Algorithm 3 for a pseudocode. If there are enough non-edges in a node’s neighborhood, then it probabilistically gets significant slack.

Algorithm 2 Phase 2: Coloring Sparse and Ordinary Nodes (when Δlognmuch-greater-thanΔ𝑛\Delta\gg\log nroman_Δ ≫ roman_log italic_n)
1:  Run SlackGeneration on V𝒪subscript𝑉𝒪V_{*}\cup\mathcal{O}italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ∪ caligraphic_O
2:  Color the remaining ordinary nodes 𝒪𝒪\mathcal{O}caligraphic_O
3:  Color the remaining sparse nodes Vsubscript𝑉V_{*}italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT
Algorithm 3 SlackGeneration

Input: SV𝑆𝑉S\subseteq Vitalic_S ⊆ italic_V

1:  Each node in vS𝑣𝑆v\in Sitalic_v ∈ italic_S is active w.p. 1/201201/201 / 20
2:  Each active node v𝑣vitalic_v samples a color rvsubscript𝑟𝑣r_{v}italic_r start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT u.a.r. from [χ]delimited-[]𝜒[\chi][ italic_χ ].
3:  v𝑣vitalic_v keeps the color rvsubscript𝑟𝑣r_{v}italic_r start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT if no neighbor tried the same color.

We also require the following lemma from [22].

Lemma 3.5 ([22]).

Let C𝐶Citalic_C be a non-easy AC, SV𝑆𝑉S\subseteq Vitalic_S ⊆ italic_V be a subset of nodes containing C𝐶Citalic_C, and M𝑀Mitalic_M be an arbitrary matching between C𝐶Citalic_C and N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C. Then, after SlackGeneration is run on S𝑆Sitalic_S, C𝐶Citalic_C contains Ω(|M|)Ω𝑀\Omega(|M|)roman_Ω ( | italic_M | ) uncolored nodes with unit-slack in G[S]𝐺delimited-[]𝑆G[S]italic_G [ italic_S ], with probability 1exp(Ω(|M|))1Ω𝑀1-\exp(-\Omega(|M|))1 - roman_exp ( - roman_Ω ( | italic_M | ) ).

There exists a large matching satisfying the hypothesis of Lemma 3.5,

Lemma 3.6.

For each ordinary AC C𝐶Citalic_C, there exists a matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT between C𝐶Citalic_C and N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C of size 2Δ/52Δ52\Delta/52 roman_Δ / 5.

Proof.

We use the following combinatorial result.

Claim 3.7.

Let B=(Y,U,EB)𝐵𝑌𝑈subscript𝐸𝐵B=(Y,U,E_{B})italic_B = ( italic_Y , italic_U , italic_E start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT ) be a bipartite graph where nodes in Y𝑌Yitalic_Y have degree at least k𝑘kitalic_k and nodes in U𝑈Uitalic_U have degree at most 2k2𝑘2k2 italic_k. There exists a matching of size |Y|/2𝑌2|Y|/2| italic_Y | / 2 in B𝐵Bitalic_B.

Proof.

Let M𝑀Mitalic_M be a maximum matching in B𝐵Bitalic_B and suppose that more than half the nodes in Y𝑌Yitalic_Y are unmatched. Let S𝑆Sitalic_S be the set of nodes reachable from the unmatched nodes YV(M)𝑌𝑉𝑀Y\setminus V(M)italic_Y ∖ italic_V ( italic_M ). Since M𝑀Mitalic_M has no augmenting path, S𝑆Sitalic_S contains no unmatched node of U𝑈Uitalic_U. All of the |YS|k𝑌𝑆𝑘|Y\cap S|\cdot k| italic_Y ∩ italic_S | ⋅ italic_k edges incident on YS𝑌𝑆Y\cap Sitalic_Y ∩ italic_S have their other endpoint in US𝑈𝑆U\cap Sitalic_U ∩ italic_S. By the degree bound on U𝑈Uitalic_U, there are fewer than |US|2k𝑈𝑆2𝑘|U\cap S|2k| italic_U ∩ italic_S | 2 italic_k such edges. Thus, |YS|<2|US|𝑌𝑆2𝑈𝑆|Y\cap S|<2|U\cap S|| italic_Y ∩ italic_S | < 2 | italic_U ∩ italic_S |. Every node in US𝑈𝑆U\cap Sitalic_U ∩ italic_S is matched to a node in YS𝑌𝑆Y\cap Sitalic_Y ∩ italic_S, while all unmatched nodes in Y𝑌Yitalic_Y are in YS𝑌𝑆Y\cap Sitalic_Y ∩ italic_S. Thus, the number of unmatched nodes in Y𝑌Yitalic_Y is at most |YS||US|<|US||M|𝑌𝑆𝑈𝑆𝑈𝑆𝑀|Y\cap S|-|U\cap S|<|U\cap S|\leq|M|| italic_Y ∩ italic_S | - | italic_U ∩ italic_S | < | italic_U ∩ italic_S | ≤ | italic_M |. This is a contradiction, and hence, at least half the nodes in Y𝑌Yitalic_Y are matched. \hfill\blacksquare

As C𝐶Citalic_C is not easy, all its nodes have external degree eCsubscript𝑒𝐶e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, while nodes in N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C are by assumption not intrusive neighbors of C𝐶Citalic_C, so they have at most 2eC2subscript𝑒𝐶2e_{C}2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT neighbors in C𝐶Citalic_C. 3.7 then implies that there exists a matching between C𝐶Citalic_C and N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C of size |C|/2(1ϵ)Δ/22Δ/5𝐶21italic-ϵΔ22Δ5|C|/2\geq(1-\epsilon)\Delta/2\geq 2\Delta/5| italic_C | / 2 ≥ ( 1 - italic_ϵ ) roman_Δ / 2 ≥ 2 roman_Δ / 5. \Box

The properties of Phase 2 are summarized in the following lemma.

Lemma 3.8.

If ΔclognΔ𝑐𝑛\Delta\geq c\log nroman_Δ ≥ italic_c roman_log italic_n for a sufficiently large constant c𝑐citalic_c, the following properties hold w.h.p. after Step 1 of Algorithm 2:

  • (†)

    Each sparse node has unit-slack in G[V]𝐺delimited-[]subscript𝑉G[V_{*}]italic_G [ italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ],

  • (††)

    Each ordinary AC has an uncolored unit-slack node in G[V𝒪]𝐺delimited-[]subscript𝑉𝒪G[V_{*}\cup\mathcal{O}]italic_G [ italic_V start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ∪ caligraphic_O ].

Proof.

We run SlackGeneration on the node set S=V𝒪𝑆superscript𝑉𝒪S=V^{*}\cup\mathcal{O}italic_S = italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O. Nodes with neighbors outside V𝒪superscript𝑉𝒪V^{*}\cup\mathcal{O}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O have slack while the rest of the graph is stalled. We focus on the remaining nodes. Each sparse node gets the respective slack with probability at least 1exp(Ω(Δ))1ΩΔ1-\exp(\Omega(\Delta))1 - roman_exp ( roman_Ω ( roman_Δ ) ) [19, Lemma 3.1], implying (†). By Lemma 3.6, there is a matching between C𝐶Citalic_C and N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C of size 2Δ/52Δ52\Delta/52 roman_Δ / 5. Thus, ()({\dagger}{\dagger})( † † ) holds with probability at least 1exp(Ω(Δ))1ΩΔ1-\exp(-\Omega(\Delta))1 - roman_exp ( - roman_Ω ( roman_Δ ) ), by Lemma 3.5.

Both probabilities become w.h.p. guarantees if ΔclognΔ𝑐𝑛\Delta\geq c\log nroman_Δ ≥ italic_c roman_log italic_n for a sufficiently large constant c𝑐citalic_c. For ΔΔ0ΔsubscriptΔ0\Delta\geq\Delta_{0}roman_Δ ≥ roman_Δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for a sufficiently large constant Δ0subscriptΔ0\Delta_{0}roman_Δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT we obtain an LLL. ∎

Proof of Lemma 3.4.

By Lemma 3.8 w.h.p. all sparse nodes become gray as they have unit slack. Also, the unit-slack node in each ordinary AC becomes gray and all other nodes of the AC become grayish as ordinary ACs induce cliques. This is sufficient to color all nodes with O(1)𝑂1O(1)italic_O ( 1 ) d1LC instances. ∎

Forward pointer: The main difficulty of Phase 2 for smaller values of ΔΔ\Deltaroman_Δ is to mimic the properties of Lemma 3.8. Sections 4 and 5 are devoted to ensuring these properties via several LLLs and d1LC instances that can be solved in a bandwidth-efficient manner.

3.2.3 Phase 3: Nice ACs

We give a simpler treatment than [22]. We want a toehold in each nice AC: a node with permanent or temporary slack. With a toehold, the rest is easy. Namely, ACs have all nodes of internal degree at least (1ε)Δ1𝜀Δ(1-\varepsilon)\Delta( 1 - italic_ε ) roman_Δ, of which none are colored in previous phases. The neighbors of a toehold are gray, and there are at least (1ε/4)Δ1𝜀4Δ(1-\varepsilon/4)\Delta( 1 - italic_ε / 4 ) roman_Δ of them by Lemma 2.4, all uncolored. The remaining nodes in the AC are then grayish, so the AC is graytone.

Nice ACs come in three types, depending on if they contain a special node, a non-edge, or a degree-below-ΔΔ\Deltaroman_Δ node. The first and third types immediately give us a toehold. It remains then to consider nice ACs with a non-edge but with no special node, which we call hollow.

For a hollow AC C𝐶Citalic_C, we identify an arbitrary non-edge (uC,wC)subscript𝑢𝐶subscript𝑤𝐶(u_{C},w_{C})( italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and call it the pair for C𝐶Citalic_C. We color the pairs for hollow ACs as a d1LC instance. The two nodes in a pair have at least Δ/2Δ2\Delta/2roman_Δ / 2 common neighbors within C𝐶Citalic_C and any of them can function as a toehold. It remains to argue that we can find a valid coloring of the pairs efficiently.

Lemma 3.9.

The pairs of hollow ACs can be colored in the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST model in O(log3logn)𝑂superscript3𝑛O(\log^{3}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_log italic_n ) rounds.

Proof.

As the nodes of a hollow C𝐶Citalic_C were uncolored, the only nodes that can conflict with the coloring of the pair are the at most 2εΔΔ/22𝜀ΔΔ22\cdot\varepsilon\Delta\leq\Delta/22 ⋅ italic_ε roman_Δ ≤ roman_Δ / 2 external neighbors. The Δ+1Δ1\Delta+1roman_Δ + 1 colors we have to work with significantly exceed that. Thus, the pairs are deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list colorable.

Both nodes of the pair (uC,wC)subscript𝑢𝐶subscript𝑤𝐶(u_{C},w_{C})( italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) have at least (1ϵ)Δ1italic-ϵΔ(1-\epsilon)\Delta( 1 - italic_ϵ ) roman_Δ neighbors in C𝐶Citalic_C, so they have at least (1ϵ)Δ(|C|(1ϵ)Δ)>(13ϵ)ΔΔ/21italic-ϵΔ𝐶1italic-ϵΔ13italic-ϵΔΔ2(1-\epsilon)\Delta-(|C|-(1-\epsilon)\Delta)>(1-3\epsilon)\Delta\geq\Delta/2( 1 - italic_ϵ ) roman_Δ - ( | italic_C | - ( 1 - italic_ϵ ) roman_Δ ) > ( 1 - 3 italic_ϵ ) roman_Δ ≥ roman_Δ / 2 common neighbors in C𝐶Citalic_C. They provide the bandwidth to transmit to one node all the colors adjacent to the other node. Also, all messages to and from uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT vis-a-vis its external neighbors can be forwarded in two rounds. Hence, we can simulate any 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST coloring algorithm on the pairs with O(1)𝑂1O(1)italic_O ( 1 )-factor slowdown; in particular, we can simulate the algorithm from Lemma 2.1. ∎

3.2.4 Phase 4: Difficult ACs in a Non-Maximum Level

By Definition 3.2, the special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of any difficult AC C𝐶Citalic_C at a level other than Dsubscript𝐷D_{\infty}italic_D start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT is contained in another difficult AC CCsuperscript𝐶𝐶C^{\prime}\neq Citalic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_C. The next lemma shows that the level of Csuperscript𝐶C^{\prime}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT must be strictly larger than the level of C𝐶Citalic_C, which allows us to color C𝐶Citalic_C fast while Csuperscript𝐶C^{\prime}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT remains uncolored.

Claim 3.10.

For an AC C𝐶Citalic_C with (C)<𝐶\ell(C)<\inftyroman_ℓ ( italic_C ) < ∞, let Csuperscript𝐶C^{\prime}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT be the difficult AC that contains the special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. Then we have (C)<(C)𝐶superscript𝐶\ell(C)<\ell(C^{\prime})roman_ℓ ( italic_C ) < roman_ℓ ( italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ).

Proof.

The special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has external degree of at least 2eC2subscript𝑒𝐶2e_{C}2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT as it is connected to at least 2eC2subscript𝑒𝐶2e_{C}2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT nodes of C𝐶Citalic_C that do not lie within Csuperscript𝐶C^{\prime}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Hence, we obtain that the external degree eCsubscript𝑒superscript𝐶e_{C^{\prime}}italic_e start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT in AC Csuperscript𝐶C^{\prime}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is at least eC2eCsubscript𝑒superscript𝐶2subscript𝑒𝐶e_{C^{\prime}}\geq 2e_{C}italic_e start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ 2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, so (C)>(C)superscript𝐶𝐶\ell(C^{\prime})>\ell(C)roman_ℓ ( italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) > roman_ℓ ( italic_C ). ∎

We color all ACs of a level in parallel, in increasing order of levels. Due to the previous claim, the special node of an AC is contained in a difficult clique in a larger level or not contained in a difficult clique at all. Hence, the special node is uncolored when the clique is processed. So, when processing some level 1iO(loglogn)1𝑖𝑂𝑛1\leq i\leq O(\log\log n)1 ≤ italic_i ≤ italic_O ( roman_log roman_log italic_n ), we color all nodes in ACs of that level, but we do not color their respective special nodes. Thus, the respective special node provides a toehold for the respective clique.

3.2.5 Phase 5: Difficult ACs in the Maximum Level

The maximum level is processed last and differently from the other levels. By definition, the special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of an AC in \infty level is not contained in a difficult AC. Also, all nodes in Dsubscript𝐷D_{\infty}italic_D start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT and their special nodes are still uncolored at the beginning of this phase.

The algorithm has four steps: (1) Form pairs of selected non-adjacent nodes, (2) Color the nodes in each pair consistently, (3) Graytone color the remaining nodes of the AC, and (4) Color the special nodes 𝒮𝒮\mathcal{S}caligraphic_S. We explain each step in detail.

First, we form the following pairs. For each special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT that is special for only one AC C𝐶Citalic_C at level \infty: Form a type-1 pair Ts=(sC,uC)subscript𝑇𝑠subscript𝑠𝐶subscript𝑢𝐶T_{s}=(s_{C},u_{C})italic_T start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = ( italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) with a non-neighbor of sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT in C𝐶Citalic_C. For each special node s𝑠sitalic_s that is special for more than one ACs at level \infty, form a type-2 pair Ts=(w1,w2)subscript𝑇𝑠subscript𝑤1subscript𝑤2T_{s}=(w_{1},w_{2})italic_T start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = ( italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), where w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and w2subscript𝑤2w_{2}italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are arbitrary non-adjacent nodes in two of the ACs for which s𝑠sitalic_s is special. Let \mathcal{E}caligraphic_E be the set of the latter special nodes.

Claim 3.11.

The pairs can be properly formed.

Proof.

Type-1: An (uncolored) non-neighbor uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of pCsubscript𝑝𝐶p_{C}italic_p start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT exists as pCsubscript𝑝𝐶p_{C}italic_p start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT can have at most (1ε/2)Δ1𝜀2Δ(1-\varepsilon/2)\Delta( 1 - italic_ε / 2 ) roman_Δ neighbors in C𝐶Citalic_C by Lemma 2.4 (4), but the AC C𝐶Citalic_C has at least (1ε/4)Δ1𝜀4Δ(1-\varepsilon/4)\Delta( 1 - italic_ε / 4 ) roman_Δ vertices.

Type-2: Let C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT be two ACs at level \infty for which s𝑠sitalic_s is special, where e(C1)e(C2)𝑒subscript𝐶1𝑒subscript𝐶2e(C_{1})\leq e(C_{2})italic_e ( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≤ italic_e ( italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). By definition, s𝑠sitalic_s has at least 2e(C1)2𝑒subscript𝐶12e(C_{1})2 italic_e ( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) (2e(C2)2𝑒subscript𝐶22e(C_{2})2 italic_e ( italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT )) neighbors in C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT), respectively. Pick w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to be any neighbor of s𝑠sitalic_s in C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Node w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT has at most e(C1)𝑒subscript𝐶1e(C_{1})italic_e ( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) neighbors in C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Thus, there are at least 2e(C2)e(C1)>02𝑒subscript𝐶2𝑒subscript𝐶102e(C_{2})-e(C_{1})>02 italic_e ( italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_e ( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) > 0 nodes in C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT that are neighbors of s𝑠sitalic_s and non-neighbors of w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and we can pick any such node as w2subscript𝑤2w_{2}italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. ∎

Lemma 3.12.

Coloring the pairs is a (deg+1)𝑑𝑒𝑔1(deg+1)( italic_d italic_e italic_g + 1 )-list coloring instance that can be solved in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST, w.h.p.

Proof.

Type-1 pair T={sC,uC}𝑇subscript𝑠𝐶subscript𝑢𝐶T=\{s_{C},u_{C}\}italic_T = { italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT }, sCCsubscript𝑠𝐶𝐶s_{C}\notin Citalic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∉ italic_C, uCCsubscript𝑢𝐶𝐶u_{C}\in Citalic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_C: We say that a node conflicts with the pair {sC,uC}subscript𝑠𝐶subscript𝑢𝐶\{s_{C},u_{C}\}{ italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT } if the node is already colored or is contained in an adjacent pair of the same phase. As C𝐶Citalic_C does not contain a special node, uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is the only node of C𝐶Citalic_C participating in the phase and all other nodes of C𝐶Citalic_C are still uncolored. The node uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT can only be adjacent to eCsubscript𝑒𝐶e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT conflicting nodes as it has external degree at most eCsubscript𝑒𝐶e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. As sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has at least 2eC2subscript𝑒𝐶2e_{C}2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT neighbors in C𝐶Citalic_C, it can conflict with at most Δ2eCΔ2subscript𝑒𝐶\Delta-2e_{C}roman_Δ - 2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT nodes. Thus, the pair conflicts with at most eC+Δ2eC=ΔeCsubscript𝑒𝐶Δ2subscript𝑒𝐶Δsubscript𝑒𝐶e_{C}+\Delta-2e_{C}=\Delta-e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT + roman_Δ - 2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT = roman_Δ - italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT nodes, which is less than ΔΔ\Deltaroman_Δ, the number of colors initially available. Thus, the problem of coloring such pairs is a (deg+1)𝑑𝑒𝑔1(deg+1)( italic_d italic_e italic_g + 1 )-list coloring problem.

Type-2 pair T={w1,w2}𝑇subscript𝑤1subscript𝑤2T=\{w_{1},w_{2}\}italic_T = { italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT }: Each such pair (w1,w2)subscript𝑤1subscript𝑤2(w_{1},w_{2})( italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is adjacent to at most e(C1)+e(C2)2ϵΔ𝑒subscript𝐶1𝑒subscript𝐶22italic-ϵΔe(C_{1})+e(C_{2})\leq 2\epsilon\Deltaitalic_e ( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + italic_e ( italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≤ 2 italic_ϵ roman_Δ nodes in other ACs. Further, all nodes in the ACs C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are still uncolored, so both nodes have at least (12ϵ)Δ12italic-ϵΔ(1-2\epsilon)\Delta( 1 - 2 italic_ϵ ) roman_Δ colors in their palette, and each pair is adjacent to at most 2εΔ2𝜀Δ2\varepsilon\Delta2 italic_ε roman_Δ other pairs or already colored neighbors, that is, the palette exceeds the degree.

𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST Implementation. A type-1 pair has at least e(C)𝑒𝐶e(C)italic_e ( italic_C ) common neighbors (the special node sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has 2e(C)2𝑒𝐶2e(C)2 italic_e ( italic_C ) neighbors inside the clique by its definition that are all connected to uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT), which suffices to communicate the colors and all messages of external neighbors of uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT to sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT (uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has at most eCsubscript𝑒𝐶e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT external neighbors). Hence, the coloring can be achieved in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST.

Let s𝑠sitalic_s be the common special node of a type-2 pair {w1,w2}subscript𝑤1subscript𝑤2\{w_{1},w_{2}\}{ italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } and let C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT be the respective cliques. For i=1,2𝑖12i=1,2italic_i = 1 , 2 the node wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has at most eCisubscript𝑒subscript𝐶𝑖e_{C_{i}}italic_e start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT outside neighbors and s𝑠sitalic_s has 2eCieCi2subscript𝑒subscript𝐶𝑖subscript𝑒subscript𝐶𝑖2e_{C_{i}}\geq e_{C_{i}}2 italic_e start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_e start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT neighbors in Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, denote these by Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We simulate the pair by s𝑠sitalic_s. The node wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can forward all initial colors of outside neighbors as well as all messages from them to s𝑠sitalic_s by relaying them through Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. ∎

After coloring the pairs, each difficult AC C𝐶Citalic_C has a node with unit-slack in G[V]𝐺delimited-[]𝑉G[V\setminus\mathcal{E}]italic_G [ italic_V ∖ caligraphic_E ], either because the clique contains an uncolored node with two neighbors appearing in a consistently colored type-1 pair T={sC,uC}𝑇subscript𝑠𝐶subscript𝑢𝐶T=\{s_{C},u_{C}\}italic_T = { italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT }, or because it contains an uncolored node with a neighbor in \mathcal{E}caligraphic_E. In the former case, the uncolored node exists because sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has at least one neighbor in C𝐶Citalic_C that is also a neighbor of uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. In the latter case, the special node s𝑠sitalic_s with type-2 pair T={w1,w2}𝑇subscript𝑤1subscript𝑤2T=\{w_{1},w_{2}\}italic_T = { italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } has by definition further neighbors besides w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and w2subscript𝑤2w_{2}italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in each clique that are all uncolored.

Thus, we color all nodes in difficult cliques via the graytone property. At the end, we color the nodes in \mathcal{E}caligraphic_E, which have unit-slack as they are adjacent to a type-2222 pair.

3.3 Proof of Theorem 1.1

Proof of Theorem 1.1.

There are five cases, depending on the relation of ΔΔ\Deltaroman_Δ and n𝑛nitalic_n. Generally, we use Lemma 2.1 to solve d1LC instances in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds. Whenever the d1LC instances require additional arguments to be solved in the respective time, e.g., because they are defined on a virtual graph, we reason their runtime when they are introduced.

  • If Δ=ω(log4n)Δ𝜔superscript4𝑛\Delta=\omega(\log^{4}n)roman_Δ = italic_ω ( roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_n ), we use the algorithm from [22] to ΔΔ\Deltaroman_Δ-color the graph.

  • For clognΔ=O(log4n)𝑐𝑛Δ𝑂superscript4𝑛c\log n\leq\Delta=O(\log^{4}n)italic_c roman_log italic_n ≤ roman_Δ = italic_O ( roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_n ) for a sufficiently large constant c𝑐citalic_c, the result follows by executing Algorithm 1 with the arguments of this section. Phases 11113333 only require O(1)𝑂1O(1)italic_O ( 1 ) rounds and a constant number of d1LC instances. In Phase 4, we iterate through the O(logΔ)=O(loglogn)𝑂Δ𝑂𝑛O(\log\Delta)=O(\log\log n)italic_O ( roman_log roman_Δ ) = italic_O ( roman_log roman_log italic_n ) levels and solve a constant number of d1LC instances for each level. Phase 5 can be executed in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n time by Lemma 3.12.

  • When polyloglognΔclognpoly𝑛Δ𝑐𝑛\operatorname{\text{{\rm poly}}}\log\log n\leq\Delta\leq c\log npoly roman_log roman_log italic_n ≤ roman_Δ ≤ italic_c roman_log italic_n, we use Algorithm 1 from this section and replace Phase 2 with Algorithm 4 (presented in Section 4) whose correctness and runtime we prove in Sections 4 and 5.

  • If Δ0ΔpolyloglognsubscriptΔ0Δpoly𝑛\Delta_{0}\leq\Delta\leq\operatorname{\text{{\rm poly}}}\log\log nroman_Δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ roman_Δ ≤ poly roman_log roman_log italic_n, we use the algorithm of this section together with the LLL representation from the proof of Lemma 3.8. The LLL can be solved with the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST LLL solver of [40] in polyΔpolyloglogn=polyloglognpolyΔpoly𝑛poly𝑛\operatorname{\text{{\rm poly}}}\Delta\operatorname{\text{{\rm poly}}}\log\log n% =\operatorname{\text{{\rm poly}}}\log\log npoly roman_Δ poly roman_log roman_log italic_n = poly roman_log roman_log italic_n rounds. Here, Δ0subscriptΔ0\Delta_{0}roman_Δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is a sufficiently large constant such that the LLL guarantees from Lemma 3.8 hold.

  • If 3ΔΔ03ΔsubscriptΔ03\leq\Delta\leq\Delta_{0}3 ≤ roman_Δ ≤ roman_Δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, that is, for constant ΔΔ\Deltaroman_Δ, there is an existing algorithm from [40].

In all cases, the algorithm runs in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds. ∎

4 Phase 2 (Δ=O(logn)Δ𝑂𝑛\Delta=O(\log n)roman_Δ = italic_O ( roman_log italic_n )): Sparse Nodes and Ordinary Cliques

In this section, we deal with Phase 2 for the most challenging regime of ΔO(logn)Ω(polyloglogn)Δ𝑂𝑛Ωpoly𝑛\Delta\in O(\log n)\cap\Omega(\operatorname{\text{{\rm poly}}}\log\log n)roman_Δ ∈ italic_O ( roman_log italic_n ) ∩ roman_Ω ( poly roman_log roman_log italic_n ). The following lemma follows from all proofs in this section, together with Lemmas 4.3, 4.5 and 4.6 all proven in Section 5.

Lemma 4.1 (Phase 2).

There exists a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that w.h.p. color the sparse nodes and nodes in ordinary cliques if log10lognΔO(logn)superscript10𝑛Δ𝑂𝑛\log^{10}\log n\leq\Delta\leq O(\log n)roman_log start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT roman_log italic_n ≤ roman_Δ ≤ italic_O ( roman_log italic_n ).

We first give high-level ideas of our method. We divide the ordinary cliques into the small, of size at most Δ(11/(10log3logn))Δ1110superscript3𝑛\Delta(1-1/(10\log^{3}\log n))roman_Δ ( 1 - 1 / ( 10 roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_log italic_n ) ), and the large. Nodes in small ordinary cliques have significant sparsity (i.e., non-edges in their induced neighborhood), which means that the one-round procedure of trying a random color has a good probability of successfully generating slack. The natural LLL formulation of that step is therefore well-behaved enough that it can be solved fast in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST with a few additional tweaks, see Section 5.2. Large nodes need a different approach.

For each large AC, we produce unit slack for a single node. See Figure 3 for an illustration of the process we will describe. We identify for each such AC a triplet of nodes (x,y,z)𝑥𝑦𝑧(x,y,z)( italic_x , italic_y , italic_z ) with the objective to color x𝑥xitalic_x and z𝑧zitalic_z with the same color, while y𝑦yitalic_y remains uncolored. This way, y𝑦yitalic_y receives unit slack, which gives us a toehold to color the whole AC.

Computing such triplets is non-trivial. We do so by breaking it into three steps, each solvable by a different LLL formulation. In brief, we first compute a set Z𝑍Zitalic_Z of candidate z𝑧zitalic_z-nodes; next partition Z𝑍Zitalic_Z into two sets; and then select the actual z𝑧zitalic_z-nodes to be used from these two sets. The split of Z𝑍Zitalic_Z into two sets is required to make the process of finally finding the z𝑧zitalic_z-nodes fit the LLL solver from Theorem 5.3. The properties of the set Z𝑍Zitalic_Z imply that it is then much easier to identify compatible x𝑥xitalic_x- and y𝑦yitalic_y-nodes, and once we find such triplets, we set up a virtual coloring instance for same-coloring x𝑥xitalic_x- and z𝑧zitalic_z-nodes in each triple. We show that this instance is d1LC and can be solved with low bandwidth despite being defined on a virtual graph. This provides a slack-toehold to the y𝑦yitalic_y-node of each triple and the coloring can be extended via d1LC instances to the whole instance.

Algorithm.

The first step of the algorithm is to compute a large matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT between each ordinary clique C𝐶Citalic_C and N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C in parallel. We then classify the ordinary cliques as follows. Fix the parameter q(n)=10log3logn𝑞𝑛10superscript3𝑛q(n)=10\log^{3}\log nitalic_q ( italic_n ) = 10 roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_log italic_n throughout this section.

Definition 4.2 (Small, Large, Unimportant and Important Ordinary cliques.).

An ordinary AC is large if it contains more than ΔΔ/q(n)ΔΔ𝑞𝑛\Delta-\Delta/q(n)roman_Δ - roman_Δ / italic_q ( italic_n ) nodes, and small otherwise. A large AC is important if |(V(MC)C)𝒪l|Δ/12𝑉subscript𝑀𝐶𝐶subscript𝒪𝑙Δ12|(V(M_{C})\setminus C)\cap\mathcal{O}_{l}|\geq\Delta/12| ( italic_V ( italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∖ italic_C ) ∩ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT | ≥ roman_Δ / 12, and unimportant otherwise.

We say that a node is small/large/important/unimportant if it belongs to an AC of the corresponding type. Let 𝒪isubscript𝒪𝑖\mathcal{O}_{i}caligraphic_O start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, 𝒪usubscript𝒪𝑢\mathcal{O}_{u}caligraphic_O start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, 𝒪l=𝒪i𝒪usubscript𝒪𝑙subscript𝒪𝑖subscript𝒪𝑢\mathcal{O}_{l}=\mathcal{O}_{i}\cup\mathcal{O}_{u}caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = caligraphic_O start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∪ caligraphic_O start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, and 𝒪ssubscript𝒪𝑠\mathcal{O}_{s}caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT be the set of important, unimportant, large, and small nodes, respectively.

Next, we present our full solution. The algorithm has the following steps, which are explained in detail below.

Algorithm 4 Phase 2: Coloring Sparse and Ordinary Nodes (Δ=O(logn)Δ𝑂𝑛\Delta=O(\log n)roman_Δ = italic_O ( roman_log italic_n ))
1:  Step 0: For each ordinary AC C𝐶Citalic_C in parallel, compute a matching MCC×(N(C)C)subscript𝑀𝐶𝐶𝑁𝐶𝐶M_{C}\subseteq C\times(N(C)\setminus C)italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ⊆ italic_C × ( italic_N ( italic_C ) ∖ italic_C ). Classify ordinary ACs into important, unimportant, and small ACs.
2:  Step 1: Generate slack for sparse and small nodes (via LLL, Section 5.2)
3:  Step 2: Compute candidate sets Z=Z1Z2𝒪l𝑍subscript𝑍1subscript𝑍2subscript𝒪𝑙Z=Z_{1}\cup Z_{2}\subseteq\mathcal{O}_{l}italic_Z = italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊆ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT (via LLL, Section 5.3)
4:  Step 3: Form triples (xC,yC,zC)C×C×Zsubscript𝑥𝐶subscript𝑦𝐶subscript𝑧𝐶𝐶𝐶𝑍(x_{C},y_{C},z_{C})\in C\times C\times Z( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∈ italic_C × italic_C × italic_Z (via LLL, Section 5.4)
5:  Step 4: Same-color (x,z)𝑥𝑧(x,z)( italic_x , italic_z )-pairs via virtual coloring instance
6:  Step 5: Color the remainder of V𝒪superscript𝑉𝒪V^{*}\cup\mathcal{O}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O (via d1LC instances).

Step 0: Classifying ACs and computing matchings.

We compute a matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each ordinary clique C𝐶Citalic_C between the vertices in C𝐶Citalic_C and the ones in N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C. We use a 2.5-approximate algorithm of [23] running in O(log2Δ+logn)=O(log2logn)𝑂superscript2Δsuperscript𝑛𝑂superscript2𝑛O(\log^{2}\Delta+\log^{*}n)=O(\log^{2}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ + roman_log start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_n ) = italic_O ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_log italic_n ) rounds, obtaining that |MC|(2Δ/5)/2.5=Δ/10subscript𝑀𝐶2Δ52.5Δ10|M_{C}|\geq(2\Delta/5)/2.5=\Delta/10| italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | ≥ ( 2 roman_Δ / 5 ) / 2.5 = roman_Δ / 10, using Lemma 3.6.

We view the edges of MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT as being directed arcs with a head in C𝐶Citalic_C and tail in VC𝑉𝐶V\setminus Citalic_V ∖ italic_C. Each AC can determine its size and the size of V(MC)𝒪l𝑉subscript𝑀𝐶subscript𝒪𝑙V(M_{C})\cap\mathcal{O}_{l}italic_V ( italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∩ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT in O(1)𝑂1O(1)italic_O ( 1 ) rounds and hence the classification of Definition 4.2 can be computed in O(1)𝑂1O(1)italic_O ( 1 ) rounds.

Step 1: Slack for sparse and small nodes.

In this step, we create slack for sparse nodes and all nodes in 𝒪ssubscript𝒪𝑠\mathcal{O}_{s}caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. The key property of small nodes is that they are relatively sparse (with many non-edges in their neighborhoods), so randomly trying colors is likely to produce slack. That leads to an LLL formulation that we can make simulatable and can therefore implement in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST.

The properties are summarized by the following lemma. Besides providing slack to all sparse nodes and the nodes in small ordinary ACs, it also guarantees that each neighborhood (and hence also each AC) does not have too many nodes colored and that the matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of each AC does not get too many nodes colored. The proof is in Section 5.2.

Lemma 4.3.

Assume that we are given a matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of size at least Δ/10Δ10\Delta/10roman_Δ / 10 between C𝐶Citalic_C and N(C)C𝑁𝐶𝐶N(C)\setminus Citalic_N ( italic_C ) ∖ italic_C for each ordinary AC C𝐶Citalic_C. There is a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round (LLL-based) 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that w.h.p. colors a subset SV𝒪𝑆superscript𝑉𝒪S\subseteq V^{*}\cup\mathcal{O}italic_S ⊆ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O and ensures that:

  1. 1.

    Each uncolored node in V𝒪ssuperscript𝑉subscript𝒪𝑠V^{*}\cup\mathcal{O}_{s}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT has unit-slack in G[V𝒪]𝐺delimited-[]superscript𝑉𝒪G[V^{*}\cup\mathcal{O}]italic_G [ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O ].

  2. 2.

    In each of the following subsets, at most O(log4lognlogΔ)𝑂superscript4𝑛ΔO(\log^{4}\log n\cdot\log\Delta)italic_O ( roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log roman_Δ ) nodes are colored: N(v)𝑁𝑣N(v)italic_N ( italic_v ) for each vV𝒪𝑣superscript𝑉𝒪v\in V^{*}\cup\mathcal{O}italic_v ∈ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O and V(MC)𝑉subscript𝑀𝐶V(M_{C})italic_V ( italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) for each AC C𝐶Citalic_C .

Step 2: Compute triple candidate set via LLL.

Let X=𝒪l{v𝒪l:v colored in Step 1}𝑋subscript𝒪𝑙conditional-set𝑣subscript𝒪𝑙v colored in Step 1X=\mathcal{O}_{l}\setminus\{v\in\mathcal{O}_{l}:\text{$v$ colored in Step~{}1}\}italic_X = caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∖ { italic_v ∈ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT : italic_v colored in Step 1 }.

The goal of this step is to compute two disjoint sets Z1,Z2subscript𝑍1subscript𝑍2Z_{1},Z_{2}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT of uncolored nodes such that each important AC has sufficiently many matching edges satisfying the following definition of usefulness.

Definition 4.4 (useful edge).

Given a subset ZX𝑍𝑋Z\subseteq Xitalic_Z ⊆ italic_X and important AC C𝐶Citalic_C, a matched arc vuMC𝑣𝑢subscript𝑀𝐶\overrightarrow{vu}\in M_{C}over→ start_ARG italic_v italic_u end_ARG ∈ italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is useful for C𝐶Citalic_C if v(XZ)𝑣𝑋𝑍v\in(X\setminus Z)italic_v ∈ ( italic_X ∖ italic_Z ) and uZ𝑢𝑍u\in Zitalic_u ∈ italic_Z. Refer to Z𝑍Zitalic_Z as the 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black nodes and XZ𝑋𝑍X\setminus Zitalic_X ∖ italic_Z as the 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white nodes. An edge is 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white if both endpoints are 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white.

An arc vz𝑣𝑧\overrightarrow{vz}over→ start_ARG italic_v italic_z end_ARG cannot be useful for the AC containing v𝑣vitalic_v; only the one containing z𝑧zitalic_z.

Formally, Step 2 provides the following lemma that we prove in Section 5.3. For an AC C𝐶Citalic_C and set Z𝑍Zitalic_Z, let U(C,Z)𝑈𝐶𝑍U(C,Z)italic_U ( italic_C , italic_Z ) denote the arcs of MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT with one endpoint in Z𝑍Zitalic_Z (and the other in C𝐶Citalic_C).

Lemma 4.5.

Let q=1/30𝑞130q=1/30italic_q = 1 / 30. There is a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round (LLL-based) 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm computing disjoint subsets Z1,Z2𝒪lsubscript𝑍1subscript𝑍2subscript𝒪𝑙Z_{1},Z_{2}\subseteq\mathcal{O}_{l}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊆ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT satisfying the following properties, w.h.p.:

  1. 1.

    |U(C,Zi)|q2(1q)3Δ/60𝑈𝐶subscript𝑍𝑖superscript𝑞2superscript1𝑞3Δ60|U(C,Z_{i})|\geq q^{2}(1-q)^{3}\Delta/60| italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) | ≥ italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_q ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Δ / 60, for i=1,2𝑖12i=1,2italic_i = 1 , 2 and for each important AC C𝐶Citalic_C, and

  2. 2.

    |(Z1Z2)N(v)|Δ/10subscript𝑍1subscript𝑍2𝑁𝑣Δ10|(Z_{1}\cup Z_{2})\cap N(v)|\leq\Delta/10| ( italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∩ italic_N ( italic_v ) | ≤ roman_Δ / 10, for all v𝒪𝑣𝒪v\in\mathcal{O}italic_v ∈ caligraphic_O.

Step 3: Forming triples via LLL.

The goal of this step is to compute a triple (xC,yC,zC)C×C×Zsubscript𝑥𝐶subscript𝑦𝐶subscript𝑧𝐶𝐶𝐶𝑍(x_{C},y_{C},z_{C})\in C\times C\times Z( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∈ italic_C × italic_C × italic_Z of nodes that satisfy the conditions of the next lemma. These triple nodes are distinct for different ACs.

Lemma 4.6.

Given sets Z1,Z2𝒪lsubscript𝑍1subscript𝑍2subscript𝒪𝑙Z_{1},Z_{2}\subseteq\mathcal{O}_{l}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊆ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT with the properties as in Lemma 4.5, there is a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round (LLL-based) 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that computes for each large important AC C𝐶Citalic_C a triple (xC,yC,zC)subscript𝑥𝐶subscript𝑦𝐶subscript𝑧𝐶(x_{C},y_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) of uncolored nodes such that w.h.p.:

  1. 1.

    xC,yCCsubscript𝑥𝐶subscript𝑦𝐶𝐶x_{C},y_{C}\in Citalic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_C and zCCsubscript𝑧𝐶𝐶z_{C}\notin Citalic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∉ italic_C,

  2. 2.

    yCxCsubscript𝑦𝐶subscript𝑥𝐶y_{C}x_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, yCzCEsubscript𝑦𝐶subscript𝑧𝐶𝐸y_{C}z_{C}\in Eitalic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∈ italic_E, xCzCEsubscript𝑥𝐶subscript𝑧𝐶𝐸x_{C}z_{C}\not\in Eitalic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∉ italic_E (xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are non-adjacent; ycsubscript𝑦𝑐y_{c}italic_y start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is adjacent to both xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT) and

  3. 3.

    the graph induced by {zC:C is important}conditional-setsubscript𝑧𝐶𝐶 is important\{z_{C}:C\text{ is important}\}{ italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT : italic_C is important } has maximum degree Δ/10absentΔ10\leq\Delta/10≤ roman_Δ / 10.

We model the problem of selecting zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each important AC C𝐶Citalic_C as a disjoint variable set LLL. The proof of the lemma is given in Section 5.4.

Step 4: Same-coloring (xC,zC)subscript𝑥𝐶subscript𝑧𝐶(x_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) pairs.

Given a triple (xC,yC,zC)x_{C},y_{C},z_{C})italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ), we will create a toehold for the AC C𝐶Citalic_C at yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT by coloring its non-adjacent neighbors xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT with the same color.

Let HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT (P𝑃Pitalic_P for pair) be the virtual graph consisting of one vertex for each pair (sC,zC)subscript𝑠𝐶subscript𝑧𝐶(s_{C},z_{C})( italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and an edge between two pairs (sC,zC)subscript𝑠𝐶subscript𝑧𝐶(s_{C},z_{C})( italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and (sC,zC)subscript𝑠superscript𝐶subscript𝑧superscript𝐶(s_{C^{\prime}},z_{C^{\prime}})( italic_s start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) if there is any edge in G𝐺Gitalic_G between {sC,zC}subscript𝑠𝐶subscript𝑧𝐶\{s_{C},z_{C}\}{ italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT } and {sC,zC}subscript𝑠superscript𝐶subscript𝑧superscript𝐶\{s_{C^{\prime}},z_{C^{\prime}}\}{ italic_s start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT }. The list of available colors L((sC,zC))𝐿subscript𝑠𝐶subscript𝑧𝐶L((s_{C},z_{C}))italic_L ( ( italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ) consists of all colors that are not used by the already colored neighbors in G𝐺Gitalic_G of sCsubscript𝑠𝐶s_{C}italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT.

Lemma 4.7.

The maximum degree ΔHPsubscriptΔsubscript𝐻𝑃\Delta_{H_{P}}roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT of HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT is upper bounded by Δ/9Δ9\Delta/9roman_Δ / 9.

Proof.

By Lemma 4.6, each node has at most Δ/10Δ10\Delta/10roman_Δ / 10 neighbors in Z𝑍Zitalic_Z. Define the set X={xC:C is an important AC}superscript𝑋conditional-setsubscript𝑥𝐶𝐶 is an important ACX^{\prime}=\{x_{C}:C\text{ is an important AC}\}italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT : italic_C is an important AC }. As Xsuperscript𝑋X^{\prime}italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT contains at most one node per AC, the number of neighbors that a node in 𝒪lsubscript𝒪𝑙\mathcal{O}_{l}caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT can have in U𝑈Uitalic_U is upper bounded by its external degree plus 1111, which is upper bounded by Δ/q(n)+1Δ𝑞𝑛1\Delta/q(n)+1roman_Δ / italic_q ( italic_n ) + 1. Thus, the maximum degree ΔHPsubscriptΔsubscript𝐻𝑃\Delta_{H_{P}}roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT of the virtual graph HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT is at most Δ/10+Δ/q(n)+1Δ/9Δ10Δ𝑞𝑛1Δ9\Delta/10+\Delta/q(n)+1\leq\Delta/9roman_Δ / 10 + roman_Δ / italic_q ( italic_n ) + 1 ≤ roman_Δ / 9 for sufficiently large n𝑛nitalic_n. ∎

Lemma 4.8.

Coloring HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT – i.e., same-coloring the pairs – is a deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list coloring instance.

Proof.

By Lemma 4.7 we obtain ΔHPΔ/9subscriptΔsubscript𝐻𝑃Δ9\Delta_{H_{P}}\leq\Delta/9roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ roman_Δ / 9. As we colored at most x=O(log5logn)𝑥𝑂superscript5𝑛x=O(\log^{5}\log n)italic_x = italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) vertices in each neighborhood in Step 1, the list of available colors of each pair has at least Δ2xΔ/9=ΔHPmuch-greater-thanΔ2𝑥Δ9subscriptΔsubscript𝐻𝑃\Delta-2x\gg\Delta/9=\Delta_{H_{P}}roman_Δ - 2 italic_x ≫ roman_Δ / 9 = roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT colors available in their joint list. Hence, we obtain a deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list coloring instance. ∎

𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST implementation. Our algorithm is based on the deg+1𝑑𝑒𝑔1deg+1italic_d italic_e italic_g + 1-list coloring algorithm from [25, 7]. Before we show how to color the nodes in HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT, we need to define a slow (it takes O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) rounds) randomized algorithm. The algorithm is used in our analysis and it works as follows. In each iteration, each uncolored pair executes the following procedure that may result in the pair to try to get colored with a color or to not try a color (also see Algorithm 5 for pseudocode of the algorithm). Throughout the algorithm, nodes xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT maintain lists L(xC)𝐿subscript𝑥𝐶L(x_{C})italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and L(zC)𝐿subscript𝑧𝐶L(z_{C})italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) consisting of all colors not used by their respective neighbors in G𝐺Gitalic_G. Then, in one iteration node xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT selects a color c𝑐citalic_c u.a.r.  from its list of available colors L(xC)𝐿subscript𝑥𝐶L(x_{C})italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ), and sends it to the other endpoint through node yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. The other endpoint zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT checks whether cL(zC)𝑐𝐿subscript𝑧𝐶c\in L(z_{C})italic_c ∈ italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ); if so, both nodes agree on trying color c𝑐citalic_c, and the color is sent to their neighbors. If no incident pair tries the same color, the pair gets permanently colored with the color. Lastly, both nodes individually update their lists by removing colors from adjacent vertices that got colored from their respective list. There is no explicit coordination between the two vertices in maintaining a joint list of available colors.

Algorithm 5 Randomized Pair Coloring
1:  Each node xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT selects a color c𝑐citalic_c u.a.r.  from L(xC)𝐿subscript𝑥𝐶L(x_{C})italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and sends c𝑐citalic_c to zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT
2:  If cL(zC)𝑐𝐿subscript𝑧𝐶c\in L(z_{C})italic_c ∈ italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) then TryColor(c)
3:  Update lists L(xC)L(xC){c(v):vNG(xC)}𝐿subscript𝑥𝐶𝐿subscript𝑥𝐶conditional-set𝑐𝑣𝑣subscript𝑁𝐺subscript𝑥𝐶L(x_{C})\leftarrow L(x_{C})\setminus\{c(v):v\in N_{G}(x_{C})\}italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ← italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∖ { italic_c ( italic_v ) : italic_v ∈ italic_N start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) } and L(zC)L(zC){c(v):vNG(zC)}𝐿subscript𝑧𝐶𝐿subscript𝑧𝐶conditional-set𝑐𝑣𝑣subscript𝑁𝐺subscript𝑧𝐶L(z_{C})\leftarrow L(z_{C})\setminus\{c(v):v\in N_{G}(z_{C})\}italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ← italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∖ { italic_c ( italic_v ) : italic_v ∈ italic_N start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) }

The next lemma shows that each pair gets colored with constant probability.

Lemma 4.9.

Consider an arbitrary iteration of Algorithm 5 and an arbitrary pair (xC,zC)subscript𝑥𝐶subscript𝑧𝐶(x_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) for a hiding AC C𝐶Citalic_C that is uncolored at the start of the iteration. Then, we have

Pr((xC,zC) gets colored in the iteration)1/2.Prsubscript𝑥𝐶subscript𝑧𝐶 gets colored in the iteration12\displaystyle\Pr((x_{C},z_{C})\text{ gets colored in the iteration})\geq 1/2~{}.roman_Pr ( ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) gets colored in the iteration ) ≥ 1 / 2 . (2)

The bound on the probability holds regardless of the outcome of previous iterations.

Proof.

Note 444The constants in this proof are not chosen optimally in order to improve readability. that throughout the execution of Algorithm 5 the respective lists of nodes xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are always of size at least ΔΔHPΩ(log5logn)4Δ/5ΔsubscriptΔsubscript𝐻𝑃Ωsuperscript5𝑛4Δ5\Delta-\Delta_{H_{P}}-\Omega(\log^{5}\log n)\geq 4\Delta/5roman_Δ - roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT - roman_Ω ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) ≥ 4 roman_Δ / 5 as Δ=ω(log5logn)Δ𝜔superscript5𝑛\Delta=\omega(\log^{5}\log n)roman_Δ = italic_ω ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) and ΔHPΔ/9subscriptΔsubscript𝐻𝑃Δ9\Delta_{H_{P}}\leq\Delta/9roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ roman_Δ / 9, by Lemma 4.7. Note, that both nodes keep their individual list of available colors in which they only remove the colors of immediate neighbors in G𝐺Gitalic_G from the list of available colors. Thus, at all times we have |L(xC)|L(zC)|3Δ/5|L(x_{C})|\cap L(z_{C})|\geq 3\Delta/5| italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) | ∩ italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) | ≥ 3 roman_Δ / 5. Let X𝑋Xitalic_X be the set of colors tried by one of the ΔHPΔ/9subscriptΔsubscript𝐻𝑃Δ9\Delta_{H_{P}}\leq\Delta/9roman_Δ start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ roman_Δ / 9 pairs incident to (sC,zC)subscript𝑠𝐶subscript𝑧𝐶(s_{C},z_{C})( italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) in the current iteration. We obtain |(L(sC)L(zC))X|Δ/2𝐿subscript𝑠𝐶𝐿subscript𝑧𝐶𝑋Δ2|(L(s_{C})\cap L(z_{C}))\setminus X|\geq\Delta/2| ( italic_L ( italic_s start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∩ italic_L ( italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ) ∖ italic_X | ≥ roman_Δ / 2. As these colors are at least half of L(xC)𝐿subscript𝑥𝐶L(x_{C})italic_L ( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT )’s palette, the probability that the pair (xC,zC)subscript𝑥𝐶subscript𝑧𝐶(x_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) gets colored is at least 1/2121/21 / 2. ∎

Lemma 4.10.

There is a randomized polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that w.h.p. colors the pairs of HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT.

Proof.

Consider the well-understood color trial algorithm in which nodes repeatedly try a color from their list of available colors, keep their color permanently if no neighbor tries the same color, and remove colors of permanently colored neighbors from their list of available colors. It is known that this algorithm colors each node with a constant probability in each iteration [7, 36]. Thus, it requires O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) rounds to color all vertices of a graph. The shattering-based 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm from [25] for d1LC runs in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds. It requires three subroutines: a) A color trial algorithm like the one from [7, 36], b) a network decomposition algorithm that can run on small subgraphs (the ones in [43, 40, 38] do the job), and c) the possibility to run O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) instances of the color trial algorithm in parallel. In our setting we want to solve the same problem, but on the virtual graph Hpsubscript𝐻𝑝H_{p}italic_H start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT while the communication network is still the original graph G𝐺Gitalic_G. The subroutine for part b) can be taken from prior work as the same issue is dealt with formally in [40, 38, 32]. We refer to these works for the details and also the definition of a network decomposition. Let us sketch the main ingredient for the informed reader. Instead of computing a network decomposition of small subgraphs of HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT, the subgraphs are first projected to G𝐺Gitalic_G, and a network decomposition of G𝐺Gitalic_G is computed afterwards. This only requires an increased distance between clusters such that the preimage of the decomposition induces a proper network decomposition of HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT.

For ingredients a) and c), we observe that Ghaffari’s algorithm only requires the following properties for the color trial algorithm: i) one iteration can be executed in constant time and with polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n bandwidth, allowing to execute O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) instances in parallel in the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST model, and ii) each node gets colored with a constant probability in each iteration. Thus, we can replace the color trial algorithm with the color trial algorithm for HPsubscript𝐻𝑃H_{P}italic_H start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT given in Algorithm 5. We have already argued that it can be implemented with polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n bandwidth showing i)i)italic_i ) and Lemma 4.9 provides its constant success probability for ii). ∎

Step 5: Completing the coloring.

To finish the coloring, we first color the unimportant nodes and then the important, small, and sparse nodes.

Lemma 4.11.

Unimportant nodes are graytone as long as the other ordinary nodes (small, sparse, important) are inactive.

Proof.

The only steps so far in which we colored vertices are Steps 1 and 4. In Step 1 we color at most O(log5logn)𝑂superscript5𝑛O(\log^{5}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) vertices per AC and per matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT of each ordinary AC C𝐶Citalic_C. In Step 4 we only color (a subset of) the vertices in Z𝑍Zitalic_Z and one vertex per important AC (the vertex xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for AC C𝐶Citalic_C). As |ZC|Δ/10𝑍𝐶Δ10|Z\cap C|\leq\Delta/10| italic_Z ∩ italic_C | ≤ roman_Δ / 10, we color at most Δ/10+O(log5logn)Δ/9Δ10𝑂superscript5𝑛Δ9\Delta/10+O(\log^{5}\log n)\leq\Delta/9roman_Δ / 10 + italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) ≤ roman_Δ / 9 vertices in each unimportant AC.

Fix some unimportant AC C𝐶Citalic_C. Recall that the algorithm of [23] finds a 2.5-approximate matching, which by Lemma 3.6 implies that |MC|Δ/10subscript𝑀𝐶Δ10|M_{C}|\geq\Delta/10| italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | ≥ roman_Δ / 10. As an unimportant AC has fewer than Δ/12Δ12\Delta/12roman_Δ / 12 nodes in (V(MC)C)𝒪l𝑉subscript𝑀𝐶𝐶subscript𝒪𝑙(V(M_{C})\setminus C)\cap\mathcal{O}_{l}( italic_V ( italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∖ italic_C ) ∩ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, we obtain that V(MC)C𝑉subscript𝑀𝐶𝐶V(M_{C})\setminus Citalic_V ( italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∖ italic_C contains at least Δ/10Δ/12=7Δ/60Δ10Δ127Δ60\Delta/10-\Delta/12=7\Delta/60roman_Δ / 10 - roman_Δ / 12 = 7 roman_Δ / 60 nodes that are not contained in 𝒪lsubscript𝒪𝑙\mathcal{O}_{l}caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. By Lemma 4.3, at most O(log5logn)𝑂superscript5𝑛O(\log^{5}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) of these get colored in Step 1; denote the uncolored nodes of these by S𝑆Sitalic_S and let S=N(S)Csuperscript𝑆𝑁𝑆𝐶S^{\prime}=N(S)\cap Citalic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_N ( italic_S ) ∩ italic_C. By the earlier argument, at most Δ/9Δ9\Delta/9roman_Δ / 9 nodes of Ssuperscript𝑆S^{\prime}italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are already colored, that is, there exists some vS𝑣superscript𝑆v\in S^{\prime}italic_v ∈ italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT that is still uncolored and has an uncolored neighbor u𝒪l𝑢subscript𝒪𝑙u\notin\mathcal{O}_{l}italic_u ∉ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. As u𝑢uitalic_u is stalled to be colored later, v𝑣vitalic_v is gray and other nodes of the AC are grayish. ∎

Lemma 4.12.

Small, sparse, and important nodes are graytone.

Proof.

By Lemma 4.3, each small or sparse node has slack in G[V𝒪]𝐺delimited-[]superscript𝑉𝒪G[V^{*}\cup\mathcal{O}]italic_G [ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O ] and is therefore gray (and stays gray until colored).

For an important AC C𝐶Citalic_C with triple (xC,yC,zC)subscript𝑥𝐶subscript𝑦𝐶subscript𝑧𝐶(x_{C},y_{C},z_{C})( italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ), the node yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is gray as xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are same-colored. Hence, the remaining uncolored nodes of C𝐶Citalic_C are either already colored or graytone as they are adjacent to v𝑣vitalic_v. ∎

5 Solving Subproblems of Phase 2 via LLL

We show how the probabilistic subproblems of Section 4 can be solved via a fast LLL algorithm. We show for all four problems that they can be captured with the 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST framework of [32]. We start by reviewing the framework of [32] and then solve each of the subproblems in respective subsections.

5.1 Framework for LLL in CONGEST

In this section, we present 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST model LLL solvers from [32]. The definitions, theorems, and selected textual excerpts in this section have been sourced from [32].

Constructive Lovász Local Lemma (LLL).

An instance =(𝒱,)𝒱\mathcal{L}=(\mathcal{V},\mathcal{B})caligraphic_L = ( caligraphic_V , caligraphic_B ) of the distributed Lovász local lemma (LLL) is given by a a set 𝒱={x1,,xk𝒱}𝒱subscript𝑥1subscript𝑥subscript𝑘𝒱\mathcal{V}=\{x_{1},\ldots,x_{k_{\mathcal{V}}}\}caligraphic_V = { italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT caligraphic_V end_POSTSUBSCRIPT end_POSTSUBSCRIPT } of independent random variables and a family \mathcal{B}caligraphic_B of ”bad” events {1,,k}subscript1subscriptsubscript𝑘\{\mathcal{E}_{1},\ldots,\mathcal{E}_{k_{\mathcal{B}}}\}{ caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , caligraphic_E start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT end_POSTSUBSCRIPT } over these variables. Let vbl()vbl\textsf{vbl}(\mathcal{E})vbl ( caligraphic_E ) denote the set of variables involving the event \mathcal{E}caligraphic_E and note that \mathcal{E}caligraphic_E is a binary function of vbl()vbl\textsf{vbl}(\mathcal{E})vbl ( caligraphic_E ). The dependency graph =(,F)subscript𝐹\mathcal{H}_{\mathcal{L}}=(\mathcal{B},F)caligraphic_H start_POSTSUBSCRIPT caligraphic_L end_POSTSUBSCRIPT = ( caligraphic_B , italic_F ) is a graph with a vertex for each event and an edge (,)Fsuperscript𝐹(\mathcal{E},\mathcal{E}^{\prime})\in F( caligraphic_E , caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ italic_F whenever vbl()vbl()vblvblsuperscript\textsf{vbl}(\mathcal{E})\cap\textsf{vbl}(\mathcal{E}^{\prime})\neq\emptysetvbl ( caligraphic_E ) ∩ vbl ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≠ ∅. The dependency degree d=d𝑑subscript𝑑d=d_{\mathcal{L}}italic_d = italic_d start_POSTSUBSCRIPT caligraphic_L end_POSTSUBSCRIPT is the maximum degree of Hsubscript𝐻H_{\mathcal{L}}italic_H start_POSTSUBSCRIPT caligraphic_L end_POSTSUBSCRIPT. We omit the subscript \mathcal{L}caligraphic_L when the considered LLL is unambiguous. The Lovász Local Lemma [18] states that Pr(¯)>0Prsubscript¯0\Pr(\cap_{\mathcal{E}\in\mathcal{B}}\bar{\mathcal{E}})>0roman_Pr ( ∩ start_POSTSUBSCRIPT caligraphic_E ∈ caligraphic_B end_POSTSUBSCRIPT over¯ start_ARG caligraphic_E end_ARG ) > 0 holds if epd<1𝑒𝑝𝑑1epd<1italic_e italic_p italic_d < 1, or in other words, there exists an assignment to the variables that avoids all bad events.

In the constructive Lovász local lemma one aims to compute such an feasible assignment, avoiding all bad events. This is often under much stronger conditions on the relation of p𝑝pitalic_p and d𝑑ditalic_d. The relation of p𝑝pitalic_p and d𝑑ditalic_d is referred to as the LLL criterion.

Constructive Distributed Lovász Local Lemma

In the distributed setting, the LLL instance \mathcal{L}caligraphic_L is mapped to a communication network G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ). We are given a function :𝒱V:𝒱𝑉\ell:\mathcal{B}\cup\mathcal{V}\rightarrow Vroman_ℓ : caligraphic_B ∪ caligraphic_V → italic_V that assigns each variable and each bad event to a node of the communication network. We assume that for each variable x𝒱𝑥𝒱x\in\mathcal{V}italic_x ∈ caligraphic_V, the node (x)𝑥\ell(x)roman_ℓ ( italic_x ) knows the distribution of x𝑥xitalic_x, including the range 𝗋𝖺𝗇𝗀𝖾(x)𝗋𝖺𝗇𝗀𝖾𝑥\mathsf{range}(x)sansserif_range ( italic_x ) of the variable. We also say that node (x)𝑥\ell(x)roman_ℓ ( italic_x ) simulates the variable/event x𝑥xitalic_x. For a vertex vV𝑣𝑉v\in Vitalic_v ∈ italic_V, we call l(v)=|1(v)|𝑙𝑣superscript1𝑣l(v)=|\ell^{-1}(v)|italic_l ( italic_v ) = | roman_ℓ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_v ) | the load of vertex v𝑣vitalic_v. The (maximum) vertex load of an LLL instance is l=maxvVl(v)𝑙subscript𝑣𝑉𝑙𝑣l=\max_{v\in V}l(v)italic_l = roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT italic_l ( italic_v ).

In the constructive distributed LLL, we execute a 𝖫𝖮𝖢𝖠𝖫𝖫𝖮𝖢𝖠𝖫\mathsf{LOCAL}sansserif_LOCAL or 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm on G𝐺Gitalic_G to compute a feasible assignment φ𝜑\varphiitalic_φ. Afterwards, for each variable x𝒱𝑥𝒱x\in\mathcal{V}italic_x ∈ caligraphic_V, node (x)𝑥\ell(x)roman_ℓ ( italic_x ) has to output φ(x)𝜑𝑥\varphi(x)italic_φ ( italic_x ).

In general, the graph G𝐺Gitalic_G and the dependency graph Hsubscript𝐻H_{\mathcal{L}}italic_H start_POSTSUBSCRIPT caligraphic_L end_POSTSUBSCRIPT do not have to coincide. However, distances between events in Hsubscript𝐻H_{\mathcal{L}}italic_H start_POSTSUBSCRIPT caligraphic_L end_POSTSUBSCRIPT and the corresponding nodes in G𝐺Gitalic_G are in close relation, as formalized by the next definition.

Definition 5.1 (Locality).

A triple (,G,)𝐺(\mathcal{L},G,\ell)( caligraphic_L , italic_G , roman_ℓ ) has locality ν𝜈\nuitalic_ν if distG((),(x))νsubscriptdist𝐺𝑥𝜈\operatorname{dist}_{G}(\ell(\mathcal{E}),\ell(x))\leq\nuroman_dist start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( roman_ℓ ( caligraphic_E ) , roman_ℓ ( italic_x ) ) ≤ italic_ν for all events \mathcal{E}caligraphic_E of \mathcal{L}caligraphic_L and variables xvbl()𝑥vblx\in\textsf{vbl}(\mathcal{E})italic_x ∈ vbl ( caligraphic_E ).

(Partial) Assignments. We use the value bottom\bot for variables that have not been set yet. A partial assignment φ𝜑\varphiitalic_φ of a set of variables 𝒱𝒱\mathcal{V}caligraphic_V is a function with domain 𝒱𝒱\mathcal{V}caligraphic_V satisfying φ(x)𝗋𝖺𝗇𝗀𝖾(x){}𝜑𝑥𝗋𝖺𝗇𝗀𝖾𝑥bottom\varphi(x)\in\mathsf{range}(x)\cup\{\bot\}italic_φ ( italic_x ) ∈ sansserif_range ( italic_x ) ∪ { ⊥ } for all x𝒱𝑥𝒱x\in\mathcal{V}italic_x ∈ caligraphic_V. A partial assignment ψ𝜓\psiitalic_ψ agrees with another (partial) assignment φ𝜑\varphiitalic_φ if ψ(x)=φ(x)𝜓𝑥𝜑𝑥\psi(x)=\varphi(x)italic_ψ ( italic_x ) = italic_φ ( italic_x ) for all xψ1()𝑥superscript𝜓1bottomx\notin\psi^{-1}(\bot)italic_x ∉ italic_ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ⊥ ), i.e., if all proper values assigned by ψ𝜓\psiitalic_ψ match those of φ𝜑\varphiitalic_φ. A retraction ψ𝜓\psiitalic_ψ of a partial assignment φ𝜑\varphiitalic_φ is a partial assignment that agrees with φ𝜑\varphiitalic_φ. For an event \mathcal{E}caligraphic_E and a partial assignment φ𝜑\varphiitalic_φ, we use the notation Pr(φ)Prconditional𝜑\Pr(\mathcal{E}\mid\varphi)roman_Pr ( caligraphic_E ∣ italic_φ ) to mean that the probability is over assignments with which φ𝜑\varphiitalic_φ agrees; in other words, the randomness is only over the variables in φ1()superscript𝜑1bottom\varphi^{-1}(\bot)italic_φ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ⊥ ).

Simulatable Distributed Lovász Local Lemma (CONGEST)
Definition 5.2 (Simulatability).

We say an LLL (,G,)𝐺(\mathcal{L},G,\ell)( caligraphic_L , italic_G , roman_ℓ ) is simulatable in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST if each of the following can be done in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds:

  1. 1.

    Test: Test in parallel which events of \mathcal{L}caligraphic_L hold (without preprocessing).

  2. 2.

    Min-aggregation: Given 1111 bit in each event (variable), each variable (event) can simultaneously find the minimum of the bits of its events (variables).

  3. For the following items, it is sufficient if they hold in the setting that events and variables are given O(loglogn)𝑂𝑛O(\log\log n)italic_O ( roman_log roman_log italic_n )-bit identifiers555In general, for the whole LLL instance and for non-constant distances such identifiers do not exist, but our LLL algorithms only use the primitives in settings where they do exists and are available. (that are unique within distance 4ν4𝜈4\nu4 italic_ν in G𝐺Gitalic_G):

  4. 3.

    Evaluate: Given a partial assignment φ𝜑\varphiitalic_φ, and partial assignments ψ1,,ψtsubscript𝜓1subscript𝜓𝑡\psi_{1},\ldots,\psi_{t}italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, t=O(logn)𝑡𝑂𝑛t=O(\log n)italic_t = italic_O ( roman_log italic_n ), in which each variable knows its values (or bottom\bot), each event \mathcal{E}caligraphic_E of \mathcal{L}caligraphic_L can simultaneously decide if Pr(ψi)αPr(φ)Prconditionalsubscript𝜓𝑖𝛼Prconditional𝜑\Pr(\mathcal{E}\mid\psi_{i})\leq\alpha\Pr(\mathcal{E}\mid\varphi)roman_Pr ( caligraphic_E ∣ italic_ψ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_α roman_Pr ( caligraphic_E ∣ italic_φ ) holds, where α𝛼\alphaitalic_α is a parameter known by all nodes of G𝐺Gitalic_G.

  5. 4.

    Min-aggregation: We can compute the following for O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) different instances in parallel: Given an O(loglogn)𝑂𝑛O(\log\log n)italic_O ( roman_log roman_log italic_n )-bit string in each event (variable), each variable (event) can simultaneously find the minimum of the strings for its events (for its variables).

Disjoint Variable Set LLLs

In a disjoint variable set LLLs there are two disjoint sets of variables 𝒱1,𝒱2subscript𝒱1subscript𝒱2\mathcal{V}_{1},\mathcal{V}_{2}caligraphic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , caligraphic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT available for each event. In fact, we consider events \mathcal{E}caligraphic_E that can be written as the conjunction of two events 1,2subscript1subscript2\mathcal{E}_{1},\mathcal{E}_{2}caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT where vbl(i)=𝒱ivblsubscript𝑖subscript𝒱𝑖\textsf{vbl}(\mathcal{E}_{i})=\mathcal{V}_{i}vbl ( caligraphic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = caligraphic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Pr(i)pPrsubscript𝑖𝑝\Pr(\mathcal{E}_{i})\leq proman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_p holds for i=1,2𝑖12i=1,2italic_i = 1 , 2. Note, that to avoid \mathcal{E}caligraphic_E it is sufficient to avoid either 1subscript1\mathcal{E}_{1}caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT or 2subscript2\mathcal{E}_{2}caligraphic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

Theorem 5.3.

There is a randomized 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds w.h.p. solve any disjoint variable set LLL of constant locality ν𝜈\nuitalic_ν with dependency degree dpolylogn𝑑poly𝑛d\leq\operatorname{\text{{\rm poly}}}\log nitalic_d ≤ poly roman_log italic_n and bad event upper bound p𝑝pitalic_p. The algorithm requires p<d(2+cl)(4c+12cΔν)loglogn𝑝superscript𝑑2subscript𝑐𝑙4𝑐12subscript𝑐Δ𝜈𝑛p<d^{-(2+c_{l})-(4c+12c_{\Delta}\nu)\log\log n}italic_p < italic_d start_POSTSUPERSCRIPT - ( 2 + italic_c start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) - ( 4 italic_c + 12 italic_c start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT italic_ν ) roman_log roman_log italic_n end_POSTSUPERSCRIPT, ldcl𝑙superscript𝑑subscript𝑐𝑙l\leq d^{c_{l}}italic_l ≤ italic_d start_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, ΔlogcnΔsuperscript𝑐𝑛\Delta\leq\log^{c}nroman_Δ ≤ roman_log start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT italic_n for constants cl,cΔ1subscript𝑐𝑙subscript𝑐Δ1c_{l},c_{\Delta}\geq 1italic_c start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ≥ 1, and that the LLL is simulatable.

Sampling LLLs

In a binary LLL the range of the variables is {𝖻𝗅𝖺𝖼𝗄,𝗐𝗁𝗂𝗍𝖾}𝖻𝗅𝖺𝖼𝗄𝗐𝗁𝗂𝗍𝖾\{\mathsf{black},\mathsf{white}\}{ sansserif_black , sansserif_white }. We view the variables/nodes with black value as sampled. Thus, we also refer to them as sampling LLLs. The risk of a bad event \mathcal{E}caligraphic_E upper bounds the probability of a bad event to hold under a certain type of retractions of an assignment that avoided an associated event superscript\mathcal{E}^{\prime}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.

Definition 5.4 (risk).

We say that an (associated) event superscript\mathcal{E}^{\prime}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT testifies risk x𝑥xitalic_x for some event superscript\mathcal{E}\subseteq\mathcal{E}^{\prime}caligraphic_E ⊆ caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT if

max{Pr(),maxψ𝖱𝖾𝗌𝗉𝖾𝖼𝗍(){Pr(ψ)}}x.Prsuperscriptsubscript𝜓𝖱𝖾𝗌𝗉𝖾𝖼𝗍superscriptPrconditional𝜓𝑥\displaystyle\max\big{\{}\Pr(\mathcal{E}^{\prime}),\max_{\psi\in\mathsf{% Respect}(\mathcal{E}^{\prime})}\{\Pr(\mathcal{E}\mid\psi)\}\big{\}}\leq x~{}.roman_max { roman_Pr ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , roman_max start_POSTSUBSCRIPT italic_ψ ∈ sansserif_Respect ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT { roman_Pr ( caligraphic_E ∣ italic_ψ ) } } ≤ italic_x . (3)

The risk of an event \mathcal{E}caligraphic_E is the smallest risk testified by some event superscript\mathcal{E}^{\prime}\supseteq\mathcal{E}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊇ caligraphic_E .

Here, 𝖱𝖾𝗌𝗉𝖾𝖼𝗍()𝖱𝖾𝗌𝗉𝖾𝖼𝗍superscript\mathsf{Respect}(\mathcal{E}^{\prime})sansserif_Respect ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) is the set of retractions of assignments avoiding superscript\mathcal{E}^{\prime}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, where either (i) no 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black variables of superscript\mathcal{E}^{\prime}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT or (ii) all 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white variables of superscript\mathcal{E}^{\prime}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are retracted. In our algorithms we will use several LLLs that have a low risk and hence can be solved with the following theorem.

Theorem 5.5.

There is a randomized 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds w.h.p. solve any LLL of constant locality ν𝜈\nuitalic_ν with dependency degree dpolylogn𝑑poly𝑛d\leq\operatorname{\text{{\rm poly}}}\log nitalic_d ≤ poly roman_log italic_n and risk p𝑝pitalic_p. The algorithm requires p<d(4+cl)(4c+12cν)loglogn𝑝superscript𝑑4subscript𝑐𝑙4𝑐12𝑐𝜈𝑛p<d^{-(4+c_{l})-(4c+12c\nu)\log\log n}italic_p < italic_d start_POSTSUPERSCRIPT - ( 4 + italic_c start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) - ( 4 italic_c + 12 italic_c italic_ν ) roman_log roman_log italic_n end_POSTSUPERSCRIPT, ldcl𝑙superscript𝑑subscript𝑐𝑙l\leq d^{c_{l}}italic_l ≤ italic_d start_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, ΔlogcΔnΔsuperscriptsubscript𝑐Δ𝑛\Delta\leq\log^{c_{\Delta}}nroman_Δ ≤ roman_log start_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_n for constants cl,cΔ1subscript𝑐𝑙subscript𝑐Δ1c_{l},c_{\Delta}\geq 1italic_c start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ≥ 1 and that the LLL is simulatable.

Events favor 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black, or are monotone increasing, if changing any value to 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black does not decrease the conditional probability of the event, respectively. A typical example of a monotone increasing event is sampling a subset of vertices containing many non-edges in the neighborhood of each node. We use this problem in our procedure to color sparse nodes. A key point is that it is easy to bound the risk of monotone increasing events as shown in the following lemma from [32].

Lemma 5.6.

The risk of a monotone increasing event \mathcal{E}caligraphic_E is Pr()Pr\Pr(\mathcal{E})roman_Pr ( caligraphic_E ) testified by 𝖺𝗌𝗌𝗈𝖼()=𝖺𝗌𝗌𝗈𝖼\mathsf{assoc}(\mathcal{E})=\mathcal{E}sansserif_assoc ( caligraphic_E ) = caligraphic_E.

Last but not least we will sample subsets of nodes satisfying certain degree bounds. The following lemma is helpful to bound the risk of such sampling LLLs.

Lemma 5.7.

Consider a random variable X𝑋Xitalic_X that is a sum of independent binary random variables. For some threshold parameter x>0𝑥0x>0italic_x > 0, let xsubscript𝑥\mathcal{E}_{x}caligraphic_E start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT be the event that X>x𝑋𝑥X>xitalic_X > italic_x holds. Then, the risk of xsubscript𝑥\mathcal{E}_{x}caligraphic_E start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT is at most Pr(x/2)Prsubscript𝑥2\Pr(\mathcal{E}_{\nicefrac{{x}}{{2}}})roman_Pr ( caligraphic_E start_POSTSUBSCRIPT / start_ARG italic_x end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT ) testified by x/2subscript𝑥2\mathcal{E}_{\nicefrac{{x}}{{2}}}caligraphic_E start_POSTSUBSCRIPT / start_ARG italic_x end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT.

5.2 Generating Unit Slack for Sparse and Ordinary Nodes

The next lemma shows that the nodes in small ordinary cliques are somewhat sparse. As each large AC is a proper clique consisting of nodes with degree ΔΔ\Deltaroman_Δ, we obtain the following.

Observation 5.8 (Small ordinary cliques are sparse).

Any node v𝑣vitalic_v in an ordinary AC C𝐶Citalic_C has at least eC(Δ3eC)subscript𝑒𝐶Δ3subscript𝑒𝐶e_{C}\cdot(\Delta-3e_{C})italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ⋅ ( roman_Δ - 3 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) non-edges in its neighborhood. In particular, any small node has at least Δ2/(2q(n))superscriptΔ22𝑞𝑛\Delta^{2}/(2q(n))roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 2 italic_q ( italic_n ) ) non-edges in its neighborhood.

Proof.

Since C𝐶Citalic_C is not easy, each of its Δ+1eCΔ1subscript𝑒𝐶\Delta+1-e_{C}roman_Δ + 1 - italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT nodes have eCsubscript𝑒𝐶e_{C}italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT external neighbors. Since C𝐶Citalic_C is not difficult, it has no intrusive neighbor. Thus, each external neighbor of vC𝑣𝐶v\in Citalic_v ∈ italic_C has at most 2eC2subscript𝑒𝐶2e_{C}2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT neighbors in C𝐶Citalic_C, so at least |C|2eCΔ3eC𝐶2subscript𝑒𝐶Δ3subscript𝑒𝐶|C|-2e_{C}\geq\Delta-3e_{C}| italic_C | - 2 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ≥ roman_Δ - 3 italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT non-neighbors. Hence, the first claim. A small node has eCΔ/q(n)subscript𝑒𝐶Δ𝑞𝑛e_{C}\geq\Delta/q(n)italic_e start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ≥ roman_Δ / italic_q ( italic_n ), implying the second claim. ∎

The task of this subsection is to prove the following lemma.

See 4.3

Proof.

Let U=V𝒪𝑈superscript𝑉𝒪U=V^{*}\cup\mathcal{O}italic_U = italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O and U={vV𝒪sN(v)U}Usuperscript𝑈conditional-set𝑣superscript𝑉subscript𝒪𝑠𝑁𝑣𝑈𝑈U^{\prime}=\{v\in V^{*}\cup\mathcal{O}_{s}\mid N(v)\subseteq U\}\subseteq Uitalic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { italic_v ∈ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∣ italic_N ( italic_v ) ⊆ italic_U } ⊆ italic_U. Note that any node in V𝒪ssuperscript𝑉subscript𝒪𝑠V^{*}\cup\mathcal{O}_{s}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT with a neighbor wU𝑤𝑈w\notin Uitalic_w ∉ italic_U automatically has unit-slack in G[U]𝐺delimited-[]𝑈G[U]italic_G [ italic_U ] as its neighbor w𝑤witalic_w is stalled to be colored later. Thus we can concentrate on the vertices in Usuperscript𝑈U^{\prime}italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.

Each node vUV𝑣superscript𝑈superscript𝑉v\in U^{\prime}\cap V^{*}italic_v ∈ italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∩ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is sparse and so has ε2Δ2superscript𝜀2superscriptΔ2\varepsilon^{2}\Delta^{2}italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT non-edges in its induced neighborhood, which is within G[U]𝐺delimited-[]𝑈G[U]italic_G [ italic_U ]. Each node in U𝒪ssuperscript𝑈subscript𝒪𝑠U^{\prime}\cap\mathcal{O}_{s}italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∩ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT has at least Δ2/(2q(n))superscriptΔ22𝑞𝑛\Delta^{2}/(2q(n))roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 2 italic_q ( italic_n ) ) non-edges in its neighborhood in G[U]𝐺delimited-[]𝑈G[U]italic_G [ italic_U ] by 5.8. Let μ=clog4lognlogΔ𝜇𝑐superscript4𝑛Δ\mu=c\log^{4}\log n\cdot\log\Deltaitalic_μ = italic_c roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log roman_Δ.

We first use Theorem 5.5 (twice) to compute two sets SiUsubscript𝑆𝑖𝑈S_{i}\subseteq Uitalic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ italic_U, i=1,2𝑖12i=1,2italic_i = 1 , 2 satisfying the following properties:

  1. 1.

    |SiN(v)|μsubscript𝑆𝑖𝑁𝑣𝜇|S_{i}\cap N(v)|\leq\mu| italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_N ( italic_v ) | ≤ italic_μ, for all vV𝒪𝑣superscript𝑉𝒪v\in V^{*}\cup\mathcal{O}italic_v ∈ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O

  2. 2.

    |SiMC|μsubscript𝑆𝑖subscript𝑀𝐶𝜇|S_{i}\cap M_{C}|\leq\mu| italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | ≤ italic_μ, for all ordinary ACs C𝐶Citalic_C ,

  3. 3.

    Number of non-edges in G[SiN(v)]𝐺delimited-[]subscript𝑆𝑖𝑁𝑣G[S_{i}\cap N(v)]italic_G [ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_N ( italic_v ) ] is Ω(log5lognlog2Δ)Ωsuperscript5𝑛superscript2Δ\Omega(\log^{5}\log n\cdot\log^{2}\Delta)roman_Ω ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ ), for each vU𝑣superscript𝑈v\in U^{\prime}italic_v ∈ italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.

In order to construct S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT consider the process that samples each node U𝑈Uitalic_U into S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with probability p=cΔ1log4lognlogΔ𝑝𝑐superscriptΔ1superscript4𝑛Δp=c\Delta^{-1}\cdot\log^{4}\log n\cdot\log\Deltaitalic_p = italic_c roman_Δ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log roman_Δ for a suitable constant c𝑐citalic_c. For a suitable constants c1subscript𝑐1c_{1}italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT introduce the following bad events.

  1. 1.

    For all vV𝒪𝑣superscript𝑉𝒪v\in V^{*}\cup\mathcal{O}italic_v ∈ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O, event vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT holds if |SiN(v)|4μsubscript𝑆𝑖𝑁𝑣4𝜇|S_{i}\cap N(v)|\geq 4\mu| italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_N ( italic_v ) | ≥ 4 italic_μ

  2. 2.

    For each ordinary AC C𝐶Citalic_C, event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT holds if |SiMC|4μsubscript𝑆𝑖subscript𝑀𝐶4𝜇|S_{i}\cap M_{C}|\geq 4\mu| italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | ≥ 4 italic_μ,

  3. 3.

    For each vU𝑣superscript𝑈v\in U^{\prime}italic_v ∈ italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, event vsubscriptsuperscript𝑣\mathcal{E}^{\prime}_{v}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT holds if the number of non edges in G[SiN(v)]𝐺delimited-[]subscript𝑆𝑖𝑁𝑣G[S_{i}\cap N(v)]italic_G [ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_N ( italic_v ) ] is less than c1log5lognlog2Δsubscript𝑐1superscript5𝑛superscript2Δc_{1}\cdot\log^{5}\log n\cdot\log^{2}\Deltaitalic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⋅ roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ.

Claim 5.9.

The sampling of S1Usubscript𝑆1𝑈S_{1}\subseteq Uitalic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊆ italic_U with probability p𝑝pitalic_p and the aforementioned bad events is a simulatable LLL with risk Δc/50loglognsuperscriptΔ𝑐50𝑛\Delta^{-c/50\cdot\log\log n}roman_Δ start_POSTSUPERSCRIPT - italic_c / 50 ⋅ roman_log roman_log italic_n end_POSTSUPERSCRIPT.

Proof.

We first bound the risk of the events and then reason about simulatability.

  • Fix some vV𝒪𝑣superscript𝑉𝒪v\in V^{*}\cup\mathcal{O}italic_v ∈ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O. The expected number of neighbors in S𝑆Sitalic_S is d(v)p/Δμ𝑑𝑣𝑝Δ𝜇d(v)\cdot p/\Delta\leq\muitalic_d ( italic_v ) ⋅ italic_p / roman_Δ ≤ italic_μ. Hence, Pr(v)exp(2μ/3)Prsubscript𝑣2𝜇3\Pr(\mathcal{E}_{v})\leq\exp\left(-2\mu/3\right)roman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ≤ roman_exp ( - 2 italic_μ / 3 ) by Chernoff. Additionally, define an associated event 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) as the event that at most 2μ2𝜇2\mu2 italic_μ neighbors are sampled. We have Pr(𝖺𝗌𝗌𝗈𝖼(v,d))exp(μ/3)Pr𝖺𝗌𝗌𝗈𝖼subscript𝑣𝑑𝜇3\Pr(\mathsf{assoc}(\mathcal{E}_{v,d}))\leq\exp\left(-\mu/3\right)roman_Pr ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v , italic_d end_POSTSUBSCRIPT ) ) ≤ roman_exp ( - italic_μ / 3 ). This bounds the risk of vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT to be at most Pr(𝖺𝗌𝗌𝗈𝖼(v))Pr𝖺𝗌𝗌𝗈𝖼subscript𝑣\Pr(\mathsf{assoc}(\mathcal{E}_{v}))roman_Pr ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) by Lemma 5.7.

  • The proof for bounding the risk of the event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each ordinary clique is identical to the proof for vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT by considering the sampling status of the matching MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT instead of the neighborhood of the respective node.

  • Fix a node vU𝑣superscript𝑈v\in U^{\prime}italic_v ∈ italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and let α=m¯(N(v)U)/Δ2𝛼¯𝑚𝑁𝑣𝑈superscriptΔ2\alpha=\bar{m}(N(v)\cap U)/\Delta^{2}italic_α = over¯ start_ARG italic_m end_ARG ( italic_N ( italic_v ) ∩ italic_U ) / roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT be the fraction of non-edges of node v𝑣vitalic_v in its neighborhood induced by U𝑈Uitalic_U. 5.8 shows αmin{ε2,1/(2q2(n))}=1/(2q2(n))𝛼superscript𝜀212superscript𝑞2𝑛12superscript𝑞2𝑛\alpha\geq\min\{\varepsilon^{2},1/(2q^{2}(n))\}=1/(2q^{2}(n))italic_α ≥ roman_min { italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , 1 / ( 2 italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_n ) ) } = 1 / ( 2 italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_n ) ), regardless of whether vVv\in V*italic_v ∈ italic_V ∗ or v𝒪s𝑣subscript𝒪𝑠v\in\mathcal{O}_{s}italic_v ∈ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT.

    Now, fix the constant c1subscript𝑐1c_{1}italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT such that the event vsubscriptsuperscript𝑣\mathcal{E}^{\prime}_{v}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT is the event that the number of non-edges in G[N(v)S1]𝐺delimited-[]𝑁𝑣subscript𝑆1G[N(v)\cap S_{1}]italic_G [ italic_N ( italic_v ) ∩ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ] is less than m¯thres=αμ2/2subscript¯𝑚𝑡𝑟𝑒𝑠𝛼superscript𝜇22\overline{m}_{thres}=\alpha\mu^{2}/2over¯ start_ARG italic_m end_ARG start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT = italic_α italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2. Let f𝑓fitalic_f be a random variable for the number of non-edges in the graph induced by X=N(v)S1𝑋𝑁𝑣subscript𝑆1X=N(v)\cap S_{1}italic_X = italic_N ( italic_v ) ∩ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Apply the non-edge hitting lemma Lemma B.2, with |X|Δ𝑋Δ|X|\leq\Delta| italic_X | ≤ roman_Δ and μ¯αΔ2¯𝜇𝛼superscriptΔ2\overline{\mu}\geq\alpha\Delta^{2}over¯ start_ARG italic_μ end_ARG ≥ italic_α roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The lemma shows that the expected number of non-edges is 𝔼[f]p2m¯αμ2𝔼𝑓superscript𝑝2¯𝑚𝛼superscript𝜇2\operatorname{\mathbb{E}}[f]\geq p^{2}\overline{m}\geq\alpha\mu^{2}blackboard_E [ italic_f ] ≥ italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over¯ start_ARG italic_m end_ARG ≥ italic_α italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and that f𝑓fitalic_f is also well concentrated. We obtain Pr(v)=Pr(f𝔼[f]/2)exp(pm¯5|X|)exp(αμ5)Prsubscriptsuperscript𝑣Pr𝑓𝔼𝑓2𝑝¯𝑚5𝑋𝛼𝜇5\Pr(\mathcal{E}^{\prime}_{v})=\Pr(f\leq\operatorname{\mathbb{E}}[f]/2)\leq\exp% \left(-\frac{p\overline{m}}{5|X|}\right)\leq\exp\left(-\frac{\alpha\mu}{5}\right)roman_Pr ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = roman_Pr ( italic_f ≤ blackboard_E [ italic_f ] / 2 ) ≤ roman_exp ( - divide start_ARG italic_p over¯ start_ARG italic_m end_ARG end_ARG start_ARG 5 | italic_X | end_ARG ) ≤ roman_exp ( - divide start_ARG italic_α italic_μ end_ARG start_ARG 5 end_ARG ). vsubscriptsuperscript𝑣\mathcal{E}^{\prime}_{v}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT is a monotone increasing event. Hence, its risk is at most Pr(v)Prsubscriptsuperscript𝑣\Pr(\mathcal{E}^{\prime}_{v})roman_Pr ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) by Lemma 5.6, where the associated event 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscriptsuperscript𝑣\mathsf{assoc}(\mathcal{E}^{\prime}_{v})sansserif_assoc ( caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) is vsubscriptsuperscript𝑣\mathcal{E}^{\prime}_{v}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT itself.

In summary the risk is upper bounded by max{exp(2μ/3),exp(αμ/5)}Δc/50loglogn2𝜇3𝛼𝜇5superscriptΔ𝑐50𝑛\max\{\exp\left(-2\mu/3\right),\exp\left(-\alpha\mu/5\right)\}\leq\Delta^{-c/5% 0\cdot\log\log n}roman_max { roman_exp ( - 2 italic_μ / 3 ) , roman_exp ( - italic_α italic_μ / 5 ) } ≤ roman_Δ start_POSTSUPERSCRIPT - italic_c / 50 ⋅ roman_log roman_log italic_n end_POSTSUPERSCRIPT.

The simulatability of the first two types of events (vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for vU𝑣𝑈v\in Uitalic_v ∈ italic_U and Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for ordinary cliques C𝐶Citalic_C) is immediate as it only counts the number of immediate neighbors of nodes and cliques, respectively. Here, the leader node (C)subscript𝐶\ell(\mathcal{E}_{C})roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) can gather full information about the number of nodes in SMC𝑆subscript𝑀𝐶S\cap M_{C}italic_S ∩ italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT in any partial assignment sampling S𝑆Sitalic_S.

The lengthy proof of the simulatability of the event vsubscriptsuperscript𝑣\mathcal{E}^{\prime}_{v}caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for vU𝑣superscript𝑈v\in U^{\prime}italic_v ∈ italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is word by word identical to the proof of the simulatability of a similar type of event in [32, Lemma 8.4, arxiv version]. The crucial point is part 3 of the simulatability definition (Definition 5.2) where O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) evaluations of conditional probabilities need to be done in parallel in the setting where locally unique IDs are represented with O(loglogn)𝑂𝑛O(\log\log n)italic_O ( roman_log roman_log italic_n ) bits. These small IDs are sufficient for a preprocessing that is done simultaneously for all instances and in which v𝑣vitalic_v learns the whole topology of G[N(v)U]𝐺delimited-[]𝑁𝑣𝑈G[N(v)\cap U]italic_G [ italic_N ( italic_v ) ∩ italic_U ]. Once the topology is available, the sampling status of nodes SN(v)𝑆𝑁𝑣S\cap N(v)italic_S ∩ italic_N ( italic_v ) will reveal the number of non-edges in v𝑣vitalic_v’s sampled neighborhood, showing simulatability. ∎

Due to 5.9, we can apply Theorem 5.5 to solve the LLL in 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST and compute a set S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with the required properties in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds, w.h.p. We proceed analogously for S2subscript𝑆2S_{2}italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT but compute it as a subset of US1𝑈subscript𝑆1U\setminus S_{1}italic_U ∖ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. The remaining steps are identical except that constant c3subscript𝑐3c_{3}italic_c start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT is replaced with a smaller constant as removing the set S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT from U𝑈Uitalic_U may reduce the sparsity of the nodes in Usuperscript𝑈U^{\prime}italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Still, the reduction is limited to a constant factor for the following reason: removing at most O(log4lognlogΔ)𝑂superscript4𝑛ΔO(\log^{4}\log n\cdot\log\Delta)italic_O ( roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log roman_Δ ) nodes from the neighborhood of each node reduces the number of non-edges in each neighborhood by at most O(Δlog4lognlogΔ)=O(Δlog5logn)𝑂Δsuperscript4𝑛Δ𝑂Δsuperscript5𝑛O(\Delta\log^{4}\log n\cdot\log\Delta)=O(\Delta\log^{5}\log n)italic_O ( roman_Δ roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log roman_Δ ) = italic_O ( roman_Δ roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ). Thus a node in U𝒪ssuperscript𝑈subscript𝒪𝑠U^{\prime}\cap\mathcal{O}_{s}italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∩ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT still has Δ2/(2q(n))O(Δlog5logn)Δ2/(4q(n))superscriptΔ22𝑞𝑛𝑂Δsuperscript5𝑛superscriptΔ24𝑞𝑛\Delta^{2}/(2q(n))-O(\Delta\log^{5}\log n)\geq\Delta^{2}/(4q(n))roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 2 italic_q ( italic_n ) ) - italic_O ( roman_Δ roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) ≥ roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 4 italic_q ( italic_n ) ) non-edges available, where we used that n𝑛nitalic_n is large enough and Δ=ω(q(n)log5logn)Δ𝜔𝑞𝑛superscript5𝑛\Delta=\omega(q(n)\log^{5}\log n)roman_Δ = italic_ω ( italic_q ( italic_n ) roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) holds. For nodes in UVsuperscript𝑈superscript𝑉U^{\prime}\cap V^{*}italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∩ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, removing the nodes in S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT from U𝑈Uitalic_U also removes less than half of the initially available ε2Δ2superscript𝜀2superscriptΔ2\varepsilon^{2}\Delta^{2}italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT non-edges.

With the two sets S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and S2subscript𝑆2S_{2}italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, we apply Lemma B.1 with two disjoint color palettes of size χ=Δ/2𝜒Δ2\chi=\lfloor\Delta/2\rflooritalic_χ = ⌊ roman_Δ / 2 ⌋. The number of non-edges m¯¯𝑚\overline{m}over¯ start_ARG italic_m end_ARG in G[SiN(v)]𝐺delimited-[]subscript𝑆𝑖𝑁𝑣G[S_{i}\cap N(v)]italic_G [ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ italic_N ( italic_v ) ] satisfies m¯/χ=Ω(logΔloglogn)¯𝑚𝜒ΩΔ𝑛\overline{m}/\chi=\Omega(\log\Delta\cdot\log\log n)over¯ start_ARG italic_m end_ARG / italic_χ = roman_Ω ( roman_log roman_Δ ⋅ roman_log roman_log italic_n ) as required. As a result, a subset SS1S2V𝒪𝑆subscript𝑆1subscript𝑆2superscript𝑉𝒪S\subseteq S_{1}\cup S_{2}\subseteq V^{*}\cup\mathcal{O}italic_S ⊆ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊆ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O is colored, such that all nodes in V𝒪ssuperscript𝑉subscript𝒪𝑠V^{*}\cup\mathcal{O}_{s}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∪ caligraphic_O start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT get slack. The second property of this lemma, stating that the number of nodes colored in N(v)𝑁𝑣N(v)italic_N ( italic_v ) of the respective nodes and in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, follows from the bound on number of neighbors in N(v)(S1S2)𝑁𝑣subscript𝑆1subscript𝑆2N(v)\cap(S_{1}\cup S_{2})italic_N ( italic_v ) ∩ ( italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) and MC(S1S2)subscript𝑀𝐶subscript𝑆1subscript𝑆2M_{C}\cap(S_{1}\cup S_{2})italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∩ ( italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). The runtime immediately follows from Theorem 5.5 and Lemma B.1. ∎

5.3 Computing the Set Z=Z1Z2𝑍subscript𝑍1subscript𝑍2Z=Z_{1}\cup Z_{2}italic_Z = italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT

See 4.5

We compute the sets Z1subscript𝑍1Z_{1}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Z2subscript𝑍2Z_{2}italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT by two consecutive LLLs 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 2subscript2\mathcal{L}_{2}caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. In the first LLL, we compute the set Z𝑍Zitalic_Z, which we split into the two sets Z1subscript𝑍1Z_{1}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Z2subscript𝑍2Z_{2}italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in the second LLL.

Definition 5.10 (First sampling LLL).

We define the following sampling LLL 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Let X={v𝒪l:v is uncolored after Step 1}𝑋conditional-set𝑣subscript𝒪𝑙v is uncolored after Step 1X=\{v\in\mathcal{O}_{l}:\text{$v$ is uncolored after Step~{}1}\}italic_X = { italic_v ∈ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT : italic_v is uncolored after Step 1 }.

  • Variables: Sample each node of X𝑋Xitalic_X with probability q=1/30𝑞130q=1/30italic_q = 1 / 30 into Z𝑍Zitalic_Z. Denote Y=XZ𝑌𝑋𝑍Y=X\setminus Zitalic_Y = italic_X ∖ italic_Z.

  • Bad Events:

    1. 1.

      For each v𝒪𝑣𝒪v\in\mathcal{O}italic_v ∈ caligraphic_O, there is a bad event vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT stating that |ZN(v)|>3qΔ𝑍𝑁𝑣3𝑞Δ|Z\cap N(v)|>3q\Delta| italic_Z ∩ italic_N ( italic_v ) | > 3 italic_q roman_Δ.

    2. 2.

      For each important AC C𝐶Citalic_C, define an event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT that holds if fewer than q2(1q)3Δ/20superscript𝑞2superscript1𝑞3Δ20q^{2}(1-q)^{3}\Delta/20italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_q ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Δ / 20 edges of MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are useful.

  • Associated Events

    1. 1.

      𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ): For each v𝒪𝑣𝒪v\in\mathcal{O}italic_v ∈ caligraphic_O, the bad event 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) holds if |ZN(v)|>3qΔ/2=Δ/20𝑍𝑁𝑣3𝑞Δ2Δ20|Z\cap N(v)|>3q\Delta/2=\Delta/20| italic_Z ∩ italic_N ( italic_v ) | > 3 italic_q roman_Δ / 2 = roman_Δ / 20 ,

    2. 2.

      𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ): The event holds if there are fewer than q(1q)Δ/10𝑞1𝑞Δ10q(1-q)\Delta/10italic_q ( 1 - italic_q ) roman_Δ / 10 useful edges or if there are fewer than (1q)2Δ/10superscript1𝑞2Δ10(1-q)^{2}\Delta/10( 1 - italic_q ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ / 10 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white edges in C𝐶Citalic_C.

  • Event/variable assignment \ellroman_ℓ: Each variable and each event vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT, 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) are simulated by the corresponding node. The events Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) are simulated by the node of C𝐶Citalic_C with the largest ID.

Note that 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) is of different nature from Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT.

Lemma 5.11.

We have the following upper bounds for the probabilities of the respective events.

  1. 1.

    For all v𝒪𝑣𝒪v\in\mathcal{O}italic_v ∈ caligraphic_O: Pr(𝖺𝗌𝗌𝗈𝖼(v))exp(Ω(Δ))Pr𝖺𝗌𝗌𝗈𝖼subscript𝑣ΩΔ\Pr(\mathsf{assoc}(\mathcal{E}_{v}))\leq\exp(-\Omega(\Delta))roman_Pr ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ≤ roman_exp ( - roman_Ω ( roman_Δ ) ).

  2. 2.

    For all important ACs C𝐶Citalic_C: Pr(𝖺𝗌𝗌𝗈𝖼(C))exp(Ω(Δ))Pr𝖺𝗌𝗌𝗈𝖼subscript𝐶ΩΔ\Pr(\mathsf{assoc}(\mathcal{E}_{C}))\leq\exp(-\Omega(\Delta))roman_Pr ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ) ≤ roman_exp ( - roman_Ω ( roman_Δ ) ).

Proof.

Throughout the proof we use that q𝑞qitalic_q and 1q1𝑞1-q1 - italic_q are constant.

Bounding Pr(𝖺𝗌𝗌𝗈𝖼(v))Pr𝖺𝗌𝗌𝗈𝖼subscript𝑣\Pr(\mathsf{assoc}(\mathcal{E}_{v}))roman_Pr ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ): As each node joins Z𝑍Zitalic_Z independently with probability q𝑞qitalic_q, we have E[|ZN(v)|]qΔ𝐸delimited-[]𝑍𝑁𝑣𝑞ΔE[|Z\cap N(v)|]\leq q\Deltaitalic_E [ | italic_Z ∩ italic_N ( italic_v ) | ] ≤ italic_q roman_Δ, and the first bound follows from a Chernoff bound.

Bounding Pr(𝖺𝗌𝗌𝗈𝖼(C))Pr𝖺𝗌𝗌𝗈𝖼subscript𝐶\Pr(\mathsf{assoc}(\mathcal{E}_{C}))roman_Pr ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ): Let NCsubscript𝑁𝐶N_{C}italic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT be the arcs of MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT that have both endpoints in 𝒪lsubscript𝒪𝑙\mathcal{O}_{l}caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT and uncolored after Step 1. All heads of arcs in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are already in 𝒪lsubscript𝒪𝑙\mathcal{O}_{l}caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, and by the definition of an important AC, at least Δ/12Δ12\Delta/12roman_Δ / 12 arcs in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT have their tails in 𝒪lsubscript𝒪𝑙\mathcal{O}_{l}caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. At most O(log5logn)𝑂superscript5𝑛O(\log^{5}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) of V(MC)𝑉subscript𝑀𝐶V(M_{C})italic_V ( italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) are already colored. Thus, NCsubscript𝑁𝐶N_{C}italic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT contains at least Δ/12O(log5logn)Δ/20Δ12𝑂superscript5𝑛Δ20\Delta/12-O(\log^{5}\log n)\geq\Delta/20roman_Δ / 12 - italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) ≥ roman_Δ / 20 nodes.

Now, observe that the probability for an edge of NCsubscript𝑁𝐶N_{C}italic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT to be useful is q(1q)𝑞1𝑞q(1-q)italic_q ( 1 - italic_q ) and the expected number of useful edges is q(1q)|NC|=q(1q)Δ/15𝑞1𝑞subscript𝑁𝐶𝑞1𝑞Δ15q(1-q)|N_{C}|=q(1-q)\Delta/15italic_q ( 1 - italic_q ) | italic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | = italic_q ( 1 - italic_q ) roman_Δ / 15. This property is independent for different edges in NCsubscript𝑁𝐶N_{C}italic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, so the claim regarding the number of useful edges follows from a Chernoff bound. Similarly, the probability for an edge to be 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white is (1q)2superscript1𝑞2(1-q)^{2}( 1 - italic_q ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, and the expected number of 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white edges in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is (1q)2|NC|=(1q)2Δ/15superscript1𝑞2subscript𝑁𝐶superscript1𝑞2Δ15(1-q)^{2}|N_{C}|=(1-q)^{2}\Delta/15( 1 - italic_q ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | = ( 1 - italic_q ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ / 15. The claim regarding 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white edges then follows with a Chernoff bound. ∎

Lemma 5.12.

1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is a sampling LLL with risk exp(Ω(Δ))ΩΔ\exp(-\Omega(\Delta))roman_exp ( - roman_Ω ( roman_Δ ) ) and dependency degree O(Δ2)𝑂superscriptΔ2O(\Delta^{2})italic_O ( roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

Proof.

The probabilities of the associated events 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) and 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) are at most exp(Ω(Δ))ΩΔ\exp(-\Omega(\Delta))roman_exp ( - roman_Ω ( roman_Δ ) ) by Lemma 5.11.

The dependency degree can be bounded as follows. Each variable of a node stating whether the node is 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white or 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black only appears in the events vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT and 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) of adjacent nodes and in the events Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) of adjacent ACs, bounding the variable degree by O(Δ)𝑂ΔO(\Delta)italic_O ( roman_Δ ). We have |vbl(V)|Δvblsubscript𝑉Δ|\textsf{vbl}(\mathcal{E}_{V})|\leq\Delta| vbl ( caligraphic_E start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT ) | ≤ roman_Δ and each event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT depends on two variables for each edge in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. As |MC|Δsubscript𝑀𝐶Δ|M_{C}|\leq\Delta| italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT | ≤ roman_Δ, we obtain that each event depends on at most O(Δ)𝑂ΔO(\Delta)italic_O ( roman_Δ ) variables and the dependency degree can be upper bounded by O(Δ2)𝑂superscriptΔ2O(\Delta^{2})italic_O ( roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

Via Lemma 5.7 we obtain that 𝖺𝗌𝗌𝗈𝖼(v)𝖺𝗌𝗌𝗈𝖼subscript𝑣\mathsf{assoc}(\mathcal{E}_{v})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) testifies that vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT has risk exp(Ω(Δ))ΩΔ\exp(-\Omega(\Delta))roman_exp ( - roman_Ω ( roman_Δ ) ).

Next, we fix an important AC C𝐶Citalic_C and reason that 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) testifies that Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has risk exp(Ω(Δ))ΩΔ\exp(-\Omega(\Delta))roman_exp ( - roman_Ω ( roman_Δ ) ). First note that C𝖺𝗌𝗌𝗈𝖼(C)subscript𝐶𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathcal{E}_{C}\subseteq\mathsf{assoc}(\mathcal{E}_{C})caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ⊆ sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ), as required by Definition 5.4. Let ψ𝖱𝖾𝗌𝗉𝖾𝖼𝗍(𝖺𝗌𝗌𝗈𝖼(C))𝜓𝖱𝖾𝗌𝗉𝖾𝖼𝗍𝖺𝗌𝗌𝗈𝖼subscript𝐶\psi\in\mathsf{Respect}(\mathsf{assoc}(\mathcal{E}_{C}))italic_ψ ∈ sansserif_Respect ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ), namely ψ𝜓\psiitalic_ψ is a retraction of an assignment φ𝜑\varphiitalic_φ under which 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) is avoided. By the definition of 𝖱𝖾𝗌𝗉𝖾𝖼𝗍(𝖺𝗌𝗌𝗈𝖼(C))𝖱𝖾𝗌𝗉𝖾𝖼𝗍𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{Respect}(\mathsf{assoc}(\mathcal{E}_{C}))sansserif_Respect ( sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ), the set of retracted variables is in one of the following two cases: 1) The set of retracted variables contains no variables of vbl(C)vblsubscript𝐶\textsf{vbl}(\mathcal{E}_{C})vbl ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) that were 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black under φ𝜑\varphiitalic_φ, or 2) The set of retracted variables contains all variables of vbl(C)vblsubscript𝐶\textsf{vbl}(\mathcal{E}_{C})vbl ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) that were 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white under φ𝜑\varphiitalic_φ.

Let us first consider the second case. As 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) is avoided under φ𝜑\varphiitalic_φ, under the assignment φ𝜑\varphiitalic_φ at least (1q)2Δ/10superscript1𝑞2Δ10(1-q)^{2}\Delta/10( 1 - italic_q ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ / 10 edges of MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are white. In the second case, all of these obtain fresh randomness, and each of them is useful independently with probability q(1q)𝑞1𝑞q(1-q)italic_q ( 1 - italic_q ). Thus, in expectation, at least q(1q)3Δ/10𝑞superscript1𝑞3Δ10q(1-q)^{3}\Delta/10italic_q ( 1 - italic_q ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Δ / 10 of them are useful. With a Chernoff bound, we obtain that the probability of Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT to happen in the second case is at most exp(Ω(Δ))ΩΔ\exp(-\Omega(\Delta))roman_exp ( - roman_Ω ( roman_Δ ) ).

Now consider the first case. As 𝖺𝗌𝗌𝗈𝖼(C)𝖺𝗌𝗌𝗈𝖼subscript𝐶\mathsf{assoc}(\mathcal{E}_{C})sansserif_assoc ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) is avoided under φ𝜑\varphiitalic_φ, under the assignment φ𝜑\varphiitalic_φ at least q(1q)Δ/10𝑞1𝑞Δ10q(1-q)\Delta/10italic_q ( 1 - italic_q ) roman_Δ / 10 edges of MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are useful. Let UC𝑈𝐶U\subseteq Citalic_U ⊆ italic_C be the set of nodes in those useful edges that are contained in C𝐶Citalic_C. Note that all nodes in U𝑈Uitalic_U are 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white under φ𝜑\varphiitalic_φ. Let U1Usubscript𝑈1𝑈U_{1}\subseteq Uitalic_U start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊆ italic_U be the nodes that are also 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white under ψ𝜓\psiitalic_ψ and let U2Usubscript𝑈2𝑈U_{2}\subseteq Uitalic_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊆ italic_U be the nodes that evaluate to bottom\bot under ψ𝜓\psiitalic_ψ, i.e., got retracted. Nodes in U2subscript𝑈2U_{2}italic_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are 𝖻𝗅𝖺𝖼𝗄𝖻𝗅𝖺𝖼𝗄\mathsf{black}sansserif_black/𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white with probability q𝑞qitalic_q and 1q1𝑞1-q1 - italic_q, respectively. Let U2wsubscriptsuperscript𝑈𝑤2U^{w}_{2}italic_U start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT be the random variable describing the number of nodes of U2subscript𝑈2U_{2}italic_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT set to 𝗐𝗁𝗂𝗍𝖾𝗐𝗁𝗂𝗍𝖾\mathsf{white}sansserif_white in this process. Let α𝛼\alphaitalic_α be the random variable describing the number of useful edges in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT after that process. We obtain 𝔼[α]𝔼[|U1|+|U2w|]=|U1|+(1q)|U2||U1|+|U2|/2|U|/2q(1q)Δ/10𝔼𝛼𝔼subscript𝑈1subscriptsuperscript𝑈𝑤2subscript𝑈11𝑞subscript𝑈2subscript𝑈1subscript𝑈22𝑈2𝑞1𝑞Δ10\operatorname{\mathbb{E}}[\alpha]\geq\operatorname{\mathbb{E}}[|U_{1}|+|U^{w}_% {2}|]=|U_{1}|+(1-q)|U_{2}|\geq|U_{1}|+|U_{2}|/2\geq|U|/2\geq q(1-q)\Delta/10blackboard_E [ italic_α ] ≥ blackboard_E [ | italic_U start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + | italic_U start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ] = | italic_U start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + ( 1 - italic_q ) | italic_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ≥ | italic_U start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + | italic_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | / 2 ≥ | italic_U | / 2 ≥ italic_q ( 1 - italic_q ) roman_Δ / 10 , where we used that (1q)1/21𝑞12(1-q)\geq 1/2( 1 - italic_q ) ≥ 1 / 2. The event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT holds if αq2(1q)3Δ/20𝛼superscript𝑞2superscript1𝑞3Δ20\alpha\leq q^{2}(1-q)^{3}\Delta/20italic_α ≤ italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_q ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Δ / 20, which is smaller than 𝔼[α]/2𝔼𝛼2\operatorname{\mathbb{E}}[\alpha]/2blackboard_E [ italic_α ] / 2. Hence, we obtain that Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT happens with probability at most exp(Ω(Δ))ΩΔ\exp(-\Omega(\Delta))roman_exp ( - roman_Ω ( roman_Δ ) ) by a Chernoff bound. ∎

Lemma 5.13.

1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is simulatable.

Proof.

Each event vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT depends only on variables that are immediately incident to the node v:vbl(v){𝗍𝗋𝗎𝖾,𝖿𝖺𝗅𝗌𝖾}:subscript𝑣maps-tovblsubscript𝑣𝗍𝗋𝗎𝖾𝖿𝖺𝗅𝗌𝖾\mathcal{E}_{v}:\textsf{vbl}(\mathcal{E}_{v})\mapsto\{\mathsf{true},\mathsf{% false}\}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT : vbl ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ↦ { sansserif_true , sansserif_false } is a function counting the number of nodes that is known to (v)subscript𝑣\ell(\mathcal{E}_{v})roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ). Hence, the simulatability condition holds for vsubscript𝑣\mathcal{E}_{v}caligraphic_E start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT. For Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT all variables are simulated by nodes that are immediately incident to the AC C𝐶Citalic_C and full knowledge about these variables can be relayed to the leader in the AC that simulates event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. Again, whether the event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT holds can be evaluated with the values of the variables and the edges in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, also for all conditional probabilities of partial assignments, as (C)subscript𝐶\ell(\mathcal{E}_{C})roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) has full knowledge of the function C:vbl(C){𝗍𝗋𝗎𝖾,𝖿𝖺𝗅𝗌𝖾}:subscript𝐶maps-tovblsubscript𝐶𝗍𝗋𝗎𝖾𝖿𝖺𝗅𝗌𝖾\mathcal{E}_{C}:\textsf{vbl}(\mathcal{E}_{C})\mapsto\{\mathsf{true},\mathsf{% false}\}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT : vbl ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ↦ { sansserif_true , sansserif_false }. ∎

Let x=q2(1q)3Δ/20𝑥superscript𝑞2superscript1𝑞3Δ20x=q^{2}(1-q)^{3}\Delta/20italic_x = italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_q ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Δ / 20 be the threshold of the number of useful edges that are guaranteed in each MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each important AC by 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (see Definition 5.10). The second LLL is significantly simpler and given by the following definition.

Definition 5.14 (Second sampling LLL).

We define the following LLL 2subscript2\mathcal{L}_{2}caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. We split Z𝑍Zitalic_Z into two sets Z1subscript𝑍1Z_{1}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Z2subscript𝑍2Z_{2}italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT where each node in Z𝑍Zitalic_Z flips an unbiased coin which set to join. There are bad events C,isubscript𝐶𝑖\mathcal{E}_{C,i}caligraphic_E start_POSTSUBSCRIPT italic_C , italic_i end_POSTSUBSCRIPT, i=1,2𝑖12i=1,2italic_i = 1 , 2 and for each important AC C𝐶Citalic_C, that hold if there are fewer than x/3𝑥3x/3italic_x / 3 useful edges in U(C,Zi)𝑈𝐶subscript𝑍𝑖U(C,Z_{i})italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), respectively.

This LLL can be solved by a result in [32].

Lemma 5.15.

There is a polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n-round 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm for 2subscript2\mathcal{L}_{2}caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

Proof.

Form the bipartite graph H=(U,Z,EH)𝐻𝑈𝑍subscript𝐸𝐻H=(U,Z,E_{H})italic_H = ( italic_U , italic_Z , italic_E start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ) with the nodes of Z𝑍Zitalic_Z on one side and a node uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each important AC C𝐶Citalic_C on the other side. There is an edge (uC,z)subscript𝑢𝐶𝑧(u_{C},z)( italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z ) for each useful arc vzU(C,Z)𝑣𝑧𝑈𝐶𝑍\overrightarrow{vz}\in U(C,Z)over→ start_ARG italic_v italic_z end_ARG ∈ italic_U ( italic_C , italic_Z ). Each node uCsubscript𝑢𝐶u_{C}italic_u start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT has degree at least xq2Δ/30=Ω(Δ)𝑥superscript𝑞2Δ30ΩΔx\geq q^{2}\Delta/30=\Omega(\Delta)italic_x ≥ italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Δ / 30 = roman_Ω ( roman_Δ ) (by Lemma 4.6), while each node zZ𝑧𝑍z\in Zitalic_z ∈ italic_Z has degree at most Δ/q(n)Δ𝑞𝑛\Delta/q(n)roman_Δ / italic_q ( italic_n ) (as z𝑧zitalic_z is large). Splitting the subset Z𝑍Zitalic_Z into two parts such that each node vU𝑣𝑈v\in Uitalic_v ∈ italic_U has between d(v)/2𝑑𝑣2d(v)/2italic_d ( italic_v ) / 2 and 3d(v)/23𝑑𝑣23d(v)/23 italic_d ( italic_v ) / 2 neighbors into each part is a vertex subset-splitting problem formulated as bounded-risk LLL and solved in Lemma D.11(1) of [32].

We only need to verify that this problem remains simulatable in our embedded setting. The problem is simulatable because the node (C)subscript𝐶\ell(\mathcal{E}_{C})roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) can obtain full knowledge of any partial assignment of vbl(C)vblsubscript𝐶\textsf{vbl}(\mathcal{E}_{C})vbl ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) and knows the function C:vbl(C){𝗍𝗋𝗎𝖾,𝖿𝖺𝗅𝗌𝖾}:subscript𝐶maps-tovblsubscript𝐶𝗍𝗋𝗎𝖾𝖿𝖺𝗅𝗌𝖾\mathcal{E}_{C}:\textsf{vbl}(\mathcal{E}_{C})\mapsto\{\mathsf{true},\mathsf{% false}\}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT : vbl ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ↦ { sansserif_true , sansserif_false }. ∎

Proof of Lemma 4.5.

First, apply Theorem 5.5 in order to solve 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds yielding a set Z𝑍Zitalic_Z that avoids all bad events of 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. The conditions of the theorem are met by Lemmas 5.13 and 5.12 and as Δlog10lognΔsuperscript10𝑛\Delta\geq\log^{10}\log nroman_Δ ≥ roman_log start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT roman_log italic_n implies that the criterion is strong enough. We split the set Z𝑍Zitalic_Z into Z1subscript𝑍1Z_{1}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Z2subscript𝑍2Z_{2}italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT by solving 2subscript2\mathcal{L}_{2}caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, by Lemma 5.15 The requirements of the theorem are satisfied as Δlog10lognΔsuperscript10𝑛\Delta\geq\log^{10}\log nroman_Δ ≥ roman_log start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT roman_log italic_n.

The degree bound immediately follows from the conditions on Z𝑍Zitalic_Z imposed in the neighborhood of each vertex v𝑣vitalic_v by 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (note that Z=Z1Z2)Z=Z_{1}\cup Z_{2})italic_Z = italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). The second property follows from the avoided events of 2subscript2\mathcal{L}_{2}caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for each important AC. ∎

5.4 Forming Triples

See 4.6

We model the problem of finding zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each important AC C𝐶Citalic_C as a disjoint variable set LLL, where the disjoint sets Z1subscript𝑍1Z_{1}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Z2subscript𝑍2Z_{2}italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT give rise to two disjoint sets of variables. Note that the respective nodes xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT and yCsubscript𝑦𝐶y_{C}italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT will only be computed in the sequel via a deterministic method. Recall, that Z=Z1Z2𝑍subscript𝑍1subscript𝑍2Z=Z_{1}\cup Z_{2}italic_Z = italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

Definition 5.16.

Define the following disjoint variable set LLL 3subscript3\mathcal{L}_{3}caligraphic_L start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT.

  • Variables: For each important AC C𝐶Citalic_C and each useful arc vzU(C,Z)𝑣𝑧𝑈𝐶𝑍\overrightarrow{vz}\in U(C,Z)over→ start_ARG italic_v italic_z end_ARG ∈ italic_U ( italic_C , italic_Z ), there is a binary random variable xvzsubscript𝑥𝑣𝑧x_{vz}italic_x start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT that assumes 1111 with probability p3=q(n)/Δsubscript𝑝3𝑞𝑛Δp_{3}=q(n)/\Deltaitalic_p start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = italic_q ( italic_n ) / roman_Δ.

  • Events: We call a useful arc vzU(C,Z)𝑣𝑧𝑈𝐶𝑍\overrightarrow{vz}\in U(C,Z)over→ start_ARG italic_v italic_z end_ARG ∈ italic_U ( italic_C , italic_Z ) successful if AC C𝐶Citalic_C activated vz𝑣𝑧\overrightarrow{vz}over→ start_ARG italic_v italic_z end_ARG and no other AC activated an edge wz𝑤𝑧\overrightarrow{wz}over→ start_ARG italic_w italic_z end_ARG (for some w𝑤witalic_w). There is one bad event Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT for each important AC C𝐶Citalic_C that holds if there is no successful edge for C𝐶Citalic_C. We introduce corresponding events C,1subscript𝐶1\mathcal{E}_{C,1}caligraphic_E start_POSTSUBSCRIPT italic_C , 1 end_POSTSUBSCRIPT and C,2subscript𝐶2\mathcal{E}_{C,2}caligraphic_E start_POSTSUBSCRIPT italic_C , 2 end_POSTSUBSCRIPT restricted to arcs in U(C,Zi)𝑈𝐶subscript𝑍𝑖U(C,Z_{i})italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), respectively. We have =C,1C,2subscript𝐶1subscript𝐶2\mathcal{E}=\mathcal{E}_{C,1}\cap\mathcal{E}_{C,2}caligraphic_E = caligraphic_E start_POSTSUBSCRIPT italic_C , 1 end_POSTSUBSCRIPT ∩ caligraphic_E start_POSTSUBSCRIPT italic_C , 2 end_POSTSUBSCRIPT.

  • The home node of Csubscript𝐶\mathcal{E}_{C}caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is (C)=vCsubscript𝐶subscript𝑣𝐶\ell(\mathcal{E}_{C})=v_{C}roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) = italic_v start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT where vCsubscript𝑣𝐶v_{C}italic_v start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is the node of C𝐶Citalic_C with largest ID. The home node of xvzsubscript𝑥𝑣𝑧x_{vz}italic_x start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT is (xvz)=zsubscript𝑥𝑣𝑧𝑧\ell(x_{vz})=zroman_ℓ ( italic_x start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT ) = italic_z.

Lemma 5.17.

For each important AC C𝐶Citalic_C and each i=1,2𝑖12i=1,2italic_i = 1 , 2, we have Pr(C,i)2Ω(q(n))Prsubscript𝐶𝑖superscript2Ω𝑞𝑛\Pr(\mathcal{E}_{C,i})\leq 2^{-\Omega(q(n))}roman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_C , italic_i end_POSTSUBSCRIPT ) ≤ 2 start_POSTSUPERSCRIPT - roman_Ω ( italic_q ( italic_n ) ) end_POSTSUPERSCRIPT and the dependency degree of 3subscript3\mathcal{L}_{3}caligraphic_L start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT is upper bounded by d=O(Δ3)𝑑𝑂superscriptΔ3d=O(\Delta^{3})italic_d = italic_O ( roman_Δ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ).

Proof.

Fix an important AC C𝐶Citalic_C and i1,2𝑖12i\in{1,2}italic_i ∈ 1 , 2. For important arc vzU(C,Zi)𝑣𝑧𝑈𝐶subscript𝑍𝑖\overrightarrow{vz}\in U(C,Z_{i})over→ start_ARG italic_v italic_z end_ARG ∈ italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), let Avzsubscript𝐴𝑣𝑧A_{vz}italic_A start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT be the event that vz𝑣𝑧\overrightarrow{vz}over→ start_ARG italic_v italic_z end_ARG is successful. Observe that Avzsubscript𝐴𝑣𝑧A_{vz}italic_A start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT depends only on arcs with z𝑧zitalic_z as tail. It holds if vz𝑣𝑧\overrightarrow{vz}over→ start_ARG italic_v italic_z end_ARG is activated (i.e., has xvz=1subscript𝑥𝑣𝑧1x_{vz}=1italic_x start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT = 1) while the other arcs with z𝑧zitalic_z as tail are not activated. The external degree of each zZ𝒪l𝑧𝑍subscript𝒪𝑙z\in Z\subseteq\mathcal{O}_{l}italic_z ∈ italic_Z ⊆ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT is at most Δ/q(n)Δ𝑞𝑛\Delta/q(n)roman_Δ / italic_q ( italic_n ), as it is large, so at most that many useful arcs have z𝑧zitalic_z as tail. Thus, Pr(Avz)Pr(xvz=1)(1p3)Δ/10=p3(1q(n)/Δ)Δ/q(n)p3(1/4)1/100.9p3Prsubscript𝐴𝑣𝑧Prsubscript𝑥𝑣𝑧1superscript1subscript𝑝3Δ10subscript𝑝3superscript1𝑞𝑛ΔΔ𝑞𝑛subscript𝑝3superscript141100.9subscript𝑝3\Pr(A_{vz})\geq\Pr(x_{vz}=1)\cdot(1-p_{3})^{\Delta/10}=p_{3}(1-q(n)/\Delta)^{% \Delta/q(n)}\geq p_{3}(1/4)^{1/10}\geq 0.9p_{3}roman_Pr ( italic_A start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT ) ≥ roman_Pr ( italic_x start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT = 1 ) ⋅ ( 1 - italic_p start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_Δ / 10 end_POSTSUPERSCRIPT = italic_p start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( 1 - italic_q ( italic_n ) / roman_Δ ) start_POSTSUPERSCRIPT roman_Δ / italic_q ( italic_n ) end_POSTSUPERSCRIPT ≥ italic_p start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( 1 / 4 ) start_POSTSUPERSCRIPT 1 / 10 end_POSTSUPERSCRIPT ≥ 0.9 italic_p start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT.

The events Avzsubscript𝐴𝑣𝑧A_{vz}italic_A start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT and Avzsubscript𝐴superscript𝑣superscript𝑧A_{v^{\prime}z^{\prime}}italic_A start_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT are independent, as each they involve disjoint sets of arcs. The bad event C,isubscript𝐶𝑖\mathcal{E}_{C,i}caligraphic_E start_POSTSUBSCRIPT italic_C , italic_i end_POSTSUBSCRIPT holds only when no useful edge in U(C,Zi)𝑈𝐶subscript𝑍𝑖U(C,Z_{i})italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) becomes successful, which occurs with probability

Pr(C,i)Prsubscript𝐶𝑖\displaystyle\Pr(\mathcal{E}_{C,i})roman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_C , italic_i end_POSTSUBSCRIPT ) =vzU(C,Zi)Pr(Avz¯)(10.9p3)|U(C,Zi)|absentsubscriptproduct𝑣𝑧𝑈𝐶subscript𝑍𝑖Pr¯subscript𝐴𝑣𝑧superscript10.9subscript𝑝3𝑈𝐶subscript𝑍𝑖\displaystyle=\prod_{\overrightarrow{vz}\in U(C,Z_{i})}\Pr(\overline{A_{vz}})% \leq(1-0.9p_{3})^{|U(C,Z_{i})|}= ∏ start_POSTSUBSCRIPT over→ start_ARG italic_v italic_z end_ARG ∈ italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT roman_Pr ( over¯ start_ARG italic_A start_POSTSUBSCRIPT italic_v italic_z end_POSTSUBSCRIPT end_ARG ) ≤ ( 1 - 0.9 italic_p start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT | italic_U ( italic_C , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) | end_POSTSUPERSCRIPT
(10.9q(n)/Δ)q2(1q)3Δ/60eΩ(q(n))absentsuperscript10.9𝑞𝑛Δsuperscript𝑞2superscript1𝑞3Δ60superscript𝑒Ω𝑞𝑛\displaystyle\leq(1-0.9q(n)/\Delta)^{q^{2}(1-q)^{3}\Delta/60}\leq e^{-\Omega(q% (n))}≤ ( 1 - 0.9 italic_q ( italic_n ) / roman_Δ ) start_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_q ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Δ / 60 end_POSTSUPERSCRIPT ≤ italic_e start_POSTSUPERSCRIPT - roman_Ω ( italic_q ( italic_n ) ) end_POSTSUPERSCRIPT

The dependency degree is upper bounded by O(Δ3)𝑂superscriptΔ3O(\Delta^{3})italic_O ( roman_Δ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) because the events of an AC only share variables with ACs that are within distance 2222 from one of the ΔΔ\Deltaroman_Δ nodes of the AC. ∎

Lemma 5.18.

3subscript3\mathcal{L}_{3}caligraphic_L start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT is simulatable.

Proof.

The respective nodes can check in O(1)𝑂1O(1)italic_O ( 1 ) rounds whether their events hold by an assignment and the aggregation primitives can be implemented efficiently as all variables are in distance at most 2222 from the ACs.

We next reason why we can compute the conditional probabilities of Definition 5.2. Let ψ𝜓\psiitalic_ψ be any partial assignment. To compute the conditional probabilities Pr(Cψ)Prconditionalsubscript𝐶𝜓\Pr(\mathcal{E}_{C}\mid\psi)roman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ∣ italic_ψ ), Pr(C,1ψ)Prconditionalsubscript𝐶1𝜓\Pr(\mathcal{E}_{C,1}\mid\psi)roman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_C , 1 end_POSTSUBSCRIPT ∣ italic_ψ ) and Pr(C,2ψ)Prconditionalsubscript𝐶2𝜓\Pr(\mathcal{E}_{C,2}\mid\psi)roman_Pr ( caligraphic_E start_POSTSUBSCRIPT italic_C , 2 end_POSTSUBSCRIPT ∣ italic_ψ ), the node holding the respective event needs to compute the probability that one of the useful edges becomes successful for C𝐶Citalic_C. The conditional probability is 00 if there already is a useful edge that is successful for C𝐶Citalic_C. For all other useful edges in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, the probability of becoming successful is independent as the activation by C𝐶Citalic_C happens independently, and also, all activations from other ACs do influence at most one useful edge in MCsubscript𝑀𝐶M_{C}italic_M start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT. The probability to become successful for a single useful edge (v,z)𝑣𝑧(v,z)( italic_v , italic_z ), vC,zZformulae-sequence𝑣𝐶𝑧𝑍v\in C,z\in Zitalic_v ∈ italic_C , italic_z ∈ italic_Z conditioned on ψ𝜓\psiitalic_ψ can be computed from knowing whether C𝐶Citalic_C activated (v,z)𝑣𝑧(v,z)( italic_v , italic_z ) in ψ𝜓\psiitalic_ψ, whether any other edge (v,z)superscript𝑣𝑧(v^{\prime},z)( italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_z ) is activated in ψ𝜓\psiitalic_ψ and from the number of useful edges of other ACs with endpoint z𝑧zitalic_z that evaluate to bottom\bot under ψ𝜓\psiitalic_ψ. The nodes (C)=(C,1)=(C,1)subscript𝐶subscript𝐶1subscript𝐶1\ell(\mathcal{E}_{C})=\ell(\mathcal{E}_{C,1})=\ell(\mathcal{E}_{C,1})roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) = roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C , 1 end_POSTSUBSCRIPT ) = roman_ℓ ( caligraphic_E start_POSTSUBSCRIPT italic_C , 1 end_POSTSUBSCRIPT ) can learn all this information in O(1)𝑂1O(1)italic_O ( 1 ) rounds using O(loglogn)𝑂𝑛O(\log\log n)italic_O ( roman_log roman_log italic_n ) bits of communication per edge (here we use that ΔpolylognΔpoly𝑛\Delta\leq\operatorname{\text{{\rm poly}}}\log nroman_Δ ≤ poly roman_log italic_n to communicate the aforementioned number efficiently). This can be performed in parallel for the events of all important ACs. Knowing the probability for each edge to become successful, the respective node can compute the conditional probability for the event. This proof also subsumes that the events can be evaluated efficiently, as we did not require the locally unique IDs from a smaller ID space that are given by Definition 5.2. ∎

Proof of Lemma 4.6.

Fix an important AC C𝐶Citalic_C. First, we use Theorem 5.3 to solve 3subscript3\mathcal{L}_{3}caligraphic_L start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT with the sets Z1subscript𝑍1Z_{1}italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Z2subscript𝑍2Z_{2}italic_Z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT given from Lemma 4.5. The algorithm runs in polyloglognpoly𝑛\operatorname{\text{{\rm poly}}}\log\log npoly roman_log roman_log italic_n rounds and works w.h.p. It provides us with a successful edge (yC,zC)subscript𝑦𝐶subscript𝑧𝐶(y_{C},z_{C})( italic_y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) for the AC C𝐶Citalic_C (see Definition 5.16). Next, we show that we can deterministically compute a node xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT to form the triple of nodes as required for Lemma 4.6 in O(1)𝑂1O(1)italic_O ( 1 ) rounds.

The nodes in C𝐶Citalic_C that cannot be used for xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT are those that are either: a) neighbors of zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, b) already colored, or c) function as zCsubscript𝑧superscript𝐶z_{C^{\prime}}italic_z start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT for another important AC C𝐶Citalic_C. As zCsubscript𝑧𝐶z_{C}italic_z start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is large, it has at most Δ/q(n)Δ𝑞𝑛\Delta/q(n)roman_Δ / italic_q ( italic_n ) neighbors in C𝐶Citalic_C. By Lemma 4.3, at most O(log4lognlogΔ)=O(log5logn)𝑂superscript4𝑛Δ𝑂superscript5𝑛O(\log^{4}\log n\cdot\log\Delta)=O(\log^{5}\log n)italic_O ( roman_log start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_log italic_n ⋅ roman_log roman_Δ ) = italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) nodes of C𝐶Citalic_C are already colored.

By Lemma 4.5, we obtain at most |ZC||N(v)Z|Δ/10𝑍𝐶𝑁𝑣𝑍Δ10|Z\cap C|\leq|N(v)\cap Z|\leq\Delta/10| italic_Z ∩ italic_C | ≤ | italic_N ( italic_v ) ∩ italic_Z | ≤ roman_Δ / 10 nodes in C𝐶Citalic_C are candidates for being the outside node in a triple. Hence, at least |C|Δ/q(n)O(log5logn)Δ/10Δ/2𝐶Δ𝑞𝑛𝑂superscript5𝑛Δ10Δ2|C|-\Delta/q(n)-O(\log^{5}\log n)-\Delta/10\geq\Delta/2| italic_C | - roman_Δ / italic_q ( italic_n ) - italic_O ( roman_log start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log italic_n ) - roman_Δ / 10 ≥ roman_Δ / 2 nodes in C𝐶Citalic_C will do as a xCsubscript𝑥𝐶x_{C}italic_x start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT-node.

The graph induced by {zc:C is an important AC}Z𝒪lconditional-setsubscript𝑧𝑐𝐶 is an important AC𝑍subscript𝒪𝑙\{z_{c}:C\text{ is an important AC}\}\subseteq Z\subseteq\mathcal{O}_{l}{ italic_z start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT : italic_C is an important AC } ⊆ italic_Z ⊆ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT has maximum degree Δ/10Δ10\Delta/10roman_Δ / 10 as Lemma 4.5 ensures that |N(v)Z|Δ/10𝑁𝑣𝑍Δ10|N(v)\cap Z|\leq\Delta/10| italic_N ( italic_v ) ∩ italic_Z | ≤ roman_Δ / 10 for all v𝒪l𝑣subscript𝒪𝑙v\in\mathcal{O}_{l}italic_v ∈ caligraphic_O start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. ∎

References

  • AKM [22] Sepehr Assadi, Pankaj Kumar, and Parth Mittal. Brooks’ theorem in graph streams: a single-pass semi-streaming algorithm for ΔΔ\Deltaroman_Δ-coloring. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, pages 234–247, 2022.
  • Bar [15] L. Barenboim. Deterministic (ΔΔ\Deltaroman_Δ + 1)-coloring in sublinear (in ΔΔ\Deltaroman_Δ) time in static, dynamic and faulty networks. In Proc. 34th ACM Symposium on Principles of Distributed Computing (PODC), pages 345–354, 2015.
  • BBKO [22] Alkida Balliu, Sebastian Brandt, Fabian Kuhn, and Dennis Olivetti. Distributed ΔΔ{\Delta}roman_Δ-coloring plays hide-and-seek. In Proc. 54th ACM Symp. on Theory of Computing (STOC), 2022.
  • BCM+ [21] Alkida Balliu, Keren Censor-Hillel, Yannic Maus, Dennis Olivetti, and Jukka Suomela. Locally checkable labelings with small messages. In Seth Gilbert, editor, 35th International Symposium on Distributed Computing, DISC 2021, October 4-8, 2021, Freiburg, Germany (Virtual Conference), volume 209 of LIPIcs, pages 8:1–8:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021.
  • BE [13] Leonid Barenboim and Michael Elkin. Distributed Graph Coloring: Fundamentals and Recent Developments. Morgan & Claypool Publishers, 2013.
  • BE [19] Étienne Bamas and Louis Esperet. Distributed coloring of graphs with an optimal number of colors. volume 126 of LIPIcs, pages 10:1–10:15. LZI, 2019.
  • BEPS [16] Leonid Barenboim, Michael Elkin, Seth Pettie, and Johannes Schneider. The locality of distributed symmetry breaking. Journal of the ACM, 63(3):20:1–20:45, 2016.
  • BFH+ [16] Sebastian Brandt, Orr Fischer, Juho Hirvonen, Barbara Keller, Tuomo Lempiäinen, Joel Rybicki, Jukka Suomela, and Jara Uitto. A lower bound for the distributed Lovász local lemma. In Proc. 48th ACM Symposium on Theory of Computing (STOC 2016), pages 479–488. ACM, 2016.
  • BKM [20] Philipp Bamberger, Fabian Kuhn, and Yannic Maus. Efficient deterministic distributed coloring with small bandwidth. In PODC ’20: ACM Symposium on Principles of Distributed Computing, Virtual Event, Italy, August 3-7, 2020, pages 243–252, 2020.
  • Bro [41] R. Leonard Brooks. On colouring the nodes of a network. Mathematical Proceedings of the Cambridge Philosophical Society, 37(2):194–197, 1941.
  • CCDM [24] Sam Coy, Artur Czumaj, Peter Davies, and Gopinath Mishra. Parallel derandomization for coloring, 2024. Note: https://arxiv.longhoe.net/abs/2302.04378v1 contains the Delta-coloring algorithm.
  • CHL+ [20] Yi-Jun Chang, Qizheng He, Wenzheng Li, Seth Pettie, and Jara Uitto. Distributed edge coloring and a special case of the constructive Lovász local lemma. ACM Trans. Algorithms, 2020.
  • CLP [18] Yi-Jun Chang, Wenzheng Li, and Seth Pettie. An optimal distributed (ΔΔ\Deltaroman_Δ+1)-coloring algorithm? In Proceedings of the ACM Symposium on Theory of Computing (STOC), pages 445–456, 2018.
  • CM [19] Shiri Chechik and Doron Mukhtar. Optimal distributed coloring algorithms for planar graphs in the LOCAL model. In Timothy M. Chan, editor, Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019, pages 787–804. SIAM, 2019.
  • CP [19] Yi-Jun Chang and Seth Pettie. A time hierarchy theorem for the LOCAL model. SIAM J. Comput., 48(1):33–69, 2019.
  • CPS [17] Kai-Min Chung, Seth Pettie, and Hsin-Hao Su. Distributed algorithms for the Lovász local lemma and graph coloring. Distributed Comput., 30(4):261–280, 2017.
  • Dav [23] Peter Davies. Improved distributed algorithms for the Lovász local lemma and edge coloring. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 4273–4295. SIAM, 2023.
  • EL [74] Paul Erdös and László Lovász. Problems and Results on 3-chromatic Hypergraphs and some Related Questions. Colloquia Mathematica Societatis János Bolyai, pages 609–627, 1974.
  • EPS [15] Michael Elkin, Seth Pettie, and Hsin-Hao Su. (2Δ1Δ1\Delta-1roman_Δ - 1)-edge-coloring is much easier than maximal matching in the distributed setting. In Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, San Diego, CA, USA, January 4-6, 2015, pages 355–370, 2015.
  • FG [17] Manuela Fischer and Mohsen Ghaffari. Sublogarithmic Distributed Algorithms for Lovász Local Lemma, and the Complexity Hierarchy. In the Proceedings of the 31st International Symposium on Distributed Computing (DISC), pages 18:1–18:16, 2017.
  • FHK [16] Pierre Fraigniaud, Marc Heinrich, and Adrian Kosowski. Local conflict coloring. In Proceedings of the IEEE Symposium on Foundations of Computer Science (FOCS), pages 625–634, 2016.
  • FHM [23] Manuela Fischer, Magnús M. Halldórsson, and Yannic Maus. Fast distributed Brooks’ theorem. In Proceedings of the SIAM-ACM Symposium on Discrete Algorithms (SODA), pages 2567–2588, 2023.
  • Fis [17] Manuela Fischer. Improved deterministic distributed matching via rounding. In Andréa W. Richa, editor, 31st International Symposium on Distributed Computing, DISC 2017, October 16-20, 2017, Vienna, Austria, volume 91 of LIPIcs, pages 17:1–17:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2017.
  • FK [23] Marc Fuchs and Fabian Kuhn. List defective colorings: Distributed algorithms and applications. In Rotem Oshman, editor, 37th International Symposium on Distributed Computing, DISC 2023, October 10-12, 2023, L’Aquila, Italy, volume 281 of LIPIcs, pages 22:1–22:23. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2023.
  • Gha [19] Mohsen Ghaffari. Distributed maximal independent set using small messages. In Proc. 30th Symp. on Discrete Algorithms (SODA), pages 805–820, 2019.
  • GHK [18] Mohsen Ghaffari, David G. Harris, and Fabian Kuhn. On derandomizing local distributed algorithms. In 59th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2018, Paris, France, October 7-9, 2018, pages 662–673, 2018.
  • GHKM [18] Mohsen Ghaffari, Juho Hirvonen, Fabian Kuhn, and Yannic Maus. Improved distributed delta-coloring. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing, PODC 2018, Egham, United Kingdom, July 23-27, 2018, pages 427–436, 2018.
  • GK [21] Mohsen Ghaffari and Fabian Kuhn. Deterministic distributed vertex coloring: Simpler, faster, and without network decomposition. In Proceedings of the IEEE Symposium on Foundations of Computer Science (FOCS), pages 1009–1020, 2021.
  • HKMT [21] Magnús M. Halldórsson, Fabian Kuhn, Yannic Maus, and Tigran Tonoyan. Efficient randomized distributed coloring in CONGEST. In Proceedings of the ACM Symposium on Theory of Computing (STOC), pages 1180–1193, 2021. Full version at CoRR abs/2105.04700.
  • HKNT [22] Magnús M. Halldórsson, Fabian Kuhn, Alexandre Nolin, and Tigran Tonoyan. Near-optimal distributed degree+1 coloring. In Stefano Leonardi and Anupam Gupta, editors, STOC ’22: 54th Annual ACM SIGACT Symposium on Theory of Computing, Rome, Italy, June 20 - 24, 2022, pages 450–463. ACM, 2022.
  • HMN [22] Magnús M. Halldórsson, Yannic Maus, and Alexandre Nolin. Fast distributed vertex splitting with applications. In Christian Scheideler, editor, 36th International Symposium on Distributed Computing, DISC 2022, October 25-27, 2022, Augusta, Georgia, USA, volume 246 of LIPIcs, pages 26:1–26:24. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022.
  • HMP [24] Magnús M. Halldórsson, Yannic Maus, and Saku Peltonen. Distributed Lovász local lemma under bandwidth limitations, 2024.
  • HN [21] Magnús M. Halldórsson and Alexandre Nolin. Superfast coloring in CONGEST via efficient color sampling. In Tomasz Jurdzinski and Stefan Schmid, editors, Structural Information and Communication Complexity - 28th International Colloquium, SIROCCO 2021, Wrocław, Poland, June 28 - July 1, 2021, Proceedings, volume 12810 of Lecture Notes in Computer Science, pages 68–83. Springer, 2021.
  • HNT [22] Magnús M. Halldórsson, Alexandre Nolin, and Tigran Tonoyan. Overcoming congestion in distributed coloring. In Proceedings of the ACM Symposium on Principles of Distributed Computing (PODC), pages 26–36. ACM, 2022.
  • HSS [18] David G. Harris, Johannes Schneider, and Hsin-Hao Su. Distributed (Δ+1Δ1\Delta+1roman_Δ + 1)-coloring in sublogarithmic rounds. Journal of the ACM, 65:19:1–19:21, 2018.
  • Joh [99] Öjvind Johansson. Simple distributed Δ+1Δ1\Delta+1roman_Δ + 1-coloring of graphs. Inf. Process. Lett., 70(5):229–232, 1999.
  • Lin [92] Nati Linial. Locality in distributed graph algorithms. SIAM Journal on Computing, 21(1):193–201, 1992.
  • MPU [23] Yannic Maus, Saku Peltonen, and Jara Uitto. Distributed symmetry breaking on power graphs via sparsification. In Proceedings of the 2023 ACM Symposium on Principles of Distributed Computing, PODC ’23, page 157–167, New York, NY, USA, 2023. Association for Computing Machinery.
  • MT [20] Yannic Maus and Tigran Tonoyan. Local conflict coloring revisited: Linial for lists. In Hagit Attiya, editor, 34th International Symposium on Distributed Computing, DISC 2020, October 12-16, 2020, Virtual Conference, volume 179 of LIPIcs, pages 16:1–16:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2020.
  • MU [21] Yannic Maus and Jara Uitto. Efficient CONGEST algorithms for the Lovász local lemma. In Seth Gilbert, editor, Proceedings of the International Symposium on Distributed Computing (DISC), volume 209 of LIPIcs, pages 31:1–31:19. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021.
  • Pos [19] Luke Postle. Linear-time and efficient distributed algorithms for list coloring graphs on surfaces. In David Zuckerman, editor, 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, Baltimore, Maryland, USA, November 9-12, 2019, pages 929–941. IEEE Computer Society, 2019.
  • PS [95] Alessandro Panconesi and Aravind Srinivasan. The local nature of ΔΔ\Deltaroman_Δ-coloring and its algorithmic applications. Combinatorica, 15(2):255–280, 1995.
  • RG [20] Václav Rozhoň and Mohsen Ghaffari. Polylogarithmic-time deterministic network decomposition and distributed derandomization. In Proceedings of the ACM Symposium on Theory of Computing (STOC), pages 350–363, 2020.

Appendix A Concentration Bounds

Lemma A.1 (Chernoff bounds).

Let {Xi}i=1rsuperscriptsubscriptsubscript𝑋𝑖𝑖1𝑟\{X_{i}\}_{i=1}^{r}{ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT be a family of independent binary random variables with Pr[Xi=1]=qiPrsubscript𝑋𝑖1subscript𝑞𝑖\Pr[X_{i}=1]=q_{i}roman_Pr [ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 ] = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and let X=i=1rXi𝑋superscriptsubscript𝑖1𝑟subscript𝑋𝑖X=\sum_{i=1}^{r}X_{i}italic_X = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. For any δ>0𝛿0\delta>0italic_δ > 0,

Pr(|X𝔼[X]|δ𝔼[X])2exp(min(δ,δ2)𝔼[X]/3).Pr𝑋𝔼delimited-[]𝑋𝛿𝔼delimited-[]𝑋2𝛿superscript𝛿2𝔼delimited-[]𝑋3\Pr\left(|X-\mathbb{E}[X]|\geq\delta\mathbb{E}[X]\right)\leq 2\exp(-\min(% \delta,\delta^{2})\mathbb{E}[X]/3)\ .roman_Pr ( | italic_X - blackboard_E [ italic_X ] | ≥ italic_δ blackboard_E [ italic_X ] ) ≤ 2 roman_exp ( - roman_min ( italic_δ , italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) blackboard_E [ italic_X ] / 3 ) .

Appendix B Further Supplementary Material from [32]

Slack generation with two given sets.

The following lemma shows that one can compute a partial coloring of the nodes in two given sets S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and S2subscript𝑆2S_{2}italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT such that any node v𝑣vitalic_v that has sufficiently many non-edges in G[N(v)Si]𝐺delimited-[]𝑁𝑣subscript𝑆𝑖G[N(v)\cap S_{i}]italic_G [ italic_N ( italic_v ) ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ], i=1,2𝑖12i=1,2italic_i = 1 , 2 obtain slack. We use it in Section 5.2 and it is proven in [32].

Lemma B.1 ([32]).

Let Δs=O(polyloglogn)subscriptΔ𝑠𝑂poly𝑛\Delta_{s}=O(\operatorname{\text{{\rm poly}}}\log\log n)roman_Δ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_O ( poly roman_log roman_log italic_n ). Let m¯¯𝑚\overline{m}over¯ start_ARG italic_m end_ARG and χ=O(Δ)𝜒𝑂Δ\chi=O(\Delta)italic_χ = italic_O ( roman_Δ ) be positive integers such that m¯/χ=Ω(logΔloglogn)¯𝑚𝜒ΩΔ𝑛\overline{m}/\chi=\Omega(\log\Delta\cdot\log\log n)over¯ start_ARG italic_m end_ARG / italic_χ = roman_Ω ( roman_log roman_Δ ⋅ roman_log roman_log italic_n ) and χcΔs𝜒superscript𝑐subscriptΔ𝑠\chi\geq c^{\prime}\Delta_{s}italic_χ ≥ italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Δ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT for some constant csuperscript𝑐c^{\prime}italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Let WV𝑊𝑉W\subseteq Vitalic_W ⊆ italic_V and let S1,S2Vsubscript𝑆1subscript𝑆2𝑉S_{1},S_{2}\subset Vitalic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊂ italic_V be disjoint sets such that for i=1,2𝑖12i=1,2italic_i = 1 , 2,

  • v(WS1S2):dSi(v)Δs:for-all𝑣𝑊subscript𝑆1subscript𝑆2subscript𝑑subscript𝑆𝑖𝑣subscriptΔ𝑠\forall v\in(W\cup S_{1}\cup S_{2}):d_{S_{i}}(v)\leq\Delta_{s}∀ italic_v ∈ ( italic_W ∪ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) : italic_d start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_v ) ≤ roman_Δ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT,

  • vWfor-all𝑣𝑊\forall v\in W∀ italic_v ∈ italic_W: the number of non-edges in N(v)Si𝑁𝑣subscript𝑆𝑖N(v)\cap S_{i}italic_N ( italic_v ) ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is at least m¯¯𝑚\overline{m}over¯ start_ARG italic_m end_ARG

There is a randomized 𝖢𝖮𝖭𝖦𝖤𝖲𝖳𝖢𝖮𝖭𝖦𝖤𝖲𝖳\mathsf{CONGEST}sansserif_CONGEST algorithm that w.h.p. colors a subset of S1S2subscript𝑆1subscript𝑆2S_{1}\cup S_{2}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT using a palette of size 2χ2𝜒2\chi2 italic_χ such that every node in W𝑊Witalic_W has at least e3/cm¯/(50χ)=Ω(m¯/χ)superscript𝑒3superscript𝑐¯𝑚50𝜒Ω¯𝑚𝜒e^{-3/c^{\prime}}\overline{m}/(50\chi)=\Omega(\overline{m}/\chi)italic_e start_POSTSUPERSCRIPT - 3 / italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT over¯ start_ARG italic_m end_ARG / ( 50 italic_χ ) = roman_Ω ( over¯ start_ARG italic_m end_ARG / italic_χ ) same-colored neighbors. Every node in W,S1,S2𝑊subscript𝑆1subscript𝑆2W,S_{1},S_{2}italic_W , italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT has at most 2Δs2subscriptΔ𝑠2\Delta_{s}2 roman_Δ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT of its neighbors colored.

Non-edge hitting lemma.

An expected p2superscript𝑝2p^{2}italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT-fraction of the non-edges is preserved when sampling nodes into a set with probability p𝑝pitalic_p. The following lemma shows that the probability of deviating from this expectation is small.

Lemma B.2 (Non-edge hitting lemma [32]).

Let G𝐺Gitalic_G be a graph on the vertex set X𝑋Xitalic_X with m¯¯𝑚\overline{m}over¯ start_ARG italic_m end_ARG non-edges. Sample each node of X𝑋Xitalic_X with probability p𝑝pitalic_p into a set S𝑆Sitalic_S and let f𝑓fitalic_f be the random variable describing the number of non-edges in G[S]𝐺delimited-[]𝑆G[S]italic_G [ italic_S ]. Then we have Pr(fp2m¯/2)exp(pm¯/5|X|)Pr𝑓superscript𝑝2¯𝑚2𝑝¯𝑚5𝑋\Pr(f\leq p^{2}\overline{m}/2)\leq\exp\left(-p\overline{m}/5|X|\right)roman_Pr ( italic_f ≤ italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over¯ start_ARG italic_m end_ARG / 2 ) ≤ roman_exp ( - italic_p over¯ start_ARG italic_m end_ARG / 5 | italic_X | ).