Linearization of optimal rates for independent zero-error source and channel problems

Nicolas Charpenay,  , Maël Le Treust,  , and Aline Roumy The work of Nicolas Charpenay was conducted during his PhD at IRISA UMR 6074 and Centre Inria de l’Université de Rennes, funded by CDSN ENS Paris-Saclay.This work was presented in part at the IEEE Information Theory Workshop (ITW) 2023 in Saint-Malo, France, [DOI: 10.1109/ITW55543.2023.10161637], and in part at the IEEE International Symposium on Information Theory (ISIT) 2023 in Taipei, Taiwan, [DOI: 10.1109/ISIT54713.2023.10206770].Nicolas Charpenay is with Univ Rennes, CNRS, IRMAR UMR 6625, F-35000 Rennes, France (e-mail: [email protected]).Maël Le Treust is with Univ. Rennes, CNRS, Inria, IRISA UMR 6074, F-35000 Rennes, France (e-mail: [email protected]).Aline Roumy is with Centre Inria de l’Université de Rennes, France (e-mail: [email protected]).
Abstract

Zero-error coding encompasses a variety of source and channel problems where the probability of error must be exactly zero. The zero-error constraint differs from the vanishing-error constraint, the latter only requires the probability of error to go to zero when the block length of the code goes to infinity. Here, many problems change from a statistical nature to a combinatorial one, which is tied to the encoder?s lack of knowledge about what is observed by the decoder. In this paper, we investigate two unsolved zero-error problems: the source coding with side information and the channel coding. We focus our attention on families of independent problems for which the distribution decomposes into a product of distributions, corresponding to solved zero-error problems. A crucial step is the linearization property of the optimal rate, which does not always hold in the zero-error regime, unlike in the vanishing error regime. By generalizing recent results of Wigderson and Zuiddam, and of Schrijver, we derive a condition under which the linearization properties of the complementary graph entropy H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and of the zero-error capacity C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for the AND product of graph and for the disjoint union of graphs are all equivalent. This provides new single-letter characterization of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, for example when the graph is a product of perfect graphs, which is not perfect in general, and for the class of graphs obtain by the product of a perfect graph G𝐺Gitalic_G with the pentagon graph C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT. By building on Haemers result, we also show that the linearization of the complementary graph entropy does not hold for the product of the Schläfli graph with its complementary graph.

I Introduction

Transmitting information without any errors has been a concern for Shannon since the beginning of his work. In his seminal paper [1], Shannon proposed a construction for zero-error source coding, a problem soon solved by Huffman in [2]. Shortly after establishing the channel capacity in [1], Shannon turned his attention to channel coding with zero-error in [3], instead of vanishing error. This subtle difference radically changes the nature of the problem, essentially combinatorial rather than probabilistic. The single-letter characterization of the zero-error capacity is a notoriously difficult open problem. For example, the zero-error capacity of the noisy-typewriter channel with 7777 letters is unknown, some lower and upper bounds are stated in [4], [5], and [6]. In fact, the zero-error property only depends on the support of the channel conditional distribution PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT but not on the probability values. More precisely, the zero-error property is translated into the characteristic graph that encompasses the problem data in its structure: the vertices are the channel inputs 𝒳𝒳\mathcal{X}caligraphic_X, and two symbols x𝑥xitalic_x and xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are adjacent if they are “confusable”, i.e. if they can produce the same channel output y𝑦yitalic_y with positive probability. For sequences of symbols xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, the characteristic graph is obtained by taking iteratively the AND product (\wedge) of the graphs. In order to prevent any decoding error, a zero-error codebook must be composed of non-adjacent codewords. Thus, the size of the optimal codebook is given by the size of the maximal independent set, called the independence number. More specifically, the zero-error capacity C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the asymptotic limit of the independence number of iterated AND products of the characteristic graph. This means that all channel distributions with the same characteristic graph (or equivalently, with the same support) have the same zero-error capacity. Over time, this open question has attracted a lot of attention in Information Theory [7, Chap. 11] and in Combinatorics and Graph Theory, see [8, Chap. 27].

1234567
Figure 1: The characteristic graph C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT of the noisy-typewriter channel with 7777 letters.

This problem inspired Berge’s notion of perfect graphs [9, pp. 382], for which the zero-error capacity is given by the one-shot independence number [10, Theorem 4.18]. Graphs with odd cycles are also related to Berge’s conjecture [11], later proved in [12] by Chudnovsky et al., namely “a graph G𝐺Gitalic_G is perfect if and only if neither G𝐺Gitalic_G nor its complementary graph G¯¯𝐺\overline{G}over¯ start_ARG italic_G end_ARG, have odd cycles of length 5555 or more.” Since the zero-error capacity of the pentagon graph C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT has been characterized by Shannon [3] and Lovász [13], as well as the zero-error capacity for perfect graphs, see [10, Theorem 4.18], the graph C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT depicted in Fig. 1, is the minimal connected graph for which the zero-error capacity is still an open problem.

In the source coding framework, an important unsolved zero-error problem was posed by Witsenhausen in [14] when the decoder has side information. In this problem depicted in Fig. 2, the encoder shares information about a source X𝑋Xitalic_X, exploiting the side-information Y𝑌Yitalic_Y observed by the decoder but not by itself. In the vanishing error regime, Slepian and Wolf in [15] showed that the optimal rate is H(X|Y)𝐻conditional𝑋𝑌H(X|Y)italic_H ( italic_X | italic_Y ), but no single-letter characterization is available in the zero-error regime. As for the channel coding problem, the zero-error property is embedded into the characteristic graph G𝐺Gitalic_G constructed with respect to the conditional distribution PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT of the “side-information channel”. The main difference with channel coding problem is that the source distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is fixed. Witsenhausen showed in [14] that the fixed-length encoding task becomes equivalent to graph coloring. In [16], Alon and Orlitsky considered the variable-length version, determining an asymptotic expression for the optimal rate based on the chromatic entropy. In [17], Koulgi et al. proved that the optimal rate coincides with the complementary graph entropy H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG, defined by Körner and Longo in [18]. These two expressions are asymptotic, as optimal rates are determined by coloring an infinite product of graphs. Single-letter characterizations are known only in the same cases as for zero-error capacity, such as perfect graphs and the pentagon graph C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT. Instead, the Körner graph entropy [19] provides a single-letter expression for the unrestricted input setting of Alon and Orlitsky in [16], where the zero-error constraint is satisfied even outside the source’s support, providing an upper bound on the optimal rate.

EncoderDecoderYnsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTX^nsuperscript^𝑋𝑛\widehat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTXnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT\diagupR𝑅Ritalic_R
Figure 2: The source coding problem with decoder side information, called side-information problem.

The difficulty of the zero-error problem comes from the fact that the knowledge of the decoder is not included in the one of the encoder. This is because the side information is available only at the decoder. Indeed, giving the side information also to the encoder allows to implement a zero-error conditional Huffman code [2] of rate H(X|Y)𝐻conditional𝑋𝑌H(X|Y)italic_H ( italic_X | italic_Y ), as in the vanishing error regime [15]. The asymmetry of knowledge poses the same difficulty in zero-error channel coding problem. Note that this difficulty is mitigated when the encoder observes the past channel output symbols, leading to the characterization by Shannon, of the zero-error capacity with feedback in [3].

In this paper we focus our attention on zero-error problems composed of a family of independent problems. First, we consider that the source and side-information decompose into a family of independent random variables (Xa,Ya)a𝒜subscriptsubscript𝑋𝑎subscript𝑌𝑎𝑎𝒜(X_{a},Y_{a})_{a\in\mathcal{A}}( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT. This induces a characteristic graph with a specific structure given by the AND product of the graphs of each subproblems G=a𝒜Ga𝐺subscript𝑎𝒜subscript𝐺𝑎G=\wedge_{a\in\mathcal{A}}G_{a}italic_G = ∧ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. Such decomposition has two benefits: it reduces the problem complexity and it provides new cases for which the complementary graph entropy is charaterized. In the vanishing error regime, the optimal rate is equal to the sum of the optimal rates in each subproblem, we say that the optimal rate linearizes. By building on Haemers results for the zero-error capacity of the Schläfli graph in [20], we show that independence alone doesn’t ensure the linearization of the complementary graph entropy, which contradicts a standard property in the vanishing error regime. Thus, showing the linearization of the optimal rates becomes crucial.

Our first contribution is to show that the linearization of the complementary graph entropy holds for the AND product of graphs if and only if it holds for the disjoint union of graphs (square-union\sqcup), also called “sum of graphs” in [21]. Recently, Wigderson and Zuiddam in [22] and Schrijver in [23] show a similar statement for the zero-error capacity: the linearization holds for the AND product if and only if it holds for the disjoint union. We then explore the consequences of these two statements, for which the characteristic graphs are defined similarly. An important difference is about the probability distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT which is specified in the source problem but a priori unspecified in the zero-error channel coding problem.

A natural notion related to both C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG is the zero-error capacity C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) of a graph relative to a distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT introduced by Csiszár and Körner in [24]. By taking the maximum over the distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT, Gargano et al. [25] showed that it is equal to the zero-error capacity

C0(G)=maxPXC(G,PX).subscript𝐶0𝐺subscriptsubscript𝑃𝑋𝐶𝐺subscript𝑃𝑋\displaystyle C_{0}(G)=\max_{P_{X}}C(G,P_{X}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) . (1)

Moreover, Marton showed in [26] that the complementary graph entropy satisfies

C(G,PX)+H¯(G)=H(X).𝐶𝐺subscript𝑃𝑋¯𝐻𝐺𝐻𝑋\displaystyle C(G,P_{X})+\overline{H}(G)=H(X).italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) + over¯ start_ARG italic_H end_ARG ( italic_G ) = italic_H ( italic_X ) . (2)

Equations (1) and (2) are the analogues of the channel capacity C=maxPXI(X;Y)𝐶subscriptsubscript𝑃𝑋𝐼𝑋𝑌C=\max_{P_{X}}I(X;Y)italic_C = roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I ( italic_X ; italic_Y ) and the entropy property I(X;Y)+H(X|Y)=H(X)𝐼𝑋𝑌𝐻conditional𝑋𝑌𝐻𝑋I(X;Y)+H(X|Y)=H(X)italic_I ( italic_X ; italic_Y ) + italic_H ( italic_X | italic_Y ) = italic_H ( italic_X ) in the vanishing error regime. The main contribution of the paper is to show that the linearization properties of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for the AND product and for the disjoint union are all equivalent, provided that the source distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT maximizes C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) when evaluated with respect to the AND product of graphs.

These linearization properties enlarge the class of problems for which C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG have a single-letter characterization. For perfect graphs, we show that the linearizations of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG always hold for the AND product and for the disjoint union. As a consequence, we determine new single-letter characterizations for the products of perfect graphs that are not perfect in general, and for the product of a perfect graph G𝐺Gitalic_G with the pentagon graph C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT, by building on the characterization of H¯(GC5)¯𝐻square-union𝐺subscript𝐶5\overline{H}(G\sqcup C_{5})over¯ start_ARG italic_H end_ARG ( italic_G ⊔ italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) stated in [21].

A crucial notion is the set of capacity-achieving distributions that contains all the distributions PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT for which C0=C(,PX)subscript𝐶0𝐶subscript𝑃𝑋C_{0}=C(\cdot,P_{X})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ). We show that the uniform distribution is capacity-achieving when the graph is vertex-transitive, i.e. when all vertices play the same role within the graph. Since, the Schlälfi graph S𝑆Sitalic_S and its complementary graph S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG are vertex-transitive, so as their product SS¯𝑆¯𝑆S\wedge\overline{S}italic_S ∧ over¯ start_ARG italic_S end_ARG, the uniform distributions are capacity achieving for S𝑆Sitalic_S, S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG and SS¯𝑆¯𝑆S\wedge\overline{S}italic_S ∧ over¯ start_ARG italic_S end_ARG. Together with Haemers result [20], this shows a counterexample where linearizations of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for the AND product and the disjoint union of S𝑆Sitalic_S and S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG do not hold.

In Sec. II, we study the linearization of the complementary graph entropy H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for the source problem with side information. The connection with the linearization of the zero-error capacity C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is investigated in Sec. III. New single-letter characterizations for C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG are provided in Sec. IV, as well as the counter-example of linearization based on the Schläfli graph.

II Zero-Error Source Coding With Decoder Side Information

II-A Problem Statement and Results from the Literature

The zero-error source coding problem with decoder side information is depicted in Fig. 2. It corresponds to a situation in data compression where the decoder has side-information Y𝑌Yitalic_Y about the source X𝑋Xitalic_X that has to be retrieved. This problem was formulated by Slepian and Wolf in [15] in the vanishing error regime and by Witsenhausen in [14] for the zero-error variant. We call this the side-information problem.

More formally, we assume that a sequence i.i.d. random variables (Xn,Yn)superscript𝑋𝑛superscript𝑌𝑛(X^{n},Y^{n})( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) of length n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is drawn according to PX,YΔ(𝒳×𝒴)subscript𝑃𝑋𝑌Δ𝒳𝒴P_{X,Y}\in\Delta(\mathcal{X}\times\mathcal{Y})italic_P start_POSTSUBSCRIPT italic_X , italic_Y end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X × caligraphic_Y ) where 𝒳𝒳\mathcal{X}caligraphic_X and 𝒴𝒴\mathcal{Y}caligraphic_Y are finite sets. We consider variable-length source coding, which encompasses the special case of fixed-length source coding. An (n,ϕe,ϕd)𝑛subscriptitalic-ϕ𝑒subscriptitalic-ϕ𝑑(n,\phi_{e},\phi_{d})( italic_n , italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) variable-length side-information source code for X𝑋Xitalic_X and Y𝑌Yitalic_Y consists of

  • -

    an encoder ϕe:𝒳n{0,1}:subscriptitalic-ϕ𝑒superscript𝒳𝑛superscript01\phi_{e}:\mathcal{X}^{n}\rightarrow\{0,1\}^{*}italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT : caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → { 0 , 1 } start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT that assigns to each xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT a binary string such that ImϕeImsubscriptitalic-ϕ𝑒\operatorname*{Im}\phi_{e}roman_Im italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT is prefix-free,

  • -

    a decoder ϕd:𝒴n×{0,1}𝒳n:subscriptitalic-ϕ𝑑superscript𝒴𝑛superscript01superscript𝒳𝑛\phi_{d}:\mathcal{Y}^{n}\times\{0,1\}^{*}\rightarrow\mathcal{X}^{n}italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT : caligraphic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT × { 0 , 1 } start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT → caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT that assigns an estimate x^nsuperscript^𝑥𝑛\widehat{x}^{n}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to each pair (yn,ϕe(xn))superscript𝑦𝑛subscriptitalic-ϕ𝑒superscript𝑥𝑛(y^{n},\phi_{e}(x^{n}))( italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ).

The rate of the (n,ϕe,ϕd)𝑛subscriptitalic-ϕ𝑒subscriptitalic-ϕ𝑑(n,\phi_{e},\phi_{d})( italic_n , italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT )-code is the average length of the codeword per source symbol, i.e. R1n𝔼[ϕe(Xn)]approaches-limit𝑅1𝑛𝔼delimited-[]subscriptitalic-ϕ𝑒superscript𝑋𝑛R\doteq\frac{1}{n}\mathbb{E}[\ell\circ\phi_{e}(X^{n})]italic_R ≐ divide start_ARG 1 end_ARG start_ARG italic_n end_ARG blackboard_E [ roman_ℓ ∘ italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ], and the probability of error is Pe(n)(X^nXn)approaches-limitsuperscriptsubscript𝑃𝑒𝑛superscript^𝑋𝑛superscript𝑋𝑛P_{e}^{(n)}\doteq\mathbb{P}\big{(}\widehat{X}^{n}\neq X^{n}\big{)}italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT ≐ blackboard_P ( over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ≠ italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ).

Definition 1.

The optimal rate in the vanishing error regime is the minimal rate among all codes that satisfy the ε𝜀\varepsilonitalic_ε-error constraint with ε0𝜀0\varepsilon\rightarrow 0italic_ε → 0:

Rlimε0inf(n,ϕe,ϕd):Pe(n)ε1n𝔼[ϕe(Xn)].approaches-limitsuperscript𝑅subscript𝜀0subscriptinfimum:𝑛subscriptitalic-ϕ𝑒subscriptitalic-ϕ𝑑superscriptsubscript𝑃𝑒𝑛𝜀1𝑛𝔼delimited-[]subscriptitalic-ϕ𝑒superscript𝑋𝑛\displaystyle R^{\star}\doteq\lim_{\displaystyle\varepsilon\rightarrow 0}\quad% \inf\limits_{\displaystyle(n,\phi_{e},\phi_{d}):P_{e}^{(n)}\leq\varepsilon}% \quad\frac{1}{n}\mathbb{E}[\ell\circ\phi_{e}(X^{n})].italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ≐ roman_lim start_POSTSUBSCRIPT italic_ε → 0 end_POSTSUBSCRIPT roman_inf start_POSTSUBSCRIPT ( italic_n , italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT ≤ italic_ε end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG blackboard_E [ roman_ℓ ∘ italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ] . (3)

The optimal rate in the zero-error regime is the minimal rate among all coding schemes that satisfy the zero-error constraint:

R0inf(n,ϕe,ϕd):Pe(n)=01n𝔼[ϕe(Xn)].approaches-limitsubscriptsuperscript𝑅0subscriptinfimum:𝑛subscriptitalic-ϕ𝑒subscriptitalic-ϕ𝑑superscriptsubscript𝑃𝑒𝑛01𝑛𝔼delimited-[]subscriptitalic-ϕ𝑒superscript𝑋𝑛\displaystyle R^{\star}_{0}\doteq\inf\limits_{\displaystyle(n,\phi_{e},\phi_{d% }):P_{e}^{(n)}=0}\quad\frac{1}{n}\mathbb{E}[\ell\circ\phi_{e}(X^{n})].italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≐ roman_inf start_POSTSUBSCRIPT ( italic_n , italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG blackboard_E [ roman_ℓ ∘ italic_ϕ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ] . (4)

When the side-information Y𝑌Yitalic_Y is available at both encoder and decoder, the optimal rates in the vanishing and zero-error regimes are equal to H(X|Y)𝐻conditional𝑋𝑌H(X|Y)italic_H ( italic_X | italic_Y ). The zero-error coding construction relies on a conditional Huffman coding [2]. In the side-information problem, the encoder does not observes the side-information Y𝑌Yitalic_Y. This asymmetry of information has a consequence: the optimal rates in the vanishing and zero-error regimes are distinct.

Theorem 1 (from [15, Theorem 2]).

The optimal rate in the vanishing error regime is

R=H(X|Y).superscript𝑅𝐻conditional𝑋𝑌\displaystyle R^{\star}=H(X|Y).italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = italic_H ( italic_X | italic_Y ) . (5)

The nature of the problem changes when considering an error probability equal to zero, instead of a vanishing error probability. In the zero-error regime, the characterisation of the optimal rate R0subscriptsuperscript𝑅0R^{\star}_{0}italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is a notoriously difficult open problem of combinatorial nature. The key features of the side-information problem are captured by the “characteristic graph” introduced by Witsenhausen in [14], which we review below.

Definition 2 (Characteristic graph).

Let 𝒳,𝒴𝒳𝒴\mathcal{X},\mathcal{Y}caligraphic_X , caligraphic_Y be two finite sets and PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT be a conditional distribution. The characteristic graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) associated to PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT is defined by:

  • -

    𝒳𝒳\mathcal{X}caligraphic_X as set of vertices,

  • -

    x,x𝒳𝑥superscript𝑥𝒳x,x^{\prime}\in\mathcal{X}italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X are adjacent xx𝑥superscript𝑥xx^{\prime}\in\mathcal{E}italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_E, if PY|X(y|x)PY|X(y|x)>0subscript𝑃conditional𝑌𝑋conditional𝑦𝑥subscript𝑃conditional𝑌𝑋conditional𝑦superscript𝑥0P_{Y|X}(y|x)P_{Y|X}(y|x^{\prime})>0italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT ( italic_y | italic_x ) italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT ( italic_y | italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) > 0 for some y𝒴𝑦𝒴y\in\mathcal{Y}italic_y ∈ caligraphic_Y.

A characteristic graph is a probabilistic graph G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), when it has the underlying distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT on its vertices.

The meaning of the characteristic graph is that, when the side information y𝑦yitalic_y does not allow to distinguish exactly between the source realizations x𝑥xitalic_x and xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, then x𝑥xitalic_x and xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are adjacent, and must be mapped to different codewords. Therefore, a zero-error encoding is a graph coloring for which adjacent vertices are mapped to different colors.

Definition 3 (Coloring, chromatic number χ𝜒\chiitalic_χ).

Let G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) be a graph. A map** c:𝒳𝒞:𝑐𝒳𝒞c:\mathcal{X}\rightarrow\mathcal{C}italic_c : caligraphic_X → caligraphic_C is a coloring if for all adjacent vertices x𝑥xitalic_x, xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with xx𝑥superscript𝑥xx^{\prime}\in\mathcal{E}italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_E, we have c(x)c(x)𝑐𝑥𝑐superscript𝑥c(x)\neq c(x^{\prime})italic_c ( italic_x ) ≠ italic_c ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). The chromatic number χ(G)𝜒𝐺\chi(G)italic_χ ( italic_G ) is the smallest |𝒞|𝒞|\mathcal{C}|| caligraphic_C | such that there exists a coloring c:𝒳𝒞:𝑐𝒳𝒞c:\mathcal{X}\rightarrow\mathcal{C}italic_c : caligraphic_X → caligraphic_C of G𝐺Gitalic_G.

For sequences of symbols with underlying distribution PY|Xn(yn|xn)=t=1nPY|X(yt|xt)subscriptsuperscript𝑃𝑛conditional𝑌𝑋conditionalsuperscript𝑦𝑛superscript𝑥𝑛superscriptsubscriptproduct𝑡1𝑛subscript𝑃conditional𝑌𝑋conditionalsubscript𝑦𝑡subscript𝑥𝑡P^{n}_{Y|X}(y^{n}|x^{n})=\prod_{t=1}^{n}P_{Y|X}(y_{t}|x_{t})italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT ( italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) = ∏ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ), two sequences of source inputs xn,xnsuperscript𝑥𝑛superscript𝑥𝑛x^{n},x^{\prime n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT are adjacent in the graph if PY|Xn(yn|xn)PY|Xn(yn|xn)>0subscriptsuperscript𝑃𝑛conditional𝑌𝑋conditionalsuperscript𝑦𝑛superscript𝑥𝑛subscriptsuperscript𝑃𝑛conditional𝑌𝑋conditionalsuperscript𝑦𝑛superscript𝑥𝑛0P^{n}_{Y|X}(y^{n}|x^{n})P^{n}_{Y|X}(y^{n}|x^{\prime n})>0italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT ( italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT ( italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) > 0 for some sequence of channel outputs ynsuperscript𝑦𝑛y^{n}italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, i.e. if and only if either xt=xtsubscript𝑥𝑡subscriptsuperscript𝑥𝑡x_{t}=x^{\prime}_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT or xtxt, 1tnformulae-sequencesubscript𝑥𝑡superscriptsubscript𝑥𝑡for-all1𝑡𝑛x_{t}x_{t}^{\prime}\in\mathcal{E},\forall\ 1\leq t\leq nitalic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_E , ∀ 1 ≤ italic_t ≤ italic_n. This implies that for sequences of symbols, the characteristic graph is built by using the AND product of graphs, denoted by \wedge, and also called “strong product” or “normal product” in [27, 26], and defined below.

Definition 4 (AND product \wedge).

Let G1=(𝒳1,1,PX1)subscript𝐺1subscript𝒳1subscript1subscript𝑃subscript𝑋1G_{1}=(\mathcal{X}_{1},\mathcal{E}_{1},P_{X_{1}})italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), G2=(𝒳2,2,PX2)subscript𝐺2subscript𝒳2subscript2subscript𝑃subscript𝑋2G_{2}=(\mathcal{X}_{2},\mathcal{E}_{2},P_{X_{2}})italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be two probabilistic graphs, their AND product G1G2subscript𝐺1subscript𝐺2G_{1}\wedge G_{2}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is a probabilistic graph defined by:

  • -

    𝒳1×𝒳2subscript𝒳1subscript𝒳2\mathcal{X}_{1}\times\mathcal{X}_{2}caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT as set of vertices,

  • -

    (x1x2),(x1x2)subscript𝑥1subscript𝑥2subscriptsuperscript𝑥1subscriptsuperscript𝑥2(x_{1}x_{2}),(x^{\prime}_{1}x^{\prime}_{2})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) are adjacent if x1x11subscript𝑥1subscriptsuperscript𝑥1subscript1x_{1}x^{\prime}_{1}\in\mathcal{E}_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and x2x22subscript𝑥2subscriptsuperscript𝑥2subscript2x_{2}x^{\prime}_{2}\in\mathcal{E}_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, with the convention of self-adjacency for all vertices.

  • -

    PX1PX2tensor-productsubscript𝑃subscript𝑋1subscript𝑃subscript𝑋2P_{X_{1}}\otimes P_{X_{2}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT as probability distribution on the vertices.

We denote by G1nsuperscriptsubscript𝐺1𝑛G_{1}^{\wedge n}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT the n𝑛nitalic_n-th AND power: G1n=G1G1superscriptsubscript𝐺1𝑛subscript𝐺1subscript𝐺1G_{1}^{\wedge n}=G_{1}\wedge...\wedge G_{1}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT = italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∧ … ∧ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT n𝑛nitalic_n times.

Unlike the vanishing error regime, there is no single-letter characterization of the optimal rate in the zero-error regime. We present two different asymptotic expressions which rely on codebooks composed of codewords that form a coloring of the AND product of the characteristic graph Gnsuperscript𝐺𝑛G^{\wedge n}italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT.

Alon and Orlitsky introduced an asymptotic expression in [16], for the optimal rate in the “restricted inputs” setting. The optimal rate R0superscriptsubscript𝑅0R_{0}^{\star}italic_R start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT relies on the notion of chromatic entropy Hχ(Gn)subscript𝐻𝜒superscript𝐺𝑛H_{\chi}(G^{\wedge n})italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ), which is the minimal entropy of a coloring of Gnsuperscript𝐺𝑛G^{\wedge n}italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT.

Definition 5 (Chromatic entropy Hχsubscript𝐻𝜒H_{\chi}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT).

The chromatic entropy of the probabilistic graph G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) is defined by

Hχ(G)=inf{H(c(X))|c is a coloring of G}.subscript𝐻𝜒𝐺infimumconditional-set𝐻𝑐𝑋𝑐 is a coloring of 𝐺\displaystyle H_{\chi}(G)=\inf\Big{\{}H(c(X))\Big{|}c\text{ is a coloring of }% G\Big{\}}.italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) = roman_inf { italic_H ( italic_c ( italic_X ) ) | italic_c is a coloring of italic_G } . (6)
Theorem 2 (from [16, Lemma 6]).

For all probabilistic graph G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ),

R0=limn1nHχ(Gn).subscriptsuperscript𝑅0subscript𝑛1𝑛subscript𝐻𝜒superscript𝐺𝑛\displaystyle R^{\star}_{0}=\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}(G^{% \wedge n}).italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) . (7)

Even though there is no single-letter expression for R0subscriptsuperscript𝑅0R^{\star}_{0}italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, Alon and Orlitsky provided a single-letter upper bound in [16] by adding the constraint called “unrestricted inputs”. This constraint requires the zero-error property to be satisfied even for the sequences of symbols (Xn,Yn)superscript𝑋𝑛superscript𝑌𝑛(X^{n},Y^{n})( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) that take values out of the support of PX,Ynsubscriptsuperscript𝑃𝑛𝑋𝑌P^{n}_{X,Y}italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X , italic_Y end_POSTSUBSCRIPT.

11112222333344445555
Figure 3: The pentagon graphs C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT with uniform distribution PX=Unif({1,,5})subscript𝑃𝑋Unif15P_{X}=\operatorname*{Unif}\big{(}\{1,...,5\}\big{)}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = roman_Unif ( { 1 , … , 5 } ) over the vertices.

With high probability, the source sequence Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is typical with respect to PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT. Let Gn[𝒯εn(PX)]superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋G^{\wedge n}[\mathcal{T}^{n}_{\varepsilon}(P_{X})]italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] be the subgraph of Gnsuperscript𝐺𝑛G^{\wedge n}italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT induced by the set of typical sequences 𝒯εn(PX)subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\mathcal{T}^{n}_{\varepsilon}(P_{X})caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) with tolerance ε>0𝜀0\varepsilon>0italic_ε > 0, see [7, Definition 2.8]. The zero-error code consists of a coloring of this induced subgraph Gn[𝒯εn(PX)]superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋G^{\wedge n}[\mathcal{T}^{n}_{\varepsilon}(P_{X})]italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] with a minimum number of colors χ(Gn[𝒯εn(PX)])𝜒superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\chi\big{(}G^{\wedge n}[\mathcal{T}^{n}_{\varepsilon}(P_{X})]\big{)}italic_χ ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] ), where χ𝜒\chiitalic_χ denotes the chromatic number of the graph. The encoder sends the color index to the decoder if Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is typical, otherwise it sends the index of the sequence Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT in 𝒳nsuperscript𝒳𝑛\mathcal{X}^{n}caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. This coding strategy has a rate upper-bounded by

1n+(Xn𝒯εn(PX))log|𝒳|+1nlogχ(Gn[𝒯εn(PX)]).1𝑛superscript𝑋𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋𝒳1𝑛𝜒superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\displaystyle\frac{1}{n}+\mathbb{P}\big{(}X^{n}\notin\mathcal{T}^{n}_{% \varepsilon}(P_{X})\big{)}\log|\mathcal{X}|+\frac{1}{n}\log\chi\big{(}G^{% \wedge n}[\mathcal{T}^{n}_{\varepsilon}(P_{X})]\big{)}.divide start_ARG 1 end_ARG start_ARG italic_n end_ARG + blackboard_P ( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∉ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ) roman_log | caligraphic_X | + divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_χ ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] ) . (8)

The zero-error property is satisfied since the decoder is able to retrieve Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT thanks to Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and the color symbol. Koulgi et al. have shown in [17, Theorem 1] that taking the limit when n𝑛nitalic_n goes to infinity and ε𝜀\varepsilonitalic_ε goes to 00 yields the best achievable rate in the zero-error side-information problem. This quantity, introduced by Körner and Longo in [18], is called the complementary graph entropy.

Definition 6.

For all probabilistic graph G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), the complementary graph entropy H¯(G)¯𝐻𝐺\overline{H}(G)over¯ start_ARG italic_H end_ARG ( italic_G ) is defined by:

H¯(G)=limε0lim supn1nlogχ(Gn[𝒯εn(PX)]).¯𝐻𝐺subscript𝜀0subscriptlimit-supremum𝑛1𝑛𝜒superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\displaystyle\overline{H}(G)=\lim_{\varepsilon\rightarrow 0}\limsup_{n% \rightarrow\infty}\frac{1}{n}\log\chi\big{(}G^{\wedge n}[\mathcal{T}^{n}_{% \varepsilon}(P_{X})]\big{)}.over¯ start_ARG italic_H end_ARG ( italic_G ) = roman_lim start_POSTSUBSCRIPT italic_ε → 0 end_POSTSUBSCRIPT lim sup start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_χ ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] ) . (9)
Theorem 3 (from [17, Theorem 1]).

The optimal rate in the zero-error regime writes

R0=H¯(G),subscriptsuperscript𝑅0¯𝐻𝐺\displaystyle R^{\star}_{0}=\overline{H}(G),italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = over¯ start_ARG italic_H end_ARG ( italic_G ) , (10)

where G𝐺Gitalic_G is the probabilistic graph formed of the characteristic graph associated to the distribution PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT, with the underlying distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT on its vertices.

A trivial single-letter upper bound is given by H(X)𝐻𝑋H(X)italic_H ( italic_X ) where the zero-error coding construction relies on Huffman coding and the decoder ignores the side information Y𝑌Yitalic_Y. In fact, this upper bound is tight for a dense subset of distributions in Δ(𝒳×𝒴)Δ𝒳𝒴\Delta(\mathcal{X}\times\mathcal{Y})roman_Δ ( caligraphic_X × caligraphic_Y ).

Proposition 1 (Full support, from [14]).

If the distribution PX,Ysubscript𝑃𝑋𝑌P_{X,Y}italic_P start_POSTSUBSCRIPT italic_X , italic_Y end_POSTSUBSCRIPT has full support, then R0=H(X)subscriptsuperscript𝑅0𝐻𝑋R^{\star}_{0}=H(X)italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_H ( italic_X ).

Proof.

Since the distribution PX,Ysubscript𝑃𝑋𝑌P_{X,Y}italic_P start_POSTSUBSCRIPT italic_X , italic_Y end_POSTSUBSCRIPT has full support, the characteristic graph G𝐺Gitalic_G is complete, i.e. every pair of symbols x𝒳𝑥𝒳x\in\mathcal{X}italic_x ∈ caligraphic_X, x𝒳superscript𝑥𝒳x^{\prime}\in\mathcal{X}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X are adjacent in G𝐺Gitalic_G, thus Hχ(G)=H(X)subscript𝐻𝜒𝐺𝐻𝑋H_{\chi}(G)=H(X)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) = italic_H ( italic_X ), which concludes the proof of Prop. 1. ∎

There are a few other cases where the optimal zero-error rate is known such as perfect graphs, or the pentagon C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT with uniform distribution shown in Fig. 3 where R0=12log2(5)subscriptsuperscript𝑅012subscript25R^{\star}_{0}=\frac{1}{2}\log_{2}(5)italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 5 ). In general, the single-letter characterization of R0subscriptsuperscript𝑅0R^{\star}_{0}italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT remains a difficult open question.

II-B Independent Zero-Error Side-Information problems

In order to understand the problem’s difficulty, we examine a specific scenario where the source and side information decompose into independent variables. In the vanishing error regime, independence is a key assumption that induces the linearization of optimal rates, shedding light on practical coding techniques. In the zero-error regime, the independence hypothesis alone is insufficient for linearization of optimal rates. We specify a hypothesis that ensures linearization, enabling us to enlarge the set of problems for which the optimal rate has a single-letter characterization.

EncoderDecoderY1n,,Y|𝒜|nsuperscriptsubscript𝑌1𝑛superscriptsubscript𝑌𝒜𝑛Y_{1}^{n},...,Y_{|\mathcal{A}|}^{n}italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , … , italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTX^1n,,X^|𝒜|nsuperscriptsubscript^𝑋1𝑛superscriptsubscript^𝑋𝒜𝑛\widehat{X}_{1}^{n},...,\widehat{X}_{|\mathcal{A}|}^{n}over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , … , over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTX1n,,X|𝒜|nsuperscriptsubscript𝑋1𝑛superscriptsubscript𝑋𝒜𝑛X_{1}^{n},...,X_{|\mathcal{A}|}^{n}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT\diagupR𝑅Ritalic_R
Figure 4: Independent side-information problems

More formally, for a finite set 𝒜𝒜\mathcal{A}caligraphic_A, we assume a set of pairs (Xa,Ya)a𝒜subscriptsubscript𝑋𝑎subscript𝑌𝑎𝑎𝒜(X_{a},Y_{a})_{a\in\mathcal{A}}( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT, referred to as an independent family, that consists of |𝒜|𝒜|\mathcal{A}|| caligraphic_A | pairs with joint distribution that decomposes as a product of distributions. This independent family generates sequences of i.i.d. random variables,

(X1n,Y1n,,X|𝒜|n,Y|𝒜|n)(PX1,Y1PX|𝒜|,Y|𝒜|)n.similar-tosuperscriptsubscript𝑋1𝑛superscriptsubscript𝑌1𝑛superscriptsubscript𝑋𝒜𝑛superscriptsubscript𝑌𝒜𝑛superscripttensor-productsubscript𝑃subscript𝑋1subscript𝑌1subscript𝑃subscript𝑋𝒜subscript𝑌𝒜tensor-productabsent𝑛\displaystyle\big{(}X_{1}^{n},Y_{1}^{n},\ldots,X_{|\mathcal{A}|}^{n},Y_{|% \mathcal{A}|}^{n}\big{)}\sim\Big{(}P_{X_{1},Y_{1}}\otimes\ldots\otimes P_{X_{|% \mathcal{A}|},Y_{|\mathcal{A}|}}\Big{)}^{\otimes n}.( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ∼ ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ … ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊗ italic_n end_POSTSUPERSCRIPT . (11)

Independent side-information problems correspond to a side-information problem in which the source and the side information are an independent family, as shown in Fig. 4. In the vanishing error regime, the optimal rate linearizes:

R=H(X1,,X|𝒜||Y1,,Y|𝒜|)=a𝒜H(Xa|Ya)=a𝒜Ra.superscript𝑅𝐻subscript𝑋1conditionalsubscript𝑋𝒜subscript𝑌1subscript𝑌𝒜subscript𝑎𝒜𝐻conditionalsubscript𝑋𝑎subscript𝑌𝑎subscript𝑎𝒜superscriptsubscript𝑅𝑎\displaystyle R^{\star}=H\big{(}X_{1},\ldots,X_{|\mathcal{A}|}\big{|}Y_{1},% \ldots,Y_{|\mathcal{A}|}\big{)}=\sum_{a\in\mathcal{A}}H(X_{a}|Y_{a})=\sum_{a% \in\mathcal{A}}R_{a}^{\star}.italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = italic_H ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_H ( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT . (12)

This property is fundamental because it means that the construction of an optimal codebook results from the concatenation of the codewords of the optimal codebooks for each subproblem.

But does linearization also hold in the zero-error regime for independent side-information problems? To answer this question, we first derive an asymptotic expression for the optimal zero-error rate. This derivation follows from the fact that the independent family can be characterized by a product of graphs.

Proposition 2.

Let (Xa,Ya)a𝒜subscriptsubscript𝑋𝑎subscript𝑌𝑎𝑎𝒜(X_{a},Y_{a})_{a\in\mathcal{A}}( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT be an independent family. The optimal rate for the independent zero-error side-information problems is

R0=H¯(a𝒜Ga),subscriptsuperscript𝑅0¯𝐻subscript𝑎𝒜subscript𝐺𝑎\displaystyle R^{\star}_{0}=\overline{H}\left(\bigwedge_{a\in\mathcal{A}}G_{a}% \right),italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (13)

where for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, Gasubscript𝐺𝑎G_{a}italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is the characteristic graph associated to the conditional distribution PYa|Xasubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎P_{Y_{a}|X_{a}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT, with the underlying probability distribution PXasubscript𝑃subscript𝑋𝑎P_{X_{a}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT on its vertices.

It is known that the complementary graph entropy H¯()¯𝐻\overline{H}(\cdot)over¯ start_ARG italic_H end_ARG ( ⋅ ) is sublinear with respect to the AND product. Indeed, [21, Theorem 2] states that for all probabilistic graphs G𝐺Gitalic_G and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT

H¯(GG)¯𝐻𝐺superscript𝐺absent\displaystyle\overline{H}(G\wedge G^{\prime})\leqover¯ start_ARG italic_H end_ARG ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ H¯(G)+H¯(G).¯𝐻𝐺¯𝐻superscript𝐺\displaystyle\,\overline{H}(G)+\overline{H}(G^{\prime}).over¯ start_ARG italic_H end_ARG ( italic_G ) + over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (14)

However, H¯()¯𝐻\overline{H}(\cdot)over¯ start_ARG italic_H end_ARG ( ⋅ ) does not linearize in general. Inspired by Haemers result [20], we show in Theorem 19, that the inequality (14) is strict for the Schläfli graph S𝑆Sitalic_S and its complement S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG. In the following, we study a condition that allows for the linearization of H¯()¯𝐻\overline{H}(\cdot)over¯ start_ARG italic_H end_ARG ( ⋅ ), i.e. where (14) holds with equality. To do this, we introduce the disjoint union of graphs, also called “sum of graphs” in [21].

Definition 7 (Disjoint union of probabilistic graphs square-union\sqcup).

Let 𝒜𝒜\mathcal{A}caligraphic_A be a finite set, let PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), and let Ga=(𝒳a,a,PXa)subscript𝐺𝑎subscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎G_{a}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be probabilistic graphs, for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A. The disjoint union with respect to PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is a probabilistic graph (𝒳,,PX)𝒳subscript𝑃𝑋(\mathcal{X},\mathcal{E},P_{X})( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) denoted by a𝒜PAGasubscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT and defined by:

  • -

    𝒳=a𝒜𝒳a𝒳subscriptsquare-union𝑎𝒜subscript𝒳𝑎\mathcal{X}=\bigsqcup_{a\in\mathcal{A}}\mathcal{X}_{a}caligraphic_X = ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is the disjoint union of the sets (𝒳a)a𝒜subscriptsubscript𝒳𝑎𝑎𝒜(\mathcal{X}_{a})_{a\in\mathcal{A}}( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT;

  • -

    For all x,x𝒳𝑥superscript𝑥𝒳x,x^{\prime}\in\mathcal{X}italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X, xx𝑥superscript𝑥xx^{\prime}\in\mathcal{E}italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_E if and only if they both belong to the same 𝒳asubscript𝒳𝑎\mathcal{X}_{a}caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT and xxa𝑥superscript𝑥subscript𝑎xx^{\prime}\in\mathcal{E}_{a}italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT;

  • -

    PX=a𝒜PA(a)PXasubscript𝑃𝑋subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎P_{X}=\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT, note that the (PXa)a𝒜subscriptsubscript𝑃subscript𝑋𝑎𝑎𝒜(P_{X_{a}})_{a\in\mathcal{A}}( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT have disjoint support in 𝒳𝒳\mathcal{X}caligraphic_X.

The disjoint union of graphs without probability distribution has the vertex set and edges defined above, without underlying probability distribution.

An example of an AND product and a disjoint union of probabilistc graphs is shown in Fig. 5. Note that, as with the AND product, the complementary graph entropy H¯()¯𝐻\overline{H}(\cdot)over¯ start_ARG italic_H end_ARG ( ⋅ ) is sublinear with respect to the disjoint union. Indeed, [21, Theorem 2] states that for all probabilistic graphs G𝐺Gitalic_G and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and s[0,1]𝑠01s\in[0,1]italic_s ∈ [ 0 , 1 ],

H¯(G(s,1s)G)¯𝐻𝐺𝑠1𝑠square-unionsuperscript𝐺absent\displaystyle\overline{H}(G\overset{(s,1-s)}{\sqcup}G^{\prime})\leqover¯ start_ARG italic_H end_ARG ( italic_G start_OVERACCENT ( italic_s , 1 - italic_s ) end_OVERACCENT start_ARG ⊔ end_ARG italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ sH¯(G)+(1s)H¯(G).𝑠¯𝐻𝐺1𝑠¯𝐻𝐺\displaystyle\,s\overline{H}(G)+(1-s)\overline{H}(G).italic_s over¯ start_ARG italic_H end_ARG ( italic_G ) + ( 1 - italic_s ) over¯ start_ARG italic_H end_ARG ( italic_G ) . (15)
1/4141/41 / 41/2121/21 / 21/4141/41 / 4G1=subscript𝐺1absentG_{1}=italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =1/3131/31 / 32/3232/32 / 3G2=subscript𝐺2absentG_{2}=italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =1/6161/61 / 61/121121/121 / 121/3131/31 / 31/6161/61 / 61/6161/61 / 61/121121/121 / 12G1G2=subscript𝐺1subscript𝐺2absentG_{1}\wedge G_{2}=italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =1/161161/161 / 161/8181/81 / 81/161161/161 / 161/4141/41 / 41/2121/21 / 2G1(14,34)G2=subscript𝐺11434square-unionsubscript𝐺2absentG_{1}\!\!\!\overset{\left(\frac{1}{4},\frac{3}{4}\right)}{\sqcup}\!\!\!G_{2}=italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_OVERACCENT ( divide start_ARG 1 end_ARG start_ARG 4 end_ARG , divide start_ARG 3 end_ARG start_ARG 4 end_ARG ) end_OVERACCENT start_ARG ⊔ end_ARG italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =
Figure 5: An empty graph G1=(N3,(14,12,14))subscript𝐺1subscript𝑁3141214G_{1}=(N_{3},(\frac{1}{4},\frac{1}{2},\frac{1}{4}))italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( italic_N start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , ( divide start_ARG 1 end_ARG start_ARG 4 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 4 end_ARG ) ) and a complete graph G2=(K2,(13,23))subscript𝐺2subscript𝐾21323G_{2}=(K_{2},(\frac{1}{3},\frac{2}{3}))italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ( divide start_ARG 1 end_ARG start_ARG 3 end_ARG , divide start_ARG 2 end_ARG start_ARG 3 end_ARG ) ), along with their AND product G1G2subscript𝐺1subscript𝐺2G_{1}\wedge G_{2}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and their disjoint union G1G2square-unionsubscript𝐺1subscript𝐺2G_{1}\sqcup G_{2}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊔ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with respect to (14,34)1434(\frac{1}{4},\frac{3}{4})( divide start_ARG 1 end_ARG start_ARG 4 end_ARG , divide start_ARG 3 end_ARG start_ARG 4 end_ARG ). The underlying distributions are represented by the numbers on each vertex.

We present our first main result, which is that the linearization of the complementary graph entropy with respect to the AND product holds if and only if the linearization holds with respect to the disjoint union. An important consequence of this linearization property is that the concatenation of optimal codes for the subproblems is optimal.

Theorem 4 (Equivalence of the linearization of \wedge and square-union\sqcup).

Let 𝒜𝒜\mathcal{A}caligraphic_A be a finite set, PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT a distribution with full-support, and let (Ga)a𝒜=(𝒳a,a,PXa)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜subscriptsubscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})_{a\in% \mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT a family of probabilistic graphs. The following equivalence holds:

H¯(a𝒜Ga)=a𝒜H¯(Ga),¯𝐻subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\sum_{a% \in\mathcal{A}}\overline{H}(G_{a}),over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (16)
\displaystyle\Longleftrightarrow\quad H¯(a𝒜PAGa)=a𝒜PA(a)H¯(Ga).¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}\right)% =\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_{a}).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (17)

We say that the linearization property holds if (16) or (17) are satisfied.

An important consequence of this theorem is that it allows the single-letter characterization of optimal rates for new sources, as discussed in Sec. IV. For example, it characterizes the optimal rate for the product of perfect graphs, which is not perfect in general. This is a consequence of the fact that the disjoint product of perfect graphs is perfect, therefore the optimal rates linearize, and from our theorem, the optimal rate for the AND product of perfect graphs is equal to the sum of the rates of each graph.

Without loss of generality, we consider that 𝒜𝒜\mathcal{A}caligraphic_A is the support of PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT. We observe that (16) does not depend on the distribution PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT, therefore if (17) holds for a distribution PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT with full support, then it holds for all distributions with full support. This remark, along with the continuity of the function PAH¯(PA)P_{A}\mapsto\overline{H}(\overset{P_{A}}{\sqcup}\cdot)italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( start_OVERACCENT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_OVERACCENT start_ARG ⊔ end_ARG ⋅ ) stated in Lemma 2 below, allows us to state the following corollary.

Corollary 1.

If the linearization properties (16) or (17) hold for a family of probabilistic graphs (Ga)a𝒜=(𝒳a,a,PXa)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜subscriptsubscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})_{a\in% \mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT, then (16) and (17) also hold for any subfamily of probabilistic graphs (Ga~)a~𝒜~subscriptsubscript𝐺~𝑎~𝑎~𝒜(G_{\tilde{a}})_{\tilde{a}\in\widetilde{\mathcal{A}}}( italic_G start_POSTSUBSCRIPT over~ start_ARG italic_a end_ARG end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT over~ start_ARG italic_a end_ARG ∈ over~ start_ARG caligraphic_A end_ARG end_POSTSUBSCRIPT with 𝒜~𝒜~𝒜𝒜\widetilde{\mathcal{A}}\subset\mathcal{A}over~ start_ARG caligraphic_A end_ARG ⊂ caligraphic_A.

II-C Sketch of Proof of Theorem 4

The proof of Theorem 4 is stated in App. A. It relies on Lemma 1 which follows from the definition of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG that involves the AND power of graph, and from the distributivity of \wedge with respect to square-union\sqcup.

Lemma 1.
H¯(a𝒜Unif(𝒜)Ga)=1|𝒜|H¯(a𝒜Ga).¯𝐻superscriptsubscriptsquare-union𝑎𝒜Unif𝒜subscript𝐺𝑎1𝒜¯𝐻subscript𝑎𝒜subscript𝐺𝑎\displaystyle\overline{H}\left(\bigsqcup_{a\in\mathcal{A}}^{\operatorname*{% Unif}(\mathcal{A})}G_{a}\right)=\frac{1}{|\mathcal{A}|}\overline{H}\left(% \bigwedge_{a\in\mathcal{A}}G_{a}\right).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Unif ( caligraphic_A ) end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG | caligraphic_A | end_ARG over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (18)

According to Lemma 1, the complementary graph entropies of the AND product H¯()\overline{H}(\wedge\>\cdot)over¯ start_ARG italic_H end_ARG ( ∧ ⋅ ) and of the disjoint union of graphs H¯()\overline{H}(\sqcup\>\cdot)over¯ start_ARG italic_H end_ARG ( ⊔ ⋅ ) are proportional when PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is the uniform distribution. This shows the equivalence between (17) and (16) when PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is uniform. In order to extend this argument to all distributions PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), we need to show the convexity property of the function

η:PAH¯(a𝒜PAGa).:𝜂maps-tosubscript𝑃𝐴¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\displaystyle\eta:P_{A}\mapsto\overline{H}\Bigg{(}\bigsqcup_{a\in\mathcal{A}}^% {P_{A}}G_{a}\Bigg{)}.italic_η : italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (19)
Lemma 2.

The function η𝜂\etaitalic_η is convex and (logmaxa|𝒳a|)subscript𝑎subscript𝒳𝑎(\log\max_{a}|\mathcal{X}_{a}|)( roman_log roman_max start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | )-Lipschitz.

The proof of Lemma 2 is stated in App. A-B. Assume that the linearization (17) holds for the uniform distribution. The convexity property implies that the function η𝜂\etaitalic_η linearizes for all distributions PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ). This argument, together with the result of Lemma 1, shows the proof of Theorem 4.

An important precision is that the complementary graph entropies of the AND product H¯()\overline{H}(\wedge\>\cdot)over¯ start_ARG italic_H end_ARG ( ∧ ⋅ ) and of the disjoint union of graphs H¯()\overline{H}(\sqcup\>\cdot)over¯ start_ARG italic_H end_ARG ( ⊔ ⋅ ) remains proportional on a dense subset of Δ(𝒜)Δ𝒜\Delta(\mathcal{A})roman_Δ ( caligraphic_A ).

Lemma 3.

Let PAΔk(𝒜)subscript𝑃𝐴subscriptΔ𝑘𝒜P_{A}\in\Delta_{k}(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_A ) be a type of a sequence of length k𝑘superscriptk\in\mathbb{N}^{\star}italic_k ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. We have,

H¯(a𝒜PAGa)=1kH¯(a𝒜GakPA(a)).¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎1𝑘¯𝐻subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑘subscript𝑃𝐴𝑎\overline{H}\left(\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\right)=\frac{1}{k}% \overline{H}\left(\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge kP_{A}(a)}\right).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG italic_k end_ARG over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_k italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) . (20)

The proof of Lemma 3 is stated in App. A-A, note that Lemma 1 is a special case of Lemma 3. The equivalence between (16) and (17) is satisfied for all type PAΔk(𝒜)subscript𝑃𝐴subscriptΔ𝑘𝒜P_{A}\in\Delta_{k}(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_A ) of sequence of length k𝑘superscriptk\in\mathbb{N}^{\star}italic_k ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, moreover the function η𝜂\etaitalic_η is continuous on Δ(𝒜)Δ𝒜\Delta(\mathcal{A})roman_Δ ( caligraphic_A ), so this argument also demonstrates the Theorem 4.

II-D Zero-error source coding with side information at the decoder and partial side information at the encoder

In Sec. II-B, we highlighted the significance of the disjoint union of graphs. Indeed, using Theorem 4, we can show the linearization for the disjoint union in order to conclude on the linearization of the AND product. This is simpler due to the fact that the disjoint union has fewer vertices and edges than the AND product. Yet, the usefulness extends further as the disjoint union corresponds to the problem in Fig. 6, where the encoder has partial information about the decoder’s side information, obtained through the deterministic function g:𝒴𝒜:𝑔𝒴𝒜g:\mathcal{Y}\rightarrow\mathcal{A}italic_g : caligraphic_Y → caligraphic_A. This setting in Fig. 6 is a specific case of Fig. 2, equivalent to a source (X,g(Y))𝑋𝑔𝑌(X,g(Y))( italic_X , italic_g ( italic_Y ) ) that the decoder must retrieve. We refer to this as the partial-side-information problem.

EncoderDecoderYnsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTX^nsuperscript^𝑋𝑛\widehat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTAn=g(Yt)tnsuperscript𝐴𝑛𝑔subscriptsubscript𝑌𝑡𝑡𝑛A^{n}=g(Y_{t})_{t\leq n}italic_A start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = italic_g ( italic_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPTXnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT\diagupR𝑅Ritalic_R
Figure 6: The partial-side-information problem.
Proposition 3.

When the encoder has partial side information, the optimal rate writes

R0=H¯(a𝒜PAGa),subscriptsuperscript𝑅0¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎\displaystyle R^{\star}_{0}=\overline{H}\left(\bigsqcup^{P_{A}}_{a\in\mathcal{% A}}G_{a}\right),italic_R start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (21)

where for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, Gasubscript𝐺𝑎G_{a}italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is the characteristic graph associated to the conditional distribution PY|X,A=asubscript𝑃conditional𝑌𝑋𝐴𝑎P_{Y|X,A=a}italic_P start_POSTSUBSCRIPT italic_Y | italic_X , italic_A = italic_a end_POSTSUBSCRIPT with the underlying probability distribution PX|A=asubscript𝑃conditional𝑋𝐴𝑎P_{X|A=a}italic_P start_POSTSUBSCRIPT italic_X | italic_A = italic_a end_POSTSUBSCRIPT on its vertices. These conditional distributions and PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT are obtained from the joint distribution PXY1A=g(Y)subscript𝑃𝑋𝑌subscript1𝐴𝑔𝑌P_{XY}\textrm{\dsrom{1}}_{A=g(Y)}italic_P start_POSTSUBSCRIPT italic_X italic_Y end_POSTSUBSCRIPT 1 start_POSTSUBSCRIPT italic_A = italic_g ( italic_Y ) end_POSTSUBSCRIPT depending on the deterministic function g:𝒴𝒜:𝑔𝒴𝒜g:\mathcal{Y}\rightarrow\mathcal{A}italic_g : caligraphic_Y → caligraphic_A.

Indeed, for each realization of the encoder side information a=g(y)𝑎𝑔𝑦a=g(y)italic_a = italic_g ( italic_y ), we construct a characteristic graph Gasubscript𝐺𝑎G_{a}italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT to model the sub-problem indexed by a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A. Since, both encoder and decoder have access to a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, the characteristic graph consists in the disjoint union the graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT. Moreover, each Gasubscript𝐺𝑎G_{a}italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT contains all realizations x𝒳𝑥𝒳x\in\mathcal{X}italic_x ∈ caligraphic_X, and there is an edge between two vertices x,x𝑥superscript𝑥x,x^{\prime}italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT if and only if PY|X,a(y|x,a)PY|X,a(y|x,a)>0subscript𝑃conditional𝑌𝑋𝑎conditional𝑦𝑥𝑎subscript𝑃conditional𝑌𝑋𝑎conditional𝑦superscript𝑥𝑎0P_{Y|X,a}(y|x,a)P_{Y|X,a}(y|x^{\prime},a)>0italic_P start_POSTSUBSCRIPT italic_Y | italic_X , italic_a end_POSTSUBSCRIPT ( italic_y | italic_x , italic_a ) italic_P start_POSTSUBSCRIPT italic_Y | italic_X , italic_a end_POSTSUBSCRIPT ( italic_y | italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_a ) > 0 for some yg1(a)𝑦superscript𝑔1𝑎y\in g^{-1}(a)italic_y ∈ italic_g start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_a ).

If the disjoint union linearizes, it reveals that it is optimal to implement the following coding scheme:

  • For each symbol a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, select the indices t{1,,n}𝑡1𝑛t\in\{1,\ldots,n\}italic_t ∈ { 1 , … , italic_n } of the sequence ansuperscript𝑎𝑛a^{n}italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT such that at=asubscript𝑎𝑡𝑎a_{t}=aitalic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_a. Denote by (Xna,Yna)superscript𝑋subscript𝑛𝑎superscript𝑌subscript𝑛𝑎(X^{n_{a}},Y^{n_{a}})( italic_X start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_Y start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) the corresponding subsequences extracted from (Xn,Yn)superscript𝑋𝑛superscript𝑌𝑛(X^{n},Y^{n})( italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) of length nasubscript𝑛𝑎n_{a}italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT.

  • For each a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, use the optimal codebook for the independent sources (Xna,Yna)a𝒜subscriptsuperscript𝑋subscript𝑛𝑎superscript𝑌subscript𝑛𝑎𝑎𝒜(X^{n_{a}},Y^{n_{a}})_{a\in\mathcal{A}}( italic_X start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_Y start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT with distinct length of sequences (na)a𝒜subscriptsubscript𝑛𝑎𝑎𝒜(n_{a})_{a\in\mathcal{A}}( italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT and concatenate the codewords obtained.

With high probability, the sequence Ansuperscript𝐴𝑛A^{n}italic_A start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT belongs to the set of typical sequences 𝒯εn(PA)subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴\mathcal{T}^{n}_{\varepsilon}(P_{A})caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ), therefore the empirical distribution (nan)a𝒜subscriptsubscript𝑛𝑎𝑛𝑎𝒜(\frac{n_{a}}{n})_{a\in\mathcal{A}}( divide start_ARG italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT converges to PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT in probability. The coding rate of the above scheme converges to

a𝒜PA(a)H¯(Ga)=H¯(a𝒜PAGa),subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎\displaystyle\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_{a})=\overline{H}% \left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}\right),∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (22)

which is optimal for the partial-side-information problem.

III Zero-Error Channel Coding Problem

Recently, Wigderson and Zuiddam in [22] and Schrijver in [23] establish the equivalence between the linearization of the zero-error channel capacity of the AND product C0()C_{0}(\wedge\>\cdot)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ∧ ⋅ ) and of the disjoint union of graphs C0()C_{0}(\sqcup\>\cdot)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⊔ ⋅ ). The main difference with the side information problem is that the channel input distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is a priory not specified in the zero-error channel coding problem.

linearization of C0()C_{0}(\wedge\>\cdot)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ∧ ⋅ )linearization of C0()C_{0}(\sqcup\>\cdot)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⊔ ⋅ )linearization of C(,P)C(\wedge\>\cdot,P)italic_C ( ∧ ⋅ , italic_P )linearization of C(,P)C(\sqcup\>\cdot,P)italic_C ( ⊔ ⋅ , italic_P )linearization of H¯()\overline{H}(\wedge\>\cdot)over¯ start_ARG italic_H end_ARG ( ∧ ⋅ )linearization of H¯()\overline{H}(\sqcup\>\cdot)over¯ start_ARG italic_H end_ARG ( ⊔ ⋅ )\Longleftrightarrow\Longleftrightarrow\Longleftrightarrow\Longleftrightarrow\Longleftrightarrow\Longleftrightarrow\Longleftrightarrow
Wigderson and Zuiddam 2023
[22, Theorem 4.1]
Schrijver 2023
[23, Theorem 2]
Marton 1993
[26, Lemma 1]
Marton 1993
[26, Lemma 1]
Theorem 4
Proposition 5
Theorem 12
Theorem 13
Figure 7: Equivalences of linearization properties between the zero-error capacity C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the zero-error capacity relative to a distribution C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), and the complementary graph entropy H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG. Our contributions are represented in the dashed rectangles.

In this section, we connect these two zero-error problems by exploring the properties of the zero-error capacity C(G,PX)𝐶𝐺subscript𝑃𝑋C(G,P_{X})italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) of a graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) relative to a distribution PXΔ(𝒳)subscript𝑃𝑋Δ𝒳P_{X}\in\Delta(\mathcal{X})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ), introduced by Csiszár and Körner [24]. We show the equivalence of the linearization properties of [22, Theorem 4.1] and [23, Theorem 2], and of the linearization properties of Theorem 4, provided that the source distribution maximizes the zero-error capacity relative to a distribution of the AND product of graphs. The path taken to demonstrate these equivalences is shown in Fig. 7.

III-A Zero-error channel capacity

The channel coding problem in Fig. 8 is introduced in [1] in the vanishing error regime, and in [3] in the zero-error regime. We consider a Discrete Memoryless Channel (DMC) that consists of an input alphabet 𝒳𝒳\mathcal{X}caligraphic_X, a finite output alphabet 𝒴𝒴\mathcal{Y}caligraphic_Y and a conditional distribution (a.k.a. transition probability) PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT. A (n,𝒞n,ϕd)𝑛subscript𝒞𝑛subscriptitalic-ϕ𝑑(n,\mathcal{C}_{n},\phi_{d})( italic_n , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT )-code consists of

  • -

    an encoder that selects uniformly a codeword xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT from the codebook 𝒞n𝒳nsubscript𝒞𝑛superscript𝒳𝑛\mathcal{C}_{n}\subseteq\mathcal{X}^{n}caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, and sends it over the DMC,

  • -

    a decoder ϕdsubscriptitalic-ϕ𝑑\phi_{d}italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT that assigns an estimate x^nsuperscript^𝑥𝑛\widehat{x}^{n}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to each received ynsuperscript𝑦𝑛y^{n}italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

The rate of the (n,𝒞n,ϕd)𝑛subscript𝒞𝑛subscriptitalic-ϕ𝑑(n,\mathcal{C}_{n},\phi_{d})( italic_n , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT )-code is the average number of messages transmitted per channel use, i.e. 1nlog|𝒞n|1𝑛subscript𝒞𝑛\frac{1}{n}\log|\mathcal{C}_{n}|divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT |, and the probability of error is Pe(n)(X^nXn)approaches-limitsuperscriptsubscript𝑃𝑒𝑛superscript^𝑋𝑛superscript𝑋𝑛P_{e}^{(n)}\doteq\mathbb{P}\big{(}\widehat{X}^{n}\neq X^{n}\big{)}italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT ≐ blackboard_P ( over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ≠ italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ).

EncoderChannel PY|Xsubscript𝑃conditional𝑌𝑋P_{Y|X}italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPTDecoderX^nsuperscript^𝑋𝑛\widehat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTXn𝒞nsuperscript𝑋𝑛subscript𝒞𝑛X^{n}\in\mathcal{C}_{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPTYnsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT
Figure 8: The channel coding problem.
Definition 8.

The channel capacity is the maximal rate among all codes that satisfy the ε𝜀{\varepsilon}italic_ε-error property, with ε0𝜀0\varepsilon\rightarrow 0italic_ε → 0:

Climε0sup(n,𝒞n,ϕd):Pe(n)ε1nlog|𝒞n|.approaches-limitsuperscript𝐶subscript𝜀0subscriptsupremum:𝑛subscript𝒞𝑛subscriptitalic-ϕ𝑑superscriptsubscript𝑃𝑒𝑛𝜀1𝑛subscript𝒞𝑛\displaystyle C^{\star}\doteq\lim_{\displaystyle\varepsilon\rightarrow 0}\quad% \,\sup\limits_{\displaystyle(n,\mathcal{C}_{n},\phi_{d}):P_{e}^{(n)}\leq% \varepsilon}\quad\frac{1}{n}\log|\mathcal{C}_{n}|.italic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ≐ roman_lim start_POSTSUBSCRIPT italic_ε → 0 end_POSTSUBSCRIPT roman_sup start_POSTSUBSCRIPT ( italic_n , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT ≤ italic_ε end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | . (23)

The zero-error channel capacity is the maximal rate among all coding schemes that satisfy the zero-error property:

C0sup(n,𝒞n,ϕd):Pe(n)=01nlog|𝒞n|.approaches-limitsuperscriptsubscript𝐶0subscriptsupremum:𝑛subscript𝒞𝑛subscriptitalic-ϕ𝑑superscriptsubscript𝑃𝑒𝑛01𝑛subscript𝒞𝑛\displaystyle C_{0}^{\star}\doteq\sup\limits_{\displaystyle(n,\mathcal{C}_{n},% \phi_{d}):P_{e}^{(n)}=0}\quad\frac{1}{n}\log|\mathcal{C}_{n}|.italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ≐ roman_sup start_POSTSUBSCRIPT ( italic_n , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | . (24)
Theorem 5 (from [1]).

The channel capacity (in the vanishing error regime) is

C=CmaxPXΔ(𝒳)I(X;Y).superscript𝐶𝐶approaches-limitsubscriptsubscript𝑃𝑋Δ𝒳𝐼𝑋𝑌\displaystyle C^{\star}=C\doteq\max_{P_{X}\in\Delta(\mathcal{X})}I(X;Y).italic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = italic_C ≐ roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT italic_I ( italic_X ; italic_Y ) . (25)

In the zero-error regime, the capacity depends on a characteristic graph and its independence number, defined below.

Definition 9 (Independent subset, independence number α𝛼\alphaitalic_α).

Let G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) be a graph. A subset 𝒮𝒳𝒮𝒳\mathcal{S}\subseteq\mathcal{X}caligraphic_S ⊆ caligraphic_X is independent in G𝐺Gitalic_G if xx𝑥superscript𝑥xx^{\prime}\notin\mathcal{E}italic_x italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∉ caligraphic_E for all x,x𝒮𝑥superscript𝑥𝒮x,x^{\prime}\in\mathcal{S}italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S. The independence number is the maximal size of an independent set in G𝐺Gitalic_G, and is denoted by α(G)𝛼𝐺\alpha(G)italic_α ( italic_G ).

Theorem 6 (from [3]).

Let G𝐺Gitalic_G be the characteristic graph corresponding to the DMC (𝒳,PY|X,𝒴)𝒳subscript𝑃conditional𝑌𝑋𝒴(\mathcal{X},P_{Y|X},\mathcal{Y})( caligraphic_X , italic_P start_POSTSUBSCRIPT italic_Y | italic_X end_POSTSUBSCRIPT , caligraphic_Y ). The zero-error channel capacity satisfies

C0=C0(G)limn1nlogα(Gn).superscriptsubscript𝐶0subscript𝐶0𝐺approaches-limitsubscript𝑛1𝑛𝛼superscript𝐺𝑛\displaystyle C_{0}^{\star}=C_{0}(G)\doteq\lim_{n\rightarrow\infty}\frac{1}{n}% \log\alpha(G^{\wedge n}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) ≐ roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_α ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) . (26)
Remark 1.

Note that, by convention, we define the zero-error capacity with the logarithm. Another existing convention (for example in [27]) for the zero-error capacity is Θ(G)limnα(Gn)napproaches-limitΘ𝐺subscript𝑛𝑛𝛼superscript𝐺𝑛\Theta(G)\doteq\lim_{n\rightarrow\infty}\sqrt[n]{\alpha(G^{\wedge n})}roman_Θ ( italic_G ) ≐ roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT nth-root start_ARG italic_n end_ARG start_ARG italic_α ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) end_ARG, which is equivalent in the sense that C0=logΘsubscript𝐶0ΘC_{0}=\log\Thetaitalic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_log roman_Θ.

We present in Sec. IV-A some examples from the literature where C0(G)subscript𝐶0𝐺C_{0}(G)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) is known, in particular when G𝐺Gitalic_G is a perfect graph. The Lovász θ𝜃\thetaitalic_θ function, introduced in [27], is an upper bound on the zero-error capacity. This function is used to show that C0(C5)=12log5subscript𝐶0subscript𝐶5125C_{0}(C_{5})=\frac{1}{2}\log 5italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log 5, which makes C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT the minimally non-perfect graph for which C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is known. Further observations on the θ𝜃\thetaitalic_θ function are derived by Sason in [28]. The zero-error capacity of C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT is still unknown. Several existing lower bounds on C0(C7)subscript𝐶0subscript𝐶7C_{0}(C_{7})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT ) were found via computer programming, in particular in [29], [30] and [31].

III-B Independent zero-error channel coding problems

To understand why no single-letter exists for the zero-error channel capacity, we study, similarly to Sec. II-B, the case where the channel transition probability decomposes as a product a𝒜PYa|Xasubscripttensor-product𝑎𝒜subscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎\bigotimes_{a\in\mathcal{A}}P_{Y_{a}|X_{a}}⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT, as depicted in Fig. 9. This is called the independent channel coding problems.

Channel PY1|X1subscript𝑃conditionalsubscript𝑌1subscript𝑋1P_{Y_{1}|X_{1}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPTChannel PY1|X1subscript𝑃conditionalsubscript𝑌1subscript𝑋1P_{Y_{1}|X_{1}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT\vdotsChannel PY1|X1subscript𝑃conditionalsubscript𝑌1subscript𝑋1P_{Y_{1}|X_{1}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPTChannel PY|𝒜||X|𝒜|subscript𝑃conditionalsubscript𝑌𝒜subscript𝑋𝒜P_{Y_{|\mathcal{A}|}|X_{|\mathcal{A}|}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPTDecoderDecoderDecoderEncoderEncoderEncoder(X^1n,,X^|𝒜|n)superscriptsubscript^𝑋1𝑛superscriptsubscript^𝑋𝒜𝑛(\widehat{X}_{1}^{n},...,\widehat{X}_{|\mathcal{A}|}^{n})( over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , … , over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT )X1nsuperscriptsubscript𝑋1𝑛X_{1}^{n}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTY1nsuperscriptsubscript𝑌1𝑛Y_{1}^{n}italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTX|𝒜|nsuperscriptsubscript𝑋𝒜𝑛X_{|\mathcal{A}|}^{n}italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPTY|𝒜|nsuperscriptsubscript𝑌𝒜𝑛Y_{|\mathcal{A}|}^{n}italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT
Figure 9: Independent channel coding problems: the information is transmitted via |𝒜|𝒜|\mathcal{A}|| caligraphic_A | parallel channels (PYa|Xa)a𝒜subscriptsubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎𝑎𝒜(P_{Y_{a}|X_{a}})_{a\in\mathcal{A}}( italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT.
Proposition 4 (from [3]).

The zero-error capacity of independent channels (PYa|Xa)a𝒜subscriptsubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎𝑎𝒜(P_{Y_{a}|X_{a}})_{a\in\mathcal{A}}( italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT is given by

C0(a𝒜Ga),subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right),italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (27)

where for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, Gasubscript𝐺𝑎G_{a}italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is the characteristic graph associated to the conditional distribution PYa|Xasubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎P_{Y_{a}|X_{a}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

In the vanishing error regime, the capacity of independent channels linearizes since

C=maxPX1,,X|𝒜|I(X1,,X|𝒜|;Y1,,Y|𝒜|)=a𝒜maxPXaI(Xa;Ya)=a𝒜Ca.𝐶subscriptsubscript𝑃subscript𝑋1subscript𝑋𝒜𝐼subscript𝑋1subscript𝑋𝒜subscript𝑌1subscript𝑌𝒜subscript𝑎𝒜subscriptsubscript𝑃subscript𝑋𝑎𝐼subscript𝑋𝑎subscript𝑌𝑎subscript𝑎𝒜subscript𝐶𝑎\displaystyle C=\max_{P_{X_{1},\ldots,X_{|\mathcal{A}|}}}I\big{(}X_{1},\ldots,% X_{|\mathcal{A}|};Y_{1},\ldots,Y_{|\mathcal{A}|}\big{)}=\sum_{a\in\mathcal{A}}% \max_{P_{X_{a}}}I(X_{a};Y_{a})=\sum_{a\in\mathcal{A}}C_{a}.italic_C = roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT . (28)

Therefore, it is optimal to concatenate the optimal codebooks designed for each channel PYa|Xasubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎P_{Y_{a}|X_{a}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

In the zero-error regime, the capacity is super-linear as shown by Shannon in [3, Theorem 4],

C0(G)+C0(G)subscript𝐶0𝐺subscript𝐶0superscript𝐺absent\displaystyle C_{0}(G)+C_{0}(G^{\prime})\leqitalic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) + italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ C0(GG).subscript𝐶0𝐺superscript𝐺\displaystyle\,C_{0}(G\wedge G^{\prime}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (29)

Haemers shows in [20] that the inequality (29) is strict for the product of the Schläfli graph S𝑆Sitalic_S and its complementary graph S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG, as stated in Theorem 7. An explicit construction of the Schläfli graph is provided in [32, Sec. 6.1]. As we will see in Sec. IV-C, Haemers’s result relies on a bound on the zero-error capacity based on the rank of the adjacency matrix of the graph. Refinements of this bound are developed by Bukh and Cox in [33], and by Gao et al. in [34].

Theorem 7 (from [20]).

Let S𝑆Sitalic_S be the Schläfli graph and S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG its complementary graph, then

C0(S)+C0(S¯)<C0(SS¯).subscript𝐶0𝑆subscript𝐶0¯𝑆subscript𝐶0𝑆¯𝑆\displaystyle C_{0}(S)+C_{0}(\overline{S})<C_{0}(S\wedge\overline{S}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ) + italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG ) < italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ∧ over¯ start_ARG italic_S end_ARG ) . (30)

Since the linearization property does not hold in general, Wigderson and Zuiddam in [22] and Schrijver in [23], recently established a condition under which capacity linearization holds.

Theorem 8 (from [22, Theorem 4.1] and [23, Theorem 2]).

For all graphs G,G𝐺superscript𝐺G,G^{\prime}italic_G , italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT,

C0(G)+C0(G)=subscript𝐶0𝐺subscript𝐶0superscript𝐺absent\displaystyle C_{0}(G)+C_{0}(G^{\prime})=italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) + italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = C0(GG),subscript𝐶0𝐺superscript𝐺\displaystyle\,C_{0}(G\wedge G^{\prime}),italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , (31)
log(2C0(G)+2C0(G))=superscript2subscript𝐶0𝐺superscript2subscript𝐶0superscript𝐺absent\displaystyle\Longleftrightarrow\quad\log\left(2^{C_{0}(G)}+2^{C_{0}(G^{\prime% })}\right)=⟺ roman_log ( 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) end_POSTSUPERSCRIPT + 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) = C0(GG).subscript𝐶0square-union𝐺superscript𝐺\displaystyle\,C_{0}(G\sqcup G^{\prime}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ⊔ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (32)

We establish the connection between the linearization properties of Theorem 4 and Theorem 8, by using the zero-error capacity C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) of a graph relative to a distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT, introduced by Csiszár and Körner in [24].

III-C Zero-error capacity C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) of a graph relative to a distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT and equivalence of the linearizations of C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG

Theorems 4 and 8 both establish the equivalence of linearizations for the AND product and disjoint union of graphs but for different quantities: H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. We will show the equivalence between these two quantities and then deduce the equivalence between the two linearizations. To do this, we study the zero-error capacity C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) of a graph relative to a distribution PXΔ(𝒳)subscript𝑃𝑋Δ𝒳P_{X}\in\Delta(\mathcal{X})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ).

Definition 10.

A sequence of codes (𝒞n)nsubscriptsubscript𝒞𝑛𝑛(\mathcal{C}_{n})_{n}( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is said to be typical with respect to PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT, or in short PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT-typical, if

maxxn𝒞nTxnPXn0.subscriptsuperscript𝑥𝑛subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑃𝑋𝑛0\displaystyle\max_{x^{n}\in\mathcal{C}_{n}}\|T_{x^{n}}-P_{X}\|_{\infty}% \underset{n\rightarrow\infty}{\rightarrow}0.roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (33)

The zero-error channel capacity relative to the input distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is the maximal rate among all PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT-typical sequence of codes that satisfy the zero-error property:

C(PX)sup(n,𝒞n,ϕd):(𝒞n)nPX-typical,Pe(n)=01nlog|𝒞n|.approaches-limitsuperscript𝐶subscript𝑃𝑋subscriptsupremum:𝑛subscript𝒞𝑛subscriptitalic-ϕ𝑑subscriptsubscript𝒞𝑛𝑛subscript𝑃𝑋-typicalsuperscriptsubscript𝑃𝑒𝑛01𝑛subscript𝒞𝑛\displaystyle C^{\star}(P_{X})\doteq\sup\limits_{\displaystyle(n,\mathcal{C}_{% n},\phi_{d}):(\mathcal{C}_{n})_{n}\ P_{X}\text{\emph{-typical}},P_{e}^{(n)}=0}% \quad\frac{1}{n}\log|\mathcal{C}_{n}|.italic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ≐ roman_sup start_POSTSUBSCRIPT ( italic_n , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) : ( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT -typical , italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | . (34)

Csiszár and Körner [24, Eq. (3.2)] derive an asymptotic expression of this capacity, which we review below.

Lemma 4 (from [24]).

The zero-error capacity of the graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) relative to PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is

C(PX)=C(G,PX)limε0lim supn1nlogα(Gn[𝒯εn(PX)]),superscript𝐶subscript𝑃𝑋𝐶𝐺subscript𝑃𝑋approaches-limitsubscript𝜀0subscriptlimit-supremum𝑛1𝑛𝛼superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\displaystyle C^{\star}(P_{X})=C(G,P_{X})\doteq\lim_{\varepsilon\rightarrow 0}% \limsup_{n\rightarrow\infty}\frac{1}{n}\log\alpha\big{(}G^{\wedge n}[\mathcal{% T}^{n}_{\varepsilon}(P_{X})]\big{)},italic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) = italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ≐ roman_lim start_POSTSUBSCRIPT italic_ε → 0 end_POSTSUBSCRIPT lim sup start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_α ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] ) , (35)

where Gn[𝒯εn(PX)]superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋G^{\wedge n}[\mathcal{T}^{n}_{\varepsilon}(P_{X})]italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] is the subgraph of Gnsuperscript𝐺𝑛G^{\wedge n}italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT induced by the set of typical sequences 𝒯εn(PX)subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\mathcal{T}^{n}_{\varepsilon}(P_{X})caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) with tolerance ε>0𝜀0\varepsilon>0italic_ε > 0.

The superior limit in (35) can be replaced by a regular limit, thanks to the superadditivity of the sequence
(1nlogα(Gn[𝒯εn(PX)]))nsubscript1𝑛𝛼superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋𝑛superscript\big{(}\frac{1}{n}\log\alpha\big{(}G^{\wedge n}[\mathcal{T}^{n}_{\varepsilon}(% P_{X})]\big{)}\big{)}_{n\in\mathbb{N}^{\star}}( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_α ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] ) ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT.

In [26, Lemma 1], Marton established the connection between the complementary graph entropy H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ).

Theorem 9 (from [26, Lemma 1]).

Given the graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) with the probability distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT,

C(G,PX)+H¯(G)=H(X).𝐶𝐺subscript𝑃𝑋¯𝐻𝐺𝐻𝑋\displaystyle C(G,P_{X})+\overline{H}(G)=H(X).italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) + over¯ start_ARG italic_H end_ARG ( italic_G ) = italic_H ( italic_X ) . (36)

We can interpret the formula in Theorem 9 in the following way. The quantities H¯(G)¯𝐻𝐺\overline{H}(G)over¯ start_ARG italic_H end_ARG ( italic_G ) and C(G,PX)𝐶𝐺subscript𝑃𝑋C(G,P_{X})italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) are respectively the minimum number of colors, and the maximum size of an independent set. A color class, i.e. vertices of the same color, is an independent subset of vertices: in the case with same-sized color classes we would need nlogα(G)𝑛𝛼𝐺n\log\alpha(G)italic_n roman_log italic_α ( italic_G ) bits to describe the source sequence in its color. Therefore, C(G,PX)𝐶𝐺subscript𝑃𝑋C(G,P_{X})italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) can be seen as the average number of bits needed to describe the index of the source sequence in its color class. These two quantities sum up to H(X)𝐻𝑋H(X)italic_H ( italic_X ), which is the information needed to describe the source sequence with zero-error. Equation (36) can be seen as an analog for zero-error regime of the formula I(X;Y)+H(X|Y)=H(X)𝐼𝑋𝑌𝐻conditional𝑋𝑌𝐻𝑋I(X;Y)+H(X|Y)=H(X)italic_I ( italic_X ; italic_Y ) + italic_H ( italic_X | italic_Y ) = italic_H ( italic_X ).

We establish below the connection between the linearization properties of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and of C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), where the equivalences in (38) and (40) follow from Marton’s formula in Theorem 9, and the equivalence (39) from Theorem 4. The complete proof is in App. B.

Proposition 5.

Let 𝒜𝒜\mathcal{A}caligraphic_A be a finite set, PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT a distribution with full-support, and let (Ga)a𝒜=(𝒳a,a,PXa)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜subscriptsubscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})_{a\in% \mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT a family of probabilistic graphs. The following equivalence holds:

C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa)𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum_{a\in% \mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C(% G_{a},P_{X_{a}})italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (37)
\displaystyle\Longleftrightarrow\; H¯(a𝒜PAGa)=a𝒜PA(a)H¯(Ga)¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}\right)% =\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_{a})over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (38)
\displaystyle\Longleftrightarrow\; H¯(a𝒜Ga)=a𝒜H¯(Ga)¯𝐻subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\sum_{a% \in\mathcal{A}}\overline{H}(G_{a})over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (39)
\displaystyle\Longleftrightarrow\; C(a𝒜Ga,a𝒜PXa)=a𝒜C(Ga,PXa).𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{a\in% \mathcal{A}}P_{X_{a}}\right)=\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}}).italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (40)

III-D Capacity achieving distributions and equivalence of the linearizations of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for the AND product

From the previous equivalences, we present our second main contribution which shows the equivalence between the linearization properties of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. A key element is the set of input distributions PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT that achieve the zero-error capacity in the sense that C0(G)=C(G,PX)subscript𝐶0𝐺𝐶𝐺subscript𝑃𝑋C_{0}(G)=C(G,P_{X})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ). As in the vanishing error regime, it seems optimal to consider codebooks composed of codewords that are typical with respect to the input distribution PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT that maximizes C(G,PX)𝐶𝐺subscript𝑃𝑋C(G,P_{X})italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ).

Theorem 10 (from [25, Theorem 2]).

For all graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ),

C0(G)=maxPXΔ(𝒳)C(G,PX).subscript𝐶0𝐺subscriptsubscript𝑃𝑋Δ𝒳𝐶𝐺subscript𝑃𝑋\displaystyle C_{0}(G)=\max_{P_{X}\in\Delta(\mathcal{X})}C(G,P_{X}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) . (41)

The result of [25, Theorem 2], see also [7, Theorem 11.22], is stated for the Sperner capacity of a family of directed graphs, while the statement of [35, Theorem 13.68] is specific to the zero-error capacity of a family of graphs. For the sake of completeness, in App. C we provide a proof for Theorem 10 that does not rely on directed graphs. As a consequence of [26, Lemma 1] and of [25, Theorem 2], the zero-error capacity reformulates

C0(G)=maxPXΔ(𝒳)(H(X)H¯(G)).subscript𝐶0𝐺subscriptsubscript𝑃𝑋Δ𝒳𝐻𝑋¯𝐻𝐺\displaystyle C_{0}(G)=\max_{P_{X}\in\Delta(\mathcal{X})}\Big{(}H(X)-\overline% {H}(G)\Big{)}.italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT ( italic_H ( italic_X ) - over¯ start_ARG italic_H end_ARG ( italic_G ) ) . (42)

In order to show the equivalence of the linearization properties between C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), we define the set of capacity-achieving distributions.

Definition 11.

Let G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) be a graph. The set of capacity-achieving distributions of G𝐺Gitalic_G is the subset of Δ(𝒳)Δ𝒳\Delta(\mathcal{X})roman_Δ ( caligraphic_X ) defined by

𝒫(G)argmaxPXΔ(𝒳)C(G,PX).approaches-limitsuperscript𝒫𝐺subscriptsubscript𝑃𝑋Δ𝒳𝐶𝐺subscript𝑃𝑋\displaystyle\mathcal{P}^{\star}(G)\doteq\operatorname*{\arg\!\max}_{P_{X}\in% \Delta(\mathcal{X})}\;C(G,P_{X}).caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) ≐ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) . (43)
Proposition 6.

For all graphs G𝐺Gitalic_G, the map** PXC(G,PX)maps-tosubscript𝑃𝑋𝐶𝐺subscript𝑃𝑋P_{X}\mapsto C(G,P_{X})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ↦ italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) is concave and the set of capacity-achieving distributions 𝒫(G)superscript𝒫𝐺\mathcal{P}^{\star}(G)caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) is convex, nonempty.

The proof of Proposition 6 is stated in App. D, and relies on Theorem 10.

The following Theorem is essential for demonstrating the equivalence of the linearizations depicted in Fig. 7. It establishes that if a joint distribution achieves capacity, then the product of its marginals also achieves it.

Theorem 11.

If PX1,,X|𝒜|𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}(\bigwedge_{a\in\mathcal{% A}}G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), then a𝒜PXa𝒫(a𝒜Ga)subscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\bigotimes_{a\in\mathcal{A}}P_{X_{a}}\in\mathcal{P}^{\star}(\bigwedge_{a\in% \mathcal{A}}G_{a})⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

The proof of Theorem 11 is stated in App. E and relies on a codebook shifting argument: given a codebook composed of codewords (x1n,x2n)superscriptsubscript𝑥1𝑛superscriptsubscript𝑥2𝑛(x_{1}^{n},x_{2}^{n})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) that are typical with respect to the joint distribution PX1,X2subscript𝑃subscript𝑋1subscript𝑋2P_{X_{1},X_{2}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we construct a set of permuted codebooks by applying a cyclic permutation only to the first component x1nsuperscriptsubscript𝑥1𝑛x_{1}^{n}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT of each codeword. We concatenate all the permuted codebooks and we replicate them n𝑛nitalic_n times so that the codewords length is equal to n=n3superscript𝑛superscript𝑛3n^{\prime}=n^{3}italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT. Then, we remove the codewords (x1n,x2n)superscriptsubscript𝑥1superscript𝑛superscriptsubscript𝑥2superscript𝑛(x_{1}^{n^{\prime}},x_{2}^{n^{\prime}})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ) that are not typical with respect to the product of marginal distributions PXPXtensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋P_{X}\otimes P_{X^{\prime}}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. We show that this construction has the same rate and preserves the zero-error property. However, it modifies the types of the codewords, which become the product of marginals as wished.

We can now establish the equivalence of the linearizations of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) for the AND product.

Theorem 12.

Let 𝒜𝒜\mathcal{A}caligraphic_A be a finite set, and (Ga)a𝒜=(𝒳a,a)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜subscriptsubscript𝒳𝑎subscript𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}=(\mathcal{X}_{a},\mathcal{E}_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT be a family of graphs. The following equivalence holds:

C0(a𝒜Ga)=a𝒜C0(Ga)subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\sum_{a\in% \mathcal{A}}C_{0}(G_{a})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (44)
\displaystyle\Longleftrightarrow\qquad PX1,,X|𝒜|𝒫(a𝒜Ga),C(a𝒜Ga,PX1,,X|𝒜|)=a𝒜C(Ga,PXa).formulae-sequencesubscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\exists P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}% \left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)\!,\qquad C\left(\bigwedge_{a\in% \mathcal{A}}G_{a},\;P_{X_{1},...,X_{|\mathcal{A}|}}\right)=\sum_{a\in\mathcal{% A}}C(G_{a},P_{X_{a}}).∃ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (45)

Furthermore, any distribution PX1,,X|𝒜|𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}\left(\bigwedge_{a\in% \mathcal{A}}G_{a}\right)italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) that satisfies (45) also satisfies PXa𝒫(Ga)subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

The proof of Theorem 12 is given in App. G.

III-E Linearization of the sum of independent channels

Similarly to the side-information problem, see Sec. II-D, the disjoint union of graphs introduced in Proposition 5 for zero-error channel coding has an operational interpretation as a sum of channels, as depicted in Fig. 10. At each time step tn𝑡𝑛t\leq nitalic_t ≤ italic_n, the encoder uses one channel atsubscript𝑎𝑡a_{t}italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT among the |𝒜|𝒜|\mathcal{A}|| caligraphic_A | channels. The decoder observes an output, deduces the chosen channel, and retrieves its input. Since the output alphabets of each individual channel are disjoint, the channel output symbol uniquely identifies the channel that is used.

Channel PY1|X1subscript𝑃conditionalsubscript𝑌1subscript𝑋1P_{Y_{1}|X_{1}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPTChannel PY1|X1subscript𝑃conditionalsubscript𝑌1subscript𝑋1P_{Y_{1}|X_{1}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT\vdotsChannel PY1|X1subscript𝑃conditionalsubscript𝑌1subscript𝑋1P_{Y_{1}|X_{1}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPTChannel PY|𝒜||X|𝒜|subscript𝑃conditionalsubscript𝑌𝒜subscript𝑋𝒜P_{Y_{|\mathcal{A}|}|X_{|\mathcal{A}|}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPTDecoderDecoderDecoderEncoderEncoderEncoder(X^at)tnsubscriptsubscript^𝑋subscript𝑎𝑡𝑡𝑛(\widehat{X}_{a_{t}})_{t\leq n}( over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTY1subscript𝑌1Y_{1}italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX|𝒜|subscript𝑋𝒜X_{|\mathcal{A}|}italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPTY|𝒜|subscript𝑌𝒜Y_{|\mathcal{A}|}italic_Y start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT
Figure 10: Sum of the |𝒜|𝒜|\mathcal{A}|| caligraphic_A | channels (PYa|Xa)a𝒜subscriptsubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎𝑎𝒜(P_{Y_{a}|X_{a}})_{a\in\mathcal{A}}( italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT: only the channel at𝒜subscript𝑎𝑡𝒜a_{t}\in\mathcal{A}italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ caligraphic_A is used at instant t{1,,n}𝑡1𝑛t\in\{1,\ldots,n\}italic_t ∈ { 1 , … , italic_n }.

In the vanishing error regime, the linearization of the capacity for the sum of channels holds since

C=log(a𝒜2Ca),𝐶subscript𝑎𝒜superscript2subscript𝐶𝑎\displaystyle C=\log\bigg{(}\sum_{a\in\mathcal{A}}2^{C_{a}}\bigg{)},italic_C = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) , (46)

where CamaxPXaI(Xa;Ya)approaches-limitsubscript𝐶𝑎subscriptsubscript𝑃subscript𝑋𝑎𝐼subscript𝑋𝑎subscript𝑌𝑎C_{a}\doteq\max_{P_{X_{a}}}I(X_{a};Y_{a})italic_C start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ≐ roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) is the capacity of the channel PYa|Xasubscript𝑃conditionalsubscript𝑌𝑎subscript𝑋𝑎P_{Y_{a}|X_{a}}italic_P start_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Proposition 7 (from [3]).

The zero-error capacity of the sum of channels is given by

C0(a𝒜Ga).subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}\right).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (47)

In the zero-error regime, Shannon in [3, Theorem 4] shows that

G,G,log(2C0(G)+2C0(G))for-all𝐺superscript𝐺superscript2subscript𝐶0𝐺superscript2subscript𝐶0superscript𝐺absent\displaystyle\forall G,G^{\prime},\quad\log\left(2^{C_{0}(G)}+2^{C_{0}(G^{% \prime})}\right)\leq∀ italic_G , italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_log ( 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) end_POSTSUPERSCRIPT + 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) ≤ C0(GG).subscript𝐶0square-union𝐺superscript𝐺\displaystyle\,C_{0}(G\sqcup G^{\prime}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ⊔ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (48)

For the sum of channels, a natural coding scheme consists in using the optimal codebooks for each channel in a time sharing manner, with respect to the distribution PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT that maximizes H(PA)+a𝒜PA(a)C0(Ga)𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C_{0}(G_{a})italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ). In other words, with this strategy, communicating over the sum channel is equivalent to sending 2 types of information: one related to identifying the chosen channel, H(PA)𝐻subscript𝑃𝐴H(P_{A})italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ), and the other to the information on this channel, C0(Ga)subscript𝐶0subscript𝐺𝑎C_{0}(G_{a})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

Lemma 5.

The map** PAH(PA)+a𝒜PA(a)C0(Ga)maps-tosubscript𝑃𝐴𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎P_{A}\mapsto H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C_{0}(G_{a})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) has a unique maximum

PA(2C0(Ga)a𝒜2C0(Ga))a𝒜,approaches-limitsubscriptsuperscript𝑃𝐴subscriptsuperscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎𝑎𝒜\displaystyle P^{\star}_{A}\doteq\left(\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime% }\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}})}}\right)_{a\in\mathcal{A}},italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≐ ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT , (49)

which gives

H(PA)+a𝒜PA(a)C0(Ga)𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎\displaystyle H(P^{\star}_{A})+\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)C_{0}(G_{% a})italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =log(a𝒜2C0(Ga)).absentsubscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎\displaystyle=\log\left(\sum_{a^{\prime}\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}}% )}\right).= roman_log ( ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) . (50)

The proof of Lemma 5 is given in App. F-B and relies on the fact that the function (wa)a𝒜log(a𝒜2wa)maps-tosubscriptsubscript𝑤𝑎𝑎𝒜subscript𝑎𝒜superscript2subscript𝑤𝑎(w_{a})_{a\in\mathcal{A}}\mapsto\log\left(\sum_{a\in\mathcal{A}}2^{w_{a}}\right)( italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ↦ roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) is the Legendre-Fenchel conjugate [36] of the entropy function PAH(PA)maps-tosubscript𝑃𝐴𝐻subscript𝑃𝐴P_{A}\mapsto H(P_{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ).

We consider the time-sharing strategy between the optimal codebooks along with the distribution PAΔ(𝒜)subscriptsuperscript𝑃𝐴Δ𝒜P^{\star}_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) defined in (49). If this strategy is optimal, then

C0(a𝒜Ga)=log(a𝒜2C0(Ga)),subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=\log\left(% \sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right),italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) , (51)

which means that the linearization property holds for the disjoint union of graphs.

Remark 2.

Note that PAΔ(𝒜)subscriptsuperscript𝑃𝐴Δ𝒜P^{\star}_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) is full-support: it can be observed PAH(PA)maps-tosubscript𝑃𝐴𝐻subscript𝑃𝐴P_{A}\mapsto H(P_{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) has an infinite slope at the frontier of Δ(𝒜)Δ𝒜\Delta(\mathcal{A})roman_Δ ( caligraphic_A ), consequently the maximizer of PAH(PA)+a𝒜PA(a)C0(Ga)maps-tosubscript𝑃𝐴𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎P_{A}\mapsto H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C_{0}(G_{a})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) is always an interior point. In other words, the information carried by the channel index H(PA)𝐻subscript𝑃𝐴H(P_{A})italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) offsets the loss in rate, if the channels with smaller capacities are not chosen too often. Therefore, in the sum of channels setting, always choosing the channel with highest capacity is suboptimal, and never choosing a channel is also suboptimal, even if this channel has zero-error capacity equal to 00.

Similar to Theorem 12, we establish the equivalence between the linearization property between C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) for the disjoint union of a family of graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT.

Theorem 13.

The following equivalence holds

C0(a𝒜Ga)=log(a𝒜2C0(Ga))subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=\log\left(% \sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) (52)
\displaystyle\Longleftrightarrow\qquad PX𝒫(a𝒜Ga),C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa),formulae-sequencesubscript𝑃𝑋superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\exists P_{X}\in\mathcal{P}^{\star}\left(\bigsqcup_{a\in\mathcal{% A}}G_{a}\right)\!,\qquad C\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;% \sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\sum_{a\in\mathcal{A}}% P_{A}(a)C(G_{a},P_{X_{a}}),∃ italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , (53)

where PXa=PX|X𝒳asubscript𝑃subscript𝑋𝑎subscript𝑃conditional𝑋𝑋subscript𝒳𝑎P_{X_{a}}=P_{X|X\in\mathcal{X}_{a}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_X | italic_X ∈ caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT and PA(a)=PX(𝒳a)subscript𝑃𝐴𝑎subscript𝑃𝑋subscript𝒳𝑎P_{A}(a)=P_{X}(\mathcal{X}_{a})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) = italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A. Furthermore, any a𝒜PA(a)PXasubscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT that satisfies (53) also satisfies the following for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A:

PA(a)=2C0(Ga)a𝒜2C0(Ga), and PXa𝒫(Ga).formulae-sequencesubscript𝑃𝐴𝑎superscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎 and subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎\displaystyle P_{A}(a)=\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime}\in\mathcal{A}}% 2^{C_{0}(G_{a^{\prime}})}},\text{ and }P_{X_{a}}\in\mathcal{P}^{\star}(G_{a}).italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) = divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG , and italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (54)

The proof of Theorem 13 is given in App. H. This result relies on Lemma 5 which proves that the distribution PA(a)=2C0(Ga)a𝒜2C0(Ga)subscript𝑃𝐴𝑎superscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎P_{A}(a)=\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime}\in\mathcal{A}}2^{C_{0}(G_{a^% {\prime}})}}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) = divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG maximizes PAH(PA)+a𝒜PA(a)C0(Ga)maps-tosubscript𝑃𝐴𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎P_{A}\mapsto H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C_{0}(G_{a})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

Remark 3.

One could think of a possible strategy for proving Theorem 13, which is successively using the equivalences in Theorem 8, Theorem 12, and Proposition 5. However, doing so yields the following statement

C0(a𝒜Ga)=log(a𝒜2C0(Ga))subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle\textstyle C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}% \right)=\log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) (55)
\displaystyle\Longleftrightarrow\; PAΔ(𝒜) full-support,PX1,,X|𝒜|𝒫(a𝒜Ga),formulae-sequencesubscript𝑃𝐴Δ𝒜 full-supportsubscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle\exists P_{A}\in\Delta(\mathcal{A})\text{ full-support},\;\exists P% _{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}\left(\textstyle\bigwedge_% {a\in\mathcal{A}}G_{a}\right)\!,∃ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) full-support , ∃ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (56)
C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa),𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\textstyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_% {a},\;\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\sum_{a\in% \mathcal{A}}P_{A}(a)C(G_{a},P_{X_{a}}),italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ,

where it remains to link the sets of capacity achieving distributions 𝒫(a𝒜Ga)superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\mathcal{P}^{\star}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}\right)caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and 𝒫(a𝒜Ga)superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\mathcal{P}^{\star}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

Theorem 13 together with Theorem 12, Theorem 8 and Proposition 5, establish the equivalence of the linearization property between C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for the AND product \wedge, and for the disjoint union square-union\sqcup of a family of graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT, as depicted in Fig. 7.

IV Main Example and Counterexamples for the Linearization of Optimal Rates

In this section, we exploit the equivalences in the linearization depicted in Fig. 7, in order to provide single-letter characterization of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for several new classes of graphs.

IV-A Perfect graphs

We show that perfect graphs allow for linearization of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG with respect to both square-union\sqcup and \wedge with any underlying probability distribution. Perfect graphs are one of the only known examples of graphs with a single-letter formula for H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG and C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Theorem. 4 allows us to provide new single-letter characterization for C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for products of perfect graphs, which are not perfect in general.

Definition 12 (Graph complement, clique number ω𝜔\omegaitalic_ω).

For all G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ), the complementary graph of G𝐺Gitalic_G is defined by G¯(𝒳,c)approaches-limit¯𝐺𝒳superscript𝑐\overline{G}\doteq(\mathcal{X},\mathcal{E}^{c})over¯ start_ARG italic_G end_ARG ≐ ( caligraphic_X , caligraphic_E start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ). The clique number of G𝐺Gitalic_G is defined by ω(G)α(G¯)approaches-limit𝜔𝐺𝛼¯𝐺\omega(G)\doteq\alpha(\overline{G})italic_ω ( italic_G ) ≐ italic_α ( over¯ start_ARG italic_G end_ARG ), where the independence number α𝛼\alphaitalic_α is stated in Definition 9.

Definition 13 (Perfect graph).

A graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) is perfect if for all subset of vertices 𝒮𝒳,χ(G[𝒮])=ω(G[𝒮])formulae-sequence𝒮𝒳𝜒𝐺delimited-[]𝒮𝜔𝐺delimited-[]𝒮\mathcal{S}\subseteq\mathcal{X},\;\chi(G[\mathcal{S}])=\omega(G[\mathcal{S}])caligraphic_S ⊆ caligraphic_X , italic_χ ( italic_G [ caligraphic_S ] ) = italic_ω ( italic_G [ caligraphic_S ] ). A probabilistic graph (𝒳,,PX)𝒳subscript𝑃𝑋(\mathcal{X},\mathcal{E},P_{X})( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) is perfect if (𝒳,)𝒳(\mathcal{X},\mathcal{E})( caligraphic_X , caligraphic_E ) is perfect.

A remarkable property of perfect graphs is their single-letter characterizations for zero-error problems. For example, as stated in Theorem 14, for the side-information problem, the optimal rate H¯(G)¯𝐻𝐺\overline{H}(G)over¯ start_ARG italic_H end_ARG ( italic_G ) equals the Körner graph entropy, defined below and introduced in [19], when G𝐺Gitalic_G is a perfect graph.

Definition 14 (Körner graph entropy Hκsubscript𝐻𝜅H_{\kappa}italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT).

For all G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), let Γ(G)Γ𝐺\Gamma(G)roman_Γ ( italic_G ) be the collection of independent sets of vertices in G𝐺Gitalic_G. The Körner graph entropy of G𝐺Gitalic_G is defined by

Hκ(G)=minXWΓ(G)I(W;X),subscript𝐻𝜅𝐺subscript𝑋𝑊Γ𝐺𝐼𝑊𝑋\displaystyle H_{\kappa}(G)=\min_{X\in W\in\Gamma(G)}I(W;X),italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G ) = roman_min start_POSTSUBSCRIPT italic_X ∈ italic_W ∈ roman_Γ ( italic_G ) end_POSTSUBSCRIPT italic_I ( italic_W ; italic_X ) , (57)

where the minimum is taken over all distributions PW|Xsubscript𝑃conditional𝑊𝑋P_{W|X}italic_P start_POSTSUBSCRIPT italic_W | italic_X end_POSTSUBSCRIPT with the constraint that the random vertex X𝑋Xitalic_X belongs to the random independent set W𝑊Witalic_W with probability one, i.e. XWΓ(G)𝑋𝑊Γ𝐺X\in W\in\Gamma(G)italic_X ∈ italic_W ∈ roman_Γ ( italic_G ) in (57).

Theorem 14 (from [37, Corollary 12]).

Let G𝐺Gitalic_G be a perfect probabilistic graph, then

H¯(G)=Hκ(G).¯𝐻𝐺subscript𝐻𝜅𝐺\displaystyle\overline{H}(G)=H_{\kappa}(G).over¯ start_ARG italic_H end_ARG ( italic_G ) = italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G ) . (58)

Similarly, single-letter characterization holds for the zero-error capacity of perfect graphs, as stated below. This is a consequence of a more general result due to Shannon (see [3, Theorem 3]) that states that a graph G𝐺Gitalic_G whose vertex set can be partitioned into α(G)𝛼𝐺\alpha(G)italic_α ( italic_G ) cliques, i.e. complete induced subgraphs, satisfies C0(G)=logα(G)subscript𝐶0𝐺𝛼𝐺C_{0}(G)=\log\alpha(G)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = roman_log italic_α ( italic_G ). Perfect graphs satisfy this property as their complementary is also perfect, and satisfy χ(G¯)=ω(G¯)=α(G)𝜒¯𝐺𝜔¯𝐺𝛼𝐺\chi(\overline{G})=\omega(\overline{G})=\alpha(G)italic_χ ( over¯ start_ARG italic_G end_ARG ) = italic_ω ( over¯ start_ARG italic_G end_ARG ) = italic_α ( italic_G ), where ω(G¯)𝜔¯𝐺\omega(\overline{G})italic_ω ( over¯ start_ARG italic_G end_ARG ) is the clique number, see [9, pp. 382].

Theorem 15 (from [3, Theorem 3]).

If G𝐺Gitalic_G is a perfect graph, then C0(G)=logα(G)subscript𝐶0𝐺𝛼𝐺C_{0}(G)=\log\alpha(G)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = roman_log italic_α ( italic_G ).

We now derive single-letter characterizations of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for graphs where these quantities were previously unknown. These characterizations result from the linearization theorems: [22], [23] for C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and Theorem 4 for C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG.

More precisely, consider some perfect graphs. Their disjoint union is perfect, as shown in Lemma 19, and C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT linearizes: since C0(GG)=logα(GG)=log(α(G)+α(G))=log(2C0(G)+2C0(G))subscript𝐶0square-union𝐺superscript𝐺𝛼square-union𝐺superscript𝐺𝛼𝐺𝛼superscript𝐺superscript2subscript𝐶0𝐺superscript2subscript𝐶0superscript𝐺C_{0}(G\sqcup G^{\prime})=\log\alpha(G\sqcup G^{\prime})=\log(\alpha(G)+\alpha% (G^{\prime}))=\log(2^{C_{0}(G)}+2^{C_{0}(G^{\prime})})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ⊔ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = roman_log italic_α ( italic_G ⊔ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = roman_log ( italic_α ( italic_G ) + italic_α ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) = roman_log ( 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) end_POSTSUPERSCRIPT + 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) holds for all perfect graphs G,G𝐺superscript𝐺G,G^{\prime}italic_G , italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. According to the linearization theorems [22], [23], since C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT of the disjoint union linearizes, so does C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT of the AND product: C0(GG)=C0(G)+C0(G)subscript𝐶0𝐺superscript𝐺subscript𝐶0𝐺subscript𝐶0superscript𝐺C_{0}(G\wedge G^{\prime})=C_{0}(G)+C_{0}(G^{\prime})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) + italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). This leads to the following proposition.

Proposition 8.

Let G𝐺Gitalic_G and Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT be perfect graphs, then

C0(GG)=log(2C0(G)+2C0(G))=log(α(G)+α(G)),subscript𝐶0square-union𝐺superscript𝐺superscript2subscript𝐶0𝐺superscript2subscript𝐶0superscript𝐺𝛼𝐺𝛼superscript𝐺\displaystyle C_{0}(G\sqcup G^{\prime})=\log\left(2^{C_{0}(G)}+2^{C_{0}(G^{% \prime})}\right)=\log(\alpha(G)+\alpha(G^{\prime})),italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ⊔ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = roman_log ( 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) end_POSTSUPERSCRIPT + 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) = roman_log ( italic_α ( italic_G ) + italic_α ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) , (59)
C0(GG)=C0(G)+C0(G)=logα(G)+logα(G).subscript𝐶0𝐺superscript𝐺subscript𝐶0𝐺subscript𝐶0superscript𝐺𝛼𝐺𝛼superscript𝐺\displaystyle C_{0}(G\wedge G^{\prime})=C_{0}(G)+C_{0}(G^{\prime})=\log\alpha(% G)+\log\alpha(G^{\prime}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) + italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = roman_log italic_α ( italic_G ) + roman_log italic_α ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (60)

According to the previous proposition, C0(GG)subscript𝐶0𝐺superscript𝐺C_{0}(G\wedge G^{\prime})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) can now be computed for any pair of perfect graphs, as it linearizes. This result was previously unknown, as the AND product of perfect graphs is not necessarily perfect. For instance, cycle graphs C6subscript𝐶6C_{6}italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT and C8subscript𝐶8C_{8}italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT are perfect (due to the strong perfect graph theorem, mentioned below), but their AND product is not (also due to the strong perfect graph theorem); it contains an odd cycle of length 7, illustrated in Fig. 11.

0,00,10,20,30,40,51,01,11,21,31,41,52,02,12,22,32,42,53,03,13,23,33,43,54,04,14,24,34,44,55,05,15,25,35,45,56,06,16,26,36,46,57,07,17,27,37,47,5
Figure 11: A non-perfect AND product of perfect graphs: C6C8subscript𝐶6subscript𝐶8C_{6}\wedge C_{8}italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ∧ italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT with an induced C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT.
Theorem 16 (Strong perfect graph theorem, from [38, Theorem 1.2]).

A graph G𝐺Gitalic_G is perfect if and only if neither G𝐺Gitalic_G nor G¯¯𝐺\overline{G}over¯ start_ARG italic_G end_ARG have an induced odd cycle of length at least 5.

Similarly, we show that the linearization property of C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG holds for perfect graphs and for all underlying probability distributions, and we provide new single-letter expression for H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) in that case.

Theorem 17.

When (Ga)a𝒜=(𝒳a,a,PXa)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜subscriptsubscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})_{a\in% \mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT is a family of perfect probabilistic graphs, we have the following single-letter characterizations:

H¯(a𝒜Ga)¯𝐻subscript𝑎𝒜subscript𝐺𝑎\displaystyle\overline{H}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =a𝒜H¯(Ga)=a𝒜Hκ(Ga),absentsubscript𝑎𝒜¯𝐻subscript𝐺𝑎subscript𝑎𝒜subscript𝐻𝜅subscript𝐺𝑎\displaystyle=\sum_{a\in\mathcal{A}}\overline{H}(G_{a})=\sum_{a\in\mathcal{A}}% H_{\kappa}(G_{a}),= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (61)
H¯(a𝒜PAGa)¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\displaystyle\overline{H}\left(\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\right)over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =a𝒜PA(a)H¯(Ga)=a𝒜PA(a)Hκ(Ga),absentsubscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐻𝜅subscript𝐺𝑎\displaystyle=\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_{a})=\sum_{a\in% \mathcal{A}}P_{A}(a)H_{\kappa}(G_{a}),= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (62)
C(a𝒜Ga,a𝒜PXa)𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{a\in% \mathcal{A}}P_{X_{a}}\right)italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =a𝒜C(Ga,PXa)=a𝒜(H(Xa)Hκ(Ga)),absentsubscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎subscript𝑎𝒜𝐻subscript𝑋𝑎subscript𝐻𝜅subscript𝐺𝑎\displaystyle=\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}})=\sum_{a\in\mathcal{A}}% \big{(}H(X_{a})-H_{\kappa}(G_{a})\big{)},= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ( italic_H ( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ) , (63)
C(a𝒜PAGa,a𝒜PA(a)PXa)𝐶superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a},\;\sum_{a\in% \mathcal{A}}P_{A}(a)P_{X_{a}}\right)italic_C ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =H(PA)+a𝒜PA(a)C(Ga,PXa)absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle=H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C(G_{a},P_{X_{a}})= italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT )
=H(PA)+a𝒜PA(a)(H(Xa)Hκ(Ga)).absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐻subscript𝑋𝑎subscript𝐻𝜅subscript𝐺𝑎\displaystyle=H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)\Big{(}H(X_{a})-H_{\kappa% }(G_{a})\Big{)}.= italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) ( italic_H ( italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ) . (64)

The proof of Theorem 17 is given in App. I. An an example, we consider below the AND product of the cycle graphs C6subscript𝐶6C_{6}italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT and C8subscript𝐶8C_{8}italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT.

Corollary 2.

Consider the cycle graphs C6subscript𝐶6C_{6}italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT and C8subscript𝐶8C_{8}italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT and denote by PX6subscript𝑃subscript𝑋6P_{X_{6}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and PX8subscript𝑃subscript𝑋8P_{X_{8}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT end_POSTSUBSCRIPT the probability distributions on the vertices. We have

H¯(C6C8)=¯𝐻subscript𝐶6subscript𝐶8absent\displaystyle\overline{H}(C_{6}\wedge C_{8})=over¯ start_ARG italic_H end_ARG ( italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ∧ italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT ) = Hκ(C6)+Hκ(C8),subscript𝐻𝜅subscript𝐶6subscript𝐻𝜅subscript𝐶8\displaystyle H_{\kappa}(C_{6})+H_{\kappa}(C_{8}),italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ) + italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT ) , (65)
C(C6C8,PX6PX8)=𝐶subscript𝐶6subscript𝐶8tensor-productsubscript𝑃subscript𝑋6subscript𝑃subscript𝑋8absent\displaystyle C(C_{6}\wedge C_{8},P_{X_{6}}\otimes P_{X_{8}})=italic_C ( italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ∧ italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = H(PX6)Hκ(C6)+H(PX8)Hκ(C8).𝐻subscript𝑃subscript𝑋6subscript𝐻𝜅subscript𝐶6𝐻subscript𝑃subscript𝑋8subscript𝐻𝜅subscript𝐶8\displaystyle H(P_{X_{6}})-H_{\kappa}(C_{6})+H(P_{X_{8}})-H_{\kappa}(C_{8}).italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ) + italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT ) . (66)

We now explore the combination of a perfect graph with a non-perfect graph. More specifically, we consider the graph C5Gsquare-unionsubscript𝐶5𝐺C_{5}\sqcup Gitalic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ⊔ italic_G where G𝐺Gitalic_G is perfect, for which the linearization of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG was studied by Tuncel et al. in [21]. The pentagon graph C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT is not perfect, thereby making any disjoint union or AND product involving it non-perfect. However, Theorem 4 provides a non-perfect example where the linearization property holds, offering a single-letter characterization of H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for the class of graphs C5Gsubscript𝐶5𝐺C_{5}\wedge Gitalic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ∧ italic_G, where G𝐺Gitalic_G is perfect.

Theorem 18 (from [21, Lemma 3]).

Let s[0,1]𝑠01s\in[0,1]italic_s ∈ [ 0 , 1 ], let G𝐺Gitalic_G be a perfect probabilistic graph, and let G5(C5,Unif({0,,4}))approaches-limitsubscript𝐺5subscript𝐶5Unif04G_{5}\doteq(C_{5},\operatorname*{Unif}(\{0,...,4\}))italic_G start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ≐ ( italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT , roman_Unif ( { 0 , … , 4 } ) ), we have

H¯(G5(s,1s)G)¯𝐻subscript𝐺5𝑠1𝑠square-union𝐺\displaystyle\overline{H}(G_{5}\overset{(s,1-s)}{\sqcup}G)over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_OVERACCENT ( italic_s , 1 - italic_s ) end_OVERACCENT start_ARG ⊔ end_ARG italic_G ) =sH¯(G5)+(1s)H¯(G)absent𝑠¯𝐻subscript𝐺51𝑠¯𝐻𝐺\displaystyle=s\overline{H}(G_{5})+(1-s)\overline{H}(G)= italic_s over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) + ( 1 - italic_s ) over¯ start_ARG italic_H end_ARG ( italic_G ) (67)
=s2log5+(1s)Hκ(G).absent𝑠251𝑠subscript𝐻𝜅𝐺\displaystyle=\frac{s}{2}\log 5+(1-s)H_{\kappa}(G).= divide start_ARG italic_s end_ARG start_ARG 2 end_ARG roman_log 5 + ( 1 - italic_s ) italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G ) . (68)
Corollary 3.

For all perfect probabilistic graph G𝐺Gitalic_G,

H¯(GG5)=H¯(G)+H¯(G5)=Hκ(G)+12log5.¯𝐻𝐺subscript𝐺5¯𝐻𝐺¯𝐻subscript𝐺5subscript𝐻𝜅𝐺125\displaystyle\overline{H}(G\wedge G_{5})=\overline{H}(G)+\overline{H}(G_{5})=H% _{\kappa}(G)+\textstyle\frac{1}{2}\log 5.over¯ start_ARG italic_H end_ARG ( italic_G ∧ italic_G start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) = over¯ start_ARG italic_H end_ARG ( italic_G ) + over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) = italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log 5 . (69)

IV-B Vertex transitive graphs

We study the importance class of vertex-transitive graphs, where all the vertices of the graph play the same “role”, and show that the uniform distribution is capacity achieving for these graphs.

Definition 15 (Vertex-transitive graph).

An automorphism of a graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) is a bijection ψ:𝒳𝒳:𝜓𝒳𝒳\psi:\mathcal{X}\rightarrow\mathcal{X}italic_ψ : caligraphic_X → caligraphic_X such that for all v,x𝒳𝑣superscript𝑥𝒳v,x^{\prime}\in\mathcal{X}italic_v , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X, vx𝑣superscript𝑥vx^{\prime}\in\mathcal{E}italic_v italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_E if and only if ψ(x)ψ(x)𝜓𝑥𝜓superscript𝑥\psi(x)\psi(x^{\prime})\in\mathcal{E}italic_ψ ( italic_x ) italic_ψ ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_E. The group of automorphisms of G is denoted by Aut(G)Aut𝐺\operatorname*{Aut}(G)roman_Aut ( italic_G ).

A graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) is vertex-transitive if Aut(G)Aut𝐺\operatorname*{Aut}(G)roman_Aut ( italic_G ) acts transitively on its vertices, i.e. for all v,x𝒳𝑣superscript𝑥𝒳v,x^{\prime}\in\mathcal{X}italic_v , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X, there exists ψAut(G)𝜓Aut𝐺\psi\in\operatorname*{Aut}(G)italic_ψ ∈ roman_Aut ( italic_G ) such that ψ(x)=x𝜓𝑥superscript𝑥\psi(x)=x^{\prime}italic_ψ ( italic_x ) = italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.

Proposition 9.

If G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ) is vertex-transitive, then

Unif(𝒳)𝒫(G).Unif𝒳superscript𝒫𝐺\displaystyle\operatorname*{Unif}(\mathcal{X})\in\mathcal{P}^{\star}(G).roman_Unif ( caligraphic_X ) ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) . (70)

The proof of Proposition 9 is given in App. F-A.

Corollary 4.

Let (Ga)a𝒜=(𝒳a,a)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜subscriptsubscript𝒳𝑎subscript𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}=(\mathcal{X}_{a},\mathcal{E}_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT be vertex-transitive graphs, their product is also vertex-transitive and

Unif(a𝒜𝒳a)=a𝒜Unif(𝒳a)𝒫(a𝒜Ga).Unifsubscriptproduct𝑎𝒜subscript𝒳𝑎subscripttensor-product𝑎𝒜Unifsubscript𝒳𝑎superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle\operatorname*{Unif}\left(\prod_{a\in\mathcal{A}}\mathcal{X}_{a}% \right)=\bigotimes_{a\in\mathcal{A}}\operatorname*{Unif}(\mathcal{X}_{a})\in% \mathcal{P}^{\star}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right).roman_Unif ( ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (71)

IV-C The Schläfli graph

We now study the important case of the Schläfli graph S𝑆Sitalic_S as it offers a counterexample for the linearizations of all quantities studied in this paper. In [20], Haemers showed that the linearization property does not hold for the product of the Schläfli graph S𝑆Sitalic_S with its complement S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG,

C0(S)+C0(S¯)<C0(SS¯).subscript𝐶0𝑆subscript𝐶0¯𝑆subscript𝐶0𝑆¯𝑆\displaystyle C_{0}(S)+C_{0}(\overline{S})<C_{0}(S\wedge\overline{S}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ) + italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG ) < italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ∧ over¯ start_ARG italic_S end_ARG ) . (72)

More specifically, Haemers shows that C0(S)=log2(3)subscript𝐶0𝑆subscript23C_{0}(S)=\log_{2}(3)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ) = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 3 ), C0(S¯)log2(7)subscript𝐶0¯𝑆subscript27C_{0}(\overline{S})\leq\log_{2}(7)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG ) ≤ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 7 ) and log2(27)C0(SS¯)subscript227subscript𝐶0𝑆¯𝑆\log_{2}(27)\leq C_{0}(S\wedge\overline{S})roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 27 ) ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ∧ over¯ start_ARG italic_S end_ARG ). In this section, we show that a similar conclusion holds for C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and for H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG.

According to [39, Lemma 3.7], the Schläfli graph S𝑆Sitalic_S and its complement S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG are vertex transitive, as well as their product SS¯𝑆¯𝑆S\wedge\overline{S}italic_S ∧ over¯ start_ARG italic_S end_ARG. By Proposition 9, the uniform distribution is capacity-achieving for S𝑆Sitalic_S, S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG, and SS¯𝑆¯𝑆S\wedge\overline{S}italic_S ∧ over¯ start_ARG italic_S end_ARG.

Corollary 5.

Consider the Schläfli graph S𝑆Sitalic_S and its complement S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG. Then,

C(S,Unif(𝒳S))=C0(S),C(S¯,Unif(𝒳S))=C0(S¯),formulae-sequence𝐶𝑆Unifsubscript𝒳𝑆subscript𝐶0𝑆𝐶¯𝑆Unifsubscript𝒳𝑆subscript𝐶0¯𝑆\displaystyle C(S,\operatorname*{Unif}(\mathcal{X}_{S}))=C_{0}(S),\qquad C(% \overline{S},\operatorname*{Unif}(\mathcal{X}_{S}))=C_{0}(\overline{S}),italic_C ( italic_S , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ) , italic_C ( over¯ start_ARG italic_S end_ARG , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG ) , (73)
C(SS¯,Unif(𝒳S)Unif(𝒳S¯))=C0(SS¯).𝐶𝑆¯𝑆tensor-productUnifsubscript𝒳𝑆Unifsubscript𝒳¯𝑆subscript𝐶0𝑆¯𝑆\displaystyle C(S\wedge\overline{S},\operatorname*{Unif}(\mathcal{X}_{S})% \otimes\operatorname*{Unif}(\mathcal{X}_{\overline{S}}))=C_{0}(S\wedge% \overline{S}).italic_C ( italic_S ∧ over¯ start_ARG italic_S end_ARG , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ⊗ roman_Unif ( caligraphic_X start_POSTSUBSCRIPT over¯ start_ARG italic_S end_ARG end_POSTSUBSCRIPT ) ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ∧ over¯ start_ARG italic_S end_ARG ) . (74)

In Theorem 19, we extend Haemers’s results of [20] and we show that the linearization property does not hold for C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG when the distribution on the vertices is uniform. By using Lemma 1, we also equalize H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG, and similarly C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, for the AND product SS¯𝑆¯𝑆S\wedge\overline{S}italic_S ∧ over¯ start_ARG italic_S end_ARG and for the disjoint union SS¯square-union𝑆¯𝑆S\sqcup\overline{S}italic_S ⊔ over¯ start_ARG italic_S end_ARG, up to a certain constant.

Theorem 19.

Let s(0,1)𝑠01s\in(0,1)italic_s ∈ ( 0 , 1 ), let S𝑆Sitalic_S be the Schläfli graph and S¯¯𝑆\overline{S}over¯ start_ARG italic_S end_ARG its complementary, with uniform distributions on their vertices. Then,

C(SS¯,Unif(𝒳S)Unif(𝒳S¯))>𝐶𝑆¯𝑆tensor-productUnifsubscript𝒳𝑆Unifsubscript𝒳¯𝑆absent\displaystyle C(S\wedge\overline{S},\operatorname*{Unif}(\mathcal{X}_{S})% \otimes\operatorname*{Unif}(\mathcal{X}_{\overline{S}}))>\>italic_C ( italic_S ∧ over¯ start_ARG italic_S end_ARG , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ⊗ roman_Unif ( caligraphic_X start_POSTSUBSCRIPT over¯ start_ARG italic_S end_ARG end_POSTSUBSCRIPT ) ) > C(S,Unif(𝒳S))+C(S¯,Unif(𝒳S¯)),𝐶𝑆Unifsubscript𝒳𝑆𝐶¯𝑆Unifsubscript𝒳¯𝑆\displaystyle C(S,\operatorname*{Unif}(\mathcal{X}_{S}))+C(\overline{S},% \operatorname*{Unif}(\mathcal{X}_{\overline{S}})),italic_C ( italic_S , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ) + italic_C ( over¯ start_ARG italic_S end_ARG , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT over¯ start_ARG italic_S end_ARG end_POSTSUBSCRIPT ) ) , (75)
C(SS¯,sUnif(𝒳S)+(1s)Unif(𝒳S¯))>𝐶square-union𝑆¯𝑆𝑠Unifsubscript𝒳𝑆1𝑠Unifsubscript𝒳¯𝑆absent\displaystyle C(S\sqcup\overline{S},s\operatorname*{Unif}(\mathcal{X}_{S})+(1-% s)\operatorname*{Unif}(\mathcal{X}_{\overline{S}}))>\>italic_C ( italic_S ⊔ over¯ start_ARG italic_S end_ARG , italic_s roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) + ( 1 - italic_s ) roman_Unif ( caligraphic_X start_POSTSUBSCRIPT over¯ start_ARG italic_S end_ARG end_POSTSUBSCRIPT ) ) > hb(s)+sC(S,Unif(𝒳S))+(1s)C(S¯,Unif(𝒳S¯)),subscript𝑏𝑠𝑠𝐶𝑆Unifsubscript𝒳𝑆1𝑠𝐶¯𝑆Unifsubscript𝒳¯𝑆\displaystyle h_{b}(s)+sC(S,\operatorname*{Unif}(\mathcal{X}_{S}))+(1-s)C(% \overline{S},\operatorname*{Unif}(\mathcal{X}_{\overline{S}})),italic_h start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( italic_s ) + italic_s italic_C ( italic_S , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ) + ( 1 - italic_s ) italic_C ( over¯ start_ARG italic_S end_ARG , roman_Unif ( caligraphic_X start_POSTSUBSCRIPT over¯ start_ARG italic_S end_ARG end_POSTSUBSCRIPT ) ) , (76)
H¯(SS¯)<¯𝐻𝑆¯𝑆absent\displaystyle\overline{H}(S\wedge\overline{S})<\>over¯ start_ARG italic_H end_ARG ( italic_S ∧ over¯ start_ARG italic_S end_ARG ) < H¯(S)+H¯(S¯),¯𝐻𝑆¯𝐻¯𝑆\displaystyle\overline{H}(S)+\overline{H}(\overline{S}),over¯ start_ARG italic_H end_ARG ( italic_S ) + over¯ start_ARG italic_H end_ARG ( over¯ start_ARG italic_S end_ARG ) , (77)
H¯(S(s,1s)S¯)<¯𝐻𝑆𝑠1𝑠square-union¯𝑆absent\displaystyle\overline{H}(S\overset{(s,1-s)}{\sqcup}\overline{S})<\>over¯ start_ARG italic_H end_ARG ( italic_S start_OVERACCENT ( italic_s , 1 - italic_s ) end_OVERACCENT start_ARG ⊔ end_ARG over¯ start_ARG italic_S end_ARG ) < sH¯(S)+(1s)H¯(S¯);𝑠¯𝐻𝑆1𝑠¯𝐻¯𝑆\displaystyle s\overline{H}(S)+(1-s)\overline{H}(\overline{S});italic_s over¯ start_ARG italic_H end_ARG ( italic_S ) + ( 1 - italic_s ) over¯ start_ARG italic_H end_ARG ( over¯ start_ARG italic_S end_ARG ) ; (78)

where hbsubscript𝑏h_{b}italic_h start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is the binary entropy. Moreover, we have

H¯(S(12,12)S¯)=¯𝐻𝑆1212square-union¯𝑆absent\displaystyle\overline{H}(S\overset{(\frac{1}{2},\frac{1}{2})}{\sqcup}% \overline{S})=over¯ start_ARG italic_H end_ARG ( italic_S start_OVERACCENT ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) end_OVERACCENT start_ARG ⊔ end_ARG over¯ start_ARG italic_S end_ARG ) = 12H¯(SS¯),12¯𝐻𝑆¯𝑆\displaystyle\,\frac{1}{2}\overline{H}(S\wedge\overline{S}),divide start_ARG 1 end_ARG start_ARG 2 end_ARG over¯ start_ARG italic_H end_ARG ( italic_S ∧ over¯ start_ARG italic_S end_ARG ) , (79)
C0(SS¯)=subscript𝐶0square-union𝑆¯𝑆absent\displaystyle C_{0}(S\sqcup\overline{S})=italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ⊔ over¯ start_ARG italic_S end_ARG ) =  112C0(SS¯).112subscript𝐶0𝑆¯𝑆\displaystyle\,1-\frac{1}{2}C_{0}(S\wedge\overline{S}).1 - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ∧ over¯ start_ARG italic_S end_ARG ) . (80)

We obtain (75) from Theorem 12 and Corollary 5; (76) and (77) come from Proposition 5; and (78) comes from Theorem 4.

Remark 4.

Alon has built in [40] infinite families of graphs that satisfy C0(GG)>log(2C0(G)+2C0(G))subscript𝐶0square-union𝐺superscript𝐺superscript2subscript𝐶0𝐺superscript2subscript𝐶0superscript𝐺C_{0}(G\sqcup G^{\prime})>\log(2^{C_{0}(G)}+2^{C_{0}(G^{\prime})})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ⊔ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) > roman_log ( 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) end_POSTSUPERSCRIPT + 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ). Similar results as in Theorem 19 can be derived for these graphs, by using their respective capacity-achieving distributions.

V Conclusion

We have shown the equivalences of linearization properties between C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG, as depicted in Fig. 7. We proved the equivalence between the suboptimality of separated zero-error coding on independent channels, and the suboptimality of separated compression of independent sources in the zero-error side-information setting, with same characteristic graph and capacity-achieving distribution.

We also state the following open questions:

  • -

    As pointed out in Lemma 11, for all capacity-achieving distribution of a product graph, the product of its marginals is also capacity-achieving. Are these marginals capacity-achieving for the respective graphs in the product, and conversely, if we consider the product of capacity-achieving distributions of graphs, is this distribution capacity-achieving for the product of graphs? In other words,

    𝒫(a𝒜Ga)a𝒜Δ(𝒳a)=?a𝒜𝒫(Ga).superscript𝒫subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜Δsubscript𝒳𝑎?subscripttensor-product𝑎𝒜superscript𝒫subscript𝐺𝑎\displaystyle\mathcal{P}^{\star}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)% \cap\bigotimes_{a\in\mathcal{A}}\Delta(\mathcal{X}_{a})\overset{?}{=}% \bigotimes_{a\in\mathcal{A}}\mathcal{P}^{\star}(G_{a}).caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ∩ ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT roman_Δ ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) over? start_ARG = end_ARG ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (81)

    We gave a partial answer in Theorem 12, in the sense that inclusion holds when the linearization of the product holds.

  • -

    We have shown in Theorem 12 and Theorem 13 that the linearization property of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT holds if and only if the linearization property of C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) holds, where PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is any capacity-achieving distribution. Can we find graphs such that the linearization property of C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) holds when PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is capacity-achieving, but does not hold for some PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT that is not capacity-achieving? A negative answer would imply that the linearization property of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is equivalent to the linearization property of C(,PX)𝐶subscript𝑃𝑋C(\cdot,P_{X})italic_C ( ⋅ , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG for all PXsubscript𝑃𝑋P_{X}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT, similarly to perfect graphs.

  • -

    Finally, we have seen in Corollary 3 that H¯(GG5)¯𝐻𝐺subscript𝐺5\overline{H}\big{(}G\wedge G_{5}\big{)}over¯ start_ARG italic_H end_ARG ( italic_G ∧ italic_G start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) with G𝐺Gitalic_G perfect is an example where the linearization property holds. Is the non-linearization property of H¯()\overline{H}(\wedge\>\cdot)over¯ start_ARG italic_H end_ARG ( ∧ ⋅ ) tied to specific non-perfect induced subgraphs in each graph in the product? And if so, can we find a minimal family of these graphs?

Appendix A Proof of Theorem 4

In order to prove Theorem 4, we will need Lemma 1, Lemma 2 and Lemma 6. The proof of Lemma 1 is a direct consequence of Lemma 3, which is proved in App. A-A. Lemma 2 gives regularity properties of PAH¯(a𝒜PAGa)maps-tosubscript𝑃𝐴¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎P_{A}\mapsto\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and is proved in App. A-B. Lemma 6 states that if a convex function γ𝛾\gammaitalic_γ of Δ(𝒜)Δ𝒜\Delta(\mathcal{A})roman_Δ ( caligraphic_A ) meets the linear interpolation of (γ(𝟙a))a𝒜subscript𝛾subscript1𝑎𝑎𝒜(\gamma(\mathds{1}_{a}))_{a\in\mathcal{A}}( italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT, where (𝟙a)a𝒜subscriptsubscript1𝑎𝑎𝒜(\mathds{1}_{a})_{a\in\mathcal{A}}( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT are the extreme points of Δ(𝒜)Δ𝒜\Delta(\mathcal{A})roman_Δ ( caligraphic_A ), then γ𝛾\gammaitalic_γ is linear. The proof of Lemma 6 is given in App. A-C.

Lemma 6.

Let 𝒜𝒜\mathcal{A}caligraphic_A be a finite set, and γ:Δ(𝒜):𝛾Δ𝒜\gamma:\Delta(\mathcal{A})\rightarrow\mathbb{R}italic_γ : roman_Δ ( caligraphic_A ) → blackboard_R be a convex function, and for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, let 𝟙asubscript1𝑎\mathds{1}_{a}blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT be the distribution that assigns 1 to the symbol a𝑎aitalic_a and 0 to the others. Then the following holds:

PAint(Δ(𝒜)),γ(PA)=a𝒜PA(a)γ(𝟙a)formulae-sequencesubscript𝑃𝐴intΔ𝒜𝛾subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝛾subscript1𝑎\displaystyle\exists P_{A}\in\operatorname*{int}(\Delta(\mathcal{A})),\,\gamma% (P_{A})=\sum_{a\in\mathcal{A}}P_{A}(a)\gamma(\mathds{1}_{a})∃ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_int ( roman_Δ ( caligraphic_A ) ) , italic_γ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (82)
\displaystyle\Longleftrightarrow\;\; PAΔ(𝒜),γ(PA)=a𝒜PA(a)γ(𝟙a)formulae-sequencefor-allsubscript𝑃𝐴Δ𝒜𝛾subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝛾subscript1𝑎\displaystyle\forall P_{A}\in\Delta(\mathcal{A}),\,\gamma(P_{A})=\sum_{a\in% \mathcal{A}}P_{A}(a)\gamma(\mathds{1}_{a})∀ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) , italic_γ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (83)

where int(Δ(𝒜))intΔ𝒜\operatorname*{int}(\Delta(\mathcal{A}))roman_int ( roman_Δ ( caligraphic_A ) ) is the interior of Δ(𝒜)Δ𝒜\Delta(\mathcal{A})roman_Δ ( caligraphic_A ) (i.e. the full-support distributions on 𝒜𝒜\mathcal{A}caligraphic_A).

Now let us prove Theorem 4:

()(\Longrightarrow)( ⟹ ) Assume that H¯(a𝒜Ga)=a𝒜H¯(Ga)¯𝐻subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜¯𝐻subscript𝐺𝑎\overline{H}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\textstyle% \sum_{a\in\mathcal{A}}\overline{H}(G_{a})over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

We can use Lemma 1, which states that H¯(a𝒜Unif(𝒜)Ga)=1|𝒜|H¯(a𝒜Ga)¯𝐻superscriptsubscriptsquare-union𝑎𝒜Unif𝒜subscript𝐺𝑎1𝒜¯𝐻subscript𝑎𝒜subscript𝐺𝑎\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{\operatorname*{Unif}(\mathcal{% A})}G_{a}\big{)}=\frac{1}{|\mathcal{A}|}\overline{H}\left(\textstyle\bigwedge_% {a\in\mathcal{A}}G_{a}\right)over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Unif ( caligraphic_A ) end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG | caligraphic_A | end_ARG over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), hence H¯(a𝒜Unif(𝒜)Ga)=a𝒜1|𝒜|H¯(Ga)¯𝐻superscriptsubscriptsquare-union𝑎𝒜Unif𝒜subscript𝐺𝑎subscript𝑎𝒜1𝒜¯𝐻subscript𝐺𝑎\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{\operatorname*{Unif}(\mathcal{% A})}G_{a}\big{)}=\sum_{a\in\mathcal{A}}\frac{1}{|\mathcal{A}|}\overline{H}(G_{% a})over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Unif ( caligraphic_A ) end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG | caligraphic_A | end_ARG over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ). Thus, the function η:PAH¯(a𝒜PAGa):𝜂maps-tosubscript𝑃𝐴¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\eta:P_{A}\mapsto\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}% \big{)}italic_η : italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) satisfies (82) with the interior point PA=Unif(𝒜)subscript𝑃𝐴Unif𝒜P_{A}=\operatorname*{Unif}(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = roman_Unif ( caligraphic_A ), and is convex by Lemma 2: by Lemma 6 we have

PAΔ(𝒜),H¯(a𝒜PAGa)=a𝒜PA(a)H¯(Ga).formulae-sequencefor-allsubscript𝑃𝐴Δ𝒜¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎\displaystyle\forall P_{A}\in\Delta(\mathcal{A}),\;\overline{H}\big{(}% \textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}=\textstyle\sum_{a\in% \mathcal{A}}P_{A}(a)\overline{H}(G_{a}).∀ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) , over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (84)

()(\Longleftarrow)( ⟸ ) Conversely, assume (84), then η:PAH¯(a𝒜PAGa):𝜂maps-tosubscript𝑃𝐴¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\eta:P_{A}\mapsto\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}% \big{)}italic_η : italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) is linear. We can use Lemma 1, and we have H¯(a𝒜Ga)=|𝒜|H¯(a𝒜Unif(𝒜)Ga)=a𝒜H¯(Ga)¯𝐻subscript𝑎𝒜subscript𝐺𝑎𝒜¯𝐻superscriptsubscriptsquare-union𝑎𝒜Unif𝒜subscript𝐺𝑎subscript𝑎𝒜¯𝐻subscript𝐺𝑎\overline{H}\big{(}\bigwedge_{a\in\mathcal{A}}G_{a}\big{)}=|\mathcal{A}|% \overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{\operatorname*{Unif}(\mathcal{% A})}G_{a}\big{)}=\sum_{a\in\mathcal{A}}\overline{H}(G_{a})over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = | caligraphic_A | over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_Unif ( caligraphic_A ) end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

A-A Proof of Lemma 3

Lemma 3 is the consequence of a more general result, which consider probability distributions PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) instead of types PAΔk(𝒜)subscript𝑃𝐴subscriptΔ𝑘𝒜P_{A}\in\Delta_{k}(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( caligraphic_A ), for some k𝑘superscriptk\in\mathbb{N}^{\star}italic_k ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.

Lemma 7.

Let PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) a probability distribution with full-support and let (a¯n)n𝒜subscriptsubscript¯𝑎𝑛𝑛superscriptsuperscript𝒜superscript(\overline{a}_{n})_{n\in\mathbb{N}^{\star}}\in\mathcal{A}^{\mathbb{N}^{\star}}( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUPERSCRIPT blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT be any sequence such that its type Ta¯nPAsubscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴T_{\overline{a}^{n}}\rightarrow P_{A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT when n𝑛n\rightarrow\inftyitalic_n → ∞. Then we have

H¯(a𝒜PAGa)=limn1nHχ(a𝒜GanTa¯n(a)).¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑛1𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎\overline{H}\left(\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\right)=\lim_{n% \rightarrow\infty}\frac{1}{n}H_{\chi}\left(\bigwedge_{a\in\mathcal{A}}G_{a}^{% \wedge nT_{\overline{a}^{n}}(a)}\right).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) . (85)

The proof of Lemma 7 is stated in App. A-A1. Now let us prove Lemma 3. Let (a¯n)nsubscriptsubscript¯𝑎𝑛𝑛superscript(\overline{a}_{n})_{n\in\mathbb{N}^{\star}}( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT be a k𝑘kitalic_k-periodic sequence such that Ta¯k=PAsubscript𝑇superscript¯𝑎𝑘subscript𝑃𝐴T_{\overline{a}^{k}}=P_{A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT, then Ta¯nk=Ta¯ksubscript𝑇superscript¯𝑎𝑛𝑘subscript𝑇superscript¯𝑎𝑘T_{\overline{a}^{nk}}=T_{\overline{a}^{k}}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, and Ta¯nnPAsubscript𝑇superscript¯𝑎𝑛𝑛subscript𝑃𝐴T_{\overline{a}^{n}}\underset{n\rightarrow\infty}{\rightarrow}P_{A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT. We can use Lemma 7 and consider every k𝑘kitalic_k-th term in the limit:

H¯(a𝒜PAGa)¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\displaystyle\overline{H}\Big{(}\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G% _{a}\Big{)}over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =limn1knHχ(a𝒜GaknTa¯kn(a))absentsubscript𝑛1𝑘𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑘𝑛subscript𝑇superscript¯𝑎𝑘𝑛𝑎\displaystyle=\lim\limits_{n\rightarrow\infty}\frac{1}{kn}H_{\chi}\Big{(}% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge knT_{\overline{a}^{kn}}(a)}% \Big{)}= roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_k italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_k italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) (86)
=limn1knHχ((a𝒜GakTa¯k(a))n)absentsubscript𝑛1𝑘𝑛subscript𝐻𝜒superscriptsubscript𝑎𝒜superscriptsubscript𝐺𝑎𝑘subscript𝑇superscript¯𝑎𝑘𝑎𝑛\displaystyle=\lim\limits_{n\rightarrow\infty}\frac{1}{kn}H_{\chi}\Big{(}\Big{% (}\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge kT_{\overline{a}^{k}}(a)}% \Big{)}^{\wedge n}\Big{)}= roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_k italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) (87)
=1kH¯(a𝒜GakPA(a)).absent1𝑘¯𝐻subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑘subscript𝑃𝐴𝑎\displaystyle=\frac{1}{k}\overline{H}\Big{(}\textstyle\bigwedge_{a\in\mathcal{% A}}G_{a}^{\wedge kP_{A}(a)}\Big{)}.= divide start_ARG 1 end_ARG start_ARG italic_k end_ARG over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_k italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) . (88)

A-A1 Proof of Lemma 7

We need several lemmas for this result. Lemma 8 establishes the distributivity of \wedge with respect to square-union\sqcup for probabilistic graphs, similarly as in [41] for graphs without underlying distribution. Lemma 9 states that H¯¯𝐻\overline{H}over¯ start_ARG italic_H end_ARG can be computed with subgraphs induced by sets that have an asymptotic probability one, in particular we will use it with typical sets of vertices. Lemma 10 gives the chromatic entropy of a disjoint union of isomorphic probabilistic graphs. The proofs of Lemma 8, Lemma 9 and Lemma 10 are respectively given in App. A-A2, App. A-A3, and Appendix A-A4.

Lemma 8.

Let 𝒜,𝒜\mathcal{A},\mathcal{B}caligraphic_A , caligraphic_B be finite sets, let PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) and PBΔ()subscript𝑃𝐵ΔP_{B}\in\Delta(\mathcal{B})italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_B ). For all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A and b𝑏b\in\mathcal{B}italic_b ∈ caligraphic_B, let Ga=(𝒳a,a,PXa)subscript𝐺𝑎subscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎G_{a}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) and Gb=(𝒳b,b,PXb)subscript𝐺𝑏subscript𝒳𝑏subscript𝑏subscript𝑃subscript𝑋𝑏G_{b}=(\mathcal{X}_{b},\mathcal{E}_{b},P_{X_{b}})italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be probabilistic graphs. Then

(a𝒜PAGa)(bPBGb)=(a,b)𝒜×PAPBGaGb.superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎superscriptsubscriptsquare-union𝑏subscript𝑃𝐵subscript𝐺𝑏superscriptsubscriptsquare-union𝑎𝑏𝒜subscript𝑃𝐴subscript𝑃𝐵subscript𝐺𝑎subscript𝐺𝑏\displaystyle\left(\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\right)\wedge\left(% \bigsqcup_{b\in\mathcal{B}}^{P_{B}}G_{b}\right)=\bigsqcup_{(a,b)\in\mathcal{A}% \times\mathcal{B}}^{P_{A}P_{B}}G_{a}\wedge G_{b}.( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ∧ ( ⨆ start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) = ⨆ start_POSTSUBSCRIPT ( italic_a , italic_b ) ∈ caligraphic_A × caligraphic_B end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT . (89)
Lemma 9.

Let G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), and (𝒮n)nsubscriptsubscript𝒮𝑛𝑛superscript(\mathcal{S}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT be a sequence of sets such that for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, 𝒮n𝒳nsubscript𝒮𝑛superscript𝒳𝑛\mathcal{S}_{n}\subseteq\mathcal{X}^{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, and PXn(𝒮n)1subscriptsuperscript𝑃𝑛𝑋subscript𝒮𝑛1P^{n}_{X}(\mathcal{S}_{n})\rightarrow 1italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) → 1 when n𝑛n\rightarrow\inftyitalic_n → ∞. Then H¯(G)=limn1nHχ(Gn[𝒮n])¯𝐻𝐺subscript𝑛1𝑛subscript𝐻𝜒superscript𝐺𝑛delimited-[]subscript𝒮𝑛\overline{H}(G)=\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}\big{(}G^{\wedge n% }[\mathcal{S}_{n}]\big{)}over¯ start_ARG italic_H end_ARG ( italic_G ) = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] ).

Definition 16 (Isomorphic probabilistic graphs).

Let G1=(𝒳1,1,PX1)subscript𝐺1subscript𝒳1subscript1subscript𝑃subscript𝑋1G_{1}=(\mathcal{X}_{1},\mathcal{E}_{1},P_{X_{1}})italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) and G2=(𝒳2,2,PX2)subscript𝐺2subscript𝒳2subscript2subscript𝑃subscript𝑋2G_{2}=(\mathcal{X}_{2},\mathcal{E}_{2},P_{X_{2}})italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be two probabilistic graphs. We say that G1subscript𝐺1G_{1}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is isomorphic to G2subscript𝐺2G_{2}italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (denoted by G1G2similar-to-or-equalssubscript𝐺1subscript𝐺2G_{1}\simeq G_{2}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≃ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT) if there exists an isomorphism between them, i.e. a bijection ψ:𝒳1𝒳2:𝜓subscript𝒳1subscript𝒳2\psi:\mathcal{X}_{1}\rightarrow\mathcal{X}_{2}italic_ψ : caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT such that:

  • For all x1,x1𝒳1subscript𝑥1superscriptsubscript𝑥1subscript𝒳1x_{1},x_{1}^{\prime}\in\mathcal{X}_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, x1x11ψ(x1)ψ(x1)2subscript𝑥1subscriptsuperscript𝑥1subscript1𝜓subscript𝑥1𝜓subscriptsuperscript𝑥1subscript2x_{1}x^{\prime}_{1}\in\mathcal{E}_{1}\Longleftrightarrow\psi(x_{1})\psi(x^{% \prime}_{1})\in\mathcal{E}_{2}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⟺ italic_ψ ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_ψ ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∈ caligraphic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT,

  • For all x1𝒳1subscript𝑥1subscript𝒳1x_{1}\in\mathcal{X}_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, PX1(x1)=PX2(ψ(x1))subscript𝑃subscript𝑋1subscript𝑥1subscript𝑃subscript𝑋2𝜓subscript𝑥1P_{X_{1}}(x_{1})=P_{X_{2}}\big{(}\psi(x_{1})\big{)}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ).

Lemma 10.

Let \mathcal{B}caligraphic_B be a finite set, let PBΔ()subscript𝑃𝐵ΔP_{B}\in\Delta(\mathcal{B})italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_B ) and let (Gb)bsubscriptsubscript𝐺𝑏𝑏(G_{b})_{b\in\mathcal{B}}( italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT be a family of isomorphic probabilistic graphs, then Hχ(bPBGb)=Hχ(Gb)subscript𝐻𝜒superscriptsubscriptsquare-unionsuperscript𝑏subscript𝑃𝐵subscript𝐺superscript𝑏subscript𝐻𝜒subscript𝐺𝑏H_{\chi}\big{(}\bigsqcup_{b^{\prime}\in\mathcal{B}}^{P_{B}}G_{b^{\prime}}\big{% )}=H_{\chi}(G_{b})italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_B end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) = italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) for all b𝑏b\in\mathcal{B}italic_b ∈ caligraphic_B.

Now let us prove Lemma 7. Let PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), and let G=a𝒜PAGa𝐺subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎G=\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}italic_G = ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. Let (a¯n)n𝒜subscriptsubscript¯𝑎𝑛𝑛superscriptsuperscript𝒜superscript(\overline{a}_{n})_{n\in\mathbb{N}^{\star}}\in\mathcal{A}^{\mathbb{N}^{\star}}( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUPERSCRIPT blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT be a sequence such that Ta¯nPAsubscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴T_{\overline{a}^{n}}\rightarrow P_{A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT when n𝑛n\rightarrow\inftyitalic_n → ∞.

Let ε>0𝜀0\varepsilon>0italic_ε > 0, and for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT let

𝒯εn(PA){an𝒜n|TanPAε},approaches-limitsubscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴conditional-setsuperscript𝑎𝑛superscript𝒜𝑛subscriptnormsubscript𝑇superscript𝑎𝑛subscript𝑃𝐴𝜀\displaystyle\mathcal{T}^{n}_{\varepsilon}(P_{A})\doteq\big{\{}a^{n}\in% \mathcal{A}^{n}\>\big{|}\>\|T_{a^{n}}-P_{A}\|_{\infty}\leq\varepsilon\big{\}},caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) ≐ { italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_A start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | ∥ italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε } , (90)
PnPAnPAn(𝒯εn(PA)),𝒮n,εan𝒯εn(PA)tn𝒳at.formulae-sequenceapproaches-limitsuperscript𝑃𝑛subscriptsuperscript𝑃𝑛𝐴subscriptsuperscript𝑃𝑛𝐴subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴approaches-limitsubscript𝒮𝑛𝜀subscriptsquare-unionsuperscript𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴subscriptproduct𝑡𝑛subscript𝒳subscript𝑎𝑡\displaystyle P^{\prime n}\doteq\frac{P^{n}_{A}}{P^{n}_{A}(\mathcal{T}^{n}_{% \varepsilon}(P_{A}))},\qquad\mathcal{S}_{n,\varepsilon}\doteq\bigsqcup_{a^{n}% \in\mathcal{T}^{n}_{\varepsilon}(P_{A})}\;\prod_{t\leq n}\mathcal{X}_{a_{t}}.italic_P start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ≐ divide start_ARG italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_ARG start_ARG italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) ) end_ARG , caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ≐ ⨆ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Since PXn(𝒮n,ε)1subscriptsuperscript𝑃𝑛𝑋subscript𝒮𝑛𝜀1P^{n}_{X}(\mathcal{S}_{n,\varepsilon})\rightarrow 1italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ) → 1 when n𝑛n\rightarrow\inftyitalic_n → ∞, we have by Lemma 9

H¯(G)=limn1nHχ(Gn[𝒮n,ε]),¯𝐻𝐺subscript𝑛1𝑛subscript𝐻𝜒superscript𝐺𝑛delimited-[]subscript𝒮𝑛𝜀\displaystyle\overline{H}(G)=\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}\Big{% (}G^{\wedge n}[\mathcal{S}_{n,\varepsilon}]\Big{)},over¯ start_ARG italic_H end_ARG ( italic_G ) = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ] ) , (91)

Let us study the limit in (91). For all n𝑛nitalic_n large enough, a¯n𝒯εn(PA)superscript¯𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴\overline{a}^{n}\in\mathcal{T}^{n}_{\varepsilon}(P_{A})over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) as Ta¯nPAsubscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴T_{\overline{a}^{n}}\rightarrow P_{A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT. Therefore, for all an𝒯εn(PA)superscript𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴a^{n}\in\mathcal{T}^{n}_{\varepsilon}(P_{A})italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ), a𝒜superscript𝑎𝒜a^{\prime}\in\mathcal{A}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A, and n𝑛nitalic_n large enough, we have

|Ta¯n(a)Tan(a)|2ε.subscript𝑇superscript¯𝑎𝑛superscript𝑎subscript𝑇superscript𝑎𝑛superscript𝑎2𝜀\displaystyle\big{|}T_{\overline{a}^{n}}(a^{\prime})-T_{a^{n}}(a^{\prime})\big% {|}\leq 2\varepsilon.| italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) | ≤ 2 italic_ε . (92)

We have on one hand

Hχ((a𝒜PAGa)n[𝒮n,ε])subscript𝐻𝜒superscriptsubscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎𝑛delimited-[]subscript𝒮𝑛𝜀\displaystyle H_{\chi}\Big{(}\big{(}\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{% A}}G_{a}\big{)}^{\wedge n}[\mathcal{S}_{n,\varepsilon}]\Big{)}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ] )
=\displaystyle=\;= Hχ((an𝒜nPAntnGat)[𝒮n,ε])subscript𝐻𝜒superscriptsubscriptsquare-unionsuperscript𝑎𝑛superscript𝒜𝑛superscriptsubscript𝑃𝐴𝑛subscript𝑡𝑛subscript𝐺subscript𝑎𝑡delimited-[]subscript𝒮𝑛𝜀\displaystyle H_{\chi}\left(\left(\textstyle\bigsqcup_{a^{n}\in\mathcal{A}^{n}% }^{P_{A}^{n}}\;\textstyle\bigwedge_{t\leq n}G_{a_{t}}\right)[\mathcal{S}_{n,% \varepsilon}]\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ( ⨆ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_A start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ⋀ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) [ caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ] ) (93)
=\displaystyle=\;= Hχ(an𝒯εn(PA)PntnGat)subscript𝐻𝜒superscriptsubscriptsquare-unionsuperscript𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴superscript𝑃𝑛subscript𝑡𝑛subscript𝐺subscript𝑎𝑡\displaystyle H_{\chi}\left(\textstyle\bigsqcup_{a^{n}\in\mathcal{T}^{n}_{% \varepsilon}(P_{A})}^{P^{\prime n}}\;\textstyle\bigwedge_{t\leq n}G_{a_{t}}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ⋀ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (94)
=\displaystyle=\;= Hχ(an𝒯εn(PA)Pna𝒜GanTan(a))subscript𝐻𝜒superscriptsubscriptsquare-unionsuperscript𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴superscript𝑃𝑛subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript𝑎𝑛superscript𝑎\displaystyle H_{\chi}\left(\textstyle\bigsqcup_{a^{n}\in\mathcal{T}^{n}_{% \varepsilon}(P_{A})}^{P^{\prime n}}\;\textstyle\bigwedge_{a^{\prime}\in% \mathcal{A}}G_{a^{\prime}}^{\wedge nT_{a^{n}}(a^{\prime})}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) (95)
\displaystyle\leq\; Hχ(an𝒯εn(PA)Pna𝒜GanTa¯n(a)+2nε)subscript𝐻𝜒superscriptsubscriptsquare-unionsuperscript𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴superscript𝑃𝑛subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎2𝑛𝜀\displaystyle H_{\chi}\left(\textstyle\bigsqcup_{a^{n}\in\mathcal{T}^{n}_{% \varepsilon}(P_{A})}^{P^{\prime n}}\;\textstyle\bigwedge_{a^{\prime}\in% \mathcal{A}}G_{a^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})+\lceil 2n% \varepsilon\rceil}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) + ⌈ 2 italic_n italic_ε ⌉ end_POSTSUPERSCRIPT ) (96)
=\displaystyle=\;= Hχ(a𝒜GanTa¯n(a)+2nε)subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎2𝑛𝜀\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a% ^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})+\lceil 2n\varepsilon% \rceil}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) + ⌈ 2 italic_n italic_ε ⌉ end_POSTSUPERSCRIPT ) (97)
\displaystyle\leq\; Hχ(a𝒜GanTa¯n(a))+Hχ(a𝒜Ga2nε)subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎2𝑛𝜀\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a% ^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})}\right)+H_{\chi}\left(% \textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a^{\prime}}^{\wedge\lceil 2n% \varepsilon\rceil}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) + italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ ⌈ 2 italic_n italic_ε ⌉ end_POSTSUPERSCRIPT ) (98)
\displaystyle\leq\; Hχ(a𝒜GanTa¯n(a))+2nε|𝒜|log|𝒳|;subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎2𝑛𝜀𝒜𝒳\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a% ^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})}\right)+\lceil 2n% \varepsilon\rceil|\mathcal{A}|\log|\mathcal{X}|;italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) + ⌈ 2 italic_n italic_ε ⌉ | caligraphic_A | roman_log | caligraphic_X | ; (99)

where (93) comes from Lemma 8; (94) comes from the definition of 𝒮n,εsubscript𝒮𝑛𝜀\mathcal{S}_{n,\varepsilon}caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT and Pnsuperscript𝑃𝑛P^{\prime n}italic_P start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT in (90); (95) is a rearrangement of the terms inside the product; (96) comes from (92); (97) follows from Lemma 10, the graphs (a𝒜GanTa¯n(a)+2nε)an𝒯εn(PA)subscriptsubscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎2𝑛𝜀superscript𝑎𝑛subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝐴\big{(}\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a^{\prime}}^{\wedge nT_{% \overline{a}^{n}}(a^{\prime})+\lceil 2n\varepsilon\rceil}\big{)}_{a^{n}\in% \mathcal{T}^{n}_{\varepsilon}(P_{A})}( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) + ⌈ 2 italic_n italic_ε ⌉ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT are isomorphic as they do not depend on ansuperscript𝑎𝑛a^{n}italic_a start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT; (98) follows from the subadditivity of Hχsubscript𝐻𝜒H_{\chi}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT; and (99) is the upper bound on Hχsubscript𝐻𝜒H_{\chi}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT given by the highest entropy of a coloring.

On the other hand, we obtain with similar arguments

Hχ((a𝒜PAGa)n[𝒮n,ε])subscript𝐻𝜒superscriptsubscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎𝑛delimited-[]subscript𝒮𝑛𝜀\displaystyle H_{\chi}\Big{(}\big{(}\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{% A}}G_{a}\big{)}^{\wedge n}[\mathcal{S}_{n,\varepsilon}]\Big{)}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ] )
\displaystyle\geq\> Hχ(a𝒜GanTa¯n(a)2nε)subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎2𝑛𝜀\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a% ^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})-\lceil 2n\varepsilon% \rceil}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - ⌈ 2 italic_n italic_ε ⌉ end_POSTSUPERSCRIPT ) (100)
\displaystyle\geq\> Hχ(a𝒜GanTa¯n(a))Hχ(a𝒜Ga2nε),subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎2𝑛𝜀\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a% ^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})}\right)-H_{\chi}\left(% \textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a^{\prime}}^{\wedge\lceil 2n% \varepsilon\rceil}\right),italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ ⌈ 2 italic_n italic_ε ⌉ end_POSTSUPERSCRIPT ) , (101)
\displaystyle\geq\> Hχ(a𝒜GanTa¯n(a))2nε|𝒜|log|𝒳|.subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎2𝑛𝜀𝒜𝒳\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a% ^{\prime}}^{\wedge nT_{\overline{a}^{n}}(a^{\prime})}\right)-\lceil 2n% \varepsilon\rceil|\mathcal{A}|\log|\mathcal{X}|.italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) - ⌈ 2 italic_n italic_ε ⌉ | caligraphic_A | roman_log | caligraphic_X | . (102)

Note that (101) also comes from the subadditivity of Hχsubscript𝐻𝜒H_{\chi}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT, as Hχ(G2)Hχ(G1G2)Hχ(G1)subscript𝐻𝜒subscript𝐺2subscript𝐻𝜒subscript𝐺1subscript𝐺2subscript𝐻𝜒subscript𝐺1H_{\chi}(G_{2})\geq H_{\chi}(G_{1}\wedge G_{2})-H_{\chi}(G_{1})italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≥ italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) for all G1,G2subscript𝐺1subscript𝐺2G_{1},G_{2}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

By combining (99) and (102) we obtain

|limn1nHχ(Gn[𝒮n,ε])limn1nHχ(a𝒜GanTa¯n(a))|subscript𝑛1𝑛subscript𝐻𝜒superscript𝐺𝑛delimited-[]subscript𝒮𝑛𝜀subscript𝑛1𝑛subscript𝐻𝜒subscriptsuperscript𝑎𝒜superscriptsubscript𝐺superscript𝑎𝑛subscript𝑇superscript¯𝑎𝑛superscript𝑎\displaystyle\left|\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}(G^{\wedge n}[% \mathcal{S}_{n,\varepsilon}])-\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}% \left(\textstyle\bigwedge_{a^{\prime}\in\mathcal{A}}G_{a^{\prime}}^{\wedge nT_% {\overline{a}^{n}}(a^{\prime})}\right)\right|| roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_n , italic_ε end_POSTSUBSCRIPT ] ) - roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) |
2ε|𝒜|log|𝒳|.absent2𝜀𝒜𝒳\displaystyle\leq 2\varepsilon|\mathcal{A}|\log|\mathcal{X}|.≤ 2 italic_ε | caligraphic_A | roman_log | caligraphic_X | . (103)

As this holds for all ε>0𝜀0\varepsilon>0italic_ε > 0, combining (91) and (103) yields the desired result.

A-A2 Proof of Lemma 8

The probabilistic graphs in both sides of (89) have

(a𝒜𝒳a)×(b𝒳b)=(a,b)𝒜×𝒳a×𝒳bsubscriptsquare-union𝑎𝒜subscript𝒳𝑎subscriptsquare-union𝑏subscript𝒳𝑏subscriptsquare-union𝑎𝑏𝒜subscript𝒳𝑎subscript𝒳𝑏\displaystyle\left(\textstyle\bigsqcup_{a\in\mathcal{A}}\mathcal{X}_{a}\right)% \times\left(\textstyle\bigsqcup_{b\in\mathcal{B}}\mathcal{X}_{b}\right)=% \textstyle\bigsqcup_{(a,b)\in\mathcal{A}\times\mathcal{B}}\mathcal{X}_{a}% \times\mathcal{X}_{b}( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) × ( ⨆ start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) = ⨆ start_POSTSUBSCRIPT ( italic_a , italic_b ) ∈ caligraphic_A × caligraphic_B end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT × caligraphic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT (104)

as set of vertices, with underlying distribution

(a𝒜PA(a)PXa)(bPB(b)PXb)subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎subscript𝑏subscript𝑃𝐵𝑏subscript𝑃subscript𝑋𝑏\displaystyle\left(\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)% \left(\textstyle\sum_{b\in\mathcal{B}}P_{B}(b)P_{X_{b}}\right)( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ( ∑ start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT ( italic_b ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT )
=(a,b)𝒜×PA(a)PB(b)PXaPXb.absentsubscript𝑎𝑏𝒜subscript𝑃𝐴𝑎subscript𝑃𝐵𝑏subscript𝑃subscript𝑋𝑎subscript𝑃subscript𝑋𝑏\displaystyle=\textstyle\sum_{(a,b)\in\mathcal{A}\times\mathcal{B}}P_{A}(a)P_{% B}(b)P_{X_{a}}P_{X_{b}}.= ∑ start_POSTSUBSCRIPT ( italic_a , italic_b ) ∈ caligraphic_A × caligraphic_B end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT ( italic_b ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (105)

Now let us show that these two graphs have the same edges. Let (x𝒜,x),(x𝒜,x)(a𝒜𝒳a)×(b𝒳b)subscript𝑥𝒜subscript𝑥subscriptsuperscript𝑥𝒜subscriptsuperscript𝑥subscriptsquare-union𝑎𝒜subscript𝒳𝑎subscriptsquare-union𝑏subscript𝒳𝑏(x_{\mathcal{A}},x_{\mathcal{B}}),(x^{\prime}_{\mathcal{A}},x^{\prime}_{% \mathcal{B}})\in\left(\bigsqcup_{a\in\mathcal{A}}\mathcal{X}_{a}\right)\times% \left(\bigsqcup_{b\in\mathcal{B}}\mathcal{X}_{b}\right)( italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) , ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) ∈ ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) × ( ⨆ start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ), let a,a𝒜subscript𝑎subscriptsuperscript𝑎𝒜a_{*},a^{\prime}_{*}\in\mathcal{A}italic_a start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ∈ caligraphic_A and b,bsubscript𝑏subscriptsuperscript𝑏b_{*},b^{\prime}_{*}\in\mathcal{B}italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ∈ caligraphic_B be the unique indexes such that

(x𝒜,x)𝒳a×𝒳band(x𝒜,x)𝒳a×𝒳b.formulae-sequencesubscript𝑥𝒜subscript𝑥subscript𝒳subscript𝑎subscript𝒳subscript𝑏andsubscriptsuperscript𝑥𝒜subscriptsuperscript𝑥subscript𝒳subscriptsuperscript𝑎subscript𝒳subscriptsuperscript𝑏\displaystyle(x_{\mathcal{A}},x_{\mathcal{B}})\in\mathcal{X}_{a_{*}}\times% \mathcal{X}_{b_{*}}\quad\text{and}\quad(x^{\prime}_{\mathcal{A}},x^{\prime}_{% \mathcal{B}})\in\mathcal{X}_{a^{\prime}_{*}}\times\mathcal{X}_{b^{\prime}_{*}}.( italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT × caligraphic_X start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT and ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT × caligraphic_X start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (106)

We have:

(x𝒜,x),(x𝒜,x) are adjacent in (a𝒜PAGa)(bPBGb)subscript𝑥𝒜subscript𝑥subscriptsuperscript𝑥𝒜subscriptsuperscript𝑥 are adjacent in superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎superscriptsubscriptsquare-union𝑏subscript𝑃𝐵subscript𝐺𝑏\displaystyle\!\!\!(x_{\mathcal{A}},x_{\mathcal{B}}),(x^{\prime}_{\mathcal{A}}% ,x^{\prime}_{\mathcal{B}})\text{ are adjacent in }\left(\textstyle\bigsqcup_{a% \in\mathcal{A}}^{P_{A}}G_{a}\right)\wedge\left(\textstyle\bigsqcup_{b\in% \mathcal{B}}^{P_{B}}G_{b}\right)( italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) , ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) are adjacent in ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ∧ ( ⨆ start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) (107)
\displaystyle\Longleftrightarrow x𝒜,x𝒜 adjacent in a𝒜PAGa and x,x adjacent in bPBGbsubscript𝑥𝒜subscriptsuperscript𝑥𝒜 adjacent in superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎 and subscript𝑥subscriptsuperscript𝑥 adjacent in superscriptsubscriptsquare-union𝑏subscript𝑃𝐵subscript𝐺𝑏\displaystyle\;x_{\mathcal{A}},x^{\prime}_{\mathcal{A}}\text{ adjacent in }% \textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\text{ and }x_{\mathcal{B}},% x^{\prime}_{\mathcal{B}}\text{ adjacent in }\textstyle\bigsqcup_{b\in\mathcal{% B}}^{P_{B}}G_{b}italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT adjacent in ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT and italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT adjacent in ⨆ start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT (108)
\displaystyle\Longleftrightarrow a=a and x𝒜x𝒜a and b=b and xxbsubscript𝑎subscriptsuperscript𝑎 and subscript𝑥𝒜subscriptsuperscript𝑥𝒜subscriptsubscript𝑎 and subscript𝑏subscriptsuperscript𝑏 and subscript𝑥subscriptsuperscript𝑥subscriptsubscript𝑏\displaystyle\;a_{*}=a^{\prime}_{*}\text{ and }x_{\mathcal{A}}x^{\prime}_{% \mathcal{A}}\in\mathcal{E}_{a_{*}}\text{ and }b_{*}=b^{\prime}_{*}\text{ and }% x_{\mathcal{B}}x^{\prime}_{\mathcal{B}}\in\mathcal{E}_{b_{*}}italic_a start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ∈ caligraphic_E start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT and italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ∈ caligraphic_E start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT (109)
\displaystyle\Longleftrightarrow (a,b)=(a,b) and (x𝒜,x),(x𝒜,x) are adjacent in GaGbsubscript𝑎subscript𝑏subscriptsuperscript𝑎subscriptsuperscript𝑏 and subscript𝑥𝒜subscript𝑥subscriptsuperscript𝑥𝒜subscriptsuperscript𝑥 are adjacent in subscript𝐺subscript𝑎subscript𝐺subscript𝑏\displaystyle\;(a_{*},b_{*})=(a^{\prime}_{*},b^{\prime}_{*})\text{ and }(x_{% \mathcal{A}},x_{\mathcal{B}}),(x^{\prime}_{\mathcal{A}},x^{\prime}_{\mathcal{B% }})\text{ are adjacent in }G_{a_{*}}\wedge G_{b_{*}}( italic_a start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ) = ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ) and ( italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) , ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) are adjacent in italic_G start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUBSCRIPT (110)
\displaystyle\Longleftrightarrow (x𝒜,x),(x𝒜,x) are adjacent in (a,b)𝒜×PAPBGaGb.subscript𝑥𝒜subscript𝑥subscriptsuperscript𝑥𝒜subscriptsuperscript𝑥 are adjacent in superscriptsubscriptsquare-union𝑎𝑏𝒜subscript𝑃𝐴subscript𝑃𝐵subscript𝐺𝑎subscript𝐺𝑏\displaystyle\;(x_{\mathcal{A}},x_{\mathcal{B}}),(x^{\prime}_{\mathcal{A}},x^{% \prime}_{\mathcal{B}})\text{ are adjacent in }\textstyle\bigsqcup_{(a,b)\in% \mathcal{A}\times\mathcal{B}}^{P_{A}P_{B}}G_{a}\wedge G_{b}.( italic_x start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) , ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_B end_POSTSUBSCRIPT ) are adjacent in ⨆ start_POSTSUBSCRIPT ( italic_a , italic_b ) ∈ caligraphic_A × caligraphic_B end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∧ italic_G start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT . (111)

A-A3 Proof of Lemma 9

In order to prove Lemma 9, we need Lemma 11. In Lemma 11 we give upper and lower bounds on the chromatic entropy of an induced subgraph G[𝒮]𝐺delimited-[]𝒮G[\mathcal{S}]italic_G [ caligraphic_S ], using the chromatic entropy of the whole graph G𝐺Gitalic_G and the probability PX(𝒮)subscript𝑃𝑋𝒮P_{X}(\mathcal{S})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ). The core idea is that if PX(𝒮)subscript𝑃𝑋𝒮P_{X}(\mathcal{S})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) is close to 1111 and Hχ(G)subscript𝐻𝜒𝐺H_{\chi}(G)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) is big, then Hχ(G[𝒮])subscript𝐻𝜒𝐺delimited-[]𝒮H_{\chi}(G[\mathcal{S}])italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G [ caligraphic_S ] ) is close to Hχ(G)subscript𝐻𝜒𝐺H_{\chi}(G)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ). The proof of Lemma 11 is given in App. A-A5

Lemma 11.

Let G=(𝒳,,PX)𝐺𝒳subscript𝑃𝑋G=(\mathcal{X},\mathcal{E},P_{X})italic_G = ( caligraphic_X , caligraphic_E , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) and 𝒮𝒳𝒮𝒳\mathcal{S}\subseteq\mathcal{X}caligraphic_S ⊆ caligraphic_X, then

Hχ(G)1(1PX(𝒮))log|𝒳|Hχ(G[𝒮])Hχ(G)PX(𝒮).subscript𝐻𝜒𝐺11subscript𝑃𝑋𝒮𝒳subscript𝐻𝜒𝐺delimited-[]𝒮subscript𝐻𝜒𝐺subscript𝑃𝑋𝒮\displaystyle H_{\chi}(G)-1-(1-P_{X}(\mathcal{S}))\log|\mathcal{X}|\leq H_{% \chi}(G[\mathcal{S}])\leq\frac{H_{\chi}(G)}{P_{X}(\mathcal{S})}.italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) - 1 - ( 1 - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) ) roman_log | caligraphic_X | ≤ italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G [ caligraphic_S ] ) ≤ divide start_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) end_ARG . (112)
Remark 5.

Hχ(G[𝒮])subscript𝐻𝜒𝐺delimited-[]𝒮H_{\chi}(G[\mathcal{S}])italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G [ caligraphic_S ] ) can be greater than Hχ(G)subscript𝐻𝜒𝐺H_{\chi}(G)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ), even if G[𝒮]𝐺delimited-[]𝒮G[\mathcal{S}]italic_G [ caligraphic_S ] has less vertices and inherits the structure of G𝐺Gitalic_G. This stems from the normalized distribution PX/PX(𝒮)subscript𝑃𝑋subscript𝑃𝑋𝒮P_{X}/P_{X}(\mathcal{S})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT / italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) on the vertices of G[𝒮]𝐺delimited-[]𝒮G[\mathcal{S}]italic_G [ caligraphic_S ] which gives more weight to the vertices in 𝒮𝒮\mathcal{S}caligraphic_S. For example, consider

G=(N5,Unif({1,,5}))(1ε,ε)(K5,Unif({1,,5}));𝐺subscript𝑁5Unif151𝜀𝜀square-unionsubscript𝐾5Unif15\displaystyle G=\big{(}N_{5},\operatorname*{Unif}(\{1,...,5\})\big{)}\overset{% (1-\varepsilon,\varepsilon)}{\sqcup}\big{(}K_{5},\operatorname*{Unif}(\{1,...,% 5\})\big{)};italic_G = ( italic_N start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT , roman_Unif ( { 1 , … , 5 } ) ) start_OVERACCENT ( 1 - italic_ε , italic_ε ) end_OVERACCENT start_ARG ⊔ end_ARG ( italic_K start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT , roman_Unif ( { 1 , … , 5 } ) ) ;

where Knsubscript𝐾𝑛K_{n}italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (resp. Nnsubscript𝑁𝑛N_{n}italic_N start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT) is the complete (resp. empty) graph with n𝑛nitalic_n vertices, i.e. there is an edge (resp. no edge) between any pair of distinct vertices, and with 𝒮𝒮\mathcal{S}caligraphic_S being the vertices in the connected component K5subscript𝐾5K_{5}italic_K start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT in G𝐺Gitalic_G. Then Hχ(G)=εlog5subscript𝐻𝜒𝐺𝜀5H_{\chi}(G)=\varepsilon\log 5italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) = italic_ε roman_log 5 and Hχ(G[𝒮])=log5subscript𝐻𝜒𝐺delimited-[]𝒮5H_{\chi}(G[\mathcal{S}])=\log 5italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G [ caligraphic_S ] ) = roman_log 5.

Now let us prove Lemma 9. By Lemma 11, we have for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT:

Hχ(Gn)1(1PXn(𝒮n))log|𝒳|subscript𝐻𝜒superscript𝐺𝑛11subscriptsuperscript𝑃𝑛𝑋subscript𝒮𝑛𝒳\displaystyle H_{\chi}(G^{\wedge n})-1-(1-P^{n}_{X}(\mathcal{S}_{n}))\log|% \mathcal{X}|italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) - 1 - ( 1 - italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) roman_log | caligraphic_X |
\displaystyle\leq\> Hχ(Gn[𝒮n])Hχ(Gn)PXn(𝒮n).subscript𝐻𝜒superscript𝐺𝑛delimited-[]superscript𝒮𝑛subscript𝐻𝜒superscript𝐺𝑛subscriptsuperscript𝑃𝑛𝑋subscript𝒮𝑛\displaystyle H_{\chi}(G^{\wedge n}[\mathcal{S}^{n}])\leq\frac{H_{\chi}(G^{% \wedge n})}{P^{n}_{X}(\mathcal{S}_{n})}.italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ] ) ≤ divide start_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG . (113)

Since PXn(𝒮n)1subscriptsuperscript𝑃𝑛𝑋subscript𝒮𝑛1P^{n}_{X}(\mathcal{S}_{n})\rightarrow 1italic_P start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) → 1, and Hχ(Gn)=nH¯(G)+o(n)subscript𝐻𝜒superscript𝐺𝑛𝑛¯𝐻𝐺𝑜𝑛H_{\chi}(G^{\wedge n})=n\overline{H}(G)+o(n)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) = italic_n over¯ start_ARG italic_H end_ARG ( italic_G ) + italic_o ( italic_n ) when n𝑛n\rightarrow\inftyitalic_n → ∞, the desired results follows immediately by normalization and limit.

A-A4 Proof of Lemma 10

Let (G~i)iNsubscriptsubscript~𝐺𝑖𝑖𝑁(\tilde{G}_{i})_{i\leq N}( over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ≤ italic_N end_POSTSUBSCRIPT be isomorphic probabilistic graphs and G𝐺Gitalic_G such that G=iG~i𝐺subscriptsquare-union𝑖subscript~𝐺𝑖G=\bigsqcup_{i}\tilde{G}_{i}italic_G = ⨆ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Let c1:𝒳1𝒞:superscriptsubscript𝑐1subscript𝒳1𝒞c_{1}^{\star}:\mathcal{X}_{1}\rightarrow\mathcal{C}italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT : caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → caligraphic_C be the coloring of G~1subscript~𝐺1\tilde{G}_{1}over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with minimal entropy, and let csuperscript𝑐c^{\star}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT be the coloring of G𝐺Gitalic_G defined by

c::superscript𝑐absent\displaystyle c^{\star}:\>italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT : 𝒳𝒞𝒳𝒞\displaystyle\mathcal{X}\rightarrow\mathcal{C}caligraphic_X → caligraphic_C (114)
vc1ψix1(x),maps-to𝑣superscriptsubscript𝑐1subscript𝜓subscript𝑖𝑥1𝑥\displaystyle v\mapsto c_{1}^{\star}\circ\psi_{i_{x}\rightarrow 1}(x),italic_v ↦ italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∘ italic_ψ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT → 1 end_POSTSUBSCRIPT ( italic_x ) , (115)

where ixsubscript𝑖𝑥i_{x}italic_i start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT is the unique integer such that v𝒳ix𝑣subscript𝒳subscript𝑖𝑥v\in\mathcal{X}_{i_{x}}italic_v ∈ caligraphic_X start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT, and ψix1:𝒳ix𝒳1:subscript𝜓subscript𝑖𝑥1subscript𝒳subscript𝑖𝑥subscript𝒳1\psi_{i_{x}\rightarrow 1}:\mathcal{X}_{i_{x}}\rightarrow\mathcal{X}_{1}italic_ψ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT → 1 end_POSTSUBSCRIPT : caligraphic_X start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT → caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is an isomorphism between G~ixsubscript~𝐺subscript𝑖𝑥\tilde{G}_{i_{x}}over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT and G~1subscript~𝐺1\tilde{G}_{1}over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. In other words csuperscript𝑐c^{\star}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT applies the same coloring pattern c1subscriptsuperscript𝑐1c^{\star}_{1}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT on each connected component of G𝐺Gitalic_G. We have

Hχ(G)subscript𝐻𝜒𝐺\displaystyle H_{\chi}(G)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) H(c(X))absent𝐻superscript𝑐𝑋\displaystyle\leq H(c^{\star}(X))≤ italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X ) ) (116)
=h(jNPiX(j)Pc(Xj))absentsubscript𝑗𝑁subscript𝑃subscript𝑖𝑋𝑗subscript𝑃superscript𝑐subscript𝑋𝑗\displaystyle=h\Big{(}\textstyle\sum_{j\leq N}P_{i_{X}}(j)P_{c^{\star}(X_{j})}% \Big{)}= italic_h ( ∑ start_POSTSUBSCRIPT italic_j ≤ italic_N end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_j ) italic_P start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) (117)
=h(jNPiX(j)Pc1(X1))absentsubscript𝑗𝑁subscript𝑃subscript𝑖𝑋𝑗subscript𝑃superscriptsubscript𝑐1subscript𝑋1\displaystyle=h\Big{(}\textstyle\sum_{j\leq N}P_{i_{X}}(j)P_{c_{1}^{\star}(X_{% 1})}\Big{)}= italic_h ( ∑ start_POSTSUBSCRIPT italic_j ≤ italic_N end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_j ) italic_P start_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) (118)
=H(c1(X1))absent𝐻subscriptsuperscript𝑐1subscript𝑋1\displaystyle=H(c^{\star}_{1}(X_{1}))= italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) (119)
=Hχ(G~1),absentsubscript𝐻𝜒subscript~𝐺1\displaystyle=H_{\chi}(\tilde{G}_{1}),= italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , (120)

where hhitalic_h denotes the entropy of a distribution; (118) comes from the definition of csuperscript𝑐c^{\star}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT; and (120) comes from the definition of c1superscriptsubscript𝑐1c_{1}^{\star}italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.

Now let us prove the upper bound on Hχ(G~1)subscript𝐻𝜒subscript~𝐺1H_{\chi}(\tilde{G}_{1})italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ). Let c𝑐citalic_c be a coloring of G𝐺Gitalic_G, and let iargminiH(c(Xi))approaches-limitsuperscript𝑖subscript𝑖𝐻𝑐subscript𝑋𝑖i^{\star}\doteq\operatorname*{\arg\!\min}_{i}H(c(X_{i}))italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ≐ start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_H ( italic_c ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) (i.e. isuperscript𝑖i^{\star}italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is the index of the connected component for which the entropy of the coloring induced by c𝑐citalic_c is minimal). We have

H(c(X))𝐻𝑐𝑋\displaystyle H(c(X))italic_H ( italic_c ( italic_X ) ) =h(jNPiX(j)Pc(Xj))absentsubscript𝑗𝑁subscript𝑃subscript𝑖𝑋𝑗subscript𝑃𝑐subscript𝑋𝑗\displaystyle=h\Big{(}\textstyle\sum_{j\leq N}P_{i_{X}}(j)P_{c(X_{j})}\Big{)}= italic_h ( ∑ start_POSTSUBSCRIPT italic_j ≤ italic_N end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_j ) italic_P start_POSTSUBSCRIPT italic_c ( italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) (121)
jNPiX(j)h(Pc(Xj))absentsubscript𝑗𝑁subscript𝑃subscript𝑖𝑋𝑗subscript𝑃𝑐subscript𝑋𝑗\displaystyle\geq\textstyle\sum_{j\leq N}P_{i_{X}}(j)h(P_{c(X_{j})})≥ ∑ start_POSTSUBSCRIPT italic_j ≤ italic_N end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_j ) italic_h ( italic_P start_POSTSUBSCRIPT italic_c ( italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ) (122)
jNPiX(j)H(c(Xi))absentsubscript𝑗𝑁subscript𝑃subscript𝑖𝑋𝑗𝐻𝑐subscript𝑋superscript𝑖\displaystyle\geq\textstyle\sum_{j\leq N}P_{i_{X}}(j)H(c(X_{i^{\star}}))≥ ∑ start_POSTSUBSCRIPT italic_j ≤ italic_N end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_j ) italic_H ( italic_c ( italic_X start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ) (123)
Hχ(G~i),absentsubscript𝐻𝜒subscript~𝐺superscript𝑖\displaystyle\geq H_{\chi}(\tilde{G}_{i^{\star}}),≥ italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) , (124)
=Hχ(G~1),absentsubscript𝐻𝜒subscript~𝐺1\displaystyle=H_{\chi}(\tilde{G}_{1}),= italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , (125)

where (122) follows from the concavity of hhitalic_h; (123) follows from the definition of isuperscript𝑖i^{\star}italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT; (124) comes from the fact that c𝑐citalic_c induces a coloring of G~isubscript~𝐺superscript𝑖\tilde{G}_{i^{\star}}over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT; (125) comes from the fact that G~1subscript~𝐺1\tilde{G}_{1}over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and G~isubscript~𝐺superscript𝑖\tilde{G}_{i^{\star}}over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT are isomorphic. Now, we can combine the bounds (120) and (125): for all coloring c𝑐citalic_c of G𝐺Gitalic_G we have

Hχ(G)Hχ(G~1)H(c(X)),subscript𝐻𝜒𝐺subscript𝐻𝜒subscript~𝐺1𝐻𝑐𝑋\displaystyle H_{\chi}(G)\leq H_{\chi}(\tilde{G}_{1})\leq H(c(X)),italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) ≤ italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( over~ start_ARG italic_G end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≤ italic_H ( italic_c ( italic_X ) ) , (126)

which yields the desired equality when taking the infimum over c𝑐citalic_c.

A-A5 Proof of Lemma 11

Let c:𝒳𝒞:superscript𝑐𝒳𝒞c^{\star}:\mathcal{X}\rightarrow\mathcal{C}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT : caligraphic_X → caligraphic_C and c𝒮:𝒮𝒞:subscriptsuperscript𝑐𝒮𝒮𝒞c^{\star}_{\mathcal{S}}:\mathcal{S}\rightarrow\mathcal{C}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT : caligraphic_S → caligraphic_C be the optimal colorings of G𝐺Gitalic_G and G[𝒮]𝐺delimited-[]𝒮G[\mathcal{S}]italic_G [ caligraphic_S ], respectively. Consider the coloring c:𝒳𝒞𝒳:𝑐𝒳square-union𝒞𝒳c:\mathcal{X}\rightarrow\mathcal{C}\sqcup\mathcal{X}italic_c : caligraphic_X → caligraphic_C ⊔ caligraphic_X of G𝐺Gitalic_G defined by c(x)=c𝒮𝑐𝑥subscriptsuperscript𝑐𝒮c(x)=c^{\star}_{\mathcal{S}}italic_c ( italic_x ) = italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT if v𝒮𝑣𝒮v\in\mathcal{S}italic_v ∈ caligraphic_S, c(x)=v𝑐𝑥𝑣c(x)=vitalic_c ( italic_x ) = italic_v otherwise.

(Lower bound) On one hand, we have

Hχ(G)subscript𝐻𝜒𝐺absent\displaystyle H_{\chi}(G)\leqitalic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) ≤ H(c(X),𝟙X𝒮)𝐻𝑐𝑋subscript1𝑋𝒮\displaystyle\>H(c(X),\mathds{1}_{X\in\mathcal{S}})italic_H ( italic_c ( italic_X ) , blackboard_1 start_POSTSUBSCRIPT italic_X ∈ caligraphic_S end_POSTSUBSCRIPT ) (127)
=\displaystyle== H(𝟙X𝒮)+PX(𝒮)H(c(X)|X𝒮)𝐻subscript1𝑋𝒮subscript𝑃𝑋𝒮𝐻conditional𝑐𝑋𝑋𝒮\displaystyle\>H(\mathds{1}_{X\in\mathcal{S}})+P_{X}(\mathcal{S})H(c(X)|X\in% \mathcal{S})italic_H ( blackboard_1 start_POSTSUBSCRIPT italic_X ∈ caligraphic_S end_POSTSUBSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) italic_H ( italic_c ( italic_X ) | italic_X ∈ caligraphic_S )
+(1PX(𝒮))H(c(X)|X𝒮)1subscript𝑃𝑋𝒮𝐻conditional𝑐𝑋𝑋𝒮\displaystyle+(1-P_{X}(\mathcal{S}))H(c(X)|X\notin\mathcal{S})+ ( 1 - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) ) italic_H ( italic_c ( italic_X ) | italic_X ∉ caligraphic_S ) (128)
\displaystyle\leq  1+H(c𝒮(X)|X𝒮)+(1PX(𝒮))log|𝒳|1𝐻conditionalsubscriptsuperscript𝑐𝒮𝑋𝑋𝒮1subscript𝑃𝑋𝒮𝒳\displaystyle\>1+H(c^{\star}_{\mathcal{S}}(X)|X\in\mathcal{S})+(1-P_{X}(% \mathcal{S}))\log|\mathcal{X}|1 + italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_X ) | italic_X ∈ caligraphic_S ) + ( 1 - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) ) roman_log | caligraphic_X | (129)
=\displaystyle== Hχ(G[𝒮])+1+(1PX(𝒮))log|𝒳|;subscript𝐻𝜒𝐺delimited-[]𝒮11subscript𝑃𝑋𝒮𝒳\displaystyle\>H_{\chi}(G[\mathcal{S}])+1+(1-P_{X}(\mathcal{S}))\log|\mathcal{% X}|;italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G [ caligraphic_S ] ) + 1 + ( 1 - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) ) roman_log | caligraphic_X | ; (130)

where (127) comes from the fact that c𝑐citalic_c is a coloring of G𝐺Gitalic_G; (128) is a decomposition using conditional entropies; (129) comes from the construction of c𝑐citalic_c: c|𝒮=c𝒮evaluated-at𝑐𝒮subscriptsuperscript𝑐𝒮c|_{\mathcal{S}}=c^{\star}_{\mathcal{S}}italic_c | start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT = italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT; (130) follows from the optimality of c𝒮subscriptsuperscript𝑐𝒮c^{\star}_{\mathcal{S}}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT as a coloring of G[𝒮]𝐺delimited-[]𝒮G[\mathcal{S}]italic_G [ caligraphic_S ].

(Upper bound) On the other hand,

Hχ(G[𝒮])subscript𝐻𝜒𝐺delimited-[]𝒮absent\displaystyle H_{\chi}(G[\mathcal{S}])\leqitalic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G [ caligraphic_S ] ) ≤ H(c(X)|V𝒮)𝐻conditionalsuperscript𝑐𝑋𝑉𝒮\displaystyle\>H(c^{\star}(X)|V\in\mathcal{S})italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X ) | italic_V ∈ caligraphic_S ) (131)
=\displaystyle== 1PX(𝒮)(H(c(X)|𝟙X𝒮)(1PX(𝒮))H(c(X)|V𝒮))1subscript𝑃𝑋𝒮𝐻conditionalsuperscript𝑐𝑋subscript1𝑋𝒮1subscript𝑃𝑋𝒮𝐻conditionalsuperscript𝑐𝑋𝑉𝒮\displaystyle\,\frac{1}{P_{X}(\mathcal{S})}\Big{(}\!H(c^{\star}(X)|\mathds{1}_% {X\in\mathcal{S}})-(1-P_{X}(\mathcal{S}))H(c^{\star}(X)|V\notin\mathcal{S})\!% \Big{)}divide start_ARG 1 end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) end_ARG ( italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X ) | blackboard_1 start_POSTSUBSCRIPT italic_X ∈ caligraphic_S end_POSTSUBSCRIPT ) - ( 1 - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) ) italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X ) | italic_V ∉ caligraphic_S ) ) (132)
\displaystyle\leq H(c(X))PX(𝒮)=Hχ(G)PX(𝒮)𝐻superscript𝑐𝑋subscript𝑃𝑋𝒮subscript𝐻𝜒𝐺subscript𝑃𝑋𝒮\displaystyle\>\frac{H(c^{\star}(X))}{P_{X}(\mathcal{S})}=\frac{H_{\chi}(G)}{P% _{X}(\mathcal{S})}divide start_ARG italic_H ( italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_X ) ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) end_ARG = divide start_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_S ) end_ARG (133)

where (131) comes from the fact that csuperscript𝑐c^{\star}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT induces a coloring of G[𝒮]𝐺delimited-[]𝒮G[\mathcal{S}]italic_G [ caligraphic_S ]; (132) is a decomposition using conditional entropies; (133) results from the elimination of negative terms and the optimality of csuperscript𝑐c^{\star}italic_c start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.

A-B Proof of Lemma 2

In order to prove Lemma 2 we need Lemma 7, which can be found in App. A-A1; and Lemma 12, which is a generalization for infinite sequences of the following observation: if Ta¯n=PAΔn(𝒜)subscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴subscriptΔ𝑛𝒜T_{\overline{a}^{n}}=P_{A}\in\Delta_{n}(\mathcal{A})italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( caligraphic_A ) satisfies PA=inPA+ninPA′′subscript𝑃𝐴𝑖𝑛subscriptsuperscript𝑃𝐴𝑛𝑖𝑛subscriptsuperscript𝑃′′𝐴P_{A}=\frac{i}{n}P^{\prime}_{A}+\frac{n-i}{n}P^{\prime\prime}_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = divide start_ARG italic_i end_ARG start_ARG italic_n end_ARG italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + divide start_ARG italic_n - italic_i end_ARG start_ARG italic_n end_ARG italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT with PAΔi(𝒜)subscriptsuperscript𝑃𝐴subscriptΔ𝑖𝒜P^{\prime}_{A}\in\Delta_{i}(\mathcal{A})italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( caligraphic_A ) and PA′′Δni(𝒜)subscriptsuperscript𝑃′′𝐴subscriptΔ𝑛𝑖𝒜P^{\prime\prime}_{A}\in\Delta_{n-i}(\mathcal{A})italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ start_POSTSUBSCRIPT italic_n - italic_i end_POSTSUBSCRIPT ( caligraphic_A ), then a¯nsuperscript¯𝑎𝑛\overline{a}^{n}over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT can be separated into two subsequences aisuperscript𝑎𝑖a^{\prime i}italic_a start_POSTSUPERSCRIPT ′ italic_i end_POSTSUPERSCRIPT and a′′nisuperscript𝑎′′𝑛𝑖a^{\prime\prime n-i}italic_a start_POSTSUPERSCRIPT ′ ′ italic_n - italic_i end_POSTSUPERSCRIPT such that Tai=PAsubscript𝑇superscript𝑎𝑖subscriptsuperscript𝑃𝐴T_{a^{\prime i}}=P^{\prime}_{A}italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ italic_i end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT and Ta′′ni=PA′′subscript𝑇superscript𝑎′′𝑛𝑖subscriptsuperscript𝑃′′𝐴T_{a^{\prime\prime n-i}}=P^{\prime\prime}_{A}italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ ′ italic_n - italic_i end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT.

Lemma 12 (Type-splitting lemma).

Let (a¯n)n𝒜subscriptsubscript¯𝑎𝑛𝑛superscriptsuperscript𝒜superscript(\overline{a}_{n})_{n\in\mathbb{N}^{\star}}\in\mathcal{A}^{\mathbb{N}^{\star}}( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUPERSCRIPT blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT be a sequence such that Ta¯nPAΔ(𝒜)subscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴Δ𝒜T_{\overline{a}^{n}}\rightarrow P_{A}\in\Delta(\mathcal{A})italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) when n𝑛n\rightarrow\inftyitalic_n → ∞, let β(0,1)𝛽01\beta\in(0,1)italic_β ∈ ( 0 , 1 ) and PA,PA′′Δ(𝒜)subscriptsuperscript𝑃𝐴subscriptsuperscript𝑃′′𝐴Δ𝒜P^{\prime}_{A},P^{\prime\prime}_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) such that

PA=βPA+(1β)PA′′.subscript𝑃𝐴𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴\displaystyle P_{A}=\beta P^{\prime}_{A}+(1-\beta)P^{\prime\prime}_{A}.italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT . (134)

Then there exists a sequence (bn)n{0,1}subscriptsubscript𝑏𝑛𝑛superscriptsuperscript01superscript(b_{n})_{n\in\mathbb{N}^{\star}}\in\{0,1\}^{\mathbb{N}^{\star}}( italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ { 0 , 1 } start_POSTSUPERSCRIPT blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT such that the two extracted sequences a(a¯n)n,bn=0approaches-limitsuperscript𝑎subscriptsubscript¯𝑎𝑛𝑛superscriptsubscript𝑏𝑛0a^{\prime}\doteq(\overline{a}_{n})_{\begin{subarray}{c}n\in\mathbb{N}^{\star},% \\ b_{n}=0\end{subarray}}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≐ ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT and a′′(a¯n)n,bn=1approaches-limitsuperscript𝑎′′subscriptsubscript¯𝑎𝑛𝑛superscriptsubscript𝑏𝑛1a^{\prime\prime}\doteq(\overline{a}_{n})_{\begin{subarray}{c}n\in\mathbb{N}^{% \star},\\ b_{n}=1\end{subarray}}italic_a start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ≐ ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 1 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT satisfy

Tbnn(β,1β),subscript𝑇superscript𝑏𝑛𝑛𝛽1𝛽\displaystyle T_{b^{n}}\underset{n\rightarrow\infty}{\rightarrow}(\beta,1-% \beta),italic_T start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG ( italic_β , 1 - italic_β ) , (135)
TannPA,subscript𝑇superscript𝑎𝑛𝑛subscriptsuperscript𝑃𝐴\displaystyle T_{a^{\prime n}}\underset{n\rightarrow\infty}{\rightarrow}P^{% \prime}_{A},italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , Ta′′nnPA′′.subscript𝑇superscript𝑎′′𝑛𝑛subscriptsuperscript𝑃′′𝐴\displaystyle T_{a^{\prime\prime n}}\underset{n\rightarrow\infty}{\rightarrow}% P^{\prime\prime}_{A}.italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT . (136)

The proof of Lemma 12 is given in App. A-B1. Now let us prove Lemma 2. We recall the definition of the function

η:PAH¯(a𝒜PAGa).:𝜂maps-tosubscript𝑃𝐴¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\displaystyle\eta:P_{A}\mapsto\overline{H}\Bigg{(}\bigsqcup_{a\in\mathcal{A}}^% {P_{A}}G_{a}\Bigg{)}.italic_η : italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (137)

(η𝜂\etaitalic_η Lipschitz) Let us first prove that the function η𝜂\etaitalic_η is Lipschitz. For all PA,PAΔ(𝒜)subscript𝑃𝐴subscriptsuperscript𝑃𝐴Δ𝒜P_{A},P^{\prime}_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) we need to bound the quantity |η(PA)η(PA)|𝜂subscript𝑃𝐴𝜂subscriptsuperscript𝑃𝐴|\eta(P_{A})-\eta(P^{\prime}_{A})|| italic_η ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) - italic_η ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) |; by Lemma 7 this is equivalent to bounding

limn1n|Hχ(a𝒜GanTa¯n(a))Hχ(a𝒜GanTa¯n(a))|subscript𝑛1𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle\lim_{n\rightarrow\infty}\frac{1}{n}\left|H_{\chi}\left(% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge nT_{\overline{a}^{n}}(a)}% \right)-H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge nT_{% \overline{a}^{\prime n}}(a)}\right)\right|roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG | italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) | (138)

where (Ta¯n,Ta¯n)(PA,PA)subscript𝑇superscript¯𝑎𝑛subscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴subscriptsuperscript𝑃𝐴(T_{\overline{a}^{n}},T_{\overline{a}^{\prime n}})\rightarrow(P_{A},P^{\prime}% _{A})( italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) → ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) when n𝑛n\rightarrow\inftyitalic_n → ∞.

Fix n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, we assume that the quantity inside |||\cdot|| ⋅ | in (138) is positive; the other case can be treated with the same arguments by symmetry of the roles. We have

Hχ(a𝒜GanTa¯n(a))Hχ(a𝒜GanTa¯n(a))subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge nT% _{\overline{a}^{n}}(a)}\right)-H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal% {A}}G_{a}^{\wedge nT_{\overline{a}^{\prime n}}(a)}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) (139)
\displaystyle\leq Hχ(a𝒜GanTa¯n(a))Hχ(a𝒜Ganmin(Ta¯n(a),Ta¯n(a)))subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle\;H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{% \wedge nT_{\overline{a}^{n}}(a)}\right)-H_{\chi}\left(\textstyle\bigwedge_{a% \in\mathcal{A}}G_{a}^{\wedge n\min(T_{\overline{a}^{n}}(a),T_{\overline{a}^{% \prime n}}(a))}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) - italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n roman_min ( italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) , italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) ) end_POSTSUPERSCRIPT ) (140)
=\displaystyle== Hχ(a𝒜Ganmin(Ta¯n(a),Ta¯n(a))a𝒜Gan|Ta¯n(a)Ta¯n(a)|+)subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscriptsubscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle\;H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{% \wedge n\min(T_{\overline{a}^{n}}(a),T_{\overline{a}^{\prime n}}(a))}% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge n|T_{\overline{a}^{n}}(a)-T% _{\overline{a}^{\prime n}}(a)|_{+}}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n roman_min ( italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) , italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) ) end_POSTSUPERSCRIPT ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n | italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) | start_POSTSUBSCRIPT + end_POSTSUBSCRIPT end_POSTSUPERSCRIPT )
Hχ(a𝒜Ganmin(Ta¯n(a),Ta¯n(a)))subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle-H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge n% \min(T_{\overline{a}^{n}}(a),T_{\overline{a}^{\prime n}}(a))}\right)- italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n roman_min ( italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) , italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) ) end_POSTSUPERSCRIPT ) (141)
\displaystyle\leq Hχ(a𝒜Gan|Ta¯n(a)Ta¯n(a)|+)subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscriptsubscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle\;H_{\chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{% \wedge n|T_{\overline{a}^{n}}(a)-T_{\overline{a}^{\prime n}}(a)|_{+}}\right)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n | italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) | start_POSTSUBSCRIPT + end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) (142)
\displaystyle\leq log(maxa|𝒳a|)a𝒜n|Ta¯n(a)Ta¯n(a)|+subscript𝑎subscript𝒳𝑎subscript𝑎𝒜𝑛subscriptsubscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle\;\log\left(\max_{a}|\mathcal{X}_{a}|\right)\textstyle\sum_{a\in% \mathcal{A}}n|T_{\overline{a}^{n}}(a)-T_{\overline{a}^{\prime n}}(a)|_{+}roman_log ( roman_max start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_n | italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) | start_POSTSUBSCRIPT + end_POSTSUBSCRIPT (143)
\displaystyle\leq nlog(maxa|𝒳a|)Ta¯nTa¯n1,𝑛subscript𝑎subscript𝒳𝑎subscriptnormsubscript𝑇superscript¯𝑎𝑛subscript𝑇superscript¯𝑎𝑛1\displaystyle\;n\log\left(\max_{a}|\mathcal{X}_{a}|\right)\|T_{\overline{a}^{n% }}-T_{\overline{a}^{\prime n}}\|_{1},italic_n roman_log ( roman_max start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ∥ italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , (144)

where ||+=max(,0)|\cdot|_{+}=\max(\cdot,0)| ⋅ | start_POSTSUBSCRIPT + end_POSTSUBSCRIPT = roman_max ( ⋅ , 0 ) and Ta¯nTa¯n1=a𝒜|Ta¯n(a)Ta¯n(a)|subscriptnormsubscript𝑇superscript¯𝑎𝑛subscript𝑇superscript¯𝑎𝑛1subscript𝑎𝒜subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎\|T_{\overline{a}^{n}}-T_{\overline{a}^{\prime n}}\|_{1}=\sum_{a\in\mathcal{A}% }|T_{\overline{a}^{n}}(a)-T_{\overline{a}^{\prime n}}(a)|∥ italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT | italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) |; (140) follows from the removal of terms in the second product, as Hχ(GG)Hχ(G)subscript𝐻𝜒𝐺superscript𝐺subscript𝐻𝜒𝐺H_{\chi}(G\wedge G^{\prime})\geq H_{\chi}(G)italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≥ italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G ) for all probabilistic graphs G,G𝐺superscript𝐺G,G^{\prime}italic_G , italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT; (141) is an arrangement of the terms in the first product, as min(s,t)+max(st,0)=s𝑠𝑡𝑠𝑡0𝑠\min(s,t)+\max(s-t,0)=sroman_min ( italic_s , italic_t ) + roman_max ( italic_s - italic_t , 0 ) = italic_s for all real numbers s,t𝑠𝑡s,titalic_s , italic_t; (142) comes from the subadditivity of Hχsubscript𝐻𝜒H_{\chi}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT; (143) follows from Hχ(Ga)logmaxa|𝒳a|subscript𝐻𝜒subscript𝐺𝑎subscriptsuperscript𝑎subscript𝒳superscript𝑎H_{\chi}(G_{a})\leq\log\max_{a^{\prime}}|\mathcal{X}_{a^{\prime}}|italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ≤ roman_log roman_max start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A; (144) results from |Ta¯n(a)Ta¯n(a)|+|Ta¯n(a)Ta¯n(a)|subscriptsubscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎subscript𝑇superscript¯𝑎𝑛𝑎|T_{\overline{a}^{n}}(a)-T_{\overline{a}^{\prime n}}(a)|_{+}\leq|T_{\overline{% a}^{n}}(a)-T_{\overline{a}^{\prime n}}(a)|| italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) | start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ≤ | italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) | for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

By normalization and limit, it follows that

|η(PA)η(PA)|𝜂subscript𝑃𝐴𝜂subscriptsuperscript𝑃𝐴\displaystyle|\eta(P_{A})-\eta(P^{\prime}_{A})|| italic_η ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) - italic_η ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) | limnlog(maxa|𝒳a|)Ta¯nTa¯n1absentsubscript𝑛subscript𝑎subscript𝒳𝑎subscriptnormsubscript𝑇superscript¯𝑎𝑛subscript𝑇superscript¯𝑎𝑛1\displaystyle\leq\lim_{n\rightarrow\infty}\log\left(\max_{a}|\mathcal{X}_{a}|% \right)\cdot\|T_{\overline{a}^{n}}-T_{\overline{a}^{\prime n}}\|_{1}≤ roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT roman_log ( roman_max start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ⋅ ∥ italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (145)
=log(maxa|𝒳a|)PAPA1.absentsubscript𝑎subscript𝒳𝑎subscriptnormsubscript𝑃𝐴subscriptsuperscript𝑃𝐴1\displaystyle=\log\left(\max_{a}|\mathcal{X}_{a}|\right)\cdot\|P_{A}-P^{\prime% }_{A}\|_{1}.= roman_log ( roman_max start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | ) ⋅ ∥ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT - italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT . (146)

Hence η𝜂\etaitalic_η is (logmaxa|𝒳a|)subscript𝑎subscript𝒳𝑎(\log\max_{a}|\mathcal{X}_{a}|)( roman_log roman_max start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT | )-Lipschitz.

(η𝜂\etaitalic_η convex) Let us now prove that η𝜂\etaitalic_η is convex. Let PA,PA′′Δ(𝒜)subscriptsuperscript𝑃𝐴subscriptsuperscript𝑃′′𝐴Δ𝒜P^{\prime}_{A},P^{\prime\prime}_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), and β(0,1)𝛽01\beta\in(0,1)italic_β ∈ ( 0 , 1 ), we have by Lemma 7

η(βPA+(1β)PA′′)=limn1nHχ(a𝒜GanTa¯n(a)),𝜂𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴subscript𝑛1𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛subscript𝑇superscript¯𝑎𝑛𝑎\displaystyle\eta\big{(}\beta P^{\prime}_{A}+(1-\beta)P^{\prime\prime}_{A}\big% {)}=\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}\left(\textstyle\bigwedge_{a% \in\mathcal{A}}G_{a}^{\wedge nT_{\overline{a}^{n}}(a)}\right),italic_η ( italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ italic_n italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) , (147)

where Ta¯nβPA+(1β)PA′′subscript𝑇superscript¯𝑎𝑛𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴T_{\overline{a}^{n}}\rightarrow\beta P^{\prime}_{A}+(1-\beta)P^{\prime\prime}_% {A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT when n𝑛n\rightarrow\inftyitalic_n → ∞. By Lemma 12, there exists (bn)n{0,1}subscriptsubscript𝑏𝑛𝑛superscriptsuperscript01superscript(b_{n})_{n\in\mathbb{N}^{\star}}\in\{0,1\}^{\mathbb{N}^{\star}}( italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ { 0 , 1 } start_POSTSUPERSCRIPT blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT such that the decomposition of (a¯n)nsubscriptsubscript¯𝑎𝑛𝑛superscript(\overline{a}_{n})_{n\in\mathbb{N}^{\star}}( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT into two subsequences a(a¯n)n,bn=0approaches-limitsuperscript𝑎subscriptsubscript¯𝑎𝑛𝑛superscriptsubscript𝑏𝑛0a^{\prime}\doteq(\overline{a}_{n})_{\begin{subarray}{c}n\in\mathbb{N}^{\star},% \\ b_{n}=0\end{subarray}}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≐ ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT and a′′(a¯n)n,bn=1approaches-limitsuperscript𝑎′′subscriptsubscript¯𝑎𝑛𝑛superscriptsubscript𝑏𝑛1a^{\prime\prime}\doteq(\overline{a}_{n})_{\begin{subarray}{c}n\in\mathbb{N}^{% \star},\\ b_{n}=1\end{subarray}}italic_a start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ≐ ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 1 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT satisfies

Tbnn(β,1β),subscript𝑇superscript𝑏𝑛𝑛𝛽1𝛽\displaystyle T_{b^{n}}\underset{n\rightarrow\infty}{\rightarrow}(\beta,1-% \beta),italic_T start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG ( italic_β , 1 - italic_β ) , (148)
TannPA,subscript𝑇superscript𝑎𝑛𝑛subscriptsuperscript𝑃𝐴\displaystyle T_{a^{\prime n}}\underset{n\rightarrow\infty}{\rightarrow}P^{% \prime}_{A},italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , Ta′′nnPA′′.subscript𝑇superscript𝑎′′𝑛𝑛subscriptsuperscript𝑃′′𝐴\displaystyle T_{a^{\prime\prime n}}\underset{n\rightarrow\infty}{\rightarrow}% P^{\prime\prime}_{A}.italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT . (149)

For all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, let Ξ(n)nTbn(0)approaches-limitΞ𝑛𝑛subscript𝑇superscript𝑏𝑛0\Xi(n)\doteq nT_{b^{n}}(0)roman_Ξ ( italic_n ) ≐ italic_n italic_T start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( 0 ), we have

η(βPA+(1β)PA′′)𝜂𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴\displaystyle\eta\big{(}\beta P^{\prime}_{A}+(1-\beta)P^{\prime\prime}_{A}\big% {)}italic_η ( italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) (150)
=\displaystyle== limn1nHχ(a𝒜GaΞ(n)TaΞ(n)(a)+(nΞ(n))Ta′′nΞ(n)(a))subscript𝑛1𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎Ξ𝑛subscript𝑇superscript𝑎Ξ𝑛𝑎𝑛Ξ𝑛subscript𝑇superscript𝑎′′𝑛Ξ𝑛𝑎\displaystyle\;\lim_{n\rightarrow\infty}\frac{1}{n}H_{\chi}\left(\textstyle% \bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge\Xi(n)T_{a^{\prime\Xi(n)}}(a)+(n-\Xi(n% ))T_{a^{\prime\prime n-\Xi(n)}}(a)}\right)roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ roman_Ξ ( italic_n ) italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ roman_Ξ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) + ( italic_n - roman_Ξ ( italic_n ) ) italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ ′ italic_n - roman_Ξ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) (151)
\displaystyle\leq limnΞ(n)n1Ξ(n)Hχ(a𝒜GaΞ(n)TaΞ(n)(a))subscript𝑛Ξ𝑛𝑛1Ξ𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎Ξ𝑛subscript𝑇superscript𝑎Ξ𝑛𝑎\displaystyle\;\lim_{n\rightarrow\infty}\frac{\Xi(n)}{n}\frac{1}{\Xi(n)}H_{% \chi}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge\Xi(n)T_{a^{% \prime\Xi(n)}}(a)}\right)roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_Ξ ( italic_n ) end_ARG start_ARG italic_n end_ARG divide start_ARG 1 end_ARG start_ARG roman_Ξ ( italic_n ) end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ roman_Ξ ( italic_n ) italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ roman_Ξ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) (152)
+nΞ(n)n1nΞ(n)Hχ(a𝒜Ga(nΞ(n))Ta′′nΞ(n)(a))𝑛Ξ𝑛𝑛1𝑛Ξ𝑛subscript𝐻𝜒subscript𝑎𝒜superscriptsubscript𝐺𝑎𝑛Ξ𝑛subscript𝑇superscript𝑎′′𝑛Ξ𝑛𝑎\displaystyle+\frac{n-\Xi(n)}{n}\frac{1}{n-\Xi(n)}H_{\chi}\left(\textstyle% \bigwedge_{a\in\mathcal{A}}G_{a}^{\wedge(n-\Xi(n))T_{a^{\prime\prime n-\Xi(n)}% }(a)}\right)+ divide start_ARG italic_n - roman_Ξ ( italic_n ) end_ARG start_ARG italic_n end_ARG divide start_ARG 1 end_ARG start_ARG italic_n - roman_Ξ ( italic_n ) end_ARG italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∧ ( italic_n - roman_Ξ ( italic_n ) ) italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ ′ italic_n - roman_Ξ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ) (153)
=\displaystyle== βη(PA)+(1β)η(PA′′);𝛽𝜂subscriptsuperscript𝑃𝐴1𝛽𝜂subscriptsuperscript𝑃′′𝐴\displaystyle\;\beta\eta(P^{\prime}_{A})+(1-\beta)\eta(P^{\prime\prime}_{A});italic_β italic_η ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ( 1 - italic_β ) italic_η ( italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) ; (154)

where (151) comes from (147); (153) follows from the subadditivity of Hχsubscript𝐻𝜒H_{\chi}italic_H start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT; (154) comes from (148), (149) and Lemma 7. Since (154) holds for all PA,PA′′Δ(𝒜)subscriptsuperscript𝑃𝐴subscriptsuperscript𝑃′′𝐴Δ𝒜P^{\prime}_{A},P^{\prime\prime}_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) and β(0,1)𝛽01\beta\in(0,1)italic_β ∈ ( 0 , 1 ), we have that η𝜂\etaitalic_η is convex.

A-B1 Proof of Lemma 12

Let (a¯n)n𝒜subscriptsubscript¯𝑎𝑛𝑛superscriptsuperscript𝒜superscript(\overline{a}_{n})_{n\in\mathbb{N}^{\star}}\in\mathcal{A}^{\mathbb{N}^{\star}}( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_A start_POSTSUPERSCRIPT blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT be a sequence such that Ta¯nPA=βPA+(1β)PA′′subscript𝑇superscript¯𝑎𝑛subscript𝑃𝐴𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴T_{\overline{a}^{n}}\rightarrow P_{A}=\beta P^{\prime}_{A}+(1-\beta)P^{\prime% \prime}_{A}italic_T start_POSTSUBSCRIPT over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT → italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT when n𝑛n\rightarrow\inftyitalic_n → ∞.

Consider a sequence (Bn)nsubscriptsubscript𝐵𝑛𝑛superscript(B_{n})_{n\in\mathbb{N}^{\star}}( italic_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT of independent Bernoulli random variables such that for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT,

(Bn=0)=βPA(a¯n)PA(a¯n).subscript𝐵𝑛0𝛽subscriptsuperscript𝑃𝐴subscript¯𝑎𝑛subscript𝑃𝐴subscript¯𝑎𝑛\displaystyle\mathbb{P}(B_{n}=0)=\frac{\beta P^{\prime}_{A}(\overline{a}_{n})}% {P_{A}(\overline{a}_{n})}.blackboard_P ( italic_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0 ) = divide start_ARG italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG . (155)

By the strong law of large numbers,

(TBn,a¯nn(βPA,(1β)PA′′))=1.subscript𝑇superscript𝐵𝑛superscript¯𝑎𝑛𝑛𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴1\displaystyle\mathbb{P}\left(T_{B^{n},\overline{a}^{n}}\underset{n\rightarrow% \infty}{\rightarrow}(\beta P^{\prime}_{A},(1-\beta)P^{\prime\prime}_{A})\right% )=1.blackboard_P ( italic_T start_POSTSUBSCRIPT italic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG ( italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) ) = 1 . (156)

Therefore, there exists at least one realization (bn)nsubscriptsubscript𝑏𝑛𝑛superscript(b_{n})_{n\in\mathbb{N}^{\star}}( italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT of (Bn)nsubscriptsubscript𝐵𝑛𝑛superscript(B_{n})_{n\in\mathbb{N}^{\star}}( italic_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT such that Tbn,a¯nsubscript𝑇superscript𝑏𝑛superscript¯𝑎𝑛T_{b^{n},\overline{a}^{n}}italic_T start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , over¯ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT converges to (βPA,(1β)PA′′)𝛽subscriptsuperscript𝑃𝐴1𝛽subscriptsuperscript𝑃′′𝐴\big{(}\beta P^{\prime}_{A},(1-\beta)P^{\prime\prime}_{A}\big{)}( italic_β italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ). The convergences of marginal and conditional types yield

Tbnn(β,1β),subscript𝑇superscript𝑏𝑛𝑛𝛽1𝛽\displaystyle T_{b^{n}}\underset{n\rightarrow\infty}{\rightarrow}(\beta,1-% \beta),italic_T start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG ( italic_β , 1 - italic_β ) , (157)
TannPA,Ta′′nnPA′′,subscript𝑇superscript𝑎𝑛𝑛subscriptsuperscript𝑃𝐴subscript𝑇superscript𝑎′′𝑛𝑛subscriptsuperscript𝑃′′𝐴\displaystyle T_{a^{\prime n}}\underset{n\rightarrow\infty}{\rightarrow}P^{% \prime}_{A},\qquad T_{a^{\prime\prime n}}\underset{n\rightarrow\infty}{% \rightarrow}P^{\prime\prime}_{A},italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT , (158)

where a(a¯n)n,bn=0approaches-limitsuperscript𝑎subscriptsubscript¯𝑎𝑛𝑛superscriptsubscript𝑏𝑛0a^{\prime}\doteq(\overline{a}_{n})_{\begin{subarray}{c}n\in\mathbb{N}^{\star},% \\ b_{n}=0\end{subarray}}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≐ ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT and a′′(a¯n)n,bn=1approaches-limitsuperscript𝑎′′subscriptsubscript¯𝑎𝑛𝑛superscriptsubscript𝑏𝑛1a^{\prime\prime}\doteq(\overline{a}_{n})_{\begin{subarray}{c}n\in\mathbb{N}^{% \star},\\ b_{n}=1\end{subarray}}italic_a start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ≐ ( over¯ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 1 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT are the extracted sequences.

A-C Proof of Lemma 6

It can be easily observed that

PAint(Δ(𝒜)),γ(PA)=a𝒜PA(a)γ(𝟙a)formulae-sequencesubscript𝑃𝐴intΔ𝒜𝛾subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝛾subscript1𝑎\displaystyle\exists P_{A}\in\operatorname*{int}(\Delta(\mathcal{A})),\,\gamma% (P_{A})=\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)\gamma(\mathds{1}_{a})∃ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_int ( roman_Δ ( caligraphic_A ) ) , italic_γ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (159)
\displaystyle\Longleftarrow\;\; PAΔ(𝒜),γ(PA)=a𝒜PA(a)γ(𝟙a).formulae-sequencefor-allsubscript𝑃𝐴Δ𝒜𝛾subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝛾subscript1𝑎\displaystyle\forall P_{A}\in\Delta(\mathcal{A}),\,\gamma(P_{A})=\textstyle% \sum_{a\in\mathcal{A}}P_{A}(a)\gamma(\mathds{1}_{a}).∀ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) , italic_γ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (160)

Now let us prove (159) \Rightarrow (160). Let PAintΔ(𝒜)subscriptsuperscript𝑃𝐴intΔ𝒜P^{\star}_{A}\in\operatorname*{int}\Delta(\mathcal{A})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_int roman_Δ ( caligraphic_A ) such that γ(PA)=a𝒜PA(a)γ(𝟙a)𝛾subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝛾subscript1𝑎\gamma(P^{\star}_{A})=\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)\gamma(\mathds{1}_% {a})italic_γ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ). Let m:Δ(𝒜):𝑚Δ𝒜m:\Delta(\mathcal{A})\rightarrow\mathbb{R}italic_m : roman_Δ ( caligraphic_A ) → blackboard_R linear such that m(PA)=γ(PA)𝑚subscriptsuperscript𝑃𝐴𝛾subscriptsuperscript𝑃𝐴m(P^{\star}_{A})=\gamma(P^{\star}_{A})italic_m ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = italic_γ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) and PAΔ(𝒜),m(PA)γ(PA)formulae-sequencefor-allsubscript𝑃𝐴Δ𝒜𝑚subscript𝑃𝐴𝛾subscript𝑃𝐴\forall P_{A}\in\Delta(\mathcal{A}),\,m(P_{A})\leq\gamma(P_{A})∀ italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ) , italic_m ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) ≤ italic_γ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ). We have

0=γ(PA)m(PA)=a𝒜PA(a)(γ(𝟙a)m(𝟙a));0𝛾subscriptsuperscript𝑃𝐴𝑚subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝛾subscript1𝑎𝑚subscript1𝑎\displaystyle 0=\gamma(P^{\star}_{A})-m(P^{\star}_{A})=\textstyle\sum_{a\in% \mathcal{A}}P^{\star}_{A}(a)\big{(}\gamma(\mathds{1}_{a})-m(\mathds{1}_{a})% \big{)};0 = italic_γ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) - italic_m ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) ( italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) - italic_m ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ) ; (161)

and therefore γ(𝟙a)=m(𝟙a)𝛾subscript1𝑎𝑚subscript1𝑎\gamma(\mathds{1}_{a})=m(\mathds{1}_{a})italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = italic_m ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, as γm0𝛾𝑚0\gamma-m\geq 0italic_γ - italic_m ≥ 0 and PA(a)>0subscriptsuperscript𝑃𝐴𝑎0P^{\star}_{A}(a)>0italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) > 0 for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A. For all PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), we have

f(PA)𝑓subscript𝑃𝐴\displaystyle f(P_{A})italic_f ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) a𝒜PA(a)γ(𝟙a)absentsubscript𝑎𝒜subscript𝑃𝐴𝑎𝛾subscript1𝑎\displaystyle\leq\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)\gamma(\mathds{1}_{a})≤ ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_γ ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (162)
=a𝒜PA(a)m(𝟙a)=m(PA),absentsubscript𝑎𝒜subscript𝑃𝐴𝑎𝑚subscript1𝑎𝑚subscript𝑃𝐴\displaystyle=\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)m(\mathds{1}_{a})=m(P_{A% }),= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_m ( blackboard_1 start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = italic_m ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) , (163)

hence γ=m𝛾𝑚\gamma=mitalic_γ = italic_m and γ𝛾\gammaitalic_γ is linear.

Appendix B Proof of Proposition 5

In order to prove Proposition 5 we need Lemma 13, which is a consequence of Marton’s formula in Theorem 9 applied to a disjoint union. The proof of Lemma 13 can be found in App. B-A.

Lemma 13.

Let PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), then

H¯(a𝒜PAGa)+C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)H(PXa).¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎𝐶superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐻subscript𝑃subscript𝑋𝑎\displaystyle\overline{H}\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}\right)% +C\left(\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a},\;\sum_{a\in\mathcal{A}}P_{A}% (a)P_{X_{a}}\right)=H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)H(P_{X_{a}}).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + italic_C ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (164)

Let us prove Proposition 5. We have on one hand:

H(PA)+a𝒜PA(a)C(Ga,PXa)𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle H(P_{A})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)C(G_{a},P_{X_{a% }})italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (165)
=H(PA)a𝒜PA(a)H¯(Ga)+a𝒜PA(a)H(PXa)absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎𝐻subscript𝑃subscript𝑋𝑎\displaystyle=H(P_{A})-\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_% {a})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)H(P_{X_{a}})= italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (166)
H(PA)H¯(a𝒜PAGa)+a𝒜PA(a)H(PXa)absent𝐻subscript𝑃𝐴¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎𝐻subscript𝑃subscript𝑋𝑎\displaystyle\leq H(P_{A})-\overline{H}\left(\textstyle\bigsqcup^{P_{A}}_{a\in% \mathcal{A}}G_{a}\right)+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)H(P_{X_{a}})≤ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) - over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (167)
=C(a𝒜PAGa,a𝒜PA(a)PXa);absent𝐶superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\displaystyle=C\left(\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a},\;\sum% _{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right);= italic_C ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ; (168)

where (166) comes from Theorem 9; (167) follows from (15), see [21, Theorem 2]; and (168) follows from Lemma 13. Therefore,

C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa)𝐶superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a},\;% \textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\textstyle% \sum_{a\in\mathcal{A}}P_{A}(a)C(G_{a},P_{X_{a}})italic_C ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (169)
\displaystyle\Longleftrightarrow\; H¯(a𝒜PAGa)=a𝒜PA(a)H¯(Ga).¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_% {a}\right)=\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_{a}).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (170)

On the other hand:

a𝒜C(Ga,PXa)subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}})∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =a𝒜H¯(Ga)+H(PXa)absentsubscript𝑎𝒜¯𝐻subscript𝐺𝑎𝐻subscript𝑃subscript𝑋𝑎\displaystyle=-\textstyle\sum_{a\in\mathcal{A}}\overline{H}(G_{a})+H(P_{X_{a}})= - ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (171)
H¯(a𝒜Ga)+a𝒜H(PXa)absent¯𝐻subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜𝐻subscript𝑃subscript𝑋𝑎\displaystyle\leq-\overline{H}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}% \right)+\textstyle\sum_{a\in\mathcal{A}}H(P_{X_{a}})≤ - over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (172)
=H¯(a𝒜Ga)+H(a𝒜PXa)absent¯𝐻subscript𝑎𝒜subscript𝐺𝑎𝐻subscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎\displaystyle=-\overline{H}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}% \right)+H\left(\textstyle\bigotimes_{a\in\mathcal{A}}P_{X_{a}}\right)= - over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + italic_H ( ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (173)
=C(a𝒜Ga,a𝒜PXa);absent𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎\displaystyle=C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;\textstyle% \bigotimes_{a\in\mathcal{A}}P_{X_{a}}\right);= italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ; (174)

where (171) comes from Theorem 9; (172) follows from (14), see [21, Theorem 2]; and (174) also follows from Theorem 9. Therefore,

a𝒜C(Ga,PXa)=C(a𝒜Ga,a𝒜PXa)subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎\displaystyle\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}})=C\left(% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{a\in\mathcal{A}}P_{X_% {a}}\right)∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (175)
\displaystyle\Longleftrightarrow\; H¯(a𝒜Ga)=a𝒜H¯(Ga).¯𝐻subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}% \right)=\textstyle\sum_{a\in\mathcal{A}}\overline{H}(G_{a}).over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (176)

B-A Proof of Lemma 13

The probabilistic graph a𝒜PAGasubscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT has a𝒜PA(a)PXasubscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT as underlying distribution. Let A,V𝐴𝑉A,Vitalic_A , italic_V be two random variables such that A𝐴Aitalic_A is drawn with PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT, and V𝑉Vitalic_V is drawn with PX|A(|a)PXaP_{X|A}(\cdot|a)\doteq P_{X_{a}}italic_P start_POSTSUBSCRIPT italic_X | italic_A end_POSTSUBSCRIPT ( ⋅ | italic_a ) ≐ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT, so that

PX=a𝒜PA(a)PXa.subscript𝑃𝑋subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\displaystyle P_{X}=\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}.italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (177)

We have

H¯(a𝒜PAGa)+C(a𝒜PAGa,a𝒜PA(a)PXa)¯𝐻subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎𝐶superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\displaystyle\overline{H}\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_% {a}\right)+C\left(\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a},% \textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) + italic_C ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (178)
=H(X)absent𝐻𝑋\displaystyle=H(X)= italic_H ( italic_X ) (179)
=H(A,X)absent𝐻𝐴𝑋\displaystyle=H(A,X)= italic_H ( italic_A , italic_X ) (180)
=H(A)+H(X|A)absent𝐻𝐴𝐻conditional𝑋𝐴\displaystyle=H(A)+H(X|A)= italic_H ( italic_A ) + italic_H ( italic_X | italic_A ) (181)
=H(PA)+a𝒜PA(a)H(PXa);absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐻subscript𝑃subscript𝑋𝑎\displaystyle=H(P_{A})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)H(P_{X_{a}});= italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ; (182)

where (179) comes from Theorem 9 and (177); and (180) comes from the fact that A𝐴Aitalic_A can be written as a function of V𝑉Vitalic_V: by definition, the vertex set of a𝒜PAGasubscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a}⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT writes 𝒳=a𝒜𝒳a𝒳subscriptsquare-union𝑎𝒜subscript𝒳𝑎\mathcal{X}=\bigsqcup_{a\in\mathcal{A}}\mathcal{X}_{a}caligraphic_X = ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT and suppPXa𝒳asuppsubscript𝑃subscript𝑋𝑎subscript𝒳𝑎\operatorname*{supp}P_{X_{a}}\subseteq\mathcal{X}_{a}roman_supp italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, therefore A𝐴Aitalic_A is the unique index such that V𝒳A𝑉subscript𝒳𝐴V\in\mathcal{X}_{A}italic_V ∈ caligraphic_X start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT.

Appendix C Proof of Theorem 10

()(\leq)( ≤ ) By definition of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and C(G,PX)𝐶𝐺subscript𝑃𝑋C(G,P_{X})italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) we have

supPXΔ(𝒳)C(G,PX)subscriptsupremumsubscript𝑃𝑋Δ𝒳𝐶𝐺subscript𝑃𝑋\displaystyle\sup_{P_{X}\in\Delta(\mathcal{X})}C(G,P_{X})roman_sup start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) =supPXΔ(𝒳)limε0lim supn1nlogα(Gn[𝒯εn(PX)])absentsubscriptsupremumsubscript𝑃𝑋Δ𝒳subscript𝜀0subscriptlimit-supremum𝑛1𝑛𝛼superscript𝐺𝑛delimited-[]subscriptsuperscript𝒯𝑛𝜀subscript𝑃𝑋\displaystyle=\sup_{P_{X}\in\Delta(\mathcal{X})}\lim_{\varepsilon\rightarrow 0% }\limsup_{n\rightarrow\infty}\frac{1}{n}\log\alpha\big{(}G^{\wedge n}[\mathcal% {T}^{n}_{\varepsilon}(P_{X})]\big{)}= roman_sup start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT roman_lim start_POSTSUBSCRIPT italic_ε → 0 end_POSTSUBSCRIPT lim sup start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_α ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT [ caligraphic_T start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ] ) (183)
supPXΔ(𝒳)limε0lim supn1nlogα(Gn)absentsubscriptsupremumsubscript𝑃𝑋Δ𝒳subscript𝜀0subscriptlimit-supremum𝑛1𝑛𝛼superscript𝐺𝑛\displaystyle\leq\sup_{P_{X}\in\Delta(\mathcal{X})}\lim_{\varepsilon% \rightarrow 0}\limsup_{n\rightarrow\infty}\frac{1}{n}\log\alpha\big{(}G^{% \wedge n}\big{)}≤ roman_sup start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT roman_lim start_POSTSUBSCRIPT italic_ε → 0 end_POSTSUBSCRIPT lim sup start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_α ( italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ) (184)
=C0(G).absentsubscript𝐶0𝐺\displaystyle=C_{0}(G).= italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) . (185)

()(\geq)( ≥ ) Let (𝒞n)nsubscriptsubscript𝒞𝑛𝑛superscript(\mathcal{C}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT be a sequence such that for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, 𝒞nsubscript𝒞𝑛\mathcal{C}_{n}caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is an independent set in Gnsuperscript𝐺𝑛G^{\wedge n}italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT, and

limn1nlog|𝒞n|=C0(G);subscript𝑛1𝑛subscript𝒞𝑛subscript𝐶0𝐺\displaystyle\lim_{n\rightarrow\infty}\frac{1}{n}\log|\mathcal{C}_{n}|=C_{0}(G);roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) ; (186)

the existence of the sequence (𝒞n)nsubscriptsubscript𝒞𝑛𝑛superscript(\mathcal{C}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT follows from the definition of C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

Let (τn)nsubscriptsubscript𝜏𝑛𝑛superscript(\tau_{n})_{n\in\mathbb{N}^{\star}}( italic_τ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT be the sequence defined by: for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT,

τn1|𝒞n|xn𝒞nTxn.approaches-limitsubscript𝜏𝑛1subscript𝒞𝑛subscriptsuperscript𝑥𝑛subscript𝒞𝑛subscript𝑇superscript𝑥𝑛\displaystyle\tau_{n}\doteq\frac{1}{|\mathcal{C}_{n}|}\sum_{x^{n}\in\mathcal{C% }_{n}}T_{x^{n}}.italic_τ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≐ divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (187)

The terms of the sequence (τn)nsubscriptsubscript𝜏𝑛𝑛superscript(\tau_{n})_{n\in\mathbb{N}^{\star}}( italic_τ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT are in Δ(𝒳)Δ𝒳\Delta(\mathcal{X})roman_Δ ( caligraphic_X ), which is a compact set. Therefore, by Bolzano-Weierstrass theorem, (τn)nsubscriptsubscript𝜏𝑛𝑛superscript(\tau_{n})_{n\in\mathbb{N}^{\star}}( italic_τ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT has a convergent subsequence (τϕ(n))nsubscriptsubscript𝜏italic-ϕ𝑛𝑛superscript(\tau_{\phi(n)})_{n\in\mathbb{N}^{\star}}( italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, where ϕ::italic-ϕsuperscriptsuperscript\phi:\mathbb{N}^{\star}\rightarrow\mathbb{N}^{\star}italic_ϕ : blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT → blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is strictly increasing. We denote by (𝒞ϕ(n))nsubscriptsubscript𝒞italic-ϕ𝑛𝑛superscript(\mathcal{C}_{\phi(n)})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT the corresponding subsequence of independent sets, and

PXlimnτϕ(n)Δ(𝒳).approaches-limitsubscriptsuperscript𝑃𝑋subscript𝑛subscript𝜏italic-ϕ𝑛Δ𝒳\displaystyle P^{\star}_{X}\doteq\lim_{n\rightarrow\infty}\tau_{\phi(n)}\in% \Delta(\mathcal{X}).italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ≐ roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) . (188)

By construction, we also have

limnlog|𝒞ϕ(n)|ϕ(n)=C0(G).subscript𝑛subscript𝒞italic-ϕ𝑛italic-ϕ𝑛subscript𝐶0𝐺\displaystyle\lim_{n\rightarrow\infty}\frac{\log|\mathcal{C}_{\phi(n)}|}{\phi(% n)}=C_{0}(G).roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT | end_ARG start_ARG italic_ϕ ( italic_n ) end_ARG = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) . (189)

Let us build an adequate sequence of codebooks with type converging uniformly to PXsubscriptsuperscript𝑃𝑋P^{\star}_{X}italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT, and with asymptotic rate C0(G)subscript𝐶0𝐺C_{0}(G)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ). For all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, let

𝒞nϕ(n)(𝒞ϕ(n))n𝒯εnnϕ(n)(PX),approaches-limitsubscriptsuperscript𝒞𝑛italic-ϕ𝑛superscriptsubscript𝒞italic-ϕ𝑛𝑛superscriptsubscript𝒯subscript𝜀𝑛𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋\displaystyle\mathcal{C}^{\star}_{n\phi(n)}\doteq(\mathcal{C}_{\phi(n)})^{n}% \cap\mathcal{T}_{\varepsilon_{n}}^{n\phi(n)}(P^{\star}_{X}),caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ≐ ( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∩ caligraphic_T start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) , (190)

where εnPXτϕ(n)+1n4approaches-limitsubscript𝜀𝑛subscriptnormsubscriptsuperscript𝑃𝑋subscript𝜏italic-ϕ𝑛14𝑛\varepsilon_{n}\doteq\|P^{\star}_{X}-\tau_{\phi(n)}\|_{\infty}+\frac{1}{\sqrt[% 4]{n}}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≐ ∥ italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT - italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG nth-root start_ARG 4 end_ARG start_ARG italic_n end_ARG end_ARG.

It can be easily observed that εnn0subscript𝜀𝑛𝑛0\varepsilon_{n}\underset{n\rightarrow\infty}{\rightarrow}0italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 and 𝒞nϕ(n)𝒯εnnϕ(n)(PX)subscriptsuperscript𝒞𝑛italic-ϕ𝑛superscriptsubscript𝒯subscript𝜀𝑛𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋\mathcal{C}^{\star}_{n\phi(n)}\subseteq\mathcal{T}_{\varepsilon_{n}}^{n\phi(n)% }(P^{\star}_{X})caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ⊆ caligraphic_T start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ): by construction we have

maxxnϕ(n)𝒞nϕ(n)Txnϕ(n)PXn0.subscriptsuperscript𝑥𝑛italic-ϕ𝑛subscriptsuperscript𝒞𝑛italic-ϕ𝑛subscriptnormsubscript𝑇superscript𝑥𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋𝑛0\displaystyle\max_{x^{n\phi(n)}\in\mathcal{C}^{\star}_{n\phi(n)}}\|T_{x^{n\phi% (n)}}-P^{\star}_{X}\|_{\infty}\underset{n\rightarrow\infty}{\rightarrow}0.roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (191)

Furthermore, for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, 𝒞nϕ(n)subscriptsuperscript𝒞𝑛italic-ϕ𝑛\mathcal{C}^{\star}_{n\phi(n)}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT is independent in Gnϕ(n)superscript𝐺𝑛italic-ϕ𝑛G^{\wedge n\phi(n)}italic_G start_POSTSUPERSCRIPT ∧ italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT, as it is contained in the independent set (𝒞ϕ(n))nsuperscriptsubscript𝒞italic-ϕ𝑛𝑛(\mathcal{C}_{\phi(n)})^{n}( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

Now let us prove that log|𝒞nϕ(n)|nϕ(n)C0(G)subscriptsuperscript𝒞𝑛italic-ϕ𝑛𝑛italic-ϕ𝑛subscript𝐶0𝐺\frac{\log|\mathcal{C}^{\star}_{n\phi(n)}|}{n\phi(n)}\rightarrow C_{0}(G)divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT | end_ARG start_ARG italic_n italic_ϕ ( italic_n ) end_ARG → italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) when n𝑛n\rightarrow\inftyitalic_n → ∞. Let us draw a codeword

Cnϕ(n)=(C1ϕ(n),,Cnϕ(n))superscript𝐶𝑛italic-ϕ𝑛subscriptsuperscript𝐶italic-ϕ𝑛1subscriptsuperscript𝐶italic-ϕ𝑛𝑛\displaystyle C^{n\phi(n)}=(C^{\phi(n)}_{1},...,C^{\phi(n)}_{n})italic_C start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT = ( italic_C start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_C start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) (192)

uniformly from (𝒞ϕn)nsuperscriptsubscript𝒞subscriptitalic-ϕ𝑛𝑛(\mathcal{C}_{\phi_{n}})^{n}( caligraphic_C start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, and show that it is in 𝒯εnnϕ(n)(PX)superscriptsubscript𝒯subscript𝜀𝑛𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋\mathcal{T}_{\varepsilon_{n}}^{n\phi(n)}(P^{\star}_{X})caligraphic_T start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) with high probability. On one hand, for all tn𝑡𝑛t\leq nitalic_t ≤ italic_n, the average type of Ctϕ(n)subscriptsuperscript𝐶italic-ϕ𝑛𝑡C^{\phi(n)}_{t}italic_C start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT writes

𝔼[TCtϕ(n)]=1|𝒞ϕ(n)|cϕ(n)𝒞ϕ(n)Tcϕ(n)=τϕ(n).𝔼delimited-[]subscript𝑇subscriptsuperscript𝐶italic-ϕ𝑛𝑡1subscript𝒞italic-ϕ𝑛subscriptsuperscript𝑐italic-ϕ𝑛subscript𝒞italic-ϕ𝑛subscript𝑇superscript𝑐italic-ϕ𝑛subscript𝜏italic-ϕ𝑛\displaystyle\mathbb{E}\left[T_{C^{\phi(n)}_{t}}\right]=\frac{1}{|\mathcal{C}_% {\phi(n)}|}\sum_{c^{\phi(n)}\in\mathcal{C}_{\phi(n)}}T_{c^{\phi(n)}}=\tau_{% \phi(n)}.blackboard_E [ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT . (193)

On the other hand,

|𝒞nϕ(n)||(𝒞ϕ(n))n|superscriptsubscript𝒞𝑛italic-ϕ𝑛superscriptsubscript𝒞italic-ϕ𝑛𝑛\displaystyle\frac{|\mathcal{C}_{n\phi(n)}^{\star}|}{|(\mathcal{C}_{\phi(n)})^% {n}|}divide start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT | end_ARG start_ARG | ( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG =|(𝒞ϕ(n))n𝒯εnnϕ(n)(PX)||(𝒞ϕ(n))n|absentsuperscriptsubscript𝒞italic-ϕ𝑛𝑛superscriptsubscript𝒯subscript𝜀𝑛𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋superscriptsubscript𝒞italic-ϕ𝑛𝑛\displaystyle=\frac{|(\mathcal{C}_{\phi(n)})^{n}\cap\mathcal{T}_{\varepsilon_{% n}}^{n\phi(n)}(P^{\star}_{X})|}{|(\mathcal{C}_{\phi(n)})^{n}|}= divide start_ARG | ( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∩ caligraphic_T start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) | end_ARG start_ARG | ( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG (194)
=(Cnϕ(n)𝒯εnnϕ(n)(PX))absentsuperscript𝐶𝑛italic-ϕ𝑛superscriptsubscript𝒯subscript𝜀𝑛𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋\displaystyle=\mathbb{P}\big{(}C^{n\phi(n)}\in\mathcal{T}_{\varepsilon_{n}}^{n% \phi(n)}(P^{\star}_{X})\big{)}= blackboard_P ( italic_C start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ) (195)
=(TCnϕ(n)PXεn)absentsubscriptnormsubscript𝑇superscript𝐶𝑛italic-ϕ𝑛subscriptsuperscript𝑃𝑋subscript𝜀𝑛\displaystyle=\mathbb{P}\left(\|T_{C^{n\phi(n)}}-P^{\star}_{X}\|_{\infty}\leq% \varepsilon_{n}\right)= blackboard_P ( ∥ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) (196)
(TCnϕ(n)τϕ(n)+τϕ(n)PXεn)absentsubscriptnormsubscript𝑇superscript𝐶𝑛italic-ϕ𝑛subscript𝜏italic-ϕ𝑛subscriptnormsubscript𝜏italic-ϕ𝑛subscriptsuperscript𝑃𝑋subscript𝜀𝑛\displaystyle\geq\mathbb{P}\left(\|T_{C^{n\phi(n)}}-\tau_{\phi(n)}\|_{\infty}+% \|\tau_{\phi(n)}-P^{\star}_{X}\|_{\infty}\leq\varepsilon_{n}\right)≥ blackboard_P ( ∥ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + ∥ italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT - italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) (197)
=(TCnϕ(n)τϕ(n)n1/4)absentsubscriptnormsubscript𝑇superscript𝐶𝑛italic-ϕ𝑛subscript𝜏italic-ϕ𝑛superscript𝑛14\displaystyle=\mathbb{P}\left(\|T_{C^{n\phi(n)}}-\tau_{\phi(n)}\|_{\infty}\leq n% ^{-1/4}\right)= blackboard_P ( ∥ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_n start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT ) (198)
=(1ntnTCtϕ(n)τϕ(n)n1/4)absentsubscriptnorm1𝑛subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛subscript𝜏italic-ϕ𝑛superscript𝑛14\displaystyle=\mathbb{P}\left(\left\|\textstyle\frac{1}{n}\sum_{t\leq n}T_{C_{% t}^{\phi(n)}}-\tau_{\phi(n)}\right\|_{\infty}\leq n^{-1/4}\right)= blackboard_P ( ∥ divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_n start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT ) (199)
=1((tnTCtϕ(n))nτϕ(n)>n3/4)absent1subscriptnormsubscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛𝑛subscript𝜏italic-ϕ𝑛superscript𝑛34\displaystyle=1-\mathbb{P}\left(\left\|\left(\textstyle\sum_{t\leq n}T_{C_{t}^% {\phi(n)}}\right)-n\tau_{\phi(n)}\right\|_{\infty}>n^{3/4}\right)= 1 - blackboard_P ( ∥ ( ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) - italic_n italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT > italic_n start_POSTSUPERSCRIPT 3 / 4 end_POSTSUPERSCRIPT ) (200)
1x𝒳(|tnTCtϕ(n)(x)nτϕ(n)(x)|>n3/4)absent1subscript𝑥𝒳subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛𝑥𝑛subscript𝜏italic-ϕ𝑛𝑥superscript𝑛34\displaystyle\geq 1-\textstyle\sum_{x\in\mathcal{X}}\mathbb{P}\left(\left|\sum% _{t\leq n}T_{C_{t}^{\phi(n)}}(x)-n\tau_{\phi(n)}(x)\right|>n^{3/4}\right)≥ 1 - ∑ start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT blackboard_P ( | ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) - italic_n italic_τ start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ( italic_x ) | > italic_n start_POSTSUPERSCRIPT 3 / 4 end_POSTSUPERSCRIPT ) (201)
1x𝒳1n3/2𝕍[tnTCtϕ(n)(x)]absent1subscript𝑥𝒳1superscript𝑛32𝕍delimited-[]subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛𝑥\displaystyle\geq\textstyle 1-\sum_{x\in\mathcal{X}}\frac{1}{n^{3/2}}\mathbb{V% }\left[\textstyle\sum_{t\leq n}T_{C_{t}^{\phi(n)}}(x)\right]≥ 1 - ∑ start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT end_ARG blackboard_V [ ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) ] (202)
1|𝒳|n1/2n1;absent1𝒳superscript𝑛12𝑛1\displaystyle\geq\textstyle 1-\frac{|\mathcal{X}|}{n^{1/2}}\underset{n% \rightarrow\infty}{\rightarrow}1;≥ 1 - divide start_ARG | caligraphic_X | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 1 ; (203)

where (195) and (199) come from the construction of Cnϕ(n)superscript𝐶𝑛italic-ϕ𝑛C^{n\phi(n)}italic_C start_POSTSUPERSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT; (198) comes from the construction of εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT; (201) follows from the union bound; (202) comes from Chebyshev’s inequality and (193); (203) follows from 𝕍[tnTCtϕ(n)(x)]𝕍delimited-[]subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛𝑥\mathbb{V}\left[\textstyle\sum_{t\leq n}T_{C_{t}^{\phi(n)}}(x)\right]blackboard_V [ ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) ] =tn𝕍[TCtϕ(n)(x)]nabsentsubscript𝑡𝑛𝕍delimited-[]subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛𝑥𝑛=\sum_{t\leq n}\mathbb{V}\left[T_{C_{t}^{\phi(n)}}(x)\right]\leq n= ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT blackboard_V [ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) ] ≤ italic_n, as the random variables TCtϕ(n)(x)subscript𝑇superscriptsubscript𝐶𝑡italic-ϕ𝑛𝑥T_{C_{t}^{\phi(n)}}(x)italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ ( italic_n ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) are iid and takes values in [0,1]01[0,1][ 0 , 1 ]. Hence

limnlog|𝒞nϕ(n)|nϕ(n)=limnlog|(𝒞ϕ(n))n|nϕ(n)=C0(G).subscript𝑛subscriptsuperscript𝒞𝑛italic-ϕ𝑛𝑛italic-ϕ𝑛subscript𝑛superscriptsubscript𝒞italic-ϕ𝑛𝑛𝑛italic-ϕ𝑛subscript𝐶0𝐺\displaystyle\lim_{n\rightarrow\infty}\frac{\log|\mathcal{C}^{\star}_{n\phi(n)% }|}{n\phi(n)}=\lim_{n\rightarrow\infty}\frac{\log|(\mathcal{C}_{\phi(n)})^{n}|% }{n\phi(n)}=C_{0}(G).roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT | end_ARG start_ARG italic_n italic_ϕ ( italic_n ) end_ARG = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | ( caligraphic_C start_POSTSUBSCRIPT italic_ϕ ( italic_n ) end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG start_ARG italic_n italic_ϕ ( italic_n ) end_ARG = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) . (204)

By combining (191), (204) and Lemma 4, it follows that

C0(G)=limnlog|𝒞nϕ(n)|nϕ(n)C(G,PX).subscript𝐶0𝐺subscript𝑛subscriptsuperscript𝒞𝑛italic-ϕ𝑛𝑛italic-ϕ𝑛𝐶𝐺subscriptsuperscript𝑃𝑋\displaystyle C_{0}(G)=\lim_{n\rightarrow\infty}\frac{\log|\mathcal{C}^{\star}% _{n\phi(n)}|}{n\phi(n)}\leq C(G,P^{\star}_{X}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_ϕ ( italic_n ) end_POSTSUBSCRIPT | end_ARG start_ARG italic_n italic_ϕ ( italic_n ) end_ARG ≤ italic_C ( italic_G , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) . (205)

In conclusion, we have constructed PXΔ(𝒱)subscriptsuperscript𝑃𝑋Δ𝒱P^{\star}_{X}\in\Delta(\mathcal{V})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_V ) such that

C0(G)=C(G,PX)=maxPXC(G,PX).subscript𝐶0𝐺𝐶𝐺subscriptsuperscript𝑃𝑋subscriptsubscript𝑃𝑋𝐶𝐺subscript𝑃𝑋\displaystyle C_{0}(G)=C(G,P^{\star}_{X})=\max_{P_{X}}C(G,P_{X}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) = italic_C ( italic_G , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) = roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) . (206)

Appendix D Proof of Proposition 6

Let us show that for all graph G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ), the function PXC(G,PX)maps-tosubscript𝑃𝑋𝐶𝐺subscript𝑃𝑋P_{X}\mapsto C(G,P_{X})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ↦ italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) is concave. Let PX,PXΔ(𝒳)subscript𝑃𝑋subscriptsuperscript𝑃𝑋Δ𝒳P_{X},P^{\prime}_{X}\in\Delta(\mathcal{X})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) and β[0,1]𝛽01\beta\in[0,1]italic_β ∈ [ 0 , 1 ]. Let (bn)nsubscriptsubscript𝑏𝑛𝑛(b_{n})_{n\in\mathbb{N}}( italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT be a sequence of integers such that bnnnβsubscript𝑏𝑛𝑛𝑛𝛽\frac{b_{n}}{n}\underset{n\rightarrow\infty}{\rightarrow}\betadivide start_ARG italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_β.

By Lemma 4, there exists two sequences (𝒞n)nsubscriptsubscript𝒞𝑛𝑛(\mathcal{C}_{n})_{n\in\mathbb{N}}( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT and (𝒞n)nsubscriptsubscriptsuperscript𝒞𝑛𝑛(\mathcal{C}^{\prime}_{n})_{n\in\mathbb{N}}( caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT that satisfy the following:

n,𝒞n𝒳n and 𝒞n𝒳n are independent in Gn;formulae-sequencefor-all𝑛superscriptsubscript𝒞𝑛superscript𝒳𝑛 and subscriptsuperscript𝒞𝑛superscript𝒳𝑛 are independent in superscript𝐺𝑛\displaystyle\forall n\in\mathbb{N}^{\star},\;\mathcal{C}_{n}\subseteq\mathcal% {X}^{n}\text{ and }\mathcal{C}^{\prime}_{n}\subseteq\mathcal{X}^{n}\text{ are % independent in }G^{\wedge n};∀ italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are independent in italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT ; (207)

and

log|𝒞n|nnC(G,PX),subscript𝒞𝑛𝑛𝑛𝐶𝐺subscript𝑃𝑋\displaystyle\frac{\log|\mathcal{C}_{n}|}{n}\underset{n\rightarrow\infty}{% \rightarrow}C(G,P_{X}),divide start_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) , log|𝒞n|nnC(G,PX),subscriptsuperscript𝒞𝑛𝑛𝑛𝐶𝐺subscriptsuperscript𝑃𝑋\displaystyle\frac{\log|\mathcal{C}^{\prime}_{n}|}{n}\underset{n\rightarrow% \infty}{\rightarrow}C(G,P^{\prime}_{X}),divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_C ( italic_G , italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) , (208)
maxxn𝒞nTxnPXn0,subscriptsuperscript𝑥𝑛subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑃𝑋𝑛0\displaystyle\max_{x^{n}\in\mathcal{C}_{n}}\|T_{x^{n}}-P_{X}\|_{\infty}% \underset{n\rightarrow\infty}{\rightarrow}0,roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 , maxxn𝒞nTxnPXn0.subscriptsuperscript𝑥𝑛subscriptsuperscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑃𝑋𝑛0\displaystyle\max_{x^{n}\in\mathcal{C}^{\prime}_{n}}\|T_{x^{n}}-P_{X}\|_{% \infty}\underset{n\rightarrow\infty}{\rightarrow}0.roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (209)

Let us build a sequence of codebooks (𝒞n′′)nsubscriptsubscriptsuperscript𝒞′′𝑛𝑛superscript(\mathcal{C}^{\prime\prime}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT adapted to the distribution βPX+(1β)PX𝛽subscript𝑃𝑋1𝛽subscriptsuperscript𝑃𝑋\beta P_{X}+(1-\beta)P^{\prime}_{X}italic_β italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT by using a time-sharing between (𝒞n)nsubscriptsubscript𝒞𝑛𝑛superscript(\mathcal{C}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT and (𝒞n)nsubscriptsubscriptsuperscript𝒞𝑛𝑛superscript(\mathcal{C}^{\prime}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. For all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, let

𝒞n′′𝒞nbn×𝒞nnbn.approaches-limitsubscriptsuperscript𝒞′′𝑛superscriptsubscript𝒞𝑛subscript𝑏𝑛subscriptsuperscript𝒞𝑛subscript𝑏𝑛𝑛\displaystyle\mathcal{C}^{\prime\prime}_{n}\doteq\mathcal{C}_{n}^{b_{n}}\times% \mathcal{C}^{\prime n-b_{n}}_{n}.caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≐ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × caligraphic_C start_POSTSUPERSCRIPT ′ italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT . (210)

For all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, 𝒞n′′𝒳n2subscriptsuperscript𝒞′′𝑛superscript𝒳superscript𝑛2\mathcal{C}^{\prime\prime}_{n}\subseteq\mathcal{X}^{n^{2}}caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT is independent in Gn2superscript𝐺superscript𝑛2G^{\wedge n^{2}}italic_G start_POSTSUPERSCRIPT ∧ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT as a product of independent sets.

The rate associated to 𝒞n′′subscriptsuperscript𝒞′′𝑛\mathcal{C}^{\prime\prime}_{n}caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT writes

log|𝒞n′′|n2subscriptsuperscript𝒞′′𝑛superscript𝑛2\displaystyle\frac{\log|\mathcal{C}^{\prime\prime}_{n}|}{n^{2}}divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG =bnlog|𝒞n|+(nbn)log|𝒞n|n2absentsubscript𝑏𝑛subscript𝒞𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝒞𝑛superscript𝑛2\displaystyle=\frac{b_{n}\log|\mathcal{C}_{n}|+(n-b_{n})\log|\mathcal{C}^{% \prime}_{n}|}{n^{2}}= divide start_ARG italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | + ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) roman_log | caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (211)
=bnnlog|𝒞n|n+nbnnlog|𝒞n|nabsentsubscript𝑏𝑛𝑛subscript𝒞𝑛𝑛𝑛subscript𝑏𝑛𝑛subscriptsuperscript𝒞𝑛𝑛\displaystyle=\frac{b_{n}}{n}\frac{\log|\mathcal{C}_{n}|}{n}+\frac{n-b_{n}}{n}% \frac{\log|\mathcal{C}^{\prime}_{n}|}{n}= divide start_ARG italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG divide start_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG + divide start_ARG italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG (212)
nβC(G,PX)+(1β)C(G,PX);𝑛𝛽𝐶𝐺subscript𝑃𝑋1𝛽𝐶𝐺subscriptsuperscript𝑃𝑋\displaystyle\underset{n\rightarrow\infty}{\rightarrow}\beta C(G,P_{X})+(1-% \beta)C(G,P^{\prime}_{X});start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_β italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) + ( 1 - italic_β ) italic_C ( italic_G , italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ; (213)

and the types of the codewords in 𝒞n′′subscriptsuperscript𝒞′′𝑛\mathcal{C}^{\prime\prime}_{n}caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT satisfy

maxxn2𝒞n′′Txn2βPX(1β)PXsubscriptsuperscript𝑥superscript𝑛2subscriptsuperscript𝒞′′𝑛subscriptnormsubscript𝑇superscript𝑥superscript𝑛2𝛽subscript𝑃𝑋1𝛽subscriptsuperscript𝑃𝑋\displaystyle\max_{x^{n^{2}}\in\mathcal{C}^{\prime\prime}_{n}}\left\|T_{x^{n^{% 2}}}-\beta P_{X}-(1-\beta)P^{\prime}_{X}\right\|_{\infty}roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_β italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT - ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT (214)
=\displaystyle=\>= maxxnbn𝒞nmaxxn(nbn)𝒞nnbnn2Txnbn+n(nbn)n2Txn(nbn)βPX(1β)PXsubscriptsuperscript𝑥𝑛subscript𝑏𝑛subscript𝒞𝑛subscriptsuperscript𝑥𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝒞𝑛subscriptnorm𝑛subscript𝑏𝑛superscript𝑛2subscript𝑇superscript𝑥𝑛subscript𝑏𝑛𝑛𝑛subscript𝑏𝑛superscript𝑛2subscript𝑇superscript𝑥𝑛𝑛subscript𝑏𝑛𝛽subscript𝑃𝑋1𝛽subscriptsuperscript𝑃𝑋\displaystyle\max_{x^{nb_{n}}\in\mathcal{C}_{n}}\max_{x^{\prime n(n-b_{n})}\in% \mathcal{C}^{\prime}_{n}}\left\|\frac{nb_{n}}{n^{2}}T_{x^{nb_{n}}}+\frac{n(n-b% _{n})}{n^{2}}T_{x^{\prime n(n-b_{n})}}-\beta P_{X}-(1-\beta)P^{\prime}_{X}% \right\|_{\infty}roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ divide start_ARG italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + divide start_ARG italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_β italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT - ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT (215)
\displaystyle\leq\> maxxnbn𝒞nbnnTxnbnβPX+maxxn(nbn)𝒞nnbnnTxn(nbn)(1β)PXsubscriptsuperscript𝑥𝑛subscript𝑏𝑛subscript𝒞𝑛subscriptnormsubscript𝑏𝑛𝑛subscript𝑇superscript𝑥𝑛subscript𝑏𝑛𝛽subscript𝑃𝑋subscriptsuperscript𝑥𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝒞𝑛subscriptnorm𝑛subscript𝑏𝑛𝑛subscript𝑇superscript𝑥𝑛𝑛subscript𝑏𝑛1𝛽subscriptsuperscript𝑃𝑋\displaystyle\max_{x^{nb_{n}}\in\mathcal{C}_{n}}\left\|\frac{b_{n}}{n}T_{x^{nb% _{n}}}-\beta P_{X}\right\|_{\infty}+\max_{x^{\prime n(n-b_{n})}\in\mathcal{C}^% {\prime}_{n}}\left\|\frac{n-b_{n}}{n}T_{x^{\prime n(n-b_{n})}}-(1-\beta)P^{% \prime}_{X}\right\|_{\infty}roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ divide start_ARG italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_β italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ divide start_ARG italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT (216)
=\displaystyle=\>= βmaxxnbn𝒞nTxnbnPX+o(1)Txnbn𝛽subscriptsuperscript𝑥𝑛subscript𝑏𝑛subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑏𝑛subscript𝑃𝑋𝑜1subscript𝑇superscript𝑥𝑛subscript𝑏𝑛\displaystyle\beta\max_{x^{nb_{n}}\in\mathcal{C}_{n}}\left\|T_{x^{nb_{n}}}-P_{% X}+o(1)T_{x^{nb_{n}}}\right\|_{\infty}italic_β roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT + italic_o ( 1 ) italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT
+(1β)maxxn(nbn)𝒞nTxn(nbn)PX+o(1)Txn(nbn)1𝛽subscriptsuperscript𝑥𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝑃𝑋𝑜1subscript𝑇superscript𝑥𝑛𝑛subscript𝑏𝑛\displaystyle+(1-\beta)\max_{x^{\prime n(n-b_{n})}\in\mathcal{C}^{\prime}_{n}}% \left\|T_{x^{\prime n(n-b_{n})}}-P^{\prime}_{X}+o(1)T_{x^{\prime n(n-b_{n})}}% \right\|_{\infty}+ ( 1 - italic_β ) roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT + italic_o ( 1 ) italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT (217)
\displaystyle\leq\> βmaxxnbn𝒞nTxnbnPX+o(1)Txnbn𝛽subscriptsuperscript𝑥𝑛subscript𝑏𝑛subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑏𝑛subscript𝑃𝑋𝑜1subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑏𝑛\displaystyle\beta\max_{x^{nb_{n}}\in\mathcal{C}_{n}}\left\|T_{x^{nb_{n}}}-P_{% X}\right\|_{\infty}+o(1)\left\|T_{x^{nb_{n}}}\right\|_{\infty}italic_β roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + italic_o ( 1 ) ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT
+(1β)maxxn(nbn)𝒞nTxn(nbn)PX+o(1)Txn(nbn)1𝛽subscriptsuperscript𝑥𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛𝑛subscript𝑏𝑛subscriptsuperscript𝑃𝑋𝑜1subscriptnormsubscript𝑇superscript𝑥𝑛𝑛subscript𝑏𝑛\displaystyle+(1-\beta)\max_{x^{\prime n(n-b_{n})}\in\mathcal{C}^{\prime}_{n}}% \left\|T_{x^{\prime n(n-b_{n})}}-P^{\prime}_{X}\right\|_{\infty}+o(1)\left\|T_% {x^{\prime n(n-b_{n})}}\right\|_{\infty}+ ( 1 - italic_β ) roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + italic_o ( 1 ) ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n ( italic_n - italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT (218)
n0.𝑛0\displaystyle\underset{n\rightarrow\infty}{\rightarrow}0.start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (219)

By Lemma 4, limnlog|𝒞n′′|n2C(G,βPX+(1β)PX)subscript𝑛subscriptsuperscript𝒞′′𝑛superscript𝑛2𝐶𝐺𝛽subscript𝑃𝑋1𝛽subscriptsuperscript𝑃𝑋\lim_{n\rightarrow\infty}\frac{\log|\mathcal{C}^{\prime\prime}_{n}|}{n^{2}}% \leq C(G,\beta P_{X}+(1-\beta)P^{\prime}_{X})roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ italic_C ( italic_G , italic_β italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ), thus

βC(G,PX)+(1β)C(G,PX)C(G,βPX+(1β)PX).𝛽𝐶𝐺subscript𝑃𝑋1𝛽𝐶𝐺subscriptsuperscript𝑃𝑋𝐶𝐺𝛽subscript𝑃𝑋1𝛽subscriptsuperscript𝑃𝑋\displaystyle\beta C(G,P_{X})+(1-\beta)C(G,P^{\prime}_{X})\leq C(G,\beta P_{X}% +(1-\beta)P^{\prime}_{X}).italic_β italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) + ( 1 - italic_β ) italic_C ( italic_G , italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) ≤ italic_C ( italic_G , italic_β italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT + ( 1 - italic_β ) italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) . (220)

The function PXC(G,PX)maps-tosubscript𝑃𝑋𝐶𝐺subscript𝑃𝑋P_{X}\mapsto C(G,P_{X})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ↦ italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) is concave on the convex compact set Δ(𝒳)Δ𝒳\Delta(\mathcal{X})roman_Δ ( caligraphic_X ), therefore its set of maximizers 𝒫(G)=argmaxPXΔ(𝒳)C(G,PX)superscript𝒫𝐺subscriptsubscript𝑃𝑋Δ𝒳𝐶𝐺subscript𝑃𝑋\mathcal{P}^{\star}(G)=\operatorname*{\arg\!\max}_{P_{X}\in\Delta(\mathcal{X})% }C(G,P_{X})caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_X ) end_POSTSUBSCRIPT italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) is convex. Furthermore, by Theorem 10, the set 𝒫(G)superscript𝒫𝐺\mathcal{P}^{\star}(G)caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) is nonempty and satisfies

PX𝒫(G),C(G,PX)=C0(G).formulae-sequencefor-allsubscript𝑃𝑋superscript𝒫𝐺𝐶𝐺subscript𝑃𝑋subscript𝐶0𝐺\displaystyle\forall P_{X}\in\mathcal{P}^{\star}(G),\;C(G,P_{X})=C_{0}(G).∀ italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) , italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) . (221)

Appendix E Proof of Theorem 11

The proof techniques used here are similar as in the proof of Theorem 10 in App. C.

Let us start by showing that Theorem 11 is true when 𝒜𝒜\mathcal{A}caligraphic_A has two elements. Let G=(𝒳,)𝐺𝒳G=(\mathcal{X},\mathcal{E})italic_G = ( caligraphic_X , caligraphic_E ), and G=(𝒳,)superscript𝐺superscript𝒳superscriptG^{\prime}=(\mathcal{X}^{\prime},\mathcal{E^{\prime}})italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( caligraphic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , caligraphic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) be two graphs, and let PX,X𝒫(GG)subscript𝑃𝑋superscript𝑋superscript𝒫𝐺superscript𝐺P_{X,X^{\prime}}\in\mathcal{P}^{\star}(G\wedge G^{\prime})italic_P start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). We will prove that PXPXtensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋P_{X}\otimes P_{X^{\prime}}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT is also capacity-achieving by building an adequate sequence of codebooks.

For all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, let 𝒞n(𝒳×𝒳)nsubscript𝒞𝑛superscript𝒳superscript𝒳𝑛\mathcal{C}_{n}\subseteq(\mathcal{X}\times\mathcal{X}^{\prime})^{n}caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ ( caligraphic_X × caligraphic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT such that 𝒞nsubscript𝒞𝑛\mathcal{C}_{n}caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is an independent set in (GG)nsuperscript𝐺superscript𝐺𝑛(G\wedge G^{\prime})^{\wedge n}( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT, and

1nlog|𝒞n|nC0(GG),1𝑛subscript𝒞𝑛𝑛subscript𝐶0𝐺superscript𝐺\displaystyle\frac{1}{n}\log|\mathcal{C}_{n}|\underset{n\rightarrow\infty}{% \rightarrow}C_{0}(G\wedge G^{\prime}),divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , (222)
max(xn,xn)𝒞nTxn,xnPX,Xn0.subscriptsuperscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛superscript𝑥𝑛subscript𝑃𝑋superscript𝑋𝑛0\displaystyle\max_{(x^{n},x^{\prime n})\in\mathcal{C}_{n}}\|T_{x^{n},x^{\prime n% }}-P_{X,X^{\prime}}\|_{\infty}\underset{n\rightarrow\infty}{\rightarrow}0.roman_max start_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (223)

The existence of such a sequence is given by Lemma 4, and Proposition 6. Let

QX,X(n)1|𝒞n|(xn,xn)𝒞nTxn,xn.approaches-limitsubscriptsuperscript𝑄𝑛𝑋superscript𝑋1subscript𝒞𝑛subscriptsuperscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛subscript𝑇superscript𝑥𝑛superscript𝑥𝑛\displaystyle Q^{(n)}_{X,X^{\prime}}\doteq\frac{1}{|\mathcal{C}_{n}|}\sum_{(x^% {n},x^{\prime n})\in\mathcal{C}_{n}}T_{x^{n},x^{\prime n}}.italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≐ divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT . (224)

An immediate observation is that

QX,X(n)nPX,Xsubscriptsuperscript𝑄𝑛𝑋superscript𝑋𝑛subscript𝑃𝑋superscript𝑋\displaystyle Q^{(n)}_{X,X^{\prime}}\underset{n\rightarrow\infty}{\rightarrow}% P_{X,X^{\prime}}italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_P start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (225)

as a consequence of (223).

Let us build a sequence of codebooks with asymptotic rate C0(GG)subscript𝐶0𝐺superscript𝐺C_{0}(G\wedge G^{\prime})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ), such that the type of their codewords converge uniformly to PXPXtensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋P_{X}\otimes P_{X^{\prime}}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT:

𝒞n3𝒯εnn3(PXPX)(tn𝒞n(t))n;approaches-limitsubscriptsuperscript𝒞superscript𝑛3subscriptsuperscript𝒯superscript𝑛3subscript𝜀𝑛tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋superscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛\displaystyle\mathcal{C}^{\star}_{n^{3}}\doteq\mathcal{T}^{n^{3}}_{\varepsilon% _{n}}(P_{X}\otimes P_{X^{\prime}})\cap\left(\textstyle\prod_{t\leq n}\mathcal{% C}^{(t)}_{n}\right)^{n};caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≐ caligraphic_T start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ∩ ( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ; (226)

where

εnQX(n)QX(n)PXPX+1n4;approaches-limitsubscript𝜀𝑛subscriptnormtensor-productsubscriptsuperscript𝑄𝑛𝑋subscriptsuperscript𝑄𝑛superscript𝑋tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋14𝑛\displaystyle\varepsilon_{n}\doteq\|Q^{(n)}_{X}\otimes Q^{(n)}_{X^{\prime}}-P_% {X}\otimes P_{X^{\prime}}\|_{\infty}+\textstyle\frac{1}{\sqrt[4]{n}};italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≐ ∥ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG nth-root start_ARG 4 end_ARG start_ARG italic_n end_ARG end_ARG ; (227)

and where for all tn𝑡𝑛t\leq nitalic_t ≤ italic_n, the shifted codebook 𝒞n(t)superscriptsubscript𝒞𝑛𝑡\mathcal{C}_{n}^{(t)}caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT is defined by

𝒞n(t){((xt,xt+1,,xn,x1,,xt1),xn)|(xn,xn)𝒞n}.approaches-limitsubscriptsuperscript𝒞𝑡𝑛conditional-setsubscript𝑥𝑡subscript𝑥𝑡1subscript𝑥𝑛subscript𝑥1subscript𝑥𝑡1superscript𝑥𝑛superscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛\displaystyle\mathcal{C}^{(t)}_{n}\doteq\Big{\{}\Big{(}(x_{t},x_{t+1},...,x_{n% },x_{1},...,x_{t-1}),x^{\prime n}\Big{)}\>\Big{|}\>(x^{n},x^{\prime n})\in% \mathcal{C}_{n}\Big{\}}.caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≐ { ( ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) | ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } . (228)

By construction, 𝒞n3𝒯εnn3(PXPX)subscriptsuperscript𝒞superscript𝑛3subscriptsuperscript𝒯superscript𝑛3subscript𝜀𝑛tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋\mathcal{C}^{\star}_{n^{3}}\subseteq\mathcal{T}^{n^{3}}_{\varepsilon_{n}}(P_{X% }\otimes P_{X^{\prime}})caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊆ caligraphic_T start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) thanks to (226), and εnn0subscript𝜀𝑛𝑛0\varepsilon_{n}\underset{n\rightarrow\infty}{\rightarrow}0italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 thanks to (227) and (225); therefore we have

maxxn3𝒞nTxn3PXPXn0.subscriptsuperscript𝑥superscript𝑛3subscriptsuperscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥superscript𝑛3tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋𝑛0\displaystyle\max_{x^{n^{3}}\in\mathcal{C}^{\star}_{n}}\|T_{x^{n^{3}}}-P_{X}% \otimes P_{X^{\prime}}\|_{\infty}\underset{n\rightarrow\infty}{\rightarrow}0.roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (229)

Furthermore, 𝒞n3subscriptsuperscript𝒞superscript𝑛3\mathcal{C}^{\star}_{n^{3}}caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT is an independent set in (GG)n3superscript𝐺superscript𝐺superscript𝑛3(G\wedge G^{\prime})^{\wedge n^{3}}( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∧ italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT, as it is contained in the product independent set (tn𝒞n(t))nsuperscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛\left(\textstyle\prod_{t\leq n}\mathcal{C}^{(t)}_{n}\right)^{n}( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT; note that this holds because the shifted codebook 𝒞n(t)subscriptsuperscript𝒞𝑡𝑛\mathcal{C}^{(t)}_{n}caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is an independent set in (GG)nsuperscript𝐺superscript𝐺𝑛(G\wedge G^{\prime})^{\wedge n}( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT for all tn𝑡𝑛t\leq nitalic_t ≤ italic_n.

Now let us prove that log|𝒞n3|n3nC0(GG)subscriptsuperscript𝒞superscript𝑛3superscript𝑛3𝑛subscript𝐶0𝐺superscript𝐺\frac{\log|\mathcal{C}^{\star}_{n^{3}}|}{n^{3}}\underset{n\rightarrow\infty}{% \rightarrow}C_{0}(G\wedge G^{\prime})divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). Let us draw a codeword uniformly from (tn𝒞n(t))nsuperscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛\left(\textstyle\prod_{t\leq n}\mathcal{C}^{(t)}_{n}\right)^{n}( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT:

Cn3(C1n2,,Cnn2),approaches-limitsuperscript𝐶superscript𝑛3subscriptsuperscript𝐶superscript𝑛21subscriptsuperscript𝐶superscript𝑛2𝑛\displaystyle C^{n^{3}}\doteq(C^{n^{2}}_{1},...,C^{n^{2}}_{n}),italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ≐ ( italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) , (230)

where for all tn𝑡𝑛t\leq nitalic_t ≤ italic_n, Ctn2subscriptsuperscript𝐶superscript𝑛2𝑡C^{n^{2}}_{t}italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a random n×n𝑛𝑛n\times nitalic_n × italic_n-sequence drawn uniformly from tn𝒞n(t)subscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛\textstyle\prod_{t\leq n}\mathcal{C}^{(t)}_{n}∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. We want to prove that Cn3𝒯εnn3(PXPX)superscript𝐶superscript𝑛3subscriptsuperscript𝒯superscript𝑛3subscript𝜀𝑛tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋C^{n^{3}}\in\mathcal{T}^{n^{3}}_{\varepsilon_{n}}(P_{X}\otimes P_{X^{\prime}})italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) with high probability.

On one hand we have to determine the average type of the random variables (Ctn2)tnsubscriptsubscriptsuperscript𝐶superscript𝑛2𝑡𝑡𝑛(C^{n^{2}}_{t})_{t\leq n}( italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT which are iid copies of Cn2=(C1n,,Cnn)superscript𝐶superscript𝑛2subscriptsuperscript𝐶𝑛1subscriptsuperscript𝐶𝑛𝑛C^{n^{2}}=(C^{n}_{1},...,C^{n}_{n})italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT = ( italic_C start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_C start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ); where each Ctnsubscriptsuperscript𝐶𝑛𝑡C^{n}_{t}italic_C start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is drawn uniformly from 𝒞n(t)subscriptsuperscript𝒞𝑡𝑛\mathcal{C}^{(t)}_{n}caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, and the (Ctn)tnsubscriptsubscriptsuperscript𝐶𝑛𝑡𝑡𝑛(C^{n}_{t})_{t\leq n}( italic_C start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT are mutually independent.

𝔼[TCtn2]𝔼delimited-[]subscript𝑇subscriptsuperscript𝐶superscript𝑛2𝑡\displaystyle\mathbb{E}\left[T_{C^{n^{2}}_{t}}\right]blackboard_E [ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] =1ntn𝔼[TCtn]absent1𝑛subscript𝑡𝑛𝔼delimited-[]subscript𝑇subscriptsuperscript𝐶𝑛𝑡\displaystyle=\frac{1}{n}\sum_{t\leq n}\mathbb{E}\left[T_{C^{n}_{t}}\right]= divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT blackboard_E [ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] (231)
=1ntn1|𝒞n(t)|(xn,xn)𝒞n(t)Txn,xnabsent1𝑛subscript𝑡𝑛1subscriptsuperscript𝒞𝑡𝑛subscriptsuperscript𝑥𝑛superscript𝑥𝑛subscriptsuperscript𝒞𝑡𝑛subscript𝑇superscript𝑥𝑛superscript𝑥𝑛\displaystyle=\frac{1}{n}\sum_{t\leq n}\frac{1}{|\mathcal{C}^{(t)}_{n}|}\sum_{% (x^{n},x^{\prime n})\in\mathcal{C}^{(t)}_{n}}T_{x^{n},x^{\prime n}}= divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (232)
=1ntn1|𝒞n|(xn,xn)𝒞nTσt(xn),xnabsent1𝑛subscript𝑡𝑛1subscript𝒞𝑛subscriptsuperscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛subscript𝑇subscript𝜎𝑡superscript𝑥𝑛superscript𝑥𝑛\displaystyle=\frac{1}{n}\sum_{t\leq n}\frac{1}{|\mathcal{C}_{n}|}\sum_{(x^{n}% ,x^{\prime n})\in\mathcal{C}_{n}}T_{\sigma_{t}(x^{n}),x^{\prime n}}= divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (233)
=1|𝒞n|(xn,xn)𝒞n1ntnTσt(xn),xnabsent1subscript𝒞𝑛subscriptsuperscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛1𝑛subscript𝑡𝑛subscript𝑇subscript𝜎𝑡superscript𝑥𝑛superscript𝑥𝑛\displaystyle=\frac{1}{|\mathcal{C}_{n}|}\sum_{(x^{n},x^{\prime n})\in\mathcal% {C}_{n}}\frac{1}{n}\sum_{t\leq n}T_{\sigma_{t}(x^{n}),x^{\prime n}}= divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (234)
=1|𝒞n|(xn,xn)𝒞nTxnTxnabsent1subscript𝒞𝑛subscriptsuperscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛tensor-productsubscript𝑇superscript𝑥𝑛subscript𝑇superscript𝑥𝑛\displaystyle=\frac{1}{|\mathcal{C}_{n}|}\sum_{(x^{n},x^{\prime n})\in\mathcal% {C}_{n}}T_{x^{n}}\otimes T_{x^{\prime n}}= divide start_ARG 1 end_ARG start_ARG | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT ) ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊗ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (235)
=QX(n)QX(n),absenttensor-productsubscriptsuperscript𝑄𝑛𝑋subscriptsuperscript𝑄𝑛superscript𝑋\displaystyle=Q^{(n)}_{X}\otimes Q^{(n)}_{X^{\prime}},= italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , (236)

where σt(xn)=(xt,xt+1,,xn,x1,,xn1)subscript𝜎𝑡superscript𝑥𝑛subscript𝑥𝑡subscript𝑥𝑡1subscript𝑥𝑛subscript𝑥1subscript𝑥𝑛1\sigma_{t}(x^{n})=(x_{t},x_{t+1},...,x_{n},x_{1},...,x_{n-1})italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) = ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n - 1 end_POSTSUBSCRIPT ); (233) comes from the construction of 𝒞n(t)subscriptsuperscript𝒞𝑡𝑛\mathcal{C}^{(t)}_{n}caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in (228); and (235) comes from the following observation:

tnTσt(xn),xn=tnsnTxs+t,xs=sntnTxs+t,xssubscript𝑡𝑛subscript𝑇subscript𝜎𝑡superscript𝑥𝑛superscript𝑥𝑛subscript𝑡𝑛subscript𝑠𝑛subscript𝑇subscript𝑥𝑠𝑡subscriptsuperscript𝑥𝑠subscript𝑠𝑛subscript𝑡𝑛subscript𝑇subscript𝑥𝑠𝑡subscriptsuperscript𝑥𝑠\displaystyle\sum_{t\leq n}T_{\sigma_{t}(x^{n}),x^{\prime n}}=\sum_{t\leq n}% \sum_{s\leq n}T_{x_{s+t},x^{\prime}_{s}}=\sum_{s\leq n}\sum_{t\leq n}T_{x_{s+t% },x^{\prime}_{s}}∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) , italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_s ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_s + italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ≤ italic_n end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_s + italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT (237)
=snTxn,(xs,,xs)=snTxnTxs=TxnTxn,absentsubscript𝑠𝑛subscript𝑇superscript𝑥𝑛subscriptsuperscript𝑥𝑠subscriptsuperscript𝑥𝑠subscript𝑠𝑛tensor-productsubscript𝑇superscript𝑥𝑛subscript𝑇subscriptsuperscript𝑥𝑠tensor-productsubscript𝑇superscript𝑥𝑛subscript𝑇superscript𝑥𝑛\displaystyle=\sum_{s\leq n}T_{x^{n},(x^{\prime}_{s},...,x^{\prime}_{s})}=\sum% _{s\leq n}T_{x^{n}}\otimes T_{x^{\prime}_{s}}=T_{x^{n}}\otimes T_{x^{\prime n}},= ∑ start_POSTSUBSCRIPT italic_s ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , … , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊗ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊗ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , (238)

where the index s+t𝑠𝑡s+titalic_s + italic_t is taken modulo n𝑛nitalic_n.

On the other hand we have

|𝒞n3||(tn𝒞n(t))n|subscriptsuperscript𝒞superscript𝑛3superscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛\displaystyle\frac{|\mathcal{C}^{\star}_{n^{3}}|}{\left|\left(\textstyle\prod_% {t\leq n}\mathcal{C}^{(t)}_{n}\right)^{n}\right|}divide start_ARG | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | end_ARG start_ARG | ( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG (239)
=|𝒯εnn3(PXPX)(tn𝒞n(t))n||(tn𝒞n(t))n|absentsubscriptsuperscript𝒯superscript𝑛3subscript𝜀𝑛tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋superscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛superscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛\displaystyle=\frac{\left|\mathcal{T}^{n^{3}}_{\varepsilon_{n}}(P_{X}\otimes P% _{X^{\prime}})\cap\left(\textstyle\prod_{t\leq n}\mathcal{C}^{(t)}_{n}\right)^% {n}\right|}{\left|\left(\textstyle\prod_{t\leq n}\mathcal{C}^{(t)}_{n}\right)^% {n}\right|}= divide start_ARG | caligraphic_T start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ∩ ( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG start_ARG | ( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG (240)
=(Cn3𝒯εnn3(PXPX))absentsuperscript𝐶superscript𝑛3subscriptsuperscript𝒯superscript𝑛3subscript𝜀𝑛tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋\displaystyle=\mathbb{P}\left(C^{n^{3}}\in\mathcal{T}^{n^{3}}_{\varepsilon_{n}% }(P_{X}\otimes P_{X^{\prime}})\right)= blackboard_P ( italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_T start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ) (241)
=(1ntnTCtn2PXPXεn)absentsubscriptnorm1𝑛subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋subscript𝜀𝑛\displaystyle=\mathbb{P}\left(\left\|\textstyle\frac{1}{n}\sum_{t\leq n}T_{C_{% t}^{n^{2}}}-P_{X}\otimes P_{X^{\prime}}\right\|_{\infty}\leq\varepsilon_{n}\right)= blackboard_P ( ∥ divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) (242)
(1ntnTCtn2QX(n)QX(n)+QX(n)QX(n)PXPXεn)absentsubscriptnorm1𝑛subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2tensor-productsubscriptsuperscript𝑄𝑛𝑋subscriptsuperscript𝑄𝑛superscript𝑋subscriptnormtensor-productsubscriptsuperscript𝑄𝑛𝑋subscriptsuperscript𝑄𝑛superscript𝑋tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋subscript𝜀𝑛\displaystyle\geq\mathbb{P}\left(\left\|\textstyle\frac{1}{n}\sum_{t\leq n}T_{% C_{t}^{n^{2}}}-Q^{(n)}_{X}\otimes Q^{(n)}_{X^{\prime}}\right\|_{\infty}+\left% \|Q^{(n)}_{X}\otimes Q^{(n)}_{X^{\prime}}-P_{X}\otimes P_{X^{\prime}}\right\|_% {\infty}\leq\varepsilon_{n}\right)≥ blackboard_P ( ∥ divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + ∥ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) (243)
=(tnTCtn2nQX(n)QX(n)n3/4)absentsubscriptnormsubscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2tensor-product𝑛subscriptsuperscript𝑄𝑛𝑋subscriptsuperscript𝑄𝑛superscript𝑋superscript𝑛34\displaystyle=\mathbb{P}\left(\left\|\textstyle\sum_{t\leq n}T_{C_{t}^{n^{2}}}% -nQ^{(n)}_{X}\otimes Q^{(n)}_{X^{\prime}}\right\|_{\infty}\leq n^{3/4}\right)= blackboard_P ( ∥ ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_n italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_n start_POSTSUPERSCRIPT 3 / 4 end_POSTSUPERSCRIPT ) (244)
1(x,x)𝒳×𝒳(|tnTCtn2(x,x)nQX(n)QX(n)(x,x)|>n3/4)absent1subscript𝑥superscript𝑥𝒳superscript𝒳subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2𝑥superscript𝑥tensor-product𝑛subscriptsuperscript𝑄𝑛𝑋subscriptsuperscript𝑄𝑛superscript𝑋𝑥superscript𝑥superscript𝑛34\displaystyle\geq 1-\textstyle\sum_{(x,x^{\prime})\in\mathcal{X}\times\mathcal% {X}^{\prime}}\mathbb{P}\left(\left|\textstyle\sum_{t\leq n}T_{C_{t}^{n^{2}}}(x% ,x^{\prime})-nQ^{(n)}_{X}\otimes Q^{(n)}_{X^{\prime}}(x,x^{\prime})\right|>n^{% 3/4}\right)≥ 1 - ∑ start_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_X × caligraphic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT blackboard_P ( | ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_n italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_Q start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) | > italic_n start_POSTSUPERSCRIPT 3 / 4 end_POSTSUPERSCRIPT ) (245)
1(x,x)𝒳×𝒳1n3/2𝕏[tnTCtn2(x,x)]absent1subscript𝑥superscript𝑥𝒳superscript𝒳1superscript𝑛32𝕏delimited-[]subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2𝑥superscript𝑥\displaystyle\geq 1-\textstyle\sum_{(x,x^{\prime})\in\mathcal{X}\times\mathcal% {X}^{\prime}}\frac{1}{n^{3/2}}\mathbb{X}\left[\textstyle\sum_{t\leq n}T_{C_{t}% ^{n^{2}}}(x,x^{\prime})\right]≥ 1 - ∑ start_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_X × caligraphic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT end_ARG blackboard_X [ ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] (246)
1|𝒳||𝒳|n1/2n1;absent1𝒳superscript𝒳superscript𝑛12𝑛1\displaystyle\geq 1-\textstyle\frac{|\mathcal{X}||\mathcal{X}^{\prime}|}{n^{1/% 2}}\underset{n\rightarrow\infty}{\rightarrow}1;≥ 1 - divide start_ARG | caligraphic_X | | caligraphic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 1 ; (247)

where (241) and (242) come from the construction of Cn3superscript𝐶superscript𝑛3C^{n^{3}}italic_C start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT; (244) comes from the construction of εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT; (245) follows from the union bound; (246) comes from Chebyshex’s inequality and (236); and (247) comes from the fact that 𝕍[tnTCtn2(x,x)]=tn𝕍[TCtn2(x,x)]n𝕍delimited-[]subscript𝑡𝑛subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2𝑥superscript𝑥subscript𝑡𝑛𝕍delimited-[]subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2𝑥superscript𝑥𝑛\mathbb{V}\left[\sum_{t\leq n}T_{C_{t}^{n^{2}}}(x,x^{\prime})\right]=\sum_{t% \leq n}\mathbb{V}\left[T_{C_{t}^{n^{2}}}(x,x^{\prime})\right]\leq nblackboard_V [ ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] = ∑ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT blackboard_V [ italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] ≤ italic_n, as the random variables TCtn2(x,x)subscript𝑇superscriptsubscript𝐶𝑡superscript𝑛2𝑥superscript𝑥T_{C_{t}^{n^{2}}}(x,x^{\prime})italic_T start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) are iid and takes values in [0,1]01[0,1][ 0 , 1 ]. Hence

limnlog|𝒞n3|n3=limnlog|(tn𝒞n(t))n|n3=limnlog|𝒞n|n=C0(GG);subscript𝑛subscriptsuperscript𝒞superscript𝑛3superscript𝑛3subscript𝑛superscriptsubscriptproduct𝑡𝑛subscriptsuperscript𝒞𝑡𝑛𝑛superscript𝑛3subscript𝑛subscript𝒞𝑛𝑛subscript𝐶0𝐺superscript𝐺\displaystyle\lim_{n\rightarrow\infty}\frac{\log|\mathcal{C}^{\star}_{n^{3}}|}% {n^{3}}=\lim_{n\rightarrow\infty}\frac{\log\left|\left(\textstyle\prod_{t\leq n% }\mathcal{C}^{(t)}_{n}\right)^{n}\right|}{n^{3}}=\lim_{n\rightarrow\infty}% \frac{\log|\mathcal{C}_{n}|}{n}=C_{0}(G\wedge G^{\prime});roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | ( ∏ start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ; (248)

where the second equality holds as the shifted codebooks (𝒞n(t))tnsubscriptsubscriptsuperscript𝒞𝑡𝑛𝑡𝑛(\mathcal{C}^{(t)}_{n})_{t\leq n}( caligraphic_C start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≤ italic_n end_POSTSUBSCRIPT all have cardinality |𝒞n|subscript𝒞𝑛|\mathcal{C}_{n}|| caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT |.

Thus, by combining (248), Lemma 4, and Proposition 6 we obtain

C0(GG)=limnlog|𝒞n3|n3C(GG,PXPX)C0(GG),subscript𝐶0𝐺superscript𝐺subscript𝑛subscriptsuperscript𝒞superscript𝑛3superscript𝑛3𝐶𝐺superscript𝐺tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋subscript𝐶0𝐺superscript𝐺\displaystyle C_{0}(G\wedge G^{\prime})=\lim_{n\rightarrow\infty}\frac{\log|% \mathcal{C}^{\star}_{n^{3}}|}{n^{3}}\leq C(G\wedge G^{\prime},P_{X}\otimes P_{% X^{\prime}})\leq C_{0}(G\wedge G^{\prime}),italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log | caligraphic_C start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ≤ italic_C ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , (249)

hence PXPX𝒫(GG)tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋superscript𝒫𝐺superscript𝐺P_{X}\otimes P_{X^{\prime}}\in\mathcal{P}^{\star}(G\wedge G^{\prime})italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ).

Therefore, Theorem 11 is proved when 𝒜𝒜\mathcal{A}caligraphic_A has two elements:

PX,X𝒫(GG)PXPX𝒫(GG).subscript𝑃𝑋superscript𝑋superscript𝒫𝐺superscript𝐺tensor-productsubscript𝑃𝑋subscript𝑃superscript𝑋superscript𝒫𝐺superscript𝐺\displaystyle P_{X,X^{\prime}}\in\mathcal{P}^{\star}(G\wedge G^{\prime})% \Longrightarrow P_{X}\otimes P_{X^{\prime}}\in\mathcal{P}^{\star}(G\wedge G^{% \prime}).italic_P start_POSTSUBSCRIPT italic_X , italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ⟹ italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ∧ italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (250)

Now let us consider the case where 𝒜𝒜\mathcal{A}caligraphic_A has a cardinality greater than 2. Let PX1,,X𝒜𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎P_{X_{1},...,X_{\mathcal{A}}}\in\mathcal{P}^{\star}(\bigwedge_{a\in\mathcal{A}% }G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ). By considering the product graphs

a𝒜Ga=(1i<iGi)(ii|𝒜|Gi);subscript𝑎𝒜subscript𝐺𝑎subscript1𝑖superscript𝑖subscript𝐺𝑖subscriptsuperscript𝑖𝑖𝒜subscript𝐺𝑖\displaystyle\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}=\Big{(}\bigwedge_{1% \leq i<i^{\star}}G_{i}\Big{)}\wedge\Big{(}\bigwedge_{i^{\star}\leq i\leq|% \mathcal{A}|}G_{i}\Big{)};⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( ⋀ start_POSTSUBSCRIPT 1 ≤ italic_i < italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∧ ( ⋀ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ≤ italic_i ≤ | caligraphic_A | end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ; (251)

for all i|𝒜|superscript𝑖𝒜i^{\star}\leq|\mathcal{A}|italic_i start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ≤ | caligraphic_A |, and applying (250) successively, we obtain

PX1,,X𝒜𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle P_{X_{1},...,X_{\mathcal{A}}}\in\mathcal{P}^{\star}\left(% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) PX1PX2,,X|𝒜|𝒫(a𝒜Ga)absenttensor-productsubscript𝑃subscript𝑋1subscript𝑃subscript𝑋2subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle\Longrightarrow P_{X_{1}}\otimes P_{X_{2},...,X_{|\mathcal{A}|}}% \in\mathcal{P}^{\star}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)⟹ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (252)
(PX1PX2)PX3,,X|𝒜|𝒫(a𝒜Ga)absenttensor-producttensor-productsubscript𝑃subscript𝑋1subscript𝑃subscript𝑋2subscript𝑃subscript𝑋3subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle\Longrightarrow(P_{X_{1}}\otimes P_{X_{2}})\otimes P_{X_{3},...,X% _{|\mathcal{A}|}}\in\mathcal{P}^{\star}\left(\textstyle\bigwedge_{a\in\mathcal% {A}}G_{a}\right)⟹ ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ⊗ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (253)
a𝒜PXa𝒫(a𝒜Ga).absentsubscripttensor-product𝑎𝒜subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle\Longrightarrow\textstyle\bigotimes_{a\in\mathcal{A}}P_{X_{a}}\in% \mathcal{P}^{\star}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right).⟹ ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (254)

Appendix F Results on capacity-achieving distributions

F-A Proof of Proposition 9

Let G𝐺Gitalic_G be a vertex-transitive graph, and let PX𝒫(G)subscript𝑃𝑋superscript𝒫𝐺P_{X}\in\mathcal{P}^{\star}(G)italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ). Let ψAut(G)𝜓Aut𝐺\psi\in\operatorname*{Aut}(G)italic_ψ ∈ roman_Aut ( italic_G ), we first prove that Pψ(X)𝒫(G)subscript𝑃𝜓𝑋superscript𝒫𝐺P_{\psi(X)}\in\mathcal{P}^{\star}(G)italic_P start_POSTSUBSCRIPT italic_ψ ( italic_X ) end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ), then we will conclude by using the convexity of 𝒫(G)superscript𝒫𝐺\mathcal{P}^{\star}(G)caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ).

Let (𝒞n)nsubscriptsubscript𝒞𝑛𝑛superscript(\mathcal{C}_{n})_{n\in\mathbb{N}^{\star}}( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT be a sequence such that

n,𝒞n𝒳n is an independent set in Gn,formulae-sequencefor-all𝑛superscriptsubscript𝒞𝑛superscript𝒳𝑛 is an independent set in superscript𝐺𝑛\displaystyle\forall n\in\mathbb{N}^{\star},\;\mathcal{C}_{n}\subseteq\mathcal% {X}^{n}\text{ is an independent set in }G^{\wedge n},∀ italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is an independent set in italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT , (255)
maxxn𝒞nTxnPXn0,subscriptsuperscript𝑥𝑛subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑃𝑋𝑛0\displaystyle\max_{x^{n}\in\mathcal{C}_{n}}\|T_{x^{n}}-P_{X}\|_{\infty}% \underset{n\rightarrow\infty}{\rightarrow}0,roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 , (256)
log|𝒞n|nnC(G,PX)=C0(G).subscript𝒞𝑛𝑛𝑛𝐶𝐺subscript𝑃𝑋subscript𝐶0𝐺\displaystyle\frac{\log|\mathcal{C}_{n}|}{n}\underset{n\rightarrow\infty}{% \rightarrow}C(G,P_{X})=C_{0}(G).divide start_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_C ( italic_G , italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) . (257)

The existence of such a sequence is given by Lemma 4. Note that the last equality in (257) comes from the assumption PX𝒫(G)subscript𝑃𝑋superscript𝒫𝐺P_{X}\in\mathcal{P}^{\star}(G)italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ).

Now, for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT the codebook

ψ(𝒞n){(ψ(x1),,ψ(xn))|xn𝒞n}approaches-limit𝜓subscript𝒞𝑛conditional-set𝜓subscript𝑥1𝜓subscript𝑥𝑛superscript𝑥𝑛subscript𝒞𝑛\displaystyle\psi(\mathcal{C}_{n})\doteq\{(\psi(x_{1}),...,\psi(x_{n}))\>|\>x^% {n}\in\mathcal{C}_{n}\}italic_ψ ( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≐ { ( italic_ψ ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_ψ ( italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) | italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } (258)

is also independent in Gnsuperscript𝐺𝑛G^{\wedge n}italic_G start_POSTSUPERSCRIPT ∧ italic_n end_POSTSUPERSCRIPT, as ψ𝜓\psiitalic_ψ is a graph automorphism and therefore preserves adjacencies. We have by construction

maxxnψ(𝒞n)TxnPψ(X)n0.subscriptsuperscript𝑥𝑛𝜓subscript𝒞𝑛subscriptnormsubscript𝑇superscript𝑥𝑛subscript𝑃𝜓𝑋𝑛0\displaystyle\max_{x^{n}\in\psi(\mathcal{C}_{n})}\|T_{x^{n}}-P_{\psi(X)}\|_{% \infty}\underset{n\rightarrow\infty}{\rightarrow}0.roman_max start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∈ italic_ψ ( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ∥ italic_T start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_ψ ( italic_X ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG 0 . (259)

Furthermore, since ψ𝜓\psiitalic_ψ is a bijection we have |ψ(𝒞n)|=|𝒞n|𝜓subscript𝒞𝑛subscript𝒞𝑛|\psi(\mathcal{C}_{n})|=|\mathcal{C}_{n}|| italic_ψ ( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) | = | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | for all n𝑛superscriptn\in\mathbb{N}^{\star}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, thus

log|ψ(𝒞n)|n=log|𝒞n|nnC0(G).𝜓subscript𝒞𝑛𝑛subscript𝒞𝑛𝑛𝑛subscript𝐶0𝐺\displaystyle\frac{\log|\psi(\mathcal{C}_{n})|}{n}=\frac{\log|\mathcal{C}_{n}|% }{n}\underset{n\rightarrow\infty}{\rightarrow}C_{0}(G).divide start_ARG roman_log | italic_ψ ( caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) | end_ARG start_ARG italic_n end_ARG = divide start_ARG roman_log | caligraphic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | end_ARG start_ARG italic_n end_ARG start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARG → end_ARG italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G ) . (260)

Hence

Pψ(X)𝒫(G).subscript𝑃𝜓𝑋superscript𝒫𝐺\displaystyle P_{\psi(X)}\in\mathcal{P}^{\star}(G).italic_P start_POSTSUBSCRIPT italic_ψ ( italic_X ) end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) . (261)

Now, for all v,x𝒳𝑣superscript𝑥𝒳v,x^{\prime}\in\mathcal{X}italic_v , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X, denote by 𝒮xvAut(G)subscript𝒮superscript𝑥𝑣Aut𝐺\mathcal{S}_{x^{\prime}\rightarrow v}\subseteq\operatorname*{Aut}(G)caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT ⊆ roman_Aut ( italic_G ) the set of automorphisms that map xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT to v𝑣vitalic_v; note that this set is nonempty thanks to the vertex-transitivity of G𝐺Gitalic_G. We have for all v𝒳𝑣𝒳v\in\mathcal{X}italic_v ∈ caligraphic_X

Aut(G)=x𝒳𝒮xv.Aut𝐺subscriptsquare-unionsuperscript𝑥𝒳subscript𝒮superscript𝑥𝑣\displaystyle\operatorname*{Aut}(G)=\textstyle\bigsqcup_{x^{\prime}\in\mathcal% {X}}\mathcal{S}_{x^{\prime}\rightarrow v}.roman_Aut ( italic_G ) = ⨆ start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT . (262)

Furthermore, for all v𝒳𝑣𝒳v\in\mathcal{X}italic_v ∈ caligraphic_X, all the sets (𝒮xv)xvsubscriptsubscript𝒮superscript𝑥𝑣superscript𝑥𝑣(\mathcal{S}_{x^{\prime}\rightarrow v})_{x^{\prime}\rightarrow v}( caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT have the same cardinality: for all x,x′′𝒳superscript𝑥superscript𝑥′′𝒳x^{\prime},x^{\prime\prime}\in\mathcal{X}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ∈ caligraphic_X,

𝒮x′′vψ1𝒮xv,subscript𝒮superscript𝑥′′𝑣subscript𝜓1subscript𝒮superscript𝑥𝑣\displaystyle\mathcal{S}_{x^{\prime\prime}\rightarrow v}\circ\psi_{1}\subseteq% \mathcal{S}_{x^{\prime}\rightarrow v},caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT ∘ italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊆ caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT , (263)

where ψ1𝒮xx′′subscript𝜓1subscript𝒮superscript𝑥superscript𝑥′′\psi_{1}\in\mathcal{S}_{x^{\prime}\rightarrow x^{\prime\prime}}italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. It follows that for all v,x𝒳𝑣superscript𝑥𝒳v,x^{\prime}\in\mathcal{X}italic_v , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X,

|𝒮xv|=|Aut(G)||𝒳|.subscript𝒮superscript𝑥𝑣Aut𝐺𝒳\displaystyle|\mathcal{S}_{x^{\prime}\rightarrow v}|=\frac{|\operatorname*{Aut% }(G)|}{|\mathcal{X}|}.| caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT | = divide start_ARG | roman_Aut ( italic_G ) | end_ARG start_ARG | caligraphic_X | end_ARG . (264)

Therefore, for all v𝒳𝑣𝒳v\in\mathcal{X}italic_v ∈ caligraphic_X we have

1|Aut(G)|ψAut(G)Pψ(X)𝒫(G)\displaystyle\frac{1}{|\operatorname*{Aut}(G)|}\sum_{\psi\in\operatorname*{Aut% }(G)}P_{\psi(X)}\qquad\in\mathcal{P}^{\star}(G)divide start_ARG 1 end_ARG start_ARG | roman_Aut ( italic_G ) | end_ARG ∑ start_POSTSUBSCRIPT italic_ψ ∈ roman_Aut ( italic_G ) end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ψ ( italic_X ) end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) (265)
=\displaystyle=\;= (1|Aut(G)|ψAut(G)PX(ψ1(x)))x𝒳subscript1Aut𝐺subscript𝜓Aut𝐺subscript𝑃𝑋superscript𝜓1𝑥𝑥𝒳\displaystyle\bigg{(}\frac{1}{|\operatorname*{Aut}(G)|}\sum_{\psi\in% \operatorname*{Aut}(G)}P_{X}(\psi^{-1}(x))\bigg{)}_{x\in\mathcal{X}}( divide start_ARG 1 end_ARG start_ARG | roman_Aut ( italic_G ) | end_ARG ∑ start_POSTSUBSCRIPT italic_ψ ∈ roman_Aut ( italic_G ) end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x ) ) ) start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT (266)
=\displaystyle=\;= (1|Aut(G)|x𝒳|𝒮xv|PX(x))x𝒳subscript1Aut𝐺subscriptsuperscript𝑥𝒳subscript𝒮superscript𝑥𝑣subscript𝑃𝑋superscript𝑥𝑥𝒳\displaystyle\bigg{(}\frac{1}{|\operatorname*{Aut}(G)|}\sum_{x^{\prime}\in% \mathcal{X}}|\mathcal{S}_{x^{\prime}\rightarrow v}|P_{X}(x^{\prime})\bigg{)}_{% x\in\mathcal{X}}( divide start_ARG 1 end_ARG start_ARG | roman_Aut ( italic_G ) | end_ARG ∑ start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT | caligraphic_S start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT → italic_v end_POSTSUBSCRIPT | italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT (267)
=\displaystyle=\;= (1|Aut(G)|x𝒳|Aut(G)||𝒳|PX(x))x𝒳subscript1Aut𝐺subscriptsuperscript𝑥𝒳Aut𝐺𝒳subscript𝑃𝑋superscript𝑥𝑥𝒳\displaystyle\bigg{(}\frac{1}{|\operatorname*{Aut}(G)|}\sum_{x^{\prime}\in% \mathcal{X}}\frac{|\operatorname*{Aut}(G)|}{|\mathcal{X}|}P_{X}(x^{\prime})% \bigg{)}_{x\in\mathcal{X}}( divide start_ARG 1 end_ARG start_ARG | roman_Aut ( italic_G ) | end_ARG ∑ start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT divide start_ARG | roman_Aut ( italic_G ) | end_ARG start_ARG | caligraphic_X | end_ARG italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT (268)
=\displaystyle=\;= Unif(𝒳);Unif𝒳\displaystyle\operatorname*{Unif}(\mathcal{X});roman_Unif ( caligraphic_X ) ; (269)

where (265) comes from the convexity of 𝒫(G)superscript𝒫𝐺\mathcal{P}^{\star}(G)caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G ) given by Proposition 6 and (261); (267) comes from (262); and (268) comes from (264).

F-B Proof of Lemma 5

Let (wa)a𝒜|𝒜|subscriptsubscript𝑤𝑎𝑎𝒜superscript𝒜(w_{a})_{a\in\mathcal{A}}\in\mathbb{R}^{|\mathcal{A}|}( italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT | caligraphic_A | end_POSTSUPERSCRIPT, and maximize

ζ:PAH(PA)+a𝒜PA(a)wa.:𝜁maps-tosubscript𝑃𝐴𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑤𝑎\displaystyle\zeta:P_{A}\mapsto H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)w_{a}.italic_ζ : italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT . (270)

It can be easily observed that ζ𝜁\zetaitalic_ζ is strictly concave, hence the existence and uniqueness of the maximum. We have

ζ(PA)=(logPA(a)1ln2+wa)a𝒜,𝜁subscript𝑃𝐴subscriptsubscript𝑃𝐴𝑎12subscript𝑤𝑎𝑎𝒜\displaystyle\nabla\zeta(P_{A})=\left(-\log P_{A}(a)-\frac{1}{\ln 2}+w_{a}% \right)_{a\in\mathcal{A}},∇ italic_ζ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ( - roman_log italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) - divide start_ARG 1 end_ARG start_ARG roman_ln 2 end_ARG + italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT , (271)

hence

ζ(PA)Δ(𝒜)perpendicular-to𝜁subscript𝑃𝐴Δ𝒜\displaystyle\nabla\zeta(P_{A})\perp\Delta(\mathcal{A})∇ italic_ζ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) ⟂ roman_Δ ( caligraphic_A ) C,ζ(PA)=(C,,C)absentformulae-sequence𝐶𝜁subscript𝑃𝐴𝐶𝐶\displaystyle\Longleftrightarrow\exists C\in\mathbb{R},\,\nabla\zeta(P_{A})=(C% ,...,C)⟺ ∃ italic_C ∈ blackboard_R , ∇ italic_ζ ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) = ( italic_C , … , italic_C ) (272)
C,(logPA(a)+wa)a𝒜=(C,,C)absentformulae-sequencesuperscript𝐶subscriptsubscript𝑃𝐴𝑎subscript𝑤𝑎𝑎𝒜superscript𝐶superscript𝐶\displaystyle\Longleftrightarrow\exists C^{\prime}\in\mathbb{R},\,(-\log P_{A}% (a)+w_{a})_{a\in\mathcal{A}}=(C^{\prime},...,C^{\prime})⟺ ∃ italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ blackboard_R , ( - roman_log italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) + italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT = ( italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , … , italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) (273)
C,PA=2C(2wa)a𝒜absentformulae-sequencesuperscript𝐶subscript𝑃𝐴superscript2superscript𝐶subscriptsuperscript2subscript𝑤𝑎𝑎𝒜\displaystyle\Longleftrightarrow\exists C^{\prime}\in\mathbb{R},\,P_{A}=2^{-C^% {\prime}}\left(2^{w_{a}}\right)_{a\in\mathcal{A}}⟺ ∃ italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ blackboard_R , italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = 2 start_POSTSUPERSCRIPT - italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ( 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT (274)

The value of Csuperscript𝐶C^{\prime}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT can be deduced from the fact that PAsubscript𝑃𝐴P_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is a probability distribution: 2Csuperscript2superscript𝐶2^{C^{\prime}}2 start_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT is the normalization constant a𝒜2wasubscriptsuperscript𝑎𝒜superscript2subscript𝑤superscript𝑎\sum_{a^{\prime}\in\mathcal{A}}2^{w_{a^{\prime}}}∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Hence the maximum of ζ𝜁\zetaitalic_ζ writes

PA=(2waa𝒜2wa)a𝒜;subscriptsuperscript𝑃𝐴subscriptsuperscript2subscript𝑤𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝑤superscript𝑎𝑎𝒜\displaystyle P^{\star}_{A}=\left(\frac{2^{w_{a}}}{\sum_{a^{\prime}\in\mathcal% {A}}2^{w_{a^{\prime}}}}\right)_{a\in\mathcal{A}};italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ; (275)

and we have

ζ(PA)𝜁subscriptsuperscript𝑃𝐴\displaystyle\zeta(P^{\star}_{A})italic_ζ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) =a𝒜PA(a)(log(a𝒜2wa2wa)+wa)absentsubscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝑤superscript𝑎superscript2subscript𝑤𝑎subscript𝑤𝑎\displaystyle=\sum_{a\in\mathcal{A}}\ P^{\star}_{A}(a)\left(\log\left(\frac{% \sum_{a^{\prime}\in\mathcal{A}}2^{w_{a^{\prime}}}}{2^{w_{a}}}\right)+w_{a}\right)= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) ( roman_log ( divide start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) + italic_w start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (276)
=log(a𝒵2wa).absentsubscriptsuperscript𝑎𝒵superscript2subscript𝑤superscript𝑎\displaystyle=\log\left(\sum_{a^{\prime}\in\mathcal{Z}}2^{w_{a^{\prime}}}% \right).= roman_log ( ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_Z end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) . (277)

Appendix G Proof of Theorem 12

We prove Theorem 12 in two steps, which are Lemma 14 and Lemma 15. The proofs are respectively given in App. G-A and G-B.

Lemma 14.
C0(a𝒜Ga)=a𝒜C0(Ga)subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\sum_{a\in% \mathcal{A}}C_{0}(G_{a})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (278)
\displaystyle\Longrightarrow\; (PXa)a𝒜a𝒜𝒫(Ga),{a𝒜PXa𝒫(a𝒜Ga) and C(a𝒜Ga,a𝒜PXa)=a𝒜C(Ga,PXa).for-allsubscriptsubscriptsuperscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎casessubscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscript𝑎𝒜subscript𝐺𝑎 and otherwise𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎subscript𝑎𝒜𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎otherwise\displaystyle\forall(P^{\star}_{X_{a}})_{a\in\mathcal{A}}\in\prod_{a\in% \mathcal{A}}\mathcal{P}^{\star}(G_{a}),\begin{cases}\bigotimes_{a\in\mathcal{A% }}P^{\star}_{X_{a}}\in\mathcal{P}^{\star}\left(\bigwedge_{a\in\mathcal{A}}G_{a% }\right)\text{ and }\\ C\left(\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{a\in\mathcal{A}}P^{\star% }_{X_{a}}\right)=\sum_{a\in\mathcal{A}}C(G_{a},P^{\star}_{X_{a}}).\end{cases}∀ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , { start_ROW start_CELL ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . end_CELL start_CELL end_CELL end_ROW (279)
Lemma 15.

For all PX1,,X|𝒜|𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}\left(\bigwedge_{a\in% \mathcal{A}}G_{a}\right)italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), the following holds

C(a𝒜Ga,PX1,,X|𝒜|)=a𝒜C(Ga,PXa)𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...,X_{|% \mathcal{A}|}}\right)=\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}})italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (280)
\displaystyle\Longrightarrow\; C0(a𝒜Ga)=a𝒜C0(Ga) and a𝒜,PXa𝒫(Ga).formulae-sequencesubscript𝐶0subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝐶0subscript𝐺𝑎 and for-all𝑎𝒜subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎\displaystyle C_{0}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\sum_{a\in% \mathcal{A}}C_{0}(G_{a})\text{ and }\forall a\in\mathcal{A},\;P_{X_{a}}\in% \mathcal{P}^{\star}(G_{a}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and ∀ italic_a ∈ caligraphic_A , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (281)

Let us prove Theorem 12. We consider a family of distributions PXa𝒫(Ga)subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) with a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A. By Lemma 14, we have

C0(a𝒜Ga)=a𝒜C0(Ga)subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)=% \textstyle\sum_{a\in\mathcal{A}}C_{0}(G_{a})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (282)
\displaystyle\Longrightarrow\; a𝒜PXa𝒫(a𝒜Ga) and C(a𝒜Ga,a𝒜PXa)=a𝒜C(Ga,PXa)subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscript𝑎𝒜subscript𝐺𝑎 and 𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎subscript𝑎𝒜𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\textstyle\bigotimes_{a\in\mathcal{A}}P^{\star}_{X_{a}}\in% \mathcal{P}^{\star}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)\text{ and }C% \left(\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{a\in\mathcal{A}}P^{\star}% _{X_{a}}\right)=\sum_{a\in\mathcal{A}}C(G_{a},P^{\star}_{X_{a}})⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (283)
\displaystyle\Longrightarrow\; PX1,,X|𝒜|𝒫(a𝒜Ga),C(a𝒜Ga,PX1,,X|𝒜|)=a𝒜C(Ga,PXa).formulae-sequencesubscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\exists P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}% \left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)\!,\;\;C\left(% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...,X_{|\mathcal{A}|}}% \right)=\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}}).∃ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (284)

Conversely, by Lemma 15 we have

PX1,,X|𝒜|𝒫(a𝒜Ga),C(a𝒜Ga,PX1,,X|𝒜|)=a𝒜C(Ga,PXa)formulae-sequencesubscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\exists P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}% \left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)\!,\;\;C\left(% \textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...,X_{|\mathcal{A}|}}% \right)=\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}})∃ italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (285)
\displaystyle\Longrightarrow\; C0(a𝒜Ga)=a𝒜C0(Ga).subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)=% \textstyle\sum_{a\in\mathcal{A}}C_{0}(G_{a}).italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (286)

Moreover, all distribution PX1,,X|𝒜|subscript𝑃subscript𝑋1subscript𝑋𝒜P_{X_{1},...,X_{|\mathcal{A}|}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT that satisfies (285), also satisfies PXa𝒫(Ga)subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

G-A Proof of Lemma 14

For all family of graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT, and family of distributions PXa𝒫(Ga)subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) with a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, we have

C0(a𝒜Ga)subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =maxPX1,,X|𝒜|C(a𝒜Ga,PX1,,X|𝒜|)absentsubscriptsubscript𝑃subscript𝑋1subscript𝑋𝒜𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜\displaystyle=\max_{P_{X_{1},...,X_{|\mathcal{A}|}}}\;C\left(\textstyle% \bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...,X_{|\mathcal{A}|}}\right)= roman_max start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (287)
C(a𝒜Ga,a𝒜PXa)absent𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\geq C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;% \bigotimes_{a\in\mathcal{A}}P^{\star}_{X_{a}}\right)≥ italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (288)
a𝒜C(Ga,PXa)absentsubscript𝑎𝒜𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\geq\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P^{\star}_{X_{a}})≥ ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (289)
=a𝒜C0(Ga);absentsubscript𝑎𝒜subscript𝐶0subscript𝐺𝑎\displaystyle=\textstyle\sum_{a\in\mathcal{A}}C_{0}(G_{a});= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ; (290)

where (287) comes from Theorem 10; (289) comes from Proposition 5; and (290) follows from the hypothesis PXa𝒫(Ga)subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) with a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

Now assume that a𝒜C0(Ga)=C0(a𝒜Ga)subscript𝑎𝒜subscript𝐶0subscript𝐺𝑎subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎\sum_{a\in\mathcal{A}}C_{0}(G_{a})=C_{0}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), then equality holds between the left-hand side of (287) and the term in (290). Therefore, we have

C0(a𝒜Ga)=C(a𝒜Ga,a𝒜PXa), hence a𝒜PXa𝒫(a𝒜Ga);formulae-sequencesubscript𝐶0subscript𝑎𝒜subscript𝐺𝑎𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎 hence subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscript𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)=C% \left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{a\in\mathcal{A}% }P^{\star}_{X_{a}}\right),\text{ hence }\textstyle\bigotimes_{a\in\mathcal{A}}% P^{\star}_{X_{a}}\in\mathcal{P}^{\star}\left(\textstyle\bigwedge_{a\in\mathcal% {A}}G_{a}\right);italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , hence ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ; (291)
C(a𝒜Ga,a𝒜PXa)=a𝒜C(Ga,PXa).𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎subscript𝑎𝒜𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;\bigotimes_{% a\in\mathcal{A}}P^{\star}_{X_{a}}\right)=\textstyle\sum_{a\in\mathcal{A}}C(G_{% a},P^{\star}_{X_{a}}).italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (292)

G-B Proof of Lemma 15

Let PX1,,X|𝒜|𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}(\bigwedge_{a\in\mathcal{% A}}G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), and let PXa𝒫(Ga)subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P^{\star}_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A. The following holds

C0(a𝒜Ga)subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =C(a𝒜Ga,PX1,,X|𝒜|)absent𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜\displaystyle=C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...% ,X_{|\mathcal{A}|}}\right)= italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (293)
C(a𝒜Ga,a𝒜PXa)absent𝐶subscript𝑎𝒜subscript𝐺𝑎subscripttensor-product𝑎𝒜subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\geq C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;% \textstyle\bigotimes_{a\in\mathcal{A}}P^{\star}_{X_{a}}\right)≥ italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⨂ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (294)
a𝒜C(Ga,PXa)absentsubscript𝑎𝒜𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\geq\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P^{\star}_{X_{a}})≥ ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (295)
a𝒜C(Ga,PXa);absentsubscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\geq\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}});≥ ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ; (296)

where (294) comes from the hypothesis PX1,,X|𝒜|𝒫(a𝒜Ga)subscript𝑃subscript𝑋1subscript𝑋𝒜superscript𝒫subscript𝑎𝒜subscript𝐺𝑎P_{X_{1},...,X_{|\mathcal{A}|}}\in\mathcal{P}^{\star}(\bigwedge_{a\in\mathcal{% A}}G_{a})italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ); (295) comes from Proposition 5; and (296) comes from the hypothesis PXa𝒫(Ga)subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎P^{\star}_{X_{a}}\in\mathcal{P}^{\star}(G_{a})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

Now assume that

C(a𝒜Ga,PX1,,X|𝒜|)=a𝒜C(Ga,PXa).𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜subscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...% ,X_{|\mathcal{A}|}}\right)=\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}}).italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (297)

Then equality holds in between the right-hand side of (293) and the term in (296). In particular, we have for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A

C(Ga,PXa)=C(Ga,PXa),𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle C(G_{a},P_{X_{a}})=C(G_{a},P^{\star}_{X_{a}}),italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , (298)

which implies that PXasubscript𝑃subscript𝑋𝑎P_{X_{a}}italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT also maximizes C(Ga,)𝐶subscript𝐺𝑎C(G_{a},\cdot)italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ⋅ ) for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A:

a𝒜,PXa𝒫(Ga).formulae-sequencefor-all𝑎𝒜subscript𝑃subscript𝑋𝑎superscript𝒫subscript𝐺𝑎\displaystyle\forall a\in\mathcal{A},\;P_{X_{a}}\in\mathcal{P}^{\star}(G_{a}).∀ italic_a ∈ caligraphic_A , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (299)

Furthermore,

C0(a𝒜Ga)subscript𝐶0subscript𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =C(a𝒜Ga,PX1,,X|𝒜|)absent𝐶subscript𝑎𝒜subscript𝐺𝑎subscript𝑃subscript𝑋1subscript𝑋𝒜\displaystyle=C\left(\textstyle\bigwedge_{a\in\mathcal{A}}G_{a},\;P_{X_{1},...% ,X_{|\mathcal{A}|}}\right)= italic_C ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT | caligraphic_A | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (300)
=a𝒜C(Ga,PXa)absentsubscript𝑎𝒜𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle=\textstyle\sum_{a\in\mathcal{A}}C(G_{a},P_{X_{a}})= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (301)
=a𝒜C0(Ga);absentsubscript𝑎𝒜subscript𝐶0subscript𝐺𝑎\displaystyle=\textstyle\sum_{a\in\mathcal{A}}C_{0}(G_{a});= ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ; (302)

where (301) is a consequence of the equality in equations (293)-(296), and (302) comes from (299).

Appendix H Proof of Theorem 13

The techniques used in this proof are the same as in the proof of Theorem 12. We prove Theorem 13 in two steps, which are Lemma 16 and Lemma 17; their proofs are respectively given in App. H-A and H-B.

Lemma 16.

Let

PA(2C0(Ga)a𝒜2C0(Ga))a𝒜,approaches-limitsubscriptsuperscript𝑃𝐴subscriptsuperscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎𝑎𝒜\displaystyle P^{\star}_{A}\doteq\left(\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime% }\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}})}}\right)_{a\in\mathcal{A}},italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≐ ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT , (303)

we have

C0(a𝒜Ga)=log(a𝒜2C0(Ga))subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=\log\left(% \sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT )
\displaystyle\Longrightarrow\; (PXa)a𝒜a𝒜𝒫(Ga),a𝒜PA(a)PXa𝒫(a𝒜Ga) andformulae-sequencefor-allsubscriptsubscriptsuperscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎 and\displaystyle\forall(P^{\star}_{X_{a}})_{a\in\mathcal{A}}\in\prod_{a\in% \mathcal{A}}\mathcal{P}^{\star}(G_{a}),\;\sum_{a\in\mathcal{A}}P^{\star}_{A}(a% )P^{\star}_{X_{a}}\in\mathcal{P}^{\star}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}% \right)\text{ and }∀ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and
C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa),𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum_{a\in% \mathcal{A}}P^{\star}_{A}(a)P^{\star}_{X_{a}}\right)=H(P^{\star}_{A})+\sum_{a% \in\mathcal{A}}P^{\star}_{A}(a)C(G_{a},P^{\star}_{X_{a}}),italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , (304)
Lemma 17.

Let

PA(2C0(Ga)a𝒜2C0(Ga))a𝒜,approaches-limitsubscriptsuperscript𝑃𝐴subscriptsuperscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎𝑎𝒜\displaystyle P^{\star}_{A}\doteq\left(\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime% }\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}})}}\right)_{a\in\mathcal{A}},italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≐ ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT , (305)

for all a𝒜PA(a)PXa𝒫(a𝒜Ga)subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\in\mathcal{P}^{\star}\left(\bigsqcup_{% a\in\mathcal{A}}G_{a}\right)∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) the following holds

C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa)𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum_{a\in% \mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\sum_{a\in\mathcal{A}}P_{A}(a)C(% G_{a},P_{X_{a}})italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT )
\displaystyle\Longrightarrow\; C0(a𝒜Ga)=log(a𝒜2C0(Ga)),(PXa)a𝒜a𝒜𝒫(Ga), and PA=PA.formulae-sequencesubscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎formulae-sequencesubscriptsubscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎 and subscript𝑃𝐴subscriptsuperscript𝑃𝐴\displaystyle C_{0}\left(\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=\log\left(% \sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)\!,\,(P_{X_{a}})_{a\in\mathcal{A}% }\in\prod_{a\in\mathcal{A}}\mathcal{P}^{\star}(G_{a}),\text{ and }P_{A}=P^{% \star}_{A}.italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) , ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , and italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT . (306)

Now let us prove Theorem 13. Let (PXa)a𝒜a𝒜𝒫(Ga)subscriptsubscriptsuperscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎(P^{\star}_{X_{a}})_{a\in\mathcal{A}}\in\prod_{a\in\mathcal{A}}\mathcal{P}^{% \star}(G_{a})( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), we have by Lemma 16

C0(a𝒜Ga)=log(a𝒜2C0(Ga))subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=% \log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) (307)
\displaystyle\Longrightarrow\; a𝒜PA(a)PXa𝒫(a𝒜Ga) andsubscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎 and\displaystyle\textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)P^{\star}_{X_{a}}% \in\mathcal{P}^{\star}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)% \text{ and }∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and
C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa),𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;% \textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)P^{\star}_{X_{a}}\right)=H(P^{% \star}_{A})+\textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)C(G_{a},P^{\star}_% {X_{a}}),italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , (308)
\displaystyle\Longrightarrow\; PX𝒫(a𝒜Ga),subscript𝑃𝑋superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle\exists P_{X}\in\mathcal{P}^{\star}\left(\textstyle\bigsqcup_{a% \in\mathcal{A}}G_{a}\right)\!,\,∃ italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) ,
C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa),𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;% \textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)P^{\star}_{X_{a}}\right)=H(P^{% \star}_{A})+\textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)C(G_{a},P^{\star}_% {X_{a}}),italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ,

where PXa=PX|X𝒳asubscriptsuperscript𝑃subscript𝑋𝑎subscript𝑃conditional𝑋𝑋subscript𝒳𝑎P^{\star}_{X_{a}}=P_{X|X\in\mathcal{X}_{a}}italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_X | italic_X ∈ caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT and PA(a)=PX(𝒳a)subscriptsuperscript𝑃𝐴𝑎subscript𝑃𝑋subscript𝒳𝑎P^{\star}_{A}(a)=P_{X}(\mathcal{X}_{a})italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) = italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

Conversely, by Lemma 17 we have

PX𝒫(a𝒜Ga),subscript𝑃𝑋superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle\exists P_{X}\in\mathcal{P}^{\star}\left(\textstyle\bigsqcup_{a% \in\mathcal{A}}G_{a}\right)\!,∃ italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (309)
C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa)𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;% \textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\textstyle% \sum_{a\in\mathcal{A}}P_{A}(a)C(G_{a},P_{X_{a}})italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT )
\displaystyle\Longrightarrow\; C0(a𝒜Ga)=log(a𝒜2C0(Ga)),subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=% \log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right),italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) , (310)

and any PX=a𝒜PA(a)PXasubscript𝑃𝑋subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎P_{X}=\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}italic_P start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT that satisfies (309) also satisfies

(PXa)a𝒜a𝒜𝒫(Ga), and PA=(2C0(Ga)a𝒜2C0(Ga))a𝒜.formulae-sequencesubscriptsubscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎 and subscript𝑃𝐴subscriptsuperscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎𝑎𝒜\displaystyle(P_{X_{a}})_{a\in\mathcal{A}}\in\prod_{a\in\mathcal{A}}\mathcal{P% }^{\star}(G_{a}),\text{ and }P_{A}=\left(\frac{2^{C_{0}(G_{a})}}{\sum_{a^{% \prime}\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}})}}\right)_{a\in\mathcal{A}}.( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , and italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT . (311)

H-A Proof of Lemma 16

Now let us prove Lemma 16. Let

a𝒜PA(a)PXa𝒫(a𝒜Ga),subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\in\mathcal{P}^{% \star}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right),∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (312)
(PXa)a𝒜a𝒜𝒫(Ga),subscriptsubscriptsuperscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎\displaystyle(P^{\star}_{X_{a}})_{a\in\mathcal{A}}\in\textstyle\prod_{a\in% \mathcal{A}}\mathcal{P}^{\star}(G_{a}),( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (313)
PA(2C0(Ga)a𝒜2C0(Ga))a𝒜.approaches-limitsubscriptsuperscript𝑃𝐴subscriptsuperscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎𝑎𝒜\displaystyle P^{\star}_{A}\doteq\left(\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime% }\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}})}}\right)_{a\in\mathcal{A}}.italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≐ ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT . (314)

We have

C0(a𝒜Ga)subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) =C(a𝒜PAGa,a𝒜PA(a)PXa)absent𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\displaystyle=C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum% _{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)= italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (315)
C(a𝒜PAGa,a𝒜PA(a)PXa)absent𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\geq C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;% \sum_{a\in\mathcal{A}}P^{\star}_{A}(a)P^{\star}_{X_{a}}\right)≥ italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (316)
H(PA)+a𝒜PA(a)C(Ga,PXa)absent𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle\geq H(P^{\star}_{A})+\textstyle\sum_{a\in\mathcal{A}}P^{\star}_{% A}(a)C(G_{a},P^{\star}_{X_{a}})≥ italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (317)
=H(PA)+a𝒜PA(a)C0(Ga)absent𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎\displaystyle=H(P^{\star}_{A})+\textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a% )C_{0}(G_{a})= italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (318)
=log(a𝒜2C0(Ga));absentsubscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle=\log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right);= roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) ; (319)

where (315) and (316) come from (312) and Proposition 6; (317) comes from Proposition 5; (318) comes from (313) and Proposition 6; and (319) comes from (314) and Lemma 5.

Assume that C0(a𝒜Ga)=log(a𝒜2C0(Ga))subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=\log\left(% \textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ), then equality holds in (315) to (319), therefore the following holds:

C0(a𝒜Ga)=log(a𝒜2C0(Ga))subscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=% \log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT )
\displaystyle\Longrightarrow\; (PXa)a𝒜a𝒜𝒫(Ga),a𝒜PA(a)PXa𝒫(a𝒜Ga) andformulae-sequencefor-allsubscriptsubscriptsuperscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎 and\displaystyle\forall(P^{\star}_{X_{a}})_{a\in\mathcal{A}}\in\textstyle\prod_{a% \in\mathcal{A}}\mathcal{P}^{\star}(G_{a}),\;\textstyle\sum_{a\in\mathcal{A}}P^% {\star}_{A}(a)P^{\star}_{X_{a}}\in\mathcal{P}^{\star}\left(\textstyle\bigsqcup% _{a\in\mathcal{A}}G_{a}\right)\text{ and }∀ ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) and
C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa).𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscriptsuperscript𝑃subscript𝑋𝑎𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum% _{a\in\mathcal{A}}P^{\star}_{A}(a)P^{\star}_{X_{a}}\right)=H(P^{\star}_{A})+% \textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a)C(G_{a},P^{\star}_{X_{a}}).italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (320)

H-B Proof of Lemma 17

Let

a𝒜PA(a)PXa𝒫(a𝒜Ga),subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\in\mathcal{P}^{% \star}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right),∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (321)
(PXa)a𝒜a𝒜𝒫(Ga),subscriptsubscriptsuperscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎\displaystyle(P^{\star}_{X_{a}})_{a\in\mathcal{A}}\in\textstyle\prod_{a\in% \mathcal{A}}\mathcal{P}^{\star}(G_{a}),( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , (322)
PA(2C0(Ga)a𝒜2C0(Ga))a𝒜.approaches-limitsubscriptsuperscript𝑃𝐴subscriptsuperscript2subscript𝐶0subscript𝐺𝑎subscriptsuperscript𝑎𝒜superscript2subscript𝐶0subscript𝐺superscript𝑎𝑎𝒜\displaystyle P^{\star}_{A}\doteq\left(\frac{2^{C_{0}(G_{a})}}{\sum_{a^{\prime% }\in\mathcal{A}}2^{C_{0}(G_{a^{\prime}})}}\right)_{a\in\mathcal{A}}.italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≐ ( divide start_ARG 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT . (323)

We have

C(a𝒜PAGa,a𝒜PA(a)PXa)𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎\displaystyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum% _{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =C0(a𝒜Ga)absentsubscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎\displaystyle=C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)= italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (324)
log(a𝒜2C0(Ga))absentsubscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎\displaystyle\geq\log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right)≥ roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) (325)
=H(PA)+a𝒜PA(a)C0(Ga)absent𝐻subscriptsuperscript𝑃𝐴subscript𝑎𝒜subscriptsuperscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎\displaystyle=H(P^{\star}_{A})+\textstyle\sum_{a\in\mathcal{A}}P^{\star}_{A}(a% )C_{0}(G_{a})= italic_H ( italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (326)
H(PA)+a𝒜PA(a)C0(Ga)absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐶0subscript𝐺𝑎\displaystyle\geq H(P_{A})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)C_{0}(G_{a})≥ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) (327)
=H(PA)+a𝒜PA(a)C(Ga,PXa)absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscriptsuperscript𝑃subscript𝑋𝑎\displaystyle=H(P_{A})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)C(G_{a},P^{% \star}_{X_{a}})= italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (328)
H(PA)+a𝒜PA(a)C(Ga,PXa);absent𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\geq H(P_{A})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)C(G_{a},P_{% X_{a}});≥ italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ; (329)

where (324) comes from (321) and Proposition 6; (325) comes from (29), see [3, Theorem 4]; (326) and (327) come from (323) and Lemma 5, which can be found in App. H-A; (328) and (329) come from (322) and Proposition 6.

Assume that C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa)𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_{a},\;\sum_{a\in\mathcal% {A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)% C(G_{a},P_{X_{a}})italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), then equality holds in (324) to (329). In particular PA=PAsubscript𝑃𝐴subscriptsuperscript𝑃𝐴P_{A}=P^{\star}_{A}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT as a consequence of the equality between (326) and (327); and (PXa)a𝒜a𝒜𝒫(Ga)subscriptsubscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎(P_{X_{a}})_{a\in\mathcal{A}}\in\prod_{a\in\mathcal{A}}\mathcal{P}^{\star}(G_{% a})( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) as a consequence of the equality between (328) and (329). Thus, for all a𝒜PA(a)PXa𝒫(a𝒜Ga)subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎superscript𝒫subscriptsquare-union𝑎𝒜subscript𝐺𝑎\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\in\mathcal{P}^{\star}\left(\textstyle% \bigsqcup_{a\in\mathcal{A}}G_{a}\right)∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) the following holds:

C(a𝒜PAGa,a𝒜PA(a)PXa)=H(PA)+a𝒜PA(a)C(Ga,PXa)𝐶subscriptsuperscriptsquare-unionsubscript𝑃𝐴𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝑃subscript𝑋𝑎𝐻subscript𝑃𝐴subscript𝑎𝒜subscript𝑃𝐴𝑎𝐶subscript𝐺𝑎subscript𝑃subscript𝑋𝑎\displaystyle\textstyle C\left(\textstyle\bigsqcup^{P_{A}}_{a\in\mathcal{A}}G_% {a},\;\sum_{a\in\mathcal{A}}P_{A}(a)P_{X_{a}}\right)=H(P_{A})+\textstyle\sum_{% a\in\mathcal{A}}P_{A}(a)C(G_{a},P_{X_{a}})italic_C ( ⨆ start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_H ( italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_C ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT )
\displaystyle\Longrightarrow\; C0(a𝒜Ga)=log(a𝒜2C0(Ga)),(PXa)a𝒜a𝒜𝒫(Ga), and PA=PA.formulae-sequencesubscript𝐶0subscriptsquare-union𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜superscript2subscript𝐶0subscript𝐺𝑎formulae-sequencesubscriptsubscript𝑃subscript𝑋𝑎𝑎𝒜subscriptproduct𝑎𝒜superscript𝒫subscript𝐺𝑎 and subscript𝑃𝐴subscriptsuperscript𝑃𝐴\displaystyle C_{0}\left(\textstyle\bigsqcup_{a\in\mathcal{A}}G_{a}\right)=% \log\left(\textstyle\sum_{a\in\mathcal{A}}2^{C_{0}(G_{a})}\right),\;(P_{X_{a}}% )_{a\in\mathcal{A}}\in\prod_{a\in\mathcal{A}}\mathcal{P}^{\star}(G_{a}),\text{% and }P_{A}=P^{\star}_{A}.italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = roman_log ( ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) , ( italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) , and italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT . (330)

Appendix I Proof of Theorem 17

Lemma 18 comes from [35, Corollary 1], and states that the function PAHκ(a𝒜PAGa)maps-tosubscript𝑃𝐴subscript𝐻𝜅superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎P_{A}\mapsto H_{\kappa}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), defined analogously to PAH¯(a𝒜PAGa)maps-tosubscript𝑃𝐴¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎P_{A}\mapsto\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ↦ over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), is always linear. The proof of Lemma 19 is given in App. I-A.

Lemma 18 (from [42, Corollary 3.4]).

For all probabilistic graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT and PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ), we have Hκ(a𝒜PAGa)=a𝒜PA(a)Hκ(Ga)subscript𝐻𝜅superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐻𝜅subscript𝐺𝑎H_{\kappa}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}=\sum_{a\in% \mathcal{A}}P_{A}(a)H_{\kappa}(G_{a})italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ).

Lemma 19.

The probabilistic graph a𝒜PAGasuperscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is perfect if and only if Gasubscript𝐺𝑎G_{a}italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is perfect for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A.

Now let us prove Theorem 17.

For all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, let Ga=(𝒳a,a,PXa)subscript𝐺𝑎subscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎G_{a}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be a perfect probabilistic graph. By Lemma 19, a𝒜PAGasuperscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is also perfect; and we have H¯(a𝒜PAGa)=Hκ(a𝒜PAGa)¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝐻𝜅superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\overline{H}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}=H_{\kappa}% \big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) by Theorem 14. We also have Hκ(a𝒜PAGa)=a𝒜PA(a)Hκ(Ga)=a𝒜PA(a)H¯(Ga)subscript𝐻𝜅superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎subscript𝐻𝜅subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎H_{\kappa}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}=\sum_{a\in% \mathcal{A}}P_{A}(a)H_{\kappa}(G_{a})=\sum_{a\in\mathcal{A}}P_{A}(a)\overline{% H}(G_{a})italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) by Lemma 18 and Theorem 14 used on the perfect graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT. Thus

H¯(a𝒜PAGa)=a𝒜PA(a)H¯(Ga).¯𝐻superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎subscript𝑎𝒜subscript𝑃𝐴𝑎¯𝐻subscript𝐺𝑎\displaystyle\overline{H}\big{(}\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G% _{a}\big{)}=\textstyle\sum_{a\in\mathcal{A}}P_{A}(a)\overline{H}(G_{a}).over¯ start_ARG italic_H end_ARG ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ( italic_a ) over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) . (331)

By Theorem 4, it follows that H¯(a𝒜Ga)=a𝒜H¯(Ga)=a𝒜Hκ(Ga)¯𝐻subscript𝑎𝒜subscript𝐺𝑎subscript𝑎𝒜¯𝐻subscript𝐺𝑎subscript𝑎𝒜subscript𝐻𝜅subscript𝐺𝑎\overline{H}\left(\bigwedge_{a\in\mathcal{A}}G_{a}\right)=\sum_{a\in\mathcal{A% }}\overline{H}(G_{a})=\sum_{a\in\mathcal{A}}H_{\kappa}(G_{a})over¯ start_ARG italic_H end_ARG ( ⋀ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ), where the last equality comes from Theorem 14.

I-A Proof of Lemma 19

()(\Longrightarrow)( ⟹ ) Let G=a𝒜PAGa𝐺superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎G=\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}italic_G = ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT be a perfect probabilistic graph. Let a𝒜superscript𝑎𝒜a^{\prime}\in\mathcal{A}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_A and 𝒮a𝒳asubscript𝒮superscript𝑎subscript𝒳superscript𝑎\mathcal{S}_{a^{\prime}}\subset\mathcal{X}_{a^{\prime}}caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊂ caligraphic_X start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. We have χ((a𝒜PAGa)[𝒮a])=ω((a𝒜PAGa)[𝒮a])𝜒superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎delimited-[]subscript𝒮superscript𝑎𝜔superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎delimited-[]subscript𝒮superscript𝑎\chi\big{(}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}[\mathcal{S}_% {a^{\prime}}]\big{)}=\omega\big{(}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_% {a}\big{)}[\mathcal{S}_{a^{\prime}}]\big{)}italic_χ ( ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) [ caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] ) = italic_ω ( ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) [ caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] ) since G𝐺Gitalic_G is perfect, and therefore χ(Ga[𝒮a])=ω(Ga[𝒮a])𝜒subscript𝐺superscript𝑎delimited-[]subscript𝒮superscript𝑎𝜔subscript𝐺superscript𝑎delimited-[]subscript𝒮superscript𝑎\chi(G_{a^{\prime}}[\mathcal{S}_{a^{\prime}}])=\omega(G_{a^{\prime}}[\mathcal{% S}_{a^{\prime}}])italic_χ ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] ) = italic_ω ( italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] ), as (a𝒜PAGa)[𝒮a]=Ga[𝒮a]superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎delimited-[]subscript𝒮superscript𝑎subscript𝐺superscript𝑎delimited-[]subscript𝒮superscript𝑎\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}[\mathcal{S}_{a^{\prime}% }]=G_{a^{\prime}}[\mathcal{S}_{a^{\prime}}]( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) [ caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] = italic_G start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ]. Thus all the graphs (Ga)a𝒜subscriptsubscript𝐺𝑎𝑎𝒜(G_{a})_{a\in\mathcal{A}}( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT are perfect.

()(\Longleftarrow)( ⟸ ) Conversely, assume that for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, Ga=(𝒳a,a,PXa)subscript𝐺𝑎subscript𝒳𝑎subscript𝑎subscript𝑃subscript𝑋𝑎G_{a}=(\mathcal{X}_{a},\mathcal{E}_{a},P_{X_{a}})italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , caligraphic_E start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) is perfect. Then for all 𝒮a𝒜𝒳a𝒮subscriptsquare-union𝑎𝒜subscript𝒳𝑎\mathcal{S}\subset\bigsqcup_{a\in\mathcal{A}}\mathcal{X}_{a}caligraphic_S ⊂ ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, 𝒮𝒮\mathcal{S}caligraphic_S can be written as a𝒜𝒮asubscriptsquare-union𝑎𝒜subscript𝒮𝑎\bigsqcup_{a\in\mathcal{A}}\mathcal{S}_{a}⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT caligraphic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT where 𝒮a𝒳asubscript𝒮𝑎subscript𝒳𝑎\mathcal{S}_{a}\subset\mathcal{X}_{a}caligraphic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ⊂ caligraphic_X start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT for all a𝒜𝑎𝒜a\in\mathcal{A}italic_a ∈ caligraphic_A, and we have for all PAΔ(𝒜)subscript𝑃𝐴Δ𝒜P_{A}\in\Delta(\mathcal{A})italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ∈ roman_Δ ( caligraphic_A ):

χ((a𝒜PAGa)[𝒮])𝜒superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎delimited-[]𝒮\displaystyle\chi\left(\left(\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a% }\right)[\mathcal{S}]\right)italic_χ ( ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) [ caligraphic_S ] ) =χ(a𝒜PAGa[𝒮a])absent𝜒superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎delimited-[]subscript𝒮𝑎\displaystyle=\chi\left(\textstyle\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}[% \mathcal{S}_{a}]\right)= italic_χ ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ] ) (332)
=maxa𝒜χ(Ga[𝒮a])absentsubscript𝑎𝒜𝜒subscript𝐺𝑎delimited-[]subscript𝒮𝑎\displaystyle=\max_{a\in\mathcal{A}}\chi\left(G_{a}[\mathcal{S}_{a}]\right)= roman_max start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_χ ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ] ) (333)
=maxa𝒜ω(Ga[𝒮a]),absentsubscript𝑎𝒜𝜔subscript𝐺𝑎delimited-[]subscript𝒮𝑎\displaystyle=\max_{a\in\mathcal{A}}\omega\left(G_{a}[\mathcal{S}_{a}]\right),= roman_max start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_ω ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ] ) , (334)

and similarly, ω((a𝒜PAGa)[𝒮])=maxa𝒜ω(Ga[𝒮a])𝜔superscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎delimited-[]𝒮subscript𝑎𝒜𝜔subscript𝐺𝑎delimited-[]subscript𝒮𝑎\omega\big{(}\big{(}\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}\big{)}[\mathcal{S% }]\big{)}=\max_{a\in\mathcal{A}}\omega\left(G_{a}[\mathcal{S}_{a}]\right)italic_ω ( ( ⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) [ caligraphic_S ] ) = roman_max start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT italic_ω ( italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT [ caligraphic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ] ). Hence a𝒜PAGasuperscriptsubscriptsquare-union𝑎𝒜subscript𝑃𝐴subscript𝐺𝑎\bigsqcup_{a\in\mathcal{A}}^{P_{A}}G_{a}⨆ start_POSTSUBSCRIPT italic_a ∈ caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is also perfect.

References

  • [1] C. E. Shannon, “A mathematical theory of communication,” The Bell system technical journal, vol. 27, no. 3, pp. 379–423, 1948.
  • [2] D. A. Huffman, “A method for the construction of minimum-redundancy codes,” Proceedings of the IRE, vol. 40, no. 9, pp. 1098–1101, 1952.
  • [3] C. Shannon, “The zero error capacity of a noisy channel,” IRE Transactions on Information Theory, vol. 2, no. 3, pp. 8–19, 1956.
  • [4] A. Vesel and J. Žerovnik, “Improved lower bound on the Shannon capacity of C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT,” Information Processing Letters, vol. 81, no. 5, pp. 277–282, 2002.
  • [5] B. Codenotti, I. Gerace, and G. Resta, “Some remarks on the Shannon capacity of odd cycles,” Ars Combinatoria, vol. 66, pp. 243–258, 2003.
  • [6] S. C. Polak and A. Schrijver, “New lower bound on the Shannon capacity of C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT from circular graphs,” Information Processing Letters, vol. 143, pp. 37–40, 2019.
  • [7] I. Csiszár and J. Körner, Information theory: coding theorems for discrete memoryless systems.   Cambridge University Press, 2011.
  • [8] S. Klavzar, R. Hammack, and W. Imrich, “Handbook of graph products,” 2011.
  • [9] C. Berge, Graphs and Hypergraphs, ser. North-Holland mathematical library.   Amsterdam, 1973.
  • [10] M. Grötschel, L. Lovász, and A. Schrijver, “Polynomial algorithms for perfect graphs,” Ann. Discrete Math, vol. 21, pp. 325–356, 1984.
  • [11] C. Berge, “Farbung von graphen, deren samtliche bzw. deren ungerade kreise starr sind,” Wissenschaftliche Zeitschrift, 1961.
  • [12] M. Chudnovsky, N. Robertson, P. D. Seymour, and R. Thomas, “Progress on perfect graphs,” Mathematical Programming, vol. 97, no. 1-2, pp. 405–422, 2003.
  • [13] L. Lovász, “On the Shannon capacity of a graph,” IEEE Transactions on Information Theory, vol. 25, no. 1, pp. 1–7, 1979.
  • [14] H. Witsenhausen, “The zero-error side information problem and chromatic numbers (corresp.),” IEEE Transactions on Information Theory, vol. 22, no. 5, pp. 592–593, 1976.
  • [15] D. Slepian and J. Wolf, “Noiseless coding of correlated information sources,” IEEE Transactions on information Theory, vol. 19, no. 4, pp. 471–480, 1973.
  • [16] N. Alon and A. Orlitsky, “Source coding and graph entropies,” IEEE Transactions on Information Theory, vol. 42, no. 5, pp. 1329–1339, 1996.
  • [17] P. Koulgi, E. Tuncel, S. L. Regunathan, and K. Rose, “On zero-error source coding with decoder side information,” IEEE Transactions on Information Theory, vol. 49, no. 1, pp. 99–111, 2003.
  • [18] J. Körner and G. Longo, “Two-step encoding for finite sources,” IEEE Transactions on Information Theory, vol. 19, no. 6, pp. 778–782, 1973.
  • [19] J. Körner, “Coding of an information source having ambiguous alphabet and the entropy of graphs,” Transactions of the 6th Prague Conference on Information Theory, pp. 411—425, 1973.
  • [20] W. Haemers, “On some problems of Lovász concerning the Shannon capacity of a graph,” IEEE Transactions on Information Theory, vol. 25, no. 2, pp. 231–232, 1979.
  • [21] E. Tuncel, J. Nayak, P. Koulgi, and K. Rose, “On complementary graph entropy,” IEEE transactions on information theory, vol. 55, no. 6, pp. 2537–2546, 2009.
  • [22] A. Wigderson and J. Zuiddam, Asymptotic spectra: Theory, applications and extensions.   manuscript, 2023.
  • [23] A. Schrijver, “On the Shannon capacity of sums and products of graphs,” Indagationes Mathematicae, vol. 34, no. 1, pp. 37–41, 2023.
  • [24] I. Csiszár and J. Körner, “On the capacity of the arbitrarily varying channel for maximum probability of error,” Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, vol. 57, no. 1, pp. 87–101, 1981.
  • [25] L. Gargano, J. Körner, and U. Vaccaro, “Capacities: from information theory to extremal set theory,” Journal of Combinatorial Theory, Series A, vol. 68, no. 2, pp. 296–316, 1994.
  • [26] K. Marton, “On the Shannon capacity of probabilistic graphs,” Journal of Combinatorial Theory, Series B, vol. 57, no. 2, pp. 183–195, 1993.
  • [27] L. Lovász, “On the Shannon capacity of a graph,” IEEE Transactions on Information theory, vol. 25, no. 1, pp. 1–7, 1979.
  • [28] I. Sason, “Observations on the Lovász θ𝜃\thetaitalic_θ-function, graph capacity, eigenvalues, and strong products,” Entropy, vol. 25, no. 1, p. 104, 2023.
  • [29] A. Vesel and J. Žerovnik, “Improved lower bound on the Shannon capacity of C7subscript𝐶7C_{7}italic_C start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT,” Information processing letters, vol. 81, no. 5, pp. 277–282, 2002.
  • [30] K. A. Mathew and P. R. Östergård, “New lower bounds for the Shannon capacity of odd cycles,” Designs, Codes and Cryptography, vol. 84, pp. 13–22, 2017.
  • [31] S. C. Polak and A. Schrijver, “New lower bound on the Shannon capacity of c7 from circular graphs,” Information Processing Letters, vol. 143, pp. 37–40, 2019.
  • [32] M. Chudnovsky and P. D. Seymour, “The structure of claw-free graphs.” in BCC, 2005, pp. 153–171.
  • [33] B. Bukh and C. Cox, “On a fractional version of Haemers’ bound,” IEEE Transactions on Information Theory, vol. 65, no. 6, pp. 3340–3348, 2018.
  • [34] L. Gao, S. Gribling, and Y. Li, “On a tracial version of Haemers bound,” IEEE Transactions on Information Theory, 2022.
  • [35] G. Simonyi, “Perfect graphs and graph entropy. An updated survey,” in Perfect Graphs, B. A. Reed and J. L. R. Alfonsin, Eds.   John Wiley & Sons, 2001, ch. 13, pp. 293–328.
  • [36] H. Touchette, “Legendre-Fenchel transforms in a nutshell,” URL http://www. maths. qmul. ac. uk/~ ht/archive/lfth2. pdf, 2005.
  • [37] I. Csiszár, J. Körner, L. Lovász, K. Marton, and G. Simonyi, “Entropy splitting for antiblocking corners and perfect graphs,” Combinatorica, vol. 10, no. 1, pp. 27–40, 1990.
  • [38] M. Chudnovsky, N. Robertson, P. Seymour, and R. Thomas, “The strong perfect graph theorem,” Annals of mathematics, pp. 51–229, 2006.
  • [39] P. J. Cameron, “6-transitive graphs,” Journal of Combinatorial Theory, Series B, vol. 28, no. 2, pp. 168–179, 1980.
  • [40] N. Alon, “The Shannon capacity of a union,” Combinatorica, vol. 18, no. 3, pp. 301–310, 1998.
  • [41] J. Zuiddam et al., Algebraic complexity, asymptotic spectra and entanglement polytopes.   Institute for Logic, Language and Computation, 2018.
  • [42] G. Simonyi, “Graph entropy: a survey,” Combinatorial Optimization, vol. 20, pp. 399–441, 1995.