Competitive Analysis of Arbitrary Varying Channels

Michael Langberg Department of Electrical Engineering
University at Buffalo
Buffalo, NY, USA
Email: [email protected]
โ€ƒโ€ƒ Oron Sabag School of Computer Science and Engineering
The Hebrew University of Jerusalem
Jerusalem, Israel
Email: [email protected]
Abstract

Arbitrary varying channels (AVC) are used to model communication settings in which a channel state may vary arbitrarily over time. Their primary objective is to circumvent statistical assumptions on channel variation. Traditional studies on AVCs optimize rate subject to the worst-case state sequence. While this approach is resilient to channel variations, it may result in low rates for state sequences that are associated with relatively good channels. This paper addresses the analysis of AVCs through the lens of competitive analysis, where solution quality is measured with respect to the optimal solution had the state sequence been known in advance. Our main result demonstrates that codes constructed by a single input distribution do not achieve optimal competitive performance over AVCs. This stands in contrast to the single-letter capacity formulae for AVCs, and it indicates, in our setting, that even though the encoder cannot predict the subsequent channel states, it benefits from varying its input distribution as time proceeds.

I Introduction

The arbitrarily varying channel (AVC) model, introduced by Blackwell, Breiman, and Thomasian [1], captures communication over a collection of memoryless channels, ๐’ฒ={Wsโข(y|x)}sโˆˆ๐’ฎ๐’ฒsubscriptsubscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅ๐‘ ๐’ฎ\operatorname{\mathcal{W}}=\{W_{s}(y|x)\}_{s\in{\mathcal{S}}}caligraphic_W = { italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ) } start_POSTSUBSCRIPT italic_s โˆˆ caligraphic_S end_POSTSUBSCRIPT, where in each time instance i๐‘–iitalic_i the channel state that determines the channel in use may be arbitrarily chosen. The AVC model is broad in nature, and in its full generality can capture the setting in which the state sequence may depend on the transmitted codeword or may have a cost/type constraint. As such, the AVC model captures both adversarial and random noise models. The majority of previous studies on AVCs address the case in which the state does not depend on the transmitted codeword, e.g., [2, 3, 4, 5]. Traditionally, success criteria for communication in the context of AVCs require the design of a single coding scheme that allows communication at a fixed rate no matter which state sequence is realized; this approach forces code rates to be matched to the worst-case channel conditions. The fixed-rate setting is complemented by variable-rate performance criteria that no longer guarantee the delivery of a fixed rate under all channel state sequences; instead, they allow the rate to vary with the channel states in operation. Rateless codes, introduced in [6, 7, 8], achieve variable-rate coding by allowing the effective blocklength to vary with the channel state.

In this work, we consider rateless codes for AVCs. In particular, the message length is fixed in advance and the communication length is a random variable that depends on channel outputs, i.e., the block length is determined by a stop** time based on the channel output filtration. Rateless codes for the AVC model have seen a number of studies over the last decade. The majority of studies involve feedback[9, 10, 11, 12, 13, 14, 15, 16, 17, 18] and are less relevant to the work at hand. In the context of rateless codes for AVCs without feedback, prior works include [19, 20, 21] that study coding solutions and effective rate in the setting in which the decoder has full or partial access to state information. Beyond the assumptions on decoder state information (which is also central to our study as well), the major difference between the works above and the work at hand lies in the quality measures of the solutions suggested - the former, for a fixed input distribution, seek decoding rules that minimize the expected decoding time given the state sequence at hand, while the work at hand seeks coding solutions with a competitive quality guarantee.

Namely, in this work, we study rateless coding technologies for communication over AVCs through the lens of competitive analysis. In competitive analysis, one compares the achievable rates of solutions in which state sequence is not available to the encoder and decoder with those in which state information is known in advance to all parties. The objective is to design communication schemes that achieve rates that are close to that achievable when the state sequence is known in advance. Common metrics to compare these two rates include the competitive-ratio that measures the ratio between the (expected) rates achievable in the case of limited state knowledge and that with full state knowledge, and regret that measures the difference between the former and latter rates described above. The design of communication schemes with a competitive ratio approaching 1 (or with regret approaching 0) guarantees that even in the face of uncertainty, the quality of communication matches the best possible under the given conditions. A competitive ratio ฮฑ๐›ผ\alphaitalic_ฮฑ that is bounded away from 1111 acts as a quality measure for the communication scheme at hand, guaranteeing that no matter what state sequence is realized, the achievable rate is guaranteed to be within an ฮฑ๐›ผ\alphaitalic_ฮฑ multiplicative ratio of the best possible.

The remainder of the paper is structured as follows. In Sectionย II we present our model and problem definition. We then formulate the main question addressed in this work in Section III. Our results are stated in Section IV.

II Model and problem definition

Let ๐’ณ๐’ณ\operatorname{\mathcal{X}}caligraphic_X, ๐’ด๐’ด\operatorname{\mathcal{Y}}caligraphic_Y, and ๐’ฎ๐’ฎ\operatorname{\mathcal{S}}caligraphic_S denote finite alphabets of the channel input, output and state, respectively. Consider a message M๐‘€Mitalic_M uniformly distributed over โ„ณโ‰œ[1:2k]\mathcal{M}\triangleq[1:2^{k}]caligraphic_M โ‰œ [ 1 : 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] and communication over channels taken from a family of discrete memoryless channels ๐’ฒ={Wsโข(y|x)}sโˆˆ๐’ฎ๐’ฒsubscriptsubscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅ๐‘ ๐’ฎ\mathcal{W}=\{W_{s}(y|x)\}_{s\in\mathcal{S}}caligraphic_W = { italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ) } start_POSTSUBSCRIPT italic_s โˆˆ caligraphic_S end_POSTSUBSCRIPT. The channel at time i๐‘–iitalic_i is determined by the state sisubscript๐‘ ๐‘–s_{i}italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as Wsiโข(y|x)subscript๐‘Šsubscript๐‘ ๐‘–conditional๐‘ฆ๐‘ฅW_{s_{i}}(y|x)italic_W start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_y | italic_x ). The state sequence does not depend on the message and should be viewed as a deterministic sequence that is chosen arbitrarily.

The message length k๐‘˜kitalic_k is not parameterized with a blocklength or rate. Instead, we consider rateless communication where the fixed-size message should be decoded with the least number of channel uses. That is, the decoder observes the stream of channel outputs and decides at each time if it wants to proceed with communication or to abort it and to decode the message. We proceed to formally define rateless codes.

โˆ™โˆ™\bulletโˆ™ Rateless codes over AVCs: For a fixed message length k๐‘˜kitalic_k, a rateless code ๐’žkโ‰œ(E,{Di}i=1โˆž,{Hi}i=1โˆž)โ‰œsubscript๐’ž๐‘˜๐ธsuperscriptsubscriptsubscript๐ท๐‘–๐‘–1superscriptsubscriptsubscript๐ป๐‘–๐‘–1\mathcal{C}_{k}\triangleq(E,\{D_{i}\}_{i=1}^{\infty},\{H_{i}\}_{i=1}^{\infty})caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ‰œ ( italic_E , { italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT , { italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT ) is defined by three deterministic map**s. The encoder is defined by the map** E:โ„ณโ†’๐’ณโˆž:๐ธโ†’โ„ณsuperscript๐’ณE:\mathcal{M}\to\mathcal{X}^{\infty}italic_E : caligraphic_M โ†’ caligraphic_X start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT. The second map** is a sequence of decoder-decision functions Hi:๐’ดiโ†’{0,1}:subscript๐ป๐‘–โ†’superscript๐’ด๐‘–01H_{i}:\mathcal{Y}^{i}\to\{0,1\}italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : caligraphic_Y start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT โ†’ { 0 , 1 } for iโ‰ฅ1๐‘–1i\geq 1italic_i โ‰ฅ 1; if the decoder decodes at time i๐‘–iitalic_i, it sets Hiโข(Yi)=1subscript๐ป๐‘–superscript๐‘Œ๐‘–1H_{i}(Y^{i})=1italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_Y start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) = 1 and then the message is decoded with the last map** Di:๐’ดiโ†’โ„ณ:subscript๐ท๐‘–โ†’superscript๐’ด๐‘–โ„ณD_{i}:\mathcal{Y}^{i}\to\mathcal{M}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : caligraphic_Y start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT โ†’ caligraphic_M. In all time instances prior to the decoding, the decoder simply sets Hi=0subscript๐ป๐‘–0H_{i}=0italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0.

The stop** time is defined as ฯ„k=miniโก{i:Hiโข(Yi)=1}subscript๐œ๐‘˜subscript๐‘–:๐‘–subscript๐ป๐‘–superscript๐‘Œ๐‘–1\tau_{k}=\min_{i}\{i:H_{i}(Y^{i})=1\}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT { italic_i : italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_Y start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) = 1 }. For a given code and a state sequence ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\mathcal{S}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT, the average probability of error is Peโข(๐ฌ)โ‰œPrโก(Mโ‰ Dฯ„kโข(Yฯ„k)โˆฃ๐ฌ)โ‰œsubscript๐‘ƒ๐‘’๐ฌPr๐‘€conditionalsubscript๐ทsubscript๐œ๐‘˜superscript๐‘Œsubscript๐œ๐‘˜๐ฌP_{e}(\operatorname*{\bf s})\triangleq\Pr(M\neq D_{\tau_{k}}(Y^{\tau_{k}})\mid% \operatorname*{\bf s})italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( bold_s ) โ‰œ roman_Pr ( italic_M โ‰  italic_D start_POSTSUBSCRIPT italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_Y start_POSTSUPERSCRIPT italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) โˆฃ bold_s ), and Pe=max๐ฌโกPeโข(๐ฌ)subscript๐‘ƒ๐‘’subscript๐ฌsubscript๐‘ƒ๐‘’๐ฌP_{e}=\max_{\operatorname*{\bf s}}P_{e}(\operatorname*{\bf s})italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT bold_s end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( bold_s ) denotes the maximal error among all state sequences. The expected decoding time for ๐ฌ๐ฌ\operatorname*{\bf s}bold_s isย ฯ„kโข(๐ฌ)=๐”ผโก[ฯ„kโˆฃ๐ฌ]subscript๐œ๐‘˜๐ฌ๐”ผconditionalsubscript๐œ๐‘˜๐ฌ\tau_{k}(\operatorname*{\bf s})=\operatorname{\mathbb{E}}[\tau_{k}\mid% \operatorname*{\bf s}]italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) = blackboard_E [ italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โˆฃ bold_s ] (here, expectation is taken over the channel and message). For rateless codes, the effective rate is typically measured as kฯ„kโข(๐ฌ)๐‘˜subscript๐œ๐‘˜๐ฌ\frac{k}{\tau_{k}(\operatorname*{\bf s})}divide start_ARG italic_k end_ARG start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG, e.g, [22].

โˆ™โˆ™\bulletโˆ™ Competitive analysis: To define our competitive metrics, as our baseline, we consider codes where the encoder and the decoder map**s have access to the state sequence ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\operatorname{\mathcal{S}}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT. Specifically, it is a rateless code ๐’žksubscript๐’ž๐‘˜\mathcal{C}_{k}caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as above, but the map**s (E,{Di},{Hi})๐ธsubscript๐ท๐‘–subscript๐ป๐‘–(E,\{D_{i}\},\{H_{i}\})( italic_E , { italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } , { italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } ) depend on the state sequence. We denote such codes with ๐’žkโˆ—subscriptsuperscript๐’žโˆ—๐‘˜\mathcal{C}^{\ast}_{k}caligraphic_C start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

The main idea is to compare the stop** time ฯ„kโข(๐ฌ)subscript๐œ๐‘˜๐ฌ\tau_{k}(\operatorname*{\bf s})italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) of a code ๐’žksubscript๐’ž๐‘˜{\cal C}_{k}caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT with the stop** time ฯ„k,ฯตโˆ—โข(๐ฌ)superscriptsubscript๐œ๐‘˜italic-ฯต๐ฌ\tau_{k,\epsilon}^{*}(\operatorname*{\bf s})italic_ฯ„ start_POSTSUBSCRIPT italic_k , italic_ฯต end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) of an optimal, clairvoyant, scheme ๐’žkโˆ—superscriptsubscript๐’ž๐‘˜{\cal C}_{k}^{*}caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT specifically designed for ๐ฌ๐ฌ\operatorname*{\bf s}bold_s. Setting ฯ„k,ฯตโˆ—โข(๐ฌ)=inf๐’žkโˆ—:Peโ‰คฯตฯ„kโข(๐ฌ)superscriptsubscript๐œ๐‘˜italic-ฯต๐ฌsubscriptinfimum:subscriptsuperscript๐’ž๐‘˜subscript๐‘ƒ๐‘’italic-ฯตsubscript๐œ๐‘˜๐ฌ\tau_{k,\epsilon}^{*}(\operatorname*{\bf s})=\inf\limits_{\mathcal{C}^{*}_{k}:% P_{e}\leq\epsilon}\tau_{k}(\operatorname*{\bf s})italic_ฯ„ start_POSTSUBSCRIPT italic_k , italic_ฯต end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) = roman_inf start_POSTSUBSCRIPT caligraphic_C start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT โ‰ค italic_ฯต end_POSTSUBSCRIPT italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ), the competitive ratio is defined as

๐‚๐‘(k,ฯต)๐‚๐‘๐‘˜italic-ฯต\displaystyle\operatorname*{\mathbf{CR}}(k,\epsilon)bold_CR ( italic_k , italic_ฯต ) =sup๐’žk:Peโ‰คฯตinf๐ฌโˆˆ๐’ฎโˆžฯ„k,ฯตโˆ—โข(๐ฌ)ฯ„kโข(๐ฌ).absentsubscriptsupremum:subscript๐’ž๐‘˜subscript๐‘ƒ๐‘’italic-ฯตsubscriptinfimum๐ฌsuperscript๐’ฎsubscriptsuperscript๐œโˆ—๐‘˜italic-ฯต๐ฌsubscript๐œ๐‘˜๐ฌ\displaystyle=\sup_{\mathcal{C}_{k}:P_{e}\leq\epsilon}\inf_{\operatorname*{\bf s% }\in\mathcal{S}^{\infty}}\frac{\tau^{\ast}_{k,\epsilon}(\operatorname*{\bf s})% }{\tau_{k}(\operatorname*{\bf s})}.\vspace{-3mm}= roman_sup start_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT โ‰ค italic_ฯต end_POSTSUBSCRIPT roman_inf start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_ฯต end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG . (1)

The competitive ratio guarantees the largest multiplicative measure of quality with respect to the optimal code. That is, for any ๐ฌ๐ฌ\operatorname*{\bf s}bold_s, whether this sequence implies low or high ฯ„k,ฯตโˆ—โข(๐ฌ)subscriptsuperscript๐œ๐‘˜italic-ฯต๐ฌ\tau^{*}_{k,\epsilon}(\operatorname*{\bf s})italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_ฯต end_POSTSUBSCRIPT ( bold_s ), the resulted stop** time of the oblivious code (that is not designed with the knowledge of ๐ฌ๐ฌ\operatorname*{\bf s}bold_s) should be the closest possible to the optimal stop** time.

For some applications, e.g., delay-sensitive systems [23, 24, 25, 26, 27, 28, 29], one may be interested in additive bounds on the normalized delay or the (effective) rate when compared to an optimal code given state knowledge, defined here as the regret:

๐‘๐ž๐ ๐ซ๐ž๐ญ(k,ฯต)๐‘๐ž๐ ๐ซ๐ž๐ญ๐‘˜italic-ฯต\displaystyle\operatorname*{\mathbf{Regret}}(k,\epsilon)bold_Regret ( italic_k , italic_ฯต ) =inf๐’žk:Peโ‰คฯตsupsโˆˆ๐’ฎโˆž(kฯ„k,ฯตโˆ—โข(๐ฌ)โˆ’kฯ„kโข(๐ฌ)).absentsubscriptinfimum:subscript๐’ž๐‘˜subscript๐‘ƒ๐‘’italic-ฯตsubscriptsupremum๐‘ superscript๐’ฎ๐‘˜subscriptsuperscript๐œโˆ—๐‘˜italic-ฯต๐ฌ๐‘˜subscript๐œ๐‘˜๐ฌ\displaystyle=\inf_{\mathcal{C}_{k}:P_{e}\leq\epsilon}\sup_{s\in\mathcal{S}^{% \infty}}\left(\frac{k}{\tau^{\ast}_{k,\epsilon}(\operatorname*{\bf s})}-\frac{% k}{\tau_{k}(\operatorname*{\bf s})}\right).= roman_inf start_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT โ‰ค italic_ฯต end_POSTSUBSCRIPT roman_sup start_POSTSUBSCRIPT italic_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_k end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_ฯต end_POSTSUBSCRIPT ( bold_s ) end_ARG - divide start_ARG italic_k end_ARG start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) . (2)

A family of channels ๐’ฒ={Ws}sโˆˆ๐’ฎ๐’ฒsubscriptsubscript๐‘Š๐‘ ๐‘ ๐’ฎ\mathcal{W}=\{W_{s}\}_{s\in\mathcal{S}}caligraphic_W = { italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_s โˆˆ caligraphic_S end_POSTSUBSCRIPT is said to be ฮฑโข(k,ฯต)๐›ผ๐‘˜italic-ฯต\alpha(k,\epsilon)italic_ฮฑ ( italic_k , italic_ฯต )-competitive if ๐‚๐‘(k,ฯต)โ‰ฅฮฑโข(k,ฯต)๐‚๐‘๐‘˜italic-ฯต๐›ผ๐‘˜italic-ฯต\operatorname*{\mathbf{CR}}(k,\epsilon)\geq\alpha(k,\epsilon)bold_CR ( italic_k , italic_ฯต ) โ‰ฅ italic_ฮฑ ( italic_k , italic_ฯต ). Similarly for the regret objective, ๐’ฒ๐’ฒ\mathcal{W}caligraphic_W allows a regret of ฯโข(k,ฯต)๐œŒ๐‘˜italic-ฯต\rho(k,\epsilon)italic_ฯ ( italic_k , italic_ฯต ) if ๐‘๐ž๐ ๐ซ๐ž๐ญ(k,ฯต)โ‰คฯโข(k,ฯต)๐‘๐ž๐ ๐ซ๐ž๐ญ๐‘˜italic-ฯต๐œŒ๐‘˜italic-ฯต\operatorname*{\mathbf{Regret}}(k,\epsilon)\leq\rho(k,\epsilon)bold_Regret ( italic_k , italic_ฯต ) โ‰ค italic_ฯ ( italic_k , italic_ฯต ).

In the asymptotic setting, which is the focus of this work, a family of channels ๐’ฒ={Ws}sโˆˆ๐’ฎ๐’ฒsubscriptsubscript๐‘Š๐‘ ๐‘ ๐’ฎ\mathcal{W}=\{W_{s}\}_{s\in\mathcal{S}}caligraphic_W = { italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_s โˆˆ caligraphic_S end_POSTSUBSCRIPT is said to be ฮฑ๐›ผ\alphaitalic_ฮฑ-competitive if there exists a sequence ฯตkโ†’0โ†’subscriptitalic-ฯต๐‘˜0\epsilon_{k}\to 0italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ†’ 0 such that lim supkโ†’โˆž๐‚๐‘(k,ฯตk)โ‰ฅฮฑ.subscriptlimit-supremumโ†’๐‘˜๐‚๐‘๐‘˜subscriptitalic-ฯต๐‘˜๐›ผ\limsup_{k\to\infty}\operatorname*{\mathbf{CR}}(k,\epsilon_{k})\geq\alpha.lim sup start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT bold_CR ( italic_k , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) โ‰ฅ italic_ฮฑ . The supremum over all such ฮฑ๐›ผ\alphaitalic_ฮฑ is defined as the optimal competitive ratio (also referred to as the competitive AVC capacity) and is denoted by ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR. Namely,

๐‚๐‘๐‚๐‘\displaystyle\operatorname*{\mathbf{CR}}bold_CR โ‰œlim supkโ†’โˆž,ฯตkโ†’0๐‚๐‘(k,ฯตk).โ‰œabsentsubscriptlimit-supremumformulae-sequenceโ†’๐‘˜โ†’subscriptitalic-ฯต๐‘˜0๐‚๐‘๐‘˜subscriptitalic-ฯต๐‘˜\displaystyle\triangleq\limsup_{\begin{subarray}{c}k\to\infty,\epsilon_{k}\to 0% \end{subarray}}\operatorname*{\mathbf{CR}}(k,\epsilon_{k}).โ‰œ lim sup start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_k โ†’ โˆž , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ†’ 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT bold_CR ( italic_k , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . (3)

Similarly, a regret of ฯ๐œŒ\rhoitalic_ฯ is achievable if there exists ฯตkโ†’0โ†’subscriptitalic-ฯต๐‘˜0\epsilon_{k}\to 0italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ†’ 0 such that lim infkโ†’โˆž๐‘๐ž๐ ๐ซ๐ž๐ญ(k,ฯตk)โ‰คฯsubscriptlimit-infimumโ†’๐‘˜๐‘๐ž๐ ๐ซ๐ž๐ญ๐‘˜subscriptitalic-ฯต๐‘˜๐œŒ\liminf_{k\to\infty}\operatorname*{\mathbf{Regret}}(k,\epsilon_{k})\leq\rholim inf start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT bold_Regret ( italic_k , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) โ‰ค italic_ฯ, and the infimum over such ฯ๐œŒ\rhoitalic_ฯ is denoted byย ๐‘๐ž๐ ๐ซ๐ž๐ญโ‰œlim infkโ†’โˆžฯตkโ†’0๐‘๐ž๐ ๐ซ๐ž๐ญ(k,ฯตk).โ‰œ๐‘๐ž๐ ๐ซ๐ž๐ญsubscriptlimit-infimumโ†’๐‘˜โ†’subscriptitalic-ฯต๐‘˜0๐‘๐ž๐ ๐ซ๐ž๐ญ๐‘˜subscriptitalic-ฯต๐‘˜\operatorname*{\mathbf{Regret}}\triangleq\liminf_{\begin{subarray}{c}k\to% \infty\\ \epsilon_{k}\to 0\end{subarray}}\operatorname*{\mathbf{Regret}}(k,\epsilon_{k}).bold_Regret โ‰œ lim inf start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_k โ†’ โˆž end_CELL end_ROW start_ROW start_CELL italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ†’ 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT bold_Regret ( italic_k , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . In the remainder of this work, we focus on the ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR quality measure; however, the results presented here apply also to ๐‘๐ž๐ ๐ซ๐ž๐ญ๐‘๐ž๐ ๐ซ๐ž๐ญ\operatorname*{\mathbf{Regret}}bold_Regret.

III Main question

The main objective towards the design of practical codes that achieve the optimal competitive ratio is a single-letter characterization of ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR. Single-letter characterizations provide efficient means to determine fundamental quality limits and, more importantly, they often provide a simple structure for optimal code design. In a previous work of the authors, [30], a single-letter characterization for the competitive analysis of the compound channel (in which the state sequence is unknown but does not change over time) was presented. In this work, in which we address AVCs, we do not derive a single-letter characterization for ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR; however, we make significant steps towards understanding this ultimate goal.

In this work, we seek to understand the structure of optimal codes for competitive metrics. Specifically, we ask if, similar to classical results in the context of communication, coding schemes designed and governed by a single optimizing distribution are optimal in the competitive setting as well. We note that, in the traditional study of AVCs, the capacity (up to symmetrizability) is characterized as maxpโข(x)โกminpโข(s)โกIโข(X;Y)subscript๐‘๐‘ฅsubscript๐‘๐‘ ๐ผ๐‘‹๐‘Œ\max_{p(x)}\min_{p(s)}I(X;Y)roman_max start_POSTSUBSCRIPT italic_p ( italic_x ) end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_p ( italic_s ) end_POSTSUBSCRIPT italic_I ( italic_X ; italic_Y ), e.g., [31], by a single input distribution. Here (X,Y)๐‘‹๐‘Œ(X,Y)( italic_X , italic_Y ) are distributed according to โˆ‘spโข(x)โขpโข(s)โขWsโข(y|x)subscript๐‘ ๐‘๐‘ฅ๐‘๐‘ subscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅ\sum_{s}p(x)p(s)W_{s}(y|x)โˆ‘ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_p ( italic_x ) italic_p ( italic_s ) italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ). The same holds for the rateless study of such AVCs when one wishes to minimize the worst-case decoding time, with [19] and without [32] decoder state information.

For the compound setting, [30] show that single-distribution codes are suboptimal in the context of competitive analysis. Namely, for competitive quality measures in the compound setting, the encoder, even without additional feedback knowledge, may be required to modify encoding statistics as time goes by. This follows, roughly stated, since a competitive encoder at any given time must optimize performance over a monotonely shrinking set of channel states. Initially, the encoder must act with all potential states in mind. However, as time goes by, certain channel statistics, if realized, would have already allowed successful decoding, and thus no longer need to be considered in the encoderโ€™s optimization. In this work, we ask whether this sub-optimality applies to the competitive analysis of the time-varying AVC model as well. In AVCs, the encoder can no longer rule out future channel states based on prior assumptions. Why then should an AVC encoder change its encoding statistics as time goes by? Rather surprisingly, we show that, even without additional feedback knowledge, the optimal AVC encoder may be required to modify its statistics as time goes by. In what follows, we introduce additional notation that allows us to formally state and then prove our main result.

We address the number of input distributions needed in code design to achieve an optimal competitive ratio. Formally, a uniform message M๐‘€Mitalic_M and an encoder map** E๐ธEitalic_E induce a distribution on the infinite ensemble of channel inputs ๐’ซโก(๐’ณโˆž)๐’ซsuperscript๐’ณ\operatorname{\mathcal{P}}(\operatorname{\mathcal{X}}^{\infty})caligraphic_P ( caligraphic_X start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT ). Computing the marginal distribution at each time step, we obtain a product distribution p(x1)p(x2)โ‹ฏโˆˆ๐’ซ(๐’ณ)โˆžp(x_{1})p(x_{2})\cdots\in\operatorname{\mathcal{P}}(\operatorname{\mathcal{X}}% )^{\infty}italic_p ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_p ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) โ‹ฏ โˆˆ caligraphic_P ( caligraphic_X ) start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT. The product distribution does not characterize the code at hand (as different inputs may depend on each other) - but will suffice for our purposes. We ask how many different distributions in ๐’ซโก(๐’ณ)๐’ซ๐’ณ\operatorname{\mathcal{P}}(\operatorname{\mathcal{X}})caligraphic_P ( caligraphic_X ) are needed to achieve an optimal competitive ratio.

We define product distributions that alternate at โ„“โˆ’1โ„“1\ell-1roman_โ„“ - 1 points as follows:

Definition 1

The set ๐’ซโ„“โŠ‚๐’ซ(๐’ณ)โˆž\operatorname{\mathcal{P}}_{\ell}\subset\operatorname{\mathcal{P}}(% \operatorname{\mathcal{X}})^{\infty}caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT โŠ‚ caligraphic_P ( caligraphic_X ) start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT includes all product distributions ๐ฉโˆˆ๐’ซ(๐’ณ)โˆž\operatorname*{\bf p}\in\operatorname{\mathcal{P}}(\operatorname{\mathcal{X}})% ^{\infty}bold_p โˆˆ caligraphic_P ( caligraphic_X ) start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT with ๐ฉ=p1n1,p2n2,โ€ฆ,pโ„“โˆ’1nโ„“โˆ’1,pโ„“โˆž๐ฉsuperscriptsubscript๐‘1subscript๐‘›1superscriptsubscript๐‘2subscript๐‘›2โ€ฆsuperscriptsubscript๐‘โ„“1subscript๐‘›โ„“1superscriptsubscript๐‘โ„“\operatorname*{\bf p}=p_{1}^{n_{1}},p_{2}^{n_{2}},\dots,p_{\ell-1}^{n_{\ell-1}% },p_{\ell}^{\infty}bold_p = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , โ€ฆ , italic_p start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT where {p1,โ€ฆ,pโ„“}โŠ‚๐’ซโก(๐’ณ)subscript๐‘1โ€ฆsubscript๐‘โ„“๐’ซ๐’ณ\{p_{1},\dots,p_{\ell}\}\subset\operatorname{\mathcal{P}}(\operatorname{% \mathcal{X}}){ italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , โ€ฆ , italic_p start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT } โŠ‚ caligraphic_P ( caligraphic_X ) and n1,โ€ฆ,nโ„“โˆ’1subscript๐‘›1โ€ฆsubscript๐‘›โ„“1n_{1},\dots,n_{\ell-1}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , โ€ฆ , italic_n start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT are non-negative integers.

For example, if โ„“=1โ„“1\ell=1roman_โ„“ = 1, the channel input has the same marginal distribution on Xisubscript๐‘‹๐‘–X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all iโ‰ฅ1๐‘–1i\geq 1italic_i โ‰ฅ 1. Complementing Definitionย 1, the competitive ratio ๐‚๐‘โ„“subscript๐‚๐‘โ„“\operatorname*{\mathbf{CR}}_{\ell}bold_CR start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT is defined similarly to ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR but when the codes in use are restricted to have product distributions close to ๐’ซโ„“subscript๐’ซโ„“\operatorname{\mathcal{P}}_{\ell}caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT (slackness is added to allow the slight variability implied by random code design). Specifically, for ฮด>0๐›ฟ0\delta>0italic_ฮด > 0 and a given k๐‘˜kitalic_k, a codebook E:[2k]โ†’๐’ณโˆž:๐ธโ†’delimited-[]superscript2๐‘˜superscript๐’ณE:[2^{k}]\rightarrow\operatorname{\mathcal{X}}^{\infty}italic_E : [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] โ†’ caligraphic_X start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT is said to be ฮด๐›ฟ\deltaitalic_ฮด-close to ๐ฉ=p1,p2,โ€ฆ๐ฉsubscript๐‘1subscript๐‘2โ€ฆ\operatorname*{\bf p}=p_{1},p_{2},\dotsbold_p = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , โ€ฆ if the marginal distributions {qi}i=1โˆžsuperscriptsubscriptsubscript๐‘ž๐‘–๐‘–1\{q_{i}\}_{i=1}^{\infty}{ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT corresponding to the i๐‘–iitalic_iโ€™th codeword entry satisfy for every integer n๐‘›nitalic_n that โˆ‘i=1n๐šƒ๐š…โข(pi,qi)โ‰คฮดโขnsuperscriptsubscript๐‘–1๐‘›๐šƒ๐š…subscript๐‘๐‘–subscript๐‘ž๐‘–๐›ฟ๐‘›\sum_{i=1}^{n}{\tt TV}(p_{i},q_{i})\leq\delta nโˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT typewriter_TV ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) โ‰ค italic_ฮด italic_n. Here, ๐šƒ๐š…๐šƒ๐š…{\tt TV}typewriter_TV is the total variation distance between distributions. A codebook E:[2k]โ†’๐’ณโˆž:๐ธโ†’delimited-[]superscript2๐‘˜superscript๐’ณE:[2^{k}]\rightarrow\operatorname{\mathcal{X}}^{\infty}italic_E : [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] โ†’ caligraphic_X start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT is said to be ฮด๐›ฟ\deltaitalic_ฮด-close to ๐’ซโ„“subscript๐’ซโ„“\operatorname{\mathcal{P}}_{\ell}caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT if it is ฮด๐›ฟ\deltaitalic_ฮด-close to some ๐ฉโˆˆ๐’ซโ„“๐ฉsubscript๐’ซโ„“\operatorname*{\bf p}\in\operatorname{\mathcal{P}}_{\ell}bold_p โˆˆ caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT. Let ๐‚๐‘โ„“subscript๐‚๐‘โ„“\operatorname*{\mathbf{CR}}_{\ell}bold_CR start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT be defined similarly to ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR when restricted to codes with product distributions ฮด๐›ฟ\deltaitalic_ฮด-close to ๐’ซโ„“subscript๐’ซโ„“\operatorname{\mathcal{P}}_{\ell}caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT for ฮดโ†’0โ†’๐›ฟ0\delta\rightarrow 0italic_ฮด โ†’ 0. We give a formal definition of ๐‚๐‘โ„“subscript๐‚๐‘โ„“\operatorname*{\mathbf{CR}}_{\ell}bold_CR start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT in Appendixย A. We now have that

๐‚๐‘1โ‰ค๐‚๐‘2โ‰คโ‹ฏโ‰ค๐‚๐‘.subscript๐‚๐‘1subscript๐‚๐‘2โ‹ฏ๐‚๐‘\displaystyle{\operatorname*{\mathbf{CR}}}_{1}\leq{\operatorname*{\mathbf{CR}}% }_{2}\leq\cdots\leq{\operatorname*{\mathbf{CR}}}.bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค bold_CR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ค โ‹ฏ โ‰ค bold_CR . (4)

As stated earlier, traditional results on communication over memoryless channels and AVCs show that there exist optimal encoders (with corresponding product distributions) that lie in ๐’ซ1subscript๐’ซ1\mathcal{P}_{1}caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. For the compound channel, [30] show, on the one hand, that codes in ๐’ซ1subscript๐’ซ1\mathcal{P}_{1}caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are sub-optimal for competitive measures, but, on the other, that codes in ๐’ซ|๐’ฎ|subscript๐’ซ๐’ฎ\operatorname{\mathcal{P}}_{|\mathcal{S}|}caligraphic_P start_POSTSUBSCRIPT | caligraphic_S | end_POSTSUBSCRIPT are sufficient for competitive optimality. The latter, for finite |๐’ฎ|๐’ฎ|\mathcal{S}|| caligraphic_S |, enables a single-letter characterization of ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR in the compound setting (as an optimization over |๐’ฎ|๐’ฎ|\mathcal{S}|| caligraphic_S | input distributions).

The work at hand asks whether ๐’ซ1subscript๐’ซ1\operatorname{\mathcal{P}}_{1}caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT suffices to achieve optimal competitive measures on AVCs. We show through an example that single-distribution codes do not suffice to achieve the competitive capacity ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR for AVCs. That is, ๐‚๐‘>๐‚๐‘1๐‚๐‘subscript๐‚๐‘1\operatorname*{\mathbf{CR}}>\operatorname*{\mathbf{CR}}_{1}bold_CR > bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Even given the worst-case flavor of the AVC model, namely, given the fact that at every time step any channel can come next, the fact that for certain channel sequences, decoding can be done before others, leads to the realization that the encoder may benefit from different behaviors as time passes.

IV Main Result

In this section, we present our main result and the main steps to establish it.

Theorem 1

Single input distributions do not achieve the optimal competitive ratio in AVCs. That is, ๐‚๐‘1<๐‚๐‘subscript๐‚๐‘1๐‚๐‘\operatorname*{\mathbf{CR}}_{1}<\operatorname*{\mathbf{CR}}bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < bold_CR.

The assertion implies that there exists AVCs for which any single-letter characterization of the optimal competitive ratio should include at least two channel input distributions. Whether there is an upper bound to the number of input distributions sufficient for optimal competitive codes, i.e., whether there exists โ„“<โˆžโ„“\ell<\inftyroman_โ„“ < โˆž such that ๐‚๐‘โ„“=๐‚๐‘subscript๐‚๐‘โ„“๐‚๐‘{\operatorname*{\mathbf{CR}}}_{\ell}=\operatorname*{\mathbf{CR}}bold_CR start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT = bold_CR, is left open in this work.

To prove Theorem 1, we show for a particular channel family the following chain of inequalities.

1/313\displaystyle 1/31 / 3 =(a)๐‚๐‘1<11/24โ‰ค(b)๐‚๐‘2โ‰ค๐‚๐‘โ‰ค(c)1/2.superscript๐‘Žabsentsubscript๐‚๐‘11124superscript๐‘subscript๐‚๐‘2๐‚๐‘superscript๐‘12\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}{\operatorname*{\mathbf{CR}}}_{1% }<11/24\stackrel{{\scriptstyle(b)}}{{\leq}}{\operatorname*{\mathbf{CR}}}_{2}% \leq\operatorname*{\mathbf{CR}}\stackrel{{\scriptstyle(c)}}{{\leq}}1/2.start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG ( italic_a ) end_ARG end_RELOP bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < 11 / 24 start_RELOP SUPERSCRIPTOP start_ARG โ‰ค end_ARG start_ARG ( italic_b ) end_ARG end_RELOP bold_CR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ค bold_CR start_RELOP SUPERSCRIPTOP start_ARG โ‰ค end_ARG start_ARG ( italic_c ) end_ARG end_RELOP 1 / 2 . (5)

To prove Theorem 1, it is sufficient to prove steps (a)โˆ’(b)๐‘Ž๐‘(a)-(b)( italic_a ) - ( italic_b ) in (5). These steps are proved in Section V. Furthermore, we prove step (c)๐‘(c)( italic_c ) since this bound is quite close to the lower bound of step (b)๐‘(b)( italic_b ), and its proof method may be of independent interest.

The channel family we study to establish (5) is depicted in Fig. 1, and is discussed in detail in Section V. In the particular example we study, the channel state can be determined by the channel output. Thus, in our achievability schemes for Theoremย 1, we consider rateless codes for the AVC setting in which the decoder has full state information (DSI). Consequently, we employ Theoremย 2 stated below in the proof of Theoremย 1. While the proof of Theoremย 2 may follow from various modifications in prior works that study rateless AVC achievability in the presence of DSI [19, 20, 21], as our model is slightly different, a self-contained proof of Theoremย 2 appears in Appendixย B. In what follows, we consider codes ๐’žk๐™ณ๐š‚๐™ธ=(E,{Di}i=1โˆž,{Hi}i=1โˆž)subscriptsuperscript๐’ž๐™ณ๐š‚๐™ธ๐‘˜๐ธsuperscriptsubscriptsubscript๐ท๐‘–๐‘–1superscriptsubscriptsubscript๐ป๐‘–๐‘–1\mathcal{C}^{\tt DSI}_{k}=(E,\{D_{i}\}_{i=1}^{\infty},\{H_{i}\}_{i=1}^{\infty})caligraphic_C start_POSTSUPERSCRIPT typewriter_DSI end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( italic_E , { italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT , { italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT ) with decoder state information (DSI). That is, the decoder map**s depend causally on the channel outputs and states.

Theorem 2

Let ฮด>0๐›ฟ0\delta>0italic_ฮด > 0. Let k๐‘˜kitalic_k be sufficiently large. Let โ„“<โˆžโ„“\ell<\inftyroman_โ„“ < โˆž. Let ๐ฉ=p1n1,p2n2,โ€ฆ,pโ„“โˆ’1nโ„“โˆ’1,pโ„“โˆžโˆˆ๐’ซโ„“formulae-sequence๐ฉsuperscriptsubscript๐‘1subscript๐‘›1superscriptsubscript๐‘2subscript๐‘›2โ€ฆsuperscriptsubscript๐‘โ„“1subscript๐‘›โ„“1superscriptsubscript๐‘โ„“subscript๐’ซโ„“\operatorname*{\bf p}=p_{1}^{n_{1}},p_{2}^{n_{2}},\dots,p_{\ell-1}^{n_{\ell-1}% },p_{\ell}^{\infty}\in\operatorname{\mathcal{P}}_{\ell}bold_p = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , โ€ฆ , italic_p start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT โˆˆ caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT. Let ๐’ฒ={Ws}๐’ฒsubscript๐‘Š๐‘ \operatorname{\mathcal{W}}=\{W_{s}\}caligraphic_W = { italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } be a channel family for which mini,sโกIsโข(X,Y)>0subscript๐‘–๐‘ subscript๐ผ๐‘ ๐‘‹๐‘Œ0\min_{i,s}I_{s}(X,Y)>0roman_min start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X , italic_Y ) > 0, where for state s๐‘ sitalic_s and index i๐‘–iitalic_i, (Xi,Y)subscript๐‘‹๐‘–๐‘Œ(X_{i},Y)( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y ) is distributed according to piโข(x)โขWsโข(y|x)subscript๐‘๐‘–๐‘ฅsubscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅp_{i}(x)W_{s}(y|x)italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ). For ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\operatorname{\mathcal{S}}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT let

ฯ„k(๐ฉ)โข(๐ฌ)=argโกminฯ„โก{โˆ‘t<ฯ„Istโข(Xt,Y)โ‰ฅk}.subscriptsuperscript๐œ๐ฉ๐‘˜๐ฌsubscript๐œsubscript๐‘ก๐œsubscript๐ผsubscript๐‘ ๐‘กsubscript๐‘‹๐‘ก๐‘Œ๐‘˜\tau^{({\bf p})}_{k}({\bf s})=\arg\min_{\tau}{\{\sum_{t<\tau}I_{s_{t}}(X_{t},Y% )\geq k\}}.italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) = roman_arg roman_min start_POSTSUBSCRIPT italic_ฯ„ end_POSTSUBSCRIPT { โˆ‘ start_POSTSUBSCRIPT italic_t < italic_ฯ„ end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_Y ) โ‰ฅ italic_k } .

Then, there exists a code ๐’žk๐™ณ๐š‚๐™ธ=(E,{Di}i=1โˆž,{Hi}i=1โˆž)subscriptsuperscript๐’ž๐™ณ๐š‚๐™ธ๐‘˜๐ธsuperscriptsubscriptsubscript๐ท๐‘–๐‘–1superscriptsubscriptsubscript๐ป๐‘–๐‘–1\mathcal{C}^{\tt DSI}_{k}=(E,\{D_{i}\}_{i=1}^{\infty},\{H_{i}\}_{i=1}^{\infty})caligraphic_C start_POSTSUPERSCRIPT typewriter_DSI end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( italic_E , { italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT , { italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT ) that is ฮด๐›ฟ\deltaitalic_ฮด-close to ๐ฉ๐ฉ\operatorname*{\bf p}bold_p with decoding error probability ฯตkโ‰ค1ksubscriptitalic-ฯต๐‘˜1๐‘˜\epsilon_{k}\leq\frac{1}{k}italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ‰ค divide start_ARG 1 end_ARG start_ARG italic_k end_ARG and competitive ratio at least

(1โˆ’ฮด)โขinf๐ฌโˆˆ๐’ฎโˆž(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)),1๐›ฟsubscriptinfimum๐ฌsuperscript๐’ฎsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ(1-\delta)\inf_{{\bf s}\in\mathcal{S}^{\infty}}\left(\frac{\tau^{*}_{k}({\bf s% })}{\tau^{({\bf p})}_{k}({\bf s})}\right),( 1 - italic_ฮด ) roman_inf start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) ,

where ฯ„kโˆ—โข(๐ฌ)=argโกminฯ„โก{โˆ‘t<ฯ„Cโข(st)โ‰ฅk}superscriptsubscript๐œ๐‘˜๐ฌsubscript๐œsubscript๐‘ก๐œ๐ถsubscript๐‘ ๐‘ก๐‘˜\tau_{k}^{*}(\operatorname*{\bf s})=\arg\min_{\tau}\{\sum_{t<\tau}C(s_{t})\geq k\}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) = roman_arg roman_min start_POSTSUBSCRIPT italic_ฯ„ end_POSTSUBSCRIPT { โˆ‘ start_POSTSUBSCRIPT italic_t < italic_ฯ„ end_POSTSUBSCRIPT italic_C ( italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) โ‰ฅ italic_k } with Cโข(s)=maxpโข(x)โกIsโข(X;Y)๐ถ๐‘ subscript๐‘๐‘ฅsubscript๐ผ๐‘ ๐‘‹๐‘ŒC(s)=\max_{p(x)}I_{s}(X;Y)italic_C ( italic_s ) = roman_max start_POSTSUBSCRIPT italic_p ( italic_x ) end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X ; italic_Y ).

The intuition behind Theorem 2 is that a decoder with state information can successfully decode the message once the accumulated mutual information between the channel input and output surpasses the message entropy. The optimal stop** time, ฯ„kโˆ—โข(๐ฌ)superscriptsubscript๐œ๐‘˜๐ฌ\tau_{k}^{*}(\operatorname*{\bf s})italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ), follows in a similar way when the encoder can use the distribution achieving capacity for each state stsubscript๐‘ ๐‘กs_{t}italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.

V Proof of Theoremย 1

In this section, we present a family of channels such that the best competitive ratio with a single input distribution is ๐‚๐‘1=1/3subscript๐‚๐‘113\operatorname*{\mathbf{CR}}_{1}=1/3bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 / 3, where one can obtain ๐‚๐‘2โ‰ฅ11/24โ‰ˆ0.46subscript๐‚๐‘211240.46\operatorname*{\mathbf{CR}}_{2}\geq 11/24\approx 0.46bold_CR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ฅ 11 / 24 โ‰ˆ 0.46 if two input distributions are allowed. This establishes the proof of Theorem 1. The proof of step (c)๐‘(c)( italic_c ) in (5), ๐‚๐‘โ‰ค1/2๐‚๐‘12\operatorname*{\mathbf{CR}}\leq 1/2bold_CR โ‰ค 1 / 2, appears in the appendix.

The channel family consists of two channels and is depicted in Figureย 1. Let ๐’ณ={1,2,3,4}๐’ณ1234\mathcal{X}=\{1,2,3,4\}caligraphic_X = { 1 , 2 , 3 , 4 } and ๐’ด={1,2,3,4,โŸ‚}๐’ด1234perpendicular-to\mathcal{Y}=\{1,2,3,4,\perp\}caligraphic_Y = { 1 , 2 , 3 , 4 , โŸ‚ }. In the first channel, W1subscript๐‘Š1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, if xโˆˆ{3,4}๐‘ฅ34x\in\{3,4\}italic_x โˆˆ { 3 , 4 } the output is y=x๐‘ฆ๐‘ฅy=xitalic_y = italic_x, and if xโˆˆ{1,2}๐‘ฅ12x\in\{1,2\}italic_x โˆˆ { 1 , 2 } the output y๐‘ฆyitalic_y has a uniform distribution on {3,4}34\{3,4\}{ 3 , 4 }.

In the second channel, W2subscript๐‘Š2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, if xโˆˆ{1,2}๐‘ฅ12x\in\{1,2\}italic_x โˆˆ { 1 , 2 } the output y๐‘ฆyitalic_y is y=x๐‘ฆ๐‘ฅy=xitalic_y = italic_x with probability 0.50.50.50.5 and y=โŸ‚๐‘ฆperpendicular-toy=\perpitalic_y = โŸ‚ with probability 0.50.50.50.5. For xโˆˆ{3,4}๐‘ฅ34x\in\{3,4\}italic_x โˆˆ { 3 , 4 }, channel W2subscript๐‘Š2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT sets y๐‘ฆyitalic_y to be uniform in {1,2}12\{1,2\}{ 1 , 2 } with probability 0.50.50.50.5 and y=โŸ‚๐‘ฆperpendicular-toy=\perpitalic_y = โŸ‚ with probability 0.50.50.50.5. The channel W2subscript๐‘Š2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is represented as the concatenation of two channels, a first channel followed by an erasure channel (EC). We will use the fact that the channel capacities are C1=1subscript๐ถ11C_{1}=1italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 and C2=0.5subscript๐ถ20.5C_{2}=0.5italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0.5, respectively.

Refer to caption
Figure 1: The channels W1subscript๐‘Š1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and W2subscript๐‘Š2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (top). R1โข(p)subscript๐‘…1๐‘R_{1}(p)italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_p ) vs. R2โข(p)subscript๐‘…2๐‘R_{2}(p)italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_p ) (bottom).

The example is a modified version of the โ€œbilingual speakerโ€ example from [33, 34], and was also studied in our prior work on the competitive analysis of compound channels [30]. Note that the encoder is uncertain about the state sequence, but the decoder can deduce the channel state from the channel output since yโˆˆ{3,4}๐‘ฆ34y\in\{3,4\}italic_y โˆˆ { 3 , 4 } if and only if the first channel is used. Thus, we can utilize Theorem 2 that addresses code analysis in the setting of DSI.

For code design, by symmetry, it is sufficient to consider for pโˆˆ[0,1]๐‘01p\in[0,1]italic_p โˆˆ [ 0 , 1 ] input distributions X๐‘‹Xitalic_X that satisfy

Prโก(X=1)Pr๐‘‹1\displaystyle\Pr(X=1)roman_Pr ( italic_X = 1 ) =Prโก(X=2)=p2โขabsentPr๐‘‹2๐‘2absent\displaystyle=\Pr(X=2)=\frac{p}{2}\operatorname*{}= roman_Pr ( italic_X = 2 ) = divide start_ARG italic_p end_ARG start_ARG 2 end_ARG start_OPERATOR end_OPERATOR
Prโก(X=3)Pr๐‘‹3\displaystyle\Pr(X=3)roman_Pr ( italic_X = 3 ) =Prโก(X=4)=1โˆ’p2.absentPr๐‘‹41๐‘2\displaystyle=\Pr(X=4)=\frac{1-p}{2}.= roman_Pr ( italic_X = 4 ) = divide start_ARG 1 - italic_p end_ARG start_ARG 2 end_ARG . (6)

The achievable rates for W1subscript๐‘Š1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and W2subscript๐‘Š2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for single-distribution codes governed by such X๐‘‹Xitalic_X equal R1โข(p)=1โˆ’psubscript๐‘…1๐‘1๐‘R_{1}(p)=1-pitalic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_p ) = 1 - italic_p and R2โข(p)=p2subscript๐‘…2๐‘๐‘2R_{2}(p)=\frac{p}{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_p ) = divide start_ARG italic_p end_ARG start_ARG 2 end_ARG as depicted in Figureย 1.

By Theoremย 2, there exists a communication scheme that allows successful decoding once the cumulative mutual information exceeds the message length. Roughly speaking, decoding is successful after

ฯ„k(๐ฉ)โข(๐ฌ)=argโกminฯ„โก{โˆ‘t<ฯ„Istโข(Xt,Y)โ‰ฅk},subscriptsuperscript๐œ๐ฉ๐‘˜๐ฌsubscript๐œsubscript๐‘ก๐œsubscript๐ผsubscript๐‘ ๐‘กsubscript๐‘‹๐‘ก๐‘Œ๐‘˜\displaystyle\tau^{({\bf p})}_{k}({\bf s})=\arg\min_{\tau}{\{\sum_{t<\tau}I_{s% _{t}}(X_{t},Y)\geq k\}},italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) = roman_arg roman_min start_POSTSUBSCRIPT italic_ฯ„ end_POSTSUBSCRIPT { โˆ‘ start_POSTSUBSCRIPT italic_t < italic_ฯ„ end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_Y ) โ‰ฅ italic_k } , (7)

where Xtsubscript๐‘‹๐‘กX_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT corresponds to the distribution of the input at time t๐‘กtitalic_t and can be parameterized by pโˆˆ[0,1]๐‘01p\in[0,1]italic_p โˆˆ [ 0 , 1 ]. We will also utilize the optimal decoding time ฯ„kโˆ—โข(๐ฌ)superscriptsubscript๐œ๐‘˜๐ฌ\tau_{k}^{*}(\operatorname*{\bf s})italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) defined in Theorem 2.

V-A Proof of ๐‚๐‘1=1/3subscript๐‚๐‘113\operatorname*{\mathbf{CR}}_{1}=1/3bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 / 3:

A lower bound is achieved by setting the channel input distribution in (V) with p=2/3๐‘23p=2/3italic_p = 2 / 3. This implies that the mutual information under both channels is I1โข(X;Y)=I2โข(X;Y)=1/3subscript๐ผ1๐‘‹๐‘Œsubscript๐ผ2๐‘‹๐‘Œ13I_{1}(X;Y)=I_{2}(X;Y)=1/3italic_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X ; italic_Y ) = italic_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_X ; italic_Y ) = 1 / 3 and therefore ฯ„k(๐ฉ)โข(๐ฌ)=3โขksuperscriptsubscript๐œ๐‘˜๐ฉ๐ฌ3๐‘˜\tau_{k}^{({\bf p})}(\operatorname*{\bf s})=3kitalic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) = 3 italic_k for all ๐ฌ๐ฌ\operatorname*{\bf s}bold_s. The competitive ratio is thus bounded, for any ฮด>0๐›ฟ0\delta>0italic_ฮด > 0, by ๐‚๐‘โ‰ฅ(1โˆ’ฮด)โขlimkโ†’โˆžinf๐ฌโˆˆ๐’ฎโˆžฯ„kโˆ—โข(๐ฌ)3โขkโ‰ฅ1โˆ’ฮด3๐‚๐‘1๐›ฟsubscriptโ†’๐‘˜subscriptinfimum๐ฌsuperscript๐’ฎsuperscriptsubscript๐œ๐‘˜๐ฌ3๐‘˜1๐›ฟ3\operatorname*{\mathbf{CR}}\geq(1-\delta)\lim_{k\to\infty}\inf_{\operatorname*% {\bf s}\in\mathcal{S}^{\infty}}\frac{\tau_{k}^{*}(\operatorname*{\bf s})}{3k}% \geq\frac{1-\delta}{3}bold_CR โ‰ฅ ( 1 - italic_ฮด ) roman_lim start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT roman_inf start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) end_ARG start_ARG 3 italic_k end_ARG โ‰ฅ divide start_ARG 1 - italic_ฮด end_ARG start_ARG 3 end_ARG where the equality follows by setting the prefix of ๐ฌ๐ฌ\operatorname*{\bf s}bold_s to 1ksuperscript1๐‘˜1^{k}1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT since it minimizes the optimal decoding time.

To prove the upper bound on ๐‚๐‘1subscript๐‚๐‘1{\operatorname*{\mathbf{CR}}}_{1}bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, it suffices to consider the inequality

limkโ†’โˆžmax๐ฉโˆˆ๐’ซ1โกmin๐ฌโˆˆ๐’ฎโˆžโก(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ))โ‰คlimkโ†’โˆžmax๐ฉโˆˆ๐’ซ1โกmin๐ฌโˆˆ๐’ฎ^โˆžโก(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)),subscriptโ†’๐‘˜subscript๐ฉsubscript๐’ซ1subscript๐ฌsuperscript๐’ฎsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌsubscriptโ†’๐‘˜subscript๐ฉsubscript๐’ซ1subscript๐ฌsuperscript^๐’ฎsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ\displaystyle\lim_{k\to\infty}\max_{\operatorname*{\bf p}\in\mathcal{P}_{1}}% \min_{\operatorname*{\bf s}\in\mathcal{S}^{\infty}}\left(\frac{\tau^{*}_{k}({% \bf s})}{\tau^{({\bf p})}_{k}({\bf s})}\right)\leq\lim_{k\to\infty}\max_{% \operatorname*{\bf p}\in\mathcal{P}_{1}}\min_{\operatorname*{\bf s}\in\widehat% {\mathcal{S}}^{\infty}}\left(\frac{\tau^{*}_{k}({\bf s})}{\tau^{({\bf p})}_{k}% ({\bf s})}\right),roman_lim start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT bold_p โˆˆ caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) โ‰ค roman_lim start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT bold_p โˆˆ caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT bold_s โˆˆ over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) , (8)

for ๐’ฎ^โˆžโŠ†๐’ฎโˆžsuperscript^๐’ฎsuperscript๐’ฎ\widehat{\mathcal{S}}^{\infty}\subseteq\mathcal{S}^{\infty}over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT โŠ† caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT of our choice. Bounding ๐‚๐‘1subscript๐‚๐‘1\operatorname*{\mathbf{CR}}_{1}bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT by the left expression in (8) roughly follows from standard capacity bounds, e.g., [30, Section VI]. For the right expression, we analyze the setting ๐’ฎ^โˆž={2โˆž,1kโข2โˆž}superscript^๐’ฎsuperscript2superscript1๐‘˜superscript2\widehat{\mathcal{S}}^{\infty}=\{2^{\infty},1^{k}2^{\infty}\}over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = { 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT , 1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT }.

For the first sequence ๐ฌ1=2โˆžsubscript๐ฌ1superscript2\operatorname*{\bf s}_{1}=2^{\infty}bold_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT, the optimal decoding time is ฯ„kโˆ—โข(๐ฌ1)=2โขksuperscriptsubscript๐œ๐‘˜subscript๐ฌ12๐‘˜\tau_{k}^{*}(\operatorname*{\bf s}_{1})=2kitalic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = 2 italic_k. For this sequence, utilizing a code with a single distribution pโ‰ 0๐‘0p\neq 0italic_p โ‰  0, we have ฯ„k(๐ฉ)โข(๐ฌ1)=2โขk/psubscriptsuperscript๐œ๐ฉ๐‘˜subscript๐ฌ12๐‘˜๐‘\tau^{(\operatorname*{\bf p})}_{k}(\operatorname*{\bf s}_{1})=2k/pitalic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = 2 italic_k / italic_p (We can assume pโ‰ 0๐‘0p\neq 0italic_p โ‰  0 since otherwise the decoding time for ๐ฌ1subscript๐ฌ1\operatorname*{\bf s}_{1}bold_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is infinite). For the second sequence, ๐ฌ2=1kโข2โˆžsubscript๐ฌ2superscript1๐‘˜superscript2\operatorname*{\bf s}_{2}=1^{k}2^{\infty}bold_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT, we have ฯ„kโˆ—โข(๐ฌ2)=ksuperscriptsubscript๐œ๐‘˜subscript๐ฌ2๐‘˜\tau_{k}^{*}(\operatorname*{\bf s}_{2})=kitalic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = italic_k and, for pโ‰ 0๐‘0p\neq 0italic_p โ‰  0, ฯ„k(๐ฉ)โข(๐ฌ2)=3โขksubscriptsuperscript๐œ๐ฉ๐‘˜subscript๐ฌ23๐‘˜\tau^{(\operatorname*{\bf p})}_{k}(\operatorname*{\bf s}_{2})=3kitalic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = 3 italic_k. Combining the two ratios that correspond to the different sequences and taking limits over k๐‘˜kitalic_k we obtain

๐‚๐‘1subscript๐‚๐‘1\displaystyle{\operatorname*{\mathbf{CR}}}_{1}bold_CR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰คmaxpโˆˆ(0,1]โกminโก{p,1/3}=1/3.absentsubscript๐‘01๐‘1313\displaystyle\leq\max_{p\in(0,1]}\min\{p,1/3\}=1/3.โ‰ค roman_max start_POSTSUBSCRIPT italic_p โˆˆ ( 0 , 1 ] end_POSTSUBSCRIPT roman_min { italic_p , 1 / 3 } = 1 / 3 . (9)

V-B Proof of ๐‚๐‘2โ‰ฅ11/24subscript๐‚๐‘21124\operatorname*{\mathbf{CR}}_{2}\geq 11/24bold_CR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ฅ 11 / 24:

The prove the assertion, we present a family of codes with two input distributions and analyze their performance. We consider codes that utilize p1โ‰ค2/3subscript๐‘123p_{1}\leq 2/3italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค 2 / 3 until time tโขk๐‘ก๐‘˜tkitalic_t italic_k and p2=2/3subscript๐‘223p_{2}=2/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3 afterwards. Without loss of optimality, we consider 1โ‰คtโ‰ค21๐‘ก21\leq t\leq 21 โ‰ค italic_t โ‰ค 2 since the minimal and maximal optimal stop** times are within this range. The optimal parameters will be shown to be p1=10/33subscript๐‘11033p_{1}=10/33italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 10 / 33 and t=3/2๐‘ก32t=3/2italic_t = 3 / 2.

For any state sequence ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\mathcal{S}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT, let its optimal decoding time be ฯ„kโˆ—โข(๐ฌ)=rโขksuperscriptsubscript๐œ๐‘˜๐ฌ๐‘Ÿ๐‘˜\tau_{k}^{*}(\operatorname*{\bf s})=rkitalic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) = italic_r italic_k where rโˆˆ[1,2]๐‘Ÿ12r\in[1,2]italic_r โˆˆ [ 1 , 2 ] and depends on the underlying state sequence.111With some abuse of notation, throughout the analysis, we omit sub-linear terms of k๐‘˜kitalic_k that do not affect the competitive ratio analysis. Let ฮฑ๐›ผ\alphaitalic_ฮฑ be the fraction of 1โ€ฒโขssuperscript1โ€ฒ๐‘ 1^{\prime}s1 start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT italic_s in ๐ฌrโขksuperscript๐ฌ๐‘Ÿ๐‘˜\operatorname*{\bf s}^{rk}bold_s start_POSTSUPERSCRIPT italic_r italic_k end_POSTSUPERSCRIPT, and (1โˆ’ฮฑ)1๐›ผ(1-\alpha)( 1 - italic_ฮฑ ) be the fraction of 2โ€™s in ๐ฌrโขksuperscript๐ฌ๐‘Ÿ๐‘˜\operatorname*{\bf s}^{rk}bold_s start_POSTSUPERSCRIPT italic_r italic_k end_POSTSUPERSCRIPT. By the accumulation of mutual information, it holds that k=ฮฑโขrโขkโขC1+(1โˆ’ฮฑ)โขrโขkโขC2๐‘˜๐›ผ๐‘Ÿ๐‘˜subscript๐ถ11๐›ผ๐‘Ÿ๐‘˜subscript๐ถ2k=\alpha rkC_{1}+(1-\alpha)rkC_{2}italic_k = italic_ฮฑ italic_r italic_k italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ( 1 - italic_ฮฑ ) italic_r italic_k italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT where C1=1,C2=0.5formulae-sequencesubscript๐ถ11subscript๐ถ20.5C_{1}=1,C_{2}=0.5italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 , italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0.5 are the capacities of the channels, respectively; or in other words ฮฑโขr=2โˆ’r๐›ผ๐‘Ÿ2๐‘Ÿ\alpha r=2-ritalic_ฮฑ italic_r = 2 - italic_r. Let ฮฒโ‰ฅ1๐›ฝ1\beta\geq 1italic_ฮฒ โ‰ฅ 1 be a constant that is independent of ๐ฌ๐ฌ\operatorname*{\bf s}bold_s and satisfies ฯ„kโข(๐ฌ)โ‰ฅฮฒโขrโขksubscript๐œ๐‘˜๐ฌ๐›ฝ๐‘Ÿ๐‘˜\tau_{k}(\operatorname*{\bf s})\geq\beta rkitalic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ฅ italic_ฮฒ italic_r italic_k for all ๐ฌ๐ฌ\operatorname*{\bf s}bold_s. Our objective is to find the least ฮฒ๐›ฝ\betaitalic_ฮฒ that satisfies the inequality for all ๐ฌ๐ฌ\operatorname*{\bf s}bold_s since it directly implies ๐‚๐‘2โ‰ฅ1ฮฒsubscript๐‚๐‘21๐›ฝ\operatorname*{\mathbf{CR}}_{2}\geq\frac{1}{\beta}bold_CR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ฅ divide start_ARG 1 end_ARG start_ARG italic_ฮฒ end_ARG.

We proceed to the analysis broken to sub-cases based on the value of rโˆˆ[1,2]๐‘Ÿ12r\in[1,2]italic_r โˆˆ [ 1 , 2 ].

Case 1: rโˆˆ[1,t]๐‘Ÿ1๐‘กr\in[1,t]italic_r โˆˆ [ 1 , italic_t ]. The range of r๐‘Ÿritalic_r implies the ordering 1โ‰คrโ‰คtโ‰คฮฒโขr1๐‘Ÿ๐‘ก๐›ฝ๐‘Ÿ1\leq r\leq t\leq\beta r1 โ‰ค italic_r โ‰ค italic_t โ‰ค italic_ฮฒ italic_r.222The case ฮฒโขrโ‰คt๐›ฝ๐‘Ÿ๐‘ก\beta r\leq titalic_ฮฒ italic_r โ‰ค italic_t is not possible; this can only occur if tโ‰ฅฮฒโ‰ฅ1๐‘ก๐›ฝ1t\geq\beta\geq 1italic_t โ‰ฅ italic_ฮฒ โ‰ฅ 1, but this contradicts our assumption that tโ‰ค2๐‘ก2t\leq 2italic_t โ‰ค 2 combined with our upper bound ๐‚๐‘โ‰ค1/2๐‚๐‘12\operatorname*{\mathbf{CR}}\leq 1/2bold_CR โ‰ค 1 / 2. That is, the alternating distribution point tโขk๐‘ก๐‘˜tkitalic_t italic_k lies between the stop** time and the optimal stop** time.

Let ๐ฌrโขksubscript๐ฌ๐‘Ÿ๐‘˜\operatorname*{\bf s}_{rk}bold_s start_POSTSUBSCRIPT italic_r italic_k end_POSTSUBSCRIPT be the prefix of ๐ฌ๐ฌ\operatorname*{\bf s}bold_s up to length rโขk๐‘Ÿ๐‘˜rkitalic_r italic_k. By our assumption that p1โ‰ค2/3subscript๐‘123p_{1}\leq 2/3italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค 2 / 3, we can consider ๐ฌrโขk2โˆžsubscript๐ฌ๐‘Ÿ๐‘˜superscript2\operatorname*{\bf s}_{rk}2^{\infty}bold_s start_POSTSUBSCRIPT italic_r italic_k end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT instead of ๐ฌ๐ฌ\operatorname*{\bf s}bold_s since this sequence has the largest stop** time. We have by the accumulation of the mutual information in (7)

k๐‘˜\displaystyle kitalic_k โ‰ฅ(1โˆ’ฮฑ)โขrโขkโขp12+ฮฑโขrโขkโข(1โˆ’p1)+(tโขkโˆ’rโขk)โขp12+ฯ„kโข(๐ฌ)โˆ’tโขk3โขabsent1๐›ผ๐‘Ÿ๐‘˜subscript๐‘12๐›ผ๐‘Ÿ๐‘˜1subscript๐‘1๐‘ก๐‘˜๐‘Ÿ๐‘˜subscript๐‘12subscript๐œ๐‘˜๐ฌ๐‘ก๐‘˜3absent\displaystyle\geq(1-\alpha)rk\frac{p_{1}}{2}+\alpha rk(1-p_{1})+(tk-rk)\frac{p% _{1}}{2}+\frac{\tau_{k}(\operatorname*{\bf s})-tk}{3}\operatorname*{}โ‰ฅ ( 1 - italic_ฮฑ ) italic_r italic_k divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + italic_ฮฑ italic_r italic_k ( 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + ( italic_t italic_k - italic_r italic_k ) divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) - italic_t italic_k end_ARG start_ARG 3 end_ARG start_OPERATOR end_OPERATOR
โ‰ฅ(1โˆ’ฮฑ)โขrโขkโขp12+ฮฑโขrโขkโข(1โˆ’p1)+(tโขkโˆ’rโขk)โขp12+ฮฒโขrโขkโˆ’tโขk3,absent1๐›ผ๐‘Ÿ๐‘˜subscript๐‘12๐›ผ๐‘Ÿ๐‘˜1subscript๐‘1๐‘ก๐‘˜๐‘Ÿ๐‘˜subscript๐‘12๐›ฝ๐‘Ÿ๐‘˜๐‘ก๐‘˜3\displaystyle\geq(1-\alpha)rk\frac{p_{1}}{2}+\alpha rk(1-p_{1})+(tk-rk)\frac{p% _{1}}{2}+\frac{\beta rk-tk}{3},\operatorname*{}โ‰ฅ ( 1 - italic_ฮฑ ) italic_r italic_k divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + italic_ฮฑ italic_r italic_k ( 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + ( italic_t italic_k - italic_r italic_k ) divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG italic_ฮฒ italic_r italic_k - italic_t italic_k end_ARG start_ARG 3 end_ARG , start_OPERATOR end_OPERATOR

which implies ฮฒโ‰ค18โขp1+6โขr+2โขtโˆ’9โขp1โขrโˆ’3โขp1โขtโˆ’62โขr๐›ฝ18subscript๐‘16๐‘Ÿ2๐‘ก9subscript๐‘1๐‘Ÿ3subscript๐‘1๐‘ก62๐‘Ÿ\beta\leq\frac{18p_{1}+6r+2t-9p_{1}r-3p_{1}t-6}{2r}italic_ฮฒ โ‰ค divide start_ARG 18 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 6 italic_r + 2 italic_t - 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_r - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t - 6 end_ARG start_ARG 2 italic_r end_ARG. This inequality holds for all state sequences such that their corresponding r๐‘Ÿritalic_r satisfies rโˆˆ[1,t]๐‘Ÿ1๐‘กr\in[1,t]italic_r โˆˆ [ 1 , italic_t ], so we proceed to maximize the upper bound by maximizing it over r๐‘Ÿritalic_r.

The bound is increasing (in r๐‘Ÿritalic_r) if โˆ’18โขp1+2โขtโˆ’3โขp1โขtโˆ’6โ‰ฅ018subscript๐‘12๐‘ก3subscript๐‘1๐‘ก60-18p_{1}+2t-3p_{1}t-6\geq 0- 18 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t - 6 โ‰ฅ 0 and is decreasing otherwise. That means that if the condition is satisfied, the bound is maximizes at rโˆ—=tsuperscript๐‘Ÿ๐‘กr^{*}=titalic_r start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = italic_t resulting in ฮฒโ‰ค9โขp1+4โขtโˆ’6โขp1โขtโˆ’3t๐›ฝ9subscript๐‘14๐‘ก6subscript๐‘1๐‘ก3๐‘ก\beta\leq\frac{9p_{1}+4t-6p_{1}t-3}{t}italic_ฮฒ โ‰ค divide start_ARG 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 4 italic_t - 6 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t - 3 end_ARG start_ARG italic_t end_ARG. If the condition is not satisfied, the upper bound is maximized at rโˆ—=1superscript๐‘Ÿ1r^{*}=1italic_r start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = 1 resulting in ฮฒโ‰ค9โขp1+2โขtโˆ’3โขp1โขt2๐›ฝ9subscript๐‘12๐‘ก3subscript๐‘1๐‘ก2\beta\leq\frac{9p_{1}+2t-3p_{1}t}{2}italic_ฮฒ โ‰ค divide start_ARG 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t end_ARG start_ARG 2 end_ARG. It is easy to verify that the condition is not satisfied for tโˆˆ[1,2]๐‘ก12t\in[1,2]italic_t โˆˆ [ 1 , 2 ], and so we will use the second bound.

Case 2: rโˆˆ[t,t+22]๐‘Ÿ๐‘ก๐‘ก22r\in[t,\frac{t+2}{2}]italic_r โˆˆ [ italic_t , divide start_ARG italic_t + 2 end_ARG start_ARG 2 end_ARG ]. The range of r๐‘Ÿritalic_r implies that the number of 2โ€™s can be contained within [0,tโขk]0๐‘ก๐‘˜[0,tk][ 0 , italic_t italic_k ]. That is, rโ‰คt+22โ‡”(1โˆ’ฮฑ)โขrโขkโ‰คtโขkiff๐‘Ÿ๐‘ก221๐›ผ๐‘Ÿ๐‘˜๐‘ก๐‘˜r\leq\frac{t+2}{2}\iff(1-\alpha)rk\leq tkitalic_r โ‰ค divide start_ARG italic_t + 2 end_ARG start_ARG 2 end_ARG โ‡” ( 1 - italic_ฮฑ ) italic_r italic_k โ‰ค italic_t italic_k. This now implies that the worst sequence (for this case) is the one starting with 2222โ€™s, which in turn implies that:

11\displaystyle 11 โ‰ฅ(1โˆ’ฮฑ)โขrโขp12+(tโˆ’(1โˆ’ฮฑ)โขr)โข(1โˆ’p1)+ฮฒโขrโˆ’t3,absent1๐›ผ๐‘Ÿsubscript๐‘12๐‘ก1๐›ผ๐‘Ÿ1subscript๐‘1๐›ฝ๐‘Ÿ๐‘ก3\displaystyle\geq(1-\alpha)r\frac{p_{1}}{2}+(t-(1-\alpha)r)(1-p_{1})+\frac{% \beta r-t}{3},โ‰ฅ ( 1 - italic_ฮฑ ) italic_r divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + ( italic_t - ( 1 - italic_ฮฑ ) italic_r ) ( 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + divide start_ARG italic_ฮฒ italic_r - italic_t end_ARG start_ARG 3 end_ARG , (10)

which implies ฮฒโ‰ค9โขp1+6โขrโˆ’2โขtโˆ’9โขp1โขr+3โขp1โขtโˆ’3r๐›ฝ9subscript๐‘16๐‘Ÿ2๐‘ก9subscript๐‘1๐‘Ÿ3subscript๐‘1๐‘ก3๐‘Ÿ\beta\leq\frac{9p_{1}+6r-2t-9p_{1}r+3p_{1}t-3}{r}italic_ฮฒ โ‰ค divide start_ARG 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 6 italic_r - 2 italic_t - 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_r + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t - 3 end_ARG start_ARG italic_r end_ARG.

The bound is increasing if t+(t+3)โข(1โˆ’3โขp1)โ‰ฅ0๐‘ก๐‘ก313subscript๐‘10t+(t+3)(1-3p_{1})\geq 0italic_t + ( italic_t + 3 ) ( 1 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) โ‰ฅ 0 implying that rโˆ—=tsuperscript๐‘Ÿ๐‘กr^{*}=titalic_r start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = italic_t and ฮฒโ‰ค9โขp1+4โขtโˆ’6โขp1โขtโˆ’3t๐›ฝ9subscript๐‘14๐‘ก6subscript๐‘1๐‘ก3๐‘ก\beta\leq\frac{9p_{1}+4t-6p_{1}t-3}{t}italic_ฮฒ โ‰ค divide start_ARG 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 4 italic_t - 6 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t - 3 end_ARG start_ARG italic_t end_ARG, and if the condition is not satisfied, rโˆ—=t+22superscript๐‘Ÿ๐‘ก22r^{*}=\frac{t+2}{2}italic_r start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = divide start_ARG italic_t + 2 end_ARG start_ARG 2 end_ARG is the maximizer and we have ฮฒโ‰ค2โขtโˆ’3โขp1โขt+6t+2๐›ฝ2๐‘ก3subscript๐‘1๐‘ก6๐‘ก2\beta\leq\frac{2t-3p_{1}t+6}{t+2}italic_ฮฒ โ‰ค divide start_ARG 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t + 6 end_ARG start_ARG italic_t + 2 end_ARG.

Case 3: rโˆˆ[t+22,2]๐‘Ÿ๐‘ก222r\in[\frac{t+2}{2},2]italic_r โˆˆ [ divide start_ARG italic_t + 2 end_ARG start_ARG 2 end_ARG , 2 ]. In this range, we have (1โˆ’ฮฑ)โขrโ‰ฅt1๐›ผ๐‘Ÿ๐‘ก(1-\alpha)r\geq t( 1 - italic_ฮฑ ) italic_r โ‰ฅ italic_t so that, in the worst state-sequence for this case, the number of 2โ€™s occupies [0,t]0๐‘ก[0,t][ 0 , italic_t ]:

11\displaystyle 11 โ‰ฅtโขp12+ฮฒโขrโˆ’t3.absent๐‘กsubscript๐‘12๐›ฝ๐‘Ÿ๐‘ก3\displaystyle\geq t\frac{p_{1}}{2}+\frac{\beta r-t}{3}.โ‰ฅ italic_t divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG italic_ฮฒ italic_r - italic_t end_ARG start_ARG 3 end_ARG . (11)

This implies ฮฒโ‰ค2โขtโˆ’3โขp1โขt+62โขr๐›ฝ2๐‘ก3subscript๐‘1๐‘ก62๐‘Ÿ\beta\leq\frac{2t-3p_{1}t+6}{2r}italic_ฮฒ โ‰ค divide start_ARG 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t + 6 end_ARG start_ARG 2 italic_r end_ARG and is minimized at rโˆ—=t+22superscript๐‘Ÿ๐‘ก22r^{*}=\frac{t+2}{2}italic_r start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = divide start_ARG italic_t + 2 end_ARG start_ARG 2 end_ARG so that ฮฒโ‰ค2โขtโˆ’3โขp1โขt+6t+2๐›ฝ2๐‘ก3subscript๐‘1๐‘ก6๐‘ก2\beta\leq\frac{2t-3p_{1}t+6}{t+2}italic_ฮฒ โ‰ค divide start_ARG 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t + 6 end_ARG start_ARG italic_t + 2 end_ARG.

Summary: We need to combine the different cases and choose the largest upper bound among these. If the condition of Case 2222 is satisfied, we obtain

ฮฒโ‰คmintโˆˆ[1,2],p1โ‰ค2/3โกmax๐›ฝsubscriptformulae-sequence๐‘ก12subscript๐‘123\displaystyle\beta\leq\min_{t\in[1,2],p_{1}\leq 2/3}\maxitalic_ฮฒ โ‰ค roman_min start_POSTSUBSCRIPT italic_t โˆˆ [ 1 , 2 ] , italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค 2 / 3 end_POSTSUBSCRIPT roman_max {9โขp1+2โขtโˆ’3โขp1โขt2,9โขp1+4โขtโˆ’6โขp1โขtโˆ’3t,\displaystyle\left\{\frac{9p_{1}+2t-3p_{1}t}{2},\frac{9p_{1}+4t-6p_{1}t-3}{t},% \right.\operatorname*{}{ divide start_ARG 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t end_ARG start_ARG 2 end_ARG , divide start_ARG 9 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 4 italic_t - 6 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t - 3 end_ARG start_ARG italic_t end_ARG , start_OPERATOR end_OPERATOR
2โขtโˆ’3โขp1โขt+6t+2}.\displaystyle\ \ \ \ \ \ \left.\frac{2t-3p_{1}t+6}{t+2}\right\}.divide start_ARG 2 italic_t - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t + 6 end_ARG start_ARG italic_t + 2 end_ARG } . (12)

The second term is dominated by the maximum between the first and third terms for all p,t๐‘๐‘กp,titalic_p , italic_t in the range. By comparing the first and third terms we obtain p1โˆ—=2โขt2โˆ’123โขt2โˆ’9โขtโˆ’18superscriptsubscript๐‘12superscript๐‘ก2123superscript๐‘ก29๐‘ก18p_{1}^{*}=\frac{2t^{2}-12}{3t^{2}-9t-18}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = divide start_ARG 2 italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 12 end_ARG start_ARG 3 italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 9 italic_t - 18 end_ARG giving

ฮฒ๐›ฝ\displaystyle\betaitalic_ฮฒ โ‰คmintโˆˆ[1,2]โก18โˆ’t2+3โขt+6.formulae-sequenceabsentsubscript๐‘ก1218superscript๐‘ก23๐‘ก6absent\displaystyle\leq\min_{t\in[1,2]}\frac{18}{-t^{2}+3t+6}.\operatorname*{}โ‰ค roman_min start_POSTSUBSCRIPT italic_t โˆˆ [ 1 , 2 ] end_POSTSUBSCRIPT divide start_ARG 18 end_ARG start_ARG - italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 3 italic_t + 6 end_ARG . start_OPERATOR end_OPERATOR

The upper bound is minimized at t=3/2๐‘ก32t=3/2italic_t = 3 / 2, implying in turn that p1โˆ—=10/33superscriptsubscript๐‘11033p_{1}^{*}=10/33italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = 10 / 33. To conclude, the optimized bound is ฮฒโ‰ค2411๐›ฝ2411\beta\leq\frac{24}{11}italic_ฮฒ โ‰ค divide start_ARG 24 end_ARG start_ARG 11 end_ARG and proves ๐‚๐‘2โ‰ฅ1124โ‰ˆ0.458subscript๐‚๐‘211240.458\operatorname*{\mathbf{CR}}_{2}\geq\frac{11}{24}\approx 0.458bold_CR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ฅ divide start_ARG 11 end_ARG start_ARG 24 end_ARG โ‰ˆ 0.458. Note that if the condition of Case 2222 is not satisfied, we necessarily have a greater bound since we have the same optimization as before but the minimum over t๐‘กtitalic_t is constrained.

VI Conclusions

In this work, we study the competitive analysis of AVCs in the rateless setting. Unlike traditional solutions for AVCs, we find that codes using a single input distribution fall short of achieving optimal competitive performance. This emphasizes the necessity of encoding technologies that adapt the input distribution over time, even in the absence of feedback or any knowledge about subsequent channel states. A single-letter expression for the optimal competitive ratio is left open in this work. In particular, an upper bound to the number of input distributions sufficient for optimal competitive codes is subject to future studies.

References

  • [1] D.ย Blackwell, L.ย Breiman, and A.ย J. Thomasian. The capacities of certain channel classes under random coding. The Annals of Mathematical Statistics, pages 558โ€“567, 1960.
  • [2] R.ย Ahlswede. Elimination of correlation in random codes for arbitrarily varying channels. Z. Wahrsch. Verw. Gebiete, 33:159โ€“175, 1978.
  • [3] I.ย Csiszรกr and P.ย Narayan. Arbitrarily varying channels with constrained inputs and states. IEEE Transactions on Information Theory, 34(1):27โ€“34, 1988.
  • [4] I.ย Csiszรกr and P.ย Narayan. Capacity of the Gaussian arbitrarily varying channel. IEEE Transactions on Information Theory, 37(1):18โ€“26, January 1991.
  • [5] I.ย Csiszรกr and P.ย Narayan. The capacity of the arbitrarily varying channel revisited : Positivity, constraints. IEEE Transactions on Information Theory, 34(2):181โ€“193, 1988.
  • [6] M.ย Luby. LT codes. In Proceedings of the 43rd Annual IEEE Symposium on Foundations of Computer Science, pages 271โ€“280, 2002.
  • [7] D.ย J.ย C. MacKay. Fountain codes. IEE Proceedings-Communications, 152(6):1062โ€“1068, 2005.
  • [8] A.ย Shokrollahi. Raptor codes. IEEE Transactions on Information Theory, 52(6):2551โ€“2567, 2006.
  • [9] O.ย Shayevitz and M.ย Feder. Communicating using feedback over a binary channel with arbitrary noise sequence. In IEEE International Symposium on Information Theory (ISIT), pages 1516โ€“1520, 2005.
  • [10] K.ย Eswaran, A.ย D. Sarwate, A.ย Sahai, and M.ย Gastpar. Using zero-rate feedback on binary additive channels with individual noise sequences. In IEEE International Symposium on Information Theory (ISIT), pages 1431โ€“1435, 2007.
  • [11] K.ย Eswaran, A.ย D Sarwate, A.ย Sahai, and M.ย C. Gastpar. Zero-rate feedback can achieve the empirical capacity. IEEE Transactions on Information Theory, 56(1):25โ€“39, 2009.
  • [12] O.ย Shayevitz and M.ย Feder. Achieving the empirical capacity using feedback: Memoryless additive models. IEEE Transactions on Information Theory, 55(3):1269โ€“1295, 2009.
  • [13] K.ย Woyach, K.ย Harrison, G.ย Ranade, and A.ย Sahai. Comments on unknown channels. In Information Theory Workshop, pages 172โ€“176. IEEE, 2012.
  • [14] Y.ย Lomnitz and M.ย Feder. Communication over individual channels. IEEE Transactions on Information Theory, 57(11):7333โ€“7358, 2011.
  • [15] Y.ย Lomnitz and M.ย Feder. Communication over individual channelsโ€“a general framework. arXiv preprint, arXiv:1203.1406, 2012.
  • [16] N.ย Blits. Rateless codes for finite message set. M.Sc. dissertation, Tel-Aviv University, 2012.
  • [17] Y.ย Lomnitz and M.ย Feder. Universal communication over arbitrarily varying channels. IEEE Transactions on Information Theory, 59(6):3720โ€“3752, 2013.
  • [18] P.ย Joshi, A.ย Purkayastha, Y.ย Zhang, A.ย J. Budkuley, and S.ย Jaggi. On the Capacity of Additive AVCs with Feedback. In IEEE International Symposium on Information Theory (ISIT), pages 504โ€“509, 2022.
  • [19] S.ย C. Draper, F.ย R. Kschischang, and F.ย Brendan. Rateless coding for arbitrary channel mixtures with decoder channel state information. IEEE Transactions on Information Theory, 55(9):4119โ€“4133, 2009.
  • [20] A.ย D. Sarwate. Robust and adaptive communication under uncertain interference. PhD thesis, University of California, Berkeley, 2008.
  • [21] A.ย D. Sarwate and M.ย Gastpar. Rateless codes for AVC models. IEEE Transactions on Information Theory, 56(7):3105โ€“3114, 2010.
  • [22] M.ย V. Burnashev. Data transmission over a discrete channel with feedback. Random transmission time. Probl. Inf. Transm., 12(4):250โ€“โ€“265, 1976.
  • [23] Y.ย Hu, Y.ย Zhu, M.ย C. Gursoy, and A.ย Schmeink. SWIPT-enabled relaying in IoT networks operating with finite blocklength codes. IEEE Journal on Selected Areas in Communications, 37(1):74โ€“88, 2018.
  • [24] Y.ย Hu, Y.ย Li, M.ย Gursoy, S.ย Velipasalar, and A.ย Schmeink. Throughput analysis of low-latency IoT systems with QoS constraints and finite blocklength codes. IEEE Transactions on Vehicular Technology, 69(3):3093โ€“3104, 2020.
  • [25] F.ย Ghanami, G.ย A. Hodtani, B.ย Vucetic, and M.ย Shirvanimoghaddam. Performance analysis and optimization of NOMA with HARQ for short packet communications in massive IoT. IEEE Internet of Things Journal, 8(6):4736โ€“4748, 2020.
  • [26] N.ย Agrawal, A.ย Bansal, K.ย Singh, C.P. Li, and S.ย Mumtaz. Finite block length analysis of RIS-assisted UAV-based multiuser IoT communication system with non-linear EH. IEEE Transactions on Communications, 70(5):3542โ€“3557, 2022.
  • [27] P.ย Popovski, J.ย J. Nielsen, C.ย Stefanovic, E.ย Deย Carvalho, E.ย Strom, K.ย F. Trillingsgaard, A.ย S. Bana, D.ย M. Kim, R.ย Kotaba, J.ย Park, and R.ย B. Sorensen. Wireless access for ultra-reliable low-latency communication: Principles and building blocks. IEEE Network, 32(2):16โ€“23, 2018.
  • [28] C.ย She, C.ย Pan, T.ย Q. Duong, T.ย Q.ย S. Quek, R.ย Schober, M.ย Simsek, and P.ย Zhu. Guest Editorial xURLLC in 6G: Next Generation Ultra-Reliable and Low-Latency Communications. IEEE Journal on Selected Areas in Communications, 41(7):1963โ€“1968, 2023.
  • [29] N.ย H. Mahmood, I.ย Atzeni, E.ย A. Jorswieck, and O.ย L.ย A. Lรณpez. Ultra-Reliable Low-Latency Communications: Foundations, Enablers, System Design, and Evolution Towards 6G. Foundations and Trendsยฎ in Communications and Information Theory, 20(5-6):512โ€“747, 2023.
  • [30] M.ย Langberg and O.ย Sabag. Competitive channel-capacity. IEEE Transactions on Information Theory, 2024.
  • [31] A.ย Lapidoth and P.ย Narayan. Reliable communication under channel uncertainty. IEEE Transactions on Information Theory, 44(10):2148โ€“2177, 1998.
  • [32] O.ย Kosut and J.ย Kliewer. Finite blocklength and dispersion bounds for the arbitrarily-varying channel. In IEEE International Symposium on Information Theory (ISIT), pages 2007โ€“2011, 2018.
  • [33] N.ย Shulman and M.ย Feder. Static broadcasting. In IEEE International Symposium on Information Theory, pageย 23, 2000.
  • [34] N.ย Shulman. Communication over an unknown channel via common broadcasting. Ph.D. dissertation, Tel Aviv University, 2003.
  • [35] C.ย Shannon. A mathematical theory of communication. Bell System Technical Journal, 27(3):379โ€“423, 623โ€“656, July 1948.
  • [36] I.ย Csiszรกr and J.ย Korner. Information Theory: Coding Theorems for Discrete Memoryless Systems, 2nd edition. Akademiai Kiado, New York, NY, 1997.
  • [37] S.ย Z. Stambler. Shannon theorems for a full class of channels with state known at the output. Problems of Information Transmission, 11(4):3โ€“12, (In Russian). 1975.
  • [38] R.ย W. Yeung. Information theory and network coding. Springer Science & Business Media, 2008.

Appendix A Definition of ๐‚๐‘โ„“subscript๐‚๐‘โ„“\operatorname*{\mathbf{CR}}_{\ell}bold_CR start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT

Let ฮด>0๐›ฟ0\delta>0italic_ฮด > 0. Let โ„“โ„“\ellroman_โ„“ be a positive integer. Let ๐’žk,โ„“(ฮด)subscriptsuperscript๐’ž๐›ฟ๐‘˜โ„“{\mathcal{C}}^{(\delta)}_{k,\ell}caligraphic_C start_POSTSUPERSCRIPT ( italic_ฮด ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_โ„“ end_POSTSUBSCRIPT be the set of all rateless codes with message size k๐‘˜kitalic_k that correspond to product distributions ๐ฉ๐ฉ{\bf p}bold_p that are ฮด๐›ฟ\deltaitalic_ฮด close to ๐’ซโ„“subscript๐’ซโ„“{\mathcal{P}}_{\ell}caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT. Let,

๐‚๐‘(ฮด)(k,ฯต)superscript๐‚๐‘๐›ฟ๐‘˜italic-ฯต\displaystyle{\operatorname*{\mathbf{CR}}}^{(\delta)}(k,\epsilon)bold_CR start_POSTSUPERSCRIPT ( italic_ฮด ) end_POSTSUPERSCRIPT ( italic_k , italic_ฯต ) =sup๐’žkโˆˆ๐’žk,โ„“(ฮด):Peโ‰คฯตinf๐ฌโˆˆ๐’ฎโˆžฯ„k,ฯตโˆ—โข(๐ฌ)ฯ„kโข(๐ฌ).absentsubscriptsupremum:subscript๐’ž๐‘˜subscriptsuperscript๐’ž๐›ฟ๐‘˜โ„“subscript๐‘ƒ๐‘’italic-ฯตsubscriptinfimum๐ฌsuperscript๐’ฎsubscriptsuperscript๐œโˆ—๐‘˜italic-ฯต๐ฌsubscript๐œ๐‘˜๐ฌ\displaystyle=\sup_{\mathcal{C}_{k}\in{\cal C}^{(\delta)}_{k,\ell}:P_{e}\leq% \epsilon}\inf_{\operatorname*{\bf s}\in\mathcal{S}^{\infty}}\frac{\tau^{\ast}_% {k,\epsilon}(\operatorname*{\bf s})}{\tau_{k}(\operatorname*{\bf s})}.\vspace{% -3mm}= roman_sup start_POSTSUBSCRIPT caligraphic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โˆˆ caligraphic_C start_POSTSUPERSCRIPT ( italic_ฮด ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_โ„“ end_POSTSUBSCRIPT : italic_P start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT โ‰ค italic_ฯต end_POSTSUBSCRIPT roman_inf start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_ฯต end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG . (13)

Let,

๐‚๐‘โ„“(ฮด)subscriptsuperscript๐‚๐‘๐›ฟโ„“\displaystyle{\operatorname*{\mathbf{CR}}}^{(\delta)}_{\ell}bold_CR start_POSTSUPERSCRIPT ( italic_ฮด ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT =lim supkโ†’โˆž,ฯตkโ†’0๐‚๐‘(ฮด)(k,ฯตk).absentsubscriptlimit-supremumformulae-sequenceโ†’๐‘˜โ†’subscriptitalic-ฯต๐‘˜0superscript๐‚๐‘๐›ฟ๐‘˜subscriptitalic-ฯต๐‘˜\displaystyle=\limsup_{\begin{subarray}{c}k\to\infty,\epsilon_{k}\to 0\end{% subarray}}{\operatorname*{\mathbf{CR}}}^{(\delta)}(k,\epsilon_{k}).= lim sup start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_k โ†’ โˆž , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT โ†’ 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT bold_CR start_POSTSUPERSCRIPT ( italic_ฮด ) end_POSTSUPERSCRIPT ( italic_k , italic_ฯต start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . (14)

Finally, let

๐‚๐‘โ„“subscript๐‚๐‘โ„“\displaystyle{\operatorname*{\mathbf{CR}}}_{\ell}bold_CR start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT =lim infฮดโ†’0๐‚๐‘โ„“(ฮด).absentsubscriptlimit-infimumโ†’๐›ฟ0subscriptsuperscript๐‚๐‘๐›ฟโ„“\displaystyle=\liminf_{\begin{subarray}{c}\delta\to 0\end{subarray}}{% \operatorname*{\mathbf{CR}}}^{(\delta)}_{\ell}.= lim inf start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_ฮด โ†’ 0 end_CELL end_ROW end_ARG end_POSTSUBSCRIPT bold_CR start_POSTSUPERSCRIPT ( italic_ฮด ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT . (15)

Appendix B Achievability (Proof of Theoremย 2)

Let ฮด>0๐›ฟ0\delta>0italic_ฮด > 0. Let k๐‘˜kitalic_k be sufficiently large. Consider any ๐ฉ=p1n1,p2n2,โ€ฆ,pโ„“โˆ’1nโ„“โˆ’1,pโ„“โˆžโˆˆ๐’ซโˆžformulae-sequence๐ฉsuperscriptsubscript๐‘1subscript๐‘›1superscriptsubscript๐‘2subscript๐‘›2โ€ฆsuperscriptsubscript๐‘โ„“1subscript๐‘›โ„“1superscriptsubscript๐‘โ„“superscript๐’ซ{\bf p}=p_{1}^{n_{1}},p_{2}^{n_{2}},\dots,p_{\ell-1}^{n_{\ell-1}},p_{\ell}^{% \infty}\in\operatorname{\mathcal{P}}^{\infty}bold_p = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , โ€ฆ , italic_p start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT โˆˆ caligraphic_P start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT in ๐’ซโ„“subscript๐’ซโ„“\operatorname{\mathcal{P}}_{\ell}caligraphic_P start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT. Here, n1,โ€ฆ,nโ„“โˆ’1subscript๐‘›1โ€ฆsubscript๐‘›โ„“1n_{1},\dots,n_{\ell-1}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , โ€ฆ , italic_n start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT may depend on k๐‘˜kitalic_k. In what follows, we show that a rateless code designed at random, i.e., in which the i๐‘–iitalic_iโ€™th codewords entries are drawn independently from the i๐‘–iitalic_iโ€™th entry in ๐ฉ๐ฉ\operatorname*{\bf p}bold_p, has (with high probability) competitive ratio at least

(1โˆ’ฮด)โขinf๐ฌโˆˆ๐’ฎโˆž(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)).1๐›ฟsubscriptinfimum๐ฌsuperscript๐’ฎsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ(1-\delta)\inf_{\operatorname*{\bf s}\in{\mathcal{S}^{\infty}}}\left(\frac{% \tau^{*}_{k}({\bf s})}{\tau^{({\bf p})}_{k}({\bf s})}\right).( 1 - italic_ฮด ) roman_inf start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) .

B-A Overview

Before diving into the technical proof, we give a rough overview. In general, in the setting of DSI for finite ๐’ฎ๐’ฎ\operatorname{\mathcal{S}}caligraphic_S, traditional analysis of random code design for memoryless channels [35] shows for any given fixed ๐ฌ๐ฌ{\bf s}bold_s that with overwhelming probability 1โˆ’ฮต๐šŒ๐š˜๐š๐šŽ1subscript๐œ€๐šŒ๐š˜๐š๐šŽ1-\varepsilon_{\tt code}1 - italic_ฮต start_POSTSUBSCRIPT typewriter_code end_POSTSUBSCRIPT over code construction, the resulting code guarantees a vanishing (in k๐‘˜kitalic_k) decoding error once the decoder waits for an appropriate amount of time. The main question at hand is the decay of ฮต๐šŒ๐š˜๐š๐šŽsubscript๐œ€๐šŒ๐š˜๐š๐šŽ\varepsilon_{\tt code}italic_ฮต start_POSTSUBSCRIPT typewriter_code end_POSTSUBSCRIPT as k๐‘˜kitalic_k grows. For several applications, a polynomial or exponential decay of ฮต๐šŒ๐š˜๐š๐šŽsubscript๐œ€๐šŒ๐š˜๐š๐šŽ\varepsilon_{\tt code}italic_ฮต start_POSTSUBSCRIPT typewriter_code end_POSTSUBSCRIPT in k๐‘˜kitalic_k is sufficient. However, in our case the code should have a vanishing decoding error, no matter which ๐ฌ๐ฌ{\operatorname*{\bf s}}bold_s is in use. As the number of possible states is exponential, we will require ฮต๐šŒ๐š˜๐š๐šŽsubscript๐œ€๐šŒ๐š˜๐š๐šŽ\varepsilon_{\tt code}italic_ฮต start_POSTSUBSCRIPT typewriter_code end_POSTSUBSCRIPT to have double-exponential dependence on k๐‘˜kitalic_k. Analysis using double-exponential (or super-exponential) concentration on code design is rather common in the AVC literature. Once established in our setting, combining a few additional ideas with a union bound over all possible ๐ฌ๐ฌ{\operatorname*{\bf s}}bold_s suffices to conclude that a single code achieves the expression for all states sequence. We proceed to establish the stated concentration on ฮต๐šŒ๐š˜๐š๐šŽsubscript๐œ€๐šŒ๐š˜๐š๐šŽ\varepsilon_{\tt code}italic_ฮต start_POSTSUBSCRIPT typewriter_code end_POSTSUBSCRIPT.

Let ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\operatorname{\mathcal{S}}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT be a fixed state vector. We divide ๐ฌ๐ฌ\operatorname*{\bf s}bold_s into consecutive chunks according to the chunks of ๐ฉ๐ฉ\operatorname*{\bf p}bold_p such that ๐ฌ=๐ฌ1,๐ฌ2,โ€ฆ,๐ฌโ„“,๐ฌโ„“+1๐ฌsubscript๐ฌ1subscript๐ฌ2โ€ฆsubscript๐ฌโ„“subscript๐ฌโ„“1\operatorname*{\bf s}=\operatorname*{\bf s}_{1},\operatorname*{\bf s}_{2},% \dots,\operatorname*{\bf s}_{\ell},\operatorname*{\bf s}_{\ell+1}bold_s = bold_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , โ€ฆ , bold_s start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT , bold_s start_POSTSUBSCRIPT roman_โ„“ + 1 end_POSTSUBSCRIPT where for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ] the length of ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is nisubscript๐‘›๐‘–n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Here n1,โ€ฆ,nโ„“โˆ’1subscript๐‘›1โ€ฆsubscript๐‘›โ„“1n_{1},\dots,n_{\ell-1}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , โ€ฆ , italic_n start_POSTSUBSCRIPT roman_โ„“ - 1 end_POSTSUBSCRIPT are defined by ๐ฉ๐ฉ\operatorname*{\bf p}bold_p, and nโ„“subscript๐‘›โ„“n_{\ell}italic_n start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT will be defined shortly. Let ๐ฉ=๐ฉ1,๐ฉ2,โ€ฆ,๐ฉโ„“,๐ฉโ„“+1๐ฉsubscript๐ฉ1subscript๐ฉ2โ€ฆsubscript๐ฉโ„“subscript๐ฉโ„“1\operatorname*{\bf p}=\operatorname*{\bf p}_{1},\operatorname*{\bf p}_{2},% \dots,\operatorname*{\bf p}_{\ell},\operatorname*{\bf p}_{\ell+1}bold_p = bold_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , โ€ฆ , bold_p start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT , bold_p start_POSTSUBSCRIPT roman_โ„“ + 1 end_POSTSUBSCRIPT be the corresponding chunk decomposition of ๐ฉ๐ฉ\operatorname*{\bf p}bold_p in which, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], ๐ฉi=pinisubscript๐ฉ๐‘–superscriptsubscript๐‘๐‘–subscript๐‘›๐‘–\operatorname*{\bf p}_{i}=p_{i}^{n_{i}}bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and ๐ฉโ„“+1=pโ„“โˆžsubscript๐ฉโ„“1superscriptsubscript๐‘โ„“\operatorname*{\bf p}_{\ell+1}=p_{\ell}^{\infty}bold_p start_POSTSUBSCRIPT roman_โ„“ + 1 end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT. Let

ฯ„โข(๐ฌ)=(1+ฮด)โขฯ„k(๐ฉ)โข(๐ฌ)=(1+ฮด)โขargโกminฯ„โก{โˆ‘j<ฯ„Isjโข(Xj,Yj)โ‰ฅk},๐œ๐ฌ1๐›ฟsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ1๐›ฟsubscript๐œsubscript๐‘—๐œsubscript๐ผsubscript๐‘ ๐‘—subscript๐‘‹๐‘—subscript๐‘Œ๐‘—๐‘˜\tau({\bf s})=(1+\delta)\tau^{({\bf p})}_{k}({\bf s})=(1+\delta)\arg\min_{\tau% }{\{\sum_{j<\tau}I_{{s_{j}}}(X_{j},Y_{j})\geq k\}},italic_ฯ„ ( bold_s ) = ( 1 + italic_ฮด ) italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) = ( 1 + italic_ฮด ) roman_arg roman_min start_POSTSUBSCRIPT italic_ฯ„ end_POSTSUBSCRIPT { โˆ‘ start_POSTSUBSCRIPT italic_j < italic_ฯ„ end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) โ‰ฅ italic_k } ,

where Xjsubscript๐‘‹๐‘—X_{j}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is distributed according to the j๐‘—jitalic_jโ€™th entry of ๐ฉ๐ฉ\operatorname*{\bf p}bold_p, and Yjsubscript๐‘Œ๐‘—Y_{j}italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the outcome of Wsj(โ‹…|Xj)W_{s_{j}}(\cdot|X_{j})italic_W start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( โ‹… | italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) where sjsubscript๐‘ ๐‘—s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the j๐‘—jitalic_jโ€™th entry of ๐ฌ๐ฌ\operatorname*{\bf s}bold_s. Assume that ฯ„โข(๐ฌ)>โˆ‘i=1โ„“โˆ’1ni๐œ๐ฌsuperscriptsubscript๐‘–1โ„“1subscript๐‘›๐‘–\tau(\operatorname*{\bf s})>\sum_{i=1}^{\ell-1}n_{i}italic_ฯ„ ( bold_s ) > โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ - 1 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and set nโ„“=ฯ„โข(๐ฌ)โˆ’โˆ‘i=1โ„“โˆ’1nisubscript๐‘›โ„“๐œ๐ฌsuperscriptsubscript๐‘–1โ„“1subscript๐‘›๐‘–n_{\ell}=\tau(\operatorname*{\bf s})-\sum_{i=1}^{\ell-1}n_{i}italic_n start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT = italic_ฯ„ ( bold_s ) - โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ - 1 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Namely, ฯ„โข(๐ฌ)=โˆ‘i=1โ„“ni๐œ๐ฌsuperscriptsubscript๐‘–1โ„“subscript๐‘›๐‘–\tau(\operatorname*{\bf s})=\sum_{i=1}^{\ell}n_{i}italic_ฯ„ ( bold_s ) = โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We remove this assumption later at the end of the proof. We employ the method of types in our decoding according to the definitions in [36]. For iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], let q๐ฌiโˆˆ๐’ซโก(๐’ฎ)subscript๐‘žsubscript๐ฌ๐‘–๐’ซ๐’ฎq_{\operatorname*{\bf s}_{i}}\in\operatorname{\mathcal{P}}(\operatorname{% \mathcal{S}})italic_q start_POSTSUBSCRIPT bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT โˆˆ caligraphic_P ( caligraphic_S ) be the type of ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In what follows we assume a function f๐‘“fitalic_f such that, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], we have niโ‰ฅfโข(ฮด)โขฯ„โข(๐ฌ)subscript๐‘›๐‘–๐‘“๐›ฟ๐œ๐ฌn_{i}\geq f(\delta)\tau(\operatorname*{\bf s})italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT โ‰ฅ italic_f ( italic_ฮด ) italic_ฯ„ ( bold_s ) and q๐ฌiโข(s)โ‰ฅfโข(ฮด)subscript๐‘žsubscript๐ฌ๐‘–๐‘ ๐‘“๐›ฟq_{\operatorname*{\bf s}_{i}}(s)\geq f(\delta)italic_q start_POSTSUBSCRIPT bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s ) โ‰ฅ italic_f ( italic_ฮด ) for any sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S. This assumption is also removed at the end of the proof.

To simplify our presentation, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ] and sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S, we further partition each chunk ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to sub-chunks ๐ฌi,ssubscript๐ฌ๐‘–๐‘ \operatorname*{\bf s}_{i,s}bold_s start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT corresponding to entries in which ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT equals s๐‘ sitalic_s. We assume without loss of generality that, for each sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S, the sub-chunk ๐ฌi,ssubscript๐ฌ๐‘–๐‘ \operatorname*{\bf s}_{i,s}bold_s start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT consists of consecutive entries of ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. This follows from the fact that the decoder, knowing ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, can reorder the received information accordingly and from the fact that the random encoding rule is consistent over chunks. Thus, we have, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ] and ๐’ฎ={s1,โ€ฆ,s|๐’ฎ|}๐’ฎsubscript๐‘ 1โ€ฆsubscript๐‘ ๐’ฎ\operatorname{\mathcal{S}}=\{s_{1},\dots,s_{|\mathcal{S}|}\}caligraphic_S = { italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , โ€ฆ , italic_s start_POSTSUBSCRIPT | caligraphic_S | end_POSTSUBSCRIPT }, that ๐ฌi=๐ฌi,s1,๐ฌi,s2,โ€ฆ,๐ฌi,s|๐’ฎ|subscript๐ฌ๐‘–subscript๐ฌ๐‘–subscript๐‘ 1subscript๐ฌ๐‘–subscript๐‘ 2โ€ฆsubscript๐ฌ๐‘–subscript๐‘ ๐’ฎ\operatorname*{\bf s}_{i}=\operatorname*{\bf s}_{i,s_{1}},\operatorname*{\bf s% }_{i,s_{2}},\dots,\operatorname*{\bf s}_{i,s_{|\mathcal{S}|}}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_s start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_s start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , โ€ฆ , bold_s start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT | caligraphic_S | end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Let ๐—=X1,X2,โ€ฆ๐—subscript๐‘‹1subscript๐‘‹2โ€ฆ\operatorname{\bf X}=X_{1},X_{2},\dotsbold_X = italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , โ€ฆ be the vector of random variables corresponding to ๐ฉ๐ฉ{\bf p}bold_p in which, for any positive integer j๐‘—jitalic_j, Xjsubscript๐‘‹๐‘—X_{j}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is distributed according to the j๐‘—jitalic_jโ€™th entry in ๐ฉ๐ฉ\operatorname*{\bf p}bold_p. We divide ๐—๐—\operatorname{\bf X}bold_X into chunks as well corresponding to the decomposition of ๐ฉ๐ฉ\operatorname*{\bf p}bold_p and ๐ฌ๐ฌ\operatorname*{\bf s}bold_s. Namely, let ๐—=๐—1,๐—2,โ€ฆ,๐—โ„“,๐—โ„“+1๐—subscript๐—1subscript๐—2โ€ฆsubscript๐—โ„“subscript๐—โ„“1\operatorname{\bf X}=\operatorname{\bf X}_{1},\operatorname{\bf X}_{2},\dots,% \operatorname{\bf X}_{\ell},\operatorname{\bf X}_{\ell+1}bold_X = bold_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , โ€ฆ , bold_X start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT , bold_X start_POSTSUBSCRIPT roman_โ„“ + 1 end_POSTSUBSCRIPT where, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], the length of ๐—isubscript๐—๐‘–\operatorname{\bf X}_{i}bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is nisubscript๐‘›๐‘–n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Let ๐’ž๐’ž{\cal C}caligraphic_C be a random code as discussed above. Namely, for each message mโˆˆ[2k]๐‘šdelimited-[]superscript2๐‘˜m\in[2^{k}]italic_m โˆˆ [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ], the codeword corresponding to m๐‘šmitalic_m is independently distributed according to ๐—๐—\operatorname{\bf X}bold_X and will be denoted by ๐ฑโก(m)=๐ฑ1โก(m),๐ฑ2โก(m),โ€ฆ,๐ฑโ„“โก(m),๐ฑโ„“+1โก(m)๐ฑ๐‘šsubscript๐ฑ1๐‘šsubscript๐ฑ2๐‘šโ€ฆsubscript๐ฑโ„“๐‘šsubscript๐ฑโ„“1๐‘š\operatorname{\bf x}(m)=\operatorname{\bf x}_{1}(m),\operatorname{\bf x}_{2}(m% ),\dots,\operatorname{\bf x}_{\ell}(m),\operatorname{\bf x}_{\ell+1}(m)bold_x ( italic_m ) = bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_m ) , bold_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_m ) , โ€ฆ , bold_x start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT ( italic_m ) , bold_x start_POSTSUBSCRIPT roman_โ„“ + 1 end_POSTSUBSCRIPT ( italic_m ), where, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], the length of ๐ฑiโก(m)subscript๐ฑ๐‘–๐‘š\operatorname{\bf x}_{i}(m)bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_m ) is nisubscript๐‘›๐‘–n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and each entry in ๐ฑiโก(m)subscript๐ฑ๐‘–๐‘š\operatorname{\bf x}_{i}(m)bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_m ) is distributed according to pisubscript๐‘๐‘–p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Moreover, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], we divide ๐ฑiโก(m)subscript๐ฑ๐‘–๐‘š\operatorname{\bf x}_{i}(m)bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_m ) into sub-chunks according to those of ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Namely, ๐ฑiโก(m)=๐ฑi,s1โก(m),๐ฑi,s2โก(m),โ€ฆ,๐ฑi,s|๐’ฎ|โก(m)subscript๐ฑ๐‘–๐‘šsubscript๐ฑ๐‘–subscript๐‘ 1๐‘šsubscript๐ฑ๐‘–subscript๐‘ 2๐‘šโ€ฆsubscript๐ฑ๐‘–subscript๐‘ ๐’ฎ๐‘š\operatorname{\bf x}_{i}(m)=\operatorname{\bf x}_{i,s_{1}}(m),\operatorname{% \bf x}_{i,s_{2}}(m),\dots,\operatorname{\bf x}_{i,s_{|\mathcal{S}|}}(m)bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_m ) = bold_x start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_m ) , bold_x start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_m ) , โ€ฆ , bold_x start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT | caligraphic_S | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_m ) where, for sโˆˆS๐‘ ๐‘†s\in Sitalic_s โˆˆ italic_S, the entries of ๐ฑi,sโก(m)subscript๐ฑ๐‘–๐‘ ๐‘š\operatorname{\bf x}_{i,s}(m)bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m ) correspond to those in ๐ฌi,ssubscript๐ฌ๐‘–๐‘ \operatorname*{\bf s}_{i,s}bold_s start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT. Finally, let ๐ฒ=๐ฒ1,๐ฒ2,โ€ฆ,๐ฒโ„“,๐ฒโ„“+1๐ฒsubscript๐ฒ1subscript๐ฒ2โ€ฆsubscript๐ฒโ„“subscript๐ฒโ„“1\operatorname{\bf y}=\operatorname{\bf y}_{1},\operatorname{\bf y}_{2},\dots,% \operatorname{\bf y}_{\ell},\operatorname{\bf y}_{\ell+1}bold_y = bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , โ€ฆ , bold_y start_POSTSUBSCRIPT roman_โ„“ end_POSTSUBSCRIPT , bold_y start_POSTSUBSCRIPT roman_โ„“ + 1 end_POSTSUBSCRIPT be the channel output at the receiver, and, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], let ๐ฒi=๐ฒi,s1,๐ฒi,s2,โ€ฆ,๐ฒi,s|๐’ฎ|subscript๐ฒ๐‘–subscript๐ฒ๐‘–subscript๐‘ 1subscript๐ฒ๐‘–subscript๐‘ 2โ€ฆsubscript๐ฒ๐‘–subscript๐‘ ๐’ฎ\operatorname{\bf y}_{i}=\operatorname{\bf y}_{i,s_{1}},\operatorname{\bf y}_{% i,s_{2}},\dots,\operatorname{\bf y}_{i,s_{|\mathcal{S}|}}bold_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_y start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_y start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , โ€ฆ , bold_y start_POSTSUBSCRIPT italic_i , italic_s start_POSTSUBSCRIPT | caligraphic_S | end_POSTSUBSCRIPT end_POSTSUBSCRIPT be the chunk and sub-chunk decomposition respectively, both according to the chunk and sub-chunks of ๐ฑโก(m)๐ฑ๐‘š\operatorname{\bf x}(m)bold_x ( italic_m ) and ๐ฌ๐ฌ\operatorname*{\bf s}bold_s. It now holds, for iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S, and mโˆˆ[2k]๐‘šdelimited-[]superscript2๐‘˜m\in[2^{k}]italic_m โˆˆ [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] that each entry of ๐ฑi,ssubscript๐ฑ๐‘–๐‘ \operatorname{\bf x}_{i,s}bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT is independently distributed according to pisubscript๐‘๐‘–p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and that ๐ฒi,ssubscript๐ฒ๐‘–๐‘ \operatorname{\bf y}_{i,s}bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT is distributed according to Wsni,s(โ‹…|๐ฑi,s)W_{s}^{n_{i,s}}(\cdot|\operatorname{\bf x}_{i,s})italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( โ‹… | bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ). Here, ni,ssubscript๐‘›๐‘–๐‘ n_{i,s}italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT is the length of the sub-chunk ๐ฒi,ssubscript๐ฒ๐‘–๐‘ \operatorname{\bf y}_{i,s}bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT. Note that, by our assumption on the type of ๐ฌisubscript๐ฌ๐‘–\operatorname*{\bf s}_{i}bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and on the chunk size nisubscript๐‘›๐‘–n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, it holds that ni,sโ‰ฅf2โข(ฮด)โขฯ„โข(๐ฌ)subscript๐‘›๐‘–๐‘ superscript๐‘“2๐›ฟ๐œ๐ฌn_{i,s}\geq f^{2}(\delta)\tau(\operatorname*{\bf s})italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โ‰ฅ italic_f start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_ฮด ) italic_ฯ„ ( bold_s ). Namely, for the sub-chunk corresponding to (i,s)๐‘–๐‘ (i,s)( italic_i , italic_s ) our analysis reduces to the performance of the fixed memoryless channel Wssubscript๐‘Š๐‘ W_{s}italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT of blocklength ni,ssubscript๐‘›๐‘–๐‘ n_{i,s}italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT in the presence of a code generated randomly according to the fixed distribution pisubscript๐‘๐‘–p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Let Imin=mini,sโกIsโข(X,Y)subscript๐ผsubscript๐‘–๐‘ subscript๐ผ๐‘ ๐‘‹๐‘ŒI_{\min}=\min_{i,s}I_{s}(X,Y)italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X , italic_Y ) and Imax=maxi,sโกIsโข(X,Y)subscript๐ผsubscript๐‘–๐‘ subscript๐ผ๐‘ ๐‘‹๐‘ŒI_{\max}=\max_{i,s}I_{s}(X,Y)italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X , italic_Y ) for (X,Y)๐‘‹๐‘Œ(X,Y)( italic_X , italic_Y ) distributed according to piโข(x)โขWsโข(y|x)subscript๐‘๐‘–๐‘ฅsubscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅp_{i}(x)W_{s}(y|x)italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ). Recall from the theorem statement that Imin>0subscript๐ผ0I_{\min}>0italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT > 0. Notice, by our definition of ฯ„k(๐ฉ)โข(๐ฌ)subscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ\tau^{({\bf p})}_{k}({\bf s})italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ), Iminsubscript๐ผI_{\min}italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT and Imaxsubscript๐ผI_{\max}italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, that kIminโ‰ฅฯ„k(๐ฉ)โข(๐ฌ)โ‰ฅkImax๐‘˜subscript๐ผsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ๐‘˜subscript๐ผ\frac{k}{I_{\min}}\geq\tau^{({\bf p})}_{k}({\bf s})\geq\frac{k}{I_{\max}}divide start_ARG italic_k end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG โ‰ฅ italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ฅ divide start_ARG italic_k end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG. We are now ready to define our encoder and decoder.

Encoding: Let ๐’ž๐’ž{\cal C}caligraphic_C with encoder E:[2k]โ†’๐’ณโˆž:๐ธโ†’delimited-[]superscript2๐‘˜superscript๐’ณE:[2^{k}]\rightarrow\operatorname{\mathcal{X}}^{\infty}italic_E : [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] โ†’ caligraphic_X start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT be the randomly constructed code as defined above. The encoder picks a uniform mโˆˆ[2k]๐‘šdelimited-[]superscript2๐‘˜m\in[2^{k}]italic_m โˆˆ [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] and transmits ๐ฑโก(m)๐ฑ๐‘š\operatorname{\bf x}(m)bold_x ( italic_m ). We show below, for sufficiently large k๐‘˜kitalic_k, that with high probability the resulting code has marginals that are ฮด๐›ฟ\deltaitalic_ฮด-close to ๐ฉ๐ฉ{\bf p}bold_p. Let pisuperscript๐‘๐‘–p^{i}italic_p start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT be the i๐‘–iitalic_iโ€™th entry of ๐ฉ๐ฉ{\bf p}bold_p (we use a superscript here to distinguish with pisubscript๐‘๐‘–p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT which governs the i๐‘–iitalic_iโ€™th chunk of ๐ฉ๐ฉ{\bf p}bold_p), and let qisuperscript๐‘ž๐‘–q^{i}italic_q start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT be the marginal distribution of the i๐‘–iitalic_iโ€™th entry of the code E๐ธEitalic_E. Using Sanovโ€™s theorem and Pinskerโ€™s inequality, for any entry i๐‘–iitalic_i in the codebook it holds with probability at most 2โˆ’ฮฉโข(2kโขฮด2)superscript2ฮฉsuperscript2๐‘˜superscript๐›ฟ22^{-\Omega{(2^{k}}\delta^{2})}2 start_POSTSUPERSCRIPT - roman_ฮฉ ( 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_ฮด start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT that ๐šƒ๐š…โข(pi,qi)>ฮด/2๐šƒ๐š…superscript๐‘๐‘–superscript๐‘ž๐‘–๐›ฟ2{\tt TV}(p^{i},q^{i})>\delta/2typewriter_TV ( italic_p start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_q start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) > italic_ฮด / 2. Thus, using standard concentration bounds and the independence between entries of ๐’ž๐’ž{\cal C}caligraphic_C, it is not hard to verify that โˆ‘n=1โˆžPrโก[โˆ‘i=1n๐šƒ๐š…โข(pi,qi)>ฮดโขn]โ‰ค2โˆ’ฮฉโข(k)superscriptsubscript๐‘›1Prsuperscriptsubscript๐‘–1๐‘›๐šƒ๐š…superscript๐‘๐‘–superscript๐‘ž๐‘–๐›ฟ๐‘›superscript2ฮฉ๐‘˜\sum_{n=1}^{\infty}\Pr[\sum_{i=1}^{n}{\tt TV}(p^{i},q^{i})>\delta n]\leq 2^{-% \Omega(k)}โˆ‘ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT roman_Pr [ โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT typewriter_TV ( italic_p start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_q start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) > italic_ฮด italic_n ] โ‰ค 2 start_POSTSUPERSCRIPT - roman_ฮฉ ( italic_k ) end_POSTSUPERSCRIPT. In both bounds above, we use k๐‘˜kitalic_k sufficiently large. This implies that, for sufficiently large k๐‘˜kitalic_k, with probability at least 1โˆ’2โˆ’ฮฉโข(k)1superscript2ฮฉ๐‘˜1-2^{-\Omega(k)}1 - 2 start_POSTSUPERSCRIPT - roman_ฮฉ ( italic_k ) end_POSTSUPERSCRIPT over code-design, the resulting code is has corresponding marginal distributions that are ฮด๐›ฟ\deltaitalic_ฮด-close to ๐ฉ๐ฉ{\bf p}bold_p.

Decoding: Let ๐ฒ๐ฒ\operatorname{\bf y}bold_y be the received transmission and ๐ฌ๐ฌ\operatorname*{\bf s}bold_s the state sequence (that is available to the decoder). Decoding follows standard typicality decoding outlined in the context of AVCโ€™s with DSI in, e.g., [37] and Exercise 12.16(b) of [36]. Let gโข(ฮด)๐‘”๐›ฟg(\delta)italic_g ( italic_ฮด ) be a function of ฮด๐›ฟ\deltaitalic_ฮด to be defined later. Let ฯ„โข(๐ฌ)=โˆ‘i=1โ„“ni=โˆ‘i,sni,s๐œ๐ฌsuperscriptsubscript๐‘–1โ„“subscript๐‘›๐‘–subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ \tau(\operatorname*{\bf s})=\sum_{i=1}^{\ell}n_{i}=\sum_{i,s}n_{i,s}italic_ฯ„ ( bold_s ) = โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT. The decoder iterates over m^^๐‘š\hat{m}over^ start_ARG italic_m end_ARG between 1111 and 2ksuperscript2๐‘˜2^{k}2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, and decodes ๐ฒฯ„superscript๐ฒ๐œ\operatorname{\bf y}^{\tau}bold_y start_POSTSUPERSCRIPT italic_ฯ„ end_POSTSUPERSCRIPT to the first m^^๐‘š\hat{m}over^ start_ARG italic_m end_ARG such that for all iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ] and sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S the joint type q๐ฑi,sโก(m^),๐ฒi,sโˆˆ๐’ซโก(๐’ณ)ร—๐’ซโก(๐’ด)subscript๐‘žsubscript๐ฑ๐‘–๐‘ ^๐‘šsubscript๐ฒ๐‘–๐‘ ๐’ซ๐’ณ๐’ซ๐’ดq_{\operatorname{\bf x}_{i,s}(\hat{m}),\operatorname{\bf y}_{i,s}}\in% \operatorname{\mathcal{P}}(\operatorname{\mathcal{X}})\times\operatorname{% \mathcal{P}}(\operatorname{\mathcal{Y}})italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( over^ start_ARG italic_m end_ARG ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT โˆˆ caligraphic_P ( caligraphic_X ) ร— caligraphic_P ( caligraphic_Y ) of the pair (๐ฑi,sโก(m^),๐ฒi,s)subscript๐ฑ๐‘–๐‘ ^๐‘šsubscript๐ฒ๐‘–๐‘ (\operatorname{\bf x}_{i,s}(\hat{m}),\operatorname{\bf y}_{i,s})( bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( over^ start_ARG italic_m end_ARG ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ) is of โ„“โˆžsubscriptโ„“\ell_{\infty}roman_โ„“ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT distance at most gโข(ฮด)๐‘”๐›ฟg(\delta)italic_g ( italic_ฮด ) from the distribution qi,sโข(x,y)=piโข(x)โขWsโข(y|x)subscript๐‘ž๐‘–๐‘ ๐‘ฅ๐‘ฆsubscript๐‘๐‘–๐‘ฅsubscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅq_{i,s}(x,y)=p_{i}(x)W_{s}(y|x)italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ) over ๐’ณร—๐’ด๐’ณ๐’ด\operatorname{\mathcal{X}}\times\operatorname{\mathcal{Y}}caligraphic_X ร— caligraphic_Y. If no message m๐‘šmitalic_m passes the test above, the decoder decodes arbitrarily to m=1๐‘š1m=1italic_m = 1.

We now prove the following concentration on ๐’ž๐’ž{\cal C}caligraphic_C for decoding with average error ฮด๐›ฟ\deltaitalic_ฮด.

Claim 1

Let ฮด>0๐›ฟ0\delta>0italic_ฮด > 0. Let iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ] and sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S. With probability 2โˆ’ฮฉโข(2k/k)superscript2ฮฉsuperscript2๐‘˜๐‘˜2^{-\Omega(2^{k}/k)}2 start_POSTSUPERSCRIPT - roman_ฮฉ ( 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT / italic_k ) end_POSTSUPERSCRIPT over code design, it holds that

PrmsubscriptPr๐‘š\displaystyle\Pr_{m}roman_Pr start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT [โˆƒ(i,s)โˆˆ[โ„“]ร—๐’ฎs.t.โˆฅq๐ฑi,sโก(m),๐ฒi,sโˆ’qi,sโˆฅโˆž>g(ฮด),\displaystyle[\exists(i,s)\in[\ell]\times\operatorname{\mathcal{S}}\ \text{s.t% .}\ \|q_{\operatorname{\bf x}_{i,s}(m),\operatorname{\bf y}_{i,s}}-q_{i,s}\|_{% \infty}>g(\delta),[ โˆƒ ( italic_i , italic_s ) โˆˆ [ roman_โ„“ ] ร— caligraphic_S s.t. โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT > italic_g ( italic_ฮด ) ,
orโˆƒmโ€ฒ<m,โˆ€(i,s),โˆฅq๐ฑi,sโก(mโ€ฒ),๐ฒi,sโˆ’qi,sโˆฅโˆžโ‰คg(ฮด)]โ‰ค1k\displaystyle\ \ \ \ \text{or}\ \ \exists m^{\prime}<m,\forall(i,s),\|q_{% \operatorname{\bf x}_{i,s}(m^{\prime}),\operatorname{\bf y}_{i,s}}-q_{i,s}\|_{% \infty}\leq g(\delta)]\leq\frac{1}{k}or โˆƒ italic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT < italic_m , โˆ€ ( italic_i , italic_s ) , โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT โ‰ค italic_g ( italic_ฮด ) ] โ‰ค divide start_ARG 1 end_ARG start_ARG italic_k end_ARG
Proof:

For mโˆˆ[2k]๐‘šdelimited-[]superscript2๐‘˜m\in[2^{k}]italic_m โˆˆ [ 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ], let Zโข(m)๐‘๐‘šZ(m)italic_Z ( italic_m ) be the indicator of the error event that either โˆƒ(i,s)๐‘–๐‘ \exists(i,s)โˆƒ ( italic_i , italic_s ) such that โ€–q๐ฑi,sโก(m),๐ฒi,sโˆ’qi,sโ€–โˆž>gโข(ฮด)subscriptnormsubscript๐‘žsubscript๐ฑ๐‘–๐‘ ๐‘šsubscript๐ฒ๐‘–๐‘ subscript๐‘ž๐‘–๐‘ ๐‘”๐›ฟ\|q_{\operatorname{\bf x}_{i,s}(m),\operatorname{\bf y}_{i,s}}-q_{i,s}\|_{% \infty}>g(\delta)โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT > italic_g ( italic_ฮด ) or โˆƒmโ€ฒ<msuperscript๐‘šโ€ฒ๐‘š\exists m^{\prime}<mโˆƒ italic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT < italic_m such that โˆ€(i,s),โ€–q๐ฑi,sโก(mโ€ฒ),๐ฒi,sโˆ’qi,sโ€–โˆžโ‰คgโข(ฮด)for-all๐‘–๐‘ subscriptnormsubscript๐‘žsubscript๐ฑ๐‘–๐‘ superscript๐‘šโ€ฒsubscript๐ฒ๐‘–๐‘ subscript๐‘ž๐‘–๐‘ ๐‘”๐›ฟ\forall(i,s),\|q_{\operatorname{\bf x}_{i,s}(m^{\prime}),\operatorname{\bf y}_% {i,s}}-q_{i,s}\|_{\infty}\leq g(\delta)โˆ€ ( italic_i , italic_s ) , โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT โ‰ค italic_g ( italic_ฮด ). Let ๐’ž(mโˆ’1)superscript๐’ž๐‘š1{\cal C}^{(m-1)}caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT be any fixed values for the codewords ๐ฑโก(1),โ€ฆ,๐ฑโก(mโˆ’1)๐ฑ1โ€ฆ๐ฑ๐‘š1\operatorname{\bf x}(1),\dots,\operatorname{\bf x}(m-1)bold_x ( 1 ) , โ€ฆ , bold_x ( italic_m - 1 ). We first analyze the expected value of Zโข(m)๐‘๐‘šZ(m)italic_Z ( italic_m ) conditioned on ๐’ž(mโˆ’1)superscript๐’ž๐‘š1{\cal C}^{(m-1)}caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT.

Standard analysis, appearing for example in Chapter 7 of [38], implies the existence of g^โข(ฮด)^๐‘”๐›ฟ\hat{g}(\delta)over^ start_ARG italic_g end_ARG ( italic_ฮด ) that tends to zero when gโข(ฮด)๐‘”๐›ฟg(\delta)italic_g ( italic_ฮด ) tends to zero, such that for m๐‘šmitalic_m and any (i,s)๐‘–๐‘ (i,s)( italic_i , italic_s ),

Pr๐’žโก[โ€–q๐ฑi,sโก(m),๐ฒi,sโˆ’qi,sโ€–โˆž>gโข(ฮด)โˆฃ๐’ž(mโˆ’1)]โ‰ค2โˆ’ni,sโขg^โข(ฮด).subscriptPr๐’žsubscriptnormsubscript๐‘žsubscript๐ฑ๐‘–๐‘ ๐‘šsubscript๐ฒ๐‘–๐‘ subscript๐‘ž๐‘–๐‘ conditional๐‘”๐›ฟsuperscript๐’ž๐‘š1superscript2subscript๐‘›๐‘–๐‘ ^๐‘”๐›ฟ\Pr_{\mathcal{C}}[\|q_{\operatorname{\bf x}_{i,s}(m),\operatorname{\bf y}_{i,s% }}-q_{i,s}\|_{\infty}>g(\delta)\mid{\cal C}^{(m-1)}]\leq 2^{-n_{i,s}\hat{g}(% \delta)}.roman_Pr start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT > italic_g ( italic_ฮด ) โˆฃ caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT ] โ‰ค 2 start_POSTSUPERSCRIPT - italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT over^ start_ARG italic_g end_ARG ( italic_ฮด ) end_POSTSUPERSCRIPT .

and for any mโ€ฒ<msuperscript๐‘šโ€ฒ๐‘šm^{\prime}<mitalic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT < italic_m and any (i,s)๐‘–๐‘ (i,s)( italic_i , italic_s ),

Pr๐’žโก[โ€–q๐ฑi,sโก(mโ€ฒ),๐ฒi,sโˆ’qi,sโ€–โˆžโ‰คgโข(ฮด)โˆฃ๐’ž(mโˆ’1)]โ‰ค2โˆ’ni,sโข(Iโข(Xi;Ys)โˆ’g^โข(ฮด)).subscriptPr๐’žsubscriptnormsubscript๐‘žsubscript๐ฑ๐‘–๐‘ superscript๐‘šโ€ฒsubscript๐ฒ๐‘–๐‘ subscript๐‘ž๐‘–๐‘ conditional๐‘”๐›ฟsuperscript๐’ž๐‘š1superscript2subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ ^๐‘”๐›ฟ\Pr_{\cal C}[\|q_{\operatorname{\bf x}_{i,s}(m^{\prime}),\operatorname{\bf y}_% {i,s}}-q_{i,s}\|_{\infty}\leq g(\delta)\mid{\cal C}^{(m-1)}]\leq 2^{-n_{i,s}(I% (X_{i};Y_{s})-\hat{g}(\delta))}.roman_Pr start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT โ‰ค italic_g ( italic_ฮด ) โˆฃ caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT ] โ‰ค 2 start_POSTSUPERSCRIPT - italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) - over^ start_ARG italic_g end_ARG ( italic_ฮด ) ) end_POSTSUPERSCRIPT .

Here (Xi,Ys)subscript๐‘‹๐‘–subscript๐‘Œ๐‘ (X_{i},Y_{s})( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) are distributed according to piโข(x)โขWsโข(y|x)subscript๐‘๐‘–๐‘ฅsubscript๐‘Š๐‘ conditional๐‘ฆ๐‘ฅp_{i}(x)W_{s}(y|x)italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) italic_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_y | italic_x ). As the events in every sub-chunk are independent, we have for any mโ€ฒ<msuperscript๐‘šโ€ฒ๐‘šm^{\prime}<mitalic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT < italic_m that,

Pr๐’ž[โˆ€(i,s),โˆฅq๐ฑi,sโก(mโ€ฒ),๐ฒi,sโˆ’\displaystyle\Pr_{\cal C}[\forall(i,s),\|q_{\operatorname{\bf x}_{i,s}(m^{% \prime}),\operatorname{\bf y}_{i,s}}-roman_Pr start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ โˆ€ ( italic_i , italic_s ) , โˆฅ italic_q start_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT ) , bold_y start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - qi,sโˆฅโˆžโ‰คg(ฮด)โˆฃ๐’ž(mโˆ’1)]\displaystyle q_{i,s}\|_{\infty}\leq g(\delta)\mid{\cal C}^{(m-1)}]italic_q start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT โˆฅ start_POSTSUBSCRIPT โˆž end_POSTSUBSCRIPT โ‰ค italic_g ( italic_ฮด ) โˆฃ caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT ]
โ‰ค2โˆ’โˆ‘i,sni,sโข(Iโข(Xi;Ys)โˆ’g^โข(ฮด)).absentsuperscript2subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ ^๐‘”๐›ฟ\displaystyle\leq 2^{-\sum_{i,s}n_{i,s}(I(X_{i};Y_{s})-\hat{g}(\delta))}.โ‰ค 2 start_POSTSUPERSCRIPT - โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) - over^ start_ARG italic_g end_ARG ( italic_ฮด ) ) end_POSTSUPERSCRIPT .

Thus, by the union bound (over mโ€ฒ<msuperscript๐‘šโ€ฒ๐‘šm^{\prime}<mitalic_m start_POSTSUPERSCRIPT โ€ฒ end_POSTSUPERSCRIPT < italic_m and more),

Pr๐’ž[\displaystyle\Pr_{\cal C}[roman_Pr start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ Z(m)=1โˆฃ๐’ž(mโˆ’1)]\displaystyle Z(m)=1\mid{\cal C}^{(m-1)}]italic_Z ( italic_m ) = 1 โˆฃ caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT ]
โ‰ค2kโข2โˆ’โˆ‘i,sni,sโข(Iโข(Xi;Ys)โˆ’g^โข(ฮด))+โˆ‘i,s2โˆ’g^โข(ฮด)โขmini,sโกni,s.absentsuperscript2๐‘˜superscript2subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ ^๐‘”๐›ฟsubscript๐‘–๐‘ superscript2^๐‘”๐›ฟsubscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ \displaystyle\leq 2^{k}2^{-\sum_{i,s}n_{i,s}(I(X_{i};Y_{s})-\hat{g}(\delta))}+% \sum_{i,s}2^{-\hat{g}(\delta)\min_{i,s}n_{i,s}}.โ‰ค 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT 2 start_POSTSUPERSCRIPT - โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT ( italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) - over^ start_ARG italic_g end_ARG ( italic_ฮด ) ) end_POSTSUPERSCRIPT + โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT - over^ start_ARG italic_g end_ARG ( italic_ฮด ) roman_min start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT .

As, ฯ„โข(๐ฌ)=โˆ‘i,sni,s=(1+ฮด)โขฯ„k(๐ฉ)โข(๐ฌ)๐œ๐ฌsubscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ 1๐›ฟsuperscriptsubscript๐œ๐‘˜๐ฉ๐ฌ\tau(\operatorname*{\bf s})=\sum_{i,s}n_{i,s}=(1+\delta)\tau_{{k}}^{(% \operatorname*{\bf p})}(\operatorname*{\bf s})italic_ฯ„ ( bold_s ) = โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT = ( 1 + italic_ฮด ) italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) it holds that โˆ‘i,sni,sโขIโข(Xi;Ys)โ‰ฅk+ฮดโขฯ„k(๐ฉ)โข(๐ฌ)โขIminโ‰ฅkโข(1+ฮดโขIminImax)subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ ๐‘˜๐›ฟsuperscriptsubscript๐œ๐‘˜๐ฉ๐ฌsubscript๐ผ๐‘˜1๐›ฟsubscript๐ผsubscript๐ผ\sum_{i,s}n_{i,s}I(X_{i};Y_{s})\geq k+\delta\tau_{{k}}^{(\operatorname*{\bf p}% )}(\operatorname*{\bf s})I_{\min}\geq k\left(1+\frac{\delta I_{\min}}{I_{\max}% }\right)โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) โ‰ฅ italic_k + italic_ฮด italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT โ‰ฅ italic_k ( 1 + divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG ). We now have that

E๐’ž[\displaystyle E_{{\cal C}}[italic_E start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ Z(m)โˆฃ๐’ž(mโˆ’1)]=Pr๐’ž[Z(m)=1โˆฃ๐’ž(mโˆ’1)]\displaystyle Z(m)\mid{\cal C}^{(m-1)}]=\Pr_{{\cal C}}[Z(m)=1\mid{\cal C}^{(m-% 1)}]italic_Z ( italic_m ) โˆฃ caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT ] = roman_Pr start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ italic_Z ( italic_m ) = 1 โˆฃ caligraphic_C start_POSTSUPERSCRIPT ( italic_m - 1 ) end_POSTSUPERSCRIPT ]
โ‰ค2kโข2โˆ’kโˆ’kโขฮดโขIminImax+g^โข(ฮด)โขฯ„โข(๐ฌ)+โˆ‘i,s2โˆ’g^โข(ฮด)โขmini,sโกni,sabsentsuperscript2๐‘˜superscript2๐‘˜๐‘˜๐›ฟsubscript๐ผsubscript๐ผ^๐‘”๐›ฟ๐œ๐ฌsubscript๐‘–๐‘ superscript2^๐‘”๐›ฟsubscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ \displaystyle\leq 2^{k}2^{-k-k\frac{\delta I_{\min}}{I_{\max}}+\hat{g}(\delta)% \tau(\operatorname*{\bf s})}+\sum_{i,s}2^{-\hat{g}(\delta)\min_{i,s}n_{i,s}}โ‰ค 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT 2 start_POSTSUPERSCRIPT - italic_k - italic_k divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG + over^ start_ARG italic_g end_ARG ( italic_ฮด ) italic_ฯ„ ( bold_s ) end_POSTSUPERSCRIPT + โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT - over^ start_ARG italic_g end_ARG ( italic_ฮด ) roman_min start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT
โ‰ค2โˆ’kโข(ฮดโขIminImaxโˆ’g^โข(ฮด)Imin)+โ„“โข|๐’ฎ|โข2โˆ’kโขg^โข(ฮด)โขf2โข(ฮด)Imaxโ‰ค12โขk,absentsuperscript2๐‘˜๐›ฟsubscript๐ผsubscript๐ผ^๐‘”๐›ฟsubscript๐ผโ„“๐’ฎsuperscript2๐‘˜^๐‘”๐›ฟsuperscript๐‘“2๐›ฟsubscript๐ผ12๐‘˜\displaystyle\leq 2^{-k\left(\frac{\delta I_{\min}}{I_{\max}}-\frac{\hat{g}(% \delta)}{I_{\min}}\right)}+\ell|\mathcal{S}|2^{-k\frac{\hat{g}(\delta)f^{2}(% \delta)}{I_{\max}}}\leq\frac{1}{2k},โ‰ค 2 start_POSTSUPERSCRIPT - italic_k ( divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG - divide start_ARG over^ start_ARG italic_g end_ARG ( italic_ฮด ) end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG ) end_POSTSUPERSCRIPT + roman_โ„“ | caligraphic_S | 2 start_POSTSUPERSCRIPT - italic_k divide start_ARG over^ start_ARG italic_g end_ARG ( italic_ฮด ) italic_f start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_ฮด ) end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT โ‰ค divide start_ARG 1 end_ARG start_ARG 2 italic_k end_ARG ,

for ฮด๐›ฟ\deltaitalic_ฮด and thus gโข(ฮด)๐‘”๐›ฟg(\delta)italic_g ( italic_ฮด ) sufficiently small such that g^โข(ฮด)โ‰คฮดโขImin2โขImax^๐‘”๐›ฟ๐›ฟsubscript๐ผ2subscript๐ผ\hat{g}(\delta)\leq\frac{\delta I_{\min}}{2I_{\max}}over^ start_ARG italic_g end_ARG ( italic_ฮด ) โ‰ค divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG and k๐‘˜kitalic_k sufficiently large.

Let Z=โˆ‘m=12kZโข(m)๐‘superscriptsubscript๐‘š1superscript2๐‘˜๐‘๐‘šZ=\sum_{m=1}^{2^{k}}Z(m)italic_Z = โˆ‘ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_Z ( italic_m ). To conclude the claim assertion, using the Chernoff-type Lemma A.1 of [5] and noting that Zโข(m)๐‘๐‘šZ(m)italic_Z ( italic_m ) depends only on ๐ฑ1,โ€ฆ,๐ฑmsubscript๐ฑ1โ€ฆsubscript๐ฑ๐‘š\operatorname{\bf x}_{1},\dots,\operatorname{\bf x}_{m}bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , โ€ฆ , bold_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, we have

Pr๐’žโก[Zโ‰ฅ2kk]โ‰ค2โˆ’ฮฉโข(2k/k).subscriptPr๐’ž๐‘superscript2๐‘˜๐‘˜superscript2ฮฉsuperscript2๐‘˜๐‘˜\Pr_{\cal C}\left[Z\geq\frac{2^{k}}{k}\right]\leq 2^{-\Omega(2^{k}/k)}.roman_Pr start_POSTSUBSCRIPT caligraphic_C end_POSTSUBSCRIPT [ italic_Z โ‰ฅ divide start_ARG 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k end_ARG ] โ‰ค 2 start_POSTSUPERSCRIPT - roman_ฮฉ ( 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT / italic_k ) end_POSTSUPERSCRIPT .

โˆŽ

Consider any ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\operatorname{\mathcal{S}}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT. Let ฯ„max=max๐ฌโกฯ„โข(๐ฌ)=(1โˆ’ฮด)โขkIminsubscript๐œsubscript๐ฌ๐œ๐ฌ1๐›ฟ๐‘˜subscript๐ผ\tau_{\max}=\max_{\operatorname*{\bf s}}\tau(\operatorname*{\bf s})=\frac{(1-% \delta)k}{I_{\min}}italic_ฯ„ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT bold_s end_POSTSUBSCRIPT italic_ฯ„ ( bold_s ) = divide start_ARG ( 1 - italic_ฮด ) italic_k end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG. Thus, using a union bound over all ๐ฌโˆˆ๐’ฎโˆž๐ฌsuperscript๐’ฎ\operatorname*{\bf s}\in\operatorname{\mathcal{S}}^{\infty}bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT of length at most ฯ„maxsubscript๐œ\tau_{\max}italic_ฯ„ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, Claimย 1 implies the existence of a code ๐’ž๐’ž{\cal C}caligraphic_C that is ฮด๐›ฟ\deltaitalic_ฮด-close to ๐ฉ๐ฉ{\bf p}bold_p and a DSI-decoder that for any ๐ฌ๐ฌ\operatorname*{\bf s}bold_s decodes at time ฯ„โข(๐ฌ)๐œ๐ฌ\tau(\operatorname*{\bf s})italic_ฯ„ ( bold_s ) with average error at most 1/k1๐‘˜1/k1 / italic_k. We conclude that the competitive ratio obtained by the suggested scheme is

inf๐ฌฯ„kโˆ—โข(๐ฌ)ฯ„โข(๐ฌ)=inf๐ฌฯ„kโˆ—โข(๐ฌ)(1+ฮด)โขฯ„k(๐ฉ)โข(๐ฌ)โ‰ฅ(1โˆ’ฮด)โขinf๐ฌ(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)).subscriptinfimum๐ฌsubscriptsuperscript๐œ๐‘˜๐ฌ๐œ๐ฌsubscriptinfimum๐ฌsubscriptsuperscript๐œ๐‘˜๐ฌ1๐›ฟsuperscriptsubscript๐œ๐‘˜๐ฉ๐ฌ1๐›ฟsubscriptinfimum๐ฌsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ\displaystyle\inf_{\operatorname*{\bf s}}\frac{\tau^{*}_{k}({\operatorname*{% \bf s}})}{\tau(\operatorname*{\bf s})}=\inf_{\operatorname*{\bf s}}\frac{\tau^% {*}_{k}{(\operatorname*{\bf s})}}{(1+\delta)\tau_{{k}}^{(\operatorname*{\bf p}% )}(\operatorname*{\bf s})}\geq(1-\delta)\inf_{\operatorname*{\bf s}}\left(% \frac{\tau^{*}_{k}({\bf s})}{\tau^{({\bf p})}_{k}({\bf s})}\right).roman_inf start_POSTSUBSCRIPT bold_s end_POSTSUBSCRIPT divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ ( bold_s ) end_ARG = roman_inf start_POSTSUBSCRIPT bold_s end_POSTSUBSCRIPT divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG ( 1 + italic_ฮด ) italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) end_ARG โ‰ฅ ( 1 - italic_ฮด ) roman_inf start_POSTSUBSCRIPT bold_s end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) . (16)

To conclude our proof, we revisit the functions f๐‘“fitalic_f, g๐‘”gitalic_g and g^^๐‘”\hat{g}over^ start_ARG italic_g end_ARG alongside our assumptions. Throughout, we fix ฮด๐›ฟ\deltaitalic_ฮด to be a sufficiently small constant and k๐‘˜kitalic_k asymptotically large. We chose fโข(ฮด)โ†’0โ†’๐‘“๐›ฟ0f(\delta)\rightarrow 0italic_f ( italic_ฮด ) โ†’ 0 as ฮดโ†’0โ†’๐›ฟ0\delta\rightarrow 0italic_ฮด โ†’ 0. We chose gโข(ฮด)โ†’0โ†’๐‘”๐›ฟ0g(\delta)\rightarrow 0italic_g ( italic_ฮด ) โ†’ 0 as ฮดโ†’0โ†’๐›ฟ0\delta\rightarrow 0italic_ฮด โ†’ 0 to be a sufficiently small function of ฮด๐›ฟ\deltaitalic_ฮด to satisfy the requirement g^โข(ฮด)โ‰คฮดโขImin2โขImax^๐‘”๐›ฟ๐›ฟsubscript๐ผ2subscript๐ผ\hat{g}(\delta)\leq\frac{\delta I_{\min}}{2I_{\max}}over^ start_ARG italic_g end_ARG ( italic_ฮด ) โ‰ค divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG stated above. If we remove the assumptions that for all iโˆˆ[โ„“]๐‘–delimited-[]โ„“i\in[\ell]italic_i โˆˆ [ roman_โ„“ ], niโ‰ฅfโข(ฮด)โขฯ„โข(๐ฌ)subscript๐‘›๐‘–๐‘“๐›ฟ๐œ๐ฌn_{i}\geq f(\delta)\tau(\operatorname*{\bf s})italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT โ‰ฅ italic_f ( italic_ฮด ) italic_ฯ„ ( bold_s ) and for all sโˆˆ๐’ฎ๐‘ ๐’ฎs\in\operatorname{\mathcal{S}}italic_s โˆˆ caligraphic_S, q๐ฌiโข(s)โ‰ฅfโข(ฮด)subscript๐‘žsubscript๐ฌ๐‘–๐‘ ๐‘“๐›ฟq_{{\bf s}_{i}}(s)\geq f(\delta)italic_q start_POSTSUBSCRIPT bold_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s ) โ‰ฅ italic_f ( italic_ฮด ) then the decoder can neglect the entries corresponding to bad pairs (i,s)๐‘–๐‘ (i,s)( italic_i , italic_s ) that violate (one of) the assumptions. The cumulative mutual information โˆ‘i,sni,sโขIโข(Xi,Ys)subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ \sum_{i,s}n_{i,s}I(X_{i},Y_{s})โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) (summed over bad pairs) lost at the decoder is bounded by ฯ„โข(๐ฌ)โขImaxโข(โ„“โขfโข(ฮด)+|๐’ฎ|โขfโข(ฮด))๐œ๐ฌsubscript๐ผโ„“๐‘“๐›ฟ๐’ฎ๐‘“๐›ฟ\tau(\operatorname*{\bf s})I_{\max}(\ell f(\delta)+|\mathcal{S}|f(\delta))italic_ฯ„ ( bold_s ) italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( roman_โ„“ italic_f ( italic_ฮด ) + | caligraphic_S | italic_f ( italic_ฮด ) ) which for sufficiently small ฮด๐›ฟ\deltaitalic_ฮด and suitable fโข(ฮด)๐‘“๐›ฟf(\delta)italic_f ( italic_ฮด ) leaves the remaining cumulative mutual information โˆ‘i,sni,sโขIโข(Xi,Ys)subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ \sum_{i,s}n_{i,s}I(X_{i},Y_{s})โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) (summed over good pairs) at the decoder to be at least kโข(1+ฮดโขImin2โขImax)๐‘˜1๐›ฟsubscript๐ผ2subscript๐ผk\left(1+\frac{\delta I_{\min}}{2I_{\max}}\right)italic_k ( 1 + divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG ) which replaces โˆ‘i,sni,sโขIโข(Xi,Ys)โ‰ฅkโข(1+ฮดโขIminImax)subscript๐‘–๐‘ subscript๐‘›๐‘–๐‘ ๐ผsubscript๐‘‹๐‘–subscript๐‘Œ๐‘ ๐‘˜1๐›ฟsubscript๐ผsubscript๐ผ\sum_{i,s}n_{i,s}I(X_{i},Y_{s})\geq k\left(1+\frac{\delta I_{\min}}{I_{\max}}\right)โˆ‘ start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_s end_POSTSUBSCRIPT italic_I ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) โ‰ฅ italic_k ( 1 + divide start_ARG italic_ฮด italic_I start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_I start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG ) in the previous presented proof. If the assumption that ฯ„โข(๐ฌ)>โˆ‘i=1โ„“โˆ’1ni๐œ๐ฌsuperscriptsubscript๐‘–1โ„“1subscript๐‘›๐‘–\tau(\operatorname*{\bf s})>\sum_{i=1}^{\ell-1}n_{i}italic_ฯ„ ( bold_s ) > โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ - 1 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT does not hold, and instead it holds that โˆ‘i=1โ„“โˆ—โˆ’1ni<ฯ„โข(๐ฌ)โ‰คโˆ‘i=1โ„“โˆ—nisuperscriptsubscript๐‘–1superscriptโ„“1subscript๐‘›๐‘–๐œ๐ฌsuperscriptsubscript๐‘–1superscriptโ„“subscript๐‘›๐‘–\sum_{i=1}^{\ell^{*}-1}n_{i}<\tau(\operatorname*{\bf s})\leq\sum_{i=1}^{\ell^{% *}}n_{i}โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_ฯ„ ( bold_s ) โ‰ค โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_โ„“ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT then one need only consider the prefix of ๐ฉ๐ฉ\operatorname*{\bf p}bold_p consisting of the first โ„“โˆ—superscriptโ„“\ell^{*}roman_โ„“ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT chunks and replace โ„“โ„“\ellroman_โ„“ in the analysis by โ„“โˆ—superscriptโ„“\ell^{*}roman_โ„“ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT.

Appendix C Proof of ๐‚๐‘โ‰ค1/2๐‚๐‘12\operatorname*{\mathbf{CR}}\leq 1/2bold_CR โ‰ค 1 / 2

In this section, we prove that for the family of channels in Fig.ย 1 we have ๐‚๐‘โ‰ค1/2๐‚๐‘12\operatorname*{\mathbf{CR}}\leq 1/2bold_CR โ‰ค 1 / 2. The main idea is to identify from the code analysis of our lower bound a collection ๐’ฎ^โˆžโŠ†๐’ฎโˆžsuperscript^๐’ฎsuperscript๐’ฎ\widehat{\mathcal{S}}^{\infty}\subseteq\mathcal{S}^{\infty}over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT โŠ† caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT of state sequences that constrain the optimization defining the competitive ratio ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR. Namely,

๐‚๐‘๐‚๐‘\displaystyle{\operatorname*{\mathbf{CR}}}bold_CR โ‰คlimkโ†’โˆžmax๐ฉโˆˆ๐’ซโˆžโกmin๐ฌโˆˆ๐’ฎโˆžโก(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ))โขabsentsubscriptโ†’๐‘˜subscript๐ฉsuperscript๐’ซsubscript๐ฌsuperscript๐’ฎsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌabsent\displaystyle\leq\lim_{k\to\infty}\max_{\operatorname*{\bf p}\in\mathcal{P}^{% \infty}}\min_{\operatorname*{\bf s}\in\mathcal{S}^{\infty}}\left(\frac{\tau^{*% }_{k}({\bf s})}{\tau^{({\bf p})}_{k}({\bf s})}\right)\operatorname*{}โ‰ค roman_lim start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT bold_p โˆˆ caligraphic_P start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT bold_s โˆˆ caligraphic_S start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) start_OPERATOR end_OPERATOR
โ‰คlimkโ†’โˆžmax๐ฉโˆˆ๐’ซโˆžโกmin๐ฌโˆˆ๐’ฎ^โˆžโก(ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)).absentsubscriptโ†’๐‘˜subscript๐ฉsuperscript๐’ซsubscript๐ฌsuperscript^๐’ฎsubscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ\displaystyle\leq\lim_{k\to\infty}\max_{\operatorname*{\bf p}\in\mathcal{P}^{% \infty}}\min_{\operatorname*{\bf s}\in\widehat{\mathcal{S}}^{\infty}}\left(% \frac{\tau^{*}_{k}({\bf s})}{\tau^{({\bf p})}_{k}({\bf s})}\right).โ‰ค roman_lim start_POSTSUBSCRIPT italic_k โ†’ โˆž end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT bold_p โˆˆ caligraphic_P start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT bold_s โˆˆ over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG ) . (17)

Bounding ๐‚๐‘๐‚๐‘\operatorname*{\mathbf{CR}}bold_CR by the top expression in (C) follows from standard capacity bounds, e.g., [30, Section VI]. The upper bound in the bottom inequality of (C) holds for any subset ๐’ฎ^โˆžsuperscript^๐’ฎ\widehat{\mathcal{S}}^{\infty}over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT of state sequences. Here, we consider ๐’ฎ^โˆžsuperscript^๐’ฎ\widehat{\mathcal{S}}^{\infty}over^ start_ARG caligraphic_S end_ARG start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT consisting of two sets. The first is ๐’ฎ^1={1kโขsโˆž}subscript^๐’ฎ1superscript1๐‘˜superscript๐‘ \widehat{\mathcal{S}}_{1}=\{1^{k}s^{\infty}\}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { 1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT } corresponding to the two sequences whose prefix of length k๐‘˜kitalic_k is 1ksuperscript1๐‘˜1^{k}1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, followed by a constant-state sequence sโˆžsuperscript๐‘ s^{\infty}italic_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT with sโˆˆ{1,2}๐‘ 12s\in\{1,2\}italic_s โˆˆ { 1 , 2 }. The second set is defined as ๐’ฎ^2={2kโข๐ฌ1:23โขk/4sโˆž}subscript^๐’ฎ2superscript2๐‘˜superscriptsubscript๐ฌ:123๐‘˜4superscript๐‘ \widehat{\mathcal{S}}_{2}=\{2^{k}\operatorname*{\bf s}_{1:2}^{3k/4}s^{\infty}\}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { 2 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT bold_s start_POSTSUBSCRIPT 1 : 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT italic_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT } where ๐ฌ1:23โขk/4superscriptsubscript๐ฌ:123๐‘˜4\operatorname*{\bf s}_{1:2}^{3k/4}bold_s start_POSTSUBSCRIPT 1 : 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT is any sequence of length 3โขk/43๐‘˜43k/43 italic_k / 4 whose number of states of type 1111 and 2222 satisfy a ratio of [1:2]delimited-[]:12[1:2][ 1 : 2 ], e.g., 1k/4โข2k/2โˆˆ๐ฌ1:23โขk/4superscript1๐‘˜4superscript2๐‘˜2superscriptsubscript๐ฌ:123๐‘˜41^{k/4}2^{k/2}\in\operatorname*{\bf s}_{1:2}^{3k/4}1 start_POSTSUPERSCRIPT italic_k / 4 end_POSTSUPERSCRIPT 2 start_POSTSUPERSCRIPT italic_k / 2 end_POSTSUPERSCRIPT โˆˆ bold_s start_POSTSUBSCRIPT 1 : 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT. Simply put, we fix the type of the sequence in this interval.

The particular choice of these sets is based on their optimal decoding times. We have ฯ„kโˆ—โข(๐ฌ)=ksubscriptsuperscript๐œ๐‘˜๐ฌ๐‘˜\tau^{*}_{k}(\operatorname*{\bf s})=kitalic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) = italic_k for all ๐ฌโˆˆ๐’ฎ^1๐ฌsubscript^๐’ฎ1\operatorname*{\bf s}\in\widehat{\mathcal{S}}_{1}bold_s โˆˆ over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT since their prefix is 1ksuperscript1๐‘˜1^{k}1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT. For the second set, we have ฯ„kโˆ—โข(๐ฌ)=7โขk/4subscriptsuperscript๐œ๐‘˜๐ฌ7๐‘˜4\tau^{*}_{k}(\operatorname*{\bf s})=7k/4italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) = 7 italic_k / 4 for all ๐ฌโˆˆ๐’ฎ^2๐ฌsubscript^๐’ฎ2\operatorname*{\bf s}\in\widehat{\mathcal{S}}_{2}bold_s โˆˆ over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT since the location of 1111โ€™s in the interval [k,7โขk/4]๐‘˜7๐‘˜4[k,7k/4][ italic_k , 7 italic_k / 4 ] has no impact on the optimal decoding time.

The remainder of the proof consists of two main steps. The first step is to show that the optimization of input distributions in (C) can be limited to input distributions that are constant within the intervals (0,k],(k,7โขk/4],(7โขk/4,โˆž)0๐‘˜๐‘˜7๐‘˜47๐‘˜4(0,k],(k,7k/4],(7k/4,\infty)( 0 , italic_k ] , ( italic_k , 7 italic_k / 4 ] , ( 7 italic_k / 4 , โˆž ). This step will follow from the structure of the chosen state-sequence sets ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and ๐’ฎ^2subscript^๐’ฎ2\widehat{\mathcal{S}}_{2}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Then, to complete the proof of the upper bound, in the second step we compute the optimization in (C) under the restricted domain of input distributions.

Our first claim is that during the time interval [1,k]1๐‘˜[1,k][ 1 , italic_k ] the optimizing input distribution is fixed and need not change. This follows from the fact that no matter which state sequence is realized, the optimal stop** time is at least k๐‘˜kitalic_k. Thus, by the concavity of mutual information, any collection of time-varying input distributions in this interval is sub-optimal. The fixed distribution in this interval is denoted by p1โˆˆ๐’ซโก(๐’ณ)subscript๐‘1๐’ซ๐’ณp_{1}\in\operatorname{\mathcal{P}}(\operatorname{\mathcal{X}})italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โˆˆ caligraphic_P ( caligraphic_X ).

For the second interval, [k,7โขk/4]๐‘˜7๐‘˜4[k,7k/4][ italic_k , 7 italic_k / 4 ], we also claim that a fixed input distribution optimizes the competitive ratio. The argument follows from symmetry: the type of the state sequence is fixed, thus we can take an expected value over the state sequence to upper bound the minimal cumulative mutual information, i.e.,

min๐ฌโˆˆ๐ฌ1:23โขk/4โขโˆ‘i=13โขk/4Isiโข(Xi;Yi)=min๐ฌโˆˆ๐ฌ1:23โขk/4โขโˆ‘i=13โขk/4โˆ‘s=1,2๐Ÿ™โข{si=s}โขIsโข(Xi;Yi)โขsubscript๐ฌsuperscriptsubscript๐ฌ:123๐‘˜4superscriptsubscript๐‘–13๐‘˜4subscript๐ผsubscript๐‘ ๐‘–subscript๐‘‹๐‘–subscript๐‘Œ๐‘–subscript๐ฌsuperscriptsubscript๐ฌ:123๐‘˜4superscriptsubscript๐‘–13๐‘˜4subscript๐‘ 121subscript๐‘ ๐‘–๐‘ subscript๐ผ๐‘ subscript๐‘‹๐‘–subscript๐‘Œ๐‘–absent\displaystyle\min_{\operatorname*{\bf s}\in\operatorname*{\bf s}_{1:2}^{3k/4}}% \sum_{i=1}^{3k/4}I_{s_{i}}(X_{i};Y_{i})=\min_{\operatorname*{\bf s}\in% \operatorname*{\bf s}_{1:2}^{3k/4}}\sum_{i=1}^{3k/4}\sum_{s=1,2}\mathbb{1}\{s_% {i}=s\}I_{s}(X_{i};Y_{i})\operatorname*{}roman_min start_POSTSUBSCRIPT bold_s โˆˆ bold_s start_POSTSUBSCRIPT 1 : 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = roman_min start_POSTSUBSCRIPT bold_s โˆˆ bold_s start_POSTSUBSCRIPT 1 : 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT โˆ‘ start_POSTSUBSCRIPT italic_s = 1 , 2 end_POSTSUBSCRIPT blackboard_1 { italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_s } italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_OPERATOR end_OPERATOR
โ‰ค(a)๐”ผS3โขk/4โข[โˆ‘i=13โขk/4โˆ‘s=1,2๐Ÿ™โข{Si=s}โขIsโข(Xi;Yi)]โขsuperscript๐‘Žabsentsubscript๐”ผsuperscript๐‘†3๐‘˜4delimited-[]superscriptsubscript๐‘–13๐‘˜4subscript๐‘ 121subscript๐‘†๐‘–๐‘ subscript๐ผ๐‘ subscript๐‘‹๐‘–subscript๐‘Œ๐‘–absent\displaystyle\stackrel{{\scriptstyle(a)}}{{\leq}}\mathbb{E}_{S^{3k/4}}\left[% \sum_{i=1}^{3k/4}\sum_{s=1,2}\mathbb{1}\{S_{i}=s\}I_{s}(X_{i};Y_{i})\right]% \operatorname*{}start_RELOP SUPERSCRIPTOP start_ARG โ‰ค end_ARG start_ARG ( italic_a ) end_ARG end_RELOP blackboard_E start_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT [ โˆ‘ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT โˆ‘ start_POSTSUBSCRIPT italic_s = 1 , 2 end_POSTSUBSCRIPT blackboard_1 { italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_s } italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ] start_OPERATOR end_OPERATOR
=(b)3โขk4โข๐”ผS3โขk/4,Tโข[โˆ‘s=1,2๐Ÿ™โข{ST=s}โขIsโข(XT;YT|T)],superscript๐‘absent3๐‘˜4subscript๐”ผsuperscript๐‘†3๐‘˜4๐‘‡delimited-[]subscript๐‘ 121subscript๐‘†๐‘‡๐‘ subscript๐ผ๐‘ subscript๐‘‹๐‘‡conditionalsubscript๐‘Œ๐‘‡๐‘‡\displaystyle\stackrel{{\scriptstyle(b)}}{{=}}\frac{3k}{4}\mathbb{E}_{S^{3k/4}% ,T}\left[\sum_{s=1,2}\mathbb{1}\{S_{T}=s\}I_{s}(X_{T};Y_{T}|T)\right],start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG ( italic_b ) end_ARG end_RELOP divide start_ARG 3 italic_k end_ARG start_ARG 4 end_ARG blackboard_E start_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT , italic_T end_POSTSUBSCRIPT [ โˆ‘ start_POSTSUBSCRIPT italic_s = 1 , 2 end_POSTSUBSCRIPT blackboard_1 { italic_S start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = italic_s } italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ; italic_Y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_T ) ] , (18)

where in step (a)๐‘Ž(a)( italic_a ) we bound the minimum sequence by a uniform distribution over a uniform random sequence S3โขk/4superscript๐‘†3๐‘˜4S^{3k/4}italic_S start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT of the corresponding fixed type. In step (b)๐‘(b)( italic_b ), we define a uniform random variable TโˆผU(1:3k/4)T\sim\text{U}(1:3k/4)italic_T โˆผ U ( 1 : 3 italic_k / 4 ), independent of S3โขk/4superscript๐‘†3๐‘˜4S^{3k/4}italic_S start_POSTSUPERSCRIPT 3 italic_k / 4 end_POSTSUPERSCRIPT. The latter expectation converges for large k๐‘˜kitalic_k to Iโข(X;Y|S,T)๐ผ๐‘‹conditional๐‘Œ๐‘†๐‘‡I(X;Y|S,T)italic_I ( italic_X ; italic_Y | italic_S , italic_T ) with SโˆผBernโข(1/3)similar-to๐‘†Bern13S\sim\text{Bern}(1/3)italic_S โˆผ Bern ( 1 / 3 ) and (T,S,X,Y)๐‘‡๐‘†๐‘‹๐‘Œ(T,S,X,Y)( italic_T , italic_S , italic_X , italic_Y ) distributed according to pโข(t)โขpโข(s)โขpโข(x|t)โขpโข(y|x,s)๐‘๐‘ก๐‘๐‘ ๐‘conditional๐‘ฅ๐‘ก๐‘conditional๐‘ฆ๐‘ฅ๐‘ p(t)p(s)p(x|t)p(y|x,s)italic_p ( italic_t ) italic_p ( italic_s ) italic_p ( italic_x | italic_t ) italic_p ( italic_y | italic_x , italic_s ). We note that Iโข(X;Y|S,T)โ‰คIโข(X;Y|S)๐ผ๐‘‹conditional๐‘Œ๐‘†๐‘‡๐ผ๐‘‹conditional๐‘Œ๐‘†I(X;Y|S,T)\leq I(X;Y|S)italic_I ( italic_X ; italic_Y | italic_S , italic_T ) โ‰ค italic_I ( italic_X ; italic_Y | italic_S ) where the joint distribution on the right hand side is pโข(s)โขpโข(x)โขpโข(y|x,s)๐‘๐‘ ๐‘๐‘ฅ๐‘conditional๐‘ฆ๐‘ฅ๐‘ p(s)p(x)p(y|x,s)italic_p ( italic_s ) italic_p ( italic_x ) italic_p ( italic_y | italic_x , italic_s ) with pโข(x)=โˆ‘tpโข(t)โขpโข(x|t)๐‘๐‘ฅsubscript๐‘ก๐‘๐‘ก๐‘conditional๐‘ฅ๐‘กp(x)=\sum_{t}p(t)p(x|t)italic_p ( italic_x ) = โˆ‘ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_p ( italic_t ) italic_p ( italic_x | italic_t ). The inequality follows from the Markov chain Tโˆ’(X,S)โˆ’Y๐‘‡๐‘‹๐‘†๐‘ŒT-(X,S)-Yitalic_T - ( italic_X , italic_S ) - italic_Y. To conclude, an i.i.d. distribution of channel inputs will achieve the upper bound for the cumulative mutual information for large k๐‘˜kitalic_k. A similar result holds in the interval [k,7โขk/4]๐‘˜7๐‘˜4[k,7k/4][ italic_k , 7 italic_k / 4 ] for ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT as well since the state is constant during this interval. In particular, this follows from the concavity of mutual information.

Finally, we note that since for any ๐ฌ๐ฌ\operatorname*{\bf s}bold_s, ฯ„kโˆ—โข(๐ฌ)โ‰ค7โขk/4subscriptsuperscript๐œ๐‘˜๐ฌ7๐‘˜4\tau^{*}_{k}(\operatorname*{\bf s})\leq 7k/4italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ค 7 italic_k / 4, and since our sets of sequences are symmetric after time 7โขk/47๐‘˜47k/47 italic_k / 4, the best rate after time 7โขk/47๐‘˜47k/47 italic_k / 4 is achieved when the rates of both channels are equal, i.e., the best rate is 1313\frac{1}{3}divide start_ARG 1 end_ARG start_ARG 3 end_ARG. This completes the first step of the proof, to show that the optimal input distributions can be restricted to be constant in the specified intervals.

We proceed with the second step of our proof in which we compute the stop** times induced by each set of sequences. From the stop** times we obtain values for competitive ratios of the form ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)subscriptsuperscript๐œ๐‘˜๐ฌsubscriptsuperscript๐œ๐ฉ๐‘˜๐ฌ\frac{\tau^{*}_{k}({\bf s})}{\tau^{({\bf p})}_{k}({\bf s})}divide start_ARG italic_ฯ„ start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) end_ARG; then, optimizing over these ratios according to (C) will result in our upper bound on the competitive ratio.

For sequences in ๐ฌโˆˆ๐’ฎ^2๐ฌsubscript^๐’ฎ2\operatorname*{\bf s}\in\widehat{\mathcal{S}}_{2}bold_s โˆˆ over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, we have

k๐‘˜\displaystyle kitalic_k =kโขp12+k4โข(1โˆ’p2)+k2โขp22+13โข(ฯ„k(๐ฉ)โข(๐ฌ)โˆ’74)absent๐‘˜subscript๐‘12๐‘˜41subscript๐‘2๐‘˜2subscript๐‘2213superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ74\displaystyle=k\frac{p_{1}}{2}+\frac{k}{4}(1-p_{2})+\frac{k}{2}\frac{p_{2}}{2}% +\frac{1}{3}\left(\tau_{k}^{({\bf p})}(\operatorname*{\bf s})-\frac{7}{4}\right)= italic_k divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG italic_k end_ARG start_ARG 4 end_ARG ( 1 - italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + divide start_ARG italic_k end_ARG start_ARG 2 end_ARG divide start_ARG italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG 1 end_ARG start_ARG 3 end_ARG ( italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) - divide start_ARG 7 end_ARG start_ARG 4 end_ARG ) (19)
=kโขp12+k4+13โข(ฯ„k(๐ฉ)โข(๐ฌ)โˆ’7โขk4).absent๐‘˜subscript๐‘12๐‘˜413superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ7๐‘˜4\displaystyle=k\frac{p_{1}}{2}+\frac{k}{4}+\frac{1}{3}\left(\tau_{k}^{({\bf p}% )}(\operatorname*{\bf s})-\frac{7k}{4}\right).= italic_k divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG italic_k end_ARG start_ARG 4 end_ARG + divide start_ARG 1 end_ARG start_ARG 3 end_ARG ( italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) - divide start_ARG 7 italic_k end_ARG start_ARG 4 end_ARG ) . (20)

Note that the stop** time is independent of p2subscript๐‘2p_{2}italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. This is due to our choice of the ratio [1:2]delimited-[]:12[1:2][ 1 : 2 ] in the definition of ๐’ฎ^2subscript^๐’ฎ2\widehat{\mathcal{S}}_{2}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. We obtain that the stop** time is ฯ„k(๐ฉ)โข(๐ฌ)=4โขkโˆ’3โขkโขp12superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ4๐‘˜3๐‘˜subscript๐‘12\tau_{k}^{({\bf p})}(\operatorname*{\bf s})=4k-\frac{3kp_{1}}{2}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) = 4 italic_k - divide start_ARG 3 italic_k italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG and that the competitive ratio for this set is ฯ„kโˆ—โข(๐ฌ)ฯ„k(๐ฉ)โข(๐ฌ)=7/44โขkโˆ’3โขkโขp12superscriptsubscript๐œ๐‘˜โˆ—๐ฌsuperscriptsubscript๐œ๐‘˜๐ฉ๐ฌ744๐‘˜3๐‘˜subscript๐‘12\frac{\tau_{k}^{\ast}(\operatorname*{\bf s})}{\tau_{k}^{({\bf p})}(% \operatorname*{\bf s})}=\frac{7/4}{4k-\frac{3kp_{1}}{2}}divide start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT ( bold_s ) end_ARG start_ARG italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) end_ARG = divide start_ARG 7 / 4 end_ARG start_ARG 4 italic_k - divide start_ARG 3 italic_k italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG end_ARG.

For the set ๐ฌโˆˆ๐’ฎ^1๐ฌsubscript^๐’ฎ1\operatorname*{\bf s}\in\widehat{\mathcal{S}}_{1}bold_s โˆˆ over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, we know that the stop** time is greater than k๐‘˜kitalic_k but we need to consider different cases depending on whether the stop** time is smaller or greater than 7โขk/47๐‘˜47k/47 italic_k / 4. Recall that there are two sequences in ๐’ฎ^1=1kโขsโˆžsubscript^๐’ฎ1superscript1๐‘˜superscript๐‘ \widehat{\mathcal{S}}_{1}=1^{k}s^{\infty}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT and therefore we have four cases. We start by computing the stop** time of any sequence in ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT such that ฯ„k(๐ฉ)โข(๐ฌ)โ‰ฅ7โขk4superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ7๐‘˜4\tau_{k}^{({\bf p})}(\operatorname*{\bf s})\geq\frac{7k}{4}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) โ‰ฅ divide start_ARG 7 italic_k end_ARG start_ARG 4 end_ARG:

k๐‘˜\displaystyle kitalic_k =(1โˆ’p1)โขk+3โขk4โขRโข(s,p2)+13โข(ฯ„k(๐ฉ)โข(๐ฌ)โˆ’7โขk4),absent1subscript๐‘1๐‘˜3๐‘˜4๐‘…๐‘ subscript๐‘213superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ7๐‘˜4\displaystyle=(1-p_{1})k+\frac{3k}{4}R(s,p_{2})+\frac{1}{3}\left(\tau_{k}^{({% \bf p})}(\operatorname*{\bf s})-\frac{7k}{4}\right),= ( 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_k + divide start_ARG 3 italic_k end_ARG start_ARG 4 end_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + divide start_ARG 1 end_ARG start_ARG 3 end_ARG ( italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) - divide start_ARG 7 italic_k end_ARG start_ARG 4 end_ARG ) , (21)

where Rโข(1,p2)=1โˆ’p2๐‘…1subscript๐‘21subscript๐‘2R(1,p_{2})=1-p_{2}italic_R ( 1 , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = 1 - italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and Rโข(2,p2)=p2/2๐‘…2subscript๐‘2subscript๐‘22R(2,p_{2})=p_{2}/2italic_R ( 2 , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / 2 depending on whether the sequence in ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ends with ๐ฌโˆž=1โˆžsuperscript๐ฌsuperscript1\operatorname*{\bf s}^{\infty}=1^{\infty}bold_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = 1 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT or ๐ฌโˆž=2โˆžsuperscript๐ฌsuperscript2\operatorname*{\bf s}^{\infty}=2^{\infty}bold_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT. Simplifying the equation, we obtain that the stop** time is ฯ„k(๐ฉ)โข(๐ฌ)=7โขk4+3โขkโขp1โˆ’9โขk4โขRโข(s,p2)superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ7๐‘˜43๐‘˜subscript๐‘19๐‘˜4๐‘…๐‘ subscript๐‘2\tau_{k}^{({\bf p})}(\operatorname*{\bf s})=\frac{7k}{4}+3kp_{1}-\frac{9k}{4}R% (s,p_{2})italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) = divide start_ARG 7 italic_k end_ARG start_ARG 4 end_ARG + 3 italic_k italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 italic_k end_ARG start_ARG 4 end_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) subject to input distributions that satisfy 3โขp1โˆ’94โขRโข(s,p2)โ‰ฅ03subscript๐‘194๐‘…๐‘ subscript๐‘203p_{1}-\frac{9}{4}R(s,p_{2})\geq 03 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 4 end_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) โ‰ฅ 0 (or equivalently, p1Rโข(s,p2)โ‰ฅ34subscript๐‘1๐‘…๐‘ subscript๐‘234\frac{p_{1}}{R(s,p_{2})}\geq\frac{3}{4}divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG โ‰ฅ divide start_ARG 3 end_ARG start_ARG 4 end_ARG). Moreover, the competitive ratio is 174+3โขp1โˆ’94โขRโข(s,p2)1743subscript๐‘194๐‘…๐‘ subscript๐‘2\frac{1}{\frac{7}{4}+3p_{1}-\frac{9}{4}R(s,p_{2})}divide start_ARG 1 end_ARG start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 4 end_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG.

We next consider the stop** time of sequences in ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with ฯ„k(๐ฉ)โข(๐ฌ)โ‰ค7โขk4superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ7๐‘˜4\tau_{k}^{({\bf p})}(\operatorname*{\bf s})\leq\frac{7k}{4}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) โ‰ค divide start_ARG 7 italic_k end_ARG start_ARG 4 end_ARG:

k๐‘˜\displaystyle kitalic_k =(1โˆ’p1)โขk+Rโข(s,p2)โข(ฯ„k(๐ฉ)โข(๐ฌ)โˆ’k),absent1subscript๐‘1๐‘˜๐‘…๐‘ subscript๐‘2superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ๐‘˜\displaystyle=(1-p_{1})k+R(s,p_{2})\left(\tau_{k}^{({\bf p})}(\operatorname*{% \bf s})-k\right),= ( 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_k + italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ( italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) - italic_k ) , (22)

which provides with the stop** time ฯ„k(๐ฉ)โข(๐ฌ)=k+p1โขkRโข(s,p2)superscriptsubscript๐œ๐‘˜๐ฉ๐ฌ๐‘˜subscript๐‘1๐‘˜๐‘…๐‘ subscript๐‘2\tau_{k}^{({\bf p})}(\operatorname*{\bf s})=k+\frac{p_{1}k}{R(s,p_{2})}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( bold_p ) end_POSTSUPERSCRIPT ( bold_s ) = italic_k + divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k end_ARG start_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG, subject to input distributions that satisfy p1Rโข(s,p2)โ‰ค34subscript๐‘1๐‘…๐‘ subscript๐‘234\frac{p_{1}}{R(s,p_{2})}\leq\frac{3}{4}divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG โ‰ค divide start_ARG 3 end_ARG start_ARG 4 end_ARG.

To determine the competitive ratio for ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, we analyze four different cases.

  1. 1.

    Case A: ฯ„kโข(๐ฌ)โ‰ฅ7/4subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\geq 7/4italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ฅ 7 / 4 for both sequences in ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

  2. 2.

    Case B: For ๐ฌโˆž=1โˆžsuperscript๐ฌsuperscript1\operatorname*{\bf s}^{\infty}=1^{\infty}bold_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = 1 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT, we have ฯ„kโข(๐ฌ)โ‰ฅ7/4subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\geq 7/4italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ฅ 7 / 4, while for ๐ฌโˆž=2โˆžsuperscript๐ฌsuperscript2\operatorname*{\bf s}^{\infty}=2^{\infty}bold_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT we have ฯ„kโข(๐ฌ)โ‰ค7/4subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\leq 7/4italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ค 7 / 4.

  3. 3.

    Case C: ฯ„kโข(๐ฌ)โ‰ค7/4subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\leq 7/4italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ค 7 / 4 for both sequences in ๐’ฎ^1subscript^๐’ฎ1\widehat{\mathcal{S}}_{1}over^ start_ARG caligraphic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

  4. 4.

    Case D: For ๐ฌโˆž=1โˆžsuperscript๐ฌsuperscript1\operatorname*{\bf s}^{\infty}=1^{\infty}bold_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = 1 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT, we have ฯ„kโข(๐ฌ)โ‰ค7/4subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\leq 7/4italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ค 7 / 4, while for ๐ฌโˆž=2โˆžsuperscript๐ฌsuperscript2\operatorname*{\bf s}^{\infty}=2^{\infty}bold_s start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT = 2 start_POSTSUPERSCRIPT โˆž end_POSTSUPERSCRIPT we have ฯ„kโข(๐ฌ)โ‰ฅ7/4subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\geq 7/4italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ฅ 7 / 4.

One can derive the conditions on p1,p2subscript๐‘1subscript๐‘2p_{1},p_{2}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT that specify each of the above cases. For example, Case A is valid only if {3โขp1โˆ’94โข(1โˆ’p2)โ‰ฅ0}โˆฉ{3โขp1โˆ’94โขp22โ‰ฅ0}3subscript๐‘1941subscript๐‘203subscript๐‘194subscript๐‘220\{3p_{1}-\frac{9}{4}(1-p_{2})\geq 0\}\cap\{3p_{1}-\frac{9}{4}\frac{p_{2}}{2}% \geq 0\}{ 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 4 end_ARG ( 1 - italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) โ‰ฅ 0 } โˆฉ { 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 4 end_ARG divide start_ARG italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG โ‰ฅ 0 }. We omit the derivation of the other cases and proceed to directly compute an upper bound on the competitive ratio conditioned on each case. The upper bound on the overall competitive ratio is the largest upper bound among the different cases and will be shown to be upper bounded by 0.50.50.50.5.

For Case A, we have that the optimized competitive ratio is

maxp1โ‰ฅ1/4โกmax1โˆ’4โขp13โ‰คp2โ‰ค8โขp13subscriptsubscript๐‘114subscript14subscript๐‘13subscript๐‘28subscript๐‘13\displaystyle\max_{p_{1}\geq 1/4}\max_{1-\frac{4p_{1}}{3}\leq p_{2}\leq\frac{8% p_{1}}{3}}roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ฅ 1 / 4 end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT 1 - divide start_ARG 4 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG โ‰ค italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ค divide start_ARG 8 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG end_POSTSUBSCRIPT
minโก{174+3โขp1โˆ’94โขminsโก{Rโข(s,p2)},744โˆ’3โขp1/2}1743subscript๐‘194subscript๐‘ ๐‘…๐‘ subscript๐‘27443subscript๐‘12\displaystyle\min\left\{\frac{1}{\frac{7}{4}+3p_{1}-\frac{9}{4}\min_{s}\{R(s,p% _{2})\}},\frac{\frac{7}{4}}{4-3p_{1}/2}\right\}roman_min { divide start_ARG 1 end_ARG start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 4 end_ARG roman_min start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT { italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) } end_ARG , divide start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG end_ARG start_ARG 4 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG }
=maxp1โกminโก{11+3โขp1,744โˆ’3โขp1/2}โขabsentsubscriptsubscript๐‘1113subscript๐‘17443subscript๐‘12absent\displaystyle=\max_{p_{1}}\min\left\{\frac{1}{1+3p_{1}},\frac{\frac{7}{4}}{4-3% p_{1}/2}\right\}\operatorname*{}= roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_min { divide start_ARG 1 end_ARG start_ARG 1 + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , divide start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG end_ARG start_ARG 4 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG } start_OPERATOR end_OPERATOR
=1/2,absent12\displaystyle=1/2,= 1 / 2 , (23)

where the first equality follows by the fact that the second term does not depend on p2subscript๐‘2p_{2}italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT; thus, optimizing over the first expression yields p2โˆ—=2/3subscriptsuperscript๐‘223p^{*}_{2}=2/3italic_p start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3. The second equality is obtained by comparing the two terms (which are equal when p1โˆ—=1/3superscriptsubscript๐‘113p_{1}^{*}=1/3italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT = 1 / 3). Indeed, the optimal point here lies in the feasible region so we have equalities in both steps. We now turn to Cases B-D, and show, as well, that the induced upper bound is less or equal than 1/2121/21 / 2.

For Case B, we have ฯ„kโข(๐ฌ)โ‰ฅ74subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\geq\frac{7}{4}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ฅ divide start_ARG 7 end_ARG start_ARG 4 end_ARG for s=1๐‘ 1s=1italic_s = 1 but ฯ„kโข(๐ฌ)โ‰ค74subscript๐œ๐‘˜๐ฌ74\tau_{k}(\operatorname*{\bf s})\leq\frac{7}{4}italic_ฯ„ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_s ) โ‰ค divide start_ARG 7 end_ARG start_ARG 4 end_ARG for s=2๐‘ 2s=2italic_s = 2. Combining these competitive ratios with S^2subscript^๐‘†2\widehat{S}_{2}over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT yields

maxp1โ‰ค38โกmaxmaxโก{1โˆ’4โขp13,8โขp13}โ‰คp2subscriptsubscript๐‘138subscript14subscript๐‘138subscript๐‘13subscript๐‘2\displaystyle\max_{p_{1}\leq\frac{3}{8}}\max_{\max\{1-\frac{4p_{1}}{3},\frac{8% p_{1}}{3}\}\leq p_{2}}roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค divide start_ARG 3 end_ARG start_ARG 8 end_ARG end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT roman_max { 1 - divide start_ARG 4 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG , divide start_ARG 8 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG } โ‰ค italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
minโก{174+3โขp1โˆ’94โข(1โˆ’p2),11+2โขp1p2,744โˆ’3โขp1/2}1743subscript๐‘1941subscript๐‘2112subscript๐‘1subscript๐‘27443subscript๐‘12\displaystyle\min\left\{\frac{1}{\frac{7}{4}+3p_{1}-\frac{9}{4}(1-p_{2})},% \frac{1}{1+\frac{2p_{1}}{p_{2}}},\frac{\frac{7}{4}}{4-3p_{1}/2}\right\}roman_min { divide start_ARG 1 end_ARG start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 4 end_ARG ( 1 - italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG , divide start_ARG 1 end_ARG start_ARG 1 + divide start_ARG 2 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_ARG , divide start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG end_ARG start_ARG 4 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG }
โ‰ค(a)maxp1โ‰ค38โกminโก{11+3โขp1,744โˆ’3โขp1/2}superscript๐‘Žabsentsubscriptsubscript๐‘138113subscript๐‘17443subscript๐‘12\displaystyle\stackrel{{\scriptstyle(a)}}{{\leq}}\max_{p_{1}\leq\frac{3}{8}}% \min\left\{\frac{1}{1+3p_{1}},\frac{\frac{7}{4}}{4-3p_{1}/2}\right\}start_RELOP SUPERSCRIPTOP start_ARG โ‰ค end_ARG start_ARG ( italic_a ) end_ARG end_RELOP roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค divide start_ARG 3 end_ARG start_ARG 8 end_ARG end_POSTSUBSCRIPT roman_min { divide start_ARG 1 end_ARG start_ARG 1 + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , divide start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG end_ARG start_ARG 4 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG }
โ‰ค(b)0.5,superscript๐‘absent0.5\displaystyle\stackrel{{\scriptstyle(b)}}{{\leq}}0.5,start_RELOP SUPERSCRIPTOP start_ARG โ‰ค end_ARG start_ARG ( italic_b ) end_ARG end_RELOP 0.5 , (24)

where (a)๐‘Ž(a)( italic_a ) follows by comparing the two terms that depend on p2subscript๐‘2p_{2}italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. These two terms are equal when p2=2/3subscript๐‘223p_{2}=2/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3 or โˆ’4โขp1/34subscript๐‘13-4p_{1}/3- 4 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 3, and it is clear that p2=โˆ’4โขp1/3subscript๐‘24subscript๐‘13p_{2}=-4p_{1}/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = - 4 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 3 is not feasible so we choose p2=2/3subscript๐‘223p_{2}=2/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3. Note that (a)๐‘Ž(a)( italic_a ) is an upper bound (and not equality) since p2=2/3subscript๐‘223p_{2}=2/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3 may not be feasible depending on the value of p1subscript๐‘1p_{1}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Step (b)๐‘(b)( italic_b ) follows from (C) by ignoring the constraint on p1subscript๐‘1p_{1}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

For Case C, we have

maxp1โ‰ค14โกmax8โขp13โ‰คp2โ‰ค1โˆ’4โขp13โกminโก{11+p1Rโข(s,p2),744โˆ’3โขp1/2}subscriptsubscript๐‘114subscript8subscript๐‘13subscript๐‘214subscript๐‘1311subscript๐‘1๐‘…๐‘ subscript๐‘27443subscript๐‘12\displaystyle\max_{p_{1}\leq\frac{1}{4}}\max_{\frac{8p_{1}}{3}\leq p_{2}\leq 1% -\frac{4p_{1}}{3}}\min\left\{\frac{1}{1+\frac{p_{1}}{R(s,p_{2})}},\frac{\frac{% 7}{4}}{4-3p_{1}/2}\right\}roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ค divide start_ARG 1 end_ARG start_ARG 4 end_ARG end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT divide start_ARG 8 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG โ‰ค italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ค 1 - divide start_ARG 4 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG end_POSTSUBSCRIPT roman_min { divide start_ARG 1 end_ARG start_ARG 1 + divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_R ( italic_s , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG end_ARG , divide start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG end_ARG start_ARG 4 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG }
โ‰ค0.5.absent0.5\displaystyle\leq 0.5.โ‰ค 0.5 . (25)

The upper bound is simple since p2=2/3subscript๐‘223p_{2}=2/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3 that equates both terms depending on p2subscript๐‘2p_{2}italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT lies in the feasible region. We then maximize over p1โˆˆ[0,1]subscript๐‘101p_{1}\in[0,1]italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โˆˆ [ 0 , 1 ] that gives 1/2121/21 / 2 as in Case A.

For last case, Case D, the optimized competitive ratio is

maxp1โ‰ฅ34โกmaxp2โ‰คminโก{1โˆ’4โขp13,8โขp13}subscriptsubscript๐‘134subscriptsubscript๐‘214subscript๐‘138subscript๐‘13\displaystyle\max_{p_{1}\geq\frac{3}{4}}\max_{p_{2}\leq\min\{1-\frac{4p_{1}}{3% },\frac{8p_{1}}{3}\}}roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT โ‰ฅ divide start_ARG 3 end_ARG start_ARG 4 end_ARG end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โ‰ค roman_min { 1 - divide start_ARG 4 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG , divide start_ARG 8 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG } end_POSTSUBSCRIPT
minโก{174+3โขp1โˆ’98โขp2,11+p11โˆ’p2,744โˆ’3โขp1/2},1743subscript๐‘198subscript๐‘211subscript๐‘11subscript๐‘27443subscript๐‘12\displaystyle\min\left\{\frac{1}{\frac{7}{4}+3p_{1}-\frac{9}{8}p_{2}},\frac{1}% {1+\frac{p_{1}}{1-p_{2}}},\frac{\frac{7}{4}}{4-3p_{1}/2}\right\},roman_min { divide start_ARG 1 end_ARG start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG + 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - divide start_ARG 9 end_ARG start_ARG 8 end_ARG italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG , divide start_ARG 1 end_ARG start_ARG 1 + divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_ARG , divide start_ARG divide start_ARG 7 end_ARG start_ARG 4 end_ARG end_ARG start_ARG 4 - 3 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG } , (26)

and comparing the terms depending on p2subscript๐‘2p_{2}italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT provides p2โˆ—โˆˆ{2/3,1+8โขp1/3}subscriptsuperscript๐‘22318subscript๐‘13p^{*}_{2}\in\{2/3,1+8p_{1}/3\}italic_p start_POSTSUPERSCRIPT โˆ— end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT โˆˆ { 2 / 3 , 1 + 8 italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 3 }. The only feasible point is p2=2/3subscript๐‘223p_{2}=2/3italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2 / 3 which simplifies the optimization to be as in Case A. We note that the optimal unconstrained solution does not lie in the feasible region and therefore 0.50.50.50.5 is a strict upper bound.

Combining the different cases, we conclude that the competitive ratio is upper bounded by the maximum among these bounds and is thus equal to 0.50.50.50.5 as asserted.