Coded Kalman Filtering over MIMO Gaussian Channels with Feedback

Barron Han, Victoria Kostina, Babak Hassibi Department of Electrical Engineering
Caltech
Pasadena, CA, USA
Email: {bshan, vkostina, hassibi}@caltech.edu
   Oron Sabag School of Computer Science and Engineering
The Hebrew University of Jerusalem
Jerusalem, Israel
Email: [email protected]
Abstract

We consider the problem of remotely stabilizing a linear dynamical system. In this setting, a sensor co-located with the system communicates the system’s state to a controller over a noisy communication channel with feedback. The objective of the controller (decoder) is to use the channel outputs to estimate the vector state with finite zero-delay mean squared error (MSE) at the infinite horizon. It has been shown in [1] that for a vector Gauss-Markov source and either a single-input multiple-output (SIMO) or a multiple-input single-output (MISO) channel, linear codes require the minimum capacity to achieve finite MSE. This paper considers the more general problem of linear zero-delay joint-source channel coding (JSCC) of a vector-valued source over a multiple-input multiple-output (MIMO) Gaussian channel with feedback. We study sufficient and necessary conditions for linear codes to achieve finite MSE. For sufficiency, we introduce a coding scheme where each unstable source mode is allocated to a single channel for estimation. Our proof for the necessity of this scheme relies on a matrix-algebraic conjecture that we prove to be true if either the source or channel is scalar. We show that linear codes achieve finite MSE for a scalar source over a MIMO channel if and only if the best scalar sub-channel can achieve finite MSE. Finally, we provide a new counter-example demonstrating that linear codes are generally sub-optimal for coding over MIMO channels.

I Introduction

Controlling an unstable plant over a noisy communication channel is a hurdle for emerging technologies such as autonomous vehicles, Internet of Things devices, and remote surgery systems. This problem setting deviates from Shannon’s communication problem in two ways that make it more challenging [2]. First, in the control setting, the data to be transmitted correspond to physical measurements and arrive in a streaming fashion instead of being made available in its entirety before transmission. Thus, we must design causal encoders and decoders for this task. Second, typical control systems are unstable, and their stabilization requires near-instantaneous and accurate estimates of the plant’s state to produce effective control actions. Consequently, codes must be low-delay yet highly reliable to perform control tasks over communication channels. We employ a class of low-delay joint-source channel codes to address the two objectives.

In a seminal paper, Sahai and Mitter [3] proved that Shannon’s channel capacity is an insufficient characterization of channel quality when the goal is to stabilize a system over a channel. They introduced the notion of anytime capacity, which is in general upper-bounded by channel capacity, as an alternative measure. While the anytime capacity of a channel provides a useful converse on the channel quality required to stabilize a specific system, achievability schemes are generally open. A class of tree codes such as those studied by Schulman [4] achieve error probabilities that decay exponentially with the delay since a source symbol was emitted. While tree codes exist for a large class of discrete channels, they are only known to be efficiently decodable in limited settings [5].

A noiseless feedback channel connecting the decoder back to the encoder does not improve the Shannon capacity of the channel [6], but feedback can significantly simplify code design and improve the reliability-delay trade-offs for communication [7, 8]. Noiseless feedback channels are reasonable assumptions when the receiver has access to more power than the transmitter, which is often the case in control systems since the controller must provide essentially noiseless control inputs. Coding for bit-streaming sources over discrete channels with feedback, which is relevant to control systems where the state has been quantized for digital transmission, has been studied in [9, 5, 10]. This paper considers a setting where measurement and coding are analog operations applied in discrete time.

Consider the problem of estimating a vector-valued plant, modeled as a Gauss-Markov source, over a multiple-input multiple-output (MIMO) additive white Gaussian noise (AWGN) channel with feedback. The causal rate-distortion function [11] provides a lower bound to the channel capacity necessary for causally estimating the source subject to a given distortion over this channel [12]. The causal rate-distortion functions for both scalar and vector Gauss-Markov sources have been studied in [11] and [13] respectively. The lower bound to channel capacity provided by the causal rate-distortion function is known to be tight only when the source is matched to the channel at hand [14]. For example, a scalar Gauss-Markov source is matched to the scalar AWGN channel [11, 3].

When the criterion is finite MSE, a known converse result is that the Shannon capacity should be greater than the sum of logs of unstable eigenvalues of the source [15, Thm. 4.1]. In the case of a scalar source and a scalar channel, this bound is tight and can be achieved by a linear innovations’ encoder [16, 3, 17, 18]. For a vector source and parallel Gaussian channels with independent power constraints, [19] also proposes a periodic linear scheme leading to sufficient conditions for achieving finite MSE. The case of a vector source with scalar channel was studied in [1], which showed that the Shannon capacity remains a necessary and sufficient measure even though the source and channel dimensions are not matched.

This paper considers the general case of vector source and MIMO channels. We first focus on the fundamental limits of achieving finite MSE using linear time-invariant codes. The innovations’ encoder that generates channel inputs as a function of the source estimation error (at the decoder) is optimal for this general problem [1]. The sequential encoder’s structure implies that the optimal decoder is a Kalman filter and its MSE can be analyzed with linear estimation theory.

Our first result is a sufficient condition to achieve finite MSE by partitioning the vector source to different sub-channels. The analysis is carried out by showing an equivalence between achieving finite MSE and the existence of a stabilizing solution to a DARE equation. The sufficient condition (achievability) is then shown to be necessary for two cases including the scenario of a scalar and a MIMO channel. In particular, it is shown that allocating the entire power to the best sub-channel is optimal, while typical water-filling solutions that distribute the power among the sub-channels are sub-optimal. Indeed, this example reveals that the Shannon channel capacity is not the figure of merit if the objective is finite MSE with linear codes. Motivated by this result, we define the linear stabilizing capacity (LSC) as an optimization problem, which is in general a lower bound to the channel capacity. Finite MSE is achievable using linear codes if and only if there exists a feasible solution. The optimization of the LSC is non-convex, but we are able to utilize it to show that linear codes are not optimal by comparing the LSC with rates that can be achieved using non-linear Shannon-Kotel’nikov map**s for a specific source-channel pair. The general case of our problem remains open but, based on numerical observation, we conjecture that the partitioning property is necessary. The equivalence to the DARE feasibility allows us to extract an algebraic condition that if it holds true then partitioning schemes achieve the fundamental limits.

The paper is organized as follows. Section II specifies the source and channel models and defines zero-delay joint source-channel codes with an MSE performance criterion. Section III presents an optimal linear code structure and applies it to the MIMO channel setting. It also defines the linear stabilizing capacity. Section IV presents our main contributions on the sufficient and necessary conditions for finite estimation error of a vector source over a MIMO Gaussian channel using linear codes and demonstrates that, in general, linear coding is not optimal.

II Problem Setup

II-A Notation

We denote by {𝐗t}t=0Tsuperscriptsubscriptsubscript𝐗𝑡𝑡0𝑇\{\mathbf{X}_{t}\}_{t=0}^{T}{ bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT a discrete time random process and we denote the vector Xt{x0,x1,,xt}superscript𝑋𝑡subscript𝑥0subscript𝑥1subscript𝑥𝑡X^{t}\triangleq\{x_{0},x_{1},\ldots,x_{t}\}italic_X start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≜ { italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT }. We write 𝐗𝒩(μ,Σ)similar-to𝐗𝒩𝜇Σ\mathbf{X}\sim\mathcal{N}(\mu,\Sigma)bold_X ∼ caligraphic_N ( italic_μ , roman_Σ ) to say that the random vector 𝐗𝐗\mathbf{X}bold_X has a Gaussian distribution with mean 𝔼[𝐗]=μ𝔼delimited-[]𝐗𝜇\mathbb{E}[\mathbf{X}]=\mublackboard_E [ bold_X ] = italic_μ and covariance matrix Cov[𝐗]=ΣCovdelimited-[]𝐗Σ\mathrm{Cov}[\mathbf{X}]=\Sigmaroman_Cov [ bold_X ] = roman_Σ. Matrices and vectors are denoted with uppercase letters, while scalars are denoted with lowercase mathematical font. Sets are denoted using the calligraphic font.

II-B System Model

The setup is depicted in Figure 1. We define its main components: a MIMO AWGN channel, a Gauss-Markov streaming source, and a zero-delay code.

Definition 1 (MIMO AWGN Channel)

The channel accepts a vector input 𝐗tnsubscript𝐗𝑡superscript𝑛\mathbf{X}_{t}\in\mathbb{R}^{n}bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and produces a vector output 𝐘tmsubscript𝐘𝑡superscript𝑚\mathbf{Y}_{t}\in\mathbb{R}^{m}bold_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT,

𝐘t=H𝐗t+𝐙t,t1.formulae-sequencesubscript𝐘𝑡𝐻subscript𝐗𝑡subscript𝐙𝑡𝑡1\mathbf{Y}_{t}=H\mathbf{X}_{t}+\mathbf{Z}_{t},\ t\geq 1.bold_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_H bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + bold_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ≥ 1 . (1)

Hm×n𝐻superscript𝑚𝑛H\in\mathbb{R}^{m\times n}italic_H ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT is a deterministic, fixed channel gain matrix and the noise is 𝐙ti.i.d.𝒩(0,R).\mathbf{Z}_{t}\stackrel{{\scriptstyle i.i.d.}}{{\sim}}\mathcal{N}(0,R).bold_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_RELOP SUPERSCRIPTOP start_ARG ∼ end_ARG start_ARG italic_i . italic_i . italic_d . end_ARG end_RELOP caligraphic_N ( 0 , italic_R ) .

We consider a channel with a diagonal channel matrix, H=diag{h1,,hn}𝐻diagsubscript1subscript𝑛H=\mathrm{diag}\{h_{1},\ldots,h_{n}\}italic_H = roman_diag { italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, and identity noise covariance, 𝐙t𝒩(0,I)similar-tosubscript𝐙𝑡𝒩0𝐼\mathbf{Z}_{t}\sim\mathcal{N}(0,I)bold_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_I ). This is without loss of generality as any channel can be diagonalized to this form [20, Thm. 9.1].

The streaming source in Figure 1 is a Gauss-Markov source.

Definition 2 (Gauss-Markov source)

The Gauss-Markov source evolves according to the linear dynamical system:

𝐒t+1=A𝐒t+𝐖t,subscript𝐒𝑡1𝐴subscript𝐒𝑡subscript𝐖𝑡\mathbf{S}_{t+1}=A\mathbf{S}_{t}+\mathbf{W}_{t},bold_S start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT = italic_A bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + bold_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (2)

where Ak×k𝐴superscript𝑘𝑘A\in\mathbb{R}^{k\times k}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT, 𝐖ti.i.d.𝒩(0,Q)\mathbf{W}_{t}\stackrel{{\scriptstyle i.i.d.}}{{\sim}}\mathcal{N}(0,Q)bold_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_RELOP SUPERSCRIPTOP start_ARG ∼ end_ARG start_ARG italic_i . italic_i . italic_d . end_ARG end_RELOP caligraphic_N ( 0 , italic_Q ) and the initial state is 𝐒0𝒩(0,Q)similar-tosubscript𝐒0𝒩0𝑄\mathbf{S}_{0}\sim\mathcal{N}(0,Q)bold_S start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_Q ).

Without loss of generality, we assume that the matrix A𝐴Aitalic_A can be written as

A=[As00Au],𝐴delimited-[]subscript𝐴𝑠00subscript𝐴𝑢A=\left[\begin{array}[]{cc}A_{s}&0\\ 0&A_{u}\end{array}\right],italic_A = [ start_ARRAY start_ROW start_CELL italic_A start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] , (3)

where Assubscript𝐴𝑠A_{s}italic_A start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is stable (eigenvalues on and inside unit circle), Ausubscript𝐴𝑢A_{u}italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is strictly unstable (eigenvalues outside unit circle) and both As,Ausubscript𝐴𝑠subscript𝐴𝑢A_{s},A_{u}italic_A start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT are in Jordan form.

We make the following assumptions about our system:

Assumption 1

The pair (A,Q)𝐴𝑄(A,Q)( italic_A , italic_Q ) is controllable.

Assumption 2

The source is strictly unstable, so As=0subscript𝐴𝑠0A_{s}=0italic_A start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = 0, and Ausubscript𝐴𝑢A_{u}italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT in (3) has distinct eigenvalues.

Assumption 1 guarantees that the error covariance is positive definite [21, Apx. C]. Assumption 2 yields a cleaner analysis since we can take A𝐴Aitalic_A to be diagonal without loss of generality. Our results hold in the general case where Ausubscript𝐴𝑢A_{u}italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is in Jordan block form. Assuming that the source is unstable does not lose generality since the stable part of the source can have a finite estimation error even if no communication is allowed. We limit the source to being strictly unstable so that there are no eigenvalues on the unit circle and a Lyapunov equation of the form X=AXAT+W𝑋𝐴𝑋superscript𝐴𝑇𝑊X=AXA^{T}+Witalic_X = italic_A italic_X italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_W has a unique solution [21, Lem. D.1.1]. This assumption is common in classical linear estimation theory [21, Apx. C].

Definition 3 (A zero-delay joint-source channel feedback code)

The feedback code for the source-channel pair in Definitions 1 and 2 consists of the following:

  1. 1.

    An encoder that at time t𝑡titalic_t has access to 𝐒tsuperscript𝐒𝑡\mathbf{S}^{t}bold_S start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and 𝐘t1superscript𝐘𝑡1\mathbf{Y}^{t-1}bold_Y start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT and generates

    𝐗t=ft(𝐒t,𝐘t1),t1formulae-sequencesubscript𝐗𝑡subscript𝑓𝑡superscript𝐒𝑡superscript𝐘𝑡1𝑡1\mathbf{X}_{t}=f_{t}(\mathbf{S}^{t},\mathbf{Y}^{t-1}),\ t\geq 1bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_S start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ) , italic_t ≥ 1 (4)

    where ft:𝒮t×𝒴t1n:subscript𝑓𝑡maps-tosuperscript𝒮𝑡superscript𝒴𝑡1superscript𝑛f_{t}\colon\mathcal{S}^{t}\times\mathcal{Y}^{t-1}\mapsto\mathbb{R}^{n}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : caligraphic_S start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT × caligraphic_Y start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ↦ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and 𝒮=k,𝒴=mformulae-sequence𝒮superscript𝑘𝒴superscript𝑚\mathcal{S}=\mathbb{R}^{k},\mathcal{Y}=\mathbb{R}^{m}caligraphic_S = blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT , caligraphic_Y = blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT. The channel inputs must satisfy an average power constraint,

    1Tt=1T1𝔼[𝐗tT𝐗t]p,1𝑇superscriptsubscript𝑡1𝑇1𝔼delimited-[]superscriptsubscript𝐗𝑡𝑇subscript𝐗𝑡𝑝\frac{1}{T}\sum_{t=1}^{T-1}\mathbb{E}[\mathbf{X}_{t}^{T}\mathbf{X}_{t}]\leq p,divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT blackboard_E [ bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] ≤ italic_p , (5)

    over the time horizon T𝑇Titalic_T.

  2. 2.

    A decoder that at time t𝑡titalic_t predicts the next source state,

    𝐒^t=gt(𝐘t1).subscript^𝐒𝑡subscript𝑔𝑡superscript𝐘𝑡1\hat{\mathbf{S}}_{t}=g_{t}(\mathbf{Y}^{t-1}).over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_Y start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ) . (6)

For a given code in Definition 3, we denote the predicted error covariance

PtCov(𝐒t𝐒^t).subscript𝑃𝑡Covsubscript𝐒𝑡subscript^𝐒𝑡P_{t}\triangleq\mathrm{Cov}(\mathbf{S}_{t}-\hat{\mathbf{S}}_{t}).italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≜ roman_Cov ( bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) . (7)

Note that for an encoder {ft}t=0superscriptsubscriptsubscript𝑓𝑡𝑡0\{f_{t}\}_{t=0}^{\infty}{ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, the decoder

𝐒^t=𝔼[𝐒t+1|𝐘t]subscript^𝐒𝑡𝔼delimited-[]conditionalsubscript𝐒𝑡1superscript𝐘𝑡\hat{\mathbf{S}}_{t}=\mathbb{E}[\mathbf{S}_{t+1}|\mathbf{Y}^{t}]over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_E [ bold_S start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT | bold_Y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] (8)

minimizes the MSE

DtTr(Cov(𝐒t𝐒^t)).subscript𝐷𝑡TrCovsubscript𝐒𝑡subscript^𝐒𝑡D_{t}\triangleq\mathrm{Tr}\left(\mathrm{Cov}(\mathbf{S}_{t}-\hat{\mathbf{S}}_{% t})\right).italic_D start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≜ roman_Tr ( roman_Cov ( bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) . (9)

We denote the asymptotic MSE

DlimsuptDt.𝐷subscriptlimsup𝑡subscript𝐷𝑡D\triangleq\text{limsup}_{t\to\infty}D_{t}.italic_D ≜ limsup start_POSTSUBSCRIPT italic_t → ∞ end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . (10)

In this paper, we study conditions for there to exist a linear encoder {ft}t=0superscriptsubscriptsubscript𝑓𝑡𝑡0\{f_{t}\}_{t=0}^{\infty}{ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT such that D<𝐷D<\inftyitalic_D < ∞.

This setting is relevant to the control problem where the source in (2) is modified to include a control input Utsubscript𝑈𝑡U_{t}italic_U start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT,

St+1=ASt+BUt+Wtsubscript𝑆𝑡1𝐴subscript𝑆𝑡𝐵subscript𝑈𝑡subscript𝑊𝑡S_{t+1}=AS_{t}+BU_{t}+W_{t}italic_S start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT = italic_A italic_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_B italic_U start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (11)

where B𝐵Bitalic_B is a constant matrix, and the decoder decides Utsubscript𝑈𝑡U_{t}italic_U start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. Provided that Utsuperscript𝑈𝑡U^{t}italic_U start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT are available at the encoder at time t𝑡titalic_t, the classical result [22] of certainty equivalence holds, implying the system in (11) is stabilizable if and only if the controller (decoder) can estimate the source with finite MSE, D<𝐷D<\inftyitalic_D < ∞ (10).

Refer to caption
Figure 1: A MIMO AWGN channel, described in Definition 1, with a noiseless feedback link is shown. The Gauss-Markov source, described in Definition 2, produces information at every time, which is encoded and passed through the channel. The decoder seeks to estimate the source at the next time given all channel outputs. We display the optimal encoding structure of Lemma 1.

III A Linear Code

In this section, we present the optimal linear code and discuss the performance of linear codes in Definition 3.

III-A Optimal Linear Code Structure

A general linear encoder has the form

𝐗t=Ξt(𝐒t)+Ψt(𝐘t1)+𝐌t,subscript𝐗𝑡subscriptΞ𝑡superscript𝐒𝑡subscriptΨ𝑡superscript𝐘𝑡1subscript𝐌𝑡\mathbf{X}_{t}=\Xi_{t}(\mathbf{S}^{t})+\Psi_{t}(\mathbf{Y}^{t-1})+\mathbf{M}_{% t},bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_Ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_S start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_Y start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ) + bold_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (12)

where 𝐌tsubscript𝐌𝑡\mathbf{M}_{t}bold_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a Gaussian random variable that is independent of (𝐒t,𝐘t1)superscript𝐒𝑡superscript𝐘𝑡1(\mathbf{S}^{t},\mathbf{Y}^{t-1})( bold_S start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ). While this encoder involves all past states and channel outputs from feedback, the next result establishes a simplified optimal code structure involving only the recent state estimate error.

Lemma 1 (Innovations encoder)

The optimal linear encoder can be written as

𝐗t=Γ~tPt1(𝐒t𝐒^t)+𝐌t,subscript𝐗𝑡subscript~Γ𝑡superscriptsubscript𝑃𝑡1subscript𝐒𝑡subscript^𝐒𝑡subscript𝐌𝑡\mathbf{X}_{t}=\tilde{\Gamma}_{t}P_{t}^{-1}(\mathbf{S}_{t}-\hat{\mathbf{S}}_{t% })+\mathbf{M}_{t},bold_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + bold_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (13)

where 𝐌t𝒩(0,Ωt)similar-tosubscript𝐌𝑡𝒩0subscriptΩ𝑡\mathbf{M}_{t}\sim\mathcal{N}(0,\Omega_{t})bold_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , roman_Ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) is independent of 𝐒t𝐒^tsubscript𝐒𝑡subscript^𝐒𝑡\mathbf{S}_{t}-\hat{\mathbf{S}}_{t}bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, and 𝐒^tsubscript^𝐒𝑡\hat{\mathbf{S}}_{t}over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the optimal decoder’s estimate (8), given recursively by

𝐒^t=A𝐒^t1+Kt(𝐘t1Γ~tH𝐒^t1),subscript^𝐒𝑡𝐴subscript^𝐒𝑡1subscriptK𝑡subscript𝐘𝑡1subscript~Γ𝑡𝐻subscript^𝐒𝑡1\hat{\mathbf{S}}_{t}=A\hat{\mathbf{S}}_{t-1}+\mathrm{K}_{t}(\mathbf{Y}_{t-1}-% \tilde{\Gamma}_{t}H\hat{\mathbf{S}}_{t-1}),over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_A over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT + roman_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_H over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) , (14)

where Kt=AΓ~tTHT(HΓ~tPt1Γ~tTHT+I)1subscript𝐾𝑡𝐴superscriptsubscript~Γ𝑡𝑇superscript𝐻𝑇superscript𝐻subscript~Γ𝑡superscriptsubscript𝑃𝑡1superscriptsubscript~Γ𝑡𝑇superscript𝐻𝑇𝐼1K_{t}=A\tilde{\Gamma}_{t}^{T}H^{T}\left(H\tilde{\Gamma}_{t}P_{t}^{-1}\tilde{% \Gamma}_{t}^{T}H^{T}+I\right)^{-1}italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_A over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_H over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_I ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, 𝐒^0=0subscript^𝐒00\hat{\mathbf{S}}_{0}=0over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0, and the error covariance Pt=Cov(𝐒t𝐒^t)subscript𝑃𝑡Covsubscript𝐒𝑡subscript^𝐒𝑡P_{t}=\mathrm{Cov}(\mathbf{S}_{t}-\hat{\mathbf{S}}_{t})italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_Cov ( bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ). Encoder parameters Γ~t,Ωtsubscript~Γ𝑡subscriptΩ𝑡\tilde{\Gamma}_{t},\Omega_{t}over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , roman_Ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT must satisfy the power constraint (5)

1Tt=1T1Tr(Γ~tPt1Γ~tT+Ωt)p.1𝑇superscriptsubscript𝑡1𝑇1Trsubscript~Γ𝑡superscriptsubscript𝑃𝑡1superscriptsubscript~Γ𝑡𝑇subscriptΩ𝑡𝑝\frac{1}{T}\sum_{t=1}^{T-1}\mathrm{Tr}(\tilde{\Gamma}_{t}P_{t}^{-1}\tilde{% \Gamma}_{t}^{T}+\Omega_{t})\leq p.divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT roman_Tr ( over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ italic_p . (15)

Intuitively, the encoder can communicate what is currently unknown to the decoder with minimal power by transmitting the innovation 𝐒t𝐒^tsubscript𝐒𝑡subscript^𝐒𝑡\mathbf{S}_{t}-\hat{\mathbf{S}}_{t}bold_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. The decoder’s prediction, 𝐒^tsubscript^𝐒𝑡\hat{\mathbf{S}}_{t}over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, is computed using channel feedback. Compared to [1, Lem. 1], Lemma 1 above introduces an independent additive term 𝐌tsubscript𝐌𝑡\mathbf{M}_{t}bold_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in (13). Its covariance ΩtsubscriptΩ𝑡\Omega_{t}roman_Ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT can be chosen at will to allow more freedom in selecting the channel input distribution.

We make the following assumption on our encoder, which simplifies our analysis of the infinite horizon estimation error.

Assumption 3

The encoder in Definition 3 is time-invariant, meaning that Γ~t=Γ~,Ωt=Ω,tformulae-sequencesubscript~Γ𝑡~ΓsubscriptΩ𝑡Ωfor-all𝑡\tilde{\Gamma}_{t}=\tilde{\Gamma},\Omega_{t}=\Omega,\forall tover~ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = over~ start_ARG roman_Γ end_ARG , roman_Ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_Ω , ∀ italic_t in (13).

The encoding structure in Lemma 1 reveals a state space model, defined by (13) and (1), that admits a Kalman filter solution (14). Consequently, the Riccati recursions in Lemma 2, stated next, give the estimation error at the infinite horizon. This lemma generalizes Lemma 2 in [1].

Lemma 2 (Riccati recursions and convergent behavior)

The prediction error covariance, Ptsubscript𝑃𝑡P_{t}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, defined in (7), of the optimal linear code introduced in Lemma 1 evolves according to a Riccati recursion that either diverges or converges to the stabilizing solution P𝑃Pitalic_P of the DARE [21, Sec. E.4]: {dmath} P = APA^T + Q - A~Γ^T H^T
 (I+H (~ΓP^-1 ~Γ^T + Ω) H^T )^-1H~ΓA^T, where asymptotically, the power constraint (15) becomes

Tr(Γ~P1Γ~T+Ω)p.Tr~Γsuperscript𝑃1superscript~Γ𝑇Ω𝑝\mathrm{Tr}(\tilde{\Gamma}P^{-1}\tilde{\Gamma}^{T}+\Omega)\leq p.roman_Tr ( over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω ) ≤ italic_p . (16)

The asymptotic MSE is computed as

D=Tr(P),𝐷Tr𝑃D=\mathrm{Tr}(P),italic_D = roman_Tr ( italic_P ) , (17)

where P𝑃Pitalic_P is the stabilizing solution of (2).

III-B The Linear Stabilizing Capacity

We express the channel input covariance (16) as

Π=Γ~P1Γ~T+Ω,Π~Γsuperscript𝑃1superscript~Γ𝑇Ω\Pi=\tilde{\Gamma}P^{-1}\tilde{\Gamma}^{T}+\Omega,roman_Π = over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω , (18)

and define a new measure of channel capacity, the linear stabilizing capacity (LSC), below.

Definition 4 (Linear stabilizing capacity)

The linear stabilizing capacity of a MIMO AWGN Channel (Def. 1) for a Gauss-Markov source (Def. 2) is

LSC(p)=supΠ0,P0,Ω,Γ~:Tr(Π)p,Ω0,(2),(18)12logdet(I+HΠH)LSC𝑝subscriptsupremum:formulae-sequencesucceeds-or-equalsΠ0succeeds-or-equals𝑃0Ω~Γabsentformulae-sequenceTrΠ𝑝succeeds-or-equalsΩ0italic-(2italic-)italic-(18italic-)12𝐼𝐻Π𝐻\displaystyle\mathrm{LSC}(p)=\sup_{\begin{subarray}{c}\Pi\succeq 0,\ P\succeq 0% ,\ \Omega,\ \tilde{\Gamma}\colon\\ \mathrm{Tr}(\Pi)\leq p,\ \Omega\succeq 0,\ \eqref{DARE},\ \eqref{Pi_input_def}% \end{subarray}}\ \frac{1}{2}\log\det(I+H\Pi H)roman_LSC ( italic_p ) = roman_sup start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_Π ⪰ 0 , italic_P ⪰ 0 , roman_Ω , over~ start_ARG roman_Γ end_ARG : end_CELL end_ROW start_ROW start_CELL roman_Tr ( roman_Π ) ≤ italic_p , roman_Ω ⪰ 0 , italic_( italic_) , italic_( italic_) end_CELL end_ROW end_ARG end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log roman_det ( italic_I + italic_H roman_Π italic_H ) (19)

Finite MSE is achievable under power constraint, p𝑝pitalic_p (16), with linear encoders if and only if the constraint set in (19) is feasible. In Section IV, we will characterize this feasibility condition in terms of the source and channel parameters.

In general,

LSC(p)C(p),LSC𝑝𝐶𝑝\mathrm{LSC}(p)\leq C(p),roman_LSC ( italic_p ) ≤ italic_C ( italic_p ) , (20)

where C(p)𝐶𝑝C(p)italic_C ( italic_p ) is the Shannon capacity, expressed as

C(p)=supΠ0:Tr(Π)p12logdet(I+HΠH).𝐶𝑝subscriptsupremum:succeeds-or-equalsΠ0absentTrΠ𝑝12𝐼𝐻Π𝐻\displaystyle C(p)=\sup_{\begin{subarray}{c}\Pi\succeq 0\colon\\ \mathrm{Tr}(\Pi)\leq p\end{subarray}}\ \frac{1}{2}\log\det(I+H\Pi H).italic_C ( italic_p ) = roman_sup start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_Π ⪰ 0 : end_CELL end_ROW start_ROW start_CELL roman_Tr ( roman_Π ) ≤ italic_p end_CELL end_ROW end_ARG end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log roman_det ( italic_I + italic_H roman_Π italic_H ) . (21)

The ΠΠ\Piroman_Π that solves (21) is known as the water-filling solution [20, Th. 9.1]. The linear stabilizing capacity (19) adds the additional constraints (2), (16) and (18), so a solution ΠΠ\Piroman_Π of (19) is generally sub-optimal in (21).

IV Main Results

We present sufficient and necessary conditions for finite MSE achievable with linear encoders in the transmission of a vector-valued source over an arbitrary rank channel. We also investigate whether linear encoders are optimal for MIMO channels.

IV-A Linear Coding for MIMO Channels

First, we present a sufficient condition for linear codes to achieve finite MSE.

Theorem 1 (MIMO channels - sufficiency)

In zero-delay JSCC (Def. 3) of a k𝑘kitalic_k-dimensional Gauss-Markov source (Def. 2) for transmission over an n𝑛nitalic_n-input MIMO AWGN channel (Def. 1) with power constraint p𝑝pitalic_p, finite asymptotic error, D<𝐷D<\inftyitalic_D < ∞ (10), is achievable if there exists an n𝑛nitalic_n-set partition, {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, of {1,,k}1𝑘\{1,\ldots,k\}{ 1 , … , italic_k }, such that

j𝒮ilog|λj|<Ci(πi),i=1,,nformulae-sequencesubscript𝑗subscript𝒮𝑖subscript𝜆𝑗subscript𝐶𝑖subscript𝜋𝑖for-all𝑖1𝑛\sum_{j\in\mathcal{S}_{i}}\log|\lambda_{j}|<C_{i}(\pi_{i}),\ \forall i=1,% \ldots,n∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log | italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | < italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , ∀ italic_i = 1 , … , italic_n (22)

where λjsubscript𝜆𝑗\lambda_{j}italic_λ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are the eigenvalues of A𝐴Aitalic_A (Def. 2), Ci(πi)=12log(1+hi2πi)subscript𝐶𝑖subscript𝜋𝑖121superscriptsubscript𝑖2subscript𝜋𝑖C_{i}(\pi_{i})=\frac{1}{2}\log(1+h_{i}^{2}\pi_{i})italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is the Shannon capacity of the i𝑖iitalic_ith channel with power πi0subscript𝜋𝑖0\pi_{i}\geq 0italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 and

i=1nπi=p.superscriptsubscript𝑖1𝑛subscript𝜋𝑖𝑝\sum_{i=1}^{n}\pi_{i}=p.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_p . (23)

Each set, 𝒮isubscript𝒮𝑖\mathcal{S}_{i}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, of the partition {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT contains the indices of the unstable modes assigned to channel i𝑖iitalic_i. Thus, Theorem 1 allocates each unstable mode of the source to a single channel output and defines a power allocation over the channels that allows each channel to stabilize its assigned modes independently. See Figure 2 for an example. We show in Theorem 2 that this partitioning property is necessary if either the source or the channel is scalar.

Theorem 2 (Scalar source or scalar channel)

If either a𝑎a\in\mathbb{R}italic_a ∈ blackboard_R or hh\in\mathbb{R}italic_h ∈ blackboard_R, finite asymptotic error, D<𝐷D<\inftyitalic_D < ∞ (10), is achievable by linear codes if and only if there exists a partition {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and power allocation vector π𝜋\piitalic_π satisfying (22), (23) in Theorem 1.

Case I (Scalar channel): The vector-source and scalar-channel scenario was shown in [1]. In this setting hh\in\mathbb{R}italic_h ∈ blackboard_R, so the conditions of Theorem 1 reduce to 𝒮1={1,,k}subscript𝒮11𝑘\mathcal{S}_{1}=\{1,\ldots,k\}caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { 1 , … , italic_k }, π=p𝜋𝑝\pi=pitalic_π = italic_p. We can explicitly compute

LSC(p)=C(p)=12log(1+h2p)LSC𝑝𝐶𝑝121superscript2𝑝\mathrm{LSC}(p)=C(p)=\frac{1}{2}\log(1+h^{2}p)roman_LSC ( italic_p ) = italic_C ( italic_p ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p ) (24)

Case II (Scalar source): In the scalar source and MIMO channel setting, a𝑎a\in\mathbb{R}italic_a ∈ blackboard_R. The sets {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT as defined in Theorem 1 satisfy 𝒮i={1}subscript𝒮𝑖1\mathcal{S}_{i}=\{1\}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { 1 } and 𝒮l=subscript𝒮𝑙\mathcal{S}_{l}=\emptysetcaligraphic_S start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = ∅ for all li𝑙𝑖l\neq iitalic_l ≠ italic_i. From (22), finite MSE is achievable by linear encoders if and only if

log|a|<maxiCi(p).𝑎subscript𝑖subscript𝐶𝑖𝑝\log|a|<\max_{i}C_{i}(p).roman_log | italic_a | < roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p ) . (25)

Generally, for scalar sources and MIMO channels, LSC(p)<C(p)LSC𝑝𝐶𝑝\mathrm{LSC}(p)<C(p)roman_LSC ( italic_p ) < italic_C ( italic_p ), which we show in Theorem 4.

Finally, particularizing Theorem 2 to scalar Gauss-Markov sources that are transmitted over a scalar AWGN channel recovers the classical result from [11] that the finite MSE is achievable if and only if log|a|<C(p),𝑎𝐶𝑝\log|a|<C(p),roman_log | italic_a | < italic_C ( italic_p ) , where LSC(p)=C(p)=12log(1+h2p)LSC𝑝𝐶𝑝121superscript2𝑝\mathrm{LSC}(p)=C(p)=\frac{1}{2}\log(1+h^{2}p)roman_LSC ( italic_p ) = italic_C ( italic_p ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p ) [16, 3, 17].

It is unknown whether the partitioning property of Theorem 1 is necessary for vector sources and MIMO channels. Towards this goal, we pose the following matrix-algebraic conjecture, which, if true, implies the necessity of the partitioning property in Theorem 1.

Conjecture 1 (Lyapunov positivity)

Let J𝐽Jitalic_J be the unique positive solution to the Lyapunov equation {dmath} J = B J B - Γ^T (I + H ΠH)^-1 Γ+ B Γ^T ΓB where Jk×k𝐽superscript𝑘𝑘J\in\mathbb{R}^{k\times k}italic_J ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT, Γn×kΓsuperscript𝑛𝑘\Gamma\in\mathbb{R}^{n\times k}roman_Γ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT, B=diag(b1,,bk),bi<1iformulae-sequence𝐵diagsubscript𝑏1subscript𝑏𝑘subscript𝑏𝑖1for-all𝑖B=\mathrm{diag}(b_{1},\ldots,b_{k}),b_{i}<1\ \forall iitalic_B = roman_diag ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < 1 ∀ italic_i, and H=diag(h1,,hn)𝐻diagsubscript1subscript𝑛H=\mathrm{diag}(h_{1},\ldots,h_{n})italic_H = roman_diag ( italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). There exists an optimal solution ΓsuperscriptΓ\Gamma^{*}roman_Γ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to

infΠ0,J0,Γ:(1),Π is diagonalTr(Π)subscriptinfimum:formulae-sequencesucceeds-or-equalsΠ0succeeds𝐽0Γabsentitalic-(1italic-)Π is diagonalTrΠ\inf_{\begin{subarray}{c}\Pi\succeq 0,J\succ 0,\Gamma\colon\\ \eqref{Jlyap_cond},\ \Pi\text{ is diagonal}\end{subarray}}\mathrm{Tr}(\Pi)roman_inf start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_Π ⪰ 0 , italic_J ≻ 0 , roman_Γ : end_CELL end_ROW start_ROW start_CELL italic_( italic_) , roman_Π is diagonal end_CELL end_ROW end_ARG end_POSTSUBSCRIPT roman_Tr ( roman_Π ) (26)

that has exactly one non-zero entry per column.

We observe that Conjecture 1 holds numerically for all systems we have simulated with n,k{2,3}𝑛𝑘23n,k\in\{2,3\}italic_n , italic_k ∈ { 2 , 3 }.

Theorem 3 (Vector source and MIMO channel - necessity)

Assume Conjecture 1 holds. In linear zero-delay JSCC (Def. 3) of a vector Gauss-Markov source (Def. 2) and MIMO AWGN channel (Def. 1), finite asymptotic error, D<𝐷D<\inftyitalic_D < ∞ (10), is achievable if and only if there exists a partition {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and power allocation vector π𝜋\piitalic_π satisfying (22), (23) in Theorem 1.

If Theorem 3 holds, the necessary and sufficient conditions coincide and (22) reduces to

i=1nlog|λi|LSC(p)=supπ:πi0(22),(23)i=1nCi(πi).superscriptsubscript𝑖1𝑛subscript𝜆𝑖LSC𝑝subscriptsupremum:𝜋subscript𝜋𝑖0italic-(22italic-)italic-(23italic-)superscriptsubscript𝑖1𝑛subscript𝐶𝑖subscript𝜋𝑖\sum_{i=1}^{n}\ \log|\lambda_{i}|\ \leq\mathrm{LSC}(p)=\sup_{\begin{subarray}{% c}\pi\colon\ \pi_{i}\geq 0\\ \eqref{mimo_cond_part},\ \eqref{powerc}\end{subarray}}\ \sum_{i=1}^{n}C_{i}(% \pi_{i}).∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT roman_log | italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ roman_LSC ( italic_p ) = roman_sup start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_π : italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 end_CELL end_ROW start_ROW start_CELL italic_( italic_) , italic_( italic_) end_CELL end_ROW end_ARG end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) . (27)

1

Figure 2: Example assignment of source components into the channels and associated power allocations for each channel as described in Theorem 1. A source can only be allocated to a single channel, and every source must be allocated to a channel. Consequently, the sets {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT form a partition of {1,,k}1𝑘\{1,\ldots,k\}{ 1 , … , italic_k }. The power allocations must satisfy (23) where p𝑝pitalic_p is the total power constraint.

IV-B Linear Codes are NOT Optimal for MIMO Channels

The optimality of linear encoding, sometimes referred to as uncoded transmission, has been studied in [14]. In this section, we establish that linear encoders are generally sub-optimal for achieving finite MSE over MIMO channels and present a sufficient optimality condition for linear encoders.

Theorem 4

(Linear coding is sub-optimal for MIMO channels) Linear codes are not optimal for the zero-delay JSCC problem of a Gauss-Markov source over a MIMO AWGN channel (Def. 1 - 3). Equivalently, there exist source-channel pairs (A,H,p)𝐴𝐻𝑝(A,H,p)( italic_A , italic_H , italic_p ) (Def. 1, Def. 2, (5)) such that all linear codes result in D=𝐷D=\inftyitalic_D = ∞, while there exist non-linear codes that can achieve D<𝐷D<\inftyitalic_D < ∞.

Proof:

We show this via a counterexample. Consider the setting of a scalar Gauss-Markov source and an arbitrary rank AWGN channel studied in Theorem 2. Let H=I2𝐻subscript𝐼2H=I_{2}italic_H = italic_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and p1much-greater-than𝑝1p\gg 1italic_p ≫ 1, which correspond to the high SNR regime given two identical channels. Shannon-Kotel’nikov map**s as studied in [17, Thm. 7.1] can achieve finite estimation error if

|a|<C(p)o(1)𝑎𝐶𝑝𝑜1|a|<C(p)-o(1)| italic_a | < italic_C ( italic_p ) - italic_o ( 1 ) (28)

as p𝑝p\to\inftyitalic_p → ∞, while by Theorem 2, a linear encoder achieves finite estimation error only if

|a|<12C(p).𝑎12𝐶𝑝|a|<\frac{1}{2}\ C(p).| italic_a | < divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_C ( italic_p ) . (29)

This follows from maxiCi(p)=12log(1+p)subscript𝑖subscript𝐶𝑖𝑝121𝑝\max_{i}C_{i}(p)=\frac{1}{2}\log(1+p)roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + italic_p ) in (25) while C(p)=212log(1+p2)𝐶𝑝2121𝑝2C(p)=2\cdot\frac{1}{2}\log(1+\frac{p}{2})italic_C ( italic_p ) = 2 ⋅ divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + divide start_ARG italic_p end_ARG start_ARG 2 end_ARG ) since the water-filling solution places power p2𝑝2\frac{p}{2}divide start_ARG italic_p end_ARG start_ARG 2 end_ARG in each of the parallel channels. Taking the limit p𝑝p\to\inftyitalic_p → ∞, maxiCi(p)=12C(p).subscript𝑖subscript𝐶𝑖𝑝12𝐶𝑝\max_{i}C_{i}(p)=\frac{1}{2}\ C(p).roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_C ( italic_p ) .

Next, we will show a sufficient condition for linear codes to achieve finite MSE with minimum power. To do so, we introduce source channel matching in the stability sense.

Definition 5

(Source-channel matching in the stability sense) The source (Def. 2) and the channel (Def. 1) are matched in the stability sense if

i=1nlog|λi|=LSC(p)=C(p),superscriptsubscript𝑖1𝑛subscript𝜆𝑖LSC𝑝𝐶𝑝\sum_{i=1}^{n}\log\ |\lambda_{i}|\ =\mathrm{LSC}(p)=C(p),∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT roman_log | italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | = roman_LSC ( italic_p ) = italic_C ( italic_p ) , (30)

where LSC(p)LSC𝑝\mathrm{LSC}(p)roman_LSC ( italic_p ) is the linear stabilizing capacity (Def. 4) and C(p)𝐶𝑝C(p)italic_C ( italic_p ) is given by (21).

If a source (Def. 2) and channel (Def. 1) are matched in the stability sense, linear codes require the minimum power to stabilize the system. Source-channel matching in Definition 5 holds if and only if ΠΠ\Piroman_Π (19) coincides with the waterfilling solution to C(p)𝐶𝑝C(p)italic_C ( italic_p ) (21). In that case, condition (22) in Theorem 1 reduces to the converse result from [15, Thm. 4.1] stating that finite MSE is achievable only if

i=1klog(|λi|)<C(p).superscriptsubscript𝑖1𝑘subscript𝜆𝑖𝐶𝑝\sum_{i=1}^{k}\log\left(|\lambda_{i}|\right)<C(p).∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT roman_log ( | italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ) < italic_C ( italic_p ) . (31)

Consequently, source-channel matching ensures linear codes perform as well as the best nonlinear codes.

V Conclusion

This paper contributes to a complete characterization of the performance of linear codes in zero-delay joint-source channel coding with feedback. Theorem 1 showed sufficiency conditions for finite MSE using linear codes for the general case. We conjecture that these conditions are necessary and prove in Theorem 2 that our conjecture holds if either the source or channel is scalar. Finally, we propose a notion of source-channel matching in the stability sense in Definition 5 and demonstrate that linear encoders are generally not optimal for MIMO channels in Theorem 4, motivating further research into non-linear codes. Linear codes remain viable in practice due to their low complexity. An exciting direction for future work is to explore the entire distortion-capacity tradeoff for linear coding of Gauss-Markov sources over AWGN Channels. While converse bounds have been analyzed in [12], achievability bounds are poorly understood for such problems.

Acknowledgment

This work was supported in part by the NSF under grants CCF-1751356 and CCF-1956386, and the Israel Science Foundation (ISF) under grant 1096/23.

References

  • [1] B. Han, O. Sabag, V. Kostina, and B. Hassibi, “Coded Kalman filtering over Gaussian channels with feedback,” in 2023 59th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp. 1–8, Sep. 2023.
  • [2] C. E. Shannon, “A mathematical theory of communication,” The Bell System Technical Journal, vol. 27, pp. 379–423, Oct. 1948.
  • [3] A. Sahai and S. Mitter, “The necessity and sufficiency of anytime capacity for stabilization of a linear system over a noisy communication link—part i: Scalar systems,” IEEE Transactions on Information Theory, vol. 52, pp. 3369–3395, July 2006.
  • [4] L. Schulman, “Coding for interactive communication,” IEEE Transactions on Information Theory, vol. 42, no. 6, pp. 1745–1756, 1996.
  • [5] R. T. Sukhavasi and B. Hassibi, “Linear time-invariant anytime codes for control over noisy channels,” IEEE Transactions on Automatic Control, vol. 61, pp. 3826–3841, Feb. 2016.
  • [6] C. E. Shannon, “The zero error capacity of a noisy channel,” IRE Transactions on Information Theory, vol. 2, pp. 8–19, Sep. 1956.
  • [7] Y. Polyanskiy, H. V. Poor, and S. Verdu, “Feedback in the non-asymptotic regime,” IEEE Transactions on Information Theory, vol. 57, no. 8, pp. 4903–4925, Aug. 2011.
  • [8] M. V. Burnashev, “Data transmission over a discrete channel with feedback,” Problemy Peredaci Informacii, vol. 12, no. 4, pp. 10–30, Oct. 1976.
  • [9] N. Guo and V. Kostina, “Reliability function for streaming over a DMC with feedback,” IEEE Transactions on Information Theory, vol. 69, pp. 2165–2192, Nov. 2023.
  • [10] O. Sabag, P. Tian, V. Kostina, and B. Hassibi, “Reducing the LQG cost with minimal communication,” IEEE Transactions on Automatic Control, vol. 68, no. 9, pp. 5258–5270, 2023.
  • [11] A. Gorbunov and M. S. Pinsker, “Prognostic epsilon entropy of a Gaussian message and a Gaussian source,” Problemy Peredachi Informatsii, vol. 10, pp. 5–25, Aug. 1974.
  • [12] V. Kostina and B. Hassibi, “Rate-cost tradeoffs in control,” IEEE Transactions on Automatic Control, vol. 64, pp. 4525–4540, Apr. 2019.
  • [13] T. Tanaka, K. K. Kim, P. A. Parrilo, and S. K. Mitter, “Semidefinite programming approach to gaussian sequential rate-distortion trade-offs,” IEEE Transactions on Automatic Control, vol. 62, no. 4, pp. 1896–1910, 2017.
  • [14] M. Gastpar, B. Rimoldi, and M. Vetterli, “To code, or not to code: lossy source-channel communication revisited,” IEEE Transactions on Information Theory, vol. 49, no. 5, pp. 1147–1158, 2003.
  • [15] S. Yüksel, “Characterization of information channels for asymptotic mean stationarity and stochastic stability of nonstationary/unstable linear systems,” IEEE Transactions on Information Theory, vol. 58, no. 10, pp. 6332–6354, Oct. 2012.
  • [16] S. Tatikonda, A. Sahai, and S. Mitter, “Stochastic linear control over a communication channel,” IEEE Transactions on Automatic Control, vol. 49, pp. 1549–1561, Sep. 2004.
  • [17] A. Khina, E. R. Gårding, G. M. Pettersson, V. Kostina, and B. Hassibi, “Control over Gaussian channels with and without source–channel separation,” IEEE Transactions on Automatic Control, vol. 64, pp. 3690–3705, Apr. 2019.
  • [18] N. Elia, “When Bode meets Shannon: control-oriented feedback communication schemes,” IEEE Transactions on Automatic Control, vol. 49, pp. 1477–1488, Sep. 2004.
  • [19] A. A. Zaidi, S. Yüksel, T. J. Oechtering, and M. Skoglund, “On the tightness of linear policies for stabilization of linear systems over Gaussian networks,” Systems and Control Letters, vol. 88, pp. 32–38, Sep. 2016.
  • [20] A. E. Gamal and Y. Kim, Network Information Theory. Cambridge University Press, 2011.
  • [21] T. Kailath, A. Sayed, and B. Hassibi, Linear Estimation. Prentice-Hall information and system sciences series, Prentice Hall, 2000.
  • [22] K. Åström, Introduction to stochastic control theory, vol. 70 of Mathematics in science and engineering. United States: Academic Press, 1970.

Appendix A Proof of Theorem 1 and Conjecture 3

We start with the DARE (2) and power constraint (16) and show that a sequence of statements are equivalent.

First, we apply the transformation

Γ~P1=Γ~Γsuperscript𝑃1Γ\tilde{\Gamma}P^{-1}=\Gammaover~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = roman_Γ (32)

to obtain a standard DARE.

Statement 1. A finite MSE is achievable if and only if there exists a P0,Γ,Ω0formulae-sequencesucceeds-or-equals𝑃0Γsucceeds-or-equalsΩ0P\succeq 0,\Gamma,\Omega\succeq 0italic_P ⪰ 0 , roman_Γ , roman_Ω ⪰ 0 satisfying (2) and (16).

P=APAT+QAPΓTHT(I+H(ΓPΓT+Ω)HT)1HΓPAT𝑃𝐴𝑃superscript𝐴𝑇𝑄𝐴𝑃superscriptΓ𝑇superscript𝐻𝑇superscript𝐼𝐻Γ𝑃superscriptΓ𝑇Ωsuperscript𝐻𝑇1𝐻Γ𝑃superscript𝐴𝑇\displaystyle\begin{split}&P=APA^{T}+Q-AP\Gamma^{T}H^{T}\\ \qquad&\ \ \ \left(I+H(\Gamma P\Gamma^{T}+\Omega)H^{T}\right)^{-1}H\Gamma PA^{% T}\\ \end{split}start_ROW start_CELL end_CELL start_CELL italic_P = italic_A italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_Q - italic_A italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ( italic_I + italic_H ( roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω ) italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H roman_Γ italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (33)
Tr(ΓPΓT+Ω)p.TrΓ𝑃superscriptΓ𝑇Ω𝑝\displaystyle\mathrm{Tr}(\Gamma P\Gamma^{T}+\Omega)\leq p.roman_Tr ( roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω ) ≤ italic_p . (34)

We will define ΠΓPΓT+ΩΠΓ𝑃superscriptΓ𝑇Ω\Pi\triangleq\Gamma P\Gamma^{T}+\Omegaroman_Π ≜ roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω as a redundant intermediate variable, resulting in Statement 2.

Statement 2. A finite MSE is achievable if and only if there exists a P0,Γ,Ω0,Π0formulae-sequencesucceeds-or-equals𝑃0Γformulae-sequencesucceeds-or-equalsΩ0succeeds-or-equalsΠ0P\succeq 0,\Gamma,\Omega\succeq 0,\Pi\succeq 0italic_P ⪰ 0 , roman_Γ , roman_Ω ⪰ 0 , roman_Π ⪰ 0 satisfying

P=APAT+QAPΓTHT(I+HΠHT)1HΓPAT𝑃𝐴𝑃superscript𝐴𝑇𝑄𝐴𝑃superscriptΓ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻Γ𝑃superscript𝐴𝑇\displaystyle\begin{split}&P=APA^{T}+Q\\ \qquad&\ \ \ -AP\Gamma^{T}H^{T}\left(I+H\Pi H^{T}\right)^{-1}H\Gamma PA^{T}\\ \end{split}start_ROW start_CELL end_CELL start_CELL italic_P = italic_A italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_Q end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - italic_A italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H roman_Γ italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (35)
Π=ΓPΓT+ΩΠΓ𝑃superscriptΓ𝑇Ω\displaystyle\Pi=\Gamma P\Gamma^{T}+\Omegaroman_Π = roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + roman_Ω (36)
Tr(Π)p.TrΠ𝑝\displaystyle\mathrm{Tr}(\Pi)\leq p.roman_Tr ( roman_Π ) ≤ italic_p . (37)

We equivalently write (36) as ΠΓPΓTsucceeds-or-equalsΠΓ𝑃superscriptΓ𝑇\Pi\succeq\Gamma P\Gamma^{T}roman_Π ⪰ roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT. Note also that any Ω0Ω0\Omega\neq 0roman_Ω ≠ 0 hardens the condition of Statement 2 since P+Psucceeds-or-equalssuperscript𝑃𝑃P^{+}\succeq Pitalic_P start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ⪰ italic_P where P+superscript𝑃P^{+}italic_P start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT is the unique positive solution of {dmath} P^+ = AP^+A^T + Q - A P^+ Γ^T H^T
(R+H (ΓP^+ Γ^T + Ω) H^T )^-1HΓP^+ A^T and P𝑃Pitalic_P is the unique positive solution of {dmath} P = APA^T + Q
- A P Γ^T H^T (I+H (ΓP Γ^T) H^T )^-1HΓP A^T Here, we use P+superscript𝑃P^{+}italic_P start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT to denote the solution to (2) when Ω0Ω0\Omega\neq 0roman_Ω ≠ 0 and P𝑃Pitalic_P to denote the solution when Ω=0Ω0\Omega=0roman_Ω = 0. However, moving forward we will use P𝑃Pitalic_P to denote the solution of (A).

Statement 3. A finite MSE is achievable if and only if there exists a P0,Γ,Π0formulae-sequencesucceeds-or-equals𝑃0Γsucceeds-or-equalsΠ0P\succeq 0,\Gamma,\Pi\succeq 0italic_P ⪰ 0 , roman_Γ , roman_Π ⪰ 0 satisfying

P=APAT+QAPΓTHT(I+HΠHT)1HΓPAT𝑃𝐴𝑃superscript𝐴𝑇𝑄𝐴𝑃superscriptΓ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻Γ𝑃superscript𝐴𝑇\displaystyle P=APA^{T}+Q-AP\Gamma^{T}H^{T}\left(I+H\Pi H^{T}\right)^{-1}H% \Gamma PA^{T}italic_P = italic_A italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_Q - italic_A italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H roman_Γ italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (38)
ΠΓPΓTsucceeds-or-equalsΠΓ𝑃superscriptΓ𝑇\displaystyle\Pi\succeq\Gamma P\Gamma^{T}roman_Π ⪰ roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (39)
Tr(Π)p.TrΠ𝑝\displaystyle\mathrm{Tr}(\Pi)\leq p.roman_Tr ( roman_Π ) ≤ italic_p . (40)

We reapply the transformation (32).

Statement 4. A finite MSE is achievable if and only if there exists a P0,Γ~,Π0formulae-sequencesucceeds-or-equals𝑃0~Γsucceeds-or-equalsΠ0P\succeq 0,\tilde{\Gamma},\Pi\succeq 0italic_P ⪰ 0 , over~ start_ARG roman_Γ end_ARG , roman_Π ⪰ 0 satisfying

P=APAT+QAΓ~THT(I+HΠHT)1HΓ~AT𝑃𝐴𝑃superscript𝐴𝑇𝑄𝐴superscript~Γ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻~Γsuperscript𝐴𝑇\displaystyle P=APA^{T}+Q-A\tilde{\Gamma}^{T}H^{T}\left(I+H\Pi H^{T}\right)^{-% 1}H\tilde{\Gamma}A^{T}italic_P = italic_A italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_Q - italic_A over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (41)
ΠΓ~P1Γ~Tsucceeds-or-equalsΠ~Γsuperscript𝑃1superscript~Γ𝑇\displaystyle\Pi\succeq\tilde{\Gamma}P^{-1}\tilde{\Gamma}^{T}roman_Π ⪰ over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (42)
Tr(Π)p.TrΠ𝑝\displaystyle\mathrm{Tr}(\Pi)\leq p.roman_Tr ( roman_Π ) ≤ italic_p . (43)

Finally, we add two redundant constraints. First,

J=PΓ~TΠ1Γ~0𝐽𝑃superscript~Γ𝑇superscriptΠ1~Γsucceeds-or-equals0J=P-\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}\succeq 0italic_J = italic_P - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG ⪰ 0

which holds iff (42) holds by the Schur complement lemma. Then, by substituting J𝐽Jitalic_J into (41), we obtain a Lyapunov for J𝐽Jitalic_J, {dmath} AJA^T - J + A Γ^T Π^-1 ΓA^T + Q - A Γ^T H^T (I + H ΠH^T)^-1 H ΓA^T - Γ^T Π^-1 Γ= 0. With these redundant conditions, we obtain the statement:

Statement 5. A finite MSE is achievable if and only if there exists a J0,P0,Γ~,Π0formulae-sequencesucceeds-or-equals𝐽0formulae-sequencesucceeds-or-equals𝑃0~Γsucceeds-or-equalsΠ0J\succeq 0,P\succeq 0,\tilde{\Gamma},\Pi\succeq 0italic_J ⪰ 0 , italic_P ⪰ 0 , over~ start_ARG roman_Γ end_ARG , roman_Π ⪰ 0 satisfying

AJATJ+AΓTΠ1ΓAT+QAΓTHT(I+HΠHT)1HΓATΓTΠ1Γ=0𝐴𝐽superscript𝐴𝑇𝐽𝐴superscriptΓ𝑇superscriptΠ1Γsuperscript𝐴𝑇𝑄𝐴superscriptΓ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻Γsuperscript𝐴𝑇superscriptΓ𝑇superscriptΠ1Γ0\displaystyle\begin{split}&AJA^{T}-J+A\Gamma^{T}\Pi^{-1}\Gamma A^{T}+Q\\ &\qquad\ \ -A\Gamma^{T}H^{T}(I+H\Pi H^{T})^{-1}H\Gamma A^{T}\\ &\qquad\ \ -\Gamma^{T}\Pi^{-1}\Gamma=0\\ \end{split}start_ROW start_CELL end_CELL start_CELL italic_A italic_J italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT - italic_J + italic_A roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Γ italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_Q end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - italic_A roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H roman_Γ italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Γ = 0 end_CELL end_ROW (44)
J=PΓ~TΠ1Γ~𝐽𝑃superscript~Γ𝑇superscriptΠ1~Γ\displaystyle J=P-\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}italic_J = italic_P - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG (45)
P=APAT+QAΓ~THT(I+HΠHT)1HΓ~AT𝑃𝐴𝑃superscript𝐴𝑇𝑄𝐴superscript~Γ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻~Γsuperscript𝐴𝑇\displaystyle P=APA^{T}+Q-A\tilde{\Gamma}^{T}H^{T}\left(I+H\Pi H^{T}\right)^{-% 1}H\tilde{\Gamma}A^{T}italic_P = italic_A italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_Q - italic_A over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (46)
ΠΓ~P1Γ~Tsucceeds-or-equalsΠ~Γsuperscript𝑃1superscript~Γ𝑇\displaystyle\Pi\succeq\tilde{\Gamma}P^{-1}\tilde{\Gamma}^{T}roman_Π ⪰ over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (47)
Tr(Π)p.TrΠ𝑝\displaystyle\mathrm{Tr}(\Pi)\leq p.roman_Tr ( roman_Π ) ≤ italic_p . (48)

As in Statement 3, (47) should be satisfied with equality. We will focus our analysis on Statement 5.

Left and right multiplying (44) by A1superscript𝐴1A^{-1}italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and ATsuperscript𝐴𝑇A^{-T}italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT to obtain a stable Lyapunov equation, we have {dmath} A^-1 J A^-T - J - ~Γ^T Π^-1 ~Γ- A^-1QA^-T + ~Γ^T H^T (I+H ΠH^T)^-1H ~Γ+ A^-1 ~Γ^T Π^-1 ~ΓA^-T = 0 By linearity, we can separate J=J^+J~𝐽^𝐽~𝐽J=\hat{J}+\tilde{J}italic_J = over^ start_ARG italic_J end_ARG + over~ start_ARG italic_J end_ARG where

J^^𝐽\displaystyle\hat{J}over^ start_ARG italic_J end_ARG =A1JATA1QATabsentsuperscript𝐴1𝐽superscript𝐴𝑇superscript𝐴1𝑄superscript𝐴𝑇\displaystyle=A^{-1}JA^{-T}-A^{-1}QA^{-T}= italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_J italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_Q italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT (49)
J~=A1JATΓ~TΠ1Γ~+Γ~THT(I+HΠHT)1HΓ~+A1Γ~TΠ1Γ~AT~𝐽superscript𝐴1𝐽superscript𝐴𝑇superscript~Γ𝑇superscriptΠ1~Γsuperscript~Γ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻~Γsuperscript𝐴1superscript~Γ𝑇superscriptΠ1~Γsuperscript𝐴𝑇\displaystyle\begin{split}\tilde{J}&=A^{-1}JA^{-T}-\tilde{\Gamma}^{T}\Pi^{-1}% \tilde{\Gamma}\\ &\qquad+\tilde{\Gamma}^{T}H^{T}(I+H\Pi H^{T})^{-1}H\tilde{\Gamma}\\ &\qquad+A^{-1}\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}A^{-T}\end{split}start_ROW start_CELL over~ start_ARG italic_J end_ARG end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_J italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (50)

The purpose of this step is to separate the terms involving Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG so that Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG only affects J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG. If there exists a Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG that makes J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG strictly positive, we can arbitrarily scale Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG so that J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG is arbitrarily positive. By making J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG arbitrarily positive, J0succeeds-or-equals𝐽0J\succeq 0italic_J ⪰ 0, so we will limit our investigation to the positivity of J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG.

Let Γ¯isubscript¯Γ𝑖\overline{\Gamma}_{i}over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denote the i𝑖iitalic_ith row of Γ¯¯Γ\overline{\Gamma}over¯ start_ARG roman_Γ end_ARG. We now have the following chain of equalities

J~=A1J~ATΓ~TΠ1Γ~+Γ~THT(I+HΠHT)1HΓ~+A1Γ~TΠ1Γ~AT~𝐽superscript𝐴1~𝐽superscript𝐴𝑇superscript~Γ𝑇superscriptΠ1~Γsuperscript~Γ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻~Γsuperscript𝐴1superscript~Γ𝑇superscriptΠ1~Γsuperscript𝐴𝑇\displaystyle\begin{split}\tilde{J}&=A^{-1}\tilde{J}A^{-T}-\tilde{\Gamma}^{T}% \Pi^{-1}\tilde{\Gamma}\\ \qquad&\ \ \ +\tilde{\Gamma}^{T}H^{T}(I+H\Pi H^{T})^{-1}H\tilde{\Gamma}\\ \qquad&\ \ \ +A^{-1}\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}A^{-T}\\ \end{split}start_ROW start_CELL over~ start_ARG italic_J end_ARG end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (51)
=A1J~ATΓ~T(Π+ΠHTHΠ)1Γ~+A1Γ~TΠ1Γ~ATabsentsuperscript𝐴1~𝐽superscript𝐴𝑇superscript~Γ𝑇superscriptΠΠsuperscript𝐻𝑇𝐻Π1~Γsuperscript𝐴1superscript~Γ𝑇superscriptΠ1~Γsuperscript𝐴𝑇\displaystyle\begin{split}&=A^{-1}\tilde{J}A^{-T}-\tilde{\Gamma}^{T}(\Pi+\Pi H% ^{T}H\Pi)^{-1}\tilde{\Gamma}\\ \qquad&\ \ \ +A^{-1}\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}A^{-T}\\ \end{split}start_ROW start_CELL end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_Π + roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H roman_Π ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (52)
=A1J~ATΓ~TΠ12(I+Π1/2HTHΠ1/2)1Π12Γ~+A1Γ~TΠ1Γ~ATabsentsuperscript𝐴1~𝐽superscript𝐴𝑇superscript~Γ𝑇superscriptΠ12superscript𝐼superscriptΠ12superscript𝐻𝑇𝐻superscriptΠ121superscriptΠ12~Γsuperscript𝐴1superscript~Γ𝑇superscriptΠ1~Γsuperscript𝐴𝑇\displaystyle\begin{split}&=A^{-1}\tilde{J}A^{-T}\\ \qquad&\ \ \ -\tilde{\Gamma}^{T}\Pi^{-\frac{1}{2}}(I+\Pi^{1/2}H^{T}H\Pi^{1/2})% ^{-1}\Pi^{-\frac{1}{2}}\tilde{\Gamma}\\ \qquad&\ \ \ +A^{-1}\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}A^{-T}\\ \end{split}start_ROW start_CELL end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( italic_I + roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (53)
=A1J~ATΓ~TΠ12U(I+Λ)1UΠ12Γ~+A1Γ~TΠ1Γ~ATabsentsuperscript𝐴1~𝐽superscript𝐴𝑇superscript~Γ𝑇superscriptΠ12superscript𝑈superscript𝐼Λ1𝑈superscriptΠ12~Γsuperscript𝐴1superscript~Γ𝑇superscriptΠ1~Γsuperscript𝐴𝑇\displaystyle\begin{split}&=A^{-1}\tilde{J}A^{-T}\\ \qquad&\ \ \ -\tilde{\Gamma}^{T}\Pi^{-\frac{1}{2}}U^{*}(I+\Lambda)^{-1}U\Pi^{-% \frac{1}{2}}\tilde{\Gamma}\\ \qquad&\ \ \ +A^{-1}\tilde{\Gamma}^{T}\Pi^{-1}\tilde{\Gamma}A^{-T}\\ \end{split}start_ROW start_CELL end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_I + roman_Λ ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_U roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (54)
=A1J~ATΓ¯T(I+Λ)1Γ¯+A1Γ¯TΓ¯ATabsentsuperscript𝐴1~𝐽superscript𝐴𝑇superscript¯Γ𝑇superscript𝐼Λ1¯Γsuperscript𝐴1superscript¯Γ𝑇¯Γsuperscript𝐴𝑇\displaystyle\begin{split}&=A^{-1}\tilde{J}A^{-T}-\overline{\Gamma}^{T}(I+% \Lambda)^{-1}\overline{\Gamma}\\ \qquad&\ \ \ +A^{-1}\overline{\Gamma}^{T}\overline{\Gamma}A^{-T}\\ \end{split}start_ROW start_CELL end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + roman_Λ ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT end_CELL end_ROW (55)
=A1J~AT+i(11+ΛiΓi¯TΓi¯+A1Γi¯TΓi¯AT),absentlimit-fromsuperscript𝐴1~𝐽superscript𝐴𝑇subscript𝑖11subscriptΛ𝑖superscript¯subscriptΓ𝑖𝑇¯subscriptΓ𝑖superscript𝐴1superscript¯subscriptΓ𝑖𝑇¯subscriptΓ𝑖superscript𝐴𝑇\displaystyle\begin{split}&=A^{-1}\tilde{J}A^{-T}+\\ \qquad&\ \ \ \sum_{i}\left(-\frac{1}{1+\Lambda_{i}}\overline{\Gamma_{i}}^{T}% \overline{\Gamma_{i}}+A^{-1}\overline{\Gamma_{i}}^{T}\overline{\Gamma_{i}}A^{-% T}\right),\end{split}start_ROW start_CELL end_CELL start_CELL = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT + end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( - divide start_ARG 1 end_ARG start_ARG 1 + roman_Λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG over¯ start_ARG roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT ) , end_CELL end_ROW (56)

where (52) follows from the Matrix Inversion lemma, (54) follows from the singular value decomposition of

Π1/2HTHΠ1/2=UΛUsuperscriptΠ12superscript𝐻𝑇𝐻superscriptΠ12𝑈Λsuperscript𝑈\Pi^{1/2}H^{T}H\Pi^{1/2}=U\Lambda U^{*}roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT = italic_U roman_Λ italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT (57)

where U𝑈Uitalic_U is unitary and ΛΛ\Lambdaroman_Λ is diagonal, and (55) follows from the notation

Γ¯UΠ12Γ~.¯Γ𝑈superscriptΠ12~Γ\overline{\Gamma}\triangleq U\Pi^{-\frac{1}{2}}\tilde{\Gamma}.over¯ start_ARG roman_Γ end_ARG ≜ italic_U roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG . (58)

Let Jisubscript𝐽𝑖J_{i}italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT be the unique solution to the Lyapunov equation

J~i=A1J~iATΓ¯iTΓ¯i1+Λi+A1Γ¯iTΓ¯iATsubscript~𝐽𝑖superscript𝐴1subscript~𝐽𝑖superscript𝐴𝑇superscriptsubscript¯Γ𝑖𝑇subscript¯Γ𝑖1subscriptΛ𝑖superscript𝐴1superscriptsubscript¯Γ𝑖𝑇subscript¯Γ𝑖superscript𝐴𝑇\tilde{J}_{i}=A^{-1}\tilde{J}_{i}A^{-T}-\frac{\bar{\Gamma}_{i}^{T}\bar{\Gamma}% _{i}}{1+\Lambda_{i}}+A^{-1}\bar{\Gamma}_{i}^{T}\bar{\Gamma}_{i}A^{-T}over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - divide start_ARG over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 + roman_Λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT (59)

then it follows that the unique solution to J𝐽Jitalic_J is given by

J~=Ji.~𝐽subscript𝐽𝑖\tilde{J}=\sum J_{i}.over~ start_ARG italic_J end_ARG = ∑ italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT . (60)

From (57), Λi=σi(Π12HTHΠ12)subscriptΛ𝑖subscript𝜎𝑖superscriptΠ12superscript𝐻𝑇𝐻superscriptΠ12\Lambda_{i}=\sigma_{i}\left(\Pi^{\frac{1}{2}}H^{T}H\Pi^{\frac{1}{2}}\right)roman_Λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( roman_Π start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H roman_Π start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ), where σi()subscript𝜎𝑖\sigma_{i}(\cdot)italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ ) indicates the i𝑖iitalic_ith singular value in order.

To connect the definition of Γ¯¯Γ\bar{\Gamma}over¯ start_ARG roman_Γ end_ARG to the condition of Theorem 1, we have the following definition:

Let 𝒮i{1,,k} be the set of indices where Γ¯i is non-zero.Let subscript𝒮𝑖1𝑘 be the set of indices where subscript¯Γ𝑖 is non-zero.\displaystyle\begin{split}\textit{Let }\mathcal{S}_{i}\subset\{1,\ldots,k\}% \textit{ be the set of indices where }\\ \qquad\bar{\Gamma}_{i}\textit{ is non-zero.}\end{split}start_ROW start_CELL Let caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊂ { 1 , … , italic_k } be the set of indices where end_CELL end_ROW start_ROW start_CELL over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is non-zero. end_CELL end_ROW (61)

A-A Proof of sufficiency

Leveraging the result for vector sources and scalar channels, see Appendix B, J~i0succeedssubscript~𝐽𝑖0\tilde{J}_{i}\succ 0over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≻ 0 if and only if

j𝒮ilog|aj|<12log(1+hi2πi).subscript𝑗subscript𝒮𝑖subscript𝑎𝑗121superscriptsubscript𝑖2subscript𝜋𝑖\sum_{j\in\mathcal{S}_{i}}\log|a_{j}|<\frac{1}{2}\log(1+h_{i}^{2}\pi_{i}).∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log | italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | < divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) . (62)

The sufficiency of the theorem follows by letting ΠΠ\Piroman_Π be diagonal, in which case Λi=πihi2subscriptΛ𝑖subscript𝜋𝑖superscriptsubscript𝑖2\Lambda_{i}=\pi_{i}h_{i}^{2}roman_Λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Let J~i𝒮superscriptsubscript~𝐽𝑖𝒮\tilde{J}_{i}^{\mathcal{S}}over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT be the submatrix of J~isubscript~𝐽𝑖\tilde{J}_{i}over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT formed by selecting the column and row indices that are members of 𝒮isubscript𝒮𝑖\mathcal{S}_{i}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the support of the i𝑖iitalic_ith row of Γ¯¯Γ\bar{\Gamma}over¯ start_ARG roman_Γ end_ARG. Note that J~i𝒮superscriptsubscript~𝐽𝑖𝒮\tilde{J}_{i}^{\mathcal{S}}over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT will be strictly positive by the MISO theorem and zero elsewhere. Since the union of supports of the rows of Γ¯¯Γ\bar{\Gamma}over¯ start_ARG roman_Γ end_ARG include every possible index, J~=iJ~i0~𝐽subscript𝑖subscript~𝐽𝑖succeeds0\tilde{J}=\sum_{i}\tilde{J}_{i}\succ 0over~ start_ARG italic_J end_ARG = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≻ 0 and can be made arbitrarily positive by scaling Γ¯¯Γ\bar{\Gamma}over¯ start_ARG roman_Γ end_ARG. Such a ΠΠ\Piroman_Π, Γ~=Π1/2Γ¯,J=J~+J^,P=J+Γ~TΠ1Γ~formulae-sequence~ΓsuperscriptΠ12¯Γformulae-sequence𝐽~𝐽^𝐽𝑃𝐽superscript~Γ𝑇superscriptΠ1~Γ\tilde{\Gamma}=\Pi^{1/2}\bar{\Gamma},J=\tilde{J}+\hat{J},P=J+\tilde{\Gamma}^{T% }\Pi^{-1}\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG = roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG , italic_J = over~ start_ARG italic_J end_ARG + over^ start_ARG italic_J end_ARG , italic_P = italic_J + over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG is sufficient for Statement 5.

A-B Proof of necessity

First, we show that a diagonal ΠΠ\Piroman_Π is necessary.

Lemma 3

If J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG can be made arbitrarily positive by some Γ,ΠΓΠ\Gamma,\Piroman_Γ , roman_Π, it can also be made arbitrarily positive by Γ~,Π~~Γ~Π\tilde{\Gamma},\tilde{\Pi}over~ start_ARG roman_Γ end_ARG , over~ start_ARG roman_Π end_ARG where Π~~Π\tilde{\Pi}over~ start_ARG roman_Π end_ARG is diagonal with Tr(Π~)Tr(Π)Tr~ΠTrΠ\mathrm{Tr}(\tilde{\Pi})\leq\mathrm{Tr}(\Pi)roman_Tr ( over~ start_ARG roman_Π end_ARG ) ≤ roman_Tr ( roman_Π ).

Proof:

Fix a ΛΛ\Lambdaroman_Λ in (59). We will show that given an arbitrary Γ,ΠΓΠ\Gamma,\Piroman_Γ , roman_Π satisfying the power constraint, the same ΛΛ\Lambdaroman_Λ is achievable by a diagonal Π~~Π\tilde{\Pi}over~ start_ARG roman_Π end_ARG and associated Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG with less power.

Let

DHHT,𝐷𝐻superscript𝐻𝑇D\triangleq HH^{T},italic_D ≜ italic_H italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (63)

where by assumption, h1h2hnsubscript1subscript2subscript𝑛h_{1}\geq h_{2}\geq\ldots\geq h_{n}italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … ≥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. From (57),

Λ=UΠ1/2HTHΠ1/2U.Λsuperscript𝑈superscriptΠ12superscript𝐻𝑇𝐻superscriptΠ12𝑈\Lambda=U^{*}\Pi^{1/2}H^{T}H\Pi^{1/2}U.roman_Λ = italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H roman_Π start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_U . (64)

This implies

Π=UΛ12VD1VΛ12UΠsuperscript𝑈superscriptΛ12superscript𝑉superscript𝐷1𝑉superscriptΛ12𝑈\Pi=U^{*}\Lambda^{\frac{1}{2}}V^{*}D^{-1}V\Lambda^{\frac{1}{2}}Uroman_Π = italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_Λ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_V roman_Λ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_U (65)

where V𝑉Vitalic_V is unitary. For a diagonal Π~~Π\tilde{\Pi}over~ start_ARG roman_Π end_ARG,

Λ=Π~1/2HTHΠ~1/2=Π~HHTΛsuperscript~Π12superscript𝐻𝑇𝐻superscript~Π12~Π𝐻superscript𝐻𝑇\Lambda=\tilde{\Pi}^{1/2}H^{T}H\tilde{\Pi}^{1/2}=\tilde{\Pi}HH^{T}roman_Λ = over~ start_ARG roman_Π end_ARG start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H over~ start_ARG roman_Π end_ARG start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT = over~ start_ARG roman_Π end_ARG italic_H italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (66)

Then by isolating Π~~Π\tilde{\Pi}over~ start_ARG roman_Π end_ARG and applying the trace to both sides,

Tr(Π~)=Tr(ΛD1),Tr~ΠTrΛsuperscript𝐷1\mathrm{Tr}(\tilde{\Pi})=\mathrm{Tr}(\Lambda D^{-1}),roman_Tr ( over~ start_ARG roman_Π end_ARG ) = roman_Tr ( roman_Λ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) , (67)

and

Tr(Π)=Tr(UΛ12VD1VΛ12U)=Tr(ΛVD1V).TrΠTrsuperscript𝑈superscriptΛ12superscript𝑉superscript𝐷1𝑉superscriptΛ12𝑈TrΛsuperscript𝑉superscript𝐷1𝑉\mathrm{Tr}(\Pi)=\mathrm{Tr}(U^{*}\Lambda^{\frac{1}{2}}V^{*}D^{-1}V\Lambda^{% \frac{1}{2}}U)=\mathrm{Tr}(\Lambda V^{*}D^{-1}V).roman_Tr ( roman_Π ) = roman_Tr ( italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_Λ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_V roman_Λ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_U ) = roman_Tr ( roman_Λ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_V ) . (68)

Note that the entries of ΛΛ\Lambdaroman_Λ are in descending order and the entries of D1superscript𝐷1D^{-1}italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT are listed in ascending order by assumption that h1h2hnsubscript1subscript2subscript𝑛h_{1}\geq h_{2}\geq\ldots\geq h_{n}italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … ≥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

We apply Ruhe’s Trace Inequality, which states that if A,B𝐴𝐵A,Bitalic_A , italic_B are PSD matrices, with eigenvalues a1a2ansubscript𝑎1subscript𝑎2subscript𝑎𝑛a_{1}\geq a_{2}\geq\ldots a_{n}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and b1b2bnsubscript𝑏1subscript𝑏2subscript𝑏𝑛b_{1}\geq b_{2}\geq\ldots b_{n}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT,

i=1naibni+1Tr(AB).superscriptsubscript𝑖1𝑛subscript𝑎𝑖subscript𝑏𝑛𝑖1Tr𝐴𝐵\sum_{i=1}^{n}a_{i}b_{n-i+1}\leq\mathrm{Tr}(AB).∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_n - italic_i + 1 end_POSTSUBSCRIPT ≤ roman_Tr ( italic_A italic_B ) . (69)

Here, A=Λ𝐴ΛA=\Lambdaitalic_A = roman_Λ, B=VD1V𝐵superscript𝑉superscript𝐷1𝑉B=V^{*}D^{-1}Vitalic_B = italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_V, and Tr(Π~)Tr~Π\mathrm{Tr}(\tilde{\Pi})roman_Tr ( over~ start_ARG roman_Π end_ARG ) achieves the lower bound with equality since ΛΛ\Lambdaroman_Λ and D1superscript𝐷1D^{-1}italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT are both diagonal with opposing ordering as in (69). Tr(Π)TrΠ\mathrm{Tr}(\Pi)roman_Tr ( roman_Π ) has a sandwiched unitary V𝑉Vitalic_V in (68). Thus,

Tr(Π~)Tr(Π)Tr~ΠTrΠ\mathrm{Tr}(\tilde{\Pi})\leq\mathrm{Tr}(\Pi)roman_Tr ( over~ start_ARG roman_Π end_ARG ) ≤ roman_Tr ( roman_Π )

as desired. ∎

The above lemma shows that we can take ΠΠ\Piroman_Π to be diagonal without loss of generality since that is what minimizes the power. Then,

Λi=πiHi2.subscriptΛ𝑖subscript𝜋𝑖superscriptsubscript𝐻𝑖2\Lambda_{i}=\pi_{i}H_{i}^{2}.roman_Λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

and

J~i=A1J~iATΓ¯iTΓ¯i1+hi2πi+A1Γ¯iTΓ¯iATsubscript~𝐽𝑖superscript𝐴1subscript~𝐽𝑖superscript𝐴𝑇superscriptsubscript¯Γ𝑖𝑇subscript¯Γ𝑖1superscriptsubscript𝑖2subscript𝜋𝑖superscript𝐴1superscriptsubscript¯Γ𝑖𝑇subscript¯Γ𝑖superscript𝐴𝑇\tilde{J}_{i}=A^{-1}\tilde{J}_{i}A^{-T}-\frac{\bar{\Gamma}_{i}^{T}\bar{\Gamma}% _{i}}{1+h_{i}^{2}\pi_{i}}+A^{-1}\bar{\Gamma}_{i}^{T}\bar{\Gamma}_{i}A^{-T}over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - divide start_ARG over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT (70)

To show the partitioning property of Theorem 1, we need to show that the sets {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT cover {1,,k}1𝑘\{1,\ldots,k\}{ 1 , … , italic_k } and are disjoint.

We show the necessity of i=1k𝒮i={1,,k}superscriptsubscript𝑖1𝑘subscript𝒮𝑖1𝑘\bigcup_{i=1}^{k}\mathcal{S}_{i}=\{1,\ldots,k\}⋃ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { 1 , … , italic_k }. For J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG to be arbitrarily positive, ΓΓ\Gammaroman_Γ must excite all directions of J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG. To see this, suppose an index j𝑗jitalic_j exists such that Γ¯ij=0subscript¯Γ𝑖𝑗0\bar{\Gamma}_{ij}=0over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 0 for all i𝑖iitalic_i in (58). Then, ejTJ~ej=0superscriptsubscript𝑒𝑗𝑇~𝐽subscript𝑒𝑗0e_{j}^{T}\tilde{J}e_{j}=0italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0, where ejsubscript𝑒𝑗e_{j}italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the j𝑗jitalic_jth standard basis vector, and J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG cannot be positive definite.

What remains to be shown is the necessity of the disjointness of the sets {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. By showing Conjecture 1, J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG (50) can be made arbitrarily positive by scaling Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG of the form described in Conjecture 1. The sets {𝒮i}i=1nsuperscriptsubscriptsubscript𝒮𝑖𝑖1𝑛\{\mathcal{S}_{i}\}_{i=1}^{n}{ caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are then disjoint.

We can consider the equivalent (redundant) constraints on P𝑃Pitalic_P in Statement 5. In our construction of ΠΠ\Piroman_Π in (47), we demonstrated that slack in ΠΓ~P1Γ~succeeds-or-equalsΠ~Γsuperscript𝑃1~Γ\Pi\succeq\tilde{\Gamma}P^{-1}\tilde{\Gamma}roman_Π ⪰ over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG only hardens the conditions, so we can always take Π=Γ~P1Γ~Π~Γsuperscript𝑃1~Γ\Pi=\tilde{\Gamma}P^{-1}\tilde{\Gamma}roman_Π = over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG. In Lemma 3, we showed that the optimal ΠΠ\Piroman_Π is diagonal. Consequently, Γ~TP1Γ~superscript~Γ𝑇superscript𝑃1~Γ\tilde{\Gamma}^{T}P^{-1}\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG can also be considered to be diagonal. We now ask what this diagonality constraint imposes on the structure of Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG.

Recall our original DARE:

P=APA+QAPΓTH(I+HΓPΓTH)1HΓPAT𝑃𝐴𝑃𝐴𝑄𝐴𝑃superscriptΓ𝑇𝐻superscript𝐼𝐻Γ𝑃superscriptΓ𝑇𝐻1𝐻Γ𝑃superscript𝐴𝑇P=APA+Q-AP\Gamma^{T}H(I+H\Gamma P\Gamma^{T}H)^{-1}H\Gamma PA^{T}italic_P = italic_A italic_P italic_A + italic_Q - italic_A italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H ( italic_I + italic_H roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H roman_Γ italic_P italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT

where Γ=Γ~P1Γ~Γsuperscript𝑃1\Gamma=\tilde{\Gamma}P^{-1}roman_Γ = over~ start_ARG roman_Γ end_ARG italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

Under the condition ΓPΓTΓ𝑃superscriptΓ𝑇\Gamma P\Gamma^{T}roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT is diagonal, the measurement covariance I+HΓPΓTH𝐼𝐻Γ𝑃superscriptΓ𝑇𝐻I+H\Gamma P\Gamma^{T}Hitalic_I + italic_H roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H is diagonal as well since H𝐻Hitalic_H is diagonal. We conjecture that the diagonality of ΓPΓTΓ𝑃superscriptΓ𝑇\Gamma P\Gamma^{T}roman_Γ italic_P roman_Γ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT imposes a structure on Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG so that 𝒮isubscript𝒮𝑖\mathcal{S}_{i}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is consistent with the condition of Theorem 1. This means that Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG has only a single non-zero entry in each column. Equivalently, each mode is assigned to a single channel. If this matrix-algebraic fact on the structure of Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG holds, Conjecture 3 holds as well.

This concludes the proof.

Appendix B Proof of Lemma 3 - Vector Source over Scalar Channel

This proof was initially presented in [1], but we give it here for completeness.

The channel is scalar, so J~=J~1~𝐽subscript~𝐽1\tilde{J}=\tilde{J}_{1}over~ start_ARG italic_J end_ARG = over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in 70. Further, we can always set π1=psubscript𝜋1𝑝\pi_{1}=pitalic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_p since the optimal encoder uses all the available power. From (70), we have

J~=A1J~ATΓ¯TΓ¯i1+h2p+A1Γ¯TΓ¯AT~𝐽superscript𝐴1~𝐽superscript𝐴𝑇superscript¯Γ𝑇subscript¯Γ𝑖1superscript2𝑝superscript𝐴1superscript¯Γ𝑇¯Γsuperscript𝐴𝑇\tilde{J}=A^{-1}\tilde{J}A^{-T}-\frac{\bar{\Gamma}^{T}\bar{\Gamma}_{i}}{1+h^{2% }p}+A^{-1}\bar{\Gamma}^{T}\bar{\Gamma}A^{-T}over~ start_ARG italic_J end_ARG = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_J end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT - divide start_ARG over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG + italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT (71)

Defining DΓdiag(Γ¯)subscript𝐷Γdiag¯ΓD_{\Gamma}\triangleq\mbox{diag}(\bar{\Gamma})italic_D start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ≜ diag ( over¯ start_ARG roman_Γ end_ARG ), i.e., the diagonal matrix whose components are the elements of the vector Γ¯Tsuperscript¯Γ𝑇\bar{\Gamma}^{T}over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, we may now write

A1Γ¯T=DΓaandΓ¯T=DΓ1,superscript𝐴1superscript¯Γ𝑇subscript𝐷Γ𝑎andsuperscript¯Γ𝑇subscript𝐷Γ1A^{-1}\bar{\Gamma}^{T}=D_{\Gamma}a~{}~{}~{}\mbox{and}~{}~{}~{}\bar{\Gamma}^{T}% =D_{\Gamma}1,italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT = italic_D start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT italic_a and over¯ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT = italic_D start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT 1 ,

where a𝑎aitalic_a is the vector of the diagonal elements of A1superscript𝐴1A^{-1}italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and 1111 is the all-one vector. (71) becomes {dmath} ~J = A^-1~J A^-T + D_Γ(aa^T-11T1 + h2p )D_Γ. Let M𝑀Mitalic_M be the solution to the Lyapunov equation

M=A1MAT+11T.𝑀superscript𝐴1𝑀superscript𝐴𝑇superscript11𝑇M=A^{-1}MA^{-T}+11^{T}.italic_M = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT + 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT . (72)

Let (Au,1)subscript𝐴𝑢1(A_{u},1)( italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , 1 ) be controllable, which holds by Assumption 2. Then by the Lyapunov stability theorem, M0succeeds𝑀0M\succ 0italic_M ≻ 0. We now claim that

J~=DΓ(h2p1+h2pM11T)DΓ.~𝐽subscript𝐷Γsuperscript2𝑝1superscript2𝑝𝑀superscript11𝑇subscript𝐷Γ{\tilde{J}}=D_{\Gamma}\left(\frac{h^{2}p}{1+h^{2}p}M-11^{T}\right)D_{\Gamma}.over~ start_ARG italic_J end_ARG = italic_D start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG italic_M - 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) italic_D start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT . (73)

This can be verified by plugging (73) into (B). It follows that J~0succeeds~𝐽0{\tilde{J}}\succ 0over~ start_ARG italic_J end_ARG ≻ 0 if and only if h2p1+h2pM11T>0superscript2𝑝1superscript2𝑝𝑀superscript11𝑇0\frac{h^{2}p}{1+h^{2}p}M-11^{T}>0divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG italic_M - 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT > 0. But the latter is equivalent to

[M11Th2p1+h2pM11T]0,succeedsdelimited-[]𝑀1superscript1𝑇superscript2𝑝1superscript2𝑝𝑀superscript11𝑇0\left[\begin{array}[]{cc}M&1\\ 1^{T}&\frac{h^{2}p}{1+h^{2}p}M-11^{T}\end{array}\right]\succ 0,[ start_ARRAY start_ROW start_CELL italic_M end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL start_CELL divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG italic_M - 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ] ≻ 0 , (74)

or

h2p1+h2pM11T>1TM11.superscript2𝑝1superscript2𝑝𝑀superscript11𝑇superscript1𝑇superscript𝑀11\frac{h^{2}p}{1+h^{2}p}M-11^{T}>1^{T}M^{-1}1.divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG italic_M - 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT > 1 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_M start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT 1 . (75)

Assume M𝑀Mitalic_M satisfies the Lyapunov equation (72). Then

1TM11=1|det(A)|2.superscript1𝑇superscript𝑀111superscriptdet𝐴21^{T}M^{-1}1=1-\left|\mbox{det}(A)\right|^{-2}.1 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_M start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT 1 = 1 - | det ( italic_A ) | start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT . (76)

This follows since A1MAT=M11Tsuperscript𝐴1𝑀superscript𝐴𝑇𝑀superscript11𝑇A^{-1}MA^{-T}=M-11^{T}italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT = italic_M - 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT from (72). On the one hand

detA1MAT=|detA|2detMdetsuperscript𝐴1𝑀superscript𝐴𝑇superscriptdet𝐴2det𝑀\mbox{det}A^{-1}MA^{-T}=\left|\mbox{det}A\right|^{-2}\mbox{det}{M}det italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M italic_A start_POSTSUPERSCRIPT - italic_T end_POSTSUPERSCRIPT = | det italic_A | start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT det italic_M (77)

and on the other {dmath} det(M-11^T) = det(I-11^TM^-1)  det M = (1-1^TM^-11)  detM, which yields the desired result.

Then, (75) and (76) imply that J~0succeeds~𝐽0\tilde{J}\succ 0over~ start_ARG italic_J end_ARG ≻ 0 if and only if

h2p1+h2p>1|detA|2,superscript2𝑝1superscript2𝑝1superscriptdet𝐴2\frac{h^{2}p}{1+h^{2}p}>1-\left|\mbox{det}A\right|^{-2},divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG > 1 - | det italic_A | start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT , (78)

or equivalently

1+h2p>|detAu|2,1superscript2𝑝superscriptdetsubscript𝐴𝑢21+h^{2}p>\left|\mbox{det}A_{u}\right|^{2},1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p > | det italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (79)

which is the capacity condition we are seeking.

Note that when this capacity condition holds, h2p1+h2pM11T0succeedssuperscript2𝑝1superscript2𝑝𝑀superscript11𝑇0\frac{h^{2}p}{1+h^{2}p}M-11^{T}\succ 0divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG start_ARG 1 + italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p end_ARG italic_M - 11 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ≻ 0 and therefore J~0succeeds~𝐽0{\tilde{J}}\succ 0over~ start_ARG italic_J end_ARG ≻ 0 in (73). We can arbitrarily scale Γ¯¯Γ\bar{\Gamma}over¯ start_ARG roman_Γ end_ARG and therefore DΓsubscript𝐷ΓD_{\Gamma}italic_D start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT to make J~~𝐽\tilde{J}over~ start_ARG italic_J end_ARG arbitrarily positive. This demonstrates both sufficiency and necessity.

Appendix C Proof of Lemma 3 - Scalar Source over MIMO Channel

We also consider a scalar source and arbitrary rank channel in Lemma 3. We can solve (46) explicitly as

P=qa2Γ~THT(I+HΠHT)1HΓ~1a2.𝑃𝑞superscript𝑎2superscript~Γ𝑇superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻~Γ1superscript𝑎2P=\frac{q-a^{2}\tilde{\Gamma}^{T}H^{T}(I+H\Pi H^{T})^{-1}H\tilde{\Gamma}}{1-a^% {2}}.italic_P = divide start_ARG italic_q - italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over~ start_ARG roman_Γ end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H over~ start_ARG roman_Γ end_ARG end_ARG start_ARG 1 - italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (80)

Plugging this into (45) we obtain {dmath} J = -qa2-1 + ~Γ^T ( a2a2-1 H^T (I + H ΠH^T)^-1 H - Π^-1 ) ~Γ⪰0 Finite estimation error can be achieved iff there exists a Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG and Π0Π0\Pi\geq 0roman_Π ≥ 0, Tr(Π)pTrΠ𝑝\mathrm{Tr}(\Pi)\leq proman_Tr ( roman_Π ) ≤ italic_p such that J0𝐽0J\geq 0italic_J ≥ 0. The above statement is equivalent to the statement: finite error is not achievable iff for all Γ~,Π~ΓΠ\tilde{\Gamma},\Piover~ start_ARG roman_Γ end_ARG , roman_Π such that Π0Π0\Pi\geq 0roman_Π ≥ 0, Tr(Π)pTrΠ𝑝\mathrm{Tr}(\Pi)\leq proman_Tr ( roman_Π ) ≤ italic_p, we have J<0𝐽0J<0italic_J < 0. This is what we set out to show.

Let

Oa2a21HT(I+HΠHT)1HΠ1.𝑂superscript𝑎2superscript𝑎21superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻superscriptΠ1O\triangleq\frac{a^{2}}{a^{2}-1}H^{T}(I+H\Pi H^{T})^{-1}H-\Pi^{-1}.italic_O ≜ divide start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 end_ARG italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H - roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (81)

Note that J<0𝐽0J<0italic_J < 0 for all Γ~~Γ\tilde{\Gamma}over~ start_ARG roman_Γ end_ARG if and only if O0precedes-or-equals𝑂0O\preceq 0italic_O ⪯ 0.

O=Π1a2a21HT(I+HΠHT)1H𝑂superscriptΠ1superscript𝑎2superscript𝑎21superscript𝐻𝑇superscript𝐼𝐻Πsuperscript𝐻𝑇1𝐻\displaystyle-O=\Pi^{-1}-\frac{a^{2}}{a^{2}-1}H^{T}(I+H\Pi H^{T})^{-1}H- italic_O = roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 end_ARG italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_H 0absent0\displaystyle\geq 0≥ 0 (82)

The inequality is equivalent to

(Π1a2a21HTa2a21HI+HΠHT)0matrixsuperscriptΠ1superscript𝑎2superscript𝑎21superscript𝐻𝑇superscript𝑎2superscript𝑎21𝐻𝐼𝐻Πsuperscript𝐻𝑇0\begin{pmatrix}\Pi^{-1}&\sqrt{\frac{a^{2}}{a^{2}-1}}H^{T}\\ \sqrt{\frac{a^{2}}{a^{2}-1}}H&I+H\Pi H^{T}\end{pmatrix}\geq 0( start_ARG start_ROW start_CELL roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL start_CELL square-root start_ARG divide start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 end_ARG end_ARG italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL square-root start_ARG divide start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 end_ARG end_ARG italic_H end_CELL start_CELL italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ) ≥ 0 (83)

which is also equivalent to

I+HΠHTa2a21HΠHT𝐼𝐻Πsuperscript𝐻𝑇superscript𝑎2superscript𝑎21𝐻Πsuperscript𝐻𝑇\displaystyle I+H\Pi H^{T}-\frac{a^{2}}{a^{2}-1}H\Pi H^{T}italic_I + italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 end_ARG italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT 0absent0\displaystyle\geq 0≥ 0 (84)
I1a21HΠHT𝐼1superscript𝑎21𝐻Πsuperscript𝐻𝑇\displaystyle I-\frac{1}{a^{2}-1}H\Pi H^{T}italic_I - divide start_ARG 1 end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 end_ARG italic_H roman_Π italic_H start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT 0absent0\displaystyle\geq 0≥ 0 (85)

Recall that H𝐻Hitalic_H is diagonal. Let hmaxsubscript𝑚𝑎𝑥h_{max}italic_h start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT be the entry of H𝐻Hitalic_H with the greatest magnitude. Note that ΠΠ\Piroman_Π has the best chance of overcoming (85) by placing all power on the best channel. Thus, if this alignment of ΠΠ\Piroman_Π does not violate (85), the inequality holds for any ΠΠ\Piroman_Π. Then, the final inequality holds for any Π0,Tr(Π)pformulae-sequenceΠ0TrΠ𝑝\Pi\geq 0,\mathrm{Tr}(\Pi)\leq proman_Π ≥ 0 , roman_Tr ( roman_Π ) ≤ italic_p if and only if

a21+hmax2psuperscript𝑎21superscriptsubscript𝑚𝑎𝑥2𝑝a^{2}\geq 1+h_{max}^{2}pitalic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 1 + italic_h start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p (86)

or

loga12log(1+hmax2p)𝑎121superscriptsubscript𝑚𝑎𝑥2𝑝\log a\geq\frac{1}{2}\log(1+h_{max}^{2}p)roman_log italic_a ≥ divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log ( 1 + italic_h start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p ) (87)

This is a necessary and sufficient condition for D<𝐷D<\inftyitalic_D < ∞ to be unachievable. Taking the inverse of the above, we obtain the desired condition.