License: arXiv.org perpetual non-exclusive license
arXiv:2310.07987v2 [cs.NI] 12 Jan 2024

Semantic-Forward Relaying: A Novel Framework Towards
6G Cooperative Communications

Wensheng Lin, , Yuna Yan, Lixin Li, , Zhu Han,  and Tad Matsumoto This letter has been accepted for publication in IEEE Communications Letters with DOI: 10.1109/LCOMM.2024.3352916. Corresponding author: Lixin Li. W. Lin, Y. Yan and L. Li are with the School of Electronics and Information, Northwestern Polytechnical University, Xi’an, Shaanxi 710129, China (e-mail: [email protected]; [email protected]; [email protected]). Z. Han is with the Department of Electrical and Computer Engineering at the University of Houston, Houston, TX 77004 USA, and also with the Department of Computer Science and Engineering, Kyung Hee University, Seoul, South Korea, 446-701 (e-mail: [email protected]). T. Matsumoto is an Invited Professor at IMT-Atlantique, France. He is also Professor Emeritus of Japan Advanced Institute of Science Technology, Ishikawa 923-1292, Japan, and University of Oulu, Finland (e-mail: [email protected]).
Abstract

This letter proposes a novel relaying framework, semantic-forward (SF), for cooperative communications towards the sixth-generation (6G) wireless networks. The SF relay extracts and transmits the semantic features, which reduces forwarding payload, and also improves the network robustness against intra-link errors. Based on the theoretical basis for cooperative communications with side information and the turbo principle, we design a joint source-channel coding algorithm to iteratively exchange the extrinsic information for enhancing the decoding gains at the destination. Surprisingly, simulation results indicate that even in bad channel conditions, SF relaying can still effectively improve the recovered information quality.

Index Terms:
Semantic-forward, cooperative communications, semantic communications, relaying systems, side information.

I Introduction

Cooperative communications are acknowledged schemes to improve the transmission quality. One of the most important categories of cooperative communications is in the form of relaying. Although relaying requires extra energy and time slot, it is an effective solution when the path-loss of the direct link is very large. On the other hand, with the research trend towards the sixth-generation (6G) wireless networks [1], various transmission technologies have been invented, among which semantic communications [2] are considered to have a great potential in media transmissions. The semantic encoder extracts the semantic features for transmissions [3], while the semantic decoder works in a similar way to the generative artificial intelligence (AI) [4]. Inspired by the principle of semantic communications, this letter proposes a novel framework, semantic-forward (SF) relaying, for cooperative communications.

There have been already diverse relaying schemes in the literature [5]. One simple relaying scheme is amplify-and-forward (AF), in which the relay directly amplifies the signals received from the source and then forwards to the destination. In 1979, Cover and El Gamal [6] established the fundamental theorems of relaying systems, and proposed the decode-and-forward (DF) and compress-and-forward (CF) schemes. In the DF scheme, the relay decodes the received signals at the first step, and then the recovered information sequence is forwarded or discarded, respectively, depending on the recovery is error-free or not. In the CF scheme, the relay quantizes and compresses its received signals into the relay information to be transmitted to the destination. Beyond DF, lossy-forward (LF) [7] was proposed to overcome the drawback of DF, where the communication resources are completely wasted once errors occur in the relay information. In the LF scheme, the relay always forwards the relay information to the destination regardless of whether or not intra-link error is detected at the relay. At the destination, a joint decoder recovers the source information with the help of the relayed information, based on the principle of correlated sources transmission.

Nevertheless, the previous relaying schemes are designed for general types of information, which do not exploit the features of information to improve the information efficiency. By adopting the semantic communications, the system adaptively exploits diverse types of information. Hence, we aims at designing a relaying systems where the relay forwards the semantic information to the destination, i.e., SF relaying, so that the destination can utilize the semantic information to help recovering the source information.

The terminology of SF has been used for the first time in [8], up to the authors’ maximum knowledge, where the source transmits semantic information to the relay, and the relay translates and forwards the processed semantic information to the destination. However, there is no direct source-destination link in [8].

Different from the relay-assisted semantic communications in [8], this work proposes a semantic-assisted relaying system. In our proposed SF relaying system, the relay reconstructs the information received from the source at the first step. Then, in spite of whether or not the reconstruction error-free, the relay extracts the semantic information and sends it to the destination. The semantic coding achieves robustness of the relaying system against errors, and hence can reduce the payload in the relay-destination (R-D) link. At the destination, a joint decoder performs iterative decoding utilizing the Turbo principle [9] that exchanges the extrinsic information between the lossy information of the source-destination (S-D) link and the semantic information of the R-D link. In this way, the SF relaying can help for the lossless recovery of the original information at the destinations, even in bad channel conditions. With the pre-trained semantic encoder/decoder, the SF relaying can reduce the payload of the R-D link in practical systems. The contributions of this letter are summarized as follows:

  • We propose a novel relaying framework, i.e., SF relaying, which adopts semantic communications at the relay and the destination to reduce the payload of the R-D link.

  • We design a joint source-channel coding algorithm for SF relaying systems, where the destination can losslessly recover the source information with the assistance of the semantic information received from the relay.

  • We conduct a series of simulations with image transmissions to evaluate the performance of SF relaying. The simulation results demonstrate that SF relaying systems can exploit the semantic information to reduce the Euclidean distance (ED) and improve the image quality.

Notation. Capital letters X,Y,V,U𝑋𝑌𝑉𝑈X,Y,V,Uitalic_X , italic_Y , italic_V , italic_U denote the random variables for constructing information sequences. M𝑀Mitalic_M represents the codeword satisfying the link rate constraint R𝑅Ritalic_R.

II The Principle of Semantic-Forward Relaying

This section explains the principle of SF relaying, and then analyzes the rate constraints on the rates for losslessly recovering the source information.

II-A Framework Structure

Refer to caption
Figure 1: The principle of SF Relaying.

As illustrated in Fig. 1, the SF relaying system contains one source, one relay and one destination. The source is nothing specific than a common source in conventional relaying systems. It broadcasts the original information to the relay and the destination at the first time slot. For semantic communications, the media type of the original information could be image, audio, video, text, etc., and combinations of multimedia.

After receiving the signals from the source, the relay reconstructs the source information as high fidelity as possible. However, due to the unavoidable channel fading, the reconstructed information may not be lossless. It is noticed that in semantic communications, the transmitted information is the extracted features of the original information. This indicates that semantic coding is robust against the noise to a certain extent. Hence, the relay utilizes a semantic encoder to extract the semantic information based on a common knowledge base shared with the destination. Then, the semantic information is sent to the destination at the second slot.

Once the destination receives all the signals sent from the source and the relay, it initiates the process for reconstructing the original information by a joint decoder with the assistance of the semantic information, based on the information theoretic principle of cooperative communications with correlated sources. If the channel conditions of the three links can satisfy the lossless reconstruction requirements at the destination, the destination can recover the original information loslessly.

Remark 1: The source can broadcast the semantic information instead of the original information. However, the recovery at the destination is not easy to be lossless in practical, because it equivalently requires the semantic decoder to losslessly recover the original information by only utilizing the semantic encoded information.

Remark 2: In general, the semantic communications aims to transmit much less payload than the original information, while maintaining a relatively high quality of the recovered information. Thus, given the total transmit energy constant, the transmit energy per bit of the R-D link can be higher than the S-D link.

II-B Theoretical System Model

For theoretical analysis, we can rely on the system model illustrated in Fig. 1 in either symbol-wise or bit-wise. We assume the system to be bit-wise hereafter for simplicity without loss of generality. Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT denotes the original information sequence, with n𝑛nitalic_n being the information sequence length. Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is the lossy information sequence recovered at the relay. Hence, Y𝑌Yitalic_Y can be represented by Y=XE𝑌direct-sum𝑋𝐸Y=X\oplus Eitalic_Y = italic_X ⊕ italic_E, where EBern(ρ)similar-to𝐸Bern𝜌E\sim\mathrm{Bern}(\rho)italic_E ∼ roman_Bern ( italic_ρ ) is the corrupting error with ρ𝜌\rhoitalic_ρ being the crossover probability between X𝑋Xitalic_X and Y𝑌Yitalic_Y. Due to the rate constraint R1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT supported by the channel capacity on the S-D link, Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT may not be losslessly transmitted to the destination. Therefore, the transmission on the S-D link can be equivalently regarded as the lossy compression from Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to codeword M1subscript𝑀1M_{1}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT by encoder (ENC) 1. Similarly, Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is also equivalently encoded into codeword M2subscript𝑀2M_{2}italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT by ENC 2 to satisfy rate constraint R2subscript𝑅2R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Vnsuperscript𝑉𝑛V^{n}italic_V start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT represents the side information provided by the semantic knowledge, which is commonly shared by the relay and the destination. Unsuperscript𝑈𝑛U^{n}italic_U start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT stands for the lossy compressed version of Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT reconstructed from codeword M2subscript𝑀2M_{2}italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Finally, the joint decoder (DEC) reconstructs X^nsuperscript^𝑋𝑛\hat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT based on codewords M1subscript𝑀1M_{1}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and M2subscript𝑀2M_{2}italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with the assistance of the side information Vnsuperscript𝑉𝑛V^{n}italic_V start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. In this letter, we aim at the lossless recovery of Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, while the recovery X^nsuperscript^𝑋𝑛\hat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT can be either lossless or lossy in general systems.

II-C Achievable Rate Analysis

Form the Shannon theory, the rate constraint on the source-relay (S-R) link is

R0I(X;Y).subscript𝑅0𝐼𝑋𝑌\displaystyle R_{0}\geq I(X;Y).italic_R start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≥ italic_I ( italic_X ; italic_Y ) . (1)

For the rate constraints on the S-D and R-D links, we need to consider from the viewpoint of the joint DEC for losslessly reconstructing X^nsuperscript^𝑋𝑛\hat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. First, if without the side information Vnsuperscript𝑉𝑛V^{n}italic_V start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, the coding rates need to satisfy the conditions for lossless source coding with a helper [10, Theorem 10.2], as

R1subscript𝑅1\displaystyle R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT H(X|U),absent𝐻conditional𝑋𝑈\displaystyle\geq H(X|U),≥ italic_H ( italic_X | italic_U ) , (2)
R2subscript𝑅2\displaystyle R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT I(Y;U).absent𝐼𝑌𝑈\displaystyle\geq I(Y;U).≥ italic_I ( italic_Y ; italic_U ) . (3)

Then, if the side information is available at both ENC 2 and joint DEC, the R-D link will be reduced by the conditional version of the mutual information without side information [10, Eq. (11.2)], i.e.,

R2subscript𝑅2\displaystyle R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT I(Y;U|V).absent𝐼𝑌conditional𝑈𝑉\displaystyle\geq I(Y;U|V).≥ italic_I ( italic_Y ; italic_U | italic_V ) . (4)

Furthermore, considering that the side information is available at joint DEC but unavailable at ENC 1, the structure is a Wyner-Ziv problem [11] because the side information provided by the semantic knowledge is noncausal. Hence, the rate is reduced by condition on V𝑉Vitalic_V, as

R1subscript𝑅1\displaystyle R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT H(X|U,V).absent𝐻conditional𝑋𝑈𝑉\displaystyle\geq H(X|U,V).≥ italic_H ( italic_X | italic_U , italic_V ) . (5)

In summary, the achievable rate region for lossless transmissions with SF relaying is the combination of (1), (4) and (5). The difference between SF and other relaying systems can be observed from (4) and (5). By introducing a common knowledge base, the required rates in the S-D and R-D links can both be further decreased by the side information V𝑉Vitalic_V. If the Shannon rate limit is found to be lower than the link payload, we update the knowledge base to further reduce the rate requirements.

Besides the conventional information theory, we can also apply the metrics of semantic information theory [12], such as semantic entropy and semantic distortion, to evaluate the semantic performance.

Note that when designing the optimal encoder and decoder, the distributions of the variables need to be known, which is internally estimated by the deep neural network (DNN).

III Joint Source-Channel Coding Design

Refer to caption
Figure 2: The system structure of joint source-channel coding for SF relaying.

Fig. 2 illustrates the general system structure of joint source-channel coding for SF relaying. In practical systems, the quantization process may be needed if the media is transmitted in using symbols with finite alphabet.

Before the channel coding, the media information is quantized and represented by binary sequence with length n𝑛nitalic_n in bits, as stated before. After the channel decoding, the media information is recovered from the received binary sequence. Therefore, we focus on bit-wise joint decoding algorithms in this letter, and the design of symbol-wise joint decoding algorithms is left as the future work.

Hereafter, we utilize images as the examples for introducing the joint source-channel coding design of the media transmission with SF relaying. One image pixel in each color channel is represented by 8 bits. For conciseness, the quantization process is omitted in description while implicitly used for converting the image pixel between analogue values and binary sequence.

At the source, information sequence Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is channel encoded by channel ENC 1, and then broadcast to the relay and the destination. Then, the relay receives and decodes the source signals by channel DEC 1 to recover the source information sequence. Due to the possible intra-link errors, the recovered sequence Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT may not be exactly the same as Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Nevertheless, the relay continues encoding and transmitting the relay information to the destination. The image feature is extracted by the semantic ENC, and further encoded by channel ENC 2 before transmitted to the destination.

At the destination, the received signals at the first slot are first decoded by channel DEC 1 to output the a posteriori LLR (LLR1psuperscriptsubscriptLLR1𝑝\mathrm{LLR}_{1}^{p}roman_LLR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT) of Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. The received signals of the second slot are successively decoded by channel DEC 2 and the semantic DEC to output the a posteriori LLR (LLR2psuperscriptsubscriptLLR2𝑝\mathrm{LLR}_{2}^{p}roman_LLR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT) of Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

Then, the extrinsic LLR (LLRiesuperscriptsubscriptLLR𝑖𝑒\mathrm{LLR}_{i}^{e}roman_LLR start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT) representing the decoding gain is calculated by LLRie=LLRipLLRia, for i{1,2},formulae-sequencesuperscriptsubscriptLLR𝑖𝑒superscriptsubscriptLLR𝑖𝑝superscriptsubscriptLLR𝑖𝑎 for 𝑖12\mathrm{LLR}_{i}^{e}=\mathrm{LLR}_{i}^{p}-\mathrm{LLR}_{i}^{a},\textrm{ for }i% \in\{1,2\},roman_LLR start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT = roman_LLR start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT - roman_LLR start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT , for italic_i ∈ { 1 , 2 } , where LLRiasuperscriptsubscriptLLR𝑖𝑎\mathrm{LLR}_{i}^{a}roman_LLR start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT is the a priori LLR, the initial value of which is all set at 00. The extrinsic LLRs are then exchanged by the extrinsic information exchanger (Ex-Info-Ex). Based on the correlation model of correlated sources [13], the extrinsic LLRs of Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are exchanged by the LLR updating function fc()subscript𝑓𝑐f_{c}(\cdot)italic_f start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ( ⋅ ) [14, Eq. (10)] to update the a priori LLR.

In the next round of decoding iteration, the updated LLR1asuperscriptsubscriptLLR1𝑎\mathrm{LLR}_{1}^{a}roman_LLR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT is directly utilized as the a priori information for channel DEC 1. However, the updated LLR2asuperscriptsubscriptLLR2𝑎\mathrm{LLR}_{2}^{a}roman_LLR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT needs to be further encoded as the a priori LLR (LLR2ssuperscriptsubscriptLLR2𝑠\mathrm{LLR}_{2}^{s}roman_LLR start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT) of the semantic information, and then input into channel DEC 2.

Finally, if the iteration round reaches the maximum times, the latest LLR1psuperscriptsubscriptLLR1𝑝\mathrm{LLR}_{1}^{p}roman_LLR start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT is utilized to reconstruct X^nsuperscript^𝑋𝑛\hat{X}^{n}over^ start_ARG italic_X end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT by hard decision.

At the cost for reducing the link payload, SF requires relatively higher computational complexity than other relaying schemes, due mainly to the semantic encoder/decoder in the relay and the destination. This algorithm presented in this letter can be further extended to the system with multiple relays. The example Python codes of the proposed SF relaying system are provided in [15].

IV Performance Evaluation

IV-A Semantic Encoder/Decoder Structure

Since the objective of this letter is to evaluate the performance gain of SF relaying, we design the structure of semantic encoder/decoder based on convolution and transpose convolution for simplicity.

TABLE I: Neural Network Structure of the Semantic ENC/DEC
Layer type Parameters Output shape
ENC Input None 3×96×96396963\times 96\times 963 × 96 × 96
Conv2D Filters: 16, Size: (2,2), Stride: 1 16×95×9516959516\times 95\times 9516 × 95 × 95
Conv2D Filters: 16, Size: (3,3), Stride: 2 16×47×4716474716\times 47\times 4716 × 47 × 47
Conv2D Filters: 16, Size: (3,3), Stride: 2 16×23×2316232316\times 23\times 2316 × 23 × 23
DEC Input None 16×23×2316232316\times 23\times 2316 × 23 × 23
TranConv2D Filters: 16, Size: (3,3), Stride: 2 16×47×4716474716\times 47\times 4716 × 47 × 47
TranConv2D Filters: 16, Size: (3,3), Stride: 2 16×95×9516959516\times 95\times 9516 × 95 × 95
TranConv2D Filters: 3, Size: (2,2), Stride: 1 3×96×96396963\times 96\times 963 × 96 × 96

In general, the design of neural network structures is affected by datasets. The simulation images are randomly selected from the CIFAR-10 dataset [16]. Thus, the neural network structures of the semantic ENC/DEC are designed as shown in Table I, where the padding is set at 00 and omitted for all convolutional layers, and transpose convolutional layers.

For the semantic ENC, the input is a three-channel image with its width and height both being 96969696 pixels. The 3×96×96396963\times 96\times 963 × 96 × 96 pixel values are input into a two-dimensional (2D) convolutional layer (Conv2D) with 16161616 filters, (2,2)22(2,2)( 2 , 2 )-sized kernels, and a stride of 1111, to extract the image features into 16161616 channels. Subsequently, the features of 16161616 channels are successively compressed by two concatenated Conv2Ds with 16161616 filters, (2,2)22(2,2)( 2 , 2 )-sized kernels, and a stride of 2222. Therefore, the final output of the semantic ENC is the features of 16161616 channels with the size in each channel being 23×23232323\times 2323 × 23. It is easy to calculate the compression rate of the semantic ENC as (16×23×23)/(3×96×96)0.306162323396960.306(16\times 23\times 23)/(3\times 96\times 96)\approx 0.306( 16 × 23 × 23 ) / ( 3 × 96 × 96 ) ≈ 0.306.

The semantic decoding process is the inverse of the semantic encoding process. Therefore, for the semantic DEC, the input of the (16×23×23)162323(16\times 23\times 23)( 16 × 23 × 23 )-sized features is decompressed by two concatenated 2D transpose convolutional layers (TranConv2Ds) with 16161616 filters, (3,3)33(3,3)( 3 , 3 )-sized kernels, and a stride of 2222. Finally, the 96×96969696\times 9696 × 96 image pixels in the 3 channels are reconstructed by TranConv2D with 3333 filters, (2,2)22(2,2)( 2 , 2 )-sized kernels, and a stride of 1111.

IV-B Simulation Settings

In the simulations, we assume line-of-sight (LoS) components dominate the channels for simplicity. Compared to the source, we assume the relay is closer to the destination, and hence the signal-to-noise ratio (SNR) γ2subscript𝛾2\gamma_{2}italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in the R-D link is relatively higher than the SNR γ1subscript𝛾1\gamma_{1}italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in the S-D link. We simply set γ2=20subscript𝛾220\gamma_{2}=20italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 20 dB which is sufficiently large for losslessly transmitting the semantic information, while γ1subscript𝛾1\gamma_{1}italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT varies from 55-5- 5 dB to 9999 dB. Additive white Gaussian noise (AWGN) channels are adopted for the S-D and R-D links. For the S-R link, we utilize the crossover probability ρ𝜌\rhoitalic_ρ between the quantized bits Xnsuperscript𝑋𝑛X^{n}italic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and Ynsuperscript𝑌𝑛Y^{n}italic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to represent the channel conditions for simplicity.

The low-density parity-check (LDPC) codes [17] are utilized as the channel codes, with the codeword length being set at 900900900900. There are 3333 bits in the same parity-check equation, and each bit is associated with 2222 parity-check equations. To satisfy the LDPC codeword length, the information bits are divide into groups, and the last group is padded by zero bits. The maximum number of the local decoding iteration for LDPC codes, and that of the global decoding iteration are set at 1111 and 7777, respectively. Before the first global iteration, we need one round of initial LDPC decoding for obtaining non-zero LLRs, and hence the ensemble number of LDPC decoding iterations is equal to 8888.

The loss function for training the semantic neural network is defined as Loss=min[αMSE(𝑿,𝑿^)+βCE(𝒁,𝒁^)],𝐿𝑜𝑠𝑠𝛼subscriptMSE𝑿bold-^𝑿𝛽subscriptCE𝒁bold-^𝒁Loss=\min[\alpha\mathcal{F}_{\mathrm{MSE}}(\boldsymbol{X},\boldsymbol{\hat{X}}% )+\beta\mathcal{F}_{\mathrm{CE}}(\boldsymbol{Z},\boldsymbol{\hat{Z}})],italic_L italic_o italic_s italic_s = roman_min [ italic_α caligraphic_F start_POSTSUBSCRIPT roman_MSE end_POSTSUBSCRIPT ( bold_italic_X , overbold_^ start_ARG bold_italic_X end_ARG ) + italic_β caligraphic_F start_POSTSUBSCRIPT roman_CE end_POSTSUBSCRIPT ( bold_italic_Z , overbold_^ start_ARG bold_italic_Z end_ARG ) ] , where 𝑿𝑿\boldsymbol{X}bold_italic_X and 𝑿^bold-^𝑿\boldsymbol{\hat{X}}overbold_^ start_ARG bold_italic_X end_ARG are the input and output image matrix, respectively. 𝒁𝒁\boldsymbol{Z}bold_italic_Z and 𝒁^bold-^𝒁\boldsymbol{\hat{Z}}overbold_^ start_ARG bold_italic_Z end_ARG are matrices for distinguishing different categories with the same batch size as 𝑿𝑿\boldsymbol{X}bold_italic_X and 𝑿^bold-^𝑿\boldsymbol{\hat{X}}overbold_^ start_ARG bold_italic_X end_ARG. We use MSEsubscriptMSE\mathcal{F}_{\mathrm{MSE}}caligraphic_F start_POSTSUBSCRIPT roman_MSE end_POSTSUBSCRIPT and CEsubscriptCE\mathcal{F}_{\mathrm{CE}}caligraphic_F start_POSTSUBSCRIPT roman_CE end_POSTSUBSCRIPT to evaluate mean square error and cross entropy, respectively. The hyper-parameters α𝛼\alphaitalic_α and β𝛽\betaitalic_β are used to balance the proportion of MSEsubscriptMSE\mathcal{F}_{\mathrm{MSE}}caligraphic_F start_POSTSUBSCRIPT roman_MSE end_POSTSUBSCRIPT and CEsubscriptCE\mathcal{F}_{\mathrm{CE}}caligraphic_F start_POSTSUBSCRIPT roman_CE end_POSTSUBSCRIPT. The initial parameters of the semantic neural network are set as follows. The hyper-parameters α𝛼\alphaitalic_α and β𝛽\betaitalic_β are set at 1.51.51.51.5 and 0.560.560.560.56, respectively. The Adam optimizer [18] is adopted with a learning rate of 0.0010.0010.0010.001. Moreover, we employ a batch size of 64646464 and 200200200200 epochs for training.

IV-C Simulation Results

Refer to caption
Figure 3: Euclidean distance with diverse ρ𝜌\rhoitalic_ρ.

To quantify the similarity between the original and reconstructed images, we use the ED as a reliable metric. Due to the page limit, the comparison with other relaying systems is left as the future work. Fig. 3 plots the ED curves for diverse ρ𝜌\rhoitalic_ρ. Obviously, the joint decoding with SF relaying always outperforms the independent decoding, which verifies the effectiveness of SF. Another observation that justifies our intuition is that, the ED reduces as the quality of the relay information increases, i.e., smaller ρ𝜌\rhoitalic_ρ. Notice that the performance gain decreases as the SNR of the S-D link increases. This is because the LDPC codes have a greater capability for increasing coding gains and hence the independent LDPC decoding has already corrected most of the errors, when the S-D link SNR becomes larger. In particular, the ED of the independent decoding decreases to 00 when γ18subscript𝛾18\gamma_{1}\geq 8italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ 8 dB.

Refer to caption
(a) Example with ρ=0𝜌0\rho=0italic_ρ = 0.
Refer to caption
(b) γ1=5subscript𝛾15\gamma_{1}=-5italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = - 5 dB, ρ=0.1𝜌0.1\rho=0.1italic_ρ = 0.1.
Refer to caption
(c) γ1=5subscript𝛾15\gamma_{1}=-5italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = - 5 dB, ρ=0.35𝜌0.35\rho=0.35italic_ρ = 0.35.
Refer to caption
(d) γ1=0subscript𝛾10\gamma_{1}=0italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0 dB, ρ=0.35𝜌0.35\rho=0.35italic_ρ = 0.35.
Figure 4: Comparison of the reconstructed image qualities.

We utilize the original and error-free semantic images shown in Fig. 4(a) as an example for displaying the reconstructed images. Figs. 4(b)-4(d) depicts the reconstructed images. Clearly, the image quality increases as the round of global iterations (Iter) increases. It is found that no more image quality improvement can be achieved when Iter changes from 4444 to 7777, and hence Itr=7absent7=7= 7 is enough in this system setup.

In Fig. 4(b), it is clearly found that the images of joint (Jnt) decoding have a higher visual quality than the images of independent (Ind) decoding. The images of semantic (Sem) decoding also becomes clearer by exchanging the extrinsic information obtained via the S-D link.

Fig. 4(c) shows the results in very bad channel conditions. Surprisingly, although the initial semantic image is unresolvable, it exhibits a relatively clear shape of the object after joint decoding. Due to the exchange of the extrinsic information, the image quality of joint decoding also improves dramatically.

By comparing Figs. 4(c) and 4(d), we can conclude that the image quality is higher in better channel conditions. Moreover, the semantic image is recovered very similar to the case with ρ=0𝜌0\rho=0italic_ρ = 0 shown in Fig. 4(a).

V Conclusion

We have proposed a novel concept of SF relaying, which is suitable for 6G media transmissions and adaptively accommodates various types of information. The principle of SF relaying has been explained in detail, and its achievable rate constraints for lossless recovery at the destination have been analyzed. In addition, we have designed a joint source-channel coding scheme for SF relaying, and further implemented the SF relaying technique in image transmission simulations. The simulation results indicate that the SF relaying system can adequately eliminate the effect of intra-link errors by utilizing the semantic decoder, reduce the payload in the R-D link, and achieve lossless transmissions in the worse channel conditions.

References

  • [1] C.-X. Wang, X. You, X. Gao, X. Zhu, Z. Li, C. Zhang, H. Wang, Y. Huang, Y. Chen, H. Haas, J. S. Thompson, E. G. Larsson, M. D. Renzo, W. Tong, P. Zhu, X. Shen, H. V. Poor, and L. Hanzo, “On the road to 6G: Visions, requirements, key technologies, and testbeds,” IEEE Communications Surveys & Tutorials, vol. 25, no. 2, pp. 905–974, Second quarter 2023.
  • [2] W. Yang, H. Du, Z. Q. Liew, W. Y. B. Lim, Z. Xiong, D. Niyato, X. Chi, X. Shen, and C. Miao, “Semantic communications for future internet: Fundamentals, applications, and challenges,” IEEE Communications Surveys & Tutorials, vol. 25, no. 1, pp. 213–250, First quarter 2023.
  • [3] H. Zhang, S. Shao, M. Tao, X. Bi, and K. B. Letaief, “Deep learning-enabled semantic communication systems with task-unaware transmitter and dynamic data,” IEEE Journal on Selected Areas in Communications, vol. 41, no. 1, pp. 170–185, Jan. 2023.
  • [4] H. Feng, Y. Yang, and Z. Han, “SCAI: Scalable AI generative content for enhanced semantic communication,” in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, Korea, Apr. 2024, invited.
  • [5] G. Kramer, M. Gastpar, and P. Gupta, “Cooperative strategies and capacity theorems for relay networks,” IEEE Transactions on Information Theory, vol. 51, no. 9, pp. 3037–3063, Sep. 2005.
  • [6] T. Cover and A. El Gamal, “Capacity theorems for the relay channel,” IEEE Transactions on Information Theory, vol. 25, no. 5, pp. 572–584, Sep. 1979.
  • [7] W. Lin, S. Qian, and T. Matsumoto, “Lossy-forward relaying for lossy communications: Rate-distortion and outage probability analyses,” IEEE Transactions on Wireless Communications, vol. 18, no. 8, pp. 3974–3986, Aug. 2019.
  • [8] X. Luo, B. Yin, Z. Chen, B. Xia, and J. Wang, “Autoencoder-based semantic communication systems with relay channels,” in IEEE International Conference on Communications (ICC) Workshops, Seoul, Republic of Korea, May 2022, pp. 711–716.
  • [9] C. Berrou and A. Glavieux, “Near optimum error correcting coding and decoding: Turbo-codes,” IEEE Transactions on Communications, vol. 44, no. 10, pp. 1261–1271, Oct. 1996.
  • [10] A. El Gamal and Y.-H. Kim, Network information theory.   Cambridge University Press, 2011.
  • [11] A. Wyner and J. Ziv, “The rate-distortion function for source coding with side information at the decoder,” IEEE Transactions on information Theory, vol. 22, no. 1, pp. 1–10, Jan. 1976.
  • [12] G. Xin, P. Fan, and K. B. Letaief, “Semantic information theory: Recent advances and future challenges,” Preprints, October 2023. [Online]. Available: https://doi.org/10.20944/preprints202310.1208.v1
  • [13] J. Garcia-Frias and Y. Zhao, “Near-Shannon/Slepian-Wolf performance for unknown correlated sources over AWGN channels,” IEEE Transactions on Communications, vol. 53, no. 4, pp. 555–559, Apr. 2005.
  • [14] X. Zhou, X. He, K. Anwar, and T. Matsumoto, “GREAT-CEO: larGe scale distRibuted dEcision mAking Technique for wireless Chief Executive Officer problems,” IEICE Transactions on Communications, vol. 95, no. 12, pp. 3654–3662, Dec. 2012.
  • [15] “Semantic-forward relaying,” GitHub, Oct. 2023. [Online]. Available: https://github.com/linwest/Semantic_Forward
  • [16] A. Krizhevsky, “Learning multiple layers of features from tiny images,” Toronto, ON, Canada, 2009. [Online]. Available: https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf
  • [17] R. Gallager, “Low-density parity-check codes,” IRE Transactions on Information Theory, vol. 8, no. 1, pp. 21–28, Jan. 1962.
  • [18] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.