Search | arXiv e-print repository

Toward Low-latency Iterative Decoding of QLDPC Codes Under Circuit-Level Noise

Authors: Anqi Gong, Sebastian Cammerer, Joseph M. Renes

Abstract: We introduce a sliding window decoder based on belief propagation (BP) with guided decimation for the purposes of decoding quantum low-density parity-check codes in the presence of circuit-level noise. Windowed decoding keeps the decoding complexity reasonable when, as is typically the case, repeated rounds of syndrome extraction are required to decode. Within each window, we employ several rounds… ▽ More We introduce a sliding window decoder based on belief propagation (BP) with guided decimation for the purposes of decoding quantum low-density parity-check codes in the presence of circuit-level noise. Windowed decoding keeps the decoding complexity reasonable when, as is typically the case, repeated rounds of syndrome extraction are required to decode. Within each window, we employ several rounds of BP with decimation of the variable node that we expect to be the most likely to flip in each round, Furthermore, we employ ensemble decoding to keep both decimation options (guesses) open in a small number of chosen rounds. We term the resulting decoder BP with guided decimation guessing (GDG). Applied to bivariate bicycle codes, GDG achieves a similar logical error rate as BP with an additional OSD post-processing stage (BP+OSD) and combination-sweep of order 10. For a window size of three syndrome cycles, a multi-threaded CPU implementation of GDG achieves a worst-case decoding latency of 3ms per window for the [[144,12,12]] code. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: 8+4 pages, 7 figures. The source code for the simulations in this work is available online at http://github.com/gongaa/SlidingWindowDecoder

arXiv:2312.02601 [pdf, other]

A Neural Receiver for 5G NR Multi-user MIMO

Authors: Sebastian Cammerer, Fayçal Aït Aoudia, Jakob Hoydis, Andreas Oeldemann, Andreas Roessler, Timo Mayer, Alexander Keller

Abstract: We introduce a neural network (NN)-based multiuser multiple-input multiple-output (MU-MIMO) receiver with 5G New Radio (5G NR) physical uplink shared channel (PUSCH) compatibility. The NN architecture is based on convolution layers to exploit the time and frequency correlation of the channel and a graph neural network (GNN) to handle multiple users. The proposed architecture adapts to an arbitrary… ▽ More We introduce a neural network (NN)-based multiuser multiple-input multiple-output (MU-MIMO) receiver with 5G New Radio (5G NR) physical uplink shared channel (PUSCH) compatibility. The NN architecture is based on convolution layers to exploit the time and frequency correlation of the channel and a graph neural network (GNN) to handle multiple users. The proposed architecture adapts to an arbitrary number of sub-carriers and supports a varying number of multiple-input multiple-output (MIMO) layers and users without the need for any retraining. The receiver operates on an entire 5G NR slot, i.e., processes the entire received orthogonal frequency division multiplexing (OFDM) time-frequency resource grid by jointly performing channel estimation, equalization, and demap**. The proposed architecture operates less than 1 dB away from a baseline using linear minimum mean square error (LMMSE) channel estimation with K-best detection but benefits from a significantly lower computational complexity. We show the importance of a carefully designed training process such that the trained receiver is universal for a wide range of different unseen channel conditions. Finally, we demonstrate the results of a hardware-in-the-loop verification based on 3GPP compliant conformance test scenarios. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 6 pages, 6 figures, presented at IEEE Globecom 2023

arXiv:2311.18558 [pdf, other]

Learning Radio Environments by Differentiable Ray Tracing

Authors: Jakob Hoydis, Fayçal Aït Aoudia, Sebastian Cammerer, Florian Euchner, Merlin Nimier-David, Stephan ten Brink, Alexander Keller

Abstract: Ray tracing (RT) is instrumental in 6G research in order to generate spatially-consistent and environment-specific channel impulse responses (CIRs). While acquiring accurate scene geometries is now relatively straightforward, determining material characteristics requires precise calibration using channel measurements. We therefore introduce a novel gradient-based calibration method, complemented b… ▽ More Ray tracing (RT) is instrumental in 6G research in order to generate spatially-consistent and environment-specific channel impulse responses (CIRs). While acquiring accurate scene geometries is now relatively straightforward, determining material characteristics requires precise calibration using channel measurements. We therefore introduce a novel gradient-based calibration method, complemented by differentiable parametrizations of material properties, scattering and antenna patterns. Our method seamlessly integrates with differentiable ray tracers that enable the computation of derivatives of CIRs with respect to these parameters. Essentially, we approach field computation as a large computational graph wherein parameters are trainable akin to weights of a neural network (NN). We have validated our method using both synthetic data and real-world indoor channel measurements, employing a distributed multiple-input multiple-output (MIMO) channel sounder. △ Less

Submitted 30 November, 2023; originally announced November 2023.

Comments: 11 pages, 9 figures

arXiv:2303.11103 [pdf, other]

Sionna RT: Differentiable Ray Tracing for Radio Propagation Modeling

Authors: Jakob Hoydis, Fayçal Aït Aoudia, Sebastian Cammerer, Merlin Nimier-David, Nikolaus Binder, Guillermo Marcus, Alexander Keller

Abstract: Sionna is a GPU-accelerated open-source library for link-level simulations based on TensorFlow. Since release v0.14 it integrates a differentiable ray tracer (RT) for the simulation of radio wave propagation. This unique feature allows for the computation of gradients of the channel impulse response and other related quantities with respect to many system and environment parameters, such as materi… ▽ More Sionna is a GPU-accelerated open-source library for link-level simulations based on TensorFlow. Since release v0.14 it integrates a differentiable ray tracer (RT) for the simulation of radio wave propagation. This unique feature allows for the computation of gradients of the channel impulse response and other related quantities with respect to many system and environment parameters, such as material properties, antenna patterns, array geometries, as well as transmitter and receiver orientations and positions. In this paper, we outline the key components of Sionna RT and showcase example applications such as learning radio materials and optimizing transmitter orientations by gradient descent. While classic ray tracing is a crucial tool for 6G research topics like reconfigurable intelligent surfaces, integrated sensing and communications, as well as user localization, differentiable ray tracing is a key enabler for many novel and exciting research directions, for example, digital twins. △ Less

Submitted 19 July, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

Comments: 5 pages, 5 figures, update reflects new features of Sionna RT introduced in release v0.15

arXiv:2207.14742 [pdf, other]

Graph Neural Networks for Channel Decoding

Authors: Sebastian Cammerer, Jakob Hoydis, Fayçal Aït Aoudia, Alexander Keller

Abstract: In this work, we propose a fully differentiable graph neural network (GNN)-based architecture for channel decoding and showcase a competitive decoding performance for various coding schemes, such as low-density parity-check (LDPC) and BCH codes. The idea is to let a neural network (NN) learn a generalized message passing algorithm over a given graph that represents the forward error correction (FE… ▽ More In this work, we propose a fully differentiable graph neural network (GNN)-based architecture for channel decoding and showcase a competitive decoding performance for various coding schemes, such as low-density parity-check (LDPC) and BCH codes. The idea is to let a neural network (NN) learn a generalized message passing algorithm over a given graph that represents the forward error correction (FEC) code structure by replacing node and edge message updates with trainable functions. Contrary to many other deep learning-based decoding approaches, the proposed solution enjoys scalability to arbitrary block lengths and the training is not limited by the curse of dimensionality. We benchmark our proposed decoder against state-of-the-art in conventional channel decoding as well as against recent deep learning-based results. For the (63,45) BCH code, our solution outperforms weighted belief propagation (BP) decoding by approximately 0.4 dB with significantly less decoding iterations and even for 5G NR LDPC codes, we observe a competitive performance when compared to conventional BP decoding. For the BCH codes, the resulting GNN decoder can be fully parametrized with only 9640 weights. △ Less

Submitted 12 October, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

Comments: Source code is available online https://github.com/NVlabs/gnn-decoder

arXiv:2207.05699 [pdf, other]

Learning Joint Detection, Equalization and Decoding for Short-Packet Communications

Authors: Sebastian Dörner, Jannis Clausius, Sebastian Cammerer, Stephan ten Brink

Abstract: We propose and practically demonstrate a joint detection and decoding scheme for short-packet wireless communications in scenarios that require to first detect the presence of a message before actually decoding it. For this, we extend the recently proposed serial Turbo-autoencoder neural network (NN) architecture and train it to find short messages that can be, all "at once", detected, synchronize… ▽ More We propose and practically demonstrate a joint detection and decoding scheme for short-packet wireless communications in scenarios that require to first detect the presence of a message before actually decoding it. For this, we extend the recently proposed serial Turbo-autoencoder neural network (NN) architecture and train it to find short messages that can be, all "at once", detected, synchronized, equalized and decoded when sent over an unsynchronized channel with memory. The conceptional advantage of the proposed system stems from a holistic message structure with superimposed pilots for joint detection and decoding without the need of relying on a dedicated preamble. This results not only in higher spectral efficiency, but also translates into the possibility of shorter messages compared to using a dedicated preamble. We compare the detection error rate (DER), bit error rate (BER) and block error rate (BLER) performance of the proposed system with a hand-crafted state-of-the-art conventional baseline and our simulations show a significant advantage of the proposed autoencoder-based system over the conventional baseline in every scenario up to messages conveying k = 96 information bits. Finally, we practically evaluate and confirm the improved performance of the proposed system over-the-air (OTA) using a software-defined radio (SDR)-based measurement testbed. △ Less

Submitted 12 July, 2022; originally announced July 2022.

Comments: Submitted to IEEE TCOM

arXiv:2205.10805 [pdf, other]

Deep Learning-Based Synchronization for Uplink NB-IoT

Authors: Fayçal Aït Aoudia, Jakob Hoydis, Sebastian Cammerer, Matthijs Van Keirsbilck, Alexander Keller

Abstract: We propose a neural network (NN)-based algorithm for device detection and time of arrival (ToA) and carrier frequency offset (CFO) estimation for the narrowband physical random-access channel (NPRACH) of narrowband internet of things (NB-IoT). The introduced NN architecture leverages residual convolutional networks as well as knowledge of the preamble structure of the 5G New Radio (5G NR) specific… ▽ More We propose a neural network (NN)-based algorithm for device detection and time of arrival (ToA) and carrier frequency offset (CFO) estimation for the narrowband physical random-access channel (NPRACH) of narrowband internet of things (NB-IoT). The introduced NN architecture leverages residual convolutional networks as well as knowledge of the preamble structure of the 5G New Radio (5G NR) specifications. Benchmarking on a 3rd Generation Partnership Project (3GPP) urban microcell (UMi) channel model with random drops of users against a state-of-the-art baseline shows that the proposed method enables up to 8 dB gains in false negative rate (FNR) as well as significant gains in false positive rate (FPR) and ToA and CFO estimation accuracy. Moreover, our simulations indicate that the proposed algorithm enables gains over a wide range of channel conditions, CFOs, and transmission probabilities. The introduced synchronization method operates at the base station (BS) and, therefore, introduces no additional complexity on the user devices. It could lead to an extension of battery lifetime by reducing the preamble length or the transmit power. Our code is available at: https://github.com/NVlabs/nprach_synch/. △ Less

Submitted 29 July, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

Comments: Our code is available at: https://github.com/NVlabs/nprach_synch/

arXiv:2203.13571 [pdf, other]

Adaptive Neural Network-based OFDM Receivers

Authors: Moritz Benedikt Fischer, Sebastian Dörner, Sebastian Cammerer, Takayuki Shimizu, Hongsheng Lu, Stephan ten Brink

Abstract: We propose and examine the idea of continuously adapting state-of-the-art neural network (NN)-based orthogonal frequency division multiplex (OFDM) receivers to current channel conditions. This online adaptation via retraining is mainly motivated by two reasons: First, receiver design typically focuses on the universal optimal performance for a wide range of possible channel realizations. However,… ▽ More We propose and examine the idea of continuously adapting state-of-the-art neural network (NN)-based orthogonal frequency division multiplex (OFDM) receivers to current channel conditions. This online adaptation via retraining is mainly motivated by two reasons: First, receiver design typically focuses on the universal optimal performance for a wide range of possible channel realizations. However, in actual applications and within short time intervals, only a subset of these channel parameters is likely to occur, as macro parameters, e.g., the maximum channel delay, can assumed to be static. Second, in-the-field alterations like temporal interferences or other conditions out of the originally intended specifications can occur on a practical (real-world) transmission. While conventional (filter-based) systems would require reconfiguration or additional signal processing to cope with these unforeseen conditions, NN-based receivers can learn to mitigate previously unseen effects even after their deployment. For this, we showcase on-the-fly adaption to current channel conditions and temporal alterations solely based on recovered labels from an outer forward error correction (FEC) code without any additional piloting overhead. To underline the flexibility of the proposed adaptive training, we showcase substantial gains for scenarios with static channel macro parameters, for out-of-specification usage and for interference compensation. △ Less

Submitted 21 July, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

Comments: Submitted to SPAWC 2022

arXiv:2203.11854 [pdf, other]

Sionna: An Open-Source Library for Next-Generation Physical Layer Research

Authors: Jakob Hoydis, Sebastian Cammerer, Fayçal Ait Aoudia, Avinash Vem, Nikolaus Binder, Guillermo Marcus, Alexander Keller

Abstract: Sionna is a GPU-accelerated open-source library for link-level simulations based on TensorFlow. It enables the rapid prototy** of complex communication system architectures and provides native support for the integration of neural networks. Sionna implements a wide breadth of carefully tested state-of-the-art algorithms that can be used for benchmarking and end-to-end performance evaluation. Thi… ▽ More Sionna is a GPU-accelerated open-source library for link-level simulations based on TensorFlow. It enables the rapid prototy** of complex communication system architectures and provides native support for the integration of neural networks. Sionna implements a wide breadth of carefully tested state-of-the-art algorithms that can be used for benchmarking and end-to-end performance evaluation. This allows researchers to focus on their research, making it more impactful and reproducible, while saving time implementing components outside their area of expertise. This white paper provides a brief introduction to Sionna, explains its design principles and features, as well as future extensions, such as integrated ray tracing and custom CUDA kernels. We believe that Sionna is a valuable tool for research on next-generation communication systems, such as 6G, and we welcome contributions from our community. △ Less

Submitted 20 March, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

Comments: 5 pages, 1 figure, 4 code listings

arXiv:2107.12613 [pdf, ps, other]

Iterative Reed-Muller Decoding

Authors: Marvin Geiselhart, Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: Reed-Muller (RM) codes are known for their good maximum likelihood (ML) performance in the short block-length regime. Despite being one of the oldest classes of channel codes, finding a low complexity soft-input decoding scheme is still an open problem. In this work, we present a belief propagation (BP) decoding architecture for RM codes based on their rich automorphism group. The decoding algorit… ▽ More Reed-Muller (RM) codes are known for their good maximum likelihood (ML) performance in the short block-length regime. Despite being one of the oldest classes of channel codes, finding a low complexity soft-input decoding scheme is still an open problem. In this work, we present a belief propagation (BP) decoding architecture for RM codes based on their rich automorphism group. The decoding algorithm can be seen as a generalization of multiple-bases belief propagation (MBBP) using polar BP as constituent decoders. We provide extensive error-rate performance simulations and compare our results to existing decoding schemes. We report a near-ML performance for the RM(3,7)-code (e.g., 0.05 dB away from the ML bound at BLER of $10^{-4}$) at a competitive computational cost. To the best of our knowledge, our proposed decoder achieves the best performance of all iterative RM decoders presented thus far. △ Less

Submitted 27 July, 2021; originally announced July 2021.

Comments: 5 pages, accepted for publication at the International Symposium on Topics in Coding 2021 (ISTC), Sep. 2021. Short version of arXiv:2012.07635

arXiv:2104.14234 [pdf, other]

Serial vs. Parallel Turbo-Autoencoders and Accelerated Training for Learned Channel Codes

Authors: Jannis Clausius, Sebastian Dörner, Sebastian Cammerer, Stephan ten Brink

Abstract: Attracted by its scalability towards practical codeword lengths, we revisit the idea of Turbo-autoencoders for end-to-end learning of PHY-Layer communications. For this, we study the existing concepts of Turbo-autoencoders from the literature and compare the concept with state-of-the-art classical coding schemes. We propose a new component-wise training algorithm based on the idea of Gaussian a pr… ▽ More Attracted by its scalability towards practical codeword lengths, we revisit the idea of Turbo-autoencoders for end-to-end learning of PHY-Layer communications. For this, we study the existing concepts of Turbo-autoencoders from the literature and compare the concept with state-of-the-art classical coding schemes. We propose a new component-wise training algorithm based on the idea of Gaussian a priori distributions that reduces the overall training time by almost a magnitude. Further, we propose a new serial architecture inspired by classical serially concatenated Turbo code structures and show that a carefully optimized interface between the two component autoencoders is required. To the best of our knowledge, these serial Turbo autoencoder structures are the best known neural network based learned sequences that can be trained from scratch without any required expert knowledge in the domain of channel codes. △ Less

Submitted 22 July, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

Comments: Submitted to ISTC 2021

arXiv:2102.03163 [pdf, other]

Wiener Filter versus Recurrent Neural Network-based 2D-Channel Estimation for V2X Communications

Authors: Moritz Benedikt Fischer, Sebastian Dörner, Sebastian Cammerer, Takayuki Shimizu, Bin Cheng, Hongsheng Lu, Stephan ten Brink

Abstract: We compare the potential of neural network (NN)-based channel estimation with classical linear minimum mean square error (LMMSE)-based estimators, also known as Wiener filtering. For this, we propose a low-complexity recurrent neural network (RNN)-based estimator that allows channel equalization of a sequence of channel observations based on independent time- and frequency-domain long short-term m… ▽ More We compare the potential of neural network (NN)-based channel estimation with classical linear minimum mean square error (LMMSE)-based estimators, also known as Wiener filtering. For this, we propose a low-complexity recurrent neural network (RNN)-based estimator that allows channel equalization of a sequence of channel observations based on independent time- and frequency-domain long short-term memory (LSTM) cells. Motivated by Vehicle-to-Everything (V2X) applications, we simulate time- and frequency-selective channels with orthogonal frequency division multiplex (OFDM) and extend our channel models in such a way that a continuous degradation from line-of-sight (LoS) to non-line-of-sight (NLoS) conditions can be emulated. It turns out that the NN-based system cannot just compete with the LMMSE equalizer, but it also can be trained w.r.t. resilience against system parameter mismatch. We thereby showcase the conceptual simplicity of such a data-driven system design, as this not only enables more robustness against, e.g., signal-to-noise-ratio (SNR) or Doppler spread estimation mismatches, but also allows to use the same equalizer over a wider range of input parameters without the need of re-building (or re-estimating) the filter coefficients. Particular attention has been paid to ensure compatibility with the existing IEEE 802.11p piloting scheme for V2X communications. Finally, feeding the payload data symbols as additional equalizer input unleashes further performance gains. We show significant gains over the conventional LMMSE equalization for highly dynamic channel conditions if such a data-augmented equalization scheme is used. △ Less

Submitted 21 May, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

Comments: Accepted at 32nd IEEE Intelligent Vehicles Symposium

arXiv:2101.09679 [pdf, ps, other]

On the Automorphism Group of Polar Codes

Authors: Marvin Geiselhart, Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: The automorphism group of a code is the set of permutations of the codeword symbols that map the whole code onto itself. For polar codes, only a part of the automorphism group was known, namely the lower-triangular affine group (LTA), which is solely based upon the partial order of the code's synthetic channels. Depending on the design, however, polar codes can have a richer set of automorphisms.… ▽ More The automorphism group of a code is the set of permutations of the codeword symbols that map the whole code onto itself. For polar codes, only a part of the automorphism group was known, namely the lower-triangular affine group (LTA), which is solely based upon the partial order of the code's synthetic channels. Depending on the design, however, polar codes can have a richer set of automorphisms. In this paper, we extend the LTA to a larger subgroup of the general affine group (GA), namely the block lower-triangular affine group (BLTA) and show that it is contained in the automorphism group of polar codes. Furthermore, we provide a low complexity algorithm for finding this group for a given information/frozen set and determining its size. Most importantly, we apply these findings in automorphism-based decoding of polar codes and report a comparable error-rate performance to that of successive cancellation list (SCL) decoding with significantly lower complexity. △ Less

Submitted 11 May, 2021; v1 submitted 24 January, 2021; originally announced January 2021.

Comments: 7 Pages, extended version of ISIT 2021 submission

arXiv:2012.07635 [pdf, other]

Automorphism Ensemble Decoding of Reed-Muller Codes

Authors: Marvin Geiselhart, Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: Reed-Muller (RM) codes are known for their good maximum likelihood (ML) performance in the short block-length regime. Despite being one of the oldest classes of channel codes, finding a low complexity soft-input decoding scheme is still an open problem. In this work, we present a versatile decoding architecture for RM codes based on their rich automorphism group. The decoding algorithm can be seen… ▽ More Reed-Muller (RM) codes are known for their good maximum likelihood (ML) performance in the short block-length regime. Despite being one of the oldest classes of channel codes, finding a low complexity soft-input decoding scheme is still an open problem. In this work, we present a versatile decoding architecture for RM codes based on their rich automorphism group. The decoding algorithm can be seen as a generalization of multiple-bases belief propagation (MBBP) and may use any polar or RM decoder as constituent decoders. We provide extensive error-rate performance simulations for successive cancellation (SC)-, SC-list (SCL)- and belief propagation (BP)-based constituent decoders. We furthermore compare our results to existing decoding schemes and report a near-ML performance for the RM(3,7)-code (e.g., 0.04 dB away from the ML bound at BLER of $10^{-3}$) at a competitive computational cost. Moreover, we provide some insights into the automorphism subgroups of RM codes and SC decoding and, thereby, prove the theoretical limitations of this method with respect to polar codes. △ Less

Submitted 16 July, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

Comments: Accepted for Publication in IEEE Transactions on Communications

arXiv:2012.01075 [pdf, other]

Iterative Detection and Decoding of Finite-Length Polar Codes in Gaussian Multiple Access Channels

Authors: Moustafa Ebada, Sebastian Cammerer, Ahmed Elkelesh, Marvin Geiselhart, Stephan ten Brink

Abstract: We consider the usage of finite-length polar codes for the Gaussian multiple access channel (GMAC) with a finite number of users. Based on the interleave-division multipleaccess (IDMA) concept, we implement an iterative detection and decoding non-orthogonal multiple access (NOMA) receiver that benefits from a low complexity, while scaling (almost) linearly with the amount of active users. We furth… ▽ More We consider the usage of finite-length polar codes for the Gaussian multiple access channel (GMAC) with a finite number of users. Based on the interleave-division multipleaccess (IDMA) concept, we implement an iterative detection and decoding non-orthogonal multiple access (NOMA) receiver that benefits from a low complexity, while scaling (almost) linearly with the amount of active users. We further show the conceptual simplicity of the belief propagation (BP)-based decoder in a step-by-step illustration of its construction. Beyond its conceptual simplicity, this approach benefits from an improved performance when compared to some recent work tackling the same problem, namely the setup of finite-length forward errorcorrection (FEC) codes for finite-number of users. We consider the 5th generation mobile communication (5G) polar code with a block length $N = 512$ applied to both a two-user and a four-user GMAC scenario with a sum-rate of $R_{sum} = 0.5$ and $R_{sum} = 1$, respectively. Simulation results show that a BP-based soft interference cancellation (SoIC) receiver outperforms a joint successive cancellation (JSC) scheme. Finally, we investigate the effect of a concatenated repetition code which suggests that alternative polar code design rules are required in multi-user scenarios. △ Less

Submitted 3 December, 2020; v1 submitted 2 December, 2020; originally announced December 2020.

Comments: Asilomar2020

arXiv:2006.15027 [pdf, other]

Deep-learning Autoencoder for Coherent and Nonlinear Optical Communication

Authors: Tim Uhlemann, Sebastian Cammerer, Alexander Span, Sebastian Dörner, Stephan ten Brink

Abstract: Motivated by the recent success of end-to-end training of communications in the wireless domain, we strive to adapt the end-to-end-learning idea from the wireless case (i.e., linear) to coherent optical fiber links (i.e., nonlinear). Although, at first glance, it sounds like a straightforward extension, it turns out that several pitfalls exist - in terms of theory but also in terms of practical im… ▽ More Motivated by the recent success of end-to-end training of communications in the wireless domain, we strive to adapt the end-to-end-learning idea from the wireless case (i.e., linear) to coherent optical fiber links (i.e., nonlinear). Although, at first glance, it sounds like a straightforward extension, it turns out that several pitfalls exist - in terms of theory but also in terms of practical implementation. This paper analyzes the potential of an autoencoder and limitations for the optical fiber under the influence of Kerr-nonlinearity and chromatic dispersion. As there is no exact capacity limit known and, hence, no analytical perfect system solution available, we set great value to the interpretability on the learnings of the autoencoder. Therefore, we design its architecture to be as close as possible to the structure of a classic communication system, knowing that this may limit its degree of freedom and, thus, its performance. Nevertheless, we were able to achieve an unexpected high gain in terms of spectral efficiency compared to a conventional reference system. △ Less

Submitted 29 June, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

Comments: Accepted (21.02.2020) for presentation at the 21st IEEE/ITG-Symposium on Photonic Networks, Leipzig, Germany, 13-14.05.2020

arXiv:2003.02744 [pdf, other]

WGAN-based Autoencoder Training Over-the-air

Authors: Sebastian Dörner, Marcus Henninger, Sebastian Cammerer, Stephan ten Brink

Abstract: The practical realization of end-to-end training of communication systems is fundamentally limited by its accessibility of the channel gradient. To overcome this major burden, the idea of generative adversarial networks (GANs) that learn to mimic the actual channel behavior has been recently proposed in the literature. Contrarily to handcrafted classical channel modeling, which can never fully cap… ▽ More The practical realization of end-to-end training of communication systems is fundamentally limited by its accessibility of the channel gradient. To overcome this major burden, the idea of generative adversarial networks (GANs) that learn to mimic the actual channel behavior has been recently proposed in the literature. Contrarily to handcrafted classical channel modeling, which can never fully capture the real world, GANs promise, in principle, the ability to learn any physical impairment, enabled by the data-driven learning algorithm. In this work, we verify the concept of GAN-based autoencoder training in actual over-the-air (OTA) measurements. To improve training stability, we first extend the concept to conditional Wasserstein GANs and embed it into a state-of-the-art autoencoder-architecture, including bitwise estimates and an outer channel code. Further, in the same framework, we compare the existing three different training approaches: model-based pre-training with receiver finetuning, reinforcement learning (RL) and GAN-based channel modeling. For this, we show advantages and limitations of GAN-based end-to-end training. In particular, for non-linear effects, it turns out that learning the whole exploration space becomes prohibitively complex. Finally, we show that the training strategy benefits from a simpler (training) data acquisition when compared to RL-based training, which requires continuous transmitter weight updates. This becomes an important practical bottleneck due to limited bandwidth and latency between transmitter and training algorithm that may even operate at physically different locations. △ Less

Submitted 5 March, 2020; originally announced March 2020.

Comments: Submitted to IEEE SPAWC

arXiv:2001.05303 [pdf, other]

CRC-Aided Belief Propagation List Decoding of Polar Codes

Authors: Marvin Geiselhart, Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: Although iterative decoding of polar codes has recently made huge progress based on the idea of permuted factor graphs, it still suffers from a non-negligible performance degradation when compared to state-of-the-art CRC-aided successive cancellation list (CA-SCL) decoding. In this work, we show that iterative decoding of polar codes based on the belief propagation list (BPL) algorithm can approac… ▽ More Although iterative decoding of polar codes has recently made huge progress based on the idea of permuted factor graphs, it still suffers from a non-negligible performance degradation when compared to state-of-the-art CRC-aided successive cancellation list (CA-SCL) decoding. In this work, we show that iterative decoding of polar codes based on the belief propagation list (BPL) algorithm can approach the error-rate performance of CA-SCL decoding and, thus, can be efficiently used for decoding the standardized 5G polar codes. Rather than only utilizing the cyclic redundancy check (CRC) as a stop** condition (i.e., for error-detection), we also aim to benefit from the error-correction capabilities of the outer CRC code. For this, we develop two distinct soft-decision CRC decoding algorithms: a Bahl-Cocke-Jelinek-Raviv (BCJR)-based approach and a sum product algorithm (SPA)-based approach. Further, an optimized selection of permuted factor graphs is analyzed and shown to reduce the decoding complexity significantly. Finally, we benchmark the proposed CRC-aided belief propagation list (CA-BPL) to state-of-the-art 5G polar codes under CA-SCL decoding and, thereby, showcase an error-rate performance not just close to the CA-SCL but also close to the maximum likelihood (ML) bound as estimated by ordered statistic decoding (OSD). △ Less

Submitted 17 January, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

Comments: Submitted to IEEE for possible publication

arXiv:1911.13055 [pdf, other]

Trainable Communication Systems: Concepts and Prototype

Authors: Sebastian Cammerer, Fayçal Ait Aoudia, Sebastian Dörner, Maximilian Stark, Jakob Hoydis, Stephan ten Brink

Abstract: We consider a trainable point-to-point communication system, where both transmitter and receiver are implemented as neural networks (NNs), and demonstrate that training on the bit-wise mutual information (BMI) allows seamless integration with practical bit-metric decoding (BMD) receivers, as well as joint optimization of constellation sha** and labeling. Moreover, we present a fully differentiab… ▽ More We consider a trainable point-to-point communication system, where both transmitter and receiver are implemented as neural networks (NNs), and demonstrate that training on the bit-wise mutual information (BMI) allows seamless integration with practical bit-metric decoding (BMD) receivers, as well as joint optimization of constellation sha** and labeling. Moreover, we present a fully differentiable neural iterative demap** and decoding (IDD) structure which achieves significant gains on additive white Gaussian noise (AWGN) channels using a standard 802.11n low-density parity-check (LDPC) code. The strength of this approach is that it can be applied to arbitrary channels without any modifications. Going one step further, we show that careful code design can lead to further performance improvements. Lastly, we show the viability of the proposed system through implementation on software-defined radios (SDRs) and training of the end-to-end system on the actual wireless channel. Experimental results reveal that the proposed method enables significant gains compared to conventional techniques. △ Less

Submitted 5 June, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

Comments: submitted to IEEE TCOM

arXiv:1909.12035 [pdf, ps, other]

Deep Learning-based Polar Code Design

Authors: Moustafa Ebada, Sebastian Cammerer, Ahmed Elkelesh, Stephan ten Brink

Abstract: In this work, we introduce a deep learning-based polar code construction algorithm. The core idea is to represent the information/frozen bit indices of a polar code as a binary vector which can be interpreted as trainable weights of a neural network (NN). For this, we demonstrate how this binary vector can be relaxed to a soft-valued vector, facilitating the learning process through gradient desce… ▽ More In this work, we introduce a deep learning-based polar code construction algorithm. The core idea is to represent the information/frozen bit indices of a polar code as a binary vector which can be interpreted as trainable weights of a neural network (NN). For this, we demonstrate how this binary vector can be relaxed to a soft-valued vector, facilitating the learning process through gradient descent and enabling an efficient code construction. We further show how different polar code design constraints (e.g., code rate) can be taken into account by means of careful binary-to-soft and soft-to-binary conversions, along with rate-adjustment after each learning iteration. Besides its conceptual simplicity, this approach benefits from having the "decoder-in-the-loop", i.e., the nature of the decoder is inherently taken into consideration while learning (designing) the polar code. We show results for belief propagation (BP) decoding over both AWGN and Rayleigh fading channels with considerable performance gains over state-of-the-art construction schemes. △ Less

Submitted 27 September, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

Comments: Allerton2019

arXiv:1905.11858 [pdf, other]

Towards Practical Indoor Positioning Based on Massive MIMO Systems

Authors: Mark Widmaier, Maximilian Arnold, Sebastian Dörner, Sebastian Cammerer, Stephan ten Brink

Abstract: We showcase the practicability of an indoor positioning system (IPS) solely based on Neural Networks (NNs) and the channel state information (CSI) of a (Massive) multiple-input multiple-output (MIMO) communication system, i.e., only build on the basis of data that is already existent in today's systems. As such our IPS system promises both, a good accuracy without the need of any additional protoc… ▽ More We showcase the practicability of an indoor positioning system (IPS) solely based on Neural Networks (NNs) and the channel state information (CSI) of a (Massive) multiple-input multiple-output (MIMO) communication system, i.e., only build on the basis of data that is already existent in today's systems. As such our IPS system promises both, a good accuracy without the need of any additional protocol/signaling overhead for the user localization task. In particular, we propose a tailored NN structure with an additional phase branch as feature extractor and (compared to previous results) a significantly reduced amount of trainable parameters, leading to a minimization of the amount of required training data. We provide actual measurements for indoor scenarios with up to 64 antennas covering a large area of 80m2. In the second part, several robustness investigations for real-measurements are conducted, i.e., once trained, we analyze the recall accuracy over a time-period of several days. Further, we analyze the impact of pedestrians walking in-between the measurements and show that finetuning and pre-training of the NN helps to mitigate effects of hardware drifts and alterations in the propagation environment over time. This reduces the amount of required training samples at equal precision and, thereby, decreases the effort of the costly training data acquisition △ Less

Submitted 28 May, 2019; originally announced May 2019.

Comments: Submitted to VTC2019 Fall

arXiv:1905.09983 [pdf, ps, other]

On Recurrent Neural Networks for Sequence-based Processing in Communications

Authors: Daniel Tandler, Sebastian Dörner, Sebastian Cammerer, Stephan ten Brink

Abstract: In this work, we analyze the capabilities and practical limitations of neural networks (NNs) for sequence-based signal processing which can be seen as an omnipresent property in almost any modern communication systems. In particular, we train multiple state-of-the-art recurrent neural network (RNN) structures to learn how to decode convolutional codes allowing a clear benchmarking with the corresp… ▽ More In this work, we analyze the capabilities and practical limitations of neural networks (NNs) for sequence-based signal processing which can be seen as an omnipresent property in almost any modern communication systems. In particular, we train multiple state-of-the-art recurrent neural network (RNN) structures to learn how to decode convolutional codes allowing a clear benchmarking with the corresponding maximum likelihood (ML) Viterbi decoder. We examine the decoding performance for various kinds of NN architectures, beginning with classical types like feedforward layers and gated recurrent unit (GRU)-layers, up to more recently introduced architectures such as temporal convolutional networks (TCNs) and differentiable neural computers (DNCs) with external memory. As a key limitation, it turns out that the training complexity increases exponentially with the length of the encoding memory $ν$ and, thus, practically limits the achievable bit error rate (BER) performance. To overcome this limitation, we introduce a new training-method by gradually increasing the number of ones within the training sequences, i.e., we constrain the amount of possible training sequences in the beginning until first convergence. By consecutively adding more and more possible sequences to the training set, we finally achieve training success in cases that did not converge before via naive training. Further, we show that our network can learn to jointly detect and decode a quadrature phase shift keying (QPSK) modulated code with sub-optimal (anti-Gray) labeling in one-shot at a performance that would require iterations between demapper and decoder in classic detection schemes. △ Less

Submitted 21 November, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

Comments: Presented at Asilomar Conf. 2019

arXiv:1903.03128 [pdf, other]

doi 10.1109/ACCESS.2019.2942999

Decoder-in-the-Loop: Genetic Optimization-based LDPC Code Design

Authors: Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Laurent Schmalen, Stephan ten Brink

Abstract: LDPC code design tools typically rely on asymptotic code behavior and are affected by an unavoidable performance degradation due to model imperfections in the short length regime. We propose an LDPC code design scheme based on an evolutionary algorithm, the Genetic Algorithm (GenAlg), implementing a "decoder-in-the-loop" concept. It inherently takes into consideration the channel, code length and… ▽ More LDPC code design tools typically rely on asymptotic code behavior and are affected by an unavoidable performance degradation due to model imperfections in the short length regime. We propose an LDPC code design scheme based on an evolutionary algorithm, the Genetic Algorithm (GenAlg), implementing a "decoder-in-the-loop" concept. It inherently takes into consideration the channel, code length and the number of iterations while optimizing the error-rate of the actual decoder hardware architecture. We construct short length LDPC codes (i.e., the parity-check matrix) with error-rate performance comparable to, or even outperforming that of well-designed standardized short length LDPC codes over both AWGN and Rayleigh fading channels. Our proposed algorithm can be used to design LDPC codes with special graph structures (e.g., accumulator-based codes) to facilitate the encoding step, or to satisfy any other practical requirement. Moreover, GenAlg can be used to design LDPC codes with the aim of reducing decoding latency and complexity, leading to coding gains of up to $0.325$ dB and $0.8$ dB at BLER of $10^{-5}$ for both AWGN and Rayleigh fading channels, respectively, when compared to state-of-the-art short LDPC codes. Also, we analyze what can be learned from the resulting codes and, as such, the GenAlg particularly highlights design paradigms of short length LDPC codes (e.g., codes with degree-1 variable nodes obtain very good results). △ Less

Submitted 25 September, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

Comments: in IEEE Access, 2019

arXiv:1901.10464 [pdf, other]

Decoder-tailored Polar Code Design Using the Genetic Algorithm

Authors: Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: We propose a new framework for constructing polar codes (i.e., selecting the frozen bit positions) for arbitrary channels, and tailored to a given decoding algorithm, rather than based on the (not necessarily optimal) assumption of successive cancellation (SC) decoding. The proposed framework is based on the Genetic Algorithm (GenAlg), where populations (i.e., collections) of information sets evol… ▽ More We propose a new framework for constructing polar codes (i.e., selecting the frozen bit positions) for arbitrary channels, and tailored to a given decoding algorithm, rather than based on the (not necessarily optimal) assumption of successive cancellation (SC) decoding. The proposed framework is based on the Genetic Algorithm (GenAlg), where populations (i.e., collections) of information sets evolve successively via evolutionary transformations based on their individual error-rate performance. These populations converge towards an information set that fits both the decoding behavior and the defined channel. Using our proposed algorithm over the additive white Gaussian noise (AWGN) channel, we construct a polar code of length 2048 with code rate 0.5, without the CRC-aid, tailored to plain successive cancellation list (SCL) decoding, achieving the same error-rate performance as the CRC-aided SCL decoding, and leading to a coding gain of 1 dB at BER of $10^{-6}$. Further, a belief propagation (BP)-tailored construction approaches the SCL error-rate performance without any modifications in the decoding algorithm itself. The performance gains can be attributed to the significant reduction in the total number of low-weight codewords. To demonstrate the flexibility, coding gains for the Rayleigh channel are shown under SCL and BP decoding. Besides improvements in error-rate performance, we show that, when required, the GenAlg can be also set up to reduce the decoding complexity, e.g., the SCL list size or the number of BP iterations can be reduced, while maintaining the same error-rate performance. △ Less

Submitted 28 January, 2019; originally announced January 2019.

Comments: This work has been submitted to the IEEE for possible publication. Manuscript submitted September 20, 2018; revised January 28, 2019; date of current version January 28, 2019. arXiv admin note: substantial text overlap with arXiv:1901.06444

arXiv:1901.06444 [pdf, other]

Genetic Algorithm-based Polar Code Construction for the AWGN Channel

Authors: Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: We propose a new polar code construction framework (i.e., selecting the frozen bit positions) for the additive white Gaussian noise (AWGN) channel, tailored to a given decoding algorithm, rather than based on the (not necessarily optimal) assumption of successive cancellation (SC) decoding. The proposed framework is based on the Genetic Algorithm (GenAlg), where populations (i.e., collections) of… ▽ More We propose a new polar code construction framework (i.e., selecting the frozen bit positions) for the additive white Gaussian noise (AWGN) channel, tailored to a given decoding algorithm, rather than based on the (not necessarily optimal) assumption of successive cancellation (SC) decoding. The proposed framework is based on the Genetic Algorithm (GenAlg), where populations (i.e., collections) of information sets evolve successively via evolutionary transformations based on their individual error-rate performance. These populations converge towards an information set that fits the decoding behavior. Using our proposed algorithm, we construct a polar code of length 2048 with code rate 0.5, without the CRC-aid, tailored to plain successive cancellation list (SCL) decoding, achieving the same error-rate performance as the CRC-aided SCL decoding, and leading to a coding gain of 1 dB at BER of $10^{-6}$. Further, a belief propagation (BP)-tailored polar code approaches the SCL error-rate performance without any modifications in the decoding algorithm itself. △ Less

Submitted 18 January, 2019; originally announced January 2019.

Comments: SCC 2019; 12th International ITG Conference on Systems, Communications and Coding, Rostock, Germany, Feb. 2019

arXiv:1901.05877 [pdf, ps, other]

Spatially Coupled LDPC Codes and the Multiple Access Channel

Authors: Sebastian Cammerer, Xiaojie Wang, Yingyan Ma, Stephan ten Brink

Abstract: We consider spatially coupled low-density parity-check (SC-LDPC) codes within a non-orthogonal interleave division multiple access (IDMA) scheme to avoid cumbersome degree profile matching of the LDPC code components to the iterative multi-user detector (MUD). Besides excellent decoding thresholds, the approach benefits from the possibility of using rather simple and regular underlying block LDPC… ▽ More We consider spatially coupled low-density parity-check (SC-LDPC) codes within a non-orthogonal interleave division multiple access (IDMA) scheme to avoid cumbersome degree profile matching of the LDPC code components to the iterative multi-user detector (MUD). Besides excellent decoding thresholds, the approach benefits from the possibility of using rather simple and regular underlying block LDPC codes owing to the universal behavior of the resulting coupled code with respect to the channel front-end, i.e., the iterative MUD. Furthermore, an additional outer repetition code makes the scheme flexible to cope with a varying number of users and user rates, as the SC-LDPC itself can be kept constant for a wide range of different user loads. The decoding thresholds are obtained via density evolution (DE) and verified by bit error rate (BER) simulations. To keep decoding complexity and latency small, we introduce a joint iterative windowed detector/decoder imposing carefully adjusted sub-block interleavers. Finally, we show that the proposed coding scheme also works for Rayleigh channels using the same code with tolerable performance loss compared to the additive white Gaussian noise (AWGN) channel. △ Less

Submitted 28 January, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

Comments: to be presented at CISS 2019

arXiv:1901.03664 [pdf, other]

Enabling FDD Massive MIMO through Deep Learning-based Channel Prediction

Authors: Maximilian Arnold, Sebastian Dörner, Sebastian Cammerer, Sarah Yan, Jakob Hoydis, Stephan ten Brink

Abstract: A major obstacle for widespread deployment of frequency division duplex (FDD)-based Massive multiple-input multiple-output (MIMO) communications is the large signaling overhead for reporting full downlink (DL) channel state information (CSI) back to the basestation (BS), in order to enable closed-loop precoding. We completely remove this overhead by a deep-learning based channel extrapolation (or… ▽ More A major obstacle for widespread deployment of frequency division duplex (FDD)-based Massive multiple-input multiple-output (MIMO) communications is the large signaling overhead for reporting full downlink (DL) channel state information (CSI) back to the basestation (BS), in order to enable closed-loop precoding. We completely remove this overhead by a deep-learning based channel extrapolation (or "prediction") approach and demonstrate that a neural network (NN) at the BS can infer the DL CSI centered around a frequency $f_\text{DL}$ by solely observing uplink (UL) CSI on a different, yet adjacent frequency band around $f_\text{UL}$; no more pilot/reporting overhead is needed than with a genuine time division duplex (TDD)-based system. The rationale is that scatterers and the large-scale propagation environment are sufficiently similar to allow a NN to learn about the physical connections and constraints between two neighboring frequency bands, and thus provide a well-operating system even when classic extrapolation methods, like the Wiener filter (used as a baseline for comparison throughout) fails. We study its performance for various state-of-the-art Massive MIMO channel models, and, even more so, evaluate the scheme using actual Massive MIMO channel measurements, rendering it to be practically feasible at negligible loss in spectral efficiency when compared to a genuine TDD-based system. △ Less

Submitted 8 January, 2019; originally announced January 2019.

Comments: Extended version of the conference paper submitted to SPAWC2019

arXiv:1811.10938 [pdf, other]

Near-Capacity Detection and Decoding: Code Design for Dynamic User Loads in Gaussian Multiple Access Channels

Authors: Xiaojie Wang, Sebastian Cammerer, Stephan ten Brink

Abstract: This paper considers the forward error correction (FEC) code design for approaching the capacity of a dynamic multiple access channel (MAC) where both the number of users and their respective signal powers keep constantly changing, resembling the scenario of an actual wireless cellular system. To obtain a low-complexity non-orthogonal multiple access (NOMA) scheme, we propose a serial concatenatio… ▽ More This paper considers the forward error correction (FEC) code design for approaching the capacity of a dynamic multiple access channel (MAC) where both the number of users and their respective signal powers keep constantly changing, resembling the scenario of an actual wireless cellular system. To obtain a low-complexity non-orthogonal multiple access (NOMA) scheme, we propose a serial concatenation of a low-density parity-check (LDPC) code and a repetition code (REP), this way achieving near Gaussian MAC (GMAC) capacity performance while co** with the dynamics of the MAC system. The joint optimization of the LDPC and REP codes is addressed by matching the analytical extrinsic information transfer (EXIT) functions of the sub-optimal multi-user detector (MUD) and the channel code for a specific and static MAC system, achieving near-GMAC capacity. We show that the near-capacity performance can be flexibly maintained with the same LDPC code regardless of the variations in the number of users and power levels. This flexibility (or elasticity) is provided by the REP code, acting as "user-load and power equalizer", dramatically simplifying the practical implementation of NOMA schemes, as only a single LDPC code is needed to cope with the dynamics of the MAC system. △ Less

Submitted 19 July, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

arXiv:1810.01137 [pdf, ps, other]

Avoiding Burst-like Error Patterns in Windowed Decoding of Spatially Coupled LDPC Codes

Authors: Kevin Klaiber, Sebastian Cammerer, Laurent Schmalen, Stephan ten Brink

Abstract: In this work, we analyze efficient window shift schemes for windowed decoding of spatially coupled low-density parity-check (SC-LDPC) codes, which is known to yield close-tooptimal decoding results when compared to full belief propagation (BP) decoding. However, a drawback of windowed decoding is that either a significant amount of window updates are required leading to unnecessary high decoding c… ▽ More In this work, we analyze efficient window shift schemes for windowed decoding of spatially coupled low-density parity-check (SC-LDPC) codes, which is known to yield close-tooptimal decoding results when compared to full belief propagation (BP) decoding. However, a drawback of windowed decoding is that either a significant amount of window updates are required leading to unnecessary high decoding complexity or the decoder suffers from sporadic burst-like error patterns, causing a decoder stall. To tackle this effect and, thus, to reduce the average decoding complexity, the basic idea is to enable adaptive window shifts based on a bit error rate (BER) prediction, which reduces the amount of unnecessary updates. As the decoder stall does not occur in analytical investigations such as the density evolution (DE), we examine different schemes on a fixed test-set and exhaustive monte-carlo simulations based on our graphic processing unit (GPU) simulation framework. As a result, we can reduce the average decoding complexity of the naive windowed decoder while improving the BER performance when compared to a non-adaptive windowed decoding scheme. Furthermore, we show that a foresightful stall prediction does not significantly outperform a retrospective stall detection which is much easier to implement in practice. △ Less

Submitted 2 October, 2018; originally announced October 2018.

Comments: ISTC 2018

arXiv:1807.00747 [pdf, other]

Online Label Recovery for Deep Learning-based Communication through Error Correcting Codes

Authors: Stefan Schibisch, Sebastian Cammerer, Sebastian Dörner, Jakob Hoydis, Stephan ten Brink

Abstract: We demonstrate that error correcting codes (ECCs) can be used to construct a labeled data set for finetuning of "trainable" communication systems without sacrificing resources for the transmission of known symbols. This enables adaptive systems, which can be trained on-the-fly to compensate for slow fluctuations in channel conditions or varying hardware impairments. We examine the influence of cor… ▽ More We demonstrate that error correcting codes (ECCs) can be used to construct a labeled data set for finetuning of "trainable" communication systems without sacrificing resources for the transmission of known symbols. This enables adaptive systems, which can be trained on-the-fly to compensate for slow fluctuations in channel conditions or varying hardware impairments. We examine the influence of corrupted training data and show that it is crucial to train based on correct labels. The proposed method can be applied to fully end-to-end trained communication systems (autoencoders) as well as systems with only some trainable components. This is exemplified by extending a conventional OFDM system with a trainable pre-equalizer neural network (NN) that can be optimized at run time. △ Less

Submitted 2 July, 2018; originally announced July 2018.

Comments: accepted for ISWCS 2018

arXiv:1806.10503 [pdf, other]

doi 10.1109/LCOMM.2018.2850772

Belief Propagation List Decoding of Polar Codes

Authors: Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: We propose a belief propagation list (BPL) decoder with comparable performance to the successive cancellation list (SCL) decoder of polar codes, which already achieves the maximum likelihood (ML) bound of polar codes for sufficiently large list size $L$. The proposed decoder is composed of multiple parallel independent belief propagation (BP) decoders based on differently permuted polar code facto… ▽ More We propose a belief propagation list (BPL) decoder with comparable performance to the successive cancellation list (SCL) decoder of polar codes, which already achieves the maximum likelihood (ML) bound of polar codes for sufficiently large list size $L$. The proposed decoder is composed of multiple parallel independent belief propagation (BP) decoders based on differently permuted polar code factor graphs. A list of possible transmitted codewords is generated and the one closest to the received vector, in terms of Euclidean distance, is picked. To the best of our knowledge, the proposed BPL decoder provides the best performance of plain polar codes under iterative decoding known so far. The proposed algorithm does not require any changes in the polar code structure itself, rendering the BPL into an alternative to the SCL decoder, equipped with a soft output capability enabling, e.g., iterative detection and decoding to further improve performance. Further benefits are lower decoding latency compared to the SCL decoder and the possibility of high throughput implementations. Additionally, we show that a different selection strategy of frozen bit positions can further enhance the error-rate performance of the proposed decoder. △ Less

Submitted 27 June, 2018; originally announced June 2018.

Journal ref: IEEE Communications Letters, vol. 22, no. 8, pp. 1536-1539, Aug. 2018

arXiv:1804.04826 [pdf, other]

On Deep Learning-based Massive MIMO Indoor User Localization

Authors: Maximilian Arnold, Sebastian Dörner, Sebastian Cammerer, Stephan ten Brink

Abstract: We examine the usability of deep neural networks for multiple-input multiple-output (MIMO) user positioning solely based on the orthogonal frequency division multiplex (OFDM) complex channel coefficients. In contrast to other indoor positioning systems (IPSs), the proposed method does not require any additional piloting overhead or any other changes in the communications system itself as it is dep… ▽ More We examine the usability of deep neural networks for multiple-input multiple-output (MIMO) user positioning solely based on the orthogonal frequency division multiplex (OFDM) complex channel coefficients. In contrast to other indoor positioning systems (IPSs), the proposed method does not require any additional piloting overhead or any other changes in the communications system itself as it is deployed on top of an existing OFDM MIMO system. Supported by actual measurements, we are mainly interested in the more challenging non-line of sight (NLoS) scenario. However, gradient descent optimization is known to require a large amount of data-points for training, i.e., the required database would be too large when compared to conventional methods. Thus, we propose a twostep training procedure, with training on simulated line of sight (LoS) data in the first step, and finetuning on measured NLoS positions in the second step. This turns out to reduce the required measured training positions and thus, reduces the effort for data acquisition. △ Less

Submitted 13 April, 2018; originally announced April 2018.

Comments: submitted to SPAWC 2018

arXiv:1803.05815 [pdf, other]

OFDM-Autoencoder for End-to-End Learning of Communications Systems

Authors: Alexander Felix, Sebastian Cammerer, Sebastian Dörner, Jakob Hoydis, Stephan ten Brink

Abstract: We extend the idea of end-to-end learning of communications systems through deep neural network (NN)-based autoencoders to orthogonal frequency division multiplexing (OFDM) with cyclic prefix (CP). Our implementation has the same benefits as a conventional OFDM system, namely singletap equalization and robustness against sampling synchronization errors, which turned out to be one of the major chal… ▽ More We extend the idea of end-to-end learning of communications systems through deep neural network (NN)-based autoencoders to orthogonal frequency division multiplexing (OFDM) with cyclic prefix (CP). Our implementation has the same benefits as a conventional OFDM system, namely singletap equalization and robustness against sampling synchronization errors, which turned out to be one of the major challenges in previous single-carrier implementations. This enables reliable communication over multipath channels and makes the communication scheme suitable for commodity hardware with imprecise oscillators. We show that the proposed scheme can be realized with state-of-the-art deep learning software libraries as transmitter and receiver solely consist of differentiable layers required for gradient-based training. We compare the performance of the autoencoder-based system against that of a state-of-the-art OFDM baseline over frequency-selective fading channels. Finally, the impact of a non-linear amplifier is investigated and we show that the autoencoder inherently learns how to deal with such hardware impairments. △ Less

Submitted 15 March, 2018; originally announced March 2018.

Comments: submitted to SPAWC 2018

arXiv:1801.04299 [pdf, other]

Belief Propagation Decoding of Polar Codes on Permuted Factor Graphs

Authors: Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer, Stephan ten Brink

Abstract: We show that the performance of iterative belief propagation (BP) decoding of polar codes can be enhanced by decoding over different carefully chosen factor graph realizations. With a genie-aided stop** condition, it can achieve the successive cancellation list (SCL) decoding performance which has already been shown to achieve the maximum likelihood (ML) bound provided that the list size is suff… ▽ More We show that the performance of iterative belief propagation (BP) decoding of polar codes can be enhanced by decoding over different carefully chosen factor graph realizations. With a genie-aided stop** condition, it can achieve the successive cancellation list (SCL) decoding performance which has already been shown to achieve the maximum likelihood (ML) bound provided that the list size is sufficiently large. The proposed decoder is based on different realizations of the polar code factor graph with randomly permuted stages during decoding. Additionally, a different way of visualizing the polar code factor graph is presented, facilitating the analysis of the underlying factor graph and the comparison of different graph permutations. In our proposed decoder, a high rate Cyclic Redundancy Check (CRC) code is concatenated with a polar code and used as an iteration stop** criterion (i.e., genie) to even outperform the SCL decoder of the plain polar code (without the CRC-aid). Although our permuted factor graph-based decoder does not outperform the SCL-CRC decoder, it achieves, to the best of our knowledge, the best performance of all iterative polar decoders presented thus far. △ Less

Submitted 6 April, 2018; v1 submitted 12 January, 2018; originally announced January 2018.

Comments: in IEEE Wireless Commun. and Networking Conf. (WCNC), April 2018

arXiv:1712.08538 [pdf, other]

Sparse Graphs for Belief Propagation Decoding of Polar Codes

Authors: Sebastian Cammerer, Moustafa Ebada, Ahmed Elkelesh, Stephan ten Brink

Abstract: We describe a novel approach to interpret a polar code as a low-density parity-check (LDPC)-like code with an underlying sparse decoding graph. This sparse graph is based on the encoding factor graph of polar codes and is suitable for conventional belief propagation (BP) decoding. We discuss several pruning techniques based on the check node decoder (CND) and variable node decoder (VND) update equ… ▽ More We describe a novel approach to interpret a polar code as a low-density parity-check (LDPC)-like code with an underlying sparse decoding graph. This sparse graph is based on the encoding factor graph of polar codes and is suitable for conventional belief propagation (BP) decoding. We discuss several pruning techniques based on the check node decoder (CND) and variable node decoder (VND) update equations, significantly reducing the size (i.e., decoding complexity) of the parity-check matrix. As a result, iterative polar decoding can then be conducted on a sparse graph, akin to the traditional well-established LDPC decoding, e.g., using a fully parallel sum-product algorithm (SPA). This facilitates the systematic analysis and design of polar codes using the well-established tools known from analyzing LDPC codes. We show that the proposed iterative polar decoder has a negligible performance loss for short-to-intermediate codelengths compared to Arikan's original BP decoder. Finally, the proposed decoder is shown to benefit from both reduced complexity and reduced memory requirements and, thus, is more suitable for hardware implementations. △ Less

Submitted 14 May, 2018; v1 submitted 22 December, 2017; originally announced December 2017.

Comments: IEEE International Symposium on Information Theory (ISIT) 2018

arXiv:1711.05572 [pdf, other]

Mitigating Clip** Effects on Error Floors under Belief Propagation Decoding of Polar Codes

Authors: Ahmed Elkelesh, Sebastian Cammerer, Moustafa Ebada, Stephan ten Brink

Abstract: In this work, we show that polar belief propagation (BP) decoding exhibits an error floor behavior which is caused by clip** of the log-likelihood ratios (LLR). The error floor becomes more pronounced for clip** to smaller LLR-values. We introduce a single-value measure quantifying a "relative error floor", showing, by exhaustive simulations for different lengths, that the error floor is mainl… ▽ More In this work, we show that polar belief propagation (BP) decoding exhibits an error floor behavior which is caused by clip** of the log-likelihood ratios (LLR). The error floor becomes more pronounced for clip** to smaller LLR-values. We introduce a single-value measure quantifying a "relative error floor", showing, by exhaustive simulations for different lengths, that the error floor is mainly caused by inadequate clip** values. We propose four modifications to the conventional BP decoding algorithm to mitigate this error floor behavior, demonstrating that the error floor is a decoder property, and not a code property. The results agree with the fact that polar codes are theoretically proven to not suffer from error floors. Finally, we show that another cause of error floors can be an improper selection of frozen bit positions. △ Less

Submitted 28 November, 2017; v1 submitted 15 November, 2017; originally announced November 2017.

Comments: ISWCS2017

arXiv:1707.03384 [pdf, other]

doi 10.1109/JSTSP.2017.2784180

Deep Learning-Based Communication Over the Air

Authors: Sebastian Dörner, Sebastian Cammerer, Jakob Hoydis, Stephan ten Brink

Abstract: End-to-end learning of communications systems is a fascinating novel concept that has so far only been validated by simulations for block-based transmissions. It allows learning of transmitter and receiver implementations as deep neural networks (NNs) that are optimized for an arbitrary differentiable end-to-end performance metric, e.g., block error rate (BLER). In this paper, we demonstrate that… ▽ More End-to-end learning of communications systems is a fascinating novel concept that has so far only been validated by simulations for block-based transmissions. It allows learning of transmitter and receiver implementations as deep neural networks (NNs) that are optimized for an arbitrary differentiable end-to-end performance metric, e.g., block error rate (BLER). In this paper, we demonstrate that over-the-air transmissions are possible: We build, train, and run a complete communications system solely composed of NNs using unsynchronized off-the-shelf software-defined radios (SDRs) and open-source deep learning (DL) software libraries. We extend the existing ideas towards continuous data transmission which eases their current restriction to short block lengths but also entails the issue of receiver synchronization. We overcome this problem by introducing a frame synchronization module based on another NN. A comparison of the BLER performance of the "learned" system with that of a practical baseline shows competitive performance close to 1 dB, even without extensive hyperparameter tuning. We identify several practical challenges of training such a system over actual channels, in particular the missing channel gradient, and propose a two-step learning procedure based on the idea of transfer learning that circumvents this issue. △ Less

Submitted 11 July, 2017; originally announced July 2017.

arXiv:1706.09239 [pdf, other]

Scattered EXIT Charts for Finite Length LDPC Code Design

Authors: Moustafa Ebada, Ahmed Elkelesh, Sebastian Cammerer, Stephan ten Brink

Abstract: We introduce the Scattered Extrinsic Information Transfer (S-EXIT) chart as a tool for optimizing degree profiles of short length Low-Density Parity-Check (LDPC) codes under iterative decoding. As degree profile optimization is typically done in the asymptotic length regime, there is space for further improvement when considering the finite length behavior. We propose to consider the average extri… ▽ More We introduce the Scattered Extrinsic Information Transfer (S-EXIT) chart as a tool for optimizing degree profiles of short length Low-Density Parity-Check (LDPC) codes under iterative decoding. As degree profile optimization is typically done in the asymptotic length regime, there is space for further improvement when considering the finite length behavior. We propose to consider the average extrinsic information as a random variable, exploiting its specific distribution properties for guiding code design. We explain, step-by-step, how to generate an S-EXIT chart for short-length LDPC codes. We show that this approach achieves gains in terms of bit error rate (BER) of 0.5 dB and 0.6 dB over the additive white Gaussian noise (AWGN) channel for codeword lengths of 128 and 180 bits, respectively, at a target BER of $10^{-4}$ when compared to conventional Extrinsic Information Transfer (EXIT) chart-based optimization. Also, a performance gain for the Binary Erasure Channel (BEC) for a block (i.e., codeword) length of 180 bits is shown. △ Less

Submitted 6 April, 2018; v1 submitted 28 June, 2017; originally announced June 2017.

Comments: in IEEE International Conference on Communications (ICC), May 2018

arXiv:1704.05408 [pdf, other]

Wave-like Decoding of Tail-biting Spatially Coupled LDPC Codes Through Iterative Demap**

Authors: Sebastian Cammerer, Laurent Schmalen, Vahid Aref, Stephan ten Brink

Abstract: For finite coupling lengths, terminated spatially coupled low-density parity-check (SC-LDPC) codes show a non-negligible rate-loss. In this paper, we investigate if this rate loss can be mitigated by tail-biting SC-LDPC codes in conjunction with iterative demap** of higher order modulation formats. Therefore, we examine the BP threshold of different coupled and uncoupled ensembles. A comparison… ▽ More For finite coupling lengths, terminated spatially coupled low-density parity-check (SC-LDPC) codes show a non-negligible rate-loss. In this paper, we investigate if this rate loss can be mitigated by tail-biting SC-LDPC codes in conjunction with iterative demap** of higher order modulation formats. Therefore, we examine the BP threshold of different coupled and uncoupled ensembles. A comparison between the decoding thresholds approximated by EXIT charts and the density evolution results of the coupled and uncoupled ensemble is given. We investigate the effect and potential of different labelings for such a set-up using per-bit EXIT curves, and exemplify the method for a 16-QAM system, e.g., using set partitioning labelings. A hybrid map** is proposed, where different sub-blocks use different labelings in order to further optimize the decoding thresholds of tail-biting codes, while the computational complexity overhead through iterative demap** remains small. △ Less

Submitted 18 April, 2017; originally announced April 2017.

Comments: presentat at the International Symposium on Turbo Codes & Iterative Information Processing (ISTC), Brest, Sept. 2016

arXiv:1702.06901 [pdf, other]

Scaling Deep Learning-based Decoding of Polar Codes via Partitioning

Authors: Sebastian Cammerer, Tobias Gruber, Jakob Hoydis, Stephan ten Brink

Abstract: The training complexity of deep learning-based channel decoders scales exponentially with the codebook size and therefore with the number of information bits. Thus, neural network decoding (NND) is currently only feasible for very short block lengths. In this work, we show that the conventional iterative decoding algorithm for polar codes can be enhanced when sub-blocks of the decoder are replaced… ▽ More The training complexity of deep learning-based channel decoders scales exponentially with the codebook size and therefore with the number of information bits. Thus, neural network decoding (NND) is currently only feasible for very short block lengths. In this work, we show that the conventional iterative decoding algorithm for polar codes can be enhanced when sub-blocks of the decoder are replaced by neural network (NN) based components. Thus, we partition the encoding graph into smaller sub-blocks and train them individually, closely approaching maximum a posteriori (MAP) performance per sub-block. These blocks are then connected via the remaining conventional belief propagation decoding stage(s). The resulting decoding algorithm is non-iterative and inherently enables a high-level of parallelization, while showing a competitive bit error rate (BER) performance. We examine the degradation through partitioning and compare the resulting decoder to state-of-the-art polar decoders such as successive cancellation list and belief propagation decoding. △ Less

Submitted 22 February, 2017; originally announced February 2017.

Comments: Submitted to Globecom 2017

arXiv:1701.07738 [pdf, ps, other]

On Deep Learning-Based Channel Decoding

Authors: Tobias Gruber, Sebastian Cammerer, Jakob Hoydis, Stephan ten Brink

Abstract: We revisit the idea of using deep neural networks for one-shot decoding of random and structured codes, such as polar codes. Although it is possible to achieve maximum a posteriori (MAP) bit error rate (BER) performance for both code families and for short codeword lengths, we observe that (i) structured codes are easier to learn and (ii) the neural network is able to generalize to codewords that… ▽ More We revisit the idea of using deep neural networks for one-shot decoding of random and structured codes, such as polar codes. Although it is possible to achieve maximum a posteriori (MAP) bit error rate (BER) performance for both code families and for short codeword lengths, we observe that (i) structured codes are easier to learn and (ii) the neural network is able to generalize to codewords that it has never seen during training for structured, but not for random codes. These results provide some evidence that neural networks can learn a form of decoding algorithm, rather than only a simple classifier. We introduce the metric normalized validation error (NVE) in order to further investigate the potential and limitations of deep learning-based decoding with respect to performance and complexity. △ Less

Submitted 26 January, 2017; originally announced January 2017.

Comments: accepted for CISS 2017

arXiv:1611.03796 [pdf]

Flexible Length Polar Codes through Graph Based Augmentation

Authors: A. Elkelesh, M. Ebada, S. Cammerer, S. ten Brink

Abstract: The structure of polar codes inherently requires block lengths to be powers of two. In this paper, we investigate how different block lengths can be realized by coupling of several short-length polar codes. For this, we first analyze "code augmentation" to better protect the semipolarized channels, improving the BER performance under belief propagation decoding. Several serial and parallel augment… ▽ More The structure of polar codes inherently requires block lengths to be powers of two. In this paper, we investigate how different block lengths can be realized by coupling of several short-length polar codes. For this, we first analyze "code augmentation" to better protect the semipolarized channels, improving the BER performance under belief propagation decoding. Several serial and parallel augmentation schemes are discussed. A coding gain of $0.3$ dB at a BER of $10^{-5}$ can be observed for the same total rate and length. Further, we extend this approach towards coupling of several "sub-polar codes", leading to a reduced computational complexity and enabling the construction of flexible length polar codes. △ Less

Submitted 11 November, 2016; originally announced November 2016.

Comments: 11th International ITG Conference on Systems, Communications and Coding (SCC) 2017, Hamburg, Germany

arXiv:1611.03655 [pdf]

Improving Belief Propagation Decoding of Polar Codes Using Scattered EXIT Charts

Authors: A. Elkelesh, M. Ebada, S. Cammerer, S. ten Brink

Abstract: For finite length polar codes, channel polarization leaves a significant number of channels not fully polarized. Adding a Cyclic Redundancy Check (CRC) to better protect information on the semi-polarized channels has already been successfully applied in the literature, and is straightforward to be used in combination with Successive Cancellation List (SCL) decoding. Belief Propagation (BP) decodin… ▽ More For finite length polar codes, channel polarization leaves a significant number of channels not fully polarized. Adding a Cyclic Redundancy Check (CRC) to better protect information on the semi-polarized channels has already been successfully applied in the literature, and is straightforward to be used in combination with Successive Cancellation List (SCL) decoding. Belief Propagation (BP) decoding, however, offers more potential for exploiting parallelism in hardware implementation, and thus, we focus our attention on improving the BP decoder. Specifically, similar to the CRC strategy in the SCL-case, we use a short-length "auxiliary" LDPC code together with the polar code to provide a significant improvement in terms of BER. We present the novel concept of "scattered" EXIT charts to design such auxiliary LDPC codes, and achieve net coding gains (Le. for the same total rate) of 0.4 dB at BER of 1E-5 compared to the conventional BP decoder. △ Less

Submitted 11 November, 2016; originally announced November 2016.

Comments: 6 pages, 2016 IEEE Information Theory Workshop (ITW)

arXiv:1609.09358 [pdf, ps, other]

Combining Belief Propagation and Successive Cancellation List Decoding of Polar Codes on a GPU Platform

Authors: Sebastian Cammerer, Benedikt Leible, Matthias Stahl, Jakob Hoydis, Stephan ten Brink

Abstract: The decoding performance of polar codes strongly depends on the decoding algorithm used, while also the decoder throughput and its latency mainly depend on the decoding algorithm. In this work, we implement the powerful successive cancellation list (SCL) decoder on a GPU and identify the bottlenecks of this algorithm with respect to parallel computing and its difficulties. The inherent serial deco… ▽ More The decoding performance of polar codes strongly depends on the decoding algorithm used, while also the decoder throughput and its latency mainly depend on the decoding algorithm. In this work, we implement the powerful successive cancellation list (SCL) decoder on a GPU and identify the bottlenecks of this algorithm with respect to parallel computing and its difficulties. The inherent serial decoding property of the SCL algorithm naturally limits the achievable speed-up gains on GPUs when compared to CPU implementations. In order to increase the decoding throughput, we use a hybrid decoding scheme based on the belief propagation (BP) decoder, which can be intra and inter-frame parallelized. The proposed scheme combines excellent decoding performance and high throughput within the signal-to-noise ratio (SNR) region of interest. △ Less

Submitted 21 December, 2016; v1 submitted 29 September, 2016; originally announced September 2016.

Comments: accepted for ICASSP'17

arXiv:1507.01970 [pdf, other]

Triggering Wave-Like Convergence of Tail-biting Spatially Coupled LDPC Codes

Authors: Sebastian Cammerer, Vahid Aref, Laurent Schmalen, Stephan ten Brink

Abstract: Spatially coupled low-density parity-check (SC-LDPC) codes can achieve the channel capacity under low-complexity belief propagation (BP) decoding, however, there is a non-negligible rate-loss because of termination effects for practical finite coupling lengths. In this paper, we study how we can approach the performance of terminated SC-LDPC codes by random shortening of tail-biting SC-LDPC codes.… ▽ More Spatially coupled low-density parity-check (SC-LDPC) codes can achieve the channel capacity under low-complexity belief propagation (BP) decoding, however, there is a non-negligible rate-loss because of termination effects for practical finite coupling lengths. In this paper, we study how we can approach the performance of terminated SC-LDPC codes by random shortening of tail-biting SC-LDPC codes. We find the minimum required rate-loss in order to achieve the same performance than terminated codes. We additionally study the use of tail-biting SC-LDPC codes for transmission over parallel channels (e.g., bit-interleaved-coded-modulation (BICM)) and investigate how the distribution of the coded bits between two parallel channels can change the performance of the code. We show that a tail-biting SC-LDPC code can be used with BP decoding almost anywhere within the achievable region of MAP decoding. The optimization comes with a mandatory buffer at the encoder side. We evaluate different distributions of coded bits in order to reduce this buffer length. △ Less

Submitted 15 December, 2015; v1 submitted 7 July, 2015; originally announced July 2015.

Comments: Submitted to 2016 Annual Conference on Information Science and Systems (CISS)

Showing 1–45 of 45 results for author: Cammerer, S