-
A Generalized Adjusted Min-Sum Decoder for 5G LDPC Codes: Algorithm and Implementation
Authors:
Yuqing Ren,
Hassan Harb,
Yifei Shen,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
5G New Radio (NR) has stringent demands on both performance and complexity for the design of low-density parity-check (LDPC) decoding algorithms and corresponding VLSI implementations. Furthermore, decoders must fully support the wide range of all 5G NR blocklengths and code rates, which is a significant challenge. In this paper, we present a high-performance and low-complexity LDPC decoder, tailo…
▽ More
5G New Radio (NR) has stringent demands on both performance and complexity for the design of low-density parity-check (LDPC) decoding algorithms and corresponding VLSI implementations. Furthermore, decoders must fully support the wide range of all 5G NR blocklengths and code rates, which is a significant challenge. In this paper, we present a high-performance and low-complexity LDPC decoder, tailor-made to fulfill the 5G requirements. First, to close the gap between belief propagation (BP) decoding and its approximations in hardware, we propose an extension of adjusted min-sum decoding, called generalized adjusted min-sum (GA-MS) decoding. This decoding algorithm flexibly truncates the incoming messages at the check node level and carefully approximates the non-linear functions of BP decoding to balance the error-rate and hardware complexity. Numerical results demonstrate that the proposed fixed-point GAMS has only a minor gap of 0.1 dB compared to floating-point BP under various scenarios of 5G standard specifications. Secondly, we present a fully reconfigurable 5G NR LDPC decoder implementation based on GA-MS decoding. Given that memory occupies a substantial portion of the decoder area, we adopt multiple data compression and approximation techniques to reduce 42.2% of the memory overhead. The corresponding 28nm FD-SOI ASIC decoder has a core area of 1.823 mm2 and operates at 895 MHz. It is compatible with all 5G NR LDPC codes and achieves a peak throughput of 24.42 Gbps and a maximum area efficiency of 13.40 Gbps/mm2 at 4 decoding iterations.
△ Less
Submitted 17 February, 2024; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Band-of-Interest-based Channel Impulse Response Fusion for Breathing Rate Estimation with UWB
Authors:
Sitian Li,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
The channel impulse response (CIR) obtained from the channel estimation step of various wireless systems is a widely used source of information in wireless sensing. Breathing rate is one of the important vital signs that can be retrieved from the CIR. Recently, there have been various works that extract the breathing rate from one carefully selected CIR delay bin that contains the breathing inform…
▽ More
The channel impulse response (CIR) obtained from the channel estimation step of various wireless systems is a widely used source of information in wireless sensing. Breathing rate is one of the important vital signs that can be retrieved from the CIR. Recently, there have been various works that extract the breathing rate from one carefully selected CIR delay bin that contains the breathing information. However, it has also been shown that the accuracy of this estimation is very sensitive to the measurement scenario, e.g., if there is any obstacle between the transceivers and the target, the position of the target, and the orientation of the target, since only one CIR delay bin does not contain a sufficient periodic component to retrieve the breathing rate. We focus on such scenarios and propose a CIR delay bin fusion method to merge several CIR bins to achieve a more accurate and reliable breathing rate estimate. We take measurements and showcase the advantages of the proposed method across scenarios.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Single-anchor UWB Localization using Channel Impulse Response Distributions
Authors:
Sitian Li,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
Ultra-wideband (UWB) devices are widely used in indoor localization scenarios. Single-anchor UWB localization shows advantages because of its simple system setup compared to conventional two-way ranging (TWR) and trilateration localization methods. In this work, we focus on single-anchor UWB localization methods that learn statistical features of the channel impulse response (CIR) in different loc…
▽ More
Ultra-wideband (UWB) devices are widely used in indoor localization scenarios. Single-anchor UWB localization shows advantages because of its simple system setup compared to conventional two-way ranging (TWR) and trilateration localization methods. In this work, we focus on single-anchor UWB localization methods that learn statistical features of the channel impulse response (CIR) in different location areas using a Gaussian mixture model (GMM). We show that by learning the joint distributions of the amplitudes of different delay components, we achieve a more accurate location estimate compared to considering each delay bin independently. Moreover, we develop a similarity metric between sets of CIRs. With this set-based similarity metric, we can further improve the estimation performance, compared to treating each snapshot separately. We showcase the advantages of the proposed methods in multiple application scenarios.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
High-Throughput Flexible Belief Propagation List Decoder for Polar Codes
Authors:
Yuqing Ren,
Yifei Shen,
Leyu Zhang,
Andreas Toftegaard Kristensen,
Alexios Balatsoukas-Stimming,
Andreas Burg,
Chuan Zhang
Abstract:
Owing to its high parallelism, belief propagation (BP) decoding is highly amenable to high-throughput implementations and thus represents a promising solution for meeting the ultra-high peak data rate of future communication systems. However, for polar codes, the error-correcting performance of BP decoding is far inferior to that of the widely used CRC-aided successive cancellation list (SCL) deco…
▽ More
Owing to its high parallelism, belief propagation (BP) decoding is highly amenable to high-throughput implementations and thus represents a promising solution for meeting the ultra-high peak data rate of future communication systems. However, for polar codes, the error-correcting performance of BP decoding is far inferior to that of the widely used CRC-aided successive cancellation list (SCL) decoding algorithm. To close the performance gap to SCL, BP list (BPL) decoding expands the exploration of candidate codewords through multiple permuted factor graphs (PFGs). From an implementation perspective, designing a unified and flexible hardware architecture for BPL decoding that supports various PFGs and code configurations presents a big challenge. In this paper, we propose the first hardware implementation of a BPL decoder for polar codes and overcome the implementation challenge by applying a hardware-friendly algorithm that generates flexible permutations on-the-fly. First, we derive the graph selection gain and provide a sequential generation (SG) algorithm to obtain a near-optimal PFG set. We further prove that any permutation can be decomposed into a combination of multiple fixed routings, and we design a low-complexity permutation network to satisfy the decoding schedule. Our BPL decoder not only has a low decoding latency by executing the decoding and permutation generation in parallel, but also supports an arbitrary list size without any area overhead. Experimental results show that, for length-1024 polar codes with a code rate of one-half, our BPL decoder with 32 PFGs has a similar error-correcting performance to SCL with a list size of 4 and achieves a throughput of 25.63 Gbps and an area efficiency of 29.46 Gbps/mm$^{2}$ at SNR=4.0dB, which is 1.82$\times$ and 4.33$\times$ faster than the state-of-the-art BP flip and SCL decoders,~respectively
△ Less
Submitted 19 March, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Increasing Cellular Network Energy Efficiency for Railway Corridors
Authors:
Adrian Schumacher,
Ruben Merz,
Andreas Burg
Abstract:
Modern trains act as Faraday cages making it challenging to provide high cellular data capacities to passengers. A solution is the deployment of linear cells along railway tracks, forming a cellular corridor. To provide a sufficiently high data capacity, many cell sites need to be installed at regular distances. However, such cellular corridors with high power sites in short distance intervals are…
▽ More
Modern trains act as Faraday cages making it challenging to provide high cellular data capacities to passengers. A solution is the deployment of linear cells along railway tracks, forming a cellular corridor. To provide a sufficiently high data capacity, many cell sites need to be installed at regular distances. However, such cellular corridors with high power sites in short distance intervals are not sustainable due to the infrastructure power consumption. To render railway connectivity more sustainable, we propose to deploy fewer high-power radio units with intermediate low-power support repeater nodes. We show that these repeaters consume only 5 % of the energy of a regular cell site and help to maintain the same data capacity in the trains. In a further step, we introduce a sleep mode for the repeater nodes that enables autonomous solar powering and even eases installation because no cables to the relays are needed.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Spreading Factor assisted LoRa Localization with Deep Reinforcement Learning
Authors:
Yaya Etiabi,
Mohammed JOUHARI,
Andreas Burg,
El Mehdi Amhoud
Abstract:
Most of the developed localization solutions rely on RSSI fingerprinting. However, in the LoRa networks, due to the spreading factor (SF) in the network setting, traditional fingerprinting may lack representativeness of the radio map, leading to inaccurate position estimates. As such, in this work, we propose a novel LoRa RSSI fingerprinting approach that takes into account the SF. The performance…
▽ More
Most of the developed localization solutions rely on RSSI fingerprinting. However, in the LoRa networks, due to the spreading factor (SF) in the network setting, traditional fingerprinting may lack representativeness of the radio map, leading to inaccurate position estimates. As such, in this work, we propose a novel LoRa RSSI fingerprinting approach that takes into account the SF. The performance evaluation shows the prominence of our proposed approach since we achieved an improvement in localization accuracy by up to 6.67% compared to the state-of-the-art methods. The evaluation has been done using a fully connected deep neural network (DNN) set as the baseline. To further improve the localization accuracy, we propose a deep reinforcement learning model that captures the ever-growing complexity of LoRa networks and copes with their scalability. The obtained results show an improvement of 48.10% in the localization accuracy compared to the baseline DNN model.
△ Less
Submitted 10 May, 2023; v1 submitted 23 May, 2022;
originally announced May 2022.
-
A Sequence Repetition Node-Based Successive Cancellation List Decoder for 5G Polar Codes: Algorithm and Implementation
Authors:
Yuqing Ren,
Andreas Toftegaard Kristensen,
Yifei Shen,
Alexios Balatsoukas-Stimming,
Chuan Zhang,
Andreas Burg
Abstract:
Due to the low-latency and high-reliability requirements of 5G, low-complexity node-based successive cancellation list (SCL) decoding has received considerable attention for use in 5G communications systems. By identifying special constituent codes in the decoding tree and immediately decoding these, node-based SCL decoding provides a significant reduction in decoding latency compared to conventio…
▽ More
Due to the low-latency and high-reliability requirements of 5G, low-complexity node-based successive cancellation list (SCL) decoding has received considerable attention for use in 5G communications systems. By identifying special constituent codes in the decoding tree and immediately decoding these, node-based SCL decoding provides a significant reduction in decoding latency compared to conventional SCL decoding. However, while there exists many types of nodes, the current node-based SCL decoders are limited by the lack of a more generalized node that can efficiently decode a larger number of different constituent codes to further reduce the decoding time. In this paper, we extend a recent generalized node, the sequence repetition (SR) node to SCL decoding and we describe the first implementation of an SR-List decoder. By merging certain SR-List decoding operations and applying various optimizations for 5G New Radio (NR) polar codes, our optimized SR-List decoding algorithm increases the throughput by almost ${2\times}$ compared to a similar state-of-the-art node-based SCL decoder. We also present our hardware implementation of the optimized SR-List decoding algorithm which supports all 5G NR polar codes. Synthesis results show that our SR-List decoder can achieve a $2.94 \, \mathrm{Gbps}$ throughput and $6.70\, \mathrm{Gbps} / \mathrm{mm}^2$ area efficiency for ${L=8}$.
△ Less
Submitted 26 August, 2022; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Adding Indoor Capacity Without Fiber Backhaul: A mmWave Bridge Prototype
Authors:
Adrian Schumacher,
Ruben Merz,
Andreas Burg
Abstract:
Today, a large portion of the mobile data traffic is consumed behind the shielding walls of buildings or in the Faraday cage of trains. This renders cellular network coverage from outdoor cell sites difficult. Indoor small cells and distributed antennas along train tracks are often considered as a solution, but the cost and the need for optical fiber backhaul are often prohibitive. To alleviate th…
▽ More
Today, a large portion of the mobile data traffic is consumed behind the shielding walls of buildings or in the Faraday cage of trains. This renders cellular network coverage from outdoor cell sites difficult. Indoor small cells and distributed antennas along train tracks are often considered as a solution, but the cost and the need for optical fiber backhaul are often prohibitive. To alleviate this issue, we describe an out-of-band repeater that converts a sub-6 GHz cell signal from a small cell installed at a cell tower to a mmWave frequency for the fronthaul to buildings or distributed antenna sites, where the signal is downconverted to the original frequency and emitted for example inside a building. This concept does not require fiber deployment, provides backward compatibility to equipment already in use, and additional indoor capacity is gained while outdoor networks are offloaded. The architecture and hardware prototype implementation are described, and measurements are reported to demonstrate the functionality and compatibility with commercial infrastructure and mobile terminals.
△ Less
Submitted 14 May, 2021;
originally announced May 2021.
-
A mmWave Bridge Concept to Solve the Cellular Outdoor-to-Indoor Challenge
Authors:
Adrian Schumacher,
Ruben Merz,
Andreas Burg
Abstract:
Wireless indoor coverage and data capacity are important aspects of cellular networks. With the ever-increasing data traffic, demand for more data capacity indoors is also growing. The lower frequencies of the legacy frequency bands of macro outdoor cells manage to provide coverage inside buildings, however, new frequencies foreseen for the 5th generation (5G) of mobile communications in the milli…
▽ More
Wireless indoor coverage and data capacity are important aspects of cellular networks. With the ever-increasing data traffic, demand for more data capacity indoors is also growing. The lower frequencies of the legacy frequency bands of macro outdoor cells manage to provide coverage inside buildings, however, new frequencies foreseen for the 5th generation (5G) of mobile communications in the millimeter wave (mmWave) spectrum penetrate very poorly into buildings. Therefore, a massive densification of the network would require to deploy a large number of indoor small cells, which would lead to high deployment costs to install the necessary wired/optical backhaul. Hence, other methods are needed that allow an increase of the data capacity indoors, bearing a lower cost than a fiber deployment. We propose a cost-efficient out-of-band repeater architecture that provides more data capacity indoors than an outdoor macro/micro network can provide to indoor, without adversely affecting a legacy network, and which readily works with the established cellular infrastructure as well as standard handsets/smartphones. This proposal is compared to conventional in- and out-of-band repeaters and relay nodes in order to highlight the advantages of our solution. While the data capacity for a single link is similar to that of repeaters and relays, a macro cell can be effectively offloaded. Cell capacities corresponding to at least 3--4 times that of a repeater or relay solution can be provided, depending on the number of parallel installed links and the bandwidth in the mmWave spectrum.
△ Less
Submitted 14 May, 2021;
originally announced May 2021.
-
3.5 GHz Coverage Assessment with a 5G Testbed
Authors:
Adrian Schumacher,
Ruben Merz,
Andreas Burg
Abstract:
Today, cellular networks have saturated frequencies below 3\,GHz. Because of increasing capacity requirements, 5th generation (5G) mobile networks target the 3.5\,GHz band (3.4 to 3.8\,GHz). Despite its expected wide usage, there is little empirical path loss data and mobile radio network planning experience for the 3.5\,GHz band available. This paper presents the results of rural, suburban, and u…
▽ More
Today, cellular networks have saturated frequencies below 3\,GHz. Because of increasing capacity requirements, 5th generation (5G) mobile networks target the 3.5\,GHz band (3.4 to 3.8\,GHz). Despite its expected wide usage, there is little empirical path loss data and mobile radio network planning experience for the 3.5\,GHz band available. This paper presents the results of rural, suburban, and urban measurement campaigns using a pre-standard 5G prototype testbed operating at 3.5\,GHz, with outdoor as well as outdoor-to-indoor scenarios. Based on the measurement results, path loss models are evaluated, which are essential for network planning.
△ Less
Submitted 14 May, 2021;
originally announced May 2021.
-
OpenCSI: An Open-Source Dataset for Indoor Localization Using CSI-Based Fingerprinting
Authors:
Arthur Gassner,
Claudiu Musat,
Alexandru Rusu,
Andreas Burg
Abstract:
Many applications require accurate indoor localization. Fingerprint-based localization methods propose a solution to this problem, but rely on a radio map that is effort-intensive to acquire. We automate the radio map acquisition phase using a software-defined radio (SDR) and a wheeled robot. Furthermore, we open-source a radio map acquired with our automated tool for a 3GPP Long-Term Evolution (L…
▽ More
Many applications require accurate indoor localization. Fingerprint-based localization methods propose a solution to this problem, but rely on a radio map that is effort-intensive to acquire. We automate the radio map acquisition phase using a software-defined radio (SDR) and a wheeled robot. Furthermore, we open-source a radio map acquired with our automated tool for a 3GPP Long-Term Evolution (LTE) wireless link. To the best of our knowledge, this is the first publicly available radio map containing channel state information (CSI). Finally, we describe first localization experiments on this radio map using a convolutional neural network to regress for location coordinates.
△ Less
Submitted 14 April, 2022; v1 submitted 16 April, 2021;
originally announced April 2021.
-
On the Implementation Complexity of Digital Full-Duplex Self-Interference Cancellation
Authors:
Andreas Toftegaard Kristensen,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
In-band full-duplex systems promise to further increase the throughput of wireless systems, by simultaneously transmitting and receiving on the same frequency band. However, concurrent transmission generates a strong self-interference signal at the receiver, which requires the use of cancellation techniques. A wide range of techniques for analog and digital self-interference cancellation have alre…
▽ More
In-band full-duplex systems promise to further increase the throughput of wireless systems, by simultaneously transmitting and receiving on the same frequency band. However, concurrent transmission generates a strong self-interference signal at the receiver, which requires the use of cancellation techniques. A wide range of techniques for analog and digital self-interference cancellation have already been presented in the literature. However, their evaluation focuses on cases where the underlying physical parameters of the full-duplex system do not vary significantly. In this paper, we focus on adaptive digital cancellation, motivated by the fact that physical systems change over time. We examine some of the different cancellation methods in terms of their performance and implementation complexity, considering the cost of both cancellation and training. We then present a comparative analysis of all these methods to determine which perform better under different system performance requirements. We demonstrate that with a neural network approach, the reduction in arithmetic complexity for the same cancellation performance relative to a state-of-the-art polynomial model is several orders of magnitude.
△ Less
Submitted 9 January, 2021;
originally announced January 2021.
-
A Maximum-Likelihood-based Multi-User LoRa Receiver Implemented in GNU Radio
Authors:
Mathieu Xhonneux,
Joachim Tapparel,
Orion Afisiadis,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
LoRa is a popular low-power wide-area network (LPWAN) technology that uses spread-spectrum to achieve long-range connectivity and resilience to noise and interference. For energy efficiency reasons, LoRa adopts a pure ALOHA access scheme, which leads to reduced network throughput due to packet collisions at the gateways. To alleviate this issue, in this paper we analyze and implement a LoRa receiv…
▽ More
LoRa is a popular low-power wide-area network (LPWAN) technology that uses spread-spectrum to achieve long-range connectivity and resilience to noise and interference. For energy efficiency reasons, LoRa adopts a pure ALOHA access scheme, which leads to reduced network throughput due to packet collisions at the gateways. To alleviate this issue, in this paper we analyze and implement a LoRa receiver that is able to decode LoRa packets from two interfering users. Our main contribution is a two-user detector derived in a maximum-likelihood fashion using a detailed interference model. As the complexity of the maximum-likelihood sequence estimation is prohibitive, a complexity-reduction technique is introduced to enable a practical implementation of the proposed two-user detector. This detector has been implemented along with an interference-robust synchronization algorithm on the GNU Radio Software-Defined-Radio (SDR) platform. The SDR implementation shows the effectiveness of the proposed method and also allows its experimental evaluation. Measurements indicate that our detector inherently leverages the time offset between the two colliding users to separate and demodulate their contributions.
△ Less
Submitted 15 December, 2020;
originally announced December 2020.
-
On the Advantage of Coherent LoRa Detection in the Presence of Interference
Authors:
Orion Afisiadis,
Sitian Li,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
It has been shown that the coherent detection of LoRa signals only provides marginal gains of around 0.7 dB on the additive white Gaussian noise (AWGN) channel. However, ALOHA-based massive Internet of Things systems, including LoRa, often operate in the interference-limited regime. Therefore, in this work, we examine the performance of the LoRa modulation with coherent detection in the presence o…
▽ More
It has been shown that the coherent detection of LoRa signals only provides marginal gains of around 0.7 dB on the additive white Gaussian noise (AWGN) channel. However, ALOHA-based massive Internet of Things systems, including LoRa, often operate in the interference-limited regime. Therefore, in this work, we examine the performance of the LoRa modulation with coherent detection in the presence of interference from another LoRa user with the same spreading factor. We derive rigorous symbol- and frame error rate expressions as well as bounds and approximations for evaluating the error rates. The error rates predicted by these approximations are compared against error rates found by Monte Carlo simulations and shown to be very accurate. We also compare the performance of LoRa with coherent and non-coherent receivers and we show that the coherent detection of LoRa is significantly more beneficial in interference scenarios than in the presence of only AWGN. For example, we show that coherent detection leads to a 2.5 dB gain over the standard non-coherent detection for a signal-to-interference ratio (SIR) of 3 dB and up to a 10 dB gain for an SIR of 0 dB.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Lupulus: A Flexible Hardware Accelerator for Neural Networks
Authors:
Andreas Toftegaard Kristensen,
Robert Giterman,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
Neural networks have become indispensable for a wide range of applications, but they suffer from high computational- and memory-requirements, requiring optimizations from the algorithmic description of the network to the hardware implementation. Moreover, the high rate of innovation in machine learning makes it important that hardware implementations provide a high level of programmability to supp…
▽ More
Neural networks have become indispensable for a wide range of applications, but they suffer from high computational- and memory-requirements, requiring optimizations from the algorithmic description of the network to the hardware implementation. Moreover, the high rate of innovation in machine learning makes it important that hardware implementations provide a high level of programmability to support current and future requirements of neural networks. In this work, we present a flexible hardware accelerator for neural networks, called Lupulus, supporting various methods for scheduling and map** of operations onto the accelerator. Lupulus was implemented in a 28nm FD-SOI technology and demonstrates a peak performance of 380 GOPS/GHz with latencies of 21.4ms and 183.6ms for the convolutional layers of AlexNet and VGG-16, respectively.
△ Less
Submitted 3 May, 2020;
originally announced May 2020.
-
An Open-Source LoRa Physical Layer Prototype on GNU Radio
Authors:
Joachim Tapparel,
Orion Afisiadis,
Paul Mayoraz,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
LoRa is the proprietary physical layer (PHY) of LoRaWAN, which is a popular Internet-of-Things (IoT) protocol enabling low-power devices to communicate over long ranges. A number of reverse engineering attempts have been published in the last few years that helped to reveal many of the LoRa PHY details. In this work, we describe our standard compatible LoRa PHY software-defined radio (SDR) prototy…
▽ More
LoRa is the proprietary physical layer (PHY) of LoRaWAN, which is a popular Internet-of-Things (IoT) protocol enabling low-power devices to communicate over long ranges. A number of reverse engineering attempts have been published in the last few years that helped to reveal many of the LoRa PHY details. In this work, we describe our standard compatible LoRa PHY software-defined radio (SDR) prototype based on GNU Radio. We show how this SDR prototype can be used to develop and evaluate receiver algorithms for LoRa. As an example, we describe the sampling time offset and the carrier frequency offset estimation and compensation blocks. We experimentally evaluate the error rate of LoRa, both for the uncoded and the coded cases, to illustrate that our publicly available open-source implementation is a solid basis for further research.
△ Less
Submitted 20 February, 2020; v1 submitted 19 February, 2020;
originally announced February 2020.
-
PAC Codes: Sequential Decoding vs List Decoding
Authors:
Mohammad Rowshan,
Andreas Burg,
Emanuele Viterbo
Abstract:
In the Shannon lecture at the 2019 International Symposium on Information Theory (ISIT), Arıkan proposed to employ a one-to-one convolutional transform as a pre-coding step before the polar transform. The resulting codes of this concatenation are called polarization-adjusted convolutional (PAC) codes. In this scheme, a pair of polar mapper and demapper as pre- and postprocessing devices are deploy…
▽ More
In the Shannon lecture at the 2019 International Symposium on Information Theory (ISIT), Arıkan proposed to employ a one-to-one convolutional transform as a pre-coding step before the polar transform. The resulting codes of this concatenation are called polarization-adjusted convolutional (PAC) codes. In this scheme, a pair of polar mapper and demapper as pre- and postprocessing devices are deployed around a memoryless channel, which provides polarized information to an outer decoder leading to improved error correction performance of the outer code. In this paper, the list decoding and sequential decoding (including Fano decoding and stack decoding) are first adapted for use to decode PAC codes. Then, to reduce the complexity of sequential decoding of PAC/polar codes, we propose (i) an adaptive heuristic metric, (ii) tree search constraints for backtracking to avoid exploration of unlikely sub-paths, and (iii) tree search strategies consistent with the pattern of error occurrence in polar codes. These contribute to the reduction of the average decoding time complexity from 50% to 80%, trading with 0.05 to 0.3 dB degradation in error correction performance within FER=10^-3 range, respectively, relative to not applying the corresponding search strategies. Additionally, as an important ingredient in Fano decoding of PAC/polar codes, an efficient computation method for the intermediate LLRs and partial sums is provided. This method is effective in backtracking and avoids storing the intermediate information or restarting the decoding process. Eventually, all three decoding algorithms are compared in terms of performance, complexity, and resource requirements.
△ Less
Submitted 17 January, 2024; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Identification of Non-Linear RF Systems Using Backpropagation
Authors:
Andreas Toftegaard Kristensen,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
In this work, we use deep unfolding to view cascaded non-linear RF systems as model-based neural networks. This view enables the direct use of a wide range of neural network tools and optimizers to efficiently identify such cascaded models. We demonstrate the effectiveness of this approach through the example of digital self-interference cancellation in full-duplex communications where an IQ imbal…
▽ More
In this work, we use deep unfolding to view cascaded non-linear RF systems as model-based neural networks. This view enables the direct use of a wide range of neural network tools and optimizers to efficiently identify such cascaded models. We demonstrate the effectiveness of this approach through the example of digital self-interference cancellation in full-duplex communications where an IQ imbalance model and a non-linear PA model are cascaded in series. For a self-interference cancellation performance of approximately 44.5 dB, the number of model parameters can be reduced by 74% and the number of operations per sample can be reduced by 79% compared to an expanded linear-in-parameters polynomial model.
△ Less
Submitted 31 May, 2020; v1 submitted 27 January, 2020;
originally announced January 2020.
-
Hardware Implementation of Neural Self-Interference Cancellation
Authors:
Yann Kurzo,
Andreas Toftegaard Kristensen,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
In-band full-duplex systems can transmit and receive information simultaneously on the same frequency band. However, due to the strong self-interference caused by the transmitter to its own receiver, the use of non-linear digital self-interference cancellation is essential. In this work, we describe a hardware architecture for a neural network-based non-linear self-interference (SI) canceller and…
▽ More
In-band full-duplex systems can transmit and receive information simultaneously on the same frequency band. However, due to the strong self-interference caused by the transmitter to its own receiver, the use of non-linear digital self-interference cancellation is essential. In this work, we describe a hardware architecture for a neural network-based non-linear self-interference (SI) canceller and we compare it with our own hardware implementation of a conventional polynomial based SI canceller. In particular, we present implementation results for a shallow and a deep neural network SI canceller as well as for a polynomial SI canceller. Our results show that the deep neural network canceller achieves a hardware efficiency of up to $312.8$ Msamples/s/mm$^2$ and an energy efficiency of up to $0.9$ nJ/sample, which is $2.1\times$ and $2\times$ better than the polynomial SI canceller, respectively. These results show that NN-based methods applied to communications are not only useful from a performance perspective, but can also be a very effective means to reduce the implementation complexity.
△ Less
Submitted 7 May, 2020; v1 submitted 13 January, 2020;
originally announced January 2020.
-
Advanced Machine Learning Techniques for Self-Interference Cancellation in Full-Duplex Radios
Authors:
Andreas Toftegaard Kristensen,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
In-band full-duplex systems allow for more efficient use of temporal and spectral resources by transmitting and receiving information at the same time and on the same frequency. However, this creates a strong self-interference signal at the receiver, making the use of self-interference cancellation critical. Recently, neural networks have been used to perform digital self-interference with lower c…
▽ More
In-band full-duplex systems allow for more efficient use of temporal and spectral resources by transmitting and receiving information at the same time and on the same frequency. However, this creates a strong self-interference signal at the receiver, making the use of self-interference cancellation critical. Recently, neural networks have been used to perform digital self-interference with lower computational complexity compared to a traditional polynomial model. In this paper, we examine the use of advanced neural networks, such as recurrent and complex-valued neural networks, and we perform an in-depth network architecture exploration. Our neural network architecture exploration reveals that complex-valued neural networks can significantly reduce both the number of floating-point operations and parameters compared to a polynomial model, whereas the real-valued networks only reduce the number of floating-point operations. For example, at a digital self-interference cancellation of 44.51 dB, a complex-valued neural network requires 33.7 % fewer floating-point operations and 26.9 % fewer parameters compared to the polynomial model.
△ Less
Submitted 14 December, 2019;
originally announced December 2019.
-
Coded LoRa Frame Error Rate Analysis
Authors:
Orion Afisiadis,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
In this work, we study the coded frame error rate (FER) of LoRa under additive white Gaussian noise (AWGN) and under carrier frequency offset (CFO). To this end, we use existing approximations for the bit error rate (BER) of the LoRa modulation under AWGN and we present a FER analysis that includes the channel coding, interleaving, and Gray map** of the LoRa physical layer. We also derive the Lo…
▽ More
In this work, we study the coded frame error rate (FER) of LoRa under additive white Gaussian noise (AWGN) and under carrier frequency offset (CFO). To this end, we use existing approximations for the bit error rate (BER) of the LoRa modulation under AWGN and we present a FER analysis that includes the channel coding, interleaving, and Gray map** of the LoRa physical layer. We also derive the LoRa BER under carrier frequency offset and we present a corresponding FER analysis. We compare the derived frame error rate expressions to Monte Carlo simulations to verify their accuracy.
△ Less
Submitted 22 November, 2019;
originally announced November 2019.
-
On the Error Rate of the LoRa Modulation with Interference
Authors:
Orion Afisiadis,
Matthieu Cotting,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
LoRa is a chirp spread-spectrum modulation developed for the Internet of Things. In this work, we examine the performance of LoRa in the presence of both additive white Gaussian noise and interference from another LoRa user. To this end, we extend an existing interference model, which assumes perfect alignment of the signal of interest and the interference, to the more realistic case where the int…
▽ More
LoRa is a chirp spread-spectrum modulation developed for the Internet of Things. In this work, we examine the performance of LoRa in the presence of both additive white Gaussian noise and interference from another LoRa user. To this end, we extend an existing interference model, which assumes perfect alignment of the signal of interest and the interference, to the more realistic case where the interfering user is neither chip- nor phase-aligned with the signal of interest and we derive an expression for the error rate. We show that the existing aligned interference model overestimates the effect of interference on the error rate. Moreover, we prove two symmetries in the interfering signal and we derive low-complexity approximate formulas that can significantly reduce the complexity of computing the symbol and frame error rates compared to the complete expression. Finally, we provide numerical simulations to corroborate the theoretical analysis and to verify the accuracy of our proposed approximations.
△ Less
Submitted 3 December, 2019; v1 submitted 24 May, 2019;
originally announced May 2019.
-
LoRa Symbol Error Rate Under Non-Chip- and Non-Phase-Aligned Interference
Authors:
Orion Afisiadis,
Matthieu Cotting,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
In this work, we examine the performance of the LoRa chirp spread spectrum modulation in the presence of both additive white Gaussian noise and interference from another LoRa user. To this end, we extend an existing interference model to the more realistic case where the interfering user is neither chip- nor phase-aligned with the signal of interest and we derive an expression for the SER. We show…
▽ More
In this work, we examine the performance of the LoRa chirp spread spectrum modulation in the presence of both additive white Gaussian noise and interference from another LoRa user. To this end, we extend an existing interference model to the more realistic case where the interfering user is neither chip- nor phase-aligned with the signal of interest and we derive an expression for the SER. We show that the existing interference model overestimates the effect of interference on the error rate. Moreover, we derive a low-complexity approximate formula that can significantly reduce the complexity of computing the symbol error rate compared to the complete expression.
△ Less
Submitted 1 May, 2019;
originally announced May 2019.
-
Design and Implementation of a Neural Network Aided Self-Interference Cancellation Scheme for Full-Duplex Radios
Authors:
Yann Kurzo,
Andreas Burg,
Alexios Balatsoukas-Stimming
Abstract:
In-band full-duplex systems are able to transmit and receive information simultaneously on the same frequency band. Due to the strong self-interference caused by the transmitter to its own receiver, the use of non-linear digital self-interference cancellation is essential. In this work, we present a hardware architecture for a neural network based non-linear self-interference canceller and we comp…
▽ More
In-band full-duplex systems are able to transmit and receive information simultaneously on the same frequency band. Due to the strong self-interference caused by the transmitter to its own receiver, the use of non-linear digital self-interference cancellation is essential. In this work, we present a hardware architecture for a neural network based non-linear self-interference canceller and we compare it with our own hardware implementation of a conventional polynomial based canceller. We show that, for the same cancellation performance, the neural network canceller has a significantly higher throughput and requires fewer hardware resources.
△ Less
Submitted 2 December, 2018;
originally announced December 2018.
-
LoRa Digital Receiver Analysis and Implementation
Authors:
Reza Ghanaatian,
Orion Afisiadis,
Matthieu Cotting,
Andreas Burg
Abstract:
Low power wide area network technologies (LPWANs) are attracting attention because they fulfill the need for long range low power communication for the Internet of Things. LoRa is one of the proprietary LPWAN physical layer (PHY) technologies, which provides variable data-rate and long range by using chirp spread spectrum modulation. This paper describes the basic LoRa PHY receiver algorithms and…
▽ More
Low power wide area network technologies (LPWANs) are attracting attention because they fulfill the need for long range low power communication for the Internet of Things. LoRa is one of the proprietary LPWAN physical layer (PHY) technologies, which provides variable data-rate and long range by using chirp spread spectrum modulation. This paper describes the basic LoRa PHY receiver algorithms and studies their performance. The LoRa PHY is first introduced and different demodulation schemes are proposed. The effect of carrier frequency offset and sampling frequency offset are then modeled and corresponding compensation methods are proposed. Finally, a software-defined radio implementation for the LoRa transceiver is briefly presented.
△ Less
Submitted 20 February, 2019; v1 submitted 9 November, 2018;
originally announced November 2018.
-
Feedback-Aware Precoding for Millimeter Wave Massive MIMO Systems
Authors:
Reza Ghanaatian,
Vahid Jamali,
Andreas Burg,
Robert Schober
Abstract:
Millimeter wave (mmWave) communication is a promising solution for co** with the ever-increasing mobile data traffic because of its large bandwidth. To enable a sufficient link margin, a large antenna array employing directional beamforming, which is enabled by the availability of channel state information at the transmitter (CSIT), is required. However, CSIT acquisition for mmWave channels intr…
▽ More
Millimeter wave (mmWave) communication is a promising solution for co** with the ever-increasing mobile data traffic because of its large bandwidth. To enable a sufficient link margin, a large antenna array employing directional beamforming, which is enabled by the availability of channel state information at the transmitter (CSIT), is required. However, CSIT acquisition for mmWave channels introduces a huge feedback overhead due to the typically large number of transmit and receive antennas. Leveraging properties of mmWave channels, this paper proposes a precoding strategy which enables a flexible adjustment of the feedback overhead. In particular, the optimal unconstrained precoder is approximated by selecting a variable number of elements from a basis that is constructed as a function of the transmitter array response, where the number of selected basis elements can be chosen according to the feedback constraint. Simulation results show that the proposed precoding scheme can provide a near-optimal solution if a higher feedback overhead can be afforded. For a low overhead, it can still provide a good approximation of the optimal precoder.
△ Less
Submitted 28 June, 2019; v1 submitted 9 November, 2018;
originally announced November 2018.
-
On the Tradeoff Between Accuracy and Complexity in Blind Detection of Polar Codes
Authors:
Pascal Giard,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
Polar codes are a recent family of error-correcting codes with a number of desirable characteristics. Their disruptive nature is illustrated by their rapid adoption in the $5^{th}$-generation mobile-communication standard, where they are used to protect control messages. In this work, we describe a two-stage system tasked with identifying the location of control messages that consists of a detecti…
▽ More
Polar codes are a recent family of error-correcting codes with a number of desirable characteristics. Their disruptive nature is illustrated by their rapid adoption in the $5^{th}$-generation mobile-communication standard, where they are used to protect control messages. In this work, we describe a two-stage system tasked with identifying the location of control messages that consists of a detection and selection stage followed by a decoding one. The first stage spurs the need for polar-code detection algorithms with variable effort to balance complexity between the two stages. We illustrate this idea of variable effort for multiple detection algorithms aimed at the first stage. We propose three novel blind detection methods based on belief-propagation decoding inspired by early-stop** criteria. Then we show how their reliability improves with the number of decoding iterations to highlight the possible tradeoffs between accuracy and complexity. Additionally, we show similar tradeoffs for a detection method from previous work. In a setup where only one block encoded with the polar code of interest is present among many other blocks, our results notably show that, depending on the complexity budget, a variable number of undesirable blocks can be dismissed while achieving a missed-detection rate in line with the block-error rate of a complex decoding algorithm.
△ Less
Submitted 11 December, 2018; v1 submitted 2 July, 2018;
originally announced July 2018.
-
Fast-SSC-Flip Decoding of Polar Codes
Authors:
Pascal Giard,
Andreas Burg
Abstract:
Polar codes are widely considered as one of the most exciting recent discoveries in channel coding. For short to moderate block lengths, their error-correction performance under list decoding can outperform that of other modern error-correcting codes. However, high-speed list-based decoders with moderate complexity are challenging to implement. Successive-cancellation (SC)-flip decoding was shown…
▽ More
Polar codes are widely considered as one of the most exciting recent discoveries in channel coding. For short to moderate block lengths, their error-correction performance under list decoding can outperform that of other modern error-correcting codes. However, high-speed list-based decoders with moderate complexity are challenging to implement. Successive-cancellation (SC)-flip decoding was shown to be capable of a competitive error-correction performance compared to that of list decoding with a small list size, at a fraction of the complexity, but suffers from a variable execution time and a higher worst-case latency. In this work, we show how to modify the state-of-the-art high-speed SC decoding algorithm to incorporate the SC-flip ideas. The algorithmic improvements are presented as well as average execution-time results tailored to a hardware implementation. The results show that the proposed fast-SSC-flip algorithm has a decoding speed close to an order of magnitude better than the previous works while retaining a comparable error-correction performance.
△ Less
Submitted 9 May, 2018; v1 submitted 1 December, 2017;
originally announced December 2017.
-
PolarBear: A 28-nm FD-SOI ASIC for Decoding of Polar Codes
Authors:
Pascal Giard,
Alexios Balatsoukas-Stimming,
Thomas Christoph Müller,
Andrea Bonetti,
Claude Thibeault,
Warren J. Gross,
Philippe Flatresse,
Andreas Burg
Abstract:
Polar codes are a recently proposed class of block codes that provably achieve the capacity of various communication channels. They received a lot of attention as they can do so with low-complexity encoding and decoding algorithms, and they have an explicit construction. Their recent inclusion in a 5G communication standard will only spur more research. However, only a couple of ASICs featuring de…
▽ More
Polar codes are a recently proposed class of block codes that provably achieve the capacity of various communication channels. They received a lot of attention as they can do so with low-complexity encoding and decoding algorithms, and they have an explicit construction. Their recent inclusion in a 5G communication standard will only spur more research. However, only a couple of ASICs featuring decoders for polar codes were fabricated, and none of them implements a list-based decoding algorithm. In this paper, we present ASIC measurement results for a fabricated 28 nm CMOS chip that implements two different decoders: the first decoder is tailored toward error-correction performance and flexibility. It supports any code rate as well as three different decoding algorithms: successive cancellation (SC), SC flip and SC list (SCL). The flexible decoder can also decode both non-systematic and systematic polar codes. The second decoder targets speed and energy efficiency. We present measurement results for the first silicon-proven SCL decoder, where its coded throughput is shown to be of 306.8 Mbps with a latency of 3.34 us and an energy per bit of 418.3 pJ/bit at a clock frequency of 721 MHz for a supply of 1.3 V. The energy per bit drops down to 178.1 pJ/bit with a more modest clock frequency of 308 MHz, lower throughput of 130.9 Mbps and a reduced supply voltage of 0.9 V. For the other two operating modes, the energy per bit is shown to be of approximately 95 pJ/bit. The less flexible high-throughput unrolled decoder can achieve a coded throughput of 9.2 Gbps and a latency of 628 ns for a measured energy per bit of 1.15 pJ/bit at 451 MHz.
△ Less
Submitted 1 September, 2017; v1 submitted 31 August, 2017;
originally announced August 2017.
-
Blind Detection of Polar Codes
Authors:
Pascal Giard,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
Polar codes were recently chosen to protect the control channel information in the next-generation mobile communication standard (5G) defined by the 3GPP. As a result, receivers will have to implement blind detection of polar coded frames in order to keep complexity, latency, and power consumption tractable. As a newly proposed class of block codes, the problem of polar-code blind detection has re…
▽ More
Polar codes were recently chosen to protect the control channel information in the next-generation mobile communication standard (5G) defined by the 3GPP. As a result, receivers will have to implement blind detection of polar coded frames in order to keep complexity, latency, and power consumption tractable. As a newly proposed class of block codes, the problem of polar-code blind detection has received very little attention. In this work, we propose a low-complexity blind-detection algorithm for polar-encoded frames. We base this algorithm on a novel detection metric with update rules that leverage the a priori knowledge of the frozen-bit locations, exploiting the inherent structures that these locations impose on a polar-encoded block of data. We show that the proposed detection metric allows to clearly distinguish polar-encoded frames from other types of data by considering the cumulative distribution functions of the detection metric, and the receiver operating characteristic. The presented results are tailored to the 5G standardization effort discussions, i.e., we consider a short low-rate polar code concatenated with a CRC.
△ Less
Submitted 18 July, 2017; v1 submitted 5 May, 2017;
originally announced May 2017.
-
A 588 Gbps LDPC Decoder Based on Finite-Alphabet Message Passing
Authors:
Reza Ghanaatian,
Alexios Balatsoukas-Stimming,
Christoph Muller,
Michael Meidlinger,
Gerald Matz,
Adam Teman,
Andreas Burg
Abstract:
An ultra-high throughput low-density parity check (LDPC) decoder with an unrolled full-parallel architecture is proposed, which achieves the highest decoding throughput compared to previously reported LDPC decoders in the literature. The decoder benefits from a serial message-transfer approach between the decoding stages to alleviate the well-known routing congestion problem in parallel LDPC decod…
▽ More
An ultra-high throughput low-density parity check (LDPC) decoder with an unrolled full-parallel architecture is proposed, which achieves the highest decoding throughput compared to previously reported LDPC decoders in the literature. The decoder benefits from a serial message-transfer approach between the decoding stages to alleviate the well-known routing congestion problem in parallel LDPC decoders. Furthermore, a finite-alphabet message passing algorithm is employed to replace the variable node update rule of the standard min-sum decoder with look-up tables, which are designed in a way that maximizes the mutual information between decoding messages. The proposed algorithm results in an architecture with reduced bit-width messages, leading to a significantly higher decoding throughput and to a lower area as compared to a min-sum decoder when serial message-transfer is used. The architecture is placed and routed for the standard min-sum reference decoder and for the proposed finite-alphabet decoder using a custom pseudo-hierarchical backend design strategy to further alleviate routing congestions and to handle the large design. Post-layout results show that the finite-alphabet decoder with the serial message-transfer architecture achieves a throughput as large as 588 Gbps with an area of 16.2 mm$^2$ and dissipates an average power of 22.7 pJ per decoded bit in a 28 nm FD-SOI library. Compared to the reference min-sum decoder, this corresponds to 3.1 times smaller area and 2 times better energy efficiency.
△ Less
Submitted 30 December, 2017; v1 submitted 16 March, 2017;
originally announced March 2017.
-
Comparison of Polar Decoders with Existing Low-Density Parity-Check and Turbo Decoders
Authors:
Alexios Balatsoukas-Stimming,
Pascal Giard,
Andreas Burg
Abstract:
Polar codes are a recently proposed family of provably capacity-achieving error-correction codes that received a lot of attention. While their theoretical properties render them interesting, their practicality compared to other types of codes has not been thoroughly studied. Towards this end, in this paper, we perform a comparison of polar decoders against LDPC and Turbo decoders that are used in…
▽ More
Polar codes are a recently proposed family of provably capacity-achieving error-correction codes that received a lot of attention. While their theoretical properties render them interesting, their practicality compared to other types of codes has not been thoroughly studied. Towards this end, in this paper, we perform a comparison of polar decoders against LDPC and Turbo decoders that are used in existing communications standards. More specifically, we compare both the error-correction performance and the hardware efficiency of the corresponding hardware implementations. This comparison enables us to identify applications where polar codes are superior to existing error-correction coding solutions as well as to determine the most promising research direction in terms of the hardware implementation of polar decoders.
△ Less
Submitted 19 April, 2017; v1 submitted 15 February, 2017;
originally announced February 2017.
-
A Multi-Gbps Unrolled Hardware List Decoder for a Systematic Polar Code
Authors:
Pascal Giard,
Alexios Balatsoukas-Stimming,
Thomas Christoph Müller,
Andreas Burg,
Claude Thibeault,
Warren J. Gross
Abstract:
Polar codes are a new class of block codes with an explicit construction that provably achieve the capacity of various communications channels, even with the low-complexity successive-cancellation (SC) decoding algorithm. Yet, the more complex successive-cancellation list (SCL) decoding algorithm is gathering more attention lately as it significantly improves the error-correction performance of sh…
▽ More
Polar codes are a new class of block codes with an explicit construction that provably achieve the capacity of various communications channels, even with the low-complexity successive-cancellation (SC) decoding algorithm. Yet, the more complex successive-cancellation list (SCL) decoding algorithm is gathering more attention lately as it significantly improves the error-correction performance of short- to moderate-length polar codes, especially when they are concatenated with a cyclic redundancy check code. However, as SCL decoding explores several decoding paths, existing hardware implementations tend to be significantly slower than SC-based decoders. In this paper, we show how the unrolling technique, which has already been used in the context of SC decoding, can be adapted to SCL decoding yielding a multi-Gbps SCL-based polar decoder with an error-correction performance that is competitive when compared to an LDPC code of similar length and rate. Post-place-and-route ASIC results for 28 nm CMOS are provided showing that this decoder can sustain a throughput greater than 10 Gbps at 468 MHz with an energy efficiency of 7.25 pJ/bit.
△ Less
Submitted 3 February, 2017;
originally announced February 2017.
-
Sliding Window Spectrum Sensing for Full-Duplex Cognitive Radios with Low Access-Latency
Authors:
Orion Afisiadis,
Andrew C. M. Austin,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
In a cognitive radio system the failure of secondary user (SU) transceivers to promptly vacate the channel can introduce significant access-latency for primary or high-priority users (PU). In conventional cognitive radio systems, the backoff latency is exacerbated by frame structures that only allow sensing at periodic intervals. Concurrent transmission and sensing using self-interference suppress…
▽ More
In a cognitive radio system the failure of secondary user (SU) transceivers to promptly vacate the channel can introduce significant access-latency for primary or high-priority users (PU). In conventional cognitive radio systems, the backoff latency is exacerbated by frame structures that only allow sensing at periodic intervals. Concurrent transmission and sensing using self-interference suppression has been suggested to improve the performance of cognitive radio systems, allowing decisions to be taken at multiple points within the frame. In this paper, we extend this approach by proposing a sliding-window full-duplex model allowing decisions to be taken on a sample-by-sample basis. We also derive the access-latency for both the existing and the proposed schemes. Our results show that the access-latency of the sliding scheme is decreased by a factor of 2.6 compared to the existing slotted full-duplex scheme and by a factor of approximately 16 compared to a half-duplex cognitive radio system. Moreover, the proposed scheme is significantly more resilient to the destructive effects of residual self-interference compared to previous approaches.
△ Less
Submitted 5 September, 2016;
originally announced September 2016.
-
Spatial Multiplexing of QPSK Signals with a Single Radio: Antenna Design and Over-the-Air Experiments
Authors:
Mohsen Yousefbeiki,
Andrew C. M. Austin,
Juan R. Mosig,
Andreas Burg,
Julien Perruisseau-Carrier
Abstract:
The paper describes the implementation and performance analysis of the first fully-operational beam-space MIMO antenna for the spatial multiplexing of two QPSK streams. The antenna is composed of a planar three-port radiator with two varactor diodes terminating the passive ports. Pattern reconfiguration is used to encode the MIMO information onto orthogonal virtual basis patterns in the far-field.…
▽ More
The paper describes the implementation and performance analysis of the first fully-operational beam-space MIMO antenna for the spatial multiplexing of two QPSK streams. The antenna is composed of a planar three-port radiator with two varactor diodes terminating the passive ports. Pattern reconfiguration is used to encode the MIMO information onto orthogonal virtual basis patterns in the far-field. A measurement campaign was conducted to compare the performance of the beam-space MIMO system with a conventional 2-by-?2 MIMO system under realistic propagation conditions. Propagation measurements were conducted for both systems and the mutual information and symbol error rates were estimated from Monte-Carlo simulations over the measured channel matrices. The results show the beam-space MIMO system and the conventional MIMO system exhibit similar finite-constellation capacity and error performance in NLOS scenarios when there is sufficient scattering in the channel. In comparison, in LOS channels, the capacity performance is observed to depend on the relative polarization of the receiving antennas.
△ Less
Submitted 30 August, 2016;
originally announced August 2016.
-
Near-Field Perturbation Effect on Constellation Error in Beam-Space MIMO
Authors:
Mohsen Yousefbeiki,
Juan R. Mosig,
Andreas P. Burg
Abstract:
Beam-space MIMO has recently been proposed as a promising solution to enable transmitting multiple data streams using a single RF chain and a single pattern-reconfigurable antenna. Since in a beam-space MIMO system radiation pattern of the transmit antenna is exploited as extra dimension for encoding information, near-field interaction of the transmit antenna with its surrounding objects affects s…
▽ More
Beam-space MIMO has recently been proposed as a promising solution to enable transmitting multiple data streams using a single RF chain and a single pattern-reconfigurable antenna. Since in a beam-space MIMO system radiation pattern of the transmit antenna is exploited as extra dimension for encoding information, near-field interaction of the transmit antenna with its surrounding objects affects spatial multiplexing performance of the system. Through numerical simulations in the previous work, it has been concluded that under BPSK signaling beam-space MIMO is not more vulnerable to near-field coupling than its conventional counterpart. In this work, we extend the study to the case of higher-order modulation schemes, where the presence of external perturbation also affects the data constellation points transmitted by a beam-space MIMO antenna. To this aim, the error vector magnitude of the transmitted signal is evaluated when placing a QPSK beam-space MIMO antenna in close proximity to a hand model of the user. The obtained results emphasize the importance of reconsidering the decoding approach for beam-space MIMO systems in practical applications.
△ Less
Submitted 1 August, 2016;
originally announced August 2016.
-
Hardware Decoders for Polar Codes: An Overview
Authors:
Pascal Giard,
Gabi Sarkis,
Alexios Balatsoukas-Stimming,
YouZhe Fan,
Chi-ying Tsui,
Andreas Burg,
Claude Thibeault,
Warren J. Gross
Abstract:
Polar codes are an exciting new class of error correcting codes that achieve the symmetric capacity of memoryless channels. Many decoding algorithms were developed and implemented, addressing various application requirements: from error-correction performance rivaling that of LDPC codes to very high throughput or low-complexity decoders. In this work, we review the state of the art in polar decode…
▽ More
Polar codes are an exciting new class of error correcting codes that achieve the symmetric capacity of memoryless channels. Many decoding algorithms were developed and implemented, addressing various application requirements: from error-correction performance rivaling that of LDPC codes to very high throughput or low-complexity decoders. In this work, we review the state of the art in polar decoders implementing the successive-cancellation, belief propagation, and list decoding algorithms, illustrating their advantages.
△ Less
Submitted 2 June, 2016;
originally announced June 2016.
-
Quantized Message Passing for LDPC Codes
Authors:
Michael Meidlinger,
Alexios Balatsoukas-Stimming,
Andreas Burg,
Gerald Matz
Abstract:
We propose a quantized decoding algorithm for low- density parity-check codes where the variable node update rule of the standard min-sum algorithm is replaced with a look-up table (LUT) that is designed using an information-theoretic criterion. We show that even with message resolutions as low as 3 bits, the proposed algorithm can achieve better error rates than a floating-point min-sum decoder.…
▽ More
We propose a quantized decoding algorithm for low- density parity-check codes where the variable node update rule of the standard min-sum algorithm is replaced with a look-up table (LUT) that is designed using an information-theoretic criterion. We show that even with message resolutions as low as 3 bits, the proposed algorithm can achieve better error rates than a floating-point min-sum decoder. Moreover, we study in detail the effect of different decoder design parameters, like the design SNR and the LUT tree structure on the performance of our decoder, and we propose some complexity reduction techniques, such as LUT re-use and message alphabet downsizing.
△ Less
Submitted 1 December, 2015;
originally announced December 2015.
-
A Fully-Unrolled LDPC Decoder Based on Quantized Message Passing
Authors:
Alexios Balatsoukas-Stimming,
Michael Meidlinger,
Reza Ghanaatian,
Gerald Matz,
Andreas Burg
Abstract:
In this paper, we propose a finite alphabet message passing algorithm for LDPC codes that replaces the standard min-sum variable node update rule by a map** based on generic look-up tables. This map** is designed in a way that maximizes the mutual information between the decoder messages and the codeword bits. We show that our decoder can deliver the same error rate performance as the conventi…
▽ More
In this paper, we propose a finite alphabet message passing algorithm for LDPC codes that replaces the standard min-sum variable node update rule by a map** based on generic look-up tables. This map** is designed in a way that maximizes the mutual information between the decoder messages and the codeword bits. We show that our decoder can deliver the same error rate performance as the conventional decoder with a much smaller message bit-width. Finally, we use the proposed algorithm to design a fully unrolled LDPC decoder hardware architecture.
△ Less
Submitted 15 October, 2015;
originally announced October 2015.
-
Faulty Successive Cancellation Decoding of Polar Codes for the Binary Erasure Channel
Authors:
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
In this paper, faulty successive cancellation decoding of polar codes for the binary erasure channel is studied. To this end, a simple erasure-based fault model is introduced to represent errors in the decoder and it is shown that, under this model, polarization does not happen, meaning that fully reliable communication is not possible at any rate. Furthermore, a lower bound on the frame error rat…
▽ More
In this paper, faulty successive cancellation decoding of polar codes for the binary erasure channel is studied. To this end, a simple erasure-based fault model is introduced to represent errors in the decoder and it is shown that, under this model, polarization does not happen, meaning that fully reliable communication is not possible at any rate. Furthermore, a lower bound on the frame error rate of polar codes under faulty SC decoding is provided, which is then used, along with a well-known upper bound, in order to choose a blocklength that minimizes the erasure probability under faulty decoding. Finally, an unequal error protection scheme that can re-enable asymptotically erasure-free transmission at a small rate loss and by protecting only a constant fraction of the decoder is proposed. The same scheme is also shown to significantly improve the finite-length performance of the faulty successive cancellation decoder by protecting as little as 1.5% of the decoder.
△ Less
Submitted 10 November, 2017; v1 submitted 20 May, 2015;
originally announced May 2015.
-
Cross-layer Energy-Efficiency Optimization of Packet Based Wireless MIMO Communication Systems
Authors:
Christian Senning,
Georgios Karakonstantis,
Andreas Burg
Abstract:
Energy in today's short-range wireless communication is mostly spent on the analog- and digital hardware rather than on radiated power. Hence, purely information-theoretic considerations fail to achieve the lowest energy per information bit and the optimization process must carefully consider the overall transceiver. In this paper, we propose to perform cross-layer optimization, based on an energy…
▽ More
Energy in today's short-range wireless communication is mostly spent on the analog- and digital hardware rather than on radiated power. Hence, purely information-theoretic considerations fail to achieve the lowest energy per information bit and the optimization process must carefully consider the overall transceiver. In this paper, we propose to perform cross-layer optimization, based on an energy-aware rate adaptation scheme combined with a physical layer that is able to properly adjust its processing effort to the data rate and the channel conditions to minimize the energy consumption per information bit. This energy proportional behavior is enabled by extending the classical system modes with additional configuration parameters at the various layers. Fine grained models of the power consumption of the hardware are developed to provide awareness of the physical layer capabilities to the medium access control layer. The joint application of the proposed energy-aware rate adaptation and modifications to the physical layer of an IEEE 802.11n system, improves energy-efficiency (averaged over many noise and channel realizations) in all considered scenarios by up to 44%.
△ Less
Submitted 1 April, 2015;
originally announced April 2015.
-
A Low-Complexity Improved Successive Cancellation Decoder for Polar Codes
Authors:
Orion Afisiadis,
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
Under successive cancellation (SC) decoding, polar codes are inferior to other codes of similar blocklength in terms of frame error rate. While more sophisticated decoding algorithms such as list- or stack-decoding partially mitigate this performance loss, they suffer from an increase in complexity. In this paper, we describe a new flavor of the SC decoder, called the SC flip decoder. Our algorith…
▽ More
Under successive cancellation (SC) decoding, polar codes are inferior to other codes of similar blocklength in terms of frame error rate. While more sophisticated decoding algorithms such as list- or stack-decoding partially mitigate this performance loss, they suffer from an increase in complexity. In this paper, we describe a new flavor of the SC decoder, called the SC flip decoder. Our algorithm preserves the low memory requirements of the basic SC decoder and adjusts the required decoding effort to the signal quality. In the waterfall region, its average computational complexity is almost as low as that of the SC decoder.
△ Less
Submitted 17 December, 2014;
originally announced December 2014.
-
Baseband and RF Hardware Impairments in Full-Duplex Wireless Systems: Experimental Characterisation and Suppression
Authors:
Alexios Balatsoukas-Stimming,
Andrew Charles Mallory Austin,
Pavle Belanovic,
Andreas Burg
Abstract:
Hardware imperfections can significantly reduce the performance of full-duplex wireless systems by introducing non-idealities and random effects that make it challenging to fully suppress self-interference. Previous research has mostly focused on analyzing the impact of hardware imperfections on full-duplex systems, based on simulations and theoretical models. In this paper, we follow a measuremen…
▽ More
Hardware imperfections can significantly reduce the performance of full-duplex wireless systems by introducing non-idealities and random effects that make it challenging to fully suppress self-interference. Previous research has mostly focused on analyzing the impact of hardware imperfections on full-duplex systems, based on simulations and theoretical models. In this paper, we follow a measurement-based approach to experimentally identify and isolate these hardware imperfections leading to residual self-interference in full-duplex nodes. Our measurements show the important role of images arising from in-phase and quadrature (IQ) imbalance in the mixers. We also observe base-band non-linearities in the digital-to-analog converters (DAC), which can introduce strong harmonic components that have not been previously considered. A corresponding general mathematical model to suppress these components of the self-interference signal arising from the hardware non-idealities is developed from the observations and measurements. Results from a 10 MHz bandwidth full-duplex OFDM system, operating at 2.48 GHz, show up to 13 dB additional suppression, relative to state-of-the-art implementations can be achieved by jointly compensating for IQ imbalance and DAC non-linearities.
△ Less
Submitted 15 December, 2014;
originally announced December 2014.
-
On Metric Sorting for Successive Cancellation List Decoding of Polar Codes
Authors:
Alexios Balatsoukas-Stimming,
Mani Bastani Parizi,
Andreas Burg
Abstract:
We focus on the metric sorter unit of successive cancellation list decoders for polar codes, which lies on the critical path in all current hardware implementations of the decoder. We review existing metric sorter architectures and we propose two new architectures that exploit the structure of the path metrics in a log-likelihood ratio based formulation of successive cancellation list decoding. Ou…
▽ More
We focus on the metric sorter unit of successive cancellation list decoders for polar codes, which lies on the critical path in all current hardware implementations of the decoder. We review existing metric sorter architectures and we propose two new architectures that exploit the structure of the path metrics in a log-likelihood ratio based formulation of successive cancellation list decoding. Our synthesis results show that, for the list size of $L=32$, our first proposed sorter is $14\%$ faster and $45\%$ smaller than existing sorters, while for smaller list sizes, our second sorter has a higher delay in return for up to $36\%$ reduction in the area.
△ Less
Submitted 26 January, 2015; v1 submitted 16 October, 2014;
originally announced October 2014.
-
A Signal Processor for Gaussian Message Passing
Authors:
Harald Kröll,
Stefan Zwicky,
Reto Odermatt,
Lukas Bruderer,
Andreas Burg,
Qiuting Huang
Abstract:
In this paper, we present a novel signal processing unit built upon the theory of factor graphs, which is able to address a wide range of signal processing algorithms. More specifically, the demonstrated factor graph processor (FGP) is tailored to Gaussian message passing algorithms. We show how to use a highly configurable systolic array to solve the message update equations of nodes in a factor…
▽ More
In this paper, we present a novel signal processing unit built upon the theory of factor graphs, which is able to address a wide range of signal processing algorithms. More specifically, the demonstrated factor graph processor (FGP) is tailored to Gaussian message passing algorithms. We show how to use a highly configurable systolic array to solve the message update equations of nodes in a factor graph efficiently. A proper instruction set and compilation procedure is presented. In a recursive least squares channel estimation example we show that the FGP can compute a message update faster than a state-ofthe- art DSP. The results demonstrate the usabilty of the FGP architecture as a flexible HW accelerator for signal-processing and communication systems.
△ Less
Submitted 11 April, 2014;
originally announced April 2014.
-
Faulty Successive Cancellation Decoding of Polar Codes for the Binary Erasure Channel
Authors:
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
We study faulty successive cancellation decoding of polar codes for the binary erasure channel. To this end, we introduce a simple erasure-based fault model and we show that, under this model, polarization does not happen, meaning that fully reliable communication is not possible at any rate. Moreover, we provide numerical results for the frame erasure rate and bit erasure rate and we study an une…
▽ More
We study faulty successive cancellation decoding of polar codes for the binary erasure channel. To this end, we introduce a simple erasure-based fault model and we show that, under this model, polarization does not happen, meaning that fully reliable communication is not possible at any rate. Moreover, we provide numerical results for the frame erasure rate and bit erasure rate and we study an unequal error protection scheme that can significantly improve the performance of the faulty successive cancellation decoder with negligible overhead.
△ Less
Submitted 9 February, 2015; v1 submitted 7 April, 2014;
originally announced April 2014.
-
Density Evolution for Min-Sum Decoding of LDPC Codes Under Unreliable Message Storage
Authors:
Alexios Balatsoukas-Stimming,
Andreas Burg
Abstract:
We analyze the performance of quantized min-sum decoding of low-density parity-check codes under unreliable message storage. To this end, we introduce a simple bit-level error model and show that decoder symmetry is preserved under this model. Subsequently, we formulate the corresponding density evolution equations to predict the average bit error probability in the limit of infinite blocklength.…
▽ More
We analyze the performance of quantized min-sum decoding of low-density parity-check codes under unreliable message storage. To this end, we introduce a simple bit-level error model and show that decoder symmetry is preserved under this model. Subsequently, we formulate the corresponding density evolution equations to predict the average bit error probability in the limit of infinite blocklength. We present numerical threshold results and we show that using more quantization bits is not always beneficial in the context of faulty decoders.
△ Less
Submitted 6 April, 2014;
originally announced April 2014.
-
LLR-based Successive Cancellation List Decoding of Polar Codes
Authors:
Alexios Balatsoukas-Stimming,
Mani Bastani Parizi,
Andreas Burg
Abstract:
We show that successive cancellation list decoding can be formulated exclusively using log-likelihood ratios. In addition to numerical stability, the log-likelihood ratio based formulation has useful properties which simplify the sorting step involved in successive cancellation list decoding. We propose a hardware architecture of the successive cancellation list decoder in the log-likelihood ratio…
▽ More
We show that successive cancellation list decoding can be formulated exclusively using log-likelihood ratios. In addition to numerical stability, the log-likelihood ratio based formulation has useful properties which simplify the sorting step involved in successive cancellation list decoding. We propose a hardware architecture of the successive cancellation list decoder in the log-likelihood ratio domain which, compared to a log-likelihood domain implementation, requires less irregular and smaller memories. This simplification together with the gains in the metric sorter, lead to $56\%$ to $137\%$ higher throughput per unit area than other recently proposed architectures. We then evaluate the empirical performance of the CRC-aided successive cancellation list decoder at different list sizes using different CRCs and conclude that it is important to adapt the CRC length to the list size in order to achieve the best error-rate performance of concatenated polar codes. Finally, we synthesize conventional successive cancellation decoders at large block-lengths with the same block-error probability as our proposed CRC-aided successive cancellation list decoders to demonstrate that, while our decoders have slightly lower throughput and larger area, they have a significantly smaller decoding latency.
△ Less
Submitted 6 March, 2015; v1 submitted 31 October, 2013;
originally announced January 2014.
-
A fast and versatile QKD system with hardware key distillation and wavelength multiplexing
Authors:
Nino Walenta,
Andreas Burg,
Dario Caselunghe,
Jeremy Constantin,
Nicolas Gisin,
Olivier Guinnard,
Raphael Houlmann,
Pascal Junod,
Boris Korzh,
Natalia Kulesza,
Matthieu Legré,
Charles Ci Wen Lim,
Tommaso Lunghi,
Laurent Monat,
Christopher Portmann,
Mathilde Soucarros,
Patrick Trinkler,
Gregory Trolliet,
Fabien Vannel,
Hugo Zbinden
Abstract:
We present a 625 MHz clocked coherent one-way quantum key distribution (QKD) system which continuously distributes secret keys over an optical fibre link. To support high secret key rates, we implemented a fast hardware key distillation engine which allows for key distillation rates up to 4 Mbps in real time. The system employs wavelength multiplexing in order to run over only a single optical fib…
▽ More
We present a 625 MHz clocked coherent one-way quantum key distribution (QKD) system which continuously distributes secret keys over an optical fibre link. To support high secret key rates, we implemented a fast hardware key distillation engine which allows for key distillation rates up to 4 Mbps in real time. The system employs wavelength multiplexing in order to run over only a single optical fibre and is compactly integrated in 19-inch 2U racks. We optimized the system considering a security analysis that respects finite-key-size effects, authentication costs, and system errors. Using fast gated InGaAs single photon detectors, we reliably distribute secret keys with rates up to 140 kbps and over 25 km of optical fibre, for a security parameter of 4E-9.
△ Less
Submitted 11 September, 2013; v1 submitted 10 September, 2013;
originally announced September 2013.
-
Enabling Complexity-Performance Trade-Offs for Successive Cancellation Decoding of Polar Codes
Authors:
Alexios Balatsoukas-Stimming,
Georgios Karakonstantis,
Andreas Burg
Abstract:
Polar codes are one of the most recent advancements in coding theory and they have attracted significant interest. While they are provably capacity achieving over various channels, they have seen limited practical applications. Unfortunately, the successive nature of successive cancellation based decoders hinders fine-grained adaptation of the decoding complexity to design constraints and operatin…
▽ More
Polar codes are one of the most recent advancements in coding theory and they have attracted significant interest. While they are provably capacity achieving over various channels, they have seen limited practical applications. Unfortunately, the successive nature of successive cancellation based decoders hinders fine-grained adaptation of the decoding complexity to design constraints and operating conditions. In this paper, we propose a systematic method for enabling complexity-performance trade-offs by constructing polar codes based on an optimization problem which minimizes the complexity under a suitably defined mutual information based performance constraint. Moreover, a low-complexity greedy algorithm is proposed in order to solve the optimization problem efficiently for very large code lengths.
△ Less
Submitted 9 April, 2014; v1 submitted 18 July, 2013;
originally announced July 2013.