-
High-Throughput Parallel Viterbi Decoder on GPU Tensor Cores
Abstract: Many research works have been performed on implementation of Vitrerbi decoding algorithm on GPU instead of FPGA because this platform provides considerable flexibility in addition to great performance. Recently, the recently-introduced Tensor cores in modern GPU architectures provide incredible computing capability. This paper proposes a novel parallel implementation of Viterbi decoding algorithm… ▽ More
Submitted 27 November, 2020; originally announced November 2020.
Comments: arXiv admin note: substantial text overlap with arXiv:2011.09337
-
High-Throughput and Memory-Efficient Parallel Viterbi Decoder for Convolutional Codes on GPU
Abstract: This paper describes a parallel implementation of Viterbi decoding algorithm. Viterbi decoder is widely used in many state-of-the-art wireless systems. The proposed solution optimizes both throughput and memory usage by applying optimizations such as unified kernel implementation and parallel traceback. Experimental evaluations show that the proposed solution achieves higher throughput compared to… ▽ More
Submitted 18 November, 2020; originally announced November 2020.