-
Ordered Reliability Direct Error Pattern Testing Decoding Algorithm
Authors:
Reza Hadavian,
Xiaoting Huang,
Dmitri Truhachev,
Kamal El-Sankary,
Hamid Ebrahimzad,
Hossein Najafi
Abstract:
We introduce a novel universal soft-decision decoding algorithm for binary block codes called ordered reliability direct error pattern testing (ORDEPT). Our results, obtained for a variety of popular short high-rate codes, demonstrate that ORDEPT outperforms state-of-the-art decoding algorithms of comparable complexity such as ordered reliability bits guessing random additive noise decoding (ORBGR…
▽ More
We introduce a novel universal soft-decision decoding algorithm for binary block codes called ordered reliability direct error pattern testing (ORDEPT). Our results, obtained for a variety of popular short high-rate codes, demonstrate that ORDEPT outperforms state-of-the-art decoding algorithms of comparable complexity such as ordered reliability bits guessing random additive noise decoding (ORBGRAND) in terms of the decoding error probability and latency. The improvements carry on to the iterative decoding of product codes and convolutional product-like codes, where we present a new adaptive decoding algorithm and demonstrate the ability of ORDEPT to efficiently find multiple candidate codewords to produce soft output.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
Subtractor-Based CNN Inference Accelerator
Authors:
Victor Gao,
Issam Hammad,
Kamal El-Sankary,
Jason Gu
Abstract:
This paper presents a novel method to boost the performance of CNN inference accelerators by utilizing subtractors. The proposed CNN preprocessing accelerator relies on sorting, grou**, and rounding the weights to create combinations that allow for the replacement of one multiplication operation and addition operation by a single subtraction operation when applying convolution during inference.…
▽ More
This paper presents a novel method to boost the performance of CNN inference accelerators by utilizing subtractors. The proposed CNN preprocessing accelerator relies on sorting, grou**, and rounding the weights to create combinations that allow for the replacement of one multiplication operation and addition operation by a single subtraction operation when applying convolution during inference. Given the high cost of multiplication in terms of power and area, replacing it with subtraction allows for a performance boost by reducing power and area. The proposed method allows for controlling the trade-off between performance gains and accuracy loss through increasing or decreasing the usage of subtractors. With a rounding size of 0.05 and by utilizing LeNet-5 with the MNIST dataset, the proposed design can achieve 32.03% power savings and a 24.59% reduction in area at the cost of only 0.1% in terms of accuracy loss.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Deep Learning Training with Simulated Approximate Multipliers
Authors:
Issam Hammad,
Kamal El-Sankary,
Jason Gu
Abstract:
This paper presents by simulation how approximate multipliers can be utilized to enhance the training performance of convolutional neural networks (CNNs). Approximate multipliers have significantly better performance in terms of speed, power, and area compared to exact multipliers. However, approximate multipliers have an inaccuracy which is defined in terms of the Mean Relative Error (MRE). To as…
▽ More
This paper presents by simulation how approximate multipliers can be utilized to enhance the training performance of convolutional neural networks (CNNs). Approximate multipliers have significantly better performance in terms of speed, power, and area compared to exact multipliers. However, approximate multipliers have an inaccuracy which is defined in terms of the Mean Relative Error (MRE). To assess the applicability of approximate multipliers in enhancing CNN training performance, a simulation for the impact of approximate multipliers error on CNN training is presented. The paper demonstrates that using approximate multipliers for CNN training can significantly enhance the performance in terms of speed, power, and area at the cost of a small negative impact on the achieved accuracy. Additionally, the paper proposes a hybrid training method which mitigates this negative impact on the accuracy. Using the proposed hybrid method, the training can start using approximate multipliers then switches to exact multipliers for the last few epochs. Using this method, the performance benefits of approximate multipliers in terms of speed, power, and area can be attained for a large portion of the training stage. On the other hand, the negative impact on the accuracy is diminished by using the exact multipliers for the last epochs of training.
△ Less
Submitted 18 April, 2020; v1 submitted 26 December, 2019;
originally announced January 2020.
-
A Comparative Study on Machine Learning Algorithms for the Control of a Wall Following Robot
Authors:
Issam Hammad,
Kamal El-Sankary,
Jason Gu
Abstract:
A comparison of the performance of various machine learning models to predict the direction of a wall following robot is presented in this paper. The models were trained using an open-source dataset that contains 24 ultrasound sensors readings and the corresponding direction for each sample. This dataset was captured using SCITOS G5 mobile robot by placing the sensors on the robot waist. In additi…
▽ More
A comparison of the performance of various machine learning models to predict the direction of a wall following robot is presented in this paper. The models were trained using an open-source dataset that contains 24 ultrasound sensors readings and the corresponding direction for each sample. This dataset was captured using SCITOS G5 mobile robot by placing the sensors on the robot waist. In addition to the full format with 24 sensors per record, the dataset has two simplified formats with 4 and 2 input sensor readings per record. Several control models were proposed previously for this dataset using all three dataset formats. In this paper, two primary research contributions are presented. First, presenting machine learning models with accuracies higher than all previously proposed models for this dataset using all three formats. A perfect solution for the 4 and 2 inputs sensors formats is presented using Decision Tree Classifier by achieving a mean accuracy of 100%. On the other hand, a mean accuracy of 99.82% was achieves using the 24 sensor inputs by employing the Gradient Boost Classifier. Second, presenting a comparative study on the performance of different machine learning and deep learning algorithms on this dataset. Therefore, providing an overall insight on the performance of these algorithms for similar sensor fusion problems. All the models in this paper were evaluated using Monte-Carlo cross-validation.
△ Less
Submitted 18 April, 2020; v1 submitted 26 December, 2019;
originally announced December 2019.
-
Gemini Infrared Multi-Object Spectrograph: Instrument Overview
Authors:
Suresh Sivanandam,
Scott Chapman,
Luc Simard,
Paul Hickson,
Kim Venn,
Simon Thibault,
Marcin Sawicki,
Adam Muzzin,
Darren Erickson,
Roberto Abraham,
Masayuki Akiyama,
David Andersen,
Colin Bradley,
Raymond Carlberg,
Shaojie Chen,
Carlos Correia,
Tim Davidge,
Sara Ellison,
Kamal El-Sankary,
Gregory Fahlman,
Masen Lamb,
Olivier Lardiere,
Marie Lemoine-Busserolle,
Dae-Sik Moon,
Norman Murray
, et al. (5 additional authors not shown)
Abstract:
The Gemini Infrared Multi-Object Spectrograph (GIRMOS) is a powerful new instrument being built to facility-class standards for the Gemini telescope. It takes advantage of the latest developments in adaptive optics and integral field spectrographs. GIRMOS will carry out simultaneous high-angular-resolution, spatially-resolved infrared ($1-2.4$ $μ$m) spectroscopy of four objects within a two-arcmin…
▽ More
The Gemini Infrared Multi-Object Spectrograph (GIRMOS) is a powerful new instrument being built to facility-class standards for the Gemini telescope. It takes advantage of the latest developments in adaptive optics and integral field spectrographs. GIRMOS will carry out simultaneous high-angular-resolution, spatially-resolved infrared ($1-2.4$ $μ$m) spectroscopy of four objects within a two-arcminute field-of-regard by taking advantage of multi-object adaptive optics. This capability does not currently exist anywhere in the world and therefore offers significant scientific gains over a very broad range of topics in astronomical research. For example, current programs for high redshift galaxies are pushing the limits of what is possible with infrared spectroscopy at $8-10$-meter class facilities by requiring up to several nights of observing time per target. Therefore, the observation of multiple objects simultaneously with adaptive optics is absolutely necessary to make effective use of telescope time and obtain statistically significant samples for high redshift science. With an expected commissioning date of 2023, GIRMOS's capabilities will also make it a key followup instrument for the James Webb Space Telescope when it is launched in 2021, as well as a true scientific and technical pathfinder for future Thirty Meter Telescope (TMT) multi-object spectroscopic instrumentation. In this paper, we will present an overview of this instrument's capabilities and overall architecture. We also highlight how this instrument lays the ground work for a future TMT early-light instrument.
△ Less
Submitted 3 August, 2018; v1 submitted 10 July, 2018;
originally announced July 2018.