Skip to main content

Showing 1–18 of 18 results for author: Maida, A S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.04794  [pdf, other

    cs.LG

    LiteLSTM Architecture Based on Weights Sharing for Recurrent Neural Networks

    Authors: Nelly Elsayed, Zag ElSayed, Anthony S. Maida

    Abstract: Long short-term memory (LSTM) is one of the robust recurrent neural network architectures for learning sequential data. However, it requires considerable computational power to learn and implement both software and hardware aspects. This paper proposed a novel LiteLSTM architecture based on reducing the LSTM computation components via the weights sharing concept to reduce the overall architecture… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Under the second reviewing round in the SN Computer Science Journal. Extended version of the LiteLSTM Architecture for Deep Recurrent Neural Networks paper that have been published in the IEEE ISCAS 2022 conference. arXiv admin note: substantial text overlap with arXiv:2201.11624

  2. arXiv:2301.04631  [pdf, other

    cs.CV eess.IV

    Deep Residual Axial Networks

    Authors: Nazmul Shahadat, Anthony S. Maida

    Abstract: While convolutional neural networks (CNNs) demonstrate outstanding performance on computer vision tasks, their computational costs remain high. Several techniques are used to reduce these costs, like reducing channel count, and using separable and depthwise separable convolutions. This paper reduces computational costs by introducing a novel architecture, axial CNNs, which replaces spatial 2D conv… ▽ More

    Submitted 17 March, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

  3. arXiv:2301.04626  [pdf, other

    cs.CV

    Deep Axial Hypercomplex Networks

    Authors: Nazmul Shahadat, Anthony S. Maida

    Abstract: Over the past decade, deep hypercomplex-inspired networks have enhanced feature extraction for image classification by enabling weight sharing across input channels. Recent works make it possible to improve representational capabilities by using hypercomplex-inspired networks which consume high computational costs. This paper reduces this cost by factorizing a quaternion 2D convolutional module in… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  4. arXiv:2301.04623  [pdf, other

    cs.CV

    Enhancing ResNet Image Classification Performance by using Parameterized Hypercomplex Multiplication

    Authors: Nazmul Shahadat, Anthony S. Maida

    Abstract: Recently, many deep networks have introduced hypercomplex and related calculations into their architectures. In regard to convolutional networks for classification, these enhancements have been applied to the convolution operations in the frontend to enhance accuracy and/or reduce the parameter requirements while maintaining accuracy. Although these enhancements have been applied to the convolutio… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  5. Vision-Based American Sign Language Classification Approach via Deep Learning

    Authors: Nelly Elsayed, Zag ElSayed, Anthony S. Maida

    Abstract: Hearing-impaired is the disability of partial or total hearing loss that causes a significant problem for communication with other people in society. American Sign Language (ASL) is one of the sign languages that most commonly used language used by Hearing impaired communities to communicate with each other. In this paper, we proposed a simple deep learning model that aims to classify the American… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 4 pages, Accepted in the The Florida AI Research Society (FLAIRS-35) 2022

  6. arXiv:2201.11624  [pdf, other

    cs.LG cs.AI

    LiteLSTM Architecture for Deep Recurrent Neural Networks

    Authors: Nelly Elsayed, Zag ElSayed, Anthony S. Maida

    Abstract: Long short-term memory (LSTM) is a robust recurrent neural network architecture for learning spatiotemporal sequential data. However, it requires significant computational power for learning and implementing from both software and hardware aspects. This paper proposes a novel LiteLSTM architecture based on reducing the computation components of the LSTM using the weights sharing concept to reduce… ▽ More

    Submitted 24 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted in the IEEE International Symposium on Circuits and Systems (ISCAS) 2022

  7. arXiv:2110.01185  [pdf, other

    cs.CV cs.LG

    Improving Axial-Attention Network Classification via Cross-Channel Weight Sharing

    Authors: Nazmul Shahadat, Anthony S. Maida

    Abstract: In recent years, hypercomplex-inspired neural networks (HCNNs) have been used to improve deep learning architectures due to their ability to enable channel-based weight sharing, treat colors as a single entity, and improve representational coherence within the layers. The work described herein studies the effect of replacing existing layers in an Axial Attention network with their representational… ▽ More

    Submitted 12 January, 2023; v1 submitted 4 October, 2021; originally announced October 2021.

  8. arXiv:2101.06850  [pdf, other

    cs.LG cs.AI

    Stacked LSTM Based Deep Recurrent Neural Network with Kalman Smoothing for Blood Glucose Prediction

    Authors: Md Fazle Rabby, Yazhou Tu, Md Imran Hossen, Insup Le, Anthony S Maida, Xiali Hei

    Abstract: Blood glucose (BG) management is crucial for type-1 diabetes patients resulting in the necessity of reliable artificial pancreas or insulin infusion systems. In recent years, deep learning techniques have been utilized for a more accurate BG level prediction system. However, continuous glucose monitoring (CGM) readings are susceptible to sensor errors. As a result, inaccurate CGM readings would af… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  9. arXiv:2009.04083  [pdf, other

    cs.NE cs.CV eess.IV

    Generalizing Complex/Hyper-complex Convolutions to Vector Map Convolutions

    Authors: Chase J Gaudet, Anthony S Maida

    Abstract: We show that the core reasons that complex and hypercomplex valued neural networks offer improvements over their real-valued counterparts is the weight sharing mechanism and treating multidimensional data as a single entity. Their algebra linearly combines the dimensions, making each dimension related to the others. However, both are constrained to a set number of dimensions, two for complex and f… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

  10. arXiv:1909.05622  [pdf, other

    cs.CV cs.LG stat.ML

    Inception-inspired LSTM for Next-frame Video Prediction

    Authors: Matin Hosseini, Anthony S. Maida, Majid Hosseini, Gottumukkala Raju

    Abstract: The problem of video frame prediction has received much interest due to its relevance to many computer vision applications such as autonomous vehicles or robotics. Supervised methods for video frame prediction rely on labeled data, which may not always be available. In this paper, we provide a novel unsupervised deep-learning method called Inception-based LSTM for video frame prediction. The gener… ▽ More

    Submitted 24 April, 2020; v1 submitted 27 August, 2019; originally announced September 2019.

  11. Deep Gated Recurrent and Convolutional Network Hybrid Model for Univariate Time Series Classification

    Authors: Nelly Elsayed, Anthony S. Maida, Magdy Bayoumi

    Abstract: Hybrid LSTM-fully convolutional networks (LSTM-FCN) for time series classification have produced state-of-the-art classification results on univariate time series. We show that replacing the LSTM with a gated recurrent unit (GRU) to create a GRU-fully convolutional network hybrid model (GRU-FCN) can offer even better performance on many time series datasets. The proposed GRU-FCN model outperforms… ▽ More

    Submitted 19 February, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: The paper modified and has several new results

    Journal ref: International Journal of Advanced Computer Science and Applications (IJACSA), 10(5), 2019

  12. arXiv:1810.07251  [pdf, other

    cs.LG cs.CV stat.ML

    Reduced-Gate Convolutional LSTM Using Predictive Coding for Spatiotemporal Prediction

    Authors: Nelly Elsayed, Anthony S. Maida, Magdy Bayoumi

    Abstract: Spatiotemporal sequence prediction is an important problem in deep learning. We study next-frame(s) video prediction using a deep-learning-based predictive coding framework that uses convolutional, long short-term memory (convLSTM) modules. We introduce a novel reduced-gate convolutional LSTM(rgcLSTM) architecture that requires a significantly lower parameter budget than a comparable convLSTM. By… ▽ More

    Submitted 22 December, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

    Comments: A novel rgcLSTM model for spatiotemporal prediction. This version contains the full description and detailed empirical study of the rgcLSTM architecture. 28 pages, 12 figures, 20 tables

    Report number: COIN12277

  13. Deep Learning in Spiking Neural Networks

    Authors: Amirhossein Tavanaei, Masoud Ghodrati, Saeed Reza Kheradpisheh, Timothee Masquelier, Anthony S. Maida

    Abstract: In recent years, deep learning has been a revolution in the field of machine learning, for computer vision in particular. In this approach, a deep (multilayer) artificial neural network (ANN) is trained in a supervised manner using backpropagation. Huge amounts of labeled examples are required, but the resulting classification accuracy is truly impressive, sometimes outperforming humans. Neurons i… ▽ More

    Submitted 20 January, 2019; v1 submitted 22 April, 2018; originally announced April 2018.

    Journal ref: Neural Networks (2018)

  14. arXiv:1711.04214  [pdf, other

    cs.NE

    BP-STDP: Approximating Backpropagation using Spike Timing Dependent Plasticity

    Authors: Amirhossein Tavanaei, Anthony S. Maida

    Abstract: The problem of training spiking neural networks (SNNs) is a necessary precondition to understanding computations within the brain, a field still in its infancy. Previous work has shown that supervised learning in multi-layer SNNs enables bio-inspired networks to recognize patterns of stimuli through hierarchical feature acquisition. Although gradient descent has shown impressive performance in mul… ▽ More

    Submitted 9 March, 2018; v1 submitted 11 November, 2017; originally announced November 2017.

  15. arXiv:1611.03000  [pdf, other

    cs.NE

    Bio-Inspired Spiking Convolutional Neural Network using Layer-wise Sparse Coding and STDP Learning

    Authors: Amirhossein Tavanaei, Anthony S. Maida

    Abstract: Hierarchical feature discovery using non-spiking convolutional neural networks (CNNs) has attracted much recent interest in machine learning and computer vision. However, it is still not well understood how to create a biologically plausible network of brain-like, spiking neurons with multi-layer, unsupervised learning. This paper explores a novel bio-inspired spiking CNN that is trained in a gree… ▽ More

    Submitted 23 June, 2017; v1 submitted 9 November, 2016; originally announced November 2016.

  16. Acquisition of Visual Features Through Probabilistic Spike-Timing-Dependent Plasticity

    Authors: Amirhossein Tavanaei, Timothee Masquelier, Anthony S Maida

    Abstract: The final version of this paper has been published in IEEEXplore available at http://ieeexplore.ieee.org/document/7727213. Please cite this paper as: Amirhossein Tavanaei, Timothee Masquelier, and Anthony Maida, Acquisition of visual features through probabilistic spike-timing-dependent plasticity. IEEE International Joint Conference on Neural Networks. pp. 307-314, IJCNN 2016. This paper explor… ▽ More

    Submitted 8 November, 2016; v1 submitted 3 June, 2016; originally announced June 2016.

    Comments: IEEE-IJCNN 2016

    Journal ref: 2016 International Joint Conference on Neural Networks

  17. arXiv:1606.00825  [pdf, other

    cs.NE

    Training a Hidden Markov Model with a Bayesian Spiking Neural Network

    Authors: Amirhossein Tavanaei, Anthony S Maida

    Abstract: It is of some interest to understand how statistically based mechanisms for signal processing might be integrated with biologically motivated mechanisms such as neural networks. This paper explores a novel hybrid approach for classifying segments of sequential data, such as individual spoken works. The approach combines a hidden Markov model (HMM) with a spiking neural network (SNN). The HMM, cons… ▽ More

    Submitted 20 July, 2016; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: Bayesian Spiking Neural Network, Revision submitted: April-27-2016

    Journal ref: Journal of Signal Processing Systems, (2016), 1-10

  18. A Spiking Network that Learns to Extract Spike Signatures from Speech Signals

    Authors: Amirhossein Tavanaei, Anthony S Maida

    Abstract: Spiking neural networks (SNNs) with adaptive synapses reflect core properties of biological neural networks. Speech recognition, as an application involving audio coding and dynamic learning, provides a good test problem to study SNN functionality. We present a simple, novel, and efficient nonrecurrent SNN that learns to convert a speech signal into a spike train signature. The signature is distin… ▽ More

    Submitted 11 March, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: Published in Neurocomputing Journal, Elsevier

    Journal ref: Neurocomputing, 140:191-199, 2017