Search | arXiv e-print repository

arXiv:2005.04167 [pdf, ps, other]

doi 10.1145/3407197.3407213

Continuous Learning in a Single-Incremental-Task Scenario with Spike Features

Authors: Ruthvik Vaila, John Chiasson, Vishal Saxena

Abstract: Deep Neural Networks (DNNs) have two key deficiencies, their dependence on high precision computing and their inability to perform sequential learning, that is, when a DNN is trained on a first task and the same DNN is trained on the next task it forgets the first task. This phenomenon of forgetting previous tasks is also referred to as catastrophic forgetting. On the other hand a mammalian brain… ▽ More Deep Neural Networks (DNNs) have two key deficiencies, their dependence on high precision computing and their inability to perform sequential learning, that is, when a DNN is trained on a first task and the same DNN is trained on the next task it forgets the first task. This phenomenon of forgetting previous tasks is also referred to as catastrophic forgetting. On the other hand a mammalian brain outperforms DNNs in terms of energy efficiency and the ability to learn sequentially without catastrophically forgetting. Here, we use bio-inspired Spike Timing Dependent Plasticity (STDP)in the feature extraction layers of the network with instantaneous neurons to extract meaningful features. In the classification sections of the network we use a modified synaptic intelligence that we refer to as cost per synapse metric as a regularizer to immunize the network against catastrophic forgetting in a Single-Incremental-Task scenario (SIT). In this study, we use MNIST handwritten digits dataset that was divided into five sub-tasks. △ Less

Submitted 3 May, 2020; originally announced May 2020.

Comments: Submitted to ICONS 2020

Journal ref: nternational Conference on Neuromorphic Systems 2020

arXiv:2002.11843 [pdf, ps, other]

A Deep Unsupervised Feature Learning Spiking Neural Network with Binarized Classification Layers for EMNIST Classification using SpykeFlow

Authors: Ruthvik Vaila, John Chiasson, Vishal Saxena

Abstract: End user AI is trained on large server farms with data collected from the users. With ever increasing demand for IOT devices, there is a need for deep learning approaches that can be implemented (at the edge) in an energy efficient manner. In this work we approach this using spiking neural networks. The unsupervised learning technique of spike timing dependent plasticity (STDP) using binary activa… ▽ More End user AI is trained on large server farms with data collected from the users. With ever increasing demand for IOT devices, there is a need for deep learning approaches that can be implemented (at the edge) in an energy efficient manner. In this work we approach this using spiking neural networks. The unsupervised learning technique of spike timing dependent plasticity (STDP) using binary activations are used to extract features from spiking input data. Gradient descent (backpropagation) is used only on the output layer to perform the training for classification. The accuracies obtained for the balanced EMNIST data set compare favorably with other approaches. The effect of stochastic gradient descent (SGD) approximations on learning capabilities of our network are also explored. △ Less

Submitted 28 October, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

Comments: A section of of this work is Submitted to IEEE TETCI 2020 Journal

arXiv:2002.11044 [pdf, ps, other]

Regression with Deep Learning for Sensor Performance Optimization

Authors: Ruthvik Vaila, Denver Lloyd, Kevin Tetz

Abstract: Neural networks with at least two hidden layers are called deep networks. Recent developments in AI and computer programming in general has led to development of tools such as Tensorflow, Keras, NumPy etc. making it easier to model and draw conclusions from data. In this work we re-approach non-linear regression with deep learning enabled by Keras and Tensorflow. In particular, we use deep learnin… ▽ More Neural networks with at least two hidden layers are called deep networks. Recent developments in AI and computer programming in general has led to development of tools such as Tensorflow, Keras, NumPy etc. making it easier to model and draw conclusions from data. In this work we re-approach non-linear regression with deep learning enabled by Keras and Tensorflow. In particular, we use deep learning to parametrize a non-linear multivariate relationship between inputs and outputs of an industrial sensor with an intent to optimize the sensor performance based on selected key metrics. △ Less

Submitted 27 March, 2021; v1 submitted 22 February, 2020; originally announced February 2020.

Comments: Accepted in Workshop on Microelectronics and Electron Devices March 30th, 2020

Journal ref: Workshop on Microelectronics and Electron Devices. March 30th, 2020

arXiv:1903.12272 [pdf, other]

Deep Convolutional Spiking Neural Networks for Image Classification

Authors: Ruthvik Vaila, John Chiasson, Vishal Saxena

Abstract: Spiking neural networks are biologically plausible counterparts of the artificial neural networks, artificial neural networks are usually trained with stochastic gradient descent and spiking neural networks are trained with spike timing dependant plasticity. Training deep convolutional neural networks is a memory and power intensive job. Spiking networks could potentially help in reducing the powe… ▽ More Spiking neural networks are biologically plausible counterparts of the artificial neural networks, artificial neural networks are usually trained with stochastic gradient descent and spiking neural networks are trained with spike timing dependant plasticity. Training deep convolutional neural networks is a memory and power intensive job. Spiking networks could potentially help in reducing the power usage. There is a large pool of tools for one to chose to train artificial neural networks of any size, on the other hand all the available tools to simulate spiking neural networks are geared towards computational neuroscience applications and they are not suitable for real life applications. In this work we focus on implementing a spiking CNN using Tensorflow to examine behaviour of the network and empirically study the effect of various parameters on learning capabilities and also study catastrophic forgetting in the spiking CNN and weight initialization problem in R-STDP using MNIST and N-MNIST data sets. △ Less

Submitted 25 September, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

Showing 1–4 of 4 results for author: Vaila, R