Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing
Authors:
Steven K. Esser,
Paul A. Merolla,
John V. Arthur,
Andrew S. Cassidy,
Rathinakumar Appuswamy,
Alexander Andreopoulos,
David J. Berg,
Jeffrey L. McKinstry,
Timothy Melano,
Davis R. Barch,
Carmelo di Nolfo,
Pallab Datta,
Arnon Amir,
Brian Taba,
Myron D. Flickner,
Dharmendra S. Modha
Abstract:
Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural pri…
▽ More
Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural primitives, can implement deep convolution networks that i) approach state-of-the-art classification accuracy across 8 standard datasets, encompassing vision and speech, ii) perform inference while preserving the hardware's underlying energy-efficiency and high throughput, running on the aforementioned datasets at between 1200 and 2600 frames per second and using between 25 and 275 mW (effectively > 6000 frames / sec / W) and iii) can be specified and trained using backpropagation with the same ease-of-use as contemporary deep learning. For the first time, the algorithmic power of deep learning can be merged with the efficiency of neuromorphic processors, bringing the promise of embedded, intelligent, brain-inspired computing one step closer.
△ Less
Submitted 24 May, 2016; v1 submitted 27 March, 2016;
originally announced March 2016.
Gibbs Sampling with Low-Power Spiking Digital Neurons
Authors:
Srinjoy Das,
Bruno Umbria Pedroni,
Paul Merolla,
John Arthur,
Andrew S. Cassidy,
Bryan L. Jackson,
Dharmendra Modha,
Gert Cauwenberghs,
Ken Kreutz-Delgado
Abstract:
Restricted Boltzmann Machines and Deep Belief Networks have been successfully used in a wide variety of applications including image classification and speech recognition. Inference and learning in these algorithms uses a Markov Chain Monte Carlo procedure called Gibbs sampling. A sigmoidal function forms the kernel of this sampler which can be realized from the firing statistics of noisy integrat…
▽ More
Restricted Boltzmann Machines and Deep Belief Networks have been successfully used in a wide variety of applications including image classification and speech recognition. Inference and learning in these algorithms uses a Markov Chain Monte Carlo procedure called Gibbs sampling. A sigmoidal function forms the kernel of this sampler which can be realized from the firing statistics of noisy integrate-and-fire neurons on a neuromorphic VLSI substrate. This paper demonstrates such an implementation on an array of digital spiking neurons with stochastic leak and threshold properties for inference tasks and presents some key performance metrics for such a hardware-based sampler in both the generative and discriminative contexts.
△ Less
Submitted 27 March, 2015; v1 submitted 26 March, 2015;
originally announced March 2015.
Querying Databases of Annotated Speech
Authors:
Steve Cassidy,
Steven Bird
Abstract:
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent e…
▽ More
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University and one at the University of Pennsylvania. Various query languages are described, along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the internet.
△ Less
Submitted 11 April, 2002;
originally announced April 2002.