THOR -- A Neuromorphic Processor with 7.29G TSOP$^2$/mm$^2$Js Energy-Throughput Efficiency
Authors:
Mayank Senapati,
Manil Dev Gomony,
Sherif Eissa,
Charlotte Frenkel,
Henk Corporaal
Abstract:
Neuromorphic computing using biologically inspired Spiking Neural Networks (SNNs) is a promising solution to meet Energy-Throughput (ET) efficiency needed for edge computing devices. Neuromorphic hardware architectures that emulate SNNs in analog/mixed-signal domains have been proposed to achieve order-of-magnitude higher energy efficiency than all-digital architectures, however at the expense of…
▽ More
Neuromorphic computing using biologically inspired Spiking Neural Networks (SNNs) is a promising solution to meet Energy-Throughput (ET) efficiency needed for edge computing devices. Neuromorphic hardware architectures that emulate SNNs in analog/mixed-signal domains have been proposed to achieve order-of-magnitude higher energy efficiency than all-digital architectures, however at the expense of limited scalability, susceptibility to noise, complex verification, and poor flexibility. On the other hand, state-of-the-art digital neuromorphic architectures focus either on achieving high energy efficiency (Joules/synaptic operation (SOP)) or throughput efficiency (SOPs/second/area), resulting in poor ET efficiency. In this work, we present THOR, an all-digital neuromorphic processor with a novel memory hierarchy and neuron update architecture that addresses both energy consumption and throughput bottlenecks. We implemented THOR in 28nm FDSOI CMOS technology and our post-layout results demonstrate an ET efficiency of 7.29G $\text{TSOP}^2/\text{mm}^2\text{Js}$ at 0.9V, 400 MHz, which represents a 3X improvement over state-of-the-art digital neuromorphic processors.
△ Less
Submitted 3 December, 2022;
originally announced December 2022.
CONVOLVE: Smart and seamless design of smart edge processors
Authors:
M. Gomony,
F. Putter,
A. Gebregiorgis,
G. Paulin,
L. Mei,
V. Jain,
S. Hamdioui,
V. Sanchez,
T. Grosser,
M. Geilen,
M. Verhelst,
F. Zenke,
F. Gurkaynak,
B. Bruin,
S. Stuijk,
S. Davidson,
S. De,
M. Ghogho,
A. Jimborean,
S. Eissa,
L. Benini,
D. Soudris,
R. Bishnoi,
S. Ainsworth,
F. Corradi
, et al. (3 additional authors not shown)
Abstract:
With the rise of Deep Learning (DL), our world braces for AI in every edge device, creating an urgent need for edge-AI SoCs. This SoC hardware needs to support high throughput, reliable and secure AI processing at Ultra Low Power (ULP), with a very short time to market. With its strong legacy in edge solutions and open processing platforms, the EU is well-positioned to become a leader in this SoC…
▽ More
With the rise of Deep Learning (DL), our world braces for AI in every edge device, creating an urgent need for edge-AI SoCs. This SoC hardware needs to support high throughput, reliable and secure AI processing at Ultra Low Power (ULP), with a very short time to market. With its strong legacy in edge solutions and open processing platforms, the EU is well-positioned to become a leader in this SoC market. However, this requires AI edge processing to become at least 100 times more energy-efficient, while offering sufficient flexibility and scalability to deal with AI as a fast-moving target. Since the design space of these complex SoCs is huge, advanced tooling is needed to make their design tractable. The CONVOLVE project (currently in Inital stage) addresses these roadblocks. It takes a holistic approach with innovations at all levels of the design hierarchy. Starting with an overview of SOTA DL processing support and our project methodology, this paper presents 8 important design choices largely impacting the energy efficiency and flexibility of DL hardware. Finding good solutions is key to making smart-edge computing a reality.
△ Less
Submitted 2 May, 2023; v1 submitted 1 December, 2022;
originally announced December 2022.
A Prioritized Access Point Algorithm for 802.11b Networks in a Lossy Environment
Authors:
A. N. Omara,
Sherine M. Abd El-Kader,
Hussein S. Eissa,
S. El-Ramly
Abstract:
In recent years, WLAN technology has been gaining popularity around the world with its sub standard 802.11b receiving major deployments in many indoor and outdoor environments. In this article we investigate the performance of IEEE 802.11b infrastructure networks in the lossless and lossy environments by means of a simulation study. Also, this study shows how the FIFO discipline of the 802.11b MAC…
▽ More
In recent years, WLAN technology has been gaining popularity around the world with its sub standard 802.11b receiving major deployments in many indoor and outdoor environments. In this article we investigate the performance of IEEE 802.11b infrastructure networks in the lossless and lossy environments by means of a simulation study. Also, this study shows how the FIFO discipline of the 802.11b MAC affects on the global performance when at least one channel is under the influence of the bursty errors. Furthermore, this paper proposes a channel aware backoff algorithm for the Access Point (AP) to prioritize its transmissions and to accelerate the transmissions in the poor radio channels to enhance the performance of the real time applications. The final results of this simulation study showed that the proposed algorithm is able to enhance the throughput and the delay in lossy environment by an average of 49% and 83% respectively.
△ Less
Submitted 21 May, 2010;
originally announced May 2010.