-
A Comprehensive Convolutional Neural Network Architecture Design using Magnetic Skyrmion and Domain Wall
Authors:
Saumya Gupta,
Venkatesh Vadde,
Bhaskaran Muralidharan,
Abhishek Sharma
Abstract:
Spintronic-based neuromorphic hardware enables high-density and rapid data processing at nanoscale lengths. leveraged by the topologically protected spin configurations and low current densities to manipulate magnetic structures such as skyrmion and domain wall. The paper presents a compact, energy-efficient multi-bit skyrmionic synapse and domain wall-based ReLU with max-pooling functionalities f…
▽ More
Spintronic-based neuromorphic hardware enables high-density and rapid data processing at nanoscale lengths. leveraged by the topologically protected spin configurations and low current densities to manipulate magnetic structures such as skyrmion and domain wall. The paper presents a compact, energy-efficient multi-bit skyrmionic synapse and domain wall-based ReLU with max-pooling functionalities for hardware neural network applications. A 4-bit,5-bit, and 6-bit skyrmionic synapse is proposed, featuring a circular bilayer vortex-based geometry. The 4-bit skyrmionic synapse consumes an ultra-low energy of 0.8724 fJ per weight update. The proposed skyrmionic synapse comprises an ultra-thin ferromagnetic layer with a strong Dzyaloshinskii-Moriya interaction and a polarizer layer with a vortex-like spin configuration. The interaction between perpendicular current flow and the labyrinth maze-like uniaxial anisotropy profiles induce skyrmionic gyration, resulting in long-term potentiation (LTP) and long-term depression (LTD) that modifies the synaptic weights. We develop a phenomenology of the synaptic device, implementing 16-state (4-bit), 32-state (5-bit), and 64-state (6-bit) skyrmionic synapses, analyzing them quantitatively using micromagnetics simulations. Furthermore, we design a CMOS hybrid domain wall-based ReLU-max pooled circuit. The activation function works on the variation of the domain wall position implying variation in the device resistance on encountering uniaxial anisotropy variation along the track. To demonstrate the practical application of our 4-bit (16-state) skyrmionic synapse with domain wall-based ReLU-Max Pooling circuit we integrate it into an inference-based convolutional neural network (CNN) for pattern recognition, achieving a comparable accuracy of 98.07% to software-based ideal training.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Domain wall and Magnetic Tunnel Junction Hybrid for on-chip Learning in UNet architecture
Authors:
Venkatesh Vadde,
Bhaskaran Muralidharan,
Abhishek Sharma
Abstract:
We present spintronic devices based hardware implementation of UNet for segmentation tasks. Our approach involves designing hardware for convolution, deconvolution, rectified activation function (ReLU), and max pooling layers of the UNet architecture. We designed the convolution and deconvolution layers of the network using the synaptic behavior of the domain wall MTJ. We also construct the ReLU a…
▽ More
We present spintronic devices based hardware implementation of UNet for segmentation tasks. Our approach involves designing hardware for convolution, deconvolution, rectified activation function (ReLU), and max pooling layers of the UNet architecture. We designed the convolution and deconvolution layers of the network using the synaptic behavior of the domain wall MTJ. We also construct the ReLU and max pooling functions of the network utilizing the spin hall driven orthogonal current injected MTJ. To incorporate the diverse physics of spin-transport, magnetization dynamics, and CMOS elements in our UNet design, we employ a hybrid simulation setup that couples micromagnetic simulation, non-equilibrium Green's function, SPICE simulation along with network implementation. We evaluate our UNet design on the CamVid dataset and achieve segmentation accuracies of 83.71$\%$ on test data, on par with the software implementation with 821mJ of energy consumption for on-chip training over 150 epochs. We further demonstrate nearly one order $(10\times)$ improvement in the energy requirement of the network using unstable ferromagnet ($Δ$=4.58) over the stable ferromagnet ($Δ$=45) based ReLU and max pooling functions while maintaining the similar accuracy. The hybrid architecture comprising domain wall MTJ and unstable FM-based MTJ leads to an on-chip energy consumption of 85.79mJ during training, with a testing energy cost of 1.55 $μJ$.
△ Less
Submitted 11 July, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Power efficient ReLU design for neuromorphic computing using spin Hall effect
Authors:
Venkatesh Vadde,
Bhaskaran Muralidharan,
Abhishek Sharma
Abstract:
We demonstrate a magnetic tunnel junction injected with spin Hall current to exhibit linear rotation of magnetization of the free-ferromagnet using only the spin current. Using the linear resistance change of the MTJ, we devise a circuit for the rectified linear activation (ReLU) function of the artificial neuron. We explore the role of different spin Hall effect (SHE) heavy metal layers on the po…
▽ More
We demonstrate a magnetic tunnel junction injected with spin Hall current to exhibit linear rotation of magnetization of the free-ferromagnet using only the spin current. Using the linear resistance change of the MTJ, we devise a circuit for the rectified linear activation (ReLU) function of the artificial neuron. We explore the role of different spin Hall effect (SHE) heavy metal layers on the power consumption of the ReLU circuit. We benchmark the power consumption of the ReLU circuit with different SHE layers by defining a new parameter called the spin Hall power factor. It combines the spin Hall angle, resistivity, and thickness of the heavy metal layer, which translates to the power consumption of the different SHE layers during spin-orbit switching/rotation of the free FM. We employ a hybrid spintronics-CMOS simulation framework that couples Keldysh non-equilibrium Green's function formalism with Landau-Lifshitz-Gilbert-Slonzewski equations and the HSPICE circuit simulator to account for diverse physics of spin-transport and the CMOS elements in our proposed ReLU design. We also demonstrate the robustness of the proposed ReLU circuit against thermal noise and non-trivial power-error trade-off that enables the use of an unstable free-ferromagnet for energy-efficient design. Using the proposed circuit, we evaluate the performance of the convolutional neural network for MNIST datasets and demonstrate comparable classification accuracies to the ideal ReLU with an energy consumption of 75 $pJ$ per sample.
△ Less
Submitted 11 March, 2023;
originally announced March 2023.
-
Orthogonal Spin Current Injected Magnetic Tunnel Junction for Convolutional Neural Networks
Authors:
Venkatesh Vadde,
Bhaskaran Muralidharan,
Abhishek Sharma
Abstract:
We propose that a spin Hall effect driven magnetic tunnel junction device can be engineered to provide a continuous change in the resistance across it when injected with orthogonal spin currents. Using this concept, we develop a hybrid device-circuit simulation platform to design a network that realizes multiple functionalities of a convolutional neural network. At the atomistic level, we use the…
▽ More
We propose that a spin Hall effect driven magnetic tunnel junction device can be engineered to provide a continuous change in the resistance across it when injected with orthogonal spin currents. Using this concept, we develop a hybrid device-circuit simulation platform to design a network that realizes multiple functionalities of a convolutional neural network. At the atomistic level, we use the Keldysh non-equilibrium Green's function technique that is coupled self-consistently with the stochastic Landau-Lifshitz-Gilbert-Slonczewski equations, which in turn is coupled with the HSPICE circuit simulator. We demonstrate the simultaneous functionality of the proposed network to evaluate the rectified linear unit and max-pooling functionalities. We present a detailed power and error analysis of the designed network against the thermal stability factor of the free ferromagnets. Our results show that there exists a non-trivial power-error trade-off in the proposed network, which enables an energy-efficient network design based on unstable free ferromagnets with reliable outputs. The static power for the proposed ReLU circuit is $0.56μW$ and whereas the energy cost of a nine-input rectified linear unit-max-pooling network with an unstable free ferromagnet($Δ=15$) is $3.4pJ$ in the worst-case scenario. We also rationalize the magnetization stability of the proposed device by analyzing the vanishing torque gradient points.
△ Less
Submitted 4 February, 2023; v1 submitted 29 July, 2022;
originally announced July 2022.