Skip to main content

Showing 1–30 of 30 results for author: Eshraghian, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02528  [pdf, other

    cs.CL

    Scalable MatMul-free Language Modeling

    Authors: Rui-Jie Zhu, Yu Zhang, Ethan Sifferman, Tyler Sheaves, Yiqiao Wang, Dustin Richmond, Peng Zhou, Jason K. Eshraghian

    Abstract: Matrix multiplication (MatMul) typically dominates the overall computational cost of large language models (LLMs). This cost only grows as LLMs scale to larger embedding dimensions and context lengths. In this work, we show that MatMul operations can be completely eliminated from LLMs while maintaining strong performance at billion-parameter scales. Our experiments show that our proposed MatMul-fr… ▽ More

    Submitted 18 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2405.19687  [pdf, other

    cs.NE cs.CV

    Autonomous Driving with Spiking Neural Networks

    Authors: Rui-Jie Zhu, Ziqing Wang, Leilani Gilpin, Jason K. Eshraghian

    Abstract: Autonomous driving demands an integrated approach that encompasses perception, prediction, and planning, all while operating under strict energy constraints to enhance scalability and environmental sustainability. We present Spiking Autonomous Driving (SAD), the first unified Spiking Neural Network (SNN) to address the energy challenges faced by autonomous driving systems through its event-driven… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.13672  [pdf, other

    cs.CV

    Advancing Spiking Neural Networks towards Multiscale Spatiotemporal Interaction Learning

    Authors: Yimeng Shan, Malu Zhang, Rui-jie Zhu, Xuerui Qiu, Jason K. Eshraghian, Haicheng Qu

    Abstract: Recent advancements in neuroscience research have propelled the development of Spiking Neural Networks (SNNs), which not only have the potential to further advance neuroscience research but also serve as an energy-efficient alternative to Artificial Neural Networks (ANNs) due to their spike-driven characteristics. However, previous studies often neglected the multiscale information and its spatiot… ▽ More

    Submitted 27 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2404.19668  [pdf, other

    cs.NE cs.LG

    SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks

    Authors: Sreyes Venkatesh, Razvan Marinescu, Jason K. Eshraghian

    Abstract: Weight quantization is used to deploy high-performance deep learning models on resource-limited hardware, enabling the use of low-precision integers for storage and computation. Spiking neural networks (SNNs) share the goal of enhancing efficiency, but adopt an 'event-driven' approach to reduce the power consumption of neural network inference. While extensive research has focused on weight quanti… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures, accepted at NICE 2024

  5. arXiv:2311.14641  [pdf, other

    cs.NE

    Neuromorphic Intermediate Representation: A Unified Instruction Set for Interoperable Brain-Inspired Computing

    Authors: Jens E. Pedersen, Steven Abreu, Matthias Jobst, Gregor Lenz, Vittorio Fra, Felix C. Bauer, Dylan R. Muir, Peng Zhou, Bernhard Vogginger, Kade Heckel, Gianvito Urgese, Sadasivan Shankar, Terrence C. Stewart, Jason K. Eshraghian, Sadique Sheik

    Abstract: Spiking neural networks and neuromorphic hardware platforms that emulate neural dynamics are slowly gaining momentum and entering main-stream usage. Despite a well-established mathematical foundation for neural dynamics, the implementation details vary greatly across different platforms. Correspondingly, there are a plethora of software and hardware implementations with their own unique technology… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: NIR is available at https://github.com/neuromorphs/NIR

  6. arXiv:2311.06570  [pdf, other

    cs.CV

    SynA-ResNet: Spike-driven ResNet Achieved through OR Residual Connection

    Authors: Yimeng Shan, Xuerui Qiu, Rui-jie Zhu, Jason K. Eshraghian, Malu Zhang, Haicheng Qu

    Abstract: Spiking Neural Networks (SNNs) have garnered substantial attention in brain-like computing for their biological fidelity and the capacity to execute energy-efficient spike-driven operations. As the demand for heightened performance in SNNs surges, the trend towards training deeper networks becomes imperative, while residual learning stands as a pivotal method for training deep neural networks. In… ▽ More

    Submitted 7 July, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures and 10 tables

  7. arXiv:2307.12471  [pdf, other

    cs.AR

    Neuromorphic Neuromodulation: Towards the next generation of on-device AI-revolution in electroceuticals

    Authors: Luis Fernando Herbozo Contreras, Nhan Duy Truong, Jason K. Eshraghian, Zhangyu Xu, Zhao**g Huang, Armin Nikpour, Omid Kavehei

    Abstract: Neuromodulation techniques have emerged as promising approaches for treating a wide range of neurological disorders, precisely delivering electrical stimulation to modulate abnormal neuronal activity. While leveraging the unique capabilities of artificial intelligence (AI) holds immense potential for responsive neurostimulation, it appears as an extremely challenging proposition where real-time (l… ▽ More

    Submitted 28 July, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

  8. arXiv:2306.15749  [pdf, other

    cs.NE cs.AI cs.AR cs.LG

    To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning Acceleration

    Authors: Fabrizio Ottati, Chang Gao, Qinyu Chen, Giovanni Brignone, Mario R. Casu, Jason K. Eshraghian, Luciano Lavagno

    Abstract: As deep learning models scale, they become increasingly competitive from domains spanning from computer vision to natural language processing; however, this happens at the expense of efficiency since they require increasingly more memory and computing power. The power efficiency of the biological brain outperforms any large-scale deep learning ( DL ) model; thus, neuromorphic computing tries to mi… ▽ More

    Submitted 28 January, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Fixed error in bio

  9. arXiv:2306.12676  [pdf, other

    cond-mat.dis-nn cs.AI

    Memristive Reservoirs Learn to Learn

    Authors: Ruomin Zhu, Jason K. Eshraghian, Zdenka Kuncic

    Abstract: Memristive reservoirs draw inspiration from a novel class of neuromorphic hardware known as nanowire networks. These systems display emergent brain-like dynamics, with optimal performance demonstrated at dynamical phase transitions. In these networks, a limited number of electrodes are available to modulate system dynamics, in contrast to the global controllability offered by neuromorphic hardware… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 7 pages, 6 figures, ICONS 2023, accepted

  10. arXiv:2304.11056  [pdf, other

    cs.CR cs.LG

    PowerGAN: A Machine Learning Approach for Power Side-Channel Attack on Compute-in-Memory Accelerators

    Authors: Ziyu Wang, Yuting Wu, Yongmo Park, Sangmin Yoo, Xinxin Wang, Jason K. Eshraghian, Wei D. Lu

    Abstract: Analog compute-in-memory (CIM) systems are promising for deep neural network (DNN) inference acceleration due to their energy efficiency and high throughput. However, as the use of DNNs expands, protecting user input privacy has become increasingly important. In this paper, we identify a potential security vulnerability wherein an adversary can reconstruct the user's private input data from a powe… ▽ More

    Submitted 27 May, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  11. arXiv:2302.13939  [pdf, other

    cs.CL cs.LG cs.NE

    SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

    Authors: Rui-Jie Zhu, Qihang Zhao, Guoqi Li, Jason K. Eshraghian

    Abstract: As the size of large language models continue to scale, so does the computational resources required to run it. Spiking Neural Networks (SNNs) have emerged as an energy-efficient approach to deep learning that leverage sparse and event-driven activations to reduce the computational overhead associated with model inference. While they have become competitive with non-spiking models on many computer… ▽ More

    Submitted 26 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  12. arXiv:2302.01015  [pdf, other

    cs.AR cs.NE

    OpenSpike: An OpenRAM SNN Accelerator

    Authors: Farhad Modaresi, Matthew Guthaus, Jason K. Eshraghian

    Abstract: This paper presents a spiking neural network (SNN) accelerator made using fully open-source EDA tools, process design kit (PDK), and memory macros synthesized using OpenRAM. The chip is taped out in the 130 nm SkyWater process and integrates over 1 million synaptic weights, and offers a reprogrammable architecture. It operates at a clock speed of 40 MHz, a supply of 1.8 V, uses a PicoRV32 core for… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: The design is open sourced and available online: https://github.com/sfmth/OpenSpike

  13. arXiv:2211.10725  [pdf, other

    cs.LG

    Intelligence Processing Units Accelerate Neuromorphic Learning

    Authors: Pao-Sheng Vincent Sun, Alexander Titterton, Anjlee Gopiani, Tim Santos, Arindam Basu, Wei D. Lu, Jason K. Eshraghian

    Abstract: Spiking neural networks (SNNs) have achieved orders of magnitude improvement in terms of energy consumption and latency when performing inference with deep learning workloads. Error backpropagation is presently regarded as the most effective method for training SNNs, but in a twist of irony, when training on modern graphics processing units (GPUs) this becomes more expensive than non-spiking netwo… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 10 pages, 9 figures, journal

  14. arXiv:2210.03515  [pdf, other

    cs.NE cs.LG

    Spiking neural networks for nonlinear regression

    Authors: Alexander Henkes, Jason K. Eshraghian, Henning Wessels

    Abstract: Spiking neural networks, also often referred to as the third generation of neural networks, carry the potential for a massive reduction in memory and energy consumption over traditional, second-generation neural networks. Inspired by the undisputed efficiency of the human brain, they introduce temporal and neuronal sparsity, which can be exploited by next-generation neuromorphic hardware. To open… ▽ More

    Submitted 26 October, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

  15. Side-channel attack analysis on in-memory computing architectures

    Authors: Ziyu Wang, Fan-hsuan Meng, Yongmo Park, Jason K. Eshraghian, Wei D. Lu

    Abstract: In-memory computing (IMC) systems have great potential for accelerating data-intensive tasks such as deep neural networks (DNNs). As DNN models are generally highly proprietary, the neural network architectures become valuable targets for attacks. In IMC systems, since the whole model is mapped on chip and weight memory read can be restricted, the pre-mapped DNN model acts as a ``black box'' for u… ▽ More

    Submitted 25 March, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Journal ref: IEEE Transactions on Emerging Topics in Computing (2023)

  16. arXiv:2206.12992  [pdf, other

    cs.NE cs.AI cs.AR cs.ET

    Gradient-based Neuromorphic Learning on Dynamical RRAM Arrays

    Authors: Peng Zhou, Jason K. Eshraghian, Dong-Uk Choi, Wei D. Lu, Sung-Mo Kang

    Abstract: We present MEMprop, the adoption of gradient-based learning to train fully memristive spiking neural networks (MSNNs). Our approach harnesses intrinsic device dynamics to trigger naturally arising voltage spikes. These spikes emitted by memristive dynamics are analog in nature, and thus fully differentiable, which eliminates the need for surrogate gradient methods that are prevalent in the spiking… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  17. arXiv:2203.01426  [pdf, other

    cs.NE cs.AI cs.ET

    SPICEprop: Backpropagating Errors Through Memristive Spiking Neural Networks

    Authors: Peng Zhou, Jason K. Eshraghian, Dong-Uk Choi, Sung-Mo Kang

    Abstract: We present a fully memristive spiking neural network (MSNN) consisting of novel memristive neurons trained using the backpropagation through time (BPTT) learning rule. Gradient descent is applied directly to the memristive integrated-and-fire (MIF) neuron designed using analog SPICE circuit models, which generates distinct depolarization, hyperpolarization, and repolarization voltage waveforms. Sy… ▽ More

    Submitted 9 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  18. arXiv:2203.01416  [pdf, other

    cs.NE cs.AI cs.ET

    A Fully Memristive Spiking Neural Network with Unsupervised Learning

    Authors: Peng Zhou, Dong-Uk Choi, Jason K. Eshraghian, Sung-Mo Kang

    Abstract: We present a fully memristive spiking neural network (MSNN) consisting of physically-realizable memristive neurons and memristive synapses to implement an unsupervised Spiking Time Dependent Plasticity (STDP) learning rule. The system is fully memristive in that both neuronal and synaptic dynamics can be realized by using memristors. The neuron is implemented using the SPICE-level memristive integ… ▽ More

    Submitted 9 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  19. arXiv:2202.07221  [pdf, other

    cs.LG cs.NE

    Navigating Local Minima in Quantized Spiking Neural Networks

    Authors: Jason K. Eshraghian, Corey Lammie, Mostafa Rahimi Azghadi, Wei D. Lu

    Abstract: Spiking and Quantized Neural Networks (NNs) are becoming exceedingly important for hyper-efficient implementations of Deep Learning (DL) algorithms. However, these networks face challenges when trained using error backpropagation, due to the absence of gradient signals when applying hard thresholds. The broadly accepted trick to overcoming this is through the use of biased gradient estimators: sur… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  20. arXiv:2201.11915  [pdf, other

    cs.NE cs.LG q-bio.NC

    The fine line between dead neurons and sparsity in binarized spiking neural networks

    Authors: Jason K. Eshraghian, Wei D. Lu

    Abstract: Spiking neural networks can compensate for quantization error by encoding information either in the temporal domain, or by processing discretized quantities in hidden states of higher precision. In theory, a wide dynamic range state-space enables multiple binarized inputs to be accumulated together, thus improving the representational capacity of individual neurons. This may be achieved by increas… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

  21. arXiv:2201.06703  [pdf, other

    cs.ET cs.AI cs.AR

    Design Space Exploration of Dense and Sparse Map** Schemes for RRAM Architectures

    Authors: Corey Lammie, Jason K. Eshraghian, Chenqi Li, Amirali Amirsoleimani, Roman Genov, Wei D. Lu, Mostafa Rahimi Azghadi

    Abstract: The impact of device and circuit-level effects in mixed-signal Resistive Random Access Memory (RRAM) accelerators typically manifest as performance degradation of Deep Learning (DL) algorithms, but the degree of impact varies based on algorithmic features. These include network architecture, capacity, weight distribution, and the type of inter-layer connections. Techniques are continuously emergin… ▽ More

    Submitted 24 January, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: Accepted at 2022 IEEE International Symposium on Circuits and Systems (ISCAS). [v2] Fixed incorrectly labeled author affiliations for Chenqi Li, Amirali Amirsoleimani, and Roman Genov

  22. arXiv:2109.12894  [pdf, other

    cs.NE cs.ET cs.LG

    Training Spiking Neural Networks Using Lessons From Deep Learning

    Authors: Jason K. Eshraghian, Max Ward, Emre Neftci, Xinxin Wang, Gregor Lenz, Girish Dwivedi, Mohammed Bennamoun, Doo Seok Jeong, Wei D. Lu

    Abstract: The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what the future of deep learning might look like. This paper serves as a tutorial and perspective showing how to apply the lessons learnt from several decades of research in deep learning, gradient descent, backpropagation and neurosc… ▽ More

    Submitted 13 August, 2023; v1 submitted 27 September, 2021; originally announced September 2021.

  23. arXiv:2104.10297  [pdf, other

    cs.ET

    FPGA Synthesis of Ternary Memristor-CMOS Decoders

    Authors: Xiaoyuan Wang, Zhiru Wu, Pengfei Zhou, Herbert H. C. Iu, Jason K. Eshraghian, Sung Mo Kang

    Abstract: The search for a compatible application of memristor-CMOS logic gates has remained elusive, as the data density benefits are offset by slow switching speeds and resistive dissipation. Active microdisplays typically prioritize pixel density (and therefore resolution) over that of speed, where the most widely used refresh rates fall between 25-240 Hz. Therefore, memristor-CMOS logic is a promising f… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  24. arXiv:2103.06506  [pdf, other

    cs.ET cs.AI cs.AR cs.LG

    Memristive Stochastic Computing for Deep Learning Parameter Optimization

    Authors: Corey Lammie, Jason K. Eshraghian, Wei D. Lu, Mostafa Rahimi Azghadi

    Abstract: Stochastic Computing (SC) is a computing paradigm that allows for the low-cost and low-power computation of various arithmetic operations using stochastic bit streams and digital logic. In contrast to conventional representation schemes used within the binary domain, the sequence of bit streams in the stochastic domain is inconsequential, and computation is usually non-deterministic. In this brief… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted by IEEE Transactions on Circuits and Systems Part II: Express Briefs

    Journal ref: IEEE Transactions on Circuits and Systems Part II: Express Briefs, 2021

  25. arXiv:2102.06536  [pdf, other

    cs.AR eess.IV eess.SP

    CrossStack: A 3-D Reconfigurable RRAM Crossbar Inference Engine

    Authors: Jason K. Eshraghian, Kyoungrok Cho, Sung Mo Kang

    Abstract: Deep neural network inference accelerators are rapidly growing in importance as we turn to massively parallelized processing beyond GPUs and ASICs. The dominant operation in feedforward inference is the multiply-and-accumlate process, where each column in a crossbar generates the current response of a single neuron. As a result, memristor crossbar arrays parallelize inference and image processing… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: 5 pages, 4 figures

  26. arXiv:2007.05657  [pdf, other

    cs.AR cs.LG eess.SP

    Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications

    Authors: Mostafa Rahimi Azghadi, Corey Lammie, Jason K. Eshraghian, Melika Payvand, Elisa Donati, Bernabe Linares-Barranco, Giacomo Indiveri

    Abstract: The advent of dedicated Deep Learning (DL) accelerators and neuromorphic processors has brought on new opportunities for applying both Deep and Spiking Neural Network (SNN) algorithms to healthcare and biomedical applications at the edge. This can facilitate the advancement of medical Internet of Things (IoT) systems and Point of Care (PoC) devices. In this paper, we provide a tutorial describing… ▽ More

    Submitted 28 April, 2021; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted by IEEE Transactions on Biomedical Circuits and Systems (21 pages, 10 figures, 5 tables)

    Journal ref: IEEE Transactions on Biomedical Circuits and Systems, 2020

  27. arXiv:1906.09395  [pdf, other

    eess.SP cs.AR eess.IV

    Adaptive Precision CNN Accelerator Using Radix-X Parallel Connected Memristor Crossbars

    Authors: Jaeheum Lee, Jason K. Eshraghian, Kyoungrok Cho, Kamran Eshraghian

    Abstract: Neural processor development is reducing our reliance on remote server access to process deep learning operations in an increasingly edge-driven world. By employing in-memory processing, parallelization techniques, and algorithm-hardware co-design, memristor crossbar arrays are known to efficiently compute large scale matrix-vector multiplications. However, state-of-the-art implementations of nega… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: 12 pages, 17 figures

  28. arXiv:1108.4182  [pdf, other

    cond-mat.mtrl-sci cs.ET

    Memristor-based Synaptic Networks and Logical Operations Using In-Situ Computing

    Authors: Omid Kavehei, Said Al-Sarawi, Kyoung-Rok Cho, Nicolangelo Iannella, Sung-** Kim, Kamran Eshraghian, Derek Abbott

    Abstract: We present new computational building blocks based on memristive devices. These blocks, can be used to implement either supervised or unsupervised learning modules. This is achieved using a crosspoint architecture which is an efficient array implementation for nanoscale two-terminal memristive devices. Based on these blocks and an experimentally verified SPICE macromodel for the memristor, we demo… ▽ More

    Submitted 21 August, 2011; originally announced August 2011.

    Comments: 18 pages, 7 figures, 2 tables

  29. arXiv:1108.3716  [pdf, other

    cond-mat.mtrl-sci cs.ET

    Non-volatile Complementary Resistive Switch-based Content Addressable Memory

    Authors: Omid Kavehei, Said Al-Sarawi, Sharath Sriram, Madhu Bhaskaran, Kyoung-Rok Cho, Kamran Eshraghian, Derek Abbott

    Abstract: This paper presents a novel resistive-only Binary and Ternary Content Addressable Memory (B/TCAM) cell that consists of two Complementary Resistive Switches (CRSs). The operation of such a cell relies on a logic$\rightarrow$ON state transition that enables this novel CRS application.

    Submitted 16 October, 2011; v1 submitted 18 August, 2011; originally announced August 2011.

    Comments: 4 pages, 4 figures

  30. arXiv:1106.2927  [pdf, other

    cond-mat.mtrl-sci cs.ET

    An Analytical Approach for Memristive Nanoarchitectures

    Authors: Omid Kavehei, Said Al-Sarawi, Kyoung-Rok Cho, Kamran Eshraghian, Derek Abbott

    Abstract: As conventional memory technologies are challenged by their technological physical limits, emerging technologies driven by novel materials are becoming an attractive option for future memory architectures. Among these technologies, Resistive Memories (ReRAM) created new possibilities because of their nano-features and unique $I$-$V$ characteristics. One particular problem that limits the maximum a… ▽ More

    Submitted 13 December, 2011; v1 submitted 15 June, 2011; originally announced June 2011.

    Comments: 12 pages, 10 figures, 4 tables