Skip to main content

Showing 1–25 of 25 results for author: Kudithipudi, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13347  [pdf, other

    cs.LG cs.AI cs.AR

    Time-Series Forecasting and Sequence Learning Using Memristor-based Reservoir System

    Authors: Abdullah M. Zyarah, Dhireesha Kudithipudi

    Abstract: Pushing the frontiers of time-series information processing in ever-growing edge devices with stringent resources has been impeded by the system's ability to process information and learn locally on the device. Local processing and learning typically demand intensive computations and massive storage as the process involves retrieving information and tuning hundreds of parameters back in time. In t… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.10758  [pdf, other

    cs.CV

    Watch Your Step: Optimal Retrieval for Continual Learning at Scale

    Authors: Truman Hickok, Dhireesha Kudithipudi

    Abstract: In continual learning, a model learns incrementally over time while minimizing interference between old and new tasks. One of the most widely used approaches in continual learning is referred to as replay. Replay methods support interleaved learning by storing past experiences in a replay buffer. Although there are methods for selectively constructing the buffer and reprocessing its contents, ther… ▽ More

    Submitted 9 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2403.05175  [pdf, other

    cs.LG cs.AI cs.CV q-bio.NC stat.ML

    Continual Learning and Catastrophic Forgetting

    Authors: Gido M. van de Ven, Nicholas Soures, Dhireesha Kudithipudi

    Abstract: This book chapter delves into the dynamics of continual learning, which is the process of incrementally learning from a non-stationary stream of data. Although continual learning is a natural skill for the human brain, it is very challenging for artificial neural networks. An important reason is that, when learning something new, these networks tend to quickly and drastically forget what they had… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Preprint of a book chapter; 21 pages, 4 figures

  4. arXiv:2311.11908  [pdf, other

    cs.LG cs.AI cs.CV

    Continual Learning: Applications and the Road Forward

    Authors: Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L. Hayes, Eyke Hüllermeier, Christopher Kanan, Dhireesha Kudithipudi, Christoph H. Lampert, Martin Mundt, Razvan Pascanu, Adrian Popescu, Andreas S. Tolias, Joost van de Weijer, Bing Liu, Vincenzo Lomonaco, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Continual learning is a subfield of machine learning, which aims to allow machine learning models to continuously learn on new data, by accumulating knowledge without forgetting what was learned in the past. In this work, we take a step back, and ask: "Why should one care about continual learning in the first place?". We set the stage by examining recent continual learning papers published at four… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2024

  5. arXiv:2310.04467  [pdf, other

    cs.LG cs.AI eess.SY

    Design Principles for Lifelong Learning AI Accelerators

    Authors: Dhireesha Kudithipudi, Anurag Daram, Abdullah M. Zyarah, Fatima Tuz Zohora, James B. Aimone, Angel Yanguas-Gil, Nicholas Soures, Emre Neftci, Matthew Mattina, Vincenzo Lomonaco, Clare D. Thiem, Benjamin Epstein

    Abstract: Lifelong learning - an agent's ability to learn throughout its lifetime - is a hallmark of biological learning systems and a central challenge for artificial intelligence (AI). The development of lifelong learning algorithms could lead to a range of novel AI applications, but this will also require the development of appropriate hardware accelerators, particularly if the models are to be deployed… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  6. arXiv:2304.04640  [pdf, other

    cs.AI

    NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

    Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Denis Kleyko, Noah Pacik-Nelson, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl, Brian Anderson, Andreas G. Andreou, Chiara Bartolozzi, Arindam Basu , et al. (73 additional authors not shown)

    Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More

    Submitted 17 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Updated from whitepaper to full perspective article preprint

  7. A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

    Authors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee , et al. (22 additional authors not shown)

    Abstract: Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through th… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in Neural Networks

  8. arXiv:2104.02233  [pdf, other

    cs.LG cs.AI

    TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

    Authors: Hamed F. Langroudi, Vedant Karia, Tej Pandit, Dhireesha Kudithipudi

    Abstract: In this research, we propose a new low-precision framework, TENT, to leverage the benefits of a tapered fixed-point numerical format in TinyML models. We introduce a tapered fixed-point quantization algorithm that matches the numerical format's dynamic range and distribution to that of the deep neural network model's parameter distribution at each layer. An accelerator architecture for the tapered… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: poster presented at the first tinyML Research Symposium, March 26, 2021

  9. End-to-End Memristive HTM System for Pattern Recognition and Sequence Prediction

    Authors: Abdullah M. Zyarah, Kevin Gomez, Dhireesha Kudithipudi

    Abstract: Neuromorphic systems that learn and predict from streaming inputs hold significant promise in pervasive edge computing and its applications. In this paper, a neuromorphic system that processes spatio-temporal information on the edge is proposed. Algorithmically, the system is based on hierarchical temporal memory that inherently offers online learning, resiliency, and fault tolerance. Architectura… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

  10. arXiv:2003.11638  [pdf, other

    cs.NE

    Metaplasticity in Multistate Memristor Synaptic Networks

    Authors: Fatima Tuz Zohora, Abdullah M. Zyarah, Nicholas Soures, Dhireesha Kudithipudi

    Abstract: Recent studies have shown that metaplastic synapses can retain information longer than simple binary synapses and are beneficial for continual learning. In this paper, we explore the multistate metaplastic synapse characteristics in the context of high retention and reception of information. Inherent behavior of a memristor emulating the multistate synapse is employed to capture the metaplastic be… ▽ More

    Submitted 25 February, 2020; originally announced March 2020.

  11. arXiv:1908.08380  [pdf, other

    eess.SP cs.LG cs.NE stat.ML

    Analysis of Wide and Deep Echo State Networks for Multiscale Spatiotemporal Time Series Forecasting

    Authors: Zachariah Carmichael, Humza Syed, Dhireesha Kudithipudi

    Abstract: Echo state networks are computationally lightweight reservoir models inspired by the random projections observed in cortical circuitry. As interest in reservoir computing has grown, networks have become deeper and more intricate. While these networks are increasingly applied to nontrivial forecasting tasks, there is a need for comprehensive performance analysis of deep reservoirs. In this work, we… ▽ More

    Submitted 1 July, 2019; originally announced August 2019.

    Comments: 10 pages, 10 figures, Proceedings of the Neuro-inspired Computational Elements Workshop (NICE '19), March 26-28, 2019, Albany, NY, USA

  12. arXiv:1908.02386  [pdf, other

    cs.LG cs.NE stat.ML

    Cheetah: Mixed Low-Precision Hardware & Software Co-Design Framework for DNNs on the Edge

    Authors: Hamed F. Langroudi, Zachariah Carmichael, David Pastuch, Dhireesha Kudithipudi

    Abstract: Low-precision DNNs have been extensively explored in order to reduce the size of DNN models for edge devices. Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision in [5..8]-bits. However, previous studies were limited to studying posit for DNN inference only. In this paper, we propose the Cheetah framework, which supports both DNN… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  13. arXiv:1907.13216  [pdf, other

    cs.LG stat.ML

    Deep Learning Training on the Edge with Low-Precision Posits

    Authors: Hamed F. Langroudi, Zachariah Carmichael, Dhireesha Kudithipudi

    Abstract: Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision ([5..8]-bit). However, majority of studies focus only on DNN inference. In this work, we propose DNN training using posits and compare with the floating point training. We evaluate on both MNIST and Fashion MNIST corpuses, where 16-bit posits outperform 16-bit floating point for… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

  14. Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

    Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

    Abstract: Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Comments: 9 pages, Proceedings of the ACM Conference for Next Generation Arithmetic (CoNGA) 2019

  15. arXiv:1812.10730  [pdf, other

    cs.ET cs.LG stat.ML

    Neuromemrisitive Architecture of HTM with On-Device Learning and Neurogenesis

    Authors: Abdullah M. Zyarah, Dhireesha Kudithipudi

    Abstract: Hierarchical temporal memory (HTM) is a biomimetic sequence memory algorithm that holds promise for invariant representations of spatial and spatiotemporal inputs. This paper presents a comprehensive neuromemristive crossbar architecture for the spatial pooler (SP) and the sparse distributed representation classifier, which are fundamental to the algorithm. There are several unique features in the… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

  16. arXiv:1812.01762  [pdf, other

    cs.DC cs.LG cs.NE

    Deep Positron: A Deep Neural Network Using the Posit Number System

    Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

    Abstract: The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on ho… ▽ More

    Submitted 18 January, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: 6 pages, Design, Automation and Test in Europe 2019

  17. Semi-Trained Memristive Crossbar Computing Engine with In-Situ Learning Accelerator

    Authors: Abdullah M. Zyarah, Dhireesha Kudithipudi

    Abstract: On-device intelligence is gaining significant attention recently as it offers local data processing and low power consumption. In this research, an on-device training circuitry for threshold-current memristors integrated in a crossbar structure is proposed. Furthermore, alternate approaches of map** the synaptic weights into fully-trained and semi-trained crossbars are investigated. In a semi-tr… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Journal ref: ACM Journal on Emerging Technologies in Computing Systems (JETC), 14, 2018

  18. Neuromorphic Architecture for the Hierarchical Temporal Memory

    Authors: Abdullah M. Zyarah, Dhireesha Kudithipudi

    Abstract: A biomimetic machine intelligence algorithm, that holds promise in creating invariant representations of spatiotemporal input streams is the hierarchical temporal memory (HTM). This unsupervised online algorithm has been demonstrated on several machine-learning tasks, including anomaly detection. Significant effort has been made in formalizing and applying the HTM algorithm to different classes of… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

  19. arXiv:1808.00523  [pdf, other

    cs.LG stat.ML

    Mod-DeepESN: Modular Deep Echo State Network

    Authors: Zachariah Carmichael, Humza Syed, Stuart Burtner, Dhireesha Kudithipudi

    Abstract: Neuro-inspired recurrent neural network algorithms, such as echo state networks, are computationally lightweight and thereby map well onto untethered devices. The baseline echo state network algorithms are shown to be efficient in solving small-scale spatio-temporal problems. However, they underperform for complex tasks that are characterized by multi-scale structures. In this research, an intrins… ▽ More

    Submitted 25 March, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: 4 pages, Cognitive Computational Neuroscience (CCN) 2018 Conference

  20. arXiv:1805.08624  [pdf, other

    cs.CV

    Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

    Authors: Seyed H. F. Langroudi, Tej Pandit, Dhireesha Kudithipudi

    Abstract: Performing the inference step of deep learning in resource constrained environments, such as embedded devices, is challenging. Success requires optimization at both software and hardware levels. Low precision arithmetic and specifically low precision fixed-point number systems have become the standard for performing deep learning inference. However, representing non-uniform data and distributed pa… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  21. arXiv:1802.07369  [pdf, other

    stat.ML cs.LG

    On the Statistical Challenges of Echo State Networks and Some Potential Remedies

    Authors: Qiuyi Wu, Ernest Fokoue, Dhireesha Kudithipudi

    Abstract: Echo state networks are powerful recurrent neural networks. However, they are often unstable and shaky, making the process of finding an good ESN for a specific dataset quite hard. Obtaining a superb accuracy by using the Echo State Network is a challenging task. We create, develop and implement a family of predictably optimal robust and stable ensemble of Echo State Networks via regularizing the… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

    Comments: 18 pages, 10 figures

    MSC Class: 62F40; 62G90; 62M10; 62M20; 62M45; 62M15; 68T05; 68T10

  22. arXiv:1711.01201  [pdf, ps, other

    cs.CV cs.NE eess.IV

    Convolutional Drift Networks for Video Classification

    Authors: Dillon Graham, Seyed Hamed Fatemi Langroudi, Christopher Kanan, Dhireesha Kudithipudi

    Abstract: Analyzing spatio-temporal data like video is a challenging task that requires processing visual and temporal information effectively. Convolutional Neural Networks have shown promise as baseline fixed feature extractors through transfer learning, a technique that helps minimize the training cost on visual information. Temporal information is often handled using hand-crafted features or Recurrent N… ▽ More

    Submitted 3 November, 2017; originally announced November 2017.

    Comments: Published in IEEE Rebooting Computing

  23. arXiv:1611.02792  [pdf, other

    cs.AR

    Non-volatile Hierarchical Temporal Memory: Hardware for Spatial Pooling

    Authors: Lennard Streat, Dhireesha Kudithipudi, Kevin Gomez

    Abstract: Hierarchical Temporal Memory (HTM) is a biomimetic machine learning algorithm imbibing the structural and algorithmic properties of the neocortex. Two main functional components of HTM that enable spatio-temporal processing are the spatial pooler and temporal memory. In this research, we explore a scalable hardware realization of the spatial pooler closely coupled with the mathematical formulation… ▽ More

    Submitted 8 November, 2016; originally announced November 2016.

  24. arXiv:1601.07482  [pdf, other

    cs.ET cs.LG stat.ML

    Unsupervised Learning in Neuromemristive Systems

    Authors: Cory Merkel, Dhireesha Kudithipudi

    Abstract: Neuromemristive systems (NMSs) currently represent the most promising platform to achieve energy efficient neuro-inspired computation. However, since the research field is less than a decade old, there are still countless algorithms and design paradigms to be explored within these systems. One particular domain that remains to be fully investigated within NMSs is unsupervised learning. In this wor… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    Comments: To appear in the proceedings of the National Aerospace & Electronics Conference & Ohio Innovation Summit (NAECON-OIS'15)

  25. arXiv:1601.06116  [pdf, other

    stat.ML cs.LG q-bio.NC

    A Mathematical Formalization of Hierarchical Temporal Memory's Spatial Pooler

    Authors: James Mnatzaganian, Ernest Fokoué, Dhireesha Kudithipudi

    Abstract: Hierarchical temporal memory (HTM) is an emerging machine learning algorithm, with the potential to provide a means to perform predictions on spatiotemporal data. The algorithm, inspired by the neocortex, currently does not have a comprehensive mathematical framework. This work brings together all aspects of the spatial pooler (SP), a critical learning component in HTM, under a single unifying fra… ▽ More

    Submitted 8 September, 2016; v1 submitted 22 January, 2016; originally announced January 2016.

    Comments: This work was submitted for publication and is currently under review. For associated code, see https://github.com/tehtechguy/mHTM