Skip to main content

Showing 1–24 of 24 results for author: Siegelmann, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10163  [pdf, other

    cs.NE

    Hidden Traveling Waves bind Working Memory Variables in Recurrent Neural Networks

    Authors: Arjun Karuvally, Terrence J. Sejnowski, Hava T. Siegelmann

    Abstract: Traveling waves are a fundamental phenomenon in the brain, playing a crucial role in short-term information storage. In this study, we leverage the concept of traveling wave dynamics within a neural lattice to formulate a theoretical model of neural working memory, study its properties, and its real world implications in AI. The proposed model diverges from traditional approaches, which assume inf… ▽ More

    Submitted 7 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  2. arXiv:2310.02430  [pdf, other

    cs.NE cs.AI cs.LG

    Episodic Memory Theory for the Mechanistic Interpretation of Recurrent Neural Networks

    Authors: Arjun Karuvally, Peter Delmastro, Hava T. Siegelmann

    Abstract: Understanding the intricate operations of Recurrent Neural Networks (RNNs) mechanistically is pivotal for advancing their capabilities and applications. In this pursuit, we propose the Episodic Memory Theory (EMT), illustrating that RNNs can be conceptualized as discrete-time analogs of the recently proposed General Sequential Episodic Memory Model. To substantiate EMT, we introduce a novel set of… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  3. arXiv:2306.07125  [pdf, other

    cs.LG cs.NE

    On the Dynamics of Learning Time-Aware Behavior with Recurrent Neural Networks

    Authors: Peter DelMastro, Rushiv Arora, Edward Rietman, Hava T. Siegelmann

    Abstract: Recurrent Neural Networks (RNNs) have shown great success in modeling time-dependent patterns, but there is limited research on their learned representations of latent temporal features and the emergence of these representations during training. To address this gap, we use timed automata (TA) to introduce a family of supervised learning tasks modeling behavior dependent on hidden temporal variable… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Main paper: 11 pages, 8 figures. Supplemental Material: 6 pages, 5 figures, 1 table

  4. arXiv:2305.18701  [pdf, other

    cs.AI eess.SY

    Temporally Layered Architecture for Efficient Continuous Control

    Authors: Devdhar Patel, Terrence Sejnowski, Hava Siegelmann

    Abstract: We present a temporally layered architecture (TLA) for temporally adaptive control with minimal energy expenditure. The TLA layers a fast and a slow policy together to achieve temporal abstraction that allows each layer to focus on a different time scale. Our design draws on the energy-saving mechanism of the human brain, which executes actions at different timescales depending on the environment'… ▽ More

    Submitted 8 August, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 10 Pages, 2 Figures, 3 Tables. arXiv admin note: text overlap with arXiv:2301.00723

  5. Neuromorphic High-Frequency 3D Dancing Pose Estimation in Dynamic Environment

    Authors: Zhongyang Zhang, Kaidong Chai, Haowen Yu, Ramzi Majaj, Francesca Walsh, Edward Wang, Upal Mahbub, Hava Siegelmann, Donghyun Kim, Tauhidur Rahman

    Abstract: As a beloved sport worldwide, dancing is getting integrated into traditional and virtual reality-based gaming platforms nowadays. It opens up new opportunities in the technology-mediated dancing space. These platforms primarily rely on passive and continuous human pose estimation as an input capture mechanism. Existing solutions are mainly based on RGB or RGB-Depth cameras for dance games. The for… ▽ More

    Submitted 27 January, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Report number: ISSN 0925-2312

    Journal ref: Neurocomputing, Volume 547, 2023, 126388

  6. arXiv:2301.04126  [pdf, other

    cs.NE cs.AI cs.LG

    Temporal Weights

    Authors: Adam Kohan, Ed Rietman, Hava Siegelmann

    Abstract: In artificial neural networks, weights are a static representation of synapses. However, synapses are not static, they have their own interacting dynamics over time. To instill weights with interacting dynamics, we use a model describing synchronization that is capable of capturing core mechanisms of a range of neural and general biological phenomena over time. An ideal fit for these Temporal Weig… ▽ More

    Submitted 13 December, 2022; originally announced January 2023.

  7. arXiv:2301.00723  [pdf, other

    cs.NE cs.AI cs.LG eess.SY

    Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

    Authors: Devdhar Patel, Joshua Russell, Francesca Walsh, Tauhidur Rahman, Terrence Sejnowski, Hava Siegelmann

    Abstract: We present temporally layered architecture (TLA), a biologically inspired system for temporally adaptive distributed control. TLA layers a fast and a slow controller together to achieve temporal abstraction that allows each layer to focus on a different time-scale. Our design is biologically inspired and draws on the architecture of the human brain which executes actions at different timescales de… ▽ More

    Submitted 5 February, 2023; v1 submitted 25 December, 2022; originally announced January 2023.

    Comments: 10 pages, 4 figures

  8. arXiv:2212.12866  [pdf, other

    cs.LG cs.AI cs.NE

    QuickNets: Saving Training and Preventing Overconfidence in Early-Exit Neural Architectures

    Authors: Devdhar Patel, Hava Siegelmann

    Abstract: Deep neural networks have long training and processing times. Early exits added to neural networks allow the network to make early predictions using intermediate activations in the network in time-sensitive applications. However, early exits increase the training time of the neural networks. We introduce QuickNets: a novel cascaded training algorithm for faster training of neural networks. QuickNe… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Comments: 9 pages, 4 figures

  9. arXiv:2212.05563  [pdf, other

    cs.NE

    Energy-based General Sequential Episodic Memory Networks at the Adiabatic Limit

    Authors: Arjun Karuvally, Terry J. Sejnowski, Hava T. Siegelmann

    Abstract: The General Associative Memory Model (GAMM) has a constant state-dependant energy surface that leads the output dynamics to fixed points, retrieving single memories from a collection of memories that can be asynchronously preloaded. We introduce a new class of General Sequential Episodic Memory Models (GSEMM) that, in the adiabatic limit, exhibit temporally changing energy surface, leading to a se… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  10. arXiv:2204.01723  [pdf, other

    cs.LG cs.NE q-bio.NC

    Signal Propagation: A Framework for Learning and Inference In a Forward Pass

    Authors: Adam Kohan, Edward A. Rietman, Hava T. Siegelmann

    Abstract: We propose a new learning framework, signal propagation (sigprop), for propagating a learning signal and updating neural network parameters via a forward pass, as an alternative to backpropagation. In sigprop, there is only the forward path for inference and learning. So, there are no structural or computational constraints necessary for learning to take place, beyond the inference model itself, s… ▽ More

    Submitted 17 November, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  11. arXiv:2202.07132  [pdf

    cs.NE cs.AI cs.LG q-bio.NC stat.CO

    Memory via Temporal Delays in weightless Spiking Neural Network

    Authors: Hananel Hazan, Simon Caby, Christopher Earl, Hava Siegelmann, Michael Levin

    Abstract: A common view in the neuroscience community is that memory is encoded in the connection strength between neurons. This perception led artificial neural network models to focus on connection weights as the key variables to modulate learning. In this paper, we present a prototype for weightless spiking neural networks that can perform a simple classification task. The memory in this network is store… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  12. arXiv:2104.04132  [pdf, other

    q-bio.NC cs.AI cs.LG

    Replay in Deep Learning: Current Approaches and Missing Biological Elements

    Authors: Tyler L. Hayes, Giri P. Krishnan, Maxim Bazhenov, Hava T. Siegelmann, Terrence J. Sejnowski, Christopher Kanan

    Abstract: Replay is the reactivation of one or more neural patterns, which are similar to the activation patterns experienced during past waking experiences. Replay was first observed in biological neural networks during sleep, and it is now thought to play a critical role in memory formation, retrieval, and consolidation. Replay-like mechanisms have been incorporated into deep artificial neural networks th… ▽ More

    Submitted 28 May, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in the MIT Press journal of Neural Computation

  13. arXiv:2005.02434  [pdf

    cs.CY cs.ET

    Nanotechnology-inspired Information Processing Systems of the Future

    Authors: Randy Bryant, Mark Hill, Tom Kazior, Daniel Lee, Jie Liu, Klara Nahrstedt, Vijay Narayanan, Jan Rabaey, Hava Siegelmann, Naresh Shanbhag, Naveen Verma, H. -S. Philip Wong

    Abstract: Nanoscale semiconductor technology has been a key enabler of the computing revolution. It has done so via advances in new materials and manufacturing processes that resulted in the size of the basic building block of computing systems - the logic switch and memory devices - being reduced into the nanoscale regime. Nanotechnology has provided increased computing functionality per unit volume, energ… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: A Computing Community Consortium (CCC) workshop report, 18 pages

    Report number: ccc2016report_3

  14. arXiv:1909.02549  [pdf, other

    cs.NE

    Minibatch Processing in Spiking Neural Networks

    Authors: Daniel J. Saunders, Cooper Sigrist, Kenneth Chaney, Robert Kozma, Hava T. Siegelmann

    Abstract: Spiking neural networks (SNNs) are a promising candidate for biologically-inspired and energy efficient computation. However, their simulation is notoriously time consuming, and may be seen as a bottleneck in develo** competitive training methods with potential deployment on neuromorphic hardware platforms. To address this issue, we provide an implementation of mini-batch processing applied to c… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

  15. arXiv:1906.11826  [pdf, other

    cs.NE cs.AI cs.LG stat.ML

    Lattice Map Spiking Neural Networks (LM-SNNs) for Clustering and Classifying Image Data

    Authors: Hananel Hazan, Daniel J. Saunders, Darpan T. Sanghavi, Hava Siegelmann, Robert Kozma

    Abstract: Spiking neural networks (SNNs) with a lattice architecture are introduced in this work, combining several desirable properties of SNNs and self-organized maps (SOMs). Networks are trained with biologically motivated, unsupervised learning rules to obtain a self-organized grid of filters via cooperative and competitive excitatory-inhibitory interactions. Several inhibition strategies are developed… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Original Manuscript Submitted: October 30, 2018. Revised: May 28, 2019. Special Issue: "Cognition and Neurocomputation" of Annals of Mathematics and Artificial Intelligence. arXiv admin note: text overlap with arXiv:1807.09374

  16. arXiv:1905.11515  [pdf, other

    cs.LG q-bio.NC stat.ML

    Abstraction Mechanisms Predict Generalization in Deep Neural Networks

    Authors: Alex Gain, Hava Siegelmann

    Abstract: A longstanding problem for Deep Neural Networks (DNNs) is understanding their puzzling ability to generalize well. We approach this problem through the unconventional angle of \textit{cognitive abstraction mechanisms}, drawing inspiration from recent neuroscience work, allowing us to define the Cognitive Neural Activation metric (CNA) for DNNs, which is the correlation between information complexi… ▽ More

    Submitted 16 April, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

  17. arXiv:1904.06269  [pdf, other

    cs.NE cs.LG

    Locally Connected Spiking Neural Networks for Unsupervised Feature Learning

    Authors: Daniel J. Saunders, Devdhar Patel, Hananel Hazan, Hava T. Siegelmann, Robert Kozma

    Abstract: In recent years, Spiking Neural Networks (SNNs) have demonstrated great successes in completing various Machine Learning tasks. We introduce a method for learning image features by \textit{locally connected layers} in SNNs using spike-timing-dependent plasticity (STDP) rule. In our approach, sub-networks compete via competitive inhibitory interactions to learn features from different locations of… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: 22 pages, 7 figures, and 4 tables

  18. arXiv:1903.11012  [pdf, other

    cs.LG cs.NE stat.ML

    Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI games

    Authors: Devdhar Patel, Hananel Hazan, Daniel J. Saunders, Hava Siegelmann, Robert Kozma

    Abstract: Deep Reinforcement Learning (RL) demonstrates excellent performance on tasks that can be solved by trained policy. It plays a dominant role among cutting-edge machine learning approaches using multi-layer Neural networks (NNs). At the same time, Deep RL suffers from high sensitivity to noisy, incomplete, and misleading input data. Following biological intuition, we involve Spiking Neural Networks… ▽ More

    Submitted 19 August, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

  19. arXiv:1808.08173  [pdf, other

    cs.NE

    STDP Learning of Image Patches with Convolutional Spiking Neural Networks

    Authors: Daniel J. Saunders, Hava T. Siegelmann, Robert Kozma, Miklós Ruszinkó

    Abstract: Spiking neural networks are motivated from principles of neural systems and may possess unexplored advantages in the context of machine learning. A class of \textit{convolutional spiking neural networks} is introduced, trained to detect image features with an unsupervised, competitive learning mechanism. Image features can be shared within subpopulations of neurons, or each may evolve independentl… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

    Comments: 7 pages, 9 figures, and 5 tables

  20. arXiv:1808.03357  [pdf, other

    cs.NE q-bio.NC

    Error Forward-Propagation: Reusing Feedforward Connections to Propagate Errors in Deep Learning

    Authors: Adam A. Kohan, Edward A. Rietman, Hava T. Siegelmann

    Abstract: We introduce Error Forward-Propagation, a biologically plausible mechanism to propagate error feedback forward through the network. Architectural constraints on connectivity are virtually eliminated for error feedback in the brain; systematic backward connectivity is not used or needed to deliver error feedback. Feedback as a means of assigning credit to neurons earlier in the forward pathway for… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

  21. Unsupervised Learning with Self-Organizing Spiking Neural Networks

    Authors: Hananel Hazan, Daniel J. Saunders, Darpan T. Sanghavi, Hava T. Siegelmann, Robert Kozma

    Abstract: We present a system comprising a hybridization of self-organized map (SOM) properties with spiking neural networks (SNNs) that retain many of the features of SOMs. Networks are trained in an unsupervised manner to learn a self-organized lattice of filters via excitatory-inhibitory interactions among populations of neurons. We develop and test various inhibition strategies, such as growing with int… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

    Journal ref: Proceeding WCCI 2018

  22. BindsNET: A machine learning-oriented spiking neural networks library in Python

    Authors: Hananel Hazan, Daniel J. Saunders, Hassaan Khan, Darpan T. Sanghavi, Hava T. Siegelmann, Robert Kozma

    Abstract: The development of spiking neural network simulation software is a critical component enabling the modeling of neural systems and the development of biologically inspired algorithms. Existing software frameworks support a wide range of neural functionality, software abstraction levels, and hardware devices, yet are typically not suitable for rapid prototy** or application to problems in the doma… ▽ More

    Submitted 10 December, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

    Journal ref: Frontiers in Neuroinformatics. 12 December 2018

  23. arXiv:cs/0304042  [pdf, ps, other

    cs.OH

    On probabilistic analog automata

    Authors: A. Ben-Hur, A. Roitershtein, H. Siegelmann

    Abstract: We consider probabilistic automata on a general state space and study their computational power. The model is based on the concept of language recognition by probabilistic automata due to Rabin and models of analog computation in a noisy environment suggested by Maass and Orponen, and Maass and Sontag. Our main result is a generalization of Rabin's reduction theorem that implies that under very… ▽ More

    Submitted 30 April, 2003; v1 submitted 28 April, 2003; originally announced April 2003.

    ACM Class: F.1.1; F.1.2

  24. arXiv:cs/0110056  [pdf, ps, other

    cs.CC cond-mat.stat-mech math-ph math.OC

    Probabilistic analysis of a differential equation for linear programming

    Authors: Asa Ben-Hur, Joshua Feinberg, Shmuel Fishman, Hava T. Siegelmann

    Abstract: In this paper we address the complexity of solving linear programming problems with a set of differential equations that converge to a fixed point that represents the optimal solution. Assuming a probabilistic model, where the inputs are i.i.d. Gaussian variables, we compute the distribution of the convergence rate to the attracting fixed point. Using the framework of Random Matrix Theory, we de… ▽ More

    Submitted 7 April, 2003; v1 submitted 29 October, 2001; originally announced October 2001.

    Comments: 1+37 pages, latex, 5 eps figures. Version accepted for publication in the Journal of Complexity. Changes made: Presentation reorganized for clarity, expanded discussion of measure of complexity in the non-asymptotic regime (added a new section)

    ACM Class: F.1.3, F.2