Skip to main content

Showing 1–19 of 19 results for author: Sejnowski, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10163  [pdf, other

    cs.NE

    Hidden Traveling Waves bind Working Memory Variables in Recurrent Neural Networks

    Authors: Arjun Karuvally, Terrence J. Sejnowski, Hava T. Siegelmann

    Abstract: Traveling waves are a fundamental phenomenon in the brain, playing a crucial role in short-term information storage. In this study, we leverage the concept of traveling wave dynamics within a neural lattice to formulate a theoretical model of neural working memory, study its properties, and its real world implications in AI. The proposed model diverges from traditional approaches, which assume inf… ▽ More

    Submitted 7 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  2. arXiv:2401.14267  [pdf

    cs.CL cs.AI

    Transformers and Cortical Waves: Encoders for Pulling In Context Across Time

    Authors: Lyle Muller, Patricia S. Churchland, Terrence J. Sejnowski

    Abstract: The capabilities of transformer networks such as ChatGPT and other Large Language Models (LLMs) have captured the world's attention. The crucial computational mechanism underlying their performance relies on transforming a complete input sequence - for example, all the words in a sentence - into a long "encoding vector" that allows transformers to learn long-range temporal dependencies in naturali… ▽ More

    Submitted 2 July, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 25 pages, 5 figures

  3. arXiv:2309.08045  [pdf, other

    cs.NE cs.AI cs.LG

    Traveling Waves Encode the Recent Past and Enhance Sequence Learning

    Authors: T. Anderson Keller, Lyle Muller, Terrence Sejnowski, Max Welling

    Abstract: Traveling waves of neural activity have been observed throughout the brain at a diversity of regions and scales; however, their precise computational role is still debated. One physically inspired hypothesis suggests that the cortical sheet may act like a wave-propagating system capable of invertibly storing a short-term memory of sequential stimuli through induced waves traveling across the corti… ▽ More

    Submitted 14 March, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

  4. arXiv:2305.18701  [pdf, other

    cs.AI eess.SY

    Temporally Layered Architecture for Efficient Continuous Control

    Authors: Devdhar Patel, Terrence Sejnowski, Hava Siegelmann

    Abstract: We present a temporally layered architecture (TLA) for temporally adaptive control with minimal energy expenditure. The TLA layers a fast and a slow policy together to achieve temporal abstraction that allows each layer to focus on a different time scale. Our design draws on the energy-saving mechanism of the human brain, which executes actions at different timescales depending on the environment'… ▽ More

    Submitted 8 August, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 10 Pages, 2 Figures, 3 Tables. arXiv admin note: text overlap with arXiv:2301.00723

  5. arXiv:2301.00723  [pdf, other

    cs.NE cs.AI cs.LG eess.SY

    Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

    Authors: Devdhar Patel, Joshua Russell, Francesca Walsh, Tauhidur Rahman, Terrence Sejnowski, Hava Siegelmann

    Abstract: We present temporally layered architecture (TLA), a biologically inspired system for temporally adaptive distributed control. TLA layers a fast and a slow controller together to achieve temporal abstraction that allows each layer to focus on a different time-scale. Our design is biologically inspired and draws on the architecture of the human brain which executes actions at different timescales de… ▽ More

    Submitted 5 February, 2023; v1 submitted 25 December, 2022; originally announced January 2023.

    Comments: 10 pages, 4 figures

  6. arXiv:2212.05563  [pdf, other

    cs.NE

    Energy-based General Sequential Episodic Memory Networks at the Adiabatic Limit

    Authors: Arjun Karuvally, Terry J. Sejnowski, Hava T. Siegelmann

    Abstract: The General Associative Memory Model (GAMM) has a constant state-dependant energy surface that leads the output dynamics to fixed points, retrieving single memories from a collection of memories that can be asynchronously preloaded. We introduce a new class of General Sequential Episodic Memory Models (GSEMM) that, in the adiabatic limit, exhibit temporally changing energy surface, leading to a se… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  7. arXiv:2210.08340  [pdf

    cs.AI q-bio.NC

    Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

    Authors: Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann LeCun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo , et al. (2 additional authors not shown)

    Abstract: Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts… ▽ More

    Submitted 22 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: White paper, 10 pages + 8 pages of references, 1 figures

  8. arXiv:2207.14382  [pdf

    cs.CL cs.AI cs.LG

    Large Language Models and the Reverse Turing Test

    Authors: Terrence Sejnowski

    Abstract: Large Language Models (LLMs) have been transformative. They are pre-trained foundational models that are self-supervised and can be adapted with fine tuning to a wide range of natural language tasks, each of which previously would have required a separate network model. This is one step closer to the extraordinary versatility of human language. GPT-3 and more recently LaMDA can carry on dialogs wi… ▽ More

    Submitted 15 November, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: Are LLMs stochastic parrots?

    ACM Class: I.2

    Journal ref: Neural Computation, 35, 309-342 (2023)

  9. arXiv:2109.05053  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph q-bio.NC

    Physics-based machine learning for modeling stochastic IP3-dependent calcium dynamics

    Authors: Oliver K. Ernst, Tom Bartol, Terrence Sejnowski, Eric Mjolsness

    Abstract: We present a machine learning method for model reduction which incorporates domain-specific physics through candidate functions. Our method estimates an effective probability distribution and differential equation model from stochastic simulations of a reaction network. The close connection between reduced and fine scale descriptions allows approximations derived from the master equation to be int… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 26 pages

    MSC Class: 68T07 ACM Class: I.2.6; I.2.1; I.2.0

  10. arXiv:2104.04132  [pdf, other

    q-bio.NC cs.AI cs.LG

    Replay in Deep Learning: Current Approaches and Missing Biological Elements

    Authors: Tyler L. Hayes, Giri P. Krishnan, Maxim Bazhenov, Hava T. Siegelmann, Terrence J. Sejnowski, Christopher Kanan

    Abstract: Replay is the reactivation of one or more neural patterns, which are similar to the activation patterns experienced during past waking experiences. Replay was first observed in biological neural networks during sleep, and it is now thought to play a critical role in memory formation, retrieval, and consolidation. Replay-like mechanisms have been incorporated into deep artificial neural networks th… ▽ More

    Submitted 28 May, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in the MIT Press journal of Neural Computation

  11. arXiv:2002.04806  [pdf

    q-bio.NC cs.AI cs.LG cs.NE

    The Unreasonable Effectiveness of Deep Learning in Artificial Intelligence

    Authors: Terrence J. Sejnowski

    Abstract: Deep learning networks have been trained to recognize speech, caption photographs and translate text between languages at high levels of performance. Although applications of deep learning networks to real world problems have become ubiquitous, our understanding of why they are so effective is lacking. These empirical results should not be possible according to sample complexity in statistics and… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Journal ref: Proceedings of the National Academy of Sciences U.S.A. (2020) https://www.pnas.org/content/early/2020/01/23/1907373117

  12. arXiv:1909.08601  [pdf, other

    math.OC cs.IT eess.SY q-bio.NC

    Diversity-enabled sweet spots in layered architectures and speed-accuracy trade-offs in sensorimotor control

    Authors: Yorie Nakahira, Quanying Liu, Terrence J. Sejnowski, John C. Doyle

    Abstract: Nervous systems sense, communicate, compute and actuate movement using distributed components with severe trade-offs in speed, accuracy, sparsity, noise and saturation. Nevertheless, brains achieve remarkably fast, accurate, and robust control performance due to a highly effective layered control architecture. Here we introduce a driving task to study how a mountain biker mitigates the immediate d… ▽ More

    Submitted 2 May, 2021; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: 12 pages, 8 figures

  13. arXiv:1905.12122  [pdf, other

    cs.LG stat.ML

    Deep Learning Moment Closure Approximations using Dynamic Boltzmann Distributions

    Authors: Oliver K. Ernst, Tom Bartol, Terrence Sejnowski, Eric Mjolsness

    Abstract: The moments of spatial probabilistic systems are often given by an infinite hierarchy of coupled differential equations. Moment closure methods are used to approximate a subset of low order moments by terminating the hierarchy at some order and replacing higher order terms with functions of lower order ones. For a given system, it is not known beforehand which closure approximation is optimal, i.e… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  14. arXiv:1905.07039  [pdf, other

    cs.LG cs.HC eess.SP stat.ML

    Utilizing Deep Learning Towards Multi-modal Bio-sensing and Vision-based Affective Computing

    Authors: Siddharth Siddharth, Tzyy-** Jung, Terrence J. Sejnowski

    Abstract: In recent years, the use of bio-sensing signals such as electroencephalogram (EEG), electrocardiogram (ECG), etc. have garnered interest towards applications in affective computing. The parallel trend of deep-learning has led to a huge leap in performance towards solving various vision-based research problems such as object detection. Yet, these advances in deep-learning have not adequately transl… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted for publication in IEEE Transactions on Affective Computing. This version on the arXiv is the updated version of the same manuscript

  15. arXiv:1804.09452  [pdf, other

    cs.HC

    Multi-modal Approach for Affective Computing

    Authors: Siddharth Siddharth, Tzyy-** Jung, Terrence J. Sejnowski

    Abstract: Throughout the past decade, many studies have classified human emotions using only a single sensing modality such as face video, electroencephalogram (EEG), electrocardiogram (ECG), galvanic skin response (GSR), etc. The results of these studies are constrained by the limitations of these modalities such as the absence of physiological biomarkers in the face-video analysis, poor spatial resolution… ▽ More

    Submitted 20 June, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

    Comments: Published in IEEE 40th International Engineering in Medicine and Biology Conference (EMBC) 2018

  16. arXiv:1802.07852  [pdf

    cs.HC

    An Affordable Bio-Sensing and Activity Tagging Platform for HCI Research

    Authors: Siddharth, Aashish Patel, Tzyy-** Jung, Terrence J. Sejnowski

    Abstract: We present a novel multi-modal bio-sensing platform capable of integrating multiple data streams for use in real-time applications. The system is composed of a central compute module and a companion headset. The compute node collects, time-stamps and transmits the data while also providing an interface for a wide range of sensors including electroencephalogram, photoplethysmogram, electrocardiogra… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

  17. arXiv:1706.04698  [pdf, other

    q-bio.NC cs.LG cs.NE stat.ML

    Gradient Descent for Spiking Neural Networks

    Authors: Dongsung Huh, Terrence J. Sejnowski

    Abstract: Much of studies on neural computation are based on network models of static neurons that produce analog output, despite the fact that information processing in the brain is predominantly carried out by dynamic neurons that produce discrete pulses called spikes. Research in spike-based computation has been impeded by the lack of efficient supervised learning algorithm for spiking networks. Here, we… ▽ More

    Submitted 19 June, 2017; v1 submitted 14 June, 2017; originally announced June 2017.

  18. arXiv:1510.07740  [pdf, other

    stat.ML cond-mat.stat-mech cs.CV cs.LG

    The Wilson Machine for Image Modeling

    Authors: Saeed Saremi, Terrence J. Sejnowski

    Abstract: Learning the distribution of natural images is one of the hardest and most important problems in machine learning. The problem remains open, because the enormous complexity of the structures in natural images spans all length scales. We break down the complexity of the problem and show that the hierarchy of structures in natural images fuels a new class of learning algorithms based on the theory o… ▽ More

    Submitted 11 November, 2015; v1 submitted 26 October, 2015; originally announced October 2015.

  19. arXiv:q-bio/0310011  [pdf, ps, other

    q-bio.QM cs.CE physics.data-an q-bio.NC

    Complex Independent Component Analysis of Frequency-Domain Electroencephalographic Data

    Authors: Jorn Anemuller, Terrence J. Sejnowski, Scott Makeig

    Abstract: Independent component analysis (ICA) has proven useful for modeling brain and electroencephalographic (EEG) data. Here, we present a new, generalized method to better capture the dynamics of brain signals than previous ICA algorithms. We regard EEG sources as eliciting spatio-temporal activity patterns, corresponding to, e.g., trajectories of activation propagating across cortex. This leads to a… ▽ More

    Submitted 25 November, 2003; v1 submitted 10 October, 2003; originally announced October 2003.

    Comments: 21 pages, 11 figures. Added final journal reference, fixed minor typos

    Journal ref: Neural Networks, 16:1311-1323, 2003