Skip to main content

Showing 1–16 of 16 results for author: Sussillo, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.14078  [pdf, other

    cs.LG math.DS

    Analyzing Populations of Neural Networks via Dynamical Model Embedding

    Authors: Jordan Cotler, Kai Sheng Tai, Felipe Hernández, Blake Elias, David Sussillo

    Abstract: A core challenge in the interpretation of deep neural networks is identifying commonalities between the underlying algorithms implemented by distinct networks trained for the same task. Motivated by this problem, we introduce DYNAMO, an algorithm that constructs low-dimensional manifolds where each point corresponds to a neural network model, and two points are nearby if the corresponding neural n… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 12+8 pages, 11 figures

  2. arXiv:2210.08340  [pdf

    cs.AI q-bio.NC

    Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

    Authors: Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann LeCun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo , et al. (2 additional authors not shown)

    Abstract: Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts… ▽ More

    Submitted 22 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: White paper, 10 pages + 8 pages of references, 1 figures

  3. arXiv:2111.01256  [pdf, other

    cs.LG

    Reverse engineering recurrent neural networks with Jacobian switching linear dynamical systems

    Authors: Jimmy T. H. Smith, Scott W. Linderman, David Sussillo

    Abstract: Recurrent neural networks (RNNs) are powerful models for processing time-series data, but it remains challenging to understand how they function. Improving this understanding is of substantial interest to both the machine learning and neuroscience communities. The framework of reverse engineering a trained RNN by linearizing around its fixed points has provided insight, but the approach has signif… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 23 pages, 9 figures

  4. arXiv:2011.02159  [pdf, other

    cs.LG cs.NE stat.ML

    Reverse engineering learned optimizers reveals known and novel mechanisms

    Authors: Niru Maheswaranathan, David Sussillo, Luke Metz, Ruoxi Sun, Jascha Sohl-Dickstein

    Abstract: Learned optimizers are algorithms that can themselves be trained to solve optimization problems. In contrast to baseline optimizers (such as momentum or Adam) that use simple update rules derived from theoretical principles, learned optimizers use flexible, high-dimensional, nonlinear parameterizations. Although this can lead to better performance in certain settings, their inner workings remain a… ▽ More

    Submitted 7 December, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Thirty-Fifth Conference on Neural Information Processing Systems. 2021

  5. arXiv:2010.15114  [pdf, other

    cs.LG cs.CL stat.ML

    The geometry of integration in text classification RNNs

    Authors: Kyle Aitken, Vinay V. Ramasesh, Ankush Garg, Yuan Cao, David Sussillo, Niru Maheswaranathan

    Abstract: Despite the widespread application of recurrent neural networks (RNNs) across a variety of tasks, a unified understanding of how RNNs solve these tasks remains elusive. In particular, it is unclear what dynamical patterns arise in trained RNNs, and how those patterns depend on the training dataset or task. This work addresses these questions in the context of a specific natural language processing… ▽ More

    Submitted 3 June, 2022; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: 9+19 pages, 30 figures; v2: smaller file size

  6. arXiv:2004.08013  [pdf, other

    cs.CL cs.LG stat.ML

    How recurrent networks implement contextual processing in sentiment analysis

    Authors: Niru Maheswaranathan, David Sussillo

    Abstract: Neural networks have a remarkable capacity for contextual processing--using recent or nearby inputs to modify processing of current input. For example, in natural language, contextual processing is necessary to correctly interpret negation (e.g. phrases such as "not bad"). However, our ability to understand how networks process context is limited. Here, we propose general methods for reverse engin… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  7. arXiv:1907.08549  [pdf, other

    q-bio.NC cs.NE

    Universality and individuality in neural dynamics across large populations of recurrent networks

    Authors: Niru Maheswaranathan, Alex H. Williams, Matthew D. Golub, Surya Ganguli, David Sussillo

    Abstract: Task-based modeling with recurrent neural networks (RNNs) has emerged as a popular way to infer the computational function of different brain regions. These models are quantitatively assessed by comparing the low-dimensional neural representations of the model with the brain, for example using canonical correlation analysis (CCA). However, the nature of the detailed neurobiological inferences one… ▽ More

    Submitted 4 December, 2019; v1 submitted 19 July, 2019; originally announced July 2019.

    Comments: Presented at NeurIPS 2019

  8. arXiv:1906.10720  [pdf, other

    cs.LG stat.ML

    Reverse engineering recurrent networks for sentiment classification reveals line attractor dynamics

    Authors: Niru Maheswaranathan, Alex Williams, Matthew D. Golub, Surya Ganguli, David Sussillo

    Abstract: Recurrent neural networks (RNNs) are a widely used tool for modeling sequential data, yet they are often treated as inscrutable black boxes. Given a trained recurrent network, we would like to reverse engineer it--to obtain a quantitative, interpretable description of how it solves a particular task. Even for simple tasks, a detailed understanding of how recurrent networks work, or a prescription… ▽ More

    Submitted 4 December, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: Presented at NeurIPS 2019

  9. arXiv:1807.00053  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG cs.NE

    Task-Driven Convolutional Recurrent Models of the Visual System

    Authors: Aran Nayebi, Daniel Bear, Jonas Kubilius, Kohitij Kar, Surya Ganguli, David Sussillo, James J. DiCarlo, Daniel L. K. Yamins

    Abstract: Feed-forward convolutional neural networks (CNNs) are currently state-of-the-art for object classification tasks such as ImageNet. Further, they are quantitatively accurate models of temporally-averaged responses of neurons in the primate brain's visual system. However, biological visual systems have two ubiquitous architectural features not shared with typical CNNs: local recurrence within cortic… ▽ More

    Submitted 26 October, 2018; v1 submitted 20 June, 2018; originally announced July 2018.

    Comments: NIPS 2018 Camera Ready Version, 16 pages including supplementary information, 6 figures

  10. arXiv:1803.06092  [pdf, other

    cs.AI cs.CV cs.LG

    A Dataset and Architecture for Visual Reasoning with a Working Memory

    Authors: Guangyu Robert Yang, Igor Ganichev, Xiao-**g Wang, Jonathon Shlens, David Sussillo

    Abstract: A vexing problem in artificial intelligence is reasoning about events that occur in complex, changing visual stimuli such as in video analysis or game play. Inspired by a rich tradition of visual reasoning and memory in cognitive psychology and neuroscience, we developed an artificial, configurable visual question and answer dataset (COG) to parallel experiments in humans and animals. COG is much… ▽ More

    Submitted 20 July, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

  11. arXiv:1711.10151  [pdf, other

    cs.CV

    Recurrent Segmentation for Variable Computational Budgets

    Authors: Lane McIntosh, Niru Maheswaranathan, David Sussillo, Jonathon Shlens

    Abstract: State-of-the-art systems for semantic image segmentation use feed-forward pipelines with fixed computational costs. Building an image segmentation system that works across a range of computational budgets is challenging and time-intensive as new architectures must be designed and trained for every computational setting. To address this problem we develop a recurrent neural network that successivel… ▽ More

    Submitted 14 March, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

  12. arXiv:1611.09913  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Capacity and Trainability in Recurrent Neural Networks

    Authors: Jasmine Collins, Jascha Sohl-Dickstein, David Sussillo

    Abstract: Two potential bottlenecks on the expressiveness of recurrent neural networks (RNNs) are their ability to store information about the task in their parameters, and to store information about the input history in their units. We show experimentally that all common RNN architectures achieve nearly the same per-task and per-unit capacity bounds with careful training, for a variety of tasks and stackin… ▽ More

    Submitted 3 March, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

    Comments: Published as a conference paper at ICLR 2017

  13. arXiv:1611.09434  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Input Switched Affine Networks: An RNN Architecture Designed for Interpretability

    Authors: Jakob N. Foerster, Justin Gilmer, Jan Chorowski, Jascha Sohl-Dickstein, David Sussillo

    Abstract: There exist many problem domains where the interpretability of neural network models is essential for deployment. Here we introduce a recurrent architecture composed of input-switched affine transformations - in other words an RNN without any explicit nonlinearities, but with input-dependent recurrent weights. This simple form allows the RNN to be analyzed via straightforward linear methods: we ca… ▽ More

    Submitted 12 June, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: ICLR 2107 submission: https://openreview.net/forum?id=H1MjAnqxg

  14. arXiv:1608.06315  [pdf, other

    cs.LG q-bio.NC stat.ML

    LFADS - Latent Factor Analysis via Dynamical Systems

    Authors: David Sussillo, Rafal Jozefowicz, L. F. Abbott, Chethan Pandarinath

    Abstract: Neuroscience is experiencing a data revolution in which many hundreds or thousands of neurons are recorded simultaneously. Currently, there is little consensus on how such data should be analyzed. Here we introduce LFADS (Latent Factor Analysis via Dynamical Systems), a method to infer latent dynamics from simultaneously recorded, single-trial, high-dimensional neural spiking data. LFADS is a sequ… ▽ More

    Submitted 22 August, 2016; originally announced August 2016.

    Comments: 16 pages, 11 figures

  15. arXiv:1511.04868  [pdf, other

    cs.LG cs.CL cs.NE

    A Neural Transducer

    Authors: Navdeep Jaitly, David Sussillo, Quoc V. Le, Oriol Vinyals, Ilya Sutskever, Samy Bengio

    Abstract: Sequence-to-sequence models have achieved impressive results on various tasks. However, they are unsuitable for tasks that require incremental predictions to be made as more data arrives or tasks that have long input sequences and output sequences. This is because they generate an output sequence conditioned on an entire input sequence. In this paper, we present a Neural Transducer that can make i… ▽ More

    Submitted 4 August, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

  16. arXiv:1412.6558  [pdf, other

    cs.NE cs.LG stat.ML

    Random Walk Initialization for Training Very Deep Feedforward Networks

    Authors: David Sussillo, L. F. Abbott

    Abstract: Training very deep networks is an important open problem in machine learning. One of many difficulties is that the norm of the back-propagated error gradient can grow or decay exponentially. Here we show that training very deep feed-forward networks (FFNs) is not as difficult as previously thought. Unlike when back-propagation is applied to a recurrent network, application to an FFN amounts to mul… ▽ More

    Submitted 27 February, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: 10 pages, 4 figures