Skip to main content

Showing 1–6 of 6 results for author: McClelland, J L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2112.03753  [pdf, other

    cs.LG cs.AI stat.ML

    Tell me why! Explanations support learning relational and causal structure

    Authors: Andrew K. Lampinen, Nicholas A. Roy, Ishita Dasgupta, Stephanie C. Y. Chan, Allison C. Tam, James L. McClelland, Chen Yan, Adam Santoro, Neil C. Rabinowitz, Jane X. Wang, Felix Hill

    Abstract: Inferring the abstract relational and causal structure of the world is a major challenge for reinforcement-learning (RL) agents. For humans, language--particularly in the form of explanations--plays a considerable role in overcoming this challenge. Here, we show that language can play a similar role for deep RL agents in complex environments. While agents typically struggle to acquire relational a… ▽ More

    Submitted 25 May, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: ICML 2022; 23 pages

    ACM Class: I.2.6

  2. arXiv:2005.04318  [pdf, other

    cs.LG cs.AI stat.ML

    Transforming task representations to perform novel tasks

    Authors: Andrew K. Lampinen, James L. McClelland

    Abstract: An important aspect of intelligence is the ability to adapt to a novel task without any direct experience (zero-shot), based on its relationship to previous tasks. Humans can exhibit this cognitive flexibility. By contrast, models that achieve superhuman performance in specific tasks often fail to adapt to even slight task alterations. To address this, we propose a general computational framework… ▽ More

    Submitted 6 October, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: 45 pages

    ACM Class: I.2.0; I.2.6

    Journal ref: PNAS December 29, 2020 117 (52) 32970-32981;

  3. arXiv:1905.09950  [pdf, other

    cs.LG cs.NE stat.ML

    Zero-shot task adaptation by homoiconic meta-map**

    Authors: Andrew K. Lampinen, James L. McClelland

    Abstract: How can deep learning systems flexibly reuse their knowledge? Toward this goal, we propose a new class of challenges, and a class of architectures that can solve them. The challenges are meta-map**s, which involve systematically transforming task behaviors to adapt to new tasks zero-shot. The key to achieving these challenges is representing the task being performed in such a way that this task… ▽ More

    Submitted 12 November, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: 27 pages

    ACM Class: I.2.0; I.2.6

  4. arXiv:1810.10531  [pdf, other

    cs.LG cs.AI q-bio.NC stat.ML

    A mathematical theory of semantic development in deep neural networks

    Authors: Andrew M. Saxe, James L. McClelland, Surya Ganguli

    Abstract: An extensive body of empirical research has revealed remarkable regularities in the acquisition, organization, deployment, and neural representation of human semantic knowledge, thereby raising a fundamental conceptual question: what are the theoretical principles governing the ability of neural networks to acquire, organize, and deploy abstract knowledge by integrating across many individual expe… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  5. arXiv:1710.10280  [pdf, other

    cs.CL cs.LG stat.ML

    One-shot and few-shot learning of word embeddings

    Authors: Andrew K. Lampinen, James L. McClelland

    Abstract: Standard deep learning systems require thousands or millions of examples to learn a concept, and cannot integrate new concepts easily. By contrast, humans have an incredible ability to do one-shot or few-shot learning. For instance, from just hearing a word used in a sentence, humans can infer a great deal about it, by leveraging what the syntax and semantics of the surrounding words tells us. Her… ▽ More

    Submitted 2 January, 2018; v1 submitted 27 October, 2017; originally announced October 2017.

    Comments: 15 pages, 7 figures, under review as a conference paper at ICLR 2018

    ACM Class: I.2.7

  6. arXiv:1312.6120  [pdf, other

    cs.NE cond-mat.dis-nn cs.CV cs.LG q-bio.NC stat.ML

    Exact solutions to the nonlinear dynamics of learning in deep linear neural networks

    Authors: Andrew M. Saxe, James L. McClelland, Surya Ganguli

    Abstract: Despite the widespread practical success of deep learning methods, our theoretical understanding of the dynamics of learning in deep neural networks remains quite sparse. We attempt to bridge the gap between the theory and practice of deep learning by systematically analyzing learning dynamics for the restricted case of deep linear neural networks. Despite the linearity of their input-output map,… ▽ More

    Submitted 19 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: Submission to ICLR2014. Revised based on reviewer feedback