Skip to main content

Showing 1–3 of 3 results for author: Ostrow, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11993  [pdf, other

    cs.LG cs.NE

    Delay Embedding Theory of Neural Sequence Models

    Authors: Mitchell Ostrow, Adam Eisen, Ila Fiete

    Abstract: To generate coherent responses, language models infer unobserved meaning from their input text sequence. One potential explanation for this capability arises from theories of delay embeddings in dynamical systems, which prove that unobserved variables can be recovered from the history of only a handful of observed variables. To test whether language models are effectively constructing delay embedd… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 14 pages, 9 figures

  2. arXiv:2402.10202  [pdf, other

    cs.LG

    Bridging Associative Memory and Probabilistic Modeling

    Authors: Rylan Schaeffer, Nika Zahedi, Mikail Khona, Dhruv Pai, Sang Truong, Yilun Du, Mitchell Ostrow, Sarthak Chandra, Andres Carranza, Ila Rani Fiete, Andrey Gromov, Sanmi Koyejo

    Abstract: Associative memory and probabilistic modeling are two fundamental topics in artificial intelligence. The first studies recurrent neural networks designed to denoise, complete and retrieve data, whereas the second studies learning and sampling from probability distributions. Based on the observation that associative memory's energy functions can be seen as probabilistic modeling's negative log like… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  3. arXiv:2306.10168  [pdf, other

    q-bio.NC cs.LG cs.NE q-bio.QM

    Beyond Geometry: Comparing the Temporal Structure of Computation in Neural Circuits with Dynamical Similarity Analysis

    Authors: Mitchell Ostrow, Adam Eisen, Leo Kozachkov, Ila Fiete

    Abstract: How can we tell whether two neural networks utilize the same internal processes for a particular computation? This question is pertinent for multiple subfields of neuroscience and machine learning, including neuroAI, mechanistic interpretability, and brain-machine interfaces. Standard approaches for comparing neural networks focus on the spatial geometry of latent states. Yet in recurrent networks… ▽ More

    Submitted 29 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 22 pages, 9 figures