Skip to main content

Showing 1–1 of 1 results for author: Sarrof, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17394  [pdf, other

    cs.CL cs.FL cs.LG

    The Expressive Capacity of State Space Models: A Formal Language Perspective

    Authors: Yash Sarrof, Yana Veitsman, Michael Hahn

    Abstract: Recently, recurrent models based on linear state space models (SSMs) have shown promising performance in language modeling (LM), competititve with transformers. However, there is little understanding of the in-principle abilities of such models, which could provide useful guidance to the search for better LM architectures. We present a comprehensive theoretical study of the capacity of such SSMs a… ▽ More

    Submitted 2 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.