Skip to main content

Showing 1–2 of 2 results for author: Sifferman, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02528  [pdf, other

    cs.CL

    Scalable MatMul-free Language Modeling

    Authors: Rui-Jie Zhu, Yu Zhang, Ethan Sifferman, Tyler Sheaves, Yiqiao Wang, Dustin Richmond, Peng Zhou, Jason K. Eshraghian

    Abstract: Matrix multiplication (MatMul) typically dominates the overall computational cost of large language models (LLMs). This cost only grows as LLMs scale to larger embedding dimensions and context lengths. In this work, we show that MatMul operations can be completely eliminated from LLMs while maintaining strong performance at billion-parameter scales. Our experiments show that our proposed MatMul-fr… ▽ More

    Submitted 18 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2401.16202  [pdf, other

    cs.AR cs.ET

    FPIA: Field-Programmable Ising Arrays with In-Memory Computing

    Authors: George Higgins Hutchinson, Ethan Sifferman, Tinish Bhattacharya, Dmitri B. Strukov

    Abstract: Ising Machine is a promising computing approach for solving combinatorial optimization problems. It is naturally suited for energy-saving and compact in-memory computing implementations with emerging memories. A naïve in-memory computing implementation of a quadratic Ising Machine requires an array of coupling weights that grows quadratically with problem size. However, the resources in such an ap… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 7 pages, 12 figures