Skip to main content

Showing 1–3 of 3 results for author: Odemuyiwa, T O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10491  [pdf, other

    cs.AR

    FuseMax: Leveraging Extended Einsums to Optimize Attention Accelerator Design

    Authors: Nandeeka Nayak, Xinrui Wu, Toluwanimi O. Odemuyiwa, Michael Pellauer, Joel S. Emer, Christopher W. Fletcher

    Abstract: Attention for transformers is a critical workload that has recently received significant "attention" as a target for custom acceleration. Yet, while prior work succeeds in reducing attention's memory-bandwidth requirements, it creates load imbalance between attention operators (resulting in severe compute under-utilization) and requires on-chip memory that scales with sequence length (which is exp… ▽ More

    Submitted 25 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: 15 pages, 10 figures

  2. arXiv:2404.11591  [pdf, other

    cs.DS

    The EDGE Language: Extended General Einsums for Graph Algorithms

    Authors: Toluwanimi O. Odemuyiwa, Joel S. Emer, John D. Owens

    Abstract: In this work, we propose a unified abstraction for graph algorithms: the Extended General Einsums language, or EDGE. The EDGE language expresses graph algorithms in the language of tensor algebra, providing a rigorous, succinct, and expressive mathematical framework. EDGE leverages two ideas: (1) the well-known foundations provided by the graph-matrix duality, where a graph is simply a 2D tensor,… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 79 pages, 14 figures

  3. TeAAL: A Declarative Framework for Modeling Sparse Tensor Accelerators

    Authors: Nandeeka Nayak, Toluwanimi O. Odemuyiwa, Shubham Ugare, Christopher W. Fletcher, Michael Pellauer, Joel S. Emer

    Abstract: Over the past few years, the explosion in sparse tensor algebra workloads has led to a corresponding rise in domain-specific accelerators to service them. Due to the irregularity present in sparse tensors, these accelerators employ a wide variety of novel solutions to achieve good performance. At the same time, prior work on design-flexible sparse accelerator modeling does not express this full ra… ▽ More

    Submitted 11 June, 2024; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: 17 pages, 13 figures