Skip to main content

Showing 1–2 of 2 results for author: Richmond, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.10271  [pdf, other

    cs.AR

    VPU-EM: An Event-based Modeling Framework to Evaluate NPU Performance and Power Efficiency at Scale

    Authors: Charles Qi, Yi Wang, Hui Wang, Yang Lu, Shiva Shankar Subramanian, Finola Cahill, Conall Tuohy, Victor Li, Xu Qian, Darren Crews, Ling Wang, Shivaji Roy, Andrea Deidda, Martin Power, Niall Hanrahan, Rick Richmond, Umer Cheema, Arnab Raha, Alessandro Palla, Gary Baugh, Deepak Mathaikutty

    Abstract: State-of-art NPUs are typically architected as a self-contained sub-system with multiple heterogeneous hardware computing modules, and a dataflow-driven programming model. There lacks well-established methodology and tools in the industry to evaluate and compare the performance of NPUs from different architectures. We present an event-based performance modeling framework, VPU-EM, targeting scalabl… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 8 pages, 9 figures

    ACM Class: B.2.2; B.8.2

  2. arXiv:2205.04586  [pdf, other

    cs.LG cs.AI cs.NE

    Towards Optimal VPU Compiler Cost Modeling by using Neural Networks to Infer Hardware Performances

    Authors: Ian Frederick Vigogne Goodbody Hunter, Alessandro Palla, Sebastian Eusebiu Nagy, Richard Richmond, Kyle McAdoo

    Abstract: Calculating the most efficient schedule of work in a neural network compiler is a difficult task. There are many parameters to be accounted for that can positively or adversely affect that schedule depending on their configuration - How work is shared between distributed targets, the subdivision of tensors to fit in memory, toggling the enablement of optimizations, etc. Traditionally, neural netwo… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 9 pages, 10 figures, 2 tables, Under Review for NeurIPS 2022