Skip to main content

Showing 1–2 of 2 results for author: Soltaniyeh, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.13386  [pdf, other

    cs.AR

    SPOTS: An Accelerator for Sparse Convolutional Networks Leveraging Systolic General Matrix-Matrix Multiplication

    Authors: Mohammadreza Soltaniyeh, Richard P. Martin, Santosh Nagarakatte

    Abstract: This paper proposes a new hardware accelerator for sparse convolutional neural networks (CNNs) by building a hardware unit to perform the Image to Column (IM2COL) transformation of the input feature map coupled with a systolic array-based general matrix-matrix multiplication (GEMM) unit. Our design carefully overlaps the IM2COL transformation with the GEMM computation to maximize parallelism. We p… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 24 pages

    Report number: Rutgers Department of Computer Science Technical Report DCS-TR-756

  2. arXiv:2004.13907  [pdf, other

    cs.DC cs.MS cs.PL

    Synergistic CPU-FPGA Acceleration of Sparse Linear Algebra

    Authors: Mohammadreza Soltaniyeh, Richard P. Martin, Santosh Nagarakatte

    Abstract: This paper describes REAP, a software-hardware approach that enables high performance sparse linear algebra computations on a cooperative CPU-FPGA platform. REAP carefully separates the task of organizing the matrix elements from the computation phase. It uses the CPU to provide a first-pass re-organization of the matrix elements, allowing the FPGA to focus on the computation. We introduce a new i… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 12 pages

    Report number: Rutgers Computer Science Technical Report DCS-TR-750