Skip to main content

Showing 1–5 of 5 results for author: Agostini, N B

.
  1. arXiv:2404.15510  [pdf, other

    cs.AR cs.DC cs.LG cs.NE

    NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator

    Authors: Kaustubh Shivdikar, Nicolas Bohm Agostini, Malith Jayaweera, Gilbert Jonatan, Jose L. Abellan, Ajay Joshi, John Kim, David Kaeli

    Abstract: Graph Neural Networks (GNNs) are emerging as a formidable tool for processing non-euclidean data across various domains, ranging from social network analysis to bioinformatics. Despite their effectiveness, their adoption has not been pervasive because of scalability challenges associated with large-scale graph datasets, particularly when leveraging message passing. To tackle these challenges, we… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Visit https://neurachip.us for WebGUI based simulations

  2. arXiv:2402.19184  [pdf, other

    cs.PL

    Data Transfer Optimizations for Host-CPU and Accelerators in AXI4MLIR

    Authors: Jude Haris, Nicolas Bohm Agostini, Antonino Tumeo, David Kaeli, José Cano

    Abstract: As custom hardware accelerators become more prevalent, it becomes increasingly important to automatically generate efficient host-driver code that can fully leverage the capabilities of these accelerators. This approach saves time and reduces the likelihood of errors that can occur during manual implementation. AXI4MLIR extends the MLIR compiler framework to generate host-driver code for custom ac… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  3. AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators

    Authors: Nicolas Bohm Agostini, Jude Haris, Perry Gibson, Malith Jayaweera, Norm Rubin, Antonino Tumeo, José L. Abellán, José Cano, David Kaeli

    Abstract: This paper addresses the need for automatic and efficient generation of host driver code for arbitrary custom AXI-based accelerators targeting linear algebra algorithms, an important workload in various applications, including machine learning and scientific computing. While existing tools have focused on automating accelerator prototy**, little attention has been paid to the host-accelerator in… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 13 pages, 17 figures, to appear in CGO2024

    ACM Class: D.3.3

  4. arXiv:2110.00478  [pdf, other

    cs.AR cs.DC cs.LG

    SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge Inference

    Authors: Jude Haris, Perry Gibson, José Cano, Nicolas Bohm Agostini, David Kaeli

    Abstract: Edge computing devices inherently face tight resource constraints, which is especially apparent when deploying Deep Neural Networks (DNN) with high memory and compute demands. FPGAs are commonly available in edge devices. Since these reconfigurable circuits can achieve higher throughput and lower power consumption than general purpose processors, they are especially well-suited for DNN acceleratio… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: This paper is accepted to SBAC-PAD 2021

  5. arXiv:1911.11313  [pdf, other

    cs.DC

    Summarizing CPU and GPU Design Trends with Product Data

    Authors: Yifan Sun, Nicolas Bohm Agostini, Shi Dong, David Kaeli

    Abstract: Moore's Law and Dennard Scaling have guided the semiconductor industry for the past few decades. Recently, both laws have faced validity challenges as transistor sizes approach the practical limits of physics. We are interested in testing the validity of these laws and reflect on the reasons responsible. In this work, we collect data of more than 4000 publicly-available CPU and GPU products. We fi… ▽ More

    Submitted 13 July, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: Fix flops/watt error