Skip to main content

Showing 1–4 of 4 results for author: Zacharopoulos, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.19917  [pdf, other

    cs.AR cs.DC

    ReDSEa: Automated Acceleration of Triangular Solver on Supercloud Heterogeneous Systems

    Authors: Georgios Zacharopoulos, Ilias Bournias, Verner Vlacic, Lukas Cavigelli

    Abstract: When utilized effectively, Supercloud heterogeneous systems have the potential to significantly enhance performance. Our ReDSEa tool-chain automates the map**, load balancing, scheduling, parallelism, and overlap** processes for the Triangular System Solver (TS) on a heterogeneous system consisting of a Huawei Kunpeng ARM multi-core CPU and an Ascend 910 AI HW accelerator. We propose an LLVM c… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 4 pages, SSH-S0C DAC 2023 Workshop

  2. arXiv:2201.08603  [pdf, other

    cs.AR

    Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration

    Authors: Georgios Zacharopoulos, Adel Ejjeh, Ying **g, En-Yu Yang, Tianyu Jia, Iulian Brumar, Jeremy Intan, Muhammad Huzaifa, Sarita Adve, Vikram Adve, Gu-Yeon Wei, David Brooks

    Abstract: The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: 20 pages

  3. arXiv:2111.14767  [pdf, other

    cs.AR cs.LG

    A Graph Deep Learning Framework for High-Level Synthesis Design Space Exploration

    Authors: Lorenzo Ferretti, Andrea Cini, Georgios Zacharopoulos, Cesare Alippi, Laura Pozzi

    Abstract: The design of efficient hardware accelerators for high-throughput data-processing applications, e.g., deep neural networks, is a challenging task in computer architecture design. In this regard, High-Level Synthesis (HLS) emerges as a solution for fast prototy** application-specific hardware starting from a behavioural description of the application computational flow. This Design-Space Explorat… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  4. arXiv:2111.09222  [pdf, other

    cs.AR

    Early DSE and Automatic Generation of Coarse Grained Merged Accelerators

    Authors: Iulian Brumar, Georgios Zacharopoulos, Yuan Yao, Saketh Rama, Gu-Yeon Wei, David Brooks

    Abstract: Post-Moore's law area-constrained systems rely on accelerators to deliver performance enhancements. Coarse grained accelerators can offer substantial domain acceleration, but manual, ad-hoc identification of code to accelerate is prohibitively expensive. Because cycle-accurate simulators and high-level synthesis flows are so time-consuming, manual creation of high-utilization accelerators that exp… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.