Skip to main content

Showing 1–8 of 8 results for author: Sohrabizadeh, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09606  [pdf, other

    cs.LG cs.AI cs.AR

    Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

    Authors: Zongyue Qin, Yunsheng Bai, Atefeh Sohrabizadeh, Zijian Ding, Ziniu Hu, Yizhou Sun, Jason Cong

    Abstract: In recent years, domain-specific accelerators (DSAs) have gained popularity for applications such as deep learning and autonomous driving. To facilitate DSA designs, programmers use high-level synthesis (HLS) to compile a high-level description written in C/C++ into a design with low-level hardware description languages that eventually synthesize DSAs on circuits. However, creating a high-quality… ▽ More

    Submitted 27 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 8 figures. arXiv admin note: text overlap with arXiv:2305.10838

  2. arXiv:2306.14052  [pdf, other

    cs.LG cs.AR cs.DC

    A Survey on Graph Neural Network Acceleration: Algorithms, Systems, and Customized Hardware

    Authors: Shichang Zhang, Atefeh Sohrabizadeh, Cheng Wan, Zijie Huang, Ziniu Hu, Yewen Wang, Yingyan, Lin, Jason Cong, Yizhou Sun

    Abstract: Graph neural networks (GNNs) are emerging for machine learning research on graph-structured data. GNNs achieve state-of-the-art performance on many tasks, but they face scalability challenges when it comes to real-world applications that have numerous data and strict latency requirements. Many studies have been conducted on how to accelerate GNNs in an effort to address these challenges. These acc… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  3. arXiv:2305.10838   

    cs.LG cs.PL

    ProgSG: Cross-Modality Representation Learning for Programs in Electronic Design Automation

    Authors: Yunsheng Bai, Atefeh Sohrabizadeh, Zongyue Qin, Ziniu Hu, Yizhou Sun, Jason Cong

    Abstract: Recent years have witnessed the growing popularity of domain-specific accelerators (DSAs), such as Google's TPUs, for accelerating various applications such as deep learning, search, autonomous driving, etc. To facilitate DSA designs, high-level synthesis (HLS) is used, which allows a developer to compile a high-level description in the form of software code in C and C++ into a design in low-level… ▽ More

    Submitted 2 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Requires further polishing

  4. arXiv:2209.02951  [pdf, other

    cs.AR cs.PL

    Democratizing Domain-Specific Computing

    Authors: Yuze Chi, Weikang Qiao, Atefeh Sohrabizadeh, Jie Wang, Jason Cong

    Abstract: In the past few years, domain-specific accelerators (DSAs), such as Google's Tensor Processing Units, have shown to offer significant performance and energy efficiency over general-purpose CPUs. An important question is whether typical software developers can design and implement their own customized DSAs, with affordability and efficiency, to accelerate their applications. This article presents o… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: To be published in CACM'22

  5. arXiv:2111.08848  [pdf, other

    cs.AR cs.LG

    Enabling Automated FPGA Accelerator Optimization Using Graph Neural Networks

    Authors: Atefeh Sohrabizadeh, Yunsheng Bai, Yizhou Sun, Jason Cong

    Abstract: High-level synthesis (HLS) has freed the computer architects from develo** their designs in a very low-level language and needing to exactly specify how the data should be transferred in register-level. With the help of HLS, the hardware designers must describe only a high-level behavioral flow of the design. Despite this, it still can take weeks to develop a high-performance architecture mainly… ▽ More

    Submitted 21 November, 2021; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: 12 pages

  6. arXiv:2111.05936  [pdf, other

    cs.LG cs.AR cs.DC

    SPA-GCN: Efficient and Flexible GCN Accelerator with an Application for Graph Similarity Computation

    Authors: Atefeh Sohrabizadeh, Yuze Chi, Jason Cong

    Abstract: While there have been many studies on hardware acceleration for deep learning on images, there has been a rather limited focus on accelerating deep learning applications involving graphs. The unique characteristics of graphs, such as the irregular memory access and dynamic parallelism, impose several challenges when the algorithm is mapped to a CPU or GPU. To address these challenges while exploit… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: 12 pages

  7. arXiv:2109.11081  [pdf, other

    cs.AR cs.DC

    Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix Dense-Matrix Multiplication

    Authors: Linghao Song, Yuze Chi, Atefeh Sohrabizadeh, Young-kyu Choi, Jason Lau, Jason Cong

    Abstract: Sparse-Matrix Dense-Matrix multiplication (SpMM) is the key operator for a wide range of applications, including scientific computing, graph processing, and deep learning. Architecting accelerators for SpMM is faced with three challenges - (1) the random memory accessing and unbalanced load in processing because of random distribution of elements in sparse matrices, (2) inefficient data handling o… ▽ More

    Submitted 12 January, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: To appear in FPGA'22

  8. arXiv:2009.14381  [pdf, other

    cs.AR cs.PL

    AutoDSE: Enabling Software Programmers to Design Efficient FPGA Accelerators

    Authors: Atefeh Sohrabizadeh, Cody Hao Yu, Min Gao, Jason Cong

    Abstract: Adopting FPGA as an accelerator in datacenters is becoming mainstream for customized computing, but the fact that FPGAs are hard to program creates a steep learning curve for software programmers. Even with the help of high-level synthesis (HLS), accelerator designers still have to manually perform code reconstruction and cumbersome parameter tuning to achieve the optimal performance. While many l… ▽ More

    Submitted 31 August, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 25 pages