Skip to main content

Showing 1–7 of 7 results for author: Hoisie, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.16792  [pdf, other

    cs.LG cs.AR

    Learning Generalizable Program and Architecture Representations for Performance Modeling

    Authors: Lingda Li, Thomas Flynn, Adolfy Hoisie

    Abstract: Performance modeling is an essential tool in many areas, including performance characterization/optimization, design space exploration, and resource allocation problems, to name a few. However, existing performance modeling approaches have limitations, such as high computational cost for discrete-event simulators, narrow flexibility of hardware emulators, or restricted accuracy/generality of analy… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: To be published at SC 2024

  2. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  3. arXiv:2105.05821  [pdf, other

    cs.AR cs.LG

    SimNet: Accurate and High-Performance Computer Architecture Simulation using Deep Learning

    Authors: Lingda Li, Santosh Pandey, Thomas Flynn, Hang Liu, Noel Wheeler, Adolfy Hoisie

    Abstract: While discrete-event simulators are essential tools for architecture research, design, and development, their practicality is limited by an extremely long time-to-solution for realistic applications under investigation. This work describes a concerted effort, where machine learning (ML) is used to accelerate discrete-event simulation. First, an ML-based instruction latency prediction framework tha… ▽ More

    Submitted 5 April, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

  4. TRUST: Triangle Counting Reloaded on GPUs

    Authors: Santosh Pandey, Zhibin Wang, Sheng Zhong, Chen Tian, Bolong Zheng, Xiaoye Li, Lingda Li, Adolfy Hoisie, Caiwen Ding, Dong Li, Hang Liu

    Abstract: Triangle counting is a building block for a wide range of graph applications. Traditional wisdom suggests that i) hashing is not suitable for triangle counting, ii) edge-centric triangle counting beats vertex-centric design, and iii) communication-free and workload balanced graph partitioning is a grand challenge for triangle counting. On the contrary, we advocate that i) hashing can help the key… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

  5. C-SAW: A Framework for Graph Sampling and Random Walk on GPUs

    Authors: Santosh Pandey, Lingda Li, Adolfy Hoisie, Xiaoye S. Li, Hang Liu

    Abstract: Many applications require to learn, mine, analyze and visualize large-scale graphs. These graphs are often too large to be addressed efficiently using conventional graph processing technologies. Many applications have requirements to analyze, transform, visualize and learn large scale graphs. These graphs are often too large to be addressed efficiently using conventional graph processing technolog… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: 12 pages,IEEE Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC20)

  6. arXiv:1905.08764  [pdf, other

    cs.PF cs.DC cs.LG

    Performance Analysis of Deep Learning Workloads on Leading-edge Systems

    Authors: Yihui Ren, Shinjae Yoo, Adolfy Hoisie

    Abstract: This work examines the performance of leading-edge systems designed for machine learning computing, including the NVIDIA DGX-2, Amazon Web Services (AWS) P3, IBM Power System Accelerated Compute Server AC922, and a consumer-grade Exxact TensorEX TS4 GPU server. Representative deep learning workloads from the fields of computer vision and natural language processing are the focus of the analysis. P… ▽ More

    Submitted 1 October, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 11 pages, 9 figures

    ACM Class: D.4.8; C.1.2; I.2.0

  7. arXiv:1406.5161  [pdf, other

    cs.DC cs.LG

    Fast Support Vector Machines Using Parallel Adaptive Shrinking on Distributed Systems

    Authors: Jeyanthi Narasimhan, Abhinav Vishnu, Lawrence Holder, Adolfy Hoisie

    Abstract: Support Vector Machines (SVM), a popular machine learning technique, has been applied to a wide range of domains such as science, finance, and social networks for supervised learning. Whether it is identifying high-risk patients by health-care professionals, or potential high-school students to enroll in college by school districts, SVMs can play a major role for social good. This paper undertakes… ▽ More

    Submitted 19 June, 2014; originally announced June 2014.

    Comments: 10 pages, 9 figures, 3 tables