Skip to main content

Showing 1–12 of 12 results for author: Kestor, G

.
  1. arXiv:2109.07419  [pdf, other

    cs.AR cs.DC cs.LG

    Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators

    Authors: Geonhwa Jeong, Gokcen Kestor, Prasanth Chatarasi, Angshuman Parashar, Po-An Tsai, Sivasankaran Rajamanickam, Roberto Gioiosa, Tushar Krishna

    Abstract: To meet the extreme compute demands for deep learning across commercial and scientific applications, dataflow accelerators are becoming increasingly popular. While these "domain-specific" accelerators are not fully programmable like CPUs and GPUs, they retain varying levels of flexibility with respect to data orchestration, i.e., dataflow and tiling optimizations to enhance efficiency. There are s… ▽ More

    Submitted 6 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: This paper is accepted to PACT 2021

  2. arXiv:2102.06827  [pdf, other

    cs.MS physics.chem-ph

    COMET: A Domain-Specific Compilation of High-Performance Computational Chemistry

    Authors: Erdal Mutlu, Ruiqin Tian, Bin Ren, Sriram Krishnamoorthy, Roberto Gioiosa, Jacques Pienaar, Gokcen Kestor

    Abstract: The computational power increases over the past decades havegreatly enhanced the ability to simulate chemical reactions andunderstand ever more complex transformations. Tensor contractions are the fundamental computational building block of these simulations. These simulations have often been tied to one platform and restricted in generality by the interface provided to the user. The expanding pre… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

    Comments: Proceeding of the 33rd the Workshop on Languages and Compilers for Parallel Computing (LCPC), October 2020

  3. arXiv:2102.05187  [pdf, other

    cs.DC cs.PL

    A High-Performance Sparse Tensor Algebra Compiler in Multi-Level IR

    Authors: Ruiqin Tian, Luanzheng Guo, Jiajia Li, Bin Ren, Gokcen Kestor

    Abstract: Tensor algebra is widely used in many applications, such as scientific computing, machine learning, and data analytics. The tensors represented real-world data are usually large and sparse. There are tens of storage formats designed for sparse matrices and/or tensors and the performance of sparse tensor operations depends on a particular architecture and/or selected sparse format, which makes it c… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  4. arXiv:2008.11827  [pdf, other

    eess.SP cs.LG cs.PF

    Smart-PGSim: Using Neural Network to Accelerate AC-OPF Power Grid Simulation

    Authors: Wenqian Dong, Zhen Xie, Gokcen Kestor, Dong Li

    Abstract: The optimal power flow (OPF) problem is one of the most important optimization problems for the operation of the power grid. It calculates the optimum scheduling of the committed generation units. In this paper, we develop a neural network approach to the problem of accelerating the current optimal power flow (AC-OPF) by generating an intelligent initial solution. The high quality of the initial s… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  5. arXiv:1810.08955  [pdf, other

    cs.DC cs.LG stat.ML

    Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training

    Authors: Jiawen Liu, Dong Li, Gokcen Kestor, Jeffrey Vetter

    Abstract: Training neural network often uses a machine learning framework such as TensorFlow and Caffe2. These frameworks employ a dataflow model where the NN training is modeled as a directed graph composed of a set of nodes. Operations in neural network training are typically implemented by the frameworks as primitives and represented as nodes in the dataflow graph. Training NN models in a dataflow-based… ▽ More

    Submitted 18 February, 2019; v1 submitted 21 October, 2018; originally announced October 2018.

  6. arXiv:1810.04110  [pdf, other

    cs.DC

    MPI Windows on Storage for HPC Applications

    Authors: Sergio Rivas-Gomez, Roberto Gioiosa, Ivy Bo Peng, Gokcen Kestor, Sai Narasimhamurthy, Erwin Laure, Stefano Markidis

    Abstract: Upcoming HPC clusters will feature hybrid memories and storage devices per compute node. In this work, we propose to use the MPI one-sided communication model and MPI windows as unique interface for programming memory and storage. We describe the design and implementation of MPI storage windows, and present its benefits for out-of-core execution, parallel I/O and fault-tolerance. In addition, we e… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

  7. arXiv:1708.01306  [pdf, other

    cs.DC

    MPI Streams for HPC Applications

    Authors: Ivy Bo Peng, Stefano Markidis, Roberto Gioiosa, Gokcen Kestor, Erwin Laure

    Abstract: Data streams are a sequence of data flowing between source and destination processes. Streaming is widely used for signal, image and video processing for its efficiency in pipelining and effectiveness in reducing demand for memory. The goal of this work is to extend the use of data streams to support both conventional scientific applications and emerging data analytic applications running on HPC p… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: Advances in Parallel Computing

  8. arXiv:1708.01304  [pdf, other

    cs.DC

    Preparing HPC Applications for the Exascale Era: A Decoupling Strategy

    Authors: Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Erwin Laure, Stefano Markidis

    Abstract: Production-quality parallel applications are often a mixture of diverse operations, such as computation- and communication-intensive, regular and irregular, tightly coupled and loosely linked operations. In conventional construction of parallel applications, each process performs all the operations, which might result inefficient and seriously limit scalability, especially at large scale. We propo… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: The 46th International Conference on Parallel Processing (ICPP-2017)

  9. arXiv:1704.08492  [pdf

    cs.DC

    Extending Message Passing Interface Windows to Storage

    Authors: Sergio Rivas-Gomez, Stefano Markidis, Ivy Bo Peng, Erwin Laure, Gokcen Kestor, Roberto Gioiosa

    Abstract: This work presents an extension to MPI supporting the one-sided communication model and window allocations in storage. Our design transparently integrates with the current MPI implementations, enabling applications to target MPI windows in storage, memory or both simultaneously, without major modifications. Initial performance results demonstrate that the presented MPI window extension could poten… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

  10. Exploring the Performance Benefit of Hybrid Memory System on HPC Environments

    Authors: Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Erwin Laure, Stefano Markidis

    Abstract: Hardware accelerators have become a de-facto standard to achieve high performance on current supercomputers and there are indications that this trend will increase in the future. Modern accelerators feature high-bandwidth memory next to the computing cores. For example, the Intel Knights Landing (KNL) processor is equipped with 16 GB of high-bandwidth memory (HBM) that works together with conventi… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

  11. Idle Period Propagation in Message-Passing Applications

    Authors: Ivy Bo Peng, Stefano Markidis, Erwin Laure, Gokcen Kestor, Roberto Gioiosa

    Abstract: Idle periods on different processes of Message Passing applications are unavoidable. While the origin of idle periods on a single process is well understood as the effect of system and architectural random delays, yet it is unclear how these idle periods propagate from one process to another. It is important to understand idle period propagation in Message Passing applications as it allows applica… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

    Comments: 18th International Conference on High Performance Computing and Communications, IEEE, 2016

  12. Exploring Application Performance on Emerging Hybrid-Memory Supercomputers

    Authors: Ivy Bo Peng, Stefano Markidis, Erwin Laure, Gokcen Kestor, Roberto Gioiosa

    Abstract: Next-generation supercomputers will feature more hierarchical and heterogeneous memory systems with different memory technologies working side-by-side. A critical question is whether at large scale existing HPC applications and emerging data-analytics workloads will have performance improvement or degradation on these systems. We propose a systematic and fair methodology to identify the trend of a… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

    Comments: 18th International Conference on High Performance Computing and Communications, IEEE, 2016