Skip to main content

Showing 1–5 of 5 results for author: Md, V

.
  1. arXiv:2104.06700  [pdf, other

    cs.LG cs.DC

    DistGNN: Scalable Distributed Training for Large-Scale Graph Neural Networks

    Authors: Vasimuddin Md, Sanchit Misra, Guixiang Ma, Ramanarayan Mohanty, Evangelos Georganas, Alexander Heinecke, Dhiraj Kalamkar, Nesreen K. Ahmed, Sasikanth Avancha

    Abstract: Full-batch training on Graph Neural Networks (GNN) to learn the structure of large graphs is a critical problem that needs to scale to hundreds of compute nodes to be feasible. It is challenging due to large memory capacity and bandwidth requirements on a single compute node and high communication volumes across multiple nodes. In this paper, we present DistGNN that optimizes the well-known Deep G… ▽ More

    Submitted 16 April, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

  2. Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning & HPC Workloads

    Authors: Evangelos Georganas, Dhiraj Kalamkar, Sasikanth Avancha, Menachem Adelman, Deepti Aggarwal, Cristina Anderson, Alexander Breuer, Jeremy Bruestle, Narendra Chaudhary, Abhisek Kundu, Denise Kutnick, Frank Laub, Vasimuddin Md, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Brian Retford, Barukh Ziv, Alexander Heinecke

    Abstract: During the past decade, novel Deep Learning (DL) algorithms, workloads and hardware have been developed to tackle a wide range of problems. Despite the advances in workload and hardware ecosystems, the programming methodology of DL systems is stagnant. DL workloads leverage either highly-optimized, yet platform-specific and inflexible kernels from DL libraries, or in the case of novel operators, r… ▽ More

    Submitted 30 November, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

  3. arXiv:2007.06354  [pdf, other

    cs.DC cs.LG

    Deep Graph Library Optimizations for Intel(R) x86 Architecture

    Authors: Sasikanth Avancha, Vasimuddin Md, Sanchit Misra, Ramanarayan Mohanty

    Abstract: The Deep Graph Library (DGL) was designed as a tool to enable structure learning from graphs, by supporting a core abstraction for graphs, including the popular Graph Neural Networks (GNN). DGL contains implementations of all core graph operations for both the CPU and GPU. In this paper, we focus specifically on CPU implementations and present performance analysis, optimizations and results across… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  4. arXiv:1910.04728  [pdf, other

    cs.DB cs.DS cs.LG

    LISA: Towards Learned DNA Sequence Search

    Authors: Darryl Ho, Jialin Ding, Sanchit Misra, Nesime Tatbul, Vikram Nathan, Vasimuddin Md, Tim Kraska

    Abstract: Next-generation sequencing (NGS) technologies have enabled affordable sequencing of billions of short DNA fragments at high throughput, paving the way for population-scale genomics. Genomics data analytics at this scale requires overcoming performance bottlenecks, such as searching for short DNA sequences over long reference sequences. In this paper, we introduce LISA (Learned Indexes for Sequence… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  5. arXiv:1907.12931  [pdf, other

    cs.DC cs.CE cs.PF q-bio.GN

    Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems

    Authors: Vasimuddin Md, Sanchit Misra, Heng Li, Srinivas Aluru

    Abstract: Innovations in Next-Generation Sequencing are enabling generation of DNA sequence data at ever faster rates and at very low cost. Large sequencing centers typically employ hundreds of such systems. Such high-throughput and low-cost generation of data underscores the need for commensurate acceleration in downstream computational analysis of the sequencing data. A fundamental step in downstream anal… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.