Skip to main content

Showing 1–9 of 9 results for author: Mailthody, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.03760  [pdf, other

    cs.DC

    CODAG: Characterizing and Optimizing Decompression Algorithms for GPUs

    Authors: Jeongmin Park, Zaid Qureshi, Vikram Mailthody, Andrew Gacek, Shunfan Shao, Mohammad AlMasri, Isaac Gelado, **jun Xiong, Chris Newburn, I-hsin Chung, Michael Garland, Nikolay Sakharnykh, Wen-mei Hwu

    Abstract: Data compression and decompression have become vital components of big-data applications to manage the exponential growth in the amount of data collected and stored. Furthermore, big-data applications have increasingly adopted GPUs due to their high compute throughput and memory bandwidth. Prior works presume that decompression is memory-bound and have dedicated most of the GPU's threads to data m… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  2. arXiv:2306.16384  [pdf, other

    cs.DC cs.AI cs.AR cs.LG

    Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses

    Authors: Jeongmin Brian Park, Vikram Sharma Mailthody, Zaid Qureshi, Wen-mei Hwu

    Abstract: Graph Neural Networks (GNNs) are emerging as a powerful tool for learning from graph-structured data and performing sophisticated inference tasks in various application domains. Although GNNs have been shown to be effective on modest-sized graphs, training them on large-scale graphs remains a significant challenge due to lack of efficient data access and data movement methods. Existing frameworks… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Under Submission. Source code: https://github.com/jeongminpark417/GIDS

  3. arXiv:2302.13522  [pdf, other

    cs.LG cs.AI cs.DC cs.IR

    IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research

    Authors: Arpandeep Khatua, Vikram Sharma Mailthody, Bhagyashree Taleka, Tengfei Ma, Xiang Song, Wen-mei Hwu

    Abstract: Graph neural networks (GNNs) have shown high potential for a variety of real-world, challenging applications, but one of the major obstacles in GNN research is the lack of large-scale flexible datasets. Most existing public datasets for GNNs are relatively small, which limits the ability of GNNs to generalize to unseen data. The few existing large-scale graph datasets provide very limited labeled… ▽ More

    Submitted 21 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted in KDD'23 conference. This is final preprint version

    Journal ref: KDD 2023

  4. arXiv:2203.04910  [pdf, other

    cs.DC cs.AR cs.OS cs.PF

    GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture

    Authors: Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seung Won Min, Amna Masood, Jeongmin Park, **jun Xiong, CJ Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William Dally, Wen-mei Hwu

    Abstract: Graphics Processing Units (GPUs) have traditionally relied on the host CPU to initiate access to the data storage. This approach is well-suited for GPU applications with known data access patterns that enable partitioning of their dataset to be processed in a pipelined fashion in the GPU. However, emerging applications such as graph and data analytics, recommender systems, or graph neural networks… ▽ More

    Submitted 6 February, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: This is an extension to the published conference paper at ASPLOS'23: https://dl.acm.org/doi/abs/10.1145/3575693.3575748

    Journal ref: ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2

  5. arXiv:2101.07897  [pdf, other

    cs.CR cs.CY

    Safer Illinois and RokWall: Privacy Preserving University Health Apps for COVID-19

    Authors: Vikram Sharma Mailthody, James Wei, Nicholas Chen, Mohammad Behnia, Ruihao Yao, Qihao Wang, Vedant Agrawal, Churan He, Lijian Wang, Leihao Chen, Amit Agarwal, Edward Richter, Wen-Mei Hwu, Christopher W. Fletcher, **jun Xiong, Andrew Miller, Sanjay Patel

    Abstract: COVID-19 has fundamentally disrupted the way we live. Government bodies, universities, and companies worldwide are rapidly develo** technologies to combat the COVID-19 pandemic and safely reopen society. Essential analytics tools such as contact tracing, super-spreader event detection, and exposure map** require collecting and analyzing sensitive user information. The increasing use of such po… ▽ More

    Submitted 17 March, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: Appears in the Workshop on Secure IT Technologies against COVID-19(CoronaDef) 2021

  6. arXiv:2008.10169  [pdf, other

    cs.AR cs.DC cs.PF

    Tearing Down the Memory Wall

    Authors: Zaid Qureshi, Vikram Sharma Mailthody, Seung Won Min, I-Hsin Chung, **jun Xiong, Wen-mei Hwu

    Abstract: We present a vision for the Erudite architecture that redefines the compute and memory abstractions such that memory bandwidth and capacity become first-class citizens along with compute throughput. In this architecture, we envision coupling a high-density, massively parallel memory technology like Flash with programmable near-data accelerators, like the streaming multiprocessors in modern GPUs. E… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: SRC Techcon 2020 paper. Discusses vision of GPU-Centric architecture, Erudite

  7. arXiv:2007.14152  [pdf, other

    cs.DC cs.LG

    At-Scale Sparse Deep Neural Network Inference with Efficient GPU Implementation

    Authors: Mert Hidayetoglu, Carl Pearson, Vikram Sharma Mailthody, Eiman Ebrahimi, **jun Xiong, Rakesh Nagi, Wen-Mei Hwu

    Abstract: This paper presents GPU performance optimization and scaling results for inference models of the Sparse Deep Neural Network Challenge 2020. Demands for network quality have increased rapidly, pushing the size and thus the memory requirements of many neural networks beyond the capacity of available accelerators. Sparse deep neural networks (SpDNN) have shown promise for reining in the memory footpr… ▽ More

    Submitted 2 September, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: 7 pages

    Journal ref: High Performance Extreme Computing (2020)

  8. arXiv:2006.10220  [pdf, other

    cs.LG stat.ML

    I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths

    Authors: Hyoungwook Nam, Seung Byum Seo, Vikram Sharma Mailthody, Noor Michael, Lan Li

    Abstract: Self-attention has emerged as a vital component of state-of-the-art sequence-to-sequence models for natural language processing in recent years, brought to the forefront by pre-trained bi-directional Transformer models. Its effectiveness is partly due to its non-sequential architecture, which promotes scalability and parallelism but limits the model to inputs of a bounded length. In particular, su… ▽ More

    Submitted 19 June, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Submitted to NeurIPS2020

  9. arXiv:2006.06890  [pdf, other

    cs.DC cs.DB

    EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs

    Authors: Seung Won Min, Vikram Sharma Mailthody, Zaid Qureshi, **jun Xiong, Eiman Ebrahimi, Wen-mei Hwu

    Abstract: Modern analytics and recommendation systems are increasingly based on graph data that capture the relations between entities being analyzed. Practical graphs come in huge sizes, offer massive parallelism, and are stored in sparse-matrix formats such as CSR. To exploit the massive parallelism, developers are increasingly interested in using GPUs for graph traversal. However, due to their sizes, gra… ▽ More

    Submitted 14 January, 2021; v1 submitted 11 June, 2020; originally announced June 2020.