Skip to main content

Showing 1–10 of 10 results for author: Selvitopi, O

.
  1. arXiv:2306.04014  [pdf, other

    cs.DC

    Evaluating the Potential of Disaggregated Memory Systems for HPC applications

    Authors: Nan Ding, Pieter Maris, Hai Ah Nam, Taylor Groves, Muaaz Gul Awan, LeAnn Lindsey, Christopher Daley, Oguz Selvitopi, Leonid Oliker, Nicholas Wright, Samuel Williams

    Abstract: Disaggregated memory is a promising approach that addresses the limitations of traditional memory architectures by enabling memory to be decoupled from compute nodes and shared across a data center. Cloud platforms have deployed such systems to improve overall system memory utilization, but performance can vary across workloads. High-performance computing (HPC) is crucial in scientific and enginee… ▽ More

    Submitted 16 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: The submission builds on the following conference paper: N. Ding, S. Williams, H.A. Nam, et al. Methodology for Evaluating the Potential of Disaggregated Memory Systems,2nd International Workshop on RESource DISaggregation in High-Performance Computing (RESDIS), November 18, 2022. It is now submitted to the CCPE journal for review

  2. arXiv:2303.01845  [pdf, other

    cs.DC cs.PF q-bio.GN

    Extreme-scale many-against-many protein similarity search

    Authors: Oguz Selvitopi, Saliya Ekanayake, Giulia Guidi, Muaaz G. Awan, Georgios A. Pavlopoulos, Ariful Azad, Nikos Kyrpides, Leonid Oliker, Katherine Yelick, Aydın Buluç

    Abstract: Similarity search is one of the most fundamental computations that are regularly performed on ever-increasing protein datasets. Scalability is of paramount importance for uncovering novel phenomena that occur at very large scales. We unleash the power of over 20,000 GPUs on the Summit system to perform all-vs-all protein similarity search on one of the largest publicly available datasets with 405… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 2022 ACM Gordon Bell Prize Finalist

    Journal ref: SC'22: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, 2022

  3. arXiv:2111.09947  [pdf, other

    cs.DC cs.DS cs.MS

    Parallel Algorithms for Masked Sparse Matrix-Matrix Products

    Authors: Srđan Milaković, Oguz Selvitopi, Israt Nisa, Zoran Budimlić, Aydin Buluc

    Abstract: Computing the product of two sparse matrices (SpGEMM) is a fundamental operation in various combinatorial and graph algorithms as well as various bioinformatics and data analytics applications for computing inner-product similarities. For an important class of algorithms, only a subset of the output entries are needed, and the resulting operation is known as Masked SpGEMM since a subset of the out… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  4. arXiv:2106.14402  [pdf, other

    cs.DC cs.DM cs.PF math.CO

    Combinatorial BLAS 2.0: Scaling combinatorial algorithms on distributed-memory systems

    Authors: Ariful Azad, Oguz Selvitopi, Md Taufique Hussain, John R. Gilbert, Aydin Buluc

    Abstract: Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, bioinformatics, and chemistry, are often hard to parallelize. The Combinatorial BLAS library implements key computational primitives for rapid development of combinatorial algorithms in distributed-memory systems. During the decade since its first introduction, the Combinatorial BLAS library has evol… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: To appear in IEEE Transactions on Parallel and Distributed Systems

  5. arXiv:2010.10055  [pdf, other

    cs.DC q-bio.GN

    Parallel String Graph Construction and Transitive Reduction for De Novo Genome Assembly

    Authors: Giulia Guidi, Oguz Selvitopi, Marquita Ellis, Leonid Oliker, Katherine Yelick, Aydin Buluc

    Abstract: One of the most computationally intensive tasks in computational biology is de novo genome assembly, the decoding of the sequence of an unknown genome from redundant and erroneous short sequences. A common assembly paradigm identifies overlap** sequences, simplifies their layout, and creates consensus. Despite many algorithms developed in the literature, the efficient assembly of large genomes i… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  6. arXiv:2010.08526  [pdf, other

    cs.DC

    Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale

    Authors: Md Taufique Hussain, Oguz Selvitopi, Aydin Buluç, Ariful Azad

    Abstract: Sparse matrix-matrix multiplication (SpGEMM) is a widely used kernel in various graph, scientific computing and machine learning algorithms. In this paper, we consider SpGEMMs performed on hundreds of thousands of processors generating trillions of nonzeros in the output matrix. Distributed SpGEMM at this extreme scale faces two key challenges: (1) high communication cost and (2) inadequate memory… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 14 pages, 15 figures

  7. arXiv:2009.14467  [pdf, other

    cs.DC q-bio.GN

    Distributed Many-to-Many Protein Sequence Alignment using Sparse Matrices

    Authors: Oguz Selvitopi, Saliya Ekanayake, Giulia Guidi, Georgios Pavlopoulos, Ariful Azad, Aydin Buluc

    Abstract: Identifying similar protein sequences is a core step in many computational biology pipelines such as detection of homologous protein sequences, generation of similarity protein graphs for downstream analysis, functional annotation and gene location. Performance and scalability of protein similarity searches have proven to be a bottleneck in many bioinformatics pipelines due to increases in cheap a… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

    Comments: To appear in International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'20)

  8. arXiv:2002.10083  [pdf, other

    cs.DC

    Optimizing High Performance Markov Clustering for Pre-Exascale Architectures

    Authors: Oguz Selvitopi, Md Taufique Hussain, Ariful Azad, Aydın Buluç

    Abstract: HipMCL is a high-performance distributed memory implementation of the popular Markov Cluster Algorithm (MCL) and can cluster large-scale networks within hours using a few thousand CPU-equipped nodes. It relies on sparse matrix computations and heavily makes use of the sparse matrix-sparse matrix multiplication kernel (SpGEMM). The existing parallel algorithms in HipMCL are not scalable to Exascale… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Journal ref: 34th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

  9. The Parallelism Motifs of Genomic Data Analysis

    Authors: Katherine Yelick, Aydin Buluc, Muaaz Awan, Ariful Azad, Benjamin Brock, Rob Egan, Saliya Ekanayake, Marquita Ellis, Evangelos Georganas, Giulia Guidi, Steven Hofmeyr, Oguz Selvitopi, Cristina Teodoropol, Leonid Oliker

    Abstract: Genomic data sets are growing dramatically as the cost of sequencing continues to decline and small sequencing devices become available. Enormous community databases store and share this data with the research community, but some of these genomic data analysis problems require large scale computational platforms to meet both the memory and computational requirements. These applications differ from… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

  10. A High-Throughput Solver for Marginalized Graph Kernels on GPU

    Authors: Yu-Hang Tang, Oguz Selvitopi, Doru Popovici, Aydın Buluç

    Abstract: We present the design and optimization of a linear solver on General Purpose GPUs for the efficient and high-throughput evaluation of the marginalized graph kernel between pairs of labeled graphs. The solver implements a preconditioned conjugate gradient (PCG) method to compute the solution to a generalized Laplacian equation associated with the tensor product of two graphs. To cope with the gap b… ▽ More

    Submitted 25 February, 2020; v1 submitted 14 October, 2019; originally announced October 2019.