Skip to main content

Showing 1–9 of 9 results for author: Mhembere, D

.
  1. arXiv:1907.03335  [pdf, other

    cs.DC cs.DB

    Graphyti: A Semi-External Memory Graph Library for FlashGraph

    Authors: Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein, Randal Burns

    Abstract: Graph datasets exceed the in-memory capacity of most standalone machines. Traditionally, graph frameworks have overcome memory limitations through scale-out, distributing computing. Emerging frameworks avoid the network bottleneck of distributed data with Semi-External Memory (SEM) that uses a single multicore node and operates on graphs larger than memory. In SEM, $\mathcal{O}(m)$ data resides on… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

  2. arXiv:1902.09527  [pdf, other

    cs.DC

    clusterNOR: A NUMA-Optimized Clustering Framework

    Authors: Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein, Randal Burns

    Abstract: Clustering algorithms are iterative and have complex data access patterns that result in many small random memory accesses. The performance of parallel implementations suffer from synchronous barriers for each iteration and skewed workloads. We rethink the parallelization of clustering for modern non-uniform memory architectures (NUMA) to maximizes independent, asynchronous computation. We elimina… ▽ More

    Submitted 17 January, 2021; v1 submitted 24 February, 2019; originally announced February 2019.

    Comments: arXiv admin note: Journal version of arXiv:1606.08905

  3. arXiv:1806.07300  [pdf, other

    cs.PF cs.DC

    Forest Packing: Fast, Parallel Decision Forests

    Authors: James Browne, Tyler M. Tomita, Disa Mhembere, Randal Burns, Joshua T. Vogelstein

    Abstract: Machine learning has an emerging critical role in high-performance computing to modulate simulations, extract knowledge from massive data, and replace numerical models with efficient approximations. Decision forests are a critical tool because they provide insight into model operation that is critical to interpreting learned results. While decision forests are trivially parallelizable, the travers… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

  4. arXiv:1606.08905  [pdf, other

    cs.DC

    knor: A NUMA-Optimized In-Memory, Distributed and Semi-External-Memory k-means Library

    Authors: Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein, Randal Burns

    Abstract: k-means is one of the most influential and utilized machine learning algorithms. Its computation limits the performance and scalability of many statistical analysis and machine learning tasks. We rethink and optimize k-means in terms of modern NUMA architectures to develop a novel parallelization scheme that delays and minimizes synchronization barriers. The \textit{k-means NUMA Optimized Routine}… ▽ More

    Submitted 24 June, 2017; v1 submitted 28 June, 2016; originally announced June 2016.

  5. arXiv:1604.06414  [pdf, other

    cs.DC

    FlashR: R-Programmed Parallel and Scalable Machine Learning using SSDs

    Authors: Da Zheng, Disa Mhembere, Joshua T. Vogelstein, Carey E. Priebe, Randal Burns

    Abstract: R is one of the most popular programming languages for statistics and machine learning, but the R framework is relatively slow and unable to scale to large datasets. The general approach for speeding up an implementation in R is to implement the algorithms in C or FORTRAN and provide an R wrapper. FlashR takes a different approach: it executes R code in parallel and scales the code beyond memory c… ▽ More

    Submitted 18 May, 2017; v1 submitted 21 April, 2016; originally announced April 2016.

  6. Semi-External Memory Sparse Matrix Multiplication for Billion-Node Graphs

    Authors: Da Zheng, Disa Mhembere, Vince Lyzinski, Joshua Vogelstein, Carey E. Priebe, Randal Burns

    Abstract: Sparse matrix multiplication is traditionally performed in memory and scales to large matrices using the distributed memory of multiple nodes. In contrast, we scale sparse matrix multiplication beyond memory capacity by implementing sparse matrix dense matrix multiplication (SpMM) in a semi-external memory (SEM) fashion; i.e., we keep the sparse matrix on commodity SSDs and dense matrices in memor… ▽ More

    Submitted 14 October, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: published in IEEE Transactions on Parallel and Distributed Systems

  7. arXiv:1408.0500  [pdf, other

    cs.DC

    FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs

    Authors: Da Zheng, Disa Mhembere, Randal Burns, Joshua Vogelstein, Carey E. Priebe, Alexander S. Szalay

    Abstract: Graph analysis performs many random reads and writes, thus, these workloads are typically performed in memory. Traditionally, analyzing large graphs requires a cluster of machines so the aggregate memory exceeds the graph size. We demonstrate that a multicore server can process graphs with billions of vertices and hundreds of billions of edges, utilizing commodity SSDs with minimal performance los… ▽ More

    Submitted 25 January, 2015; v1 submitted 3 August, 2014; originally announced August 2014.

    Comments: published in FAST'15

  8. MIGRAINE: MRI Graph Reliability Analysis and Inference for Connectomics

    Authors: William Gray Roncal, Zachary H. Koterba, Disa Mhembere, Dean M. Kleissas, Joshua T. Vogelstein, Randal Burns, Anita R. Bowles, Dimitrios K. Donavos, Sephira Ryman, Rex E. Jung, Lei Wu, Vince Calhoun, R. Jacob Vogelstein

    Abstract: Currently, connectomes (e.g., functional or structural brain graphs) can be estimated in humans at $\approx 1~mm^3$ scale using a combination of diffusion weighted magnetic resonance imaging, functional magnetic resonance imaging and structural magnetic resonance imaging scans. This manuscript summarizes a novel, scalable implementation of open-source algorithms to rapidly estimate magnetic resona… ▽ More

    Submitted 17 December, 2013; originally announced December 2013.

    Comments: Published as part of 2013 IEEE GlobalSIP conference

  9. Computing Scalable Multivariate Glocal Invariants of Large (Brain-) Graphs

    Authors: Disa Mhembere, William Gray Roncal, Daniel Sussman, Carey E. Priebe, Rex Jung, Sephira Ryman, R. Jacob Vogelstein, Joshua T. Vogelstein, Randal Burns

    Abstract: Graphs are quickly emerging as a leading abstraction for the representation of data. One important application domain originates from an emerging discipline called "connectomics". Connectomics studies the brain as a graph; vertices correspond to neurons (or collections thereof) and edges correspond to structural or functional connections between them. To explore the variability of connectomes---to… ▽ More

    Submitted 16 December, 2013; originally announced December 2013.

    Comments: Published as part of 2013 IEEE GlobalSIP conference