Skip to main content

Showing 1–5 of 5 results for author: Nolet, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.15136  [pdf, other

    cs.DS cs.CV cs.DB cs.DC cs.IR

    CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs

    Authors: Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, Yong Wang

    Abstract: Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines spanning data mining and artificial intelligence, from information retrieval and computer vision to natural language processing and recommender systems. Data volumes have soared in recent years and the computational cost of an exhaustive exact nearest neighbor search is often prohibitive, necessitating the adop… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2306.16354  [pdf, ps, other

    cs.LG stat.ML

    cuSLINK: Single-linkage Agglomerative Clustering on the GPU

    Authors: Corey J. Nolet, Divye Gala, Alex Fender, Mahesh Doijade, Joe Eaton, Edward Raff, John Zedlewski, Brad Rees, Tim Oates

    Abstract: In this paper, we propose cuSLINK, a novel and state-of-the-art reformulation of the SLINK algorithm on the GPU which requires only $O(Nk)$ space and uses a parameter $k$ to trade off space and time. We also propose a set of novel and reusable building blocks that compose cuSLINK. These building blocks include highly optimized computational patterns for $k$-NN graph construction, spanning trees, a… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: To appear in ECML PKDD 2023 by Springer Nature

  3. arXiv:2104.06357  [pdf, ps, other

    cs.LG cs.DC math.RA

    GPU Semiring Primitives for Sparse Neighborhood Methods

    Authors: Corey J. Nolet, Divye Gala, Edward Raff, Joe Eaton, Brad Rees, John Zedlewski, Tim Oates

    Abstract: High-performance primitives for mathematical operations on sparse vectors must deal with the challenges of skewed degree distributions and limits on memory consumption that are typically not issues in dense operations. We demonstrate that a sparse semiring primitive can be flexible enough to support a wide range of critical distance measures while maintaining performance and memory efficiency on t… ▽ More

    Submitted 4 March, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

  4. arXiv:2008.00325  [pdf, other

    cs.LG cs.DS stat.ML

    Bringing UMAP Closer to the Speed of Light with GPU Acceleration

    Authors: Corey J. Nolet, Victor Lafargue, Edward Raff, Thejaswi Nanditale, Tim Oates, John Zedlewski, Joshua Patterson

    Abstract: The Uniform Manifold Approximation and Projection (UMAP) algorithm has become widely popular for its ease of use, quality of results, and support for exploratory, unsupervised, supervised, and semi-supervised learning. While many algorithms can be ported to a GPU in a simple and direct fashion, such efforts have resulted in inefficient and inaccurate versions of UMAP. We show a number of technique… ▽ More

    Submitted 29 March, 2021; v1 submitted 1 August, 2020; originally announced August 2020.

  5. arXiv:2002.04803  [pdf, other

    cs.LG stat.ML

    Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence

    Authors: Sebastian Raschka, Joshua Patterson, Corey Nolet

    Abstract: Smarter applications are making better use of the insights gleaned from data, having an impact on every industry and research discipline. At the core of this revolution lies the tools and the methods that are driving it, from processing the massive piles of data generated each day to learning from and taking useful action. Deep neural networks, along with advancements in classical ML and scalable… ▽ More

    Submitted 31 March, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Preprint of a manuscript accepted for publication in "Machine Learning with Python," a special issue of Information (ISSN 2078-2489)