Skip to main content

Showing 1–12 of 12 results for author: Priest, B W

.
  1. An Analysis of the Johnson-Lindenstrauss Lemma with the Bivariate Gamma Distribution

    Authors: Jason Bernstein, Alec M. Dunton, Benjamin W. Priest

    Abstract: Probabilistic proofs of the Johnson-Lindenstrauss lemma imply that random projection can reduce the dimension of a data set and approximately preserve pairwise distances. If a distance being approximately preserved is called a success, and the complement of this event is called a failure, then such a random projection likely results in no failures. Assuming a Gaussian random projection, the lemma… ▽ More

    Submitted 12 July, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 20 pages, 5 figures. This revision improves figure 2 and format of citations

    Report number: LLNL-TR-844277

  2. arXiv:2209.11280  [pdf, other

    cs.LG math.NA stat.ML

    Scalable Gaussian Process Hyperparameter Optimization via Coverage Regularization

    Authors: Killian Wood, Alec M. Dunton, Amanda Muyskens, Benjamin W. Priest

    Abstract: Gaussian processes (GPs) are Bayesian non-parametric models popular in a variety of applications due to their accuracy and native uncertainty quantification (UQ). Tuning GP hyperparameters is critical to ensure the validity of prediction accuracy and uncertainty; uniquely estimating multiple hyperparameters in, e.g. the Matern kernel can also be a significant challenge. Moreover, training GPs on l… ▽ More

    Submitted 2 November, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 4 pages content, 3 figures, 6 tables

    MSC Class: 60G15 ACM Class: G.3

  3. arXiv:2208.14592  [pdf, other

    astro-ph.IM stat.ML

    Light curve completion and forecasting using fast and scalable Gaussian processes (MuyGPs)

    Authors: Imène R. Goumiri, Alec M. Dunton, Amanda L. Muyskens, Benjamin W. Priest, Robert E. Armstrong

    Abstract: Temporal variations of apparent magnitude, called light curves, are observational statistics of interest captured by telescopes over long periods of time. Light curves afford the exploration of Space Domain Awareness (SDA) objectives such as object identification or pose estimation as latent variable inference problems. Ground-based observations from commercial off the shelf (COTS) cameras remain… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 14 pages, 7 figures, accepted to AMOS 2022 conference

  4. arXiv:2205.10879  [pdf, other

    cs.LG math.NA stat.ML

    Fast Gaussian Process Posterior Mean Prediction via Local Cross Validation and Precomputation

    Authors: Alec M. Dunton, Benjamin W. Priest, Amanda Muyskens

    Abstract: Gaussian processes (GPs) are Bayesian non-parametric models useful in a myriad of applications. Despite their popularity, the cost of GP predictions (quadratic storage and cubic complexity with respect to the number of training points) remains a hurdle in applying GPs to large data. We present a fast posterior mean prediction algorithm called FastMuyGPs to address this shortcoming. FastMuyGPs is b… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: 9 pages content, 4 figures, 3 tables

    MSC Class: 60G15 ACM Class: G.3

  5. arXiv:2107.12330  [pdf, other

    cs.DC

    TriPoll: Computing Surveys of Triangles in Massive-Scale Temporal Graphs with Metadata

    Authors: Trevor Steil, Tahsin Reza, Keita Iwabuchi, Benjamin W. Priest, Geoffrey Sanders, Roger Pearce

    Abstract: Understanding the higher-order interactions within network data is a key objective of network science. Surveys of metadata triangles (or patterned 3-cycles in metadata-enriched graphs) are often of interest in this pursuit. In this work, we develop TriPoll, a prototype distributed HPC system capable of surveying triangles in massive graphs containing metadata on their edges and vertices. We contra… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    Comments: 13 pages, 8 figures

    Report number: LLNL-CONF-822890

  6. Gaussian Process Classification for Galaxy Blend Identification in LSST

    Authors: James J. Buchanan, Michael D. Schneider, Robert E. Armstrong, Amanda L. Muyskens, Benjamin W. Priest, Ryan J. Dana

    Abstract: A significant fraction of observed galaxies in the Rubin Observatory Legacy Survey of Space and Time (LSST) will overlap at least one other galaxy along the same line of sight, in a so-called "blend." The current standard method of assessing blend likelihood in LSST images relies on counting up the number of intensity peaks in the smoothed image of a blend candidate, but the reliability of this pr… ▽ More

    Submitted 10 December, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: 20 pages, 6 figures, version accepted by ApJ

  7. Star-Galaxy Image Separation with Computationally Efficient Gaussian Process Classification

    Authors: Amanda L. Muyskens, Imène R. Goumiri, Benjamin W. Priest, Michael D. Schneider, Robert E. Armstrong, Jason M. Bernstein, Ryan Dana

    Abstract: We introduce a novel method for discerning optical telescope images of stars from those of galaxies using Gaussian processes (GPs). Although applications of GPs often struggle in high-dimensional data modalities such as optical image classification, we show that a low-dimensional embedding of images into a metric space defined by the principal components of the data suffices to produce high-qualit… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: 18 pages, 8 figures

  8. arXiv:2010.06094  [pdf, other

    astro-ph.IM

    Star-Galaxy Separation via Gaussian Processes with Model Reduction

    Authors: Imène R. Goumiri, Amanda L. Muyskens, Michael D. Schneider, Benjamin W. Priest, Robert E. Armstrong

    Abstract: Modern cosmological surveys such as the Hyper Suprime-Cam (HSC) survey produce a huge volume of low-resolution images of both distant galaxies and dim stars in our own galaxy. Being able to automatically classify these images is a long-standing problem in astronomy and critical to a number of different scientific analyses. Recently, the challenge of "star-galaxy" classification has been approached… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  9. arXiv:2007.12669  [pdf, other

    cs.LG stat.ML

    Scaling Graph Clustering with Distributed Sketches

    Authors: Benjamin W. Priest, Alec Dunton, Geoffrey Sanders

    Abstract: The unsupervised learning of community structure, in particular the partitioning vertices into clusters or communities, is a canonical and well-studied problem in exploratory graph analysis. However, like most graph analyses the introduction of immense scale presents challenges to traditional methods. Spectral clustering in distributed memory, for example, requires hundreds of expensive bulk-synch… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: 9 pages, submitted to IEEE HPEC Graph Challenge 2020, comments welcome

    Report number: LLNL-CONF-812693

  10. arXiv:2004.11280  [pdf, other

    quant-ph

    Quantum Machine Learning using Gaussian Processes with Performant Quantum Kernels

    Authors: Matthew Otten, Imène R. Goumiri, Benjamin W. Priest, George F. Chapline, Michael D. Schneider

    Abstract: Quantum computers have the opportunity to be transformative for a variety of computational tasks. Recently, there have been proposals to use the unsimulatably of large quantum devices to perform regression, classification, and other machine learning tasks with quantum advantage by using kernel methods. While unsimulatably is a necessary condition for quantum advantage in machine learning, it is no… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

  11. arXiv:2004.05198  [pdf, other

    cs.LG eess.SY stat.ML

    Reinforcement Learning via Gaussian Processes with Neural Network Dual Kernels

    Authors: Imène R. Goumiri, Benjamin W. Priest, Michael D. Schneider

    Abstract: While deep neural networks (DNNs) and Gaussian Processes (GPs) are both popularly utilized to solve problems in reinforcement learning, both approaches feature undesirable drawbacks for challenging problems. DNNs learn complex nonlinear embeddings, but do not naturally quantify uncertainty and are often data-inefficient to train. GPs infer posterior distributions over functions, but popular kernel… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

    Comments: 22 pages, 5 figures

    Report number: LLNL-JRNL-808440

  12. arXiv:2004.04289  [pdf, other

    cs.DC

    DegreeSketch: Distributed Cardinality Sketches on Massive Graphs with Applications

    Authors: Benjamin W. Priest

    Abstract: We present DegreeSketch, a semi-streaming distributed sketch data structure and demonstrate its utility for estimating local neighborhood sizes and local triangle count heavy hitters on massive graphs. DegreeSketch consists of vertex-centric cardinality sketches distributed across a set of processors that are accumulated in a single pass, and then behaves as a persistent query engine capable of ap… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: 22 pages, 8 figures, submitted to VLDB 2020, comments welcome

    Report number: LLNL-CONF-806542