Skip to main content

Showing 1–8 of 8 results for author: Malony, A D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19058  [pdf, other

    physics.comp-ph cs.DC cs.PF physics.plasm-ph

    Understanding the Impact of openPMD on BIT1, a Particle-in-Cell Monte Carlo Code, through Instrumentation, Monitoring, and In-Situ Analysis

    Authors: Jeremy J. Williams, Stefan Costea, Allen D. Malony, David Tskhakaya, Leon Kos, Ales Podolnik, Jakub Hromadka, Kevin Huck, Erwin Laure, Stefano Markidis

    Abstract: Particle-in-Cell Monte Carlo simulations on large-scale systems play a fundamental role in understanding the complexities of plasma dynamics in fusion devices. Efficient handling and analysis of vast datasets are essential for advancing these simulations. Previously, we addressed this challenge by integrating openPMD with BIT1, a Particle-in-Cell Monte Carlo code, streamlining data streaming and s… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted by the Euro-Par 2024 workshops (PHYSHPC 2024), prepared in the standardized Springer LNCS format and consists of 12 pages, which includes the main text, references, and figures

  2. arXiv:2304.11205  [pdf, ps, other

    cs.DC

    STaKTAU: profiling HPC applications' operating system usage

    Authors: Camille Coti, Kevin Huck, Allen D. Malony

    Abstract: This paper presents a approach for measuring the time spent by HPC applications in the operating system's kernel. We use the SystemTap interface to insert timers before and after system calls, and take advantage of its stability to design a tool that can be used with multiple versions of the kernel. We evaluate its performance overhead, using an OS-intensive mini-benchmark and a raytracing mini ap… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  3. HPC Storage Service Autotuning Using Variational-Autoencoder-Guided Asynchronous Bayesian Optimization

    Authors: Matthieu Dorier, Romain Egele, Prasanna Balaprakash, Jaehoon Koo, Sandeep Madireddy, Srinivasan Ramesh, Allen D. Malony, Rob Ross

    Abstract: Distributed data storage services tailored to specific applications have grown popular in the high-performance computing (HPC) community as a way to address I/O and storage challenges. These services offer a variety of specific interfaces, semantics, and data representations. They also expose many tuning parameters, making it difficult for their users to find the best configuration for a given wor… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted at IEEE Cluster 2022

  4. arXiv:2202.08948  [pdf, other

    cs.DC

    SKaMPI-OpenSHMEM: Measuring OpenSHMEM Communication Routines

    Authors: Camille Coti, Allen D. Malony

    Abstract: Benchmarking is an important challenge in HPC, in particular, to be able to tune the basic blocks of the software environment used by applications. The communication library and distributed run-time environment are among the most critical ones. In particular, many of the routines provided by communication libraries can be adjusted using parameters such as buffer sizes and communication algorithm.… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 17 pages, OpenSHMEM workshop 2021

  5. arXiv:2105.13395  [pdf, ps, other

    cs.DC

    Measuring OpenSHMEM Communication Routines with SKaMPI-OpenSHMEM User's manual

    Authors: Camille Coti, Allen D Malony

    Abstract: This document presents the OpenSHMEM extension for the Special Karlsruhe MPI benchmark and the measurement algorithms used to measure the routines.

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: This paper is a technical report that comes with our benchmarking software. It implements distributed algorithms for the measurement of distributed operations

  6. arXiv:2003.01081  [pdf, other

    cs.DC

    On-the-fly Optimization of Parallel Computation of Symbolic Symplectic Invariants

    Authors: Joseph Ben Geloun, Camille Coti, Allen D. Malony

    Abstract: Group invariants are used in high energy physics to define quantum field theory interactions. In this paper, we are presenting the parallel algebraic computation of special invariants called symplectic and even focusing on one particular invariant that finds recent interest in physics. Our results will export to other invariants. The cost of performing basic computations on the multivariate polyno… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  7. Checkpoint/restart approaches for a thread-based MPI runtime

    Authors: Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Perache, Patrick Carribault, Julien Jaeger, Allen D. Malony, Sameer Shende

    Abstract: Fault-tolerance has always been an important topic when it comes to running massively parallel programs at scale. Statistically, hardware and software failures are expected to occur more often on systems gathering millions of computing units. Moreover, the larger jobs are, the more computing hours would be wasted by a crash. In this paper, we describe the work done in our MPI runtime to enable bot… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: This research has been partially sponsored by the European Union s Horizon 2020 Programme under the LEGaTO Project (www.legato-project.eu), grant agreement 780681 and the Mont-Blanc 2020 project, grant agreement no. 779877

  8. arXiv:1701.08547  [pdf, other

    cs.DC cs.PF

    Autotuning GPU Kernels via Static and Predictive Analysis

    Authors: Robert V. Lim, Boyana Norris, Allen D. Malony

    Abstract: Optimizing the performance of GPU kernels is challenging for both human programmers and code generators. For example, CUDA programmers must set thread and block parameters for a kernel, but might not have the intuition to make a good choice. Similarly, compilers can generate working code, but may miss tuning opportunities by not targeting GPU models or performing code transformations. Although emp… ▽ More

    Submitted 29 June, 2017; v1 submitted 30 January, 2017; originally announced January 2017.