Skip to main content

Showing 1–10 of 10 results for author: Doerfert, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08843  [pdf, other

    cs.SE cs.PF cs.PL

    Input-Gen: Guided Generation of Stateful Inputs for Testing, Tuning, and Training

    Authors: Ivan R. Ivanov, Joachim Meyer, Aiden Grossman, William S. Moses, Johannes Doerfert

    Abstract: The size and complexity of software applications is increasing at an accelerating pace. Source code repositories (along with their dependencies) require vast amounts of labor to keep them tested, maintained, and up to date. As the discipline now begins to also incorporate automatically generated programs, automation in testing and tuning is required to keep up with the pace - let alone reduce the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2403.12345  [pdf, other

    cs.DC

    Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

    Authors: John Tramm, Paul Romano, Patrick Shriwise, Amanda Lund, Johannes Doerfert, Patrick Steinbrecher, Andrew Siegel, Gavin Ridley

    Abstract: OpenMC is an open source Monte Carlo neutral particle transport application that has recently been ported to GPU using the OpenMP target offloading model. We examine the performance of OpenMC at scale on the Frontier, Polaris, and Aurora supercomputers, demonstrating that performance portability has been achieved by OpenMC across all three major GPU vendors (AMD, NVIDIA, and Intel). OpenMC's GPU p… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2311.03210  [pdf, other

    cs.DC

    Quantum Task Offloading with the OpenMP API

    Authors: Joseph K. L. Lee, Oliver T. Brown, Mark Bull, Martin Ruefenacht, Johannes Doerfert, Michael Klemm, Martin Schulz

    Abstract: Most of the widely used quantum programming languages and libraries are not designed for the tightly coupled nature of hybrid quantum-classical algorithms, which run on quantum resources that are integrated on-premise with classical HPC infrastructure. We propose a programming model using the API provided by OpenMP to target quantum devices, which provides an easy-to-use and efficient interface fo… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Poster extended abstract for Supercomputing 2023 (SC23)

  4. arXiv:2309.15432  [pdf, other

    cs.PL

    ComPile: A Large IR Dataset from Production Sources

    Authors: Aiden Grossman, Ludger Paehler, Konstantinos Parasyris, Tal Ben-Nun, Jacob Hegna, William Moses, Jose M Monsalve Diaz, Mircea Trofin, Johannes Doerfert

    Abstract: Code is increasingly becoming a core data modality of modern machine learning research impacting not only the way we write code with conversational agents like OpenAI's ChatGPT, Google's Bard, or Anthropic's Claude, the way we translate code from one language into another, but also the compiler infrastructure underlying the language. While modeling approaches may vary and representations differ, t… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  5. arXiv:2306.11686  [pdf, other

    cs.DC

    GPU First -- Execution of Legacy CPU Codes on GPUs

    Authors: Shilei Tian, Tom Scogland, Barbara Chapman, Johannes Doerfert

    Abstract: Utilizing GPUs is critical for high performance on heterogeneous systems. However, leveraging the full potential of GPUs for accelerating legacy CPU applications can be a challenging task for developers. The porting process requires identifying code regions amenable to acceleration, managing distinct memories, synchronizing host and device execution, and handling library functions that may not be… ▽ More

    Submitted 26 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  6. arXiv:2207.00257  [pdf, other

    cs.PL cs.DC

    High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs

    Authors: William S. Moses, Ivan R. Ivanov, Jens Domke, Toshio Endo, Johannes Doerfert, Oleksandr Zinenko

    Abstract: While parallelism remains the main source of performance, architectural implementations and programming models change with each new hardware generation, often leading to costly application re-engineering. Most tools for performance portability require manual and costly application porting to yet another programming model. We propose an alternative approach that automatically translates programs… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  7. arXiv:2106.14332  [pdf, other

    cs.DC cond-mat.mtrl-sci cs.AR cs.CL cs.SE

    A Case Study of LLVM-Based Analysis for Optimizing SIMD Code Generation

    Authors: Joseph Huber, Weile Wei, Giorgis Georgakoudis, Johannes Doerfert, Oscar Hernandez

    Abstract: This paper presents a methodology for using LLVM-based tools to tune the DCA++ (dynamical clusterapproximation) application that targets the new ARM A64FX processor. The goal is to describethe changes required for the new architecture and generate efficient single instruction/multiple data(SIMD) instructions that target the new Scalable Vector Extension instruction set. During manualtuning, the au… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

  8. Experience Report: Writing A Portable GPU Runtime with OpenMP 5.1

    Authors: Shilei Tian, Jon Chesterfield, Johannes Doerfert, Barbara Chapman

    Abstract: GPU runtimes are historically implemented in CUDA or other vendor specific languages dedicated to GPU programming. In this work we show that OpenMP 5.1, with minor compiler extensions, is capable of replacing existing solutions without a performance penalty. The result is a performant and portable GPU runtime that can be compiled with LLVM/Clang to Nvidia and AMD GPUs without the need for CUDA or… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

  9. arXiv:2010.08439  [pdf, other

    cs.PL

    Really Embedding Domain-Specific Languages into C++

    Authors: Hal Finkel, Alexander McCaskey, Tobi Popoola, Dmitry Lyakh, Johannes Doerfert

    Abstract: Domain-specific languages (DSLs) are both pervasive and powerful, but remain difficult to integrate into large projects. As a result, while DSLs can bring distinct advantages in performance, reliability, and maintainability, their use often involves trading off other good software-engineering practices. In this paper, we describe an extension to the Clang C++ compiler to support syntax plugins, an… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  10. arXiv:1505.07716  [pdf, other

    cs.PL cs.DC

    Polly's Polyhedral Scheduling in the Presence of Reductions

    Authors: Johannes Doerfert, Kevin Streit, Sebastian Hack, Zino Benaissa

    Abstract: The polyhedral model provides a powerful mathematical abstraction to enable effective optimization of loop nests with respect to a given optimization goal, e.g., exploiting parallelism. Unexploited reduction properties are a frequent reason for polyhedral optimizers to assume parallelism prohibiting dependences. To our knowledge, no polyhedral loop optimizer available in any production compiler pr… ▽ More

    Submitted 28 May, 2015; originally announced May 2015.

    Comments: Presented at the IMPACT15 workshop

    ACM Class: D.3.4