Skip to main content

Showing 1–16 of 16 results for author: Finkel, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.13242  [pdf, other

    cs.LG cs.PF

    Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version)

    Authors: Xingfu Wu, Michael Kruse, Prasanna Balaprakash, Hal Finkel, Paul Hovland, Valerie Taylor, Mary Hall

    Abstract: In this paper, we develop a ytopt autotuning framework that leverages Bayesian optimization to explore the parameter space search and compare four different supervised learning methods within Bayesian optimization and evaluate their effectiveness. We select six of the most complex PolyBench benchmarks and apply the newly developed LLVM Clang/Polly loop optimization pragmas to the benchmarks to opt… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Submitted to CCPE journal. arXiv admin note: substantial text overlap with arXiv:2010.08040

  2. arXiv:2102.01687  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Report of the Workshop on Program Synthesis for Scientific Computing

    Authors: Hal Finkel, Ignacio Laguna

    Abstract: Program synthesis is an active research field in academia, national labs, and industry. Yet, work directly applicable to scientific computing, while having some impressive successes, has been limited. This report reviews the relevant areas of program synthesis work for scientific computing, discusses successes to date, and outlines opportunities for future work. This report is the result of the Wo… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 29 pages, workshop website: https://prog-synth-science.github.io/2020/

  3. arXiv:2010.08439  [pdf, other

    cs.PL

    Really Embedding Domain-Specific Languages into C++

    Authors: Hal Finkel, Alexander McCaskey, Tobi Popoola, Dmitry Lyakh, Johannes Doerfert

    Abstract: Domain-specific languages (DSLs) are both pervasive and powerful, but remain difficult to integrate into large projects. As a result, while DSLs can bring distinct advantages in performance, reliability, and maintainability, their use often involves trading off other good software-engineering practices. In this paper, we describe an extension to the Clang C++ compiler to support syntax plugins, an… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  4. arXiv:2010.08040  [pdf, other

    cs.PF cs.LG cs.PL

    Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization

    Authors: Xingfu Wu, Michael Kruse, Prasanna Balaprakash, Hal Finkel, Paul Hovland, Valerie Taylor, Mary Hall

    Abstract: An autotuning is an approach that explores a search space of possible implementations/configurations of a kernel or an application by selecting and evaluating a subset of implementations/configurations on a target platform and/or use models to identify a high performance implementation/configuration. In this paper, we develop an autotuning framework that leverages Bayesian optimization to explore… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: to be published in the 11th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS20)

  5. arXiv:2010.06521  [pdf, other

    cs.DC cs.CL

    Autotuning Search Space for Loop Transformations

    Authors: Michael Kruse, Hal Finkel, Xingfu Wu

    Abstract: One of the challenges for optimizing compilers is to predict whether applying an optimization will improve its execution speed. Programmers may override the compiler's profitability heuristic using optimization directives such as pragmas in the source code. Machine learning in the form of autotuning can assist users in finding the best optimizations for each platform. In this paper we propose a… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: LLVM-in-HPC 2020 preprint

  6. arXiv:2010.03935  [pdf, other

    quant-ph cs.MS

    Extending C++ for Heterogeneous Quantum-Classical Computing

    Authors: Thien Nguyen, Anthony Santana, Tyler Kharazi, Daniel Claudino, Hal Finkel, Alexander McCaskey

    Abstract: We present qcor - a language extension to C++ and compiler implementation that enables heterogeneous quantum-classical programming, compilation, and execution in a single-source context. Our work provides a first-of-its-kind C++ compiler enabling high-level quantum kernel (function) expression in a quantum-language agnostic manner, as well as a hardware-agnostic, retargetable compiler workflow tar… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

  7. Design and Use of Loop-Transformation Pragmas

    Authors: Michael Kruse, Hal Finkel

    Abstract: Adding a pragma directive into the source code is arguably easier than rewriting it, for instance for loop unrolling. Moreover, if the application is maintained for multiple platforms, their difference in performance characteristics may require different code transformations. Code transformation directives allow replacing the directives depending on the platform, i.e. separation of code semantics… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: IWOMP 2019, September 11-13, Auckland, preprint

  8. arXiv:1904.08555  [pdf, other

    cs.PL

    ClangJIT: Enhancing C++ with Just-in-Time Compilation

    Authors: Hal Finkel, David Poliakoff, David F. Richards

    Abstract: The C++ programming language is not only a keystone of the high-performance-computing ecosystem but has proven to be a successful base for portable parallel-programming frameworks. As is well known, C++ programmers use templates to specialize algorithms, thus allowing the compiler to generate highly-efficient code for specific parameters, data structures, and so on. This capability has been limite… ▽ More

    Submitted 27 April, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Report number: LLNL-CONF-772305, APT-151745

  9. arXiv:1811.05630  [pdf, other

    quant-ph cs.CC cs.ET

    Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

    Authors: Xin-Chuan Wu, Sheng Di, Franck Cappello, Hal Finkel, Yuri Alexeev, Frederic T. Chong

    Abstract: In order to evaluate, validate, and refine the design of new quantum algorithms or quantum computers, researchers and developers need methods to assess their correctness and fidelity. This requires the capabilities of quantum circuit simulations. However, the number of quantum state amplitudes increases exponentially with the number of qubits, leading to the exponential growth of the memory requir… ▽ More

    Submitted 14 November, 2018; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: 2 pages, 2 figures. The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

  10. arXiv:1811.05140  [pdf, other

    quant-ph cs.ET

    Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

    Authors: Xin-Chuan Wu, Sheng Di, Franck Cappello, Hal Finkel, Yuri Alexeev, Frederic T. Chong

    Abstract: Classical simulation of quantum circuits is crucial for evaluating and validating the design of new quantum algorithms. However, the number of quantum state amplitudes increases exponentially with the number of qubits, leading to the exponential growth of the memory requirement for the simulations. In this paper, we present a new data reduction technique to reduce the memory requirement of quantum… ▽ More

    Submitted 14 November, 2018; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: 6pages, 6 figures. The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

  11. arXiv:1811.00632  [pdf, other

    cs.PL

    Loop Optimization Framework

    Authors: Michael Kruse, Hal Finkel

    Abstract: The LLVM compiler framework supports a selection of loop transformations such as vectorization, distribution and unrolling. Each transformation is carried-out by specialized passes that have been developed independently. In this paper we propose an integrated approach to loop optimizations: A single dedicated pass that mutates a Loop Structure DAG. Each transformation can make use of a common infr… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: LCPC'18 preprint

  12. arXiv:1811.00624  [pdf, other

    cs.PL

    User-Directed Loop-Transformations in Clang

    Authors: Michael Kruse, Hal Finkel

    Abstract: Directives for the compiler such as pragmas can help programmers to separate an algorithm's semantics from its optimization. This keeps the code understandable and easier to optimize for different platforms. Simple transformations such as loop unrolling are already implemented in most mainstream compilers. We recently submitted a proposal to add generalized loop transformations to the OpenMP stand… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: LLVM-HPC Workshop 2018 preprint

  13. A Proposal for Loop-Transformation Pragmas

    Authors: Michael Kruse, Hal Finkel

    Abstract: Pragmas for loop transformations, such as unrolling, are implemented in most mainstream compilers. They are used by application programmers because of their ease of use compared to directly modifying the source code of the relevant loops. We propose additional pragmas for common loop transformations that go far beyond the transformations today's compilers provide and should make most source rewrit… ▽ More

    Submitted 11 June, 2018; v1 submitted 9 May, 2018; originally announced May 2018.

    Comments: IWOMP'18 preprint

  14. arXiv:1610.02606  [pdf, other

    cs.OH

    Doing Moore with Less -- Leapfrogging Moore's Law with Inexactness for Supercomputing

    Authors: Sven Leyffer, Stefan M. Wild, Mike Fagan, Marc Snir, Krishna Palem, Kazutomo Yoshii, Hal Finkel

    Abstract: Energy and power consumption are major limitations to continued scaling of computing systems. Inexactness, where the quality of the solution can be traded for energy savings, has been proposed as an approach to overcoming those limitations. In the past, however, inexactness necessitated the need for highly customized or specialized hardware. The current evolution of commercial off-the-shelf(COTS)… ▽ More

    Submitted 12 October, 2016; v1 submitted 8 October, 2016; originally announced October 2016.

    Comments: 9 pages, 12 figures, PDFLaTeX. 12 Oct 2016: Corrected author Hal Finkel's affiliation to show ALCF/Argonne

    ACM Class: F.2.1; G.1.5

  15. arXiv:1510.08545  [pdf, ps, other

    physics.comp-ph cs.CE cs.DC hep-ex

    High Energy Physics Forum for Computational Excellence: Working Group Reports (I. Applications Software II. Software Libraries and Tools III. Systems)

    Authors: Salman Habib, Robert Roser, Tom LeCompte, Zach Marshall, Anders Borgland, Brett Viren, Peter Nugent, Makoto Asai, Lothar Bauerdick, Hal Finkel, Steve Gottlieb, Stefan Hoeche, Paul Sheldon, Jean-Luc Vay, Peter Elmer, Michael Kirby, Simon Patton, Maxim Potekhin, Brian Yanny, Paolo Calafiura, Eli Dart, Oliver Gutsche, Taku Izubuchi, Adam Lyon, Don Petravick

    Abstract: Computing plays an essential role in all aspects of high energy physics. As computational technology evolves rapidly in new directions, and data throughput and volume continue to follow a steep trend-line, it is important for the HEP community to develop an effective response to a series of expected challenges. In order to help shape the desired response, the HEP Forum for Computational Excellence… ▽ More

    Submitted 28 October, 2015; originally announced October 2015.

    Comments: 72 pages

  16. arXiv:1211.4864  [pdf, other

    cs.DC astro-ph.CO astro-ph.IM cs.PF physics.comp-ph

    The Universe at Extreme Scale: Multi-Petaflop Sky Simulation on the BG/Q

    Authors: Salman Habib, Vitali Morozov, Hal Finkel, Adrian Pope, Katrin Heitmann, Kalyan Kumaran, Tom Peterka, Joe Insley, David Daniel, Patricia Fasel, Nicholas Frontiere, Zarija Lukic

    Abstract: Remarkable observational advances have established a compelling cross-validated model of the Universe. Yet, two key pillars of this model -- dark matter and dark energy -- remain mysterious. Sky surveys that map billions of galaxies to explore the `Dark Universe', demand a corresponding extreme-scale simulation capability; the HACC (Hybrid/Hardware Accelerated Cosmology Code) framework has been de… ▽ More

    Submitted 19 November, 2012; originally announced November 2012.

    Comments: 11 pages, 11 figures, final version of paper for talk presented at SC12