Skip to main content

Showing 1–15 of 15 results for author: Anzt, H

.
  1. Fast Truncated SVD of Sparse and Dense Matrices on Graphics Processors

    Authors: Andres E. Tomas, Enrique S. Quintana-Orti, Hartwig Anzt

    Abstract: We investigate the solution of low-rank matrix approximation problems using the truncated SVD. For this purpose, we develop and optimize GPU implementations for the randomized SVD and a blocked variant of the Lanczos approach. Our work takes advantage of the fact that the two methods are composed of very similar linear algebra building blocks, which can be assembled using numerical kernels from ex… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 16 pages, 4 figures

    ACM Class: G.4.5

    Journal ref: The International Journal of High Performance Computing Applications. 2023;37(3-4):380-393

  2. arXiv:2309.09002  [pdf

    physics.ao-ph cs.AI cs.CE cs.CY physics.soc-ph

    Earth Virtualization Engines -- A Technical Perspective

    Authors: Torsten Hoefler, Bjorn Stevens, Andreas F. Prein, Johanna Baehr, Thomas Schulthess, Thomas F. Stocker, John Taylor, Daniel Klocke, Pekka Manninen, Piers M. Forster, Tobias Kölling, Nicolas Gruber, Hartwig Anzt, Claudia Frauen, Florian Ziemen, Milan Klöwer, Karthik Kashinath, Christoph Schär, Oliver Fuhrer, Bryan N. Lawrence

    Abstract: Participants of the Berlin Summit on Earth Virtualization Engines (EVEs) discussed ideas and concepts to improve our ability to cope with climate change. EVEs aim to provide interactive and accessible climate simulations and data for a wide range of users. They combine high-resolution physics-based models with machine learning techniques to improve the fidelity, efficiency, and interpretability of… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  3. Porting Batched Iterative Solvers onto Intel GPUs with SYCL

    Authors: Phuong Nguyen, Pratik Nayak, Hartwig Anzt

    Abstract: Batched linear solvers play a vital role in computational sciences, especially in the fields of plasma physics and combustion simulations. With the imminent deployment of the Aurora Supercomputer and other upcoming systems equipped with Intel GPUs, there is a compelling demand to expand the capabilities of these solvers for Intel GPU architectures. In this paper, we present our efforts in portin… ▽ More

    Submitted 26 September, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 11 pages, 8 figures

    ACM Class: G.4

    Journal ref: In Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis (SC-W 2023)

  4. arXiv:2306.14337  [pdf, other

    cs.CE

    GPU-Resident Sparse Direct Linear Solvers for Alternating Current Optimal Power Flow Analysis

    Authors: Kasia Świrydowicz, Nicholson Koukpaizan, Tobias Ribizel, Fritz Göbel, Shrirang Abhyankar, Hartwig Anzt, Slaven Peleš

    Abstract: Integrating renewable resources within the transmission grid at a wide scale poses significant challenges for economic dispatch as it requires analysis with more optimization parameters, constraints, and sources of uncertainty. This motivates the investigation of more efficient computational methods, especially those for solving the underlying linear systems, which typically take more than half of… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    MSC Class: 65F05; 65F10; 65F50; 65K10; 65Y05; 65Y10; 90C51

  5. arXiv:2103.10116  [pdf, other

    cs.DC cs.MS cs.PF

    Porting a sparse linear algebra math library to Intel GPUs

    Authors: Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt

    Abstract: With the announcement that the Aurora Supercomputer will be composed of general purpose Intel CPUs complemented by discrete high performance Intel GPUs, and the deployment of the oneAPI ecosystem, Intel has committed to enter the arena of discrete high performance GPUs. A central requirement for the scientific computing community is the availability of production-ready software stacks and a glimps… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: preprint, not submitted

  6. arXiv:2101.10883  [pdf

    cs.SE

    A Fresh Look at FAIR for Research Software

    Authors: Daniel S. Katz, Morane Gruenpeter, Tom Honeyman, Lorraine Hwang, Mark D. Wilkinson, Vanessa Sochat, Hartwig Anzt, Carole Goble, for FAIR4RS Subgroup 1

    Abstract: This document captures the discussion and deliberation of the FAIR for Research Software (FAIR4RS) subgroup that took a fresh look at the applicability of the FAIR Guiding Principles for scientific data management and stewardship for research software. We discuss the vision of research software as ideally reproducible, open, usable, recognized, sustained and robust, and then review both the charac… ▽ More

    Submitted 9 February, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

  7. arXiv:2011.08879  [pdf, other

    cs.DC cs.MS cs.PF cs.SE

    Ginkgo -- A Math Library designed for Platform Portability

    Authors: Terry Cojean, Yu-Hsiang "Mike" Tsai, Hartwig Anzt

    Abstract: The first associations to software sustainability might be the existence of a continuous integration (CI) framework; the existence of a testing framework composed of unit tests, integration tests, and end-to-end tests; and also the existence of software documentation. However, when asking what is a common deathblow for a scientific software product, it is often the lack of platform and performance… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Submitted to Parallel Computing Journal (PARCO)

  8. arXiv:2010.13342  [pdf, other

    cs.DC

    Resiliency in Numerical Algorithm Design for Extreme Scale Simulations

    Authors: Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Goeddeke, Marco Heisig, Fabienne Jezequel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Orti , et al. (11 additional authors not shown)

    Abstract: This work is based on the seminar titled ``Resiliency in Numerical Algorithm Design for Extreme Scale Simulations'' held March 1-6, 2020 at Schloss Dagstuhl, that was attended by all the authors. Naive versions of conventional resilience techniques will not scale to the exascale regime: with a main memory footprint of tens of Petabytes, synchronously writing checkpoint data all the way to backgr… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 45 pages, 3 figures, submitted to The International Journal of High Performance Computing Applications

    ACM Class: D.4.5; G.4; G.1; D.4.4

  9. arXiv:2009.12101  [pdf, other

    cs.MS

    Compressed Basis GMRES on High Performance GPUs

    Authors: José I. Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás

    Abstract: Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the performance of practical realizations of these methods is constrained by the communication bandwidth in all current computer architectures, motivating the recent investigation of sophisticated techniques to avoid, reduce, and/or hide the mess… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

  10. arXiv:2008.08478  [pdf, other

    cs.MS cs.PF

    Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse Linear Algebra Computations

    Authors: Yuhsiang Mike Tsai, Terry Cojean, Hartwig Anzt

    Abstract: GPU accelerators have become an important backbone for scientific high performance computing, and the performance advances obtained from adopting new GPU hardware are significant. In this paper we take a first look at NVIDIA's newest server line GPU, the A100 architecture part of the Ampere generation. Specifically, we assess its performance for sparse linear algebra operations that form the backb… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  11. arXiv:2007.06674  [pdf, other

    cs.MS math.NA

    A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

    Authors: Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin Carson, Terry Cojean, Jack Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Sherry Li, Neil Lindquist, Yang Liu, Jennifer Loe, Piotr Luszczek, Pratik Nayak, Sri Pranesh, Siva Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Urike Meier Yang

    Abstract: Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community and their demand for high compute power in low precision formats. Also the server-line products are increasingly featuring low-precision special function units, such as the NVIDIA tensor cores in ORNL's Summit supercomputer providing more t… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: Technical report as a part of the Exascale computing project (ECP)

    ACM Class: G.1.3; G.4

  12. arXiv:2006.16852  [pdf, other

    cs.MS

    Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing

    Authors: Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang Mike Tsai, Enrique S. Quintana-Ortí

    Abstract: In this paper, we present Ginkgo, a modern C++ math library for scientific high performance computing. While classical linear algebra libraries act on matrix and vector objects, Ginkgo's design principle abstracts all functionality as "linear operators", motivating the notation of a "linear operator algebra library". Ginkgo's current focus is oriented towards providing sparse linear algebra functi… ▽ More

    Submitted 1 July, 2020; v1 submitted 30 June, 2020; originally announced June 2020.

    Comments: Preprint submitted to ACM Transactions on Mathematical Software

    ACM Class: D.2; G.1.3; G.4

  13. arXiv:2006.14290  [pdf, other

    cs.MS

    Preparing Ginkgo for AMD GPUs -- A Testimonial on Porting CUDA Code to HIP

    Authors: Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt

    Abstract: With AMD reinforcing their ambition in the scientific high performance computing ecosystem, we extend the hardware scope of the Ginkgo linear algebra package to feature a HIP backend for AMD GPUs. In this paper, we report and discuss the porting effort from CUDA, the extension of the HIP framework to add missing features such as cooperative groups, the performance price of compiling HIP code for A… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Preprint submitted to HeteroPar

  14. An Environment for Sustainable Research Software in Germany and Beyond: Current State, Open Challenges, and Call for Action

    Authors: Hartwig Anzt, Felix Bach, Stephan Druskat, Frank Löffler, Axel Loewe, Bernhard Y. Renard, Gunnar Seemann, Alexander Struck, Elke Achhammer, Piush Aggarwal, Franziska Appel, Michael Bader, Lutz Brusch, Christian Busse, Gerasimos Chourdakis, Piotr W. Dabrowski, Peter Ebert, Bernd Flemisch, Sven Friedl, Bernadette Fritzsch, Maximilian D. Funk, Volker Gast, Florian Goth, Jean-Noël Grad, Sibylle Hermann , et al. (18 additional authors not shown)

    Abstract: Research software has become a central asset in academic research. It optimizes existing and enables new research methods, implements and embeds research knowledge, and constitutes an essential research product in itself. Research software must be sustainable in order to understand, replicate, reproduce, and build upon existing research or conduct new research effectively. In other words, software… ▽ More

    Submitted 5 May, 2020; v1 submitted 27 April, 2020; originally announced May 2020.

    Comments: Official position paper 001 of de-RSE e.V. - Society for Research Software (https://de-rse.org) --- 16 pages, 1 figure + 1 page supplementary material, 4 figures --- Submitted to the F1000 Research Science Policy Research Gateway on 2020-04-03

    Journal ref: F1000Research 2020

  15. arXiv:2003.05361  [pdf, other

    cs.DC cs.MS

    Evaluating Abstract Asynchronous Schwarz solvers on GPUs

    Authors: Pratik Nayak, Terry Cojean, Hartwig Anzt

    Abstract: With the commencement of the exascale computing era, we realize that the majority of the leadership supercomputers are heterogeneous and massively parallel even on a single node with multiple co-processors such as GPUs and multiple cores on each node. For example, ORNLs Summit accumulates six NVIDIA Tesla V100s and 42 core IBM Power9s on each node. Synchronizing across all these compute resources… ▽ More

    Submitted 5 May, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Preprint submitted to IJHPCA