Skip to main content

Showing 1–9 of 9 results for author: Kreutzer, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17480  [pdf, other

    cs.NI

    Resilience-by-Design Concepts for 6G Communication Networks

    Authors: Ladan Khaloopour, Yanpeng Su, Florian Raskob, Tobias Meuser, Roland Bless, Leon Würsching, Kamyar Abedi, Marko Andjelkovic, Hekma Chaari, Pousali Chakraborty, Michael Kreutzer, Matthias Hollick, Thorsten Strufe, Norman Franchi, Vahid Jamali

    Abstract: The sixth generation (6G) mobile communication networks are expected to intelligently integrate into various aspects of modern digital society, including smart cities, homes, healthcare, transportation, and factories. While offering a multitude of services, it is likely that societies become increasingly reliant on 6G infrastructure. Any disruption to these digital services, whether due to human o… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. MLComp: A Methodology for Machine Learning-based Performance Estimation and Adaptive Selection of Pareto-Optimal Compiler Optimization Sequences

    Authors: Alessio Colucci, Dávid Juhász, Martin Mosbeck, Alberto Marchisio, Semeen Rehman, Manfred Kreutzer, Guenther Nadbath, Axel Jantsch, Muhammad Shafique

    Abstract: Embedded systems have proliferated in various consumer and industrial applications with the evolution of Cyber-Physical Systems and the Internet of Things. These systems are subjected to stringent constraints so that embedded software must be optimized for multiple objectives simultaneously, namely reduced energy consumption, execution time, and code size. Compilers offer optimization phases to im… ▽ More

    Submitted 11 December, 2020; v1 submitted 9 December, 2020; originally announced December 2020.

    Comments: Accepted for publication at the 24th IEEE/ACM Design, Automation and Test in Europe (DATE'21) Conference, February, 2021

  3. arXiv:1803.02156  [pdf, ps, other

    cs.MS cs.PF physics.comp-ph

    Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs

    Authors: Moritz Kreutzer, Georg Hager, Dominik Ernst, Holger Fehske, Alan R. Bishop, Gerhard Wellein

    Abstract: Chebyshev filter diagonalization is well established in quantum chemistry and quantum physics to compute bulks of eigenvalues of large sparse matrices. Choosing a block vector implementation, we investigate optimization opportunities on the new class of high-performance compute devices featuring both high-bandwidth and low-bandwidth memory. We focus on the transparent access to the full address sp… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

    Comments: 18 pages, 8 figures

  4. arXiv:1708.02030  [pdf, ps, other

    cs.DC

    CRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance

    Authors: Faisal Shahzad, Jonas Thies, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein

    Abstract: In order to efficiently use the future generations of supercomputers, fault tolerance and power consumption are two of the prime challenges anticipated by the High Performance Computing (HPC) community. Checkpoint/Restart (CR) has been and still is the most widely used technique to deal with hard failures. Application-level CR is the most effective CR technique in terms of overhead efficiency but… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

  5. GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems

    Authors: Moritz Kreutzer, Jonas Thies, Melven Röhrig-Zöllner, Andreas Pieper, Faisal Shahzad, Martin Galgon, Achim Basermann, Holger Fehske, Georg Hager, Gerhard Wellein

    Abstract: While many of the architectural details of future exascale-class high performance computer systems are still a matter of intense research, there appears to be a general consensus that they will be strongly heterogeneous, featuring "standard" as well as "accelerated" resources. Today, such resources are available as multicore processors, graphics processing units (GPUs), and other accelerators such… ▽ More

    Submitted 15 February, 2016; v1 submitted 29 July, 2015; originally announced July 2015.

    Comments: 32 pages, 11 figures

  6. arXiv:1505.04628  [pdf, ps, other

    cs.DC

    Building a fault tolerant application using the GASPI communication layer

    Authors: Faisal Shahzad, Moritz Kreutzer, Thomas Zeiser, Rui Machado, Andreas Pieper, Georg Hager, Gerhard Wellein

    Abstract: It is commonly agreed that highly parallel software on Exascale computers will suffer from many more runtime failures due to the decreasing trend in the mean time to failures (MTTF). Therefore, it is not surprising that a lot of research is going on in the area of fault tolerance and fault mitigation. Applications should survive a failure and/or be able to recover with minimal cost. MPI is not yet… ▽ More

    Submitted 18 May, 2015; originally announced May 2015.

  7. arXiv:1410.5242  [pdf, ps, other

    cs.CE cond-mat.mes-hall cs.DC cs.PF physics.comp-ph

    Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems

    Authors: Moritz Kreutzer, Georg Hager, Gerhard Wellein, Andreas Pieper, Andreas Alvermann, Holger Fehske

    Abstract: The Kernel Polynomial Method (KPM) is a well-established scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization potential and feasibility of peta-scale heterogeneous CPU-GPU implementations of the KPM. At the node level we show that it is possible to decouple the sparse… ▽ More

    Submitted 29 July, 2015; v1 submitted 20 October, 2014; originally announced October 2014.

    Comments: 10 pages, 12 figures

    Journal ref: Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 417-426

  8. A unified sparse matrix data format for efficient general sparse matrix-vector multiply on modern processors with wide SIMD units

    Authors: Moritz Kreutzer, Georg Hager, Gerhard Wellein, Holger Fehske, Alan R. Bishop

    Abstract: Sparse matrix-vector multiplication (spMVM) is the most time-consuming kernel in many numerical algorithms and has been studied extensively on all modern processor and accelerator architectures. However, the optimal sparse matrix data storage format is highly hardware-specific, which could become an obstacle when using heterogeneous systems. Also, it is as yet unclear how the wide single instructi… ▽ More

    Submitted 5 March, 2014; v1 submitted 23 July, 2013; originally announced July 2013.

    Comments: 23 pages, 7 figures, 6 listings

    Journal ref: SIAM Journal on Scientific Computing 2014 36:5, C401-C423

  9. arXiv:1112.5588  [pdf, ps, other

    cs.DC cs.MS cs.PF math.NA

    Sparse matrix-vector multiplication on GPGPU clusters: A new storage format and a scalable implementation

    Authors: Moritz Kreutzer, Georg Hager, Gerhard Wellein, Holger Fehske, Achim Basermann, Alan R. Bishop

    Abstract: Sparse matrix-vector multiplication (spMVM) is the dominant operation in many sparse solvers. We investigate performance properties of spMVM with matrices of various sparsity patterns on the nVidia "Fermi" class of GPGPUs. A new "padded jagged diagonals storage" (pJDS) format is proposed which may substantially reduce the memory overhead intrinsic to the widespread ELLPACK-R scheme. In our test sc… ▽ More

    Submitted 29 February, 2012; v1 submitted 23 December, 2011; originally announced December 2011.

    Comments: 10 pages, 5 figures. Added reference to other recent sparse matrix formats