Skip to main content

Showing 1–22 of 22 results for author: Brandt, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00026  [pdf, other

    cs.DC

    Distributed astrophysics simulations using Octo-Tiger with RISC-V CPUs using HPX and Kokkos

    Authors: Patrick Diehl, Gregor Daiß, Steven R. Brandt, Alireza Kheirkhahan, Srinivas Yadav Singanaboina, Dominic Marcello, Chris Taylor, John Leidel, Hartmut Kaiser

    Abstract: In recent years, interest in RISC-V computing architectures have moved from academic to mainstream, especially in the field of High Performance Computing where energy limitations are increasingly a point of concern. The results presented in this paper are part of a longer-term evaluation of RISC-V's viability for HPC applications. In this work, we use the Octo-Tiger multi-physics, multi-scale, 3D… ▽ More

    Submitted 10 May, 2024; originally announced July 2024.

  2. HPX with Spack and Singularity Containers: Evaluating Overheads for HPX/Kokkos using an astrophysics application

    Authors: Patrick Diehl, Steven R. Brandt, Gregor Daiß, Hartmut Kaiser

    Abstract: Cloud computing for high performance computing resources is an emerging topic. This service is of interest to researchers who care about reproducible computing, for software packages with complex installations, and for companies or researchers who need the compute resources only occasionally or do not want to run and maintain a supercomputer on their own. The connection between HPC and containers… ▽ More

    Submitted 7 May, 2024; v1 submitted 11 February, 2024; originally announced May 2024.

  3. Evaluating HPX and Kokkos on RISC-V using an Astrophysics Application Octo-Tiger

    Authors: Parick Diehl, Gregor Daiss, Steven R. Brandt, Alireza Kheirkhahan, Hartmut Kaiser, Christopher Taylor, John Leidel

    Abstract: In recent years, computers based on the RISC-V architecture have raised broad interest in the high-performance computing (HPC) community. As the RISC-V community develops the core instruction set architecture (ISA) along with ISA extensions, the HPC community has been actively ensuring HPC applications and environments are supported. In this context, assessing the performance of asynchronous many-… ▽ More

    Submitted 17 August, 2023; originally announced September 2023.

  4. arXiv:2308.03161  [pdf, other

    cs.AI

    Precise Benchmarking of Explainable AI Attribution Methods

    Authors: Rafaël Brandt, Daan Raatjens, Georgi Gaydadjiev

    Abstract: The rationale behind a deep learning model's output is often difficult to understand by humans. EXplainable AI (XAI) aims at solving this by develo** methods that improve interpretability and explainability of machine learning models. Reliable evaluation metrics are needed to assess and compare different XAI methods. We propose a novel evaluation approach for benchmarking state-of-the-art XAI at… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  5. Benchmarking the Parallel 1D Heat Equation Solver in Chapel, Charm++, C++, HPX, Go, Julia, Python, Rust, Swift, and Java

    Authors: Patrick Diehl, Steven R. Brandt, Max Morris, Nikunj Gupta, Hartmut Kaiser

    Abstract: Many scientific high performance codes that simulate e.g. black holes, coastal waves, climate and weather, etc. rely on block-structured meshes and use finite differencing methods to iteratively solve the appropriate systems of differential equations. In this paper we investigate implementations of an extremely simple simulation of this type using various programming systems and languages. We focu… ▽ More

    Submitted 10 July, 2023; v1 submitted 18 May, 2023; originally announced July 2023.

  6. Shared memory parallelism in Modern C++ and HPX

    Authors: Patrick Diehl, Steven R. Brandt, Hartmut Kaiser

    Abstract: Parallel programming remains a daunting challenge, from the struggle to express a parallel algorithm without cluttering the underlying synchronous logic, to describing which devices to employ in a calculation, to correctness. Over the years, numerous solutions have arisen, many of them requiring new programming languages, extensions to programming languages, or the addition of pragmas. Support for… ▽ More

    Submitted 9 August, 2023; v1 submitted 16 January, 2023; originally announced February 2023.

    Comments: Extended paper for the special issue

  7. arXiv:2208.00109  [pdf, other

    cs.HC

    Traveler: Navigating Task Parallel Traces for Performance Analysis

    Authors: Sayef Azad Sakin, Alex Bigelow, R. Tohid, Connor Scully-Allison, Carlos Scheidegger, Steven R. Brandt, Christopher Taylor, Kevin A. Huck, Hartmut Kaiser, Katherine E. Isaacs

    Abstract: Understanding the behavior of software in execution is a key step in identifying and fixing performance issues. This is especially important in high performance computing contexts where even minor performance tweaks can translate into large savings in terms of computational resource use. To aid performance analysis, developers may collect an execution trace - a chronological log of program activit… ▽ More

    Submitted 3 September, 2022; v1 submitted 29 July, 2022; originally announced August 2022.

    Comments: IEEE VIS 2022

  8. Deploying a Task-based Runtime System on Raspberry Pi Clusters

    Authors: Nikunj Gupta, Steve R. Brandt, Bibek Wagle, Nanmiao, Alireza Kheirkhahan, Patrick Diehl, Hartmut Kaiser, Felix W. Baumann

    Abstract: Arm technology is becoming increasingly important in HPC. Recently, Fugaku, an \arm-based system, was awarded the number one place in the Top500 list. Raspberry Pis provide an inexpensive platform to become familiar with this architecture. However, Pis can also be useful on their own. Here we describe our efforts to configure and benchmark the use of a Raspberry Pi cluster with the HPX/Phylanx pla… ▽ More

    Submitted 9 April, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

  9. arXiv:2006.15373  [pdf, other

    cs.CV

    MTStereo 2.0: improved accuracy of stereo depth estimation withMax-trees

    Authors: Rafael Brandt, Nicola Strisciuglio, Nicolai Petkov

    Abstract: Efficient yet accurate extraction of depth from stereo image pairs is required by systems with low power resources, such as robotics and embedded systems. State-of-the-art stereo matching methods based on convolutional neural networks require intensive computations on GPUs and are difficult to deploy on embedded systems. In this paper, we propose a stereo matching method, called MTStereo 2.0, for… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

  10. arXiv:1910.09902  [pdf

    cs.SE

    Theory-Software Translation: Research Challenges and Future Directions

    Authors: Caroline Jay, Robert Haines, Daniel S. Katz, Jeffrey Carver, James C. Phillips, Anshu Dubey, Sandra Gesing, Matthew Turk, Hui Wan, Hubertus van Dam, James Howison, Vitali Morozov, Steven R. Brandt

    Abstract: The Theory-Software Translation Workshop, held in New Orleans in February 2019, explored in depth the process of both instantiating theory in software - for example, implementing a mathematical model in code as part of a simulation - and using the outputs of software - such as the behavior of a simulation - to advance knowledge. As computation within research is now ubiquitous, the workshop provid… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  11. arXiv:1904.08500  [pdf, other

    cs.CV cs.LG eess.IV

    Machine Vision for Natural Gas Methane Emissions Detection Using an Infrared Camera

    Authors: **gfan Wang, Lyne P. Tchapmi, Arvind P. Ravikumara, Mike McGuire, Clay S. Bell, Daniel Zimmerle, Silvio Savarese, Adam R. Brandt

    Abstract: It is crucial to reduce natural gas methane emissions, which can potentially offset the climate benefits of replacing coal with gas. Optical gas imaging (OGI) is a widely-used method to detect methane leaks, but is labor-intensive and cannot provide leak detection results without operators' judgment. In this paper, we develop a computer vision approach to OGI-based leak detection using convolution… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: This paper was submitted to Applied Energy

  12. arXiv:1604.05550  [pdf, other

    cs.IT

    Joint Coordinated Precoding and Discrete Rate Selection in Multicell MIMO Networks

    Authors: Rasmus Brandt, Mats Bengtsson

    Abstract: Many practical wireless communications systems select their transmit rate from a finite set of modulation and coding schemes, which correspond to a set of discrete rates. In this paper, we therefore formulate a joint coordinated precoding and discrete rate selection problem for multiple-input multiple-output (MIMO) multicell networks. Compared to the common assumption of using the continuous Shann… ▽ More

    Submitted 19 April, 2016; originally announced April 2016.

    Comments: Submitted to IEEE Signal Processing Letters

  13. Globally Optimal Base Station Clustering in Interference Alignment-Based Multicell Networks

    Authors: Rasmus Brandt, Rami Mochaourab, Mats Bengtsson

    Abstract: Coordinated precoding based on interference alignment is a promising technique for improving the throughputs in future wireless multicell networks. In small networks, all base stations can typically jointly coordinate their precoding. In large networks however, base station clustering is necessary due to the otherwise overwhelmingly high channel state information (CSI) acquisition overhead. In thi… ▽ More

    Submitted 26 February, 2016; originally announced February 2016.

    Comments: Accepted in IEEE Signal Processing Letters. (c) 2016 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE

    Journal ref: IEEE Signal Processing Letters, Year: 2016, Volume: 23, Issue: 4, Pages: 512 - 516

  14. arXiv:1602.07859  [pdf, other

    cs.IT

    Distributed Long-Term Base Station Clustering in Cellular Networks using Coalition Formation

    Authors: Rasmus Brandt, Rami Mochaourab, Mats Bengtsson

    Abstract: Interference alignment (IA) is a promising technique for interference mitigation in multicell networks due to its ability to completely cancel the intercell interference through linear precoding and receive filtering. In small networks, the amount of required channel state information (CSI) is modest and IA is therefore typically applied jointly over all base stations. In large networks, where the… ▽ More

    Submitted 25 February, 2016; originally announced February 2016.

    Comments: Submitted to IEEE Transactions on Signal and Information Processing over Networks

  15. Report on the Third Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE3)

    Authors: Daniel S. Katz, Sou-Cheng T. Choi, Kyle E. Niemeyer, James Hetherington, Frank Löffler, Dan Gunter, Ray Idaszak, Steven R. Brandt, Mark A. Miller, Sandra Gesing, Nick D. Jones, Nic Weber, Suresh Marru, Gabrielle Allen, Birgit Penzenstadler, Colin C. Venters, Ethan Davis, Lorraine Hwang, Ilian Todorov, Abani Patra, Miguel de Val-Borro

    Abstract: This report records and discusses the Third Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE3). The report includes a description of the keynote presentation of the workshop, which served as an overview of sustainable scientific software. It also summarizes a set of lightning talks in which speakers highlighted to-the-point lessons and challenges pertaining to sustain… ▽ More

    Submitted 6 February, 2016; originally announced February 2016.

  16. Interference Alignment-Aided Base Station Clustering using Coalition Formation

    Authors: Rasmus Brandt, Rami Mochaourab, Mats Bengtsson

    Abstract: Base station clustering is necessary in large interference networks, where the channel state information (CSI) acquisition overhead otherwise would be overwhelming. In this paper, we propose a novel long-term throughput model for the clustered users which addresses the balance between interference mitigation capability and CSI acquisition overhead. The model only depends on statistical CSI, thus e… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

    Comments: 2nd Prize, Student Paper Contest. Copyright 2015 SS&C. Published in the Proceedings of the 49th Asilomar Conference on Signals, Systems and Computers, Nov 8-11, 2015, Pacific Grove, CA, USA

  17. arXiv:1504.06794  [pdf, other

    cs.IT

    Overhead-Aware Distributed CSI Selection in the MIMO Interference Channel

    Authors: Rami Mochaourab, Rasmus Brandt, Hadi Ghauch, Mats Bengtsson

    Abstract: We consider a MIMO interference channel in which the transmitters and receivers operate in frequency-division duplex mode. In this setting, interference management through coordinated transceiver design necessitates channel state information at the transmitters (CSI-T). The acquisition of CSI-T is done through feedback from the receivers, which entitles a loss in degrees of freedom, due to trainin… ▽ More

    Submitted 6 July, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: 5 pages, 2 figures. to appear at EUSIPCO 2015, Special Session on Algorithms for Distributed Coordination and Learning

  18. arXiv:1410.1764  [pdf, other

    cs.MS cs.DC

    Chemora: A PDE Solving Framework for Modern HPC Architectures

    Authors: Erik Schnetter, Marek Blazewicz, Steven R. Brandt, David M. Koppelman, Frank Löffler

    Abstract: Modern HPC architectures consist of heterogeneous multi-core, many-node systems with deep memory hierarchies. Modern applications employ ever more advanced discretisation methods to study multi-physics problems. Develo** such applications that explore cutting-edge physics on cutting-edge HPC systems has become a complex task that requires significant HPC knowledge and experience. Unfortunately,… ▽ More

    Submitted 3 October, 2014; originally announced October 2014.

  19. arXiv:1406.7756  [pdf, ps, other

    cs.IT

    Optimal Scheduling for Interference Mitigation by Range Information

    Authors: Vijaya Yajnanarayana, Klas E. G. Magnusson, Rasmus Brandt, Satyam Dwivedi, Peter Händel

    Abstract: The multiple access scheduling decides how the channel is shared among the nodes in the network. Typical scheduling algorithms aims at increasing the channel utilization and thereby throughput of the network. This paper describes several algorithms for generating an optimal schedule in terms of channel utilization for multiple access by utilizing range information in a fully connected network. We… ▽ More

    Submitted 1 September, 2016; v1 submitted 30 June, 2014; originally announced June 2014.

  20. arXiv:1309.1812  [pdf, other

    cs.CE cs.MS cs.SE

    Cactus: Issues for Sustainable Simulation Software

    Authors: Frank Löffler, Steven R. Brandt, Gabrielle Allen, Erik Schnetter

    Abstract: The Cactus Framework is an open-source, modular, portable programming environment for the collaborative development and deployment of scientific applications using high-performance computing. Its roots reach back to 1996 at the National Center for Supercomputer Applications and the Albert Einstein Institute in Germany, where its development jumpstarted. Since then, the Cactus framework has witness… ▽ More

    Submitted 15 September, 2013; v1 submitted 6 September, 2013; originally announced September 2013.

    Comments: submitted to the Workshop on Sustainable Software for Science: Practice and Experiences 2013

  21. arXiv:1307.6488  [pdf, other

    physics.comp-ph cs.MS gr-qc

    From Physics Model to Results: An Optimizing Framework for Cross-Architecture Code Generation

    Authors: Marek Blazewicz, Ian Hinder, David M. Koppelman, Steven R. Brandt, Milosz Ciznicki, Michal Kierzynka, Frank Löffler, Erik Schnetter, Jian Tao

    Abstract: Starting from a high-level problem description in terms of partial differential equations using abstract tensor notation, the Chemora framework discretizes, optimizes, and generates complete high performance codes for a wide range of compute architectures. Chemora extends the capabilities of Cactus, facilitating the usage of large-scale CPU/GPU systems in an efficient manner for complex applicatio… ▽ More

    Submitted 24 July, 2013; originally announced July 2013.

    Comments: 18 pages, 4 figures, accepted for publication in Scientific Programming

    Report number: AEI-2013-227

  22. arXiv:1201.2118  [pdf, other

    cs.DC

    A Massive Data Parallel Computational Framework for Petascale/Exascale Hybrid Computer Systems

    Authors: Marek Blazewicz, Steven R. Brandt, Peter Diener, David M. Koppelman, Krzysztof Kurowski, Frank Löffler, Erik Schnetter, Jian Tao

    Abstract: Heterogeneous systems are becoming more common on High Performance Computing (HPC) systems. Even using tools like CUDA and OpenCL it is a non-trivial task to obtain optimal performance on the GPU. Approaches to simplifying this task include Merge (a library based framework for heterogeneous multi-core systems), Zippy (a framework for parallel execution of codes on multiple GPUs), BSGP (a new progr… ▽ More

    Submitted 10 January, 2012; originally announced January 2012.

    Comments: Parallel Computing 2011 (ParCo2011), 30 August -- 2 September 2011, Ghent, Belgium