Skip to main content

Showing 1–50 of 51 results for author: Rüde, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08371  [pdf, other

    cs.CE

    Code Generation and Performance Engineering for Matrix-Free Finite Element Methods on Hybrid Tetrahedral Grids

    Authors: Fabian Böhm, Daniel Bauer, Nils Kohl, Christie Alappat, Dominik Thönnes, Marcus Mohr, Harald Köstler, Ulrich Rüde

    Abstract: This paper introduces a code generator designed for node-level optimized, extreme-scalable, matrix-free finite element operators on hybrid tetrahedral grids. It optimizes the local evaluation of bilinear forms through various techniques including tabulation, relocation of loop invariants, and inter-element vectorization - implemented as transformations of an abstract syntax tree. A key contributio… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 22 pages

    MSC Class: 65F50; 65N30; 65N55; 65Y20; 65F10

  2. A Continuous Benchmarking Infrastructure for High-Performance Computing Applications

    Authors: Christoph Alt, Martin Lanser, Jonas Plewinski, Atin Janki, Axel Klawonn, Harald Köstler, Michael Selzer, Ulrich Rüde

    Abstract: For scientific software, especially those used for large-scale simulations, achieving good performance and efficiently using the available hardware resources is essential. It is important to regularly perform benchmarks to ensure the efficient use of hardware and software when systems are changing and the software evolves. However, this can become quickly very tedious when many options for paramet… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Journal ref: International Journal of Parallel, Emergent & Distributed Systems, 2024

  3. arXiv:2402.13171  [pdf, other

    cs.CE cs.DC physics.flu-dyn

    waLBerla-wind: a lattice-Boltzmann-based high-performance flow solver for wind energy applications

    Authors: Helen Schottenhamml, Ani Anciaux-Sedrakian, Frédéric Blondel, Harald Köstler, Ulrich Rüde

    Abstract: This article presents the development of a new wind turbine simulation software to study wake flow physics. To this end, the design and development of waLBerla-wind, a new simulator based on the lattice-Boltzmann method that is known for its excellent performance and scaling properties, will be presented. Here it will be used for large eddy simulations (LES) coupled with actuator wind turbine mode… ▽ More

    Submitted 8 December, 2023; originally announced February 2024.

    Journal ref: Concurrency Computat Pract Exper. 2024;e8117

  4. arXiv:2308.01792  [pdf, other

    cs.CE

    Fundamental Data Structures for Matrix-Free Finite Elements on Hybrid Tetrahedral Grids

    Authors: Nils Kohl, Daniel Bauer, Fabian Böhm, Ulrich Rüde

    Abstract: This paper presents efficient data structures for the implementation of matrix-free finite element methods on block-structured, hybrid tetrahedral grids. It provides a complete categorization of all geometric sub-objects that emerge from the regular refinement of the unstructured, tetrahedral coarse grid and describes efficient iteration patterns and analytical linearization functions for the mapp… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 21 pages

  5. Model-Based Performance Analysis of the HyTeG Finite Element Framework

    Authors: Dominik Thönnes, Ulrich Rüde

    Abstract: In this work, we present how code generation techniques significantly improve the performance of the computational kernels in the HyTeG software framework. This HPC framework combines the performance and memory advantages of matrix-free multigrid solvers with the flexibility of unstructured meshes. The pystencils code generation toolbox is used to replace the original abstract C++ kernels with hig… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  6. arXiv:2211.02435  [pdf, other

    cs.MS cs.CE math.NA physics.comp-ph

    Advanced Automatic Code Generation for Multiple Relaxation-Time Lattice Boltzmann Methods

    Authors: Frederik Hennig, Markus Holzer, Ulrich Rüde

    Abstract: The scientific code generation package lbmpy supports the automated design and the efficient implementation of lattice Boltzmann methods (LBMs) through metaprogramming. It is based on a new, concise calculus for describing multiple relaxation-time LBMs, including techniques that enable the numerically advantageous subtraction of the constant background component from the populations. These techniq… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 23 pages, 6 figures

  7. arXiv:2103.02388  [pdf, other

    cs.CE math.NA

    A massively parallel Eulerian-Lagrangian method for advection-dominated transport in viscous fluids

    Authors: Nils Kohl, Marcus Mohr, Sebastian Eibl, Ulrich Rüde

    Abstract: Motivated by challenges in Earth mantle convection, we present a massively parallel implementation of an Eulerian-Lagrangian method for the advection-diffusion equation in the advection-dominated regime. The advection term is treated by a particle-based, characteristics method coupled to a block-structured finite-element framework. Its numerical and computational performance is evaluated in multip… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 22 pages

    MSC Class: 65M25; 65Y05; 65M60

  8. arXiv:2012.06144  [pdf, other

    physics.flu-dyn cs.MS cs.PF

    Highly Efficient Lattice-Boltzmann Multiphase Simulations of Immiscible Fluids at High-Density Ratios on CPUs and GPUs through Code Generation

    Authors: Markus Holzer, Martin Bauer, Ulrich Rüde

    Abstract: A high-performance implementation of a multiphase lattice Boltzmann method based on the conservative Allen-Cahn model supporting high-density ratios and high Reynolds numbers is presented. Metaprogramming techniques are used to generate optimized code for CPUs and GPUs automatically. The coupled model is specified in a high-level symbolic description and optimized through automatic transformations… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 17 pages, 9 figures

  9. arXiv:2010.13513  [pdf, other

    cs.CE cs.MS math.NA

    Textbook efficiency: massively parallel matrix-free multigrid for the Stokes system

    Authors: Nils Kohl, Ulrich Rüde

    Abstract: We employ textbook multigrid efficiency (TME), as introduced by Achi Brandt, to construct an asymptotically optimal monolithic multigrid solver for the Stokes system. The geometric multigrid solver builds upon the concept of hierarchical hybrid grids (HHG), which is extended to higher-order finite-element discretizations, and a corresponding matrix-free implementation. The computational cost of th… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 22 pages, 7 figures

    MSC Class: 65F10; 65N30; 65N55

  10. arXiv:2010.13342  [pdf, other

    cs.DC

    Resiliency in Numerical Algorithm Design for Extreme Scale Simulations

    Authors: Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Goeddeke, Marco Heisig, Fabienne Jezequel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Orti , et al. (11 additional authors not shown)

    Abstract: This work is based on the seminar titled ``Resiliency in Numerical Algorithm Design for Extreme Scale Simulations'' held March 1-6, 2020 at Schloss Dagstuhl, that was attended by all the authors. Naive versions of conventional resilience techniques will not scale to the exascale regime: with a main memory footprint of tens of Petabytes, synchronously writing checkpoint data all the way to backgr… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 45 pages, 3 figures, submitted to The International Journal of High Performance Computing Applications

    ACM Class: D.4.5; G.4; G.1; D.4.4

  11. arXiv:2001.11806  [pdf, other

    cs.MS cs.CE cs.DC

    lbmpy: Automatic code generation for efficient parallel lattice Boltzmann methods

    Authors: Martin Bauer, Harald Köstler, Ulrich Rüde

    Abstract: Lattice Boltzmann methods are a popular mesoscopic alternative to macroscopic computational fluid dynamics solvers. Many variants have been developed that vary in complexity, accuracy, and computational cost. Extensions are available to simulate multi-phase, multi-component, turbulent, or non-Newtonian flows. In this work we present lbmpy, a code generation package that supports a wide variety of… ▽ More

    Submitted 11 April, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

  12. arXiv:2001.10424  [pdf, ps, other

    math.NA cs.DC

    Parallel solution of saddle point systems with nested iterative solvers based on the Golub-Kahan Bidiagonalization

    Authors: Carola Kruse, Masha Sosonkina, Mario Arioli, Nicolas Tardieu, Ulrich Ruede

    Abstract: We present a scalability study of Golub-Kahan bidiagonalization for the parallel iterative solution of symmetric indefinite linear systems with a 2x2 block structure. The algorithms have been implemented within the parallel numerical library PETSc. Since a nested inner-outer iteration strategy may be necessary, we investigate different choices for the inner solvers, including parallel sparse direc… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  13. arXiv:1909.13772  [pdf, other

    cs.DC cs.CE physics.comp-ph

    waLBerla: A block-structured high-performance framework for multiphysics simulations

    Authors: Martin Bauer, Sebastian Eibl, Christian Godenschwager, Nils Kohl, Michael Kuron, Christoph Rettinger, Florian Schornbaum, Christoph Schwarzmeier, Dominik Thönnes, Harald Köstler, Ulrich Rüde

    Abstract: Programming current supercomputers efficiently is a challenging task. Multiple levels of parallelism on the core, on the compute node, and between nodes need to be exploited to make full use of the system. Heterogeneous hardware architectures with accelerators further complicate the development process. waLBerla addresses these challenges by providing the user with highly efficient building blocks… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  14. arXiv:1908.08666  [pdf, other

    cs.CE

    Stencil scaling for vector-valued PDEs on hybrid grids with applications to generalized Newtonian fluids

    Authors: Daniel Drzisga, Ulrich Rüde, Barbara Wohlmuth

    Abstract: Matrix-free finite element implementations for large applications provide an attractive alternative to standard sparse matrix data formats due to the significantly reduced memory consumption. Here, we show that they are also competitive with respect to the run time in the low order case if combined with suitable stencil scaling techniques. We focus on variable coefficient vector-valued partial dif… ▽ More

    Submitted 18 March, 2020; v1 submitted 23 August, 2019; originally announced August 2019.

  15. arXiv:1906.10963  [pdf, other

    cs.SE cs.MS

    A Modular and Extensible Software Architecture for Particle Dynamics

    Authors: Sebastian Eibl, Ulrich Rüde

    Abstract: Creating a highly parallel and flexible discrete element software requires an interdisciplinary approach, where expertise from different disciplines is combined. On the one hand domain specialists provide interaction models between particles. On the other hand high-performance computing specialists optimize the code to achieve good performance on different hardware architectures. In particular, th… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: Proceedings Of The 8Th International Conference On Discrete Element Methods

  16. arXiv:1905.05042  [pdf, other

    cond-mat.mes-hall cs.DC

    Computational Study of Ultrathin CNT Films with the Scalable Mesoscopic Distinct Element Method

    Authors: Igor Ostanin, Traian Dumitrică, Sebastian Eibl, Ulrich Rüde

    Abstract: In this work we present a computational study of the small strain mechanics of freestanding ultrathin CNT films under in-plane loading. The numerical modeling of the mechanics of representatively large specimens with realistic micro- and nanostructure is presented. Our simulations utilize the scalable implementation of the mesoscopic distinct element method of the waLBerla multi-physics framework.… ▽ More

    Submitted 19 October, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

  17. arXiv:1811.12742  [pdf, other

    cs.DC physics.comp-ph

    Dynamic Load Balancing Techniques for Particulate Flow Simulations

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: Parallel multiphysics simulations often suffer from load imbalances originating from the applied coupling of algorithms with spatially and temporally varying workloads. It is thus desirable to minimize these imbalances to reduce the time to solution and to better utilize the available hardware resources. Taking particulate flows as an illustrating example application, we present and evaluate load… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

  18. arXiv:1808.07677  [pdf, other

    cs.CE math.NA

    An iterative generalized Golub-Kahan algorithm for problems in structural mechanics

    Authors: Mario Arioli, Carola Kruse, Ulrich Ruede, Nicolas Tardieu

    Abstract: This paper studies the Craig variant of the Golub-Kahan bidiagonalization algorithm as an iterative solver for linear systems with saddle point structure. Such symmetric indefinite systems in 2x2 block form arise in many applications, but standard iterative solvers are often found to perform poorly on them and robust preconditioners may not be available. Specifically, such systems arise in structu… ▽ More

    Submitted 23 August, 2018; originally announced August 2018.

  19. A Systematic Comparison of Dynamic Load Balancing Algorithms for Massively Parallel Rigid Particle Dynamics

    Authors: Sebastian Eibl, Ulrich Rüde

    Abstract: As compute power increases with time, more involved and larger simulations become possible. However, it gets increasingly difficult to efficiently use the provided computational resources. Especially in particle-based simulations with a spatial domain partitioning large load imbalances can occur due to the simulation being dynamic. Then a static domain partitioning may not be suitable. This can de… ▽ More

    Submitted 2 August, 2019; v1 submitted 2 August, 2018; originally announced August 2018.

  20. arXiv:1805.10167  [pdf, other

    cs.MS cs.DC

    A Scalable and Modular Software Architecture for Finite Elements on Hierarchical Hybrid Grids

    Authors: Nils Kohl, Dominik Thönnes, Daniel Drzisga, Dominik Bartuschat, Ulrich Rüde

    Abstract: In this article, a new generic higher-order finite-element framework for massively parallel simulations is presented. The modular software architecture is carefully designed to exploit the resources of modern and future supercomputers. Combining an unstructured topology with structured grid refinement facilitates high geometric adaptability and matrix-free multigrid implementations with excellent… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: Preprint of an article submitted to International Journal of Parallel, Emergent and Distributed Systems (Taylor & Francis)

  21. arXiv:1804.06373  [pdf, other

    cs.MS

    Adaptive control in rollforward recovery for extreme scale multigrid

    Authors: Markus Huber, Ulrich Rüde, Barbara Wohlmuth

    Abstract: With the increasing number of compute components, failures in future exa-scale computer systems are expected to become more frequent. This motivates the study of novel resilience techniques. Here, we extend a recently proposed algorithm-based recovery method for multigrid iterations by introducing an adaptive control. After a fault, the healthy part of the system continues the iterative solution p… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

  22. arXiv:1802.02765  [pdf, other

    cs.DC cs.PF physics.comp-ph

    A local parallel communication algorithm for polydisperse rigid body dynamics

    Authors: Sebastian Eibl, Ulrich Rüde

    Abstract: The simulation of large ensembles of particles is usually parallelized by partitioning the domain spatially and using message passing to communicate between the processes handling neighboring subdomains. The particles are represented as individual geometric objects and are associated to the subdomains. Handling collisions and migrating particles between subdomains, as required for proper parallel… ▽ More

    Submitted 2 August, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

  23. arXiv:1711.00336  [pdf, other

    cs.CE physics.comp-ph

    A Coupled Lattice Boltzmann Method and Discrete Element Method for Discrete Particle Simulations of Particulate Flows

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: Discrete particle simulations are widely used to study large-scale particulate flows in complex geometries where particle-particle and particle-fluid interactions require an adequate representation but the computational cost has to be kept low. In this work, we present a novel coupling approach for such simulations. A lattice Boltzmann formulation of the generalized Navier-Stokes equations is used… ▽ More

    Submitted 1 November, 2017; originally announced November 2017.

  24. arXiv:1709.06793  [pdf, other

    math.NA cs.CE

    A stencil scaling approach for accelerating matrix-free finite element implementations

    Authors: Simon Bauer, Daniel Drzisga, Marcus Mohr, Ulrich Ruede, Christian Waluga, Barbara Wohlmuth

    Abstract: We present a novel approach to fast on-the-fly low order finite element assembly for scalar elliptic partial differential equations of Darcy type with variable coefficients optimized for matrix-free implementations. Our approach introduces a new operator that is obtained by appropriately scaling the reference stiffness matrix from the constant coefficient case. Assuming sufficient regularity, an a… ▽ More

    Submitted 23 July, 2018; v1 submitted 20 September, 2017; originally announced September 2017.

  25. A Scalable Multiphysics Algorithm for Massively Parallel Direct Numerical Simulations of Electrophoresis

    Authors: Dominik Bartuschat, Ulrich Rüde

    Abstract: In this article we introduce a novel coupled algorithm for massively parallel direct numerical simulations of electrophoresis in microfluidic flows. This multiphysics algorithm employs an Eulerian description of fluid and ions, combined with a Lagrangian representation of moving charged particles. The fixed grid facilitates efficient solvers and the employed lattice Boltzmann method can efficientl… ▽ More

    Submitted 25 May, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

    Comments: Accepted manuscript of publication in Journal of Computational Science (Elsevier)

  26. arXiv:1708.08286  [pdf, other

    cs.DC

    A Scalable and Extensible Checkpointing Scheme for Massively Parallel Simulations

    Authors: Nils Kohl, Johannes Hötzer, Florian Schornbaum, Martin Bauer, Christian Godenschwager, Harald Köstler, Britta Nestler, Ulrich Rüde

    Abstract: Realistic simulations in engineering or in the materials sciences can consume enormous computing resources and thus require the use of massively parallel supercomputers. The probability of a failure increases both with the runtime and with the number of system components. For future exascale systems it is therefore considered critical that strategies are developed to make software resilient agains… ▽ More

    Submitted 29 January, 2018; v1 submitted 28 August, 2017; originally announced August 2017.

  27. The Maximum Dissipation Principle in Rigid-Body Dynamics with Purely Inelastic Impacts

    Authors: Tobias Preclik, Sebastian Eibl, Ulrich Rüde

    Abstract: Formulating a consistent theory for rigid-body dynamics with impacts is an intricate problem. Twenty years ago Stewart published the first consistent theory with purely inelastic impacts and an impulsive friction model analogous to Coulomb friction. In this paper we demonstrate that the consistent impact model can exhibit multiple solutions with a varying degree of dissipation even in the single-c… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Journal ref: Springer, Computational Mechanics, 2017

  28. Extreme-Scale Block-Structured Adaptive Mesh Refinement

    Authors: Florian Schornbaum, Ulrich Rüde

    Abstract: In this article, we present a novel approach for block-structured adaptive mesh refinement (AMR) that is suitable for extreme-scale parallelism. All data structures are designed such that the size of the meta data in each distributed processor memory remains bounded independent of the processor number. In all stages of the AMR process, we use only distributed algorithms. No central resources such… ▽ More

    Submitted 13 April, 2018; v1 submitted 22 April, 2017; originally announced April 2017.

    Comments: 38 pages, 17 figures, 11 tables

    MSC Class: 68W10; 68W15; 68U20; 65Y05; 65Y20; 76P05

    Journal ref: SIAM J. Sci. Comput. 40-3 (2018), pp. C358-C387

  29. arXiv:1702.04910  [pdf, other

    cs.CE physics.comp-ph physics.flu-dyn

    A comparative study of fluid-particle coupling methods for fully resolved lattice Boltzmann simulations

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: The direct numerical simulation of particulate systems offers a unique approach to study the dynamics of fluid-solid suspensions by fully resolving the submerged particles and without introducing empirical models. For the lattice Boltzmann method, different variants exist to incorporate the fluid-particle interaction into the simulation. This paper provides a detailed and systematic comparison of… ▽ More

    Submitted 16 February, 2017; originally announced February 2017.

    Comments: 29 pages, 13 figures, 4 tables

  30. arXiv:1610.02608  [pdf, other

    cs.CE math.HO stat.OT

    Research and Education in Computational Science and Engineering

    Authors: Ulrich Rüde, Karen Willcox, Lois Curfman McInnes, Hans De Sterck, George Biros, Hans Bungartz, James Corones, Evin Cramer, James Crowley, Omar Ghattas, Max Gunzburger, Michael Hanke, Robert Harrison, Michael Heroux, Jan Hesthaven, Peter Jimack, Chris Johnson, Kirk E. Jordan, David E. Keyes, Rolf Krause, Vipin Kumar, Stefan Mayer, Juan Meza, Knut Martin Mørken, J. Tinsley Oden , et al. (8 additional authors not shown)

    Abstract: Over the past two decades the field of computational science and engineering (CSE) has penetrated both basic and applied research in academia, industry, and laboratories to advance discovery, optimize systems, support decision-makers, and educate the scientific and engineering workforce. Informed by centuries of theory and experiment, CSE performs computational experiments to answer questions that… ▽ More

    Submitted 31 December, 2017; v1 submitted 8 October, 2016; originally announced October 2016.

    Comments: Major revision, to appear in SIAM Review

    Report number: Argonne National Laboratory Preprint ANL/MCS-P6054-0916 MSC Class: 00A72; 62-07; 68U20; 68W01; 68W10; 97A99; 97M10; 97N80; 97R20; 97R30 ACM Class: G.0; G.4; I.6; J.0; J.2; J.3; J.4; J.6; J.7; K.3.2

  31. arXiv:1607.03252  [pdf, other

    cs.CE cs.DC cs.MS math.NA

    Scheduling massively parallel multigrid for multilevel Monte Carlo methods

    Authors: Björn Gmeiner, Daniel Drzisga, Ulrich Ruede, Robert Scheichl, Barbara Wohlmuth

    Abstract: The computational complexity of naive, sampling-based uncertainty quantification for 3D partial differential equations is extremely high. Multilevel approaches, such as multilevel Monte Carlo (MLMC), can reduce the complexity significantly, but to exploit them fully in a parallel environment, sophisticated scheduling strategies are needed. Often fast algorithms that are executed in parallel are es… ▽ More

    Submitted 12 July, 2016; originally announced July 2016.

    MSC Class: G.1.8

  32. arXiv:1603.04633  [pdf, other

    cond-mat.soft cs.CE physics.flu-dyn

    Microswimming with inertia

    Authors: Jayant Pande, Kristina Pickl, Oleg Trosman, Ulrich Rüde, Ana-Sunčana Smith

    Abstract: Microswimmers, especially in theoretical treatments, are generally taken to be completely inertia-free, since inertial effects on their motion are typically small and assuming their absence simplifies the problem considerably. Yet in nature there is no discrete break between swimmers for which inertia is negligibly small and for which it is detectable. Here we study a microswimming model for which… ▽ More

    Submitted 6 November, 2016; v1 submitted 15 March, 2016; originally announced March 2016.

  33. A Python Extension for the Massively Parallel Multiphysics Simulation Framework waLBerla

    Authors: Martin Bauer, Florian Schornbaum, Christian Godenschwager, Matthias Markl, Daniela Anderl, Harald Köstler, Ulrich Rüde

    Abstract: We present a Python extension to the massively parallel HPC simulation toolkit waLBerla. waLBerla is a framework for stencil based algorithms operating on block-structured grids, with the main application field being fluid simulations in complex geometries using the lattice Boltzmann method. Careful performance engineering results in excellent node performance and good scalability to over 400,000… ▽ More

    Submitted 23 November, 2015; originally announced November 2015.

  34. arXiv:1511.02134  [pdf, other

    cs.CE cs.MS math.NA

    A quantitative performance analysis for Stokes solvers at the extreme scale

    Authors: Björn Gmeiner, Markus Huber, Lorenz John, Ulrich Rüde, Barbara Wohlmuth

    Abstract: This article presents a systematic quantitative performance analysis for large finite element computations on extreme scale computing systems. Three parallel iterative solvers for the Stokes system, discretized by low order tetrahedral elements, are compared with respect to their numerical efficiency and their scalability running on up to $786\,432$ parallel threads. A genuine multigrid method for… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

    MSC Class: 65N55; 65Y05; 68Q25

  35. Massively Parallel Algorithms for the Lattice Boltzmann Method on Non-uniform Grids

    Authors: Florian Schornbaum, Ulrich Rüde

    Abstract: The lattice Boltzmann method exhibits excellent scalability on current supercomputing systems and has thus increasingly become an alternative method for large-scale non-stationary flow simulations, reaching up to a trillion grid nodes. Additionally, grid refinement can lead to substantial savings in memory and compute time. These saving, however, come at the cost of much more complex data structur… ▽ More

    Submitted 21 January, 2016; v1 submitted 31 August, 2015; originally announced August 2015.

    Comments: 32 pages, 20 figures, 4 tables

    Journal ref: SIAM J. Sci. Comput. 38-2 (2016), pp. C96-C126

  36. arXiv:1508.02960  [pdf, other

    cs.CE physics.flu-dyn

    Pore-scale lattice Boltzmann simulation of laminar and turbulent flow through a sphere pack

    Authors: Ehsan Fattahia, Christian Waluga, Barbara Wohlmuth, Ulrich Rüde, Michael Manhart, Rainer Helmig

    Abstract: The lattice Boltzmann method can be used to simulate flow through porous media with full geometrical resolution. With such a direct numerical simulation, it becomes possible to study fundamental effects which are difficult to assess either by develo** macroscopic mathematical models or experiments. We first evaluate the lattice Boltzmann method with various boundary handling of the solid-wall an… ▽ More

    Submitted 12 August, 2015; originally announced August 2015.

  37. arXiv:1507.06565  [pdf, ps, other

    cs.CE

    Large scale lattice Boltzmann simulation for the coupling of free and porous media flow

    Authors: Ehsan Fattahi, Christian Waluga, Barbara Wohlmuth, Ulrich Rüde

    Abstract: In this work, we investigate the interaction of free and porous media flow by large scale lattice Boltzmann simulations. We study the transport phenomena at the porous interface on multiple scales, i.e., we consider both, computationally generated pore-scale geometries and homogenized models at a macroscopic scale. The pore-scale results are compared to those obtained by using different transmissi… ▽ More

    Submitted 23 July, 2015; originally announced July 2015.

  38. arXiv:1506.06185  [pdf, other

    cs.MS math.NA

    Resilience for Multigrid Software at the Extreme Scale

    Authors: Markus Huber, Björn Gmeiner, Ulrich Rüde, Barbara Wohlmuth

    Abstract: Fault tolerant algorithms for the numerical approximation of elliptic partial differential equations on modern supercomputers play a more and more important role in the future design of exa-scale enabled iterative solvers. Here, we combine domain partitioning with highly scalable geometric multigrid schemes to obtain fast and fault-robust solvers in three dimensions. The recovery strategy is based… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.

    MSC Class: 65N55; 65Y05; 68Q85

  39. arXiv:1506.01684  [pdf, other

    cs.DC physics.comp-ph

    Massively Parallel Phase-Field Simulations for Ternary Eutectic Directional Solidification

    Authors: Martin Bauer, Johannes Hötzer, Philipp Steinmetz, Marcus Jainta, Marco Berghoff, Florian Schornbaum, Christian Godenschwager, Harald Köstler, Britta Nestler, Ulrich Rüde

    Abstract: Microstructures forming during ternary eutectic directional solidification processes have significant influence on the macroscopic mechanical properties of metal alloys. For a realistic simulation, we use the well established thermodynamically consistent phase-field method and improve it with a new grand potential formulation to couple the concentration evolution. This extension is very compute in… ▽ More

    Submitted 4 June, 2015; originally announced June 2015.

    Comments: submitted to Supercomputing 2015

  40. arXiv:1503.06869  [pdf, other

    cs.CE physics.comp-ph physics.flu-dyn

    Two Computational Models for Simulating the Tumbling Motion of Elongated Particles in Fluids

    Authors: Dominik Bartuschat, Ellen Fischermeier, Katarina Gustavsson, Ulrich Rüde

    Abstract: Suspensions with fiber-like particles in the low Reynolds number regime are modeled by two different approaches that both use a Lagrangian representation of individual particles. The first method is the well-established formulation based on Stokes flow that is formulated as integral equations. It uses a slender body approximation for the fibers to represent the interaction between them directly wi… ▽ More

    Submitted 23 March, 2015; originally announced March 2015.

    Comments: Submitted to the Journal Computers & Fluids (Elsevier)

  41. arXiv:1501.07400  [pdf, other

    cs.CE cs.DC

    Resilience for Exascale Enabled Multigrid Methods

    Authors: Markus Huber, Björn Gmeiner, Ulrich Rüde, Barbara Wohlmuth

    Abstract: With the increasing number of components and further miniaturization the mean time between faults in supercomputers will decrease. System level fault tolerance techniques are expensive and cost energy, since they are often based on redundancy. Also classical check-point-restart techniques reach their limits when the time for storing the system state to backup memory becomes excessive. Therefore, a… ▽ More

    Submitted 29 January, 2015; originally announced January 2015.

    MSC Class: 68W10; 68N30; 65N55

  42. arXiv:1501.05810  [pdf, other

    cs.CE physics.comp-ph

    Ultrascale Simulations of Non-smooth Granular Dynamics

    Authors: Tobias Preclik, Ulrich Rüde

    Abstract: This article presents new algorithms for massively parallel granular dynamics simulations on distributed memory architectures using a domain partitioning approach. Collisions are modelled with hard contacts in order to hide their micro-dynamics and thus to extend the time and length scales that can be simulated. The multi-contact problem is solved using a non-linear block Gauss-Seidel method that… ▽ More

    Submitted 23 January, 2015; originally announced January 2015.

    MSC Class: 65Y05 (Primary); 70F35; 70F40; 70E55 (Secondary) ACM Class: I.6.0

  43. arXiv:1410.6609  [pdf, other

    cs.CE physics.comp-ph

    Parallel Multiphysics Simulations of Charged Particles in Microfluidic Flows

    Authors: Dominik Bartuschat, Ulrich Rüde

    Abstract: The article describes parallel multiphysics simulations of charged particles in microfluidic flows with the waLBerla framework. To this end, three physical effects are coupled: rigid body dynamics, fluid flow modelled by a lattice Boltzmann algorithm, and electric potentials represented by a finite volume discretisation. For solving the finite volume discretisation for the electrostatic forces, a… ▽ More

    Submitted 24 October, 2014; originally announced October 2014.

    Comments: Submitted to Journal of Computational Science (Elsevier)

  44. arXiv:1406.5369  [pdf, other

    cs.MS

    A Scala Prototype to Generate Multigrid Solver Implementations for Different Problems and Target Multi-Core Platforms

    Authors: Harald Koestler, Christian Schmitt, Sebastian Kuckuk, Frank Hannig, Juergen Teich, Ulrich Ruede

    Abstract: Many problems in computational science and engineering involve partial differential equations and thus require the numerical solution of large, sparse (non)linear systems of equations. Multigrid is known to be one of the most efficient methods for this purpose. However, the concrete multigrid algorithm and its implementation highly depend on the underlying problem and hardware. Therefore, changes… ▽ More

    Submitted 20 June, 2014; originally announced June 2014.

  45. Numerical Investigations on Hatching Process Strategies for Powder Bed Based Additive Manufacturing using an Electron Beam

    Authors: Matthias Markl, Regina Ammer, Ulrich Rüde, Carolin Körner

    Abstract: This paper investigates in hatching process strategies for additive manufacturing using an electron beam by numerical simulations. The underlying physical model and the corresponding three dimensional thermal free surface lattice Boltzmann method of the simulation software are briefly presented. The simulation software has already been validated on the basis of experiments up to 1.2 kW beam power… ▽ More

    Submitted 30 March, 2015; v1 submitted 13 March, 2014; originally announced March 2014.

    Journal ref: The International Journal of Advanced Manufacturing Technology: Volume 78, Issue 1 (2015), Page 239-247

  46. arXiv:1402.2440  [pdf, ps, other

    cs.CE

    Validation Experiments for LBM Simulations of Electron Beam Melting

    Authors: Regina Ammer, Matthias Markl, Vera Jüchter, Carolin Körner, Ulrich Rüde

    Abstract: This paper validates 3D simulation results of electron beam melting (EBM) processes comparing experimental and numerical data. The physical setup is presented which is discretized by a three dimensional (3D) thermal lattice Boltzmann method (LBM). An experimental process window is used for the validation depending on the line energy injected into the metal powder bed and the scan velocity of the e… ▽ More

    Submitted 11 February, 2014; originally announced February 2014.

    Comments: submitted to "International Journal of Modern Physics C"

  47. arXiv:1303.1651  [pdf, ps, other

    cs.PF cs.MS

    Model-guided Performance Analysis of the Sparse Matrix-Matrix Multiplication

    Authors: Tobias Scharpff, Klaus Iglberger, Georg Hager, Ulrich Ruede

    Abstract: Achieving high efficiency with numerical kernels for sparse matrices is of utmost importance, since they are part of many simulation codes and tend to use most of the available compute time and resources. In addition, especially in large scale simulation frameworks the readability and ease of use of mathematical expressions are essential components for the continuous maintenance, modification, and… ▽ More

    Submitted 6 May, 2013; v1 submitted 7 March, 2013; originally announced March 2013.

    Comments: 8 pages, 12 figures. Small corrections w.r.t. previous version

  48. arXiv:1201.0351  [pdf, ps, other

    cs.CE physics.flu-dyn

    Liquid-gas-solid flows with lattice Boltzmann: Simulation of floating bodies

    Authors: Simon Bogner, Ulrich Rüde

    Abstract: This paper presents a model for the simulation of liquid-gas-solid flows by means of the lattice Boltzmann method. The approach is built upon previous works for the simulation of liquid-solid particle suspensions on the one hand, and on a liquid-gas free surface model on the other. We show how the two approaches can be unified by a novel set of dynamic cell conversion rules. For evaluation, we con… ▽ More

    Submitted 1 January, 2012; originally announced January 2012.

    Comments: 22 pages, Preprint submitted to Computers and Mathematics with Applications Special Issue ICMMES 2011, Proceedings of the Eighth International Conference for Mesoscopic Methods in Engineering and Science

  49. arXiv:1108.0786  [pdf, other

    cs.CE cond-mat.soft physics.flu-dyn

    All good things come in threes - Three beads learn to swim with lattice Boltzmann and a rigid body solver

    Authors: Kristina Pickl, Jan Götz, Klaus Iglberger, Jayant Pande, Klaus Mecke, Ana-Suncana Smith, Ulrich Rüde

    Abstract: We simulate the self-propulsion of devices in a fluid in the regime of low Reynolds numbers. Each device consists of three bodies (spheres or capsules) connected with two damped harmonic springs. Sinusoidal driving forces compress the springs which are resolved within a rigid body physics engine. The latter is consistently coupled to a 3D lattice Boltzmann framework for the fluid dynamics. In simu… ▽ More

    Submitted 3 August, 2011; originally announced August 2011.

  50. arXiv:1104.1729  [pdf, ps, other

    cs.PF cs.PL

    Expression Templates Revisited: A Performance Analysis of the Current ET Methodology

    Authors: Klaus Iglberger, Georg Hager, Jan Treibig, Ulrich Ruede

    Abstract: In the last decade, Expression Templates (ET) have gained a reputation as an efficient performance optimization tool for C++ codes. This reputation builds on several ET-based linear algebra frameworks focused on combining both elegant and high-performance C++ code. However, on closer examination the assumption that ETs are a performance optimization technique cannot be maintained. In this paper we… ▽ More

    Submitted 9 April, 2011; originally announced April 2011.

    Comments: 16 pages, 7 figures

    Journal ref: SIAM Journal on Scientific Computing 34(2), C42-C69 (2012)