Skip to main content

Showing 1–50 of 80 results for author: Rüde, U

.
  1. arXiv:2404.08371  [pdf, other

    cs.CE

    Code Generation and Performance Engineering for Matrix-Free Finite Element Methods on Hybrid Tetrahedral Grids

    Authors: Fabian Böhm, Daniel Bauer, Nils Kohl, Christie Alappat, Dominik Thönnes, Marcus Mohr, Harald Köstler, Ulrich Rüde

    Abstract: This paper introduces a code generator designed for node-level optimized, extreme-scalable, matrix-free finite element operators on hybrid tetrahedral grids. It optimizes the local evaluation of bilinear forms through various techniques including tabulation, relocation of loop invariants, and inter-element vectorization - implemented as transformations of an abstract syntax tree. A key contributio… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 22 pages

    MSC Class: 65F50; 65N30; 65N55; 65Y20; 65F10

  2. A Continuous Benchmarking Infrastructure for High-Performance Computing Applications

    Authors: Christoph Alt, Martin Lanser, Jonas Plewinski, Atin Janki, Axel Klawonn, Harald Köstler, Michael Selzer, Ulrich Rüde

    Abstract: For scientific software, especially those used for large-scale simulations, achieving good performance and efficiently using the available hardware resources is essential. It is important to regularly perform benchmarks to ensure the efficient use of hardware and software when systems are changing and the software evolves. However, this can become quickly very tedious when many options for paramet… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Journal ref: International Journal of Parallel, Emergent & Distributed Systems, 2024

  3. arXiv:2402.13171  [pdf, other

    cs.CE cs.DC physics.flu-dyn

    waLBerla-wind: a lattice-Boltzmann-based high-performance flow solver for wind energy applications

    Authors: Helen Schottenhamml, Ani Anciaux-Sedrakian, Frédéric Blondel, Harald Köstler, Ulrich Rüde

    Abstract: This article presents the development of a new wind turbine simulation software to study wake flow physics. To this end, the design and development of waLBerla-wind, a new simulator based on the lattice-Boltzmann method that is known for its excellent performance and scaling properties, will be presented. Here it will be used for large eddy simulations (LES) coupled with actuator wind turbine mode… ▽ More

    Submitted 8 December, 2023; originally announced February 2024.

    Journal ref: Concurrency Computat Pract Exper. 2024;e8117

  4. arXiv:2401.03041  [pdf, other

    physics.flu-dyn math.NA

    Development of a central-moment phase-field lattice Boltzmann model for thermocapillary flows: Droplet capture and computational performance

    Authors: Markus Holzer, Travis Mitchell, Christopher R. Leonardi, Ulrich Ruede

    Abstract: This study develops a computationally efficient phase-field lattice Boltzmann model with the capability to simulate thermocapillary flows. The model was implemented into the open-source simulation framework, waLBerla, and extended to conduct the collision stage using central moments. The multiphase model was coupled with both a passive-scalar thermal LB, and a RK solution to the energy equation in… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  5. arXiv:2310.06952  [pdf, ps, other

    math.NA

    Generalized Golub-Kahan bidiagonalization for nonsymmetric saddle point systems

    Authors: Andrei Dumitrasc, Carola Kruse, Ulrich Ruede

    Abstract: The generalized Golub-Kahan bidiagonalization has been used to solve saddle-point systems where the leading block is symmetric and positive definite. We extend this iterative method for the case where the symmetry condition no longer holds. We do so by relying on the known connection the algorithm has with the Conjugate Gradient method and following the line of reasoning that adapts the latter int… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 18 pages, 3 figures

    MSC Class: 65F10 (Primary) 65F50; 65N22 (Secondary)

  6. arXiv:2308.01792  [pdf, other

    cs.CE

    Fundamental Data Structures for Matrix-Free Finite Elements on Hybrid Tetrahedral Grids

    Authors: Nils Kohl, Daniel Bauer, Fabian Böhm, Ulrich Rüde

    Abstract: This paper presents efficient data structures for the implementation of matrix-free finite element methods on block-structured, hybrid tetrahedral grids. It provides a complete categorization of all geometric sub-objects that emerge from the regular refinement of the unstructured, tetrahedral coarse grid and describes efficient iteration patterns and analytical linearization functions for the mapp… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 21 pages

  7. Deflation for the off-diagonal block in symmetric saddle point systems

    Authors: Andrei Dumitrasc, Carola Kruse, Ulrich Ruede

    Abstract: Deflation techniques are typically used to shift isolated clusters of small eigenvalues in order to obtain a tighter distribution and a smaller condition number. Such changes induce a positive effect in the convergence behavior of Krylov subspace methods, which are among the most popular iterative solvers for large sparse linear systems. We develop a deflation strategy for symmetric saddle point m… ▽ More

    Submitted 14 May, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 28 pages, 13 figures

    MSC Class: 15A18 (Primary) 65F10; 65F15 (Secondary)

  8. Model-Based Performance Analysis of the HyTeG Finite Element Framework

    Authors: Dominik Thönnes, Ulrich Rüde

    Abstract: In this work, we present how code generation techniques significantly improve the performance of the computational kernels in the HyTeG software framework. This HPC framework combines the performance and memory advantages of matrix-free multigrid solvers with the flexibility of unstructured meshes. The pystencils code generation toolbox is used to replace the original abstract C++ kernels with hig… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  9. arXiv:2305.09910  [pdf, other

    physics.comp-ph math.NA

    Scalable Flow Simulations with the Lattice Boltzmann Method

    Authors: Markus Holzer, Jayesh Badwaik, Radim Vavrik, Gabriel Staffelbach, Andreas Herten, Ondrej Vysocky, Ilan Rocchi, Lubomir Riha, Romain Cuidard, Ulrich Ruede

    Abstract: The primary goal of the EuroHPC JU project SCALABLE is to develop an industrial Lattice Boltzmann Method (LBM)-based computational fluid dynamics (CFD) solver capable of exploiting current and future extreme scale architectures, expanding current capabilities of existing industrial LBM solvers by at least two orders of magnitude in terms of processor cores and lattice cells, while preserving its a… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  10. arXiv:2301.10674  [pdf, other

    physics.flu-dyn physics.comp-ph physics.geo-ph

    Particle-resolved simulation of antidunes in free-surface flows

    Authors: Christoph Schwarzmeier, Christoph Rettinger, Samuel Kemmler, Jonas Plewinski, Francisco Núñez-González, Harald Köstler, Ulrich Rüde, Bernhard Vowinckel

    Abstract: The interaction of supercritical turbulent flows with granular sediment beds is challenging to study both experimentally and numerically; this challenging task has hampered the advances in understanding antidunes, the most characteristic bedform of supercritical flows. This article presents the first numerical attempt to simulate upstream-migrating antidunes with geometrically resolved particles a… ▽ More

    Submitted 23 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Journal ref: Journal of Fluid Mechanics 961 (2023)

  11. arXiv:2211.02435  [pdf, other

    cs.MS cs.CE math.NA physics.comp-ph

    Advanced Automatic Code Generation for Multiple Relaxation-Time Lattice Boltzmann Methods

    Authors: Frederik Hennig, Markus Holzer, Ulrich Rüde

    Abstract: The scientific code generation package lbmpy supports the automated design and the efficient implementation of lattice Boltzmann methods (LBMs) through metaprogramming. It is based on a new, concise calculus for describing multiple relaxation-time LBMs, including techniques that enable the numerically advantageous subtraction of the constant background component from the populations. These techniq… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 23 pages, 6 figures

  12. Inexact inner-outer Golub-Kahan bidiagonalization method: A relaxation strategy

    Authors: Vincent Darrigrand, Andrei Dumitrasc, Carola Kruse, Ulrich Ruede

    Abstract: We study an inexact inner-outer generalized Golub-Kahan algorithm for the solution of saddle-point problems with a two-times-two block structure. In each outer iteration, an inner system has to be solved which in theory has to be done exactly. Whenever the system is getting large, an inner exact solver is, however, no longer efficient or even feasible and iterative methods must be used. We focus t… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 25 pages, 9 figures

    MSC Class: 65F10; 65F50; 65N22

  13. arXiv:2207.14496  [pdf, other

    physics.flu-dyn physics.comp-ph

    Comparison of refilling schemes in the free-surface lattice Boltzmann method

    Authors: Christoph Schwarzmeier, Ulrich Rüde

    Abstract: Simulating mobile liquid-gas interfaces with the free-surface lattice Boltzmann method (FSLBM) requires frequent re-initialization of fluid flow information in computational cells that convert from gas to liquid. The corresponding algorithm, here referred to as the refilling scheme, is crucial for the successful application of the FSLBM in terms of accuracy and numerical stability. This study comp… ▽ More

    Submitted 24 November, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2207.13962

    Journal ref: AIP Advances 12 (2022)

  14. arXiv:2207.13962  [pdf, other

    physics.flu-dyn physics.comp-ph

    Analysis and comparison of boundary condition variants in the free-surface lattice Boltzmann method

    Authors: Christoph Schwarzmeier, Ulrich Rüde

    Abstract: The accuracy of the free-surface lattice Boltzmann method (FSLBM) depends significantly on the boundary condition employed at the free interface. Ideally, the chosen boundary condition balances the forces exerted by the liquid and gas pressure. Different variants of the same boundary condition are possible, depending on the number and choice of the particle distribution functions (PDFs) to which i… ▽ More

    Submitted 22 January, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

    Journal ref: International Journal for Numerical Methods in Fluids (2023)

  15. arXiv:2206.11637  [pdf, other

    physics.flu-dyn physics.comp-ph

    Comparison of free-surface and conservative Allen-Cahn phase-field lattice Boltzmann method

    Authors: Christoph Schwarzmeier, Markus Holzer, Travis Mitchell, Moritz Lehmann, Fabian Häusl, Ulrich Rüde

    Abstract: This study compares the free-surface lattice Boltzmann method (FSLBM) with the conservative Allen-Cahn phase-field lattice Boltzmann method (PFLBM) in their ability to model two-phase flows in which the behavior of the system is dominated by the heavy phase. Both models are introduced and their individual properties, strengths and weaknesses are thoroughly discussed. Six numerical benchmark cases… ▽ More

    Submitted 24 November, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Journal ref: Journal of Computational Physics 473 (2023)

  16. arXiv:2205.07543  [pdf, other

    physics.geo-ph physics.comp-ph

    Effect of Sediment Form and Form Distribution on Porosity: A Simulation Study Based on the Discrete Element Method

    Authors: Christoph Rettinger, Ulrich Rüde, Stefan Vollmer, Roy M. Frings

    Abstract: Porosity is one of the key properties of dense particle packings like sediment deposits and is influenced by a multitude of grain characteristics such as their size distribution and shape. In the present work, we focus on the form, a specific aspect of the overall shape, of sedimentary grains in order to investigate and quantify its effect on porosity, ultimately deriving novel porosity-prediction… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  17. arXiv:2112.04353  [pdf, ps, other

    math.NA

    A decoupled numerical method for two-phase flows of different densities and viscosities in superposed fluid and porous layers

    Authors: Yali Gao, Daozhi Han, Xiaoming He, Ulrich Rüde

    Abstract: In this article we consider the numerical modeling and simulation via the phase field approach of two-phase flows of different densities and viscosities in superposed fluid and porous layers. The model consists of the Cahn-Hilliard-Navier-Stokes equations in the free flow region and the Cahn-Hilliard-Darcy equations in porous media that are coupled by seven domain interface boundary conditions. We… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  18. arXiv:2103.10882  [pdf, other

    physics.flu-dyn physics.comp-ph

    Coupling fully resolved light particles with the Lattice Boltzmann method on adaptively refined grids

    Authors: Lukas Werner, Christoph Rettinger, Ulrich Rüde

    Abstract: The simulation of geometrically resolved rigid particles in a fluid relies on coupling algorithms to transfer momentum both ways between the particles and the fluid. In this article, the fluid flow is modeled with a parallel Lattice Boltzmann method using adaptive grid refinement to improve numerical efficiency. The coupling with the particles is realized with the momentum exchange method. When im… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  19. arXiv:2103.04103  [pdf, other

    physics.flu-dyn cond-mat.soft physics.geo-ph

    Rheology of mobile sediment beds in laminar shear flow: effects of creep and polydispersity

    Authors: Christoph Rettinger, Sebastian Eibl, Ulrich Rüde, Bernhard Vowinckel

    Abstract: Classical scaling relationships for rheological quantities such as the $μ(J)$-rheology have become increasingly popular for closures of two-phase flow modeling. However, these frameworks have been derived for monodisperse particles. We aim to extend these considerations to sediment transport modeling by using a more realistic sediment composition. We investigate the rheological behavior of sheared… ▽ More

    Submitted 30 June, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

  20. arXiv:2103.02388  [pdf, other

    cs.CE math.NA

    A massively parallel Eulerian-Lagrangian method for advection-dominated transport in viscous fluids

    Authors: Nils Kohl, Marcus Mohr, Sebastian Eibl, Ulrich Rüde

    Abstract: Motivated by challenges in Earth mantle convection, we present a massively parallel implementation of an Eulerian-Lagrangian method for the advection-diffusion equation in the advection-dominated regime. The advection term is treated by a particle-based, characteristics method coupled to a block-structured finite-element framework. Its numerical and computational performance is evaluated in multip… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 22 pages

    MSC Class: 65M25; 65Y05; 65M60

  21. arXiv:2012.06144  [pdf, other

    physics.flu-dyn cs.MS cs.PF

    Highly Efficient Lattice-Boltzmann Multiphase Simulations of Immiscible Fluids at High-Density Ratios on CPUs and GPUs through Code Generation

    Authors: Markus Holzer, Martin Bauer, Ulrich Rüde

    Abstract: A high-performance implementation of a multiphase lattice Boltzmann method based on the conservative Allen-Cahn model supporting high-density ratios and high Reynolds numbers is presented. Metaprogramming techniques are used to generate optimized code for CPUs and GPUs automatically. The coupled model is specified in a high-level symbolic description and optimized through automatic transformations… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 17 pages, 9 figures

  22. arXiv:2010.13513  [pdf, other

    cs.CE cs.MS math.NA

    Textbook efficiency: massively parallel matrix-free multigrid for the Stokes system

    Authors: Nils Kohl, Ulrich Rüde

    Abstract: We employ textbook multigrid efficiency (TME), as introduced by Achi Brandt, to construct an asymptotically optimal monolithic multigrid solver for the Stokes system. The geometric multigrid solver builds upon the concept of hierarchical hybrid grids (HHG), which is extended to higher-order finite-element discretizations, and a corresponding matrix-free implementation. The computational cost of th… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 22 pages, 7 figures

    MSC Class: 65F10; 65N30; 65N55

  23. arXiv:2010.13342  [pdf, other

    cs.DC

    Resiliency in Numerical Algorithm Design for Extreme Scale Simulations

    Authors: Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Goeddeke, Marco Heisig, Fabienne Jezequel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Orti , et al. (11 additional authors not shown)

    Abstract: This work is based on the seminar titled ``Resiliency in Numerical Algorithm Design for Extreme Scale Simulations'' held March 1-6, 2020 at Schloss Dagstuhl, that was attended by all the authors. Naive versions of conventional resilience techniques will not scale to the exascale regime: with a main memory footprint of tens of Petabytes, synchronously writing checkpoint data all the way to backgr… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 45 pages, 3 figures, submitted to The International Journal of High Performance Computing Applications

    ACM Class: D.4.5; G.4; G.1; D.4.4

  24. arXiv:2008.13046  [pdf

    cond-mat.mtrl-sci

    Densification of Single-Walled Carbon Nanotube Films: Mesoscopic Distinct Element Method Simulations and Experimental Validation

    Authors: Grigorii Drozdov, Igor Ostanin, Hao Xu, Yuezhou Wang, Traian Dumitrică, Artem Grebenko, Alexey P. Tsapenko, Yuriy Gladush, Georgy Ermolaev, Valentyn S. Volkov, Sebastian Eibl, Ulrich Rüde, Albert G. Nasibulin

    Abstract: Nanometer thin single-walled carbon nanotube (CNT) films collected from the aerosol chemical deposition reactors have gathered attention for their promising applications. Densification of these pristine films provides an important way to manipulate the mechanical, electronic, and optical properties. To elucidate the underlying microstructural level restructuring, which is ultimately responsible fo… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: 12 figures

  25. arXiv:2003.01490  [pdf, other

    physics.comp-ph

    An efficient four-way coupled lattice Boltzmann - discrete element method for fully resolved simulations of particle-laden flows

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: A four-way coupling scheme for the direct numerical simulation of particle-laden flows is developed and analyzed. It employs a novel adaptive multi-relaxation time lattice Boltzmann method to simulate the fluid phase efficiently. The momentum exchange method is used to couple the fluid and the particulate phase. The particle interactions in normal and tangential direction are accounted for by a di… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  26. arXiv:2001.11806  [pdf, other

    cs.MS cs.CE cs.DC

    lbmpy: Automatic code generation for efficient parallel lattice Boltzmann methods

    Authors: Martin Bauer, Harald Köstler, Ulrich Rüde

    Abstract: Lattice Boltzmann methods are a popular mesoscopic alternative to macroscopic computational fluid dynamics solvers. Many variants have been developed that vary in complexity, accuracy, and computational cost. Extensions are available to simulate multi-phase, multi-component, turbulent, or non-Newtonian flows. In this work we present lbmpy, a code generation package that supports a wide variety of… ▽ More

    Submitted 11 April, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

  27. arXiv:2001.10424  [pdf, ps, other

    math.NA cs.DC

    Parallel solution of saddle point systems with nested iterative solvers based on the Golub-Kahan Bidiagonalization

    Authors: Carola Kruse, Masha Sosonkina, Mario Arioli, Nicolas Tardieu, Ulrich Ruede

    Abstract: We present a scalability study of Golub-Kahan bidiagonalization for the parallel iterative solution of symmetric indefinite linear systems with a 2x2 block structure. The algorithms have been implemented within the parallel numerical library PETSc. Since a nested inner-outer iteration strategy may be necessary, we investigate different choices for the inner solvers, including parallel sparse direc… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  28. arXiv:1909.13772  [pdf, other

    cs.DC cs.CE physics.comp-ph

    waLBerla: A block-structured high-performance framework for multiphysics simulations

    Authors: Martin Bauer, Sebastian Eibl, Christian Godenschwager, Nils Kohl, Michael Kuron, Christoph Rettinger, Florian Schornbaum, Christoph Schwarzmeier, Dominik Thönnes, Harald Köstler, Ulrich Rüde

    Abstract: Programming current supercomputers efficiently is a challenging task. Multiple levels of parallelism on the core, on the compute node, and between nodes need to be exploited to make full use of the system. Heterogeneous hardware architectures with accelerators further complicate the development process. waLBerla addresses these challenges by providing the user with highly efficient building blocks… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  29. arXiv:1908.11746  [pdf, ps, other

    math.NA

    On numerical solution of full rank linear systems

    Authors: A. Dumitrasc, Ph. Leleux, C. Popa, D. Ruiz, U. Ruede

    Abstract: Matrices can be augmented by adding additional columns such that a partitioning of the matrix in blocks of rows defines mutually orthogonal subspaces. This augmented system can then be solved efficiently by a sum of projections onto these subspaces. The equivalence to the original linear system is ensured by adding additional rows to the matrix in a specific form. The resulting solution method is… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

  30. arXiv:1908.08666  [pdf, other

    cs.CE

    Stencil scaling for vector-valued PDEs on hybrid grids with applications to generalized Newtonian fluids

    Authors: Daniel Drzisga, Ulrich Rüde, Barbara Wohlmuth

    Abstract: Matrix-free finite element implementations for large applications provide an attractive alternative to standard sparse matrix data formats due to the significantly reduced memory consumption. Here, we show that they are also competitive with respect to the run time in the low order case if combined with suitable stencil scaling techniques. We focus on variable coefficient vector-valued partial dif… ▽ More

    Submitted 18 March, 2020; v1 submitted 23 August, 2019; originally announced August 2019.

  31. arXiv:1906.10963  [pdf, other

    cs.SE cs.MS

    A Modular and Extensible Software Architecture for Particle Dynamics

    Authors: Sebastian Eibl, Ulrich Rüde

    Abstract: Creating a highly parallel and flexible discrete element software requires an interdisciplinary approach, where expertise from different disciplines is combined. On the one hand domain specialists provide interaction models between particles. On the other hand high-performance computing specialists optimize the code to achieve good performance on different hardware architectures. In particular, th… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: Proceedings Of The 8Th International Conference On Discrete Element Methods

  32. arXiv:1906.06884  [pdf, other

    math.NA

    Validation and calibration of coupled porous-medium and free-flow problems using pore-scale resolved models

    Authors: Iryna Rybak, Christoph Schwarzmeier, Elissa Eggenweiler, Ulrich Rüde

    Abstract: The correct choice of interface conditions and effective parameters for coupled macroscale free-flow and porous-medium models is crucial for a complete mathematical description of the problem under consideration and for accurate numerical simulation of applications. We consider single-fluid-phase systems described by the Stokes-Darcy model. Different sets of coupling conditions for this model are… ▽ More

    Submitted 26 June, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

    MSC Class: 68N99; 76D07; 76S05

  33. arXiv:1905.05042  [pdf, other

    cond-mat.mes-hall cs.DC

    Computational Study of Ultrathin CNT Films with the Scalable Mesoscopic Distinct Element Method

    Authors: Igor Ostanin, Traian Dumitrică, Sebastian Eibl, Ulrich Rüde

    Abstract: In this work we present a computational study of the small strain mechanics of freestanding ultrathin CNT films under in-plane loading. The numerical modeling of the mechanics of representatively large specimens with realistic micro- and nanostructure is presented. Our simulations utilize the scalable implementation of the mesoscopic distinct element method of the waLBerla multi-physics framework.… ▽ More

    Submitted 19 October, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

  34. arXiv:1811.12742  [pdf, other

    cs.DC physics.comp-ph

    Dynamic Load Balancing Techniques for Particulate Flow Simulations

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: Parallel multiphysics simulations often suffer from load imbalances originating from the applied coupling of algorithms with spatially and temporally varying workloads. It is thus desirable to minimize these imbalances to reduce the time to solution and to better utilize the available hardware resources. Taking particulate flows as an illustrating example application, we present and evaluate load… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

  35. arXiv:1808.07677  [pdf, other

    cs.CE math.NA

    An iterative generalized Golub-Kahan algorithm for problems in structural mechanics

    Authors: Mario Arioli, Carola Kruse, Ulrich Ruede, Nicolas Tardieu

    Abstract: This paper studies the Craig variant of the Golub-Kahan bidiagonalization algorithm as an iterative solver for linear systems with saddle point structure. Such symmetric indefinite systems in 2x2 block form arise in many applications, but standard iterative solvers are often found to perform poorly on them and robust preconditioners may not be available. Specifically, such systems arise in structu… ▽ More

    Submitted 23 August, 2018; originally announced August 2018.

  36. A Systematic Comparison of Dynamic Load Balancing Algorithms for Massively Parallel Rigid Particle Dynamics

    Authors: Sebastian Eibl, Ulrich Rüde

    Abstract: As compute power increases with time, more involved and larger simulations become possible. However, it gets increasingly difficult to efficiently use the provided computational resources. Especially in particle-based simulations with a spatial domain partitioning large load imbalances can occur due to the simulation being dynamic. Then a static domain partitioning may not be suitable. This can de… ▽ More

    Submitted 2 August, 2019; v1 submitted 2 August, 2018; originally announced August 2018.

  37. arXiv:1805.10167  [pdf, other

    cs.MS cs.DC

    A Scalable and Modular Software Architecture for Finite Elements on Hierarchical Hybrid Grids

    Authors: Nils Kohl, Dominik Thönnes, Daniel Drzisga, Dominik Bartuschat, Ulrich Rüde

    Abstract: In this article, a new generic higher-order finite-element framework for massively parallel simulations is presented. The modular software architecture is carefully designed to exploit the resources of modern and future supercomputers. Combining an unstructured topology with structured grid refinement facilitates high geometric adaptability and matrix-free multigrid implementations with excellent… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: Preprint of an article submitted to International Journal of Parallel, Emergent and Distributed Systems (Taylor & Francis)

  38. arXiv:1804.06373  [pdf, other

    cs.MS

    Adaptive control in rollforward recovery for extreme scale multigrid

    Authors: Markus Huber, Ulrich Rüde, Barbara Wohlmuth

    Abstract: With the increasing number of compute components, failures in future exa-scale computer systems are expected to become more frequent. This motivates the study of novel resilience techniques. Here, we extend a recently proposed algorithm-based recovery method for multigrid iterations by introducing an adaptive control. After a fault, the healthy part of the system continues the iterative solution p… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

  39. arXiv:1803.04937  [pdf, other

    physics.comp-ph physics.app-ph

    An improved lattice Boltzmann D3Q19 method based on an alternative equilibrium discretization

    Authors: Martin Bauer, Ulrich Rüde

    Abstract: Lattice Boltzmann simulations of three-dimensional, isothermal hydrodynamics often use either the D3Q19 or the D3Q27 velocity sets. While both models correctly approximate Navier-Stokes in the continuum limit, the D3Q19 model is computationally less expensive but has some known deficiencies regarding Galilean invariance, especially for high Reynolds number flows. In this work we present a novel me… ▽ More

    Submitted 28 August, 2018; v1 submitted 13 March, 2018; originally announced March 2018.

  40. arXiv:1802.02765  [pdf, other

    cs.DC cs.PF physics.comp-ph

    A local parallel communication algorithm for polydisperse rigid body dynamics

    Authors: Sebastian Eibl, Ulrich Rüde

    Abstract: The simulation of large ensembles of particles is usually parallelized by partitioning the domain spatially and using message passing to communicate between the processes handling neighboring subdomains. The particles are represented as individual geometric objects and are associated to the subdomains. Handling collisions and migrating particles between subdomains, as required for proper parallel… ▽ More

    Submitted 2 August, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

  41. arXiv:1712.07028  [pdf, other

    physics.comp-ph physics.flu-dyn

    Direct simulation of liquid-gas-solid flow with a free surface lattice Boltzmann method

    Authors: Simon Bogner, Jens Harting, Ulrich Rüde

    Abstract: Direct numerical simulation of liquid-gas-solid flows is uncommon due to the considerable computational cost. As the grid spacing is determined by the smallest involved length scale, large grid sizes become necessary -- in particular if the bubble-particle aspect ratio is on the order of 10 or larger. Hence, it arises the question of both feasibility and reasonability. In this paper, we present a… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: submitted to International Journal of Computational Fluid Dynamics

  42. arXiv:1711.00336  [pdf, other

    cs.CE physics.comp-ph

    A Coupled Lattice Boltzmann Method and Discrete Element Method for Discrete Particle Simulations of Particulate Flows

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: Discrete particle simulations are widely used to study large-scale particulate flows in complex geometries where particle-particle and particle-fluid interactions require an adequate representation but the computational cost has to be kept low. In this work, we present a novel coupling approach for such simulations. A lattice Boltzmann formulation of the generalized Navier-Stokes equations is used… ▽ More

    Submitted 1 November, 2017; originally announced November 2017.

  43. arXiv:1709.06793  [pdf, other

    math.NA cs.CE

    A stencil scaling approach for accelerating matrix-free finite element implementations

    Authors: Simon Bauer, Daniel Drzisga, Marcus Mohr, Ulrich Ruede, Christian Waluga, Barbara Wohlmuth

    Abstract: We present a novel approach to fast on-the-fly low order finite element assembly for scalar elliptic partial differential equations of Darcy type with variable coefficients optimized for matrix-free implementations. Our approach introduces a new operator that is obtained by appropriately scaling the reference stiffness matrix from the constant coefficient case. Assuming sufficient regularity, an a… ▽ More

    Submitted 23 July, 2018; v1 submitted 20 September, 2017; originally announced September 2017.

  44. A Scalable Multiphysics Algorithm for Massively Parallel Direct Numerical Simulations of Electrophoresis

    Authors: Dominik Bartuschat, Ulrich Rüde

    Abstract: In this article we introduce a novel coupled algorithm for massively parallel direct numerical simulations of electrophoresis in microfluidic flows. This multiphysics algorithm employs an Eulerian description of fluid and ions, combined with a Lagrangian representation of moving charged particles. The fixed grid facilitates efficient solvers and the employed lattice Boltzmann method can efficientl… ▽ More

    Submitted 25 May, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

    Comments: Accepted manuscript of publication in Journal of Computational Science (Elsevier)

  45. arXiv:1708.08286  [pdf, other

    cs.DC

    A Scalable and Extensible Checkpointing Scheme for Massively Parallel Simulations

    Authors: Nils Kohl, Johannes Hötzer, Florian Schornbaum, Martin Bauer, Christian Godenschwager, Harald Köstler, Britta Nestler, Ulrich Rüde

    Abstract: Realistic simulations in engineering or in the materials sciences can consume enormous computing resources and thus require the use of massively parallel supercomputers. The probability of a failure increases both with the runtime and with the number of system components. For future exascale systems it is therefore considered critical that strategies are developed to make software resilient agains… ▽ More

    Submitted 29 January, 2018; v1 submitted 28 August, 2017; originally announced August 2017.

  46. The Maximum Dissipation Principle in Rigid-Body Dynamics with Purely Inelastic Impacts

    Authors: Tobias Preclik, Sebastian Eibl, Ulrich Rüde

    Abstract: Formulating a consistent theory for rigid-body dynamics with impacts is an intricate problem. Twenty years ago Stewart published the first consistent theory with purely inelastic impacts and an impulsive friction model analogous to Coulomb friction. In this paper we demonstrate that the consistent impact model can exhibit multiple solutions with a varying degree of dissipation even in the single-c… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Journal ref: Springer, Computational Mechanics, 2017

  47. Extreme-Scale Block-Structured Adaptive Mesh Refinement

    Authors: Florian Schornbaum, Ulrich Rüde

    Abstract: In this article, we present a novel approach for block-structured adaptive mesh refinement (AMR) that is suitable for extreme-scale parallelism. All data structures are designed such that the size of the meta data in each distributed processor memory remains bounded independent of the processor number. In all stages of the AMR process, we use only distributed algorithms. No central resources such… ▽ More

    Submitted 13 April, 2018; v1 submitted 22 April, 2017; originally announced April 2017.

    Comments: 38 pages, 17 figures, 11 tables

    MSC Class: 68W10; 68W15; 68U20; 65Y05; 65Y20; 76P05

    Journal ref: SIAM J. Sci. Comput. 40-3 (2018), pp. C358-C387

  48. arXiv:1702.04910  [pdf, other

    cs.CE physics.comp-ph physics.flu-dyn

    A comparative study of fluid-particle coupling methods for fully resolved lattice Boltzmann simulations

    Authors: Christoph Rettinger, Ulrich Rüde

    Abstract: The direct numerical simulation of particulate systems offers a unique approach to study the dynamics of fluid-solid suspensions by fully resolving the submerged particles and without introducing empirical models. For the lattice Boltzmann method, different variants exist to incorporate the fluid-particle interaction into the simulation. This paper provides a detailed and systematic comparison of… ▽ More

    Submitted 16 February, 2017; originally announced February 2017.

    Comments: 29 pages, 13 figures, 4 tables

  49. arXiv:1612.01333  [pdf, ps, other

    math.NA

    On the analysis of block smoothers for saddle point problems

    Authors: Lorenz John, Ulrich Rüde, Barbara Wohlmuth, Walter Zulehner

    Abstract: In this article, we discuss several classes of Uzawa smoothers for the application in multigrid methods in the context of saddle point problems. Beside commonly used variants, such as the inexact and block factorization version, we also introduce a new symmetric method, belonging to the class of Uzawa smoothers. For these variants we unify the analysis of the smoothing properties, which is an impo… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

  50. arXiv:1610.02608  [pdf, other

    cs.CE math.HO stat.OT

    Research and Education in Computational Science and Engineering

    Authors: Ulrich Rüde, Karen Willcox, Lois Curfman McInnes, Hans De Sterck, George Biros, Hans Bungartz, James Corones, Evin Cramer, James Crowley, Omar Ghattas, Max Gunzburger, Michael Hanke, Robert Harrison, Michael Heroux, Jan Hesthaven, Peter Jimack, Chris Johnson, Kirk E. Jordan, David E. Keyes, Rolf Krause, Vipin Kumar, Stefan Mayer, Juan Meza, Knut Martin Mørken, J. Tinsley Oden , et al. (8 additional authors not shown)

    Abstract: Over the past two decades the field of computational science and engineering (CSE) has penetrated both basic and applied research in academia, industry, and laboratories to advance discovery, optimize systems, support decision-makers, and educate the scientific and engineering workforce. Informed by centuries of theory and experiment, CSE performs computational experiments to answer questions that… ▽ More

    Submitted 31 December, 2017; v1 submitted 8 October, 2016; originally announced October 2016.

    Comments: Major revision, to appear in SIAM Review

    Report number: Argonne National Laboratory Preprint ANL/MCS-P6054-0916 MSC Class: 00A72; 62-07; 68U20; 68W01; 68W10; 97A99; 97M10; 97N80; 97R20; 97R30 ACM Class: G.0; G.4; I.6; J.0; J.2; J.3; J.4; J.6; J.7; K.3.2