Skip to main content

Showing 1–2 of 2 results for author: Feichtinger, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:1112.0850  [pdf, ps, other

    cs.PF

    Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results

    Authors: Johannes Habich, Christian Feichtinger, Harald Köstler, Georg Hager, Gerhard Wellein

    Abstract: GPUs offer several times the floating point performance and memory bandwidth of current standard two socket CPU servers, e.g. NVIDIA C2070 vs. Intel Xeon Westmere X5650. The lattice Boltzmann method has been established as a flow solver in recent years and was one of the first flow solvers to be successfully ported and that performs well on GPUs. We demonstrate advanced optimization strategies for… ▽ More

    Submitted 5 December, 2011; originally announced December 2011.

    Comments: 10 pages, 7 figures, 4 tables, preprint submitted to Computers and Fluids journal

  2. A Flexible Patch-Based Lattice Boltzmann Parallelization Approach for Heterogeneous GPU-CPU Clusters

    Authors: Christian Feichtinger, Johannes Habich, Harald Koestler, Georg Hager, Ulrich Ruede, Gerhard Wellein

    Abstract: Sustaining a large fraction of single GPU performance in parallel computations is considered to be the major problem of GPU-based clusters. In this article, this topic is addressed in the context of a lattice Boltzmann flow solver that is integrated in the WaLBerla software framework. We propose a multi-GPU implementation using a block-structured MPI parallelization, suitable for load balancing an… ▽ More

    Submitted 8 July, 2010; originally announced July 2010.

    Comments: 20 pages, 12 figures

    Journal ref: Parallel Computing 37(9), 536-549 (2011)