Skip to main content

Showing 1–9 of 9 results for author: Houzeaux, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.08777  [pdf, other

    cs.DC cs.PF

    Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs

    Authors: Herbert Owen, Dominik Ernst, Thomas Gruber, Oriol Lemkuhl, Guillaume Houzeaux, Lucas Gasparino, Gerhard Wellein

    Abstract: This paper addresses the challenge of providing portable and highly efficient code structures for CPU and GPU architectures. We choose the assembly of the right-hand term in the incompressible flow module of the High-Performance Computational Mechanics code Alya, which is one of the two CFD codes in the Unified European Benchmark Suite. Starting from an efficient CPU-code and a related OpenACC-por… ▽ More

    Submitted 22 January, 2024; originally announced March 2024.

  2. arXiv:2401.08447  [pdf, other

    cs.DC physics.comp-ph

    Monitoring the development of CFD applications on unstable HPC platforms

    Authors: Damien Dosimont, Guillaume Houzeaux

    Abstract: We tackle the challenging tasks of monitoring on unstable HPC platforms the performance of CFD applications all along their development. We have designed and implemented a monitoring framework, integrated at the end of a CI-CD pipeline. Measures retrieved during the automatic execution of production simulations are analyzed within a visual analytics interface we developed, providing advanced visua… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: ParCFD2023 34th International Conference on Parallel Computational Fluid Dynamics, May 29-31 2023, Cuenca, Ecuador

  3. arXiv:2210.11917  [pdf, other

    cs.DC cs.PF physics.app-ph

    A portable coding strategy to exploit vectorization on combustion simulations

    Authors: Fabio Banchelli, Guillermo Oyarzun, Marta Garcia-Gasulla, Filippo Mantovani, Ambrus Both, Guillaume Houzeaux, Daniel Mira

    Abstract: The complexity of combustion simulations demands the latest high-performance computing tools to accelerate its time-to-solution results. A current trend on HPC systems is the utilization of CPUs with SIMD or vector extensions to exploit data parallelism. Our work proposes a strategy to improve the automatic vectorization of finite element-based scientific codes. The approach applies a parametric c… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  4. A parallel algorithm for unilateral contact problems

    Authors: G. Guillamet, M. Rivero, M. Zavala-Aké, M. Vázquez, G. Houzeaux, S. Oller

    Abstract: In this paper, we introduce a novel parallel contact algorithm designed to run efficiently in High-Performance Computing based supercomputers. Particular emphasis is put on its computational implementation in a multiphysics finite element code. The algorithm is based on the method of partial Dirichlet-Neumann boundary conditions and is capable to solve numerically a nonlinear contact problem betwe… ▽ More

    Submitted 1 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: 26 pages, 23 figures

    MSC Class: 74M15; 74M20; 74F99; 74S05; 65Y05; ACM Class: J.2; I.6.3

    Journal ref: Computers & Structures, Volume 271 (2022) 106862

  5. Dynamic resource allocation for efficient parallel CFD simulations

    Authors: G. Houzeaux, R. M. Badia, R. Borrell, D. Dosimont, J. Ejarque, M. Garcia-Gasulla, V. López

    Abstract: CFD users of supercomputers usually resort to rule-of-thumb methods to select the number of subdomains (partitions) when relying on MPI-based parallelization. One common approach is to set a minimum number of elements or cells per subdomain, under which the parallel efficiency of the code is "known" to fall below a subjective level, say 80%. The situation is even worse when the user is not aware o… ▽ More

    Submitted 29 June, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: 27 pages, 15 figures

    MSC Class: 35-04 ACM Class: D.1; D.2; J.2; J.6

  6. arXiv:2107.11541  [pdf, other

    cs.DC

    Performance assessment of CUDA and OpenACC in large scale combustion simulations

    Authors: Guillermo Oyarzun, Daniel Mira, Guillaume Houzeaux

    Abstract: GPUs have climbed up to the top of supercomputer systems making life harder to many legacy scientific codes. Nowadays, many recipes are being used in such code's portability, without any clarity of which is the best option. We present a comparative analysis of the two most common approaches, CUDA and OpenACC, into the multi-physics CFD code Alya. Our focus is the combustion problems which are one… ▽ More

    Submitted 31 July, 2021; v1 submitted 24 July, 2021; originally announced July 2021.

  7. HPC compact quasi-Newton algorithm for interface problems

    Authors: A. Santiago, M. Zavala-Aké, R. Borell, G. Houzeaux

    Abstract: In this work we present a robust interface coupling algorithm called Compact Interface quasi-Newton (CIQN). It is designed for computationally intensive applications using an MPI multi-code partitioned scheme. The algorithm allows to reuse information from previous time steps, feature that has been previously proposed to accelerate convergence. Through algebraic manipulation, an efficient usage of… ▽ More

    Submitted 1 June, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: 33 pages: 23 manuscript, 10 appendix. 16 figures: 4 manuscript, 12 appendix. 10 Tables: 3 manuscript, 7 appendix

    MSC Class: 68U20; 00A72; 68Q85; 65-04; 74F10

    Journal ref: Journal of Fluids and Structures (2020) 103009

  8. Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics

    Authors: R. Borrell, D. Dosimont, M. Garcia-Gasulla, G. Houzeaux, O. Lehmkuhl, V. Mehta, H. Owen, M. Vazquez, G. Oyarzun

    Abstract: High fidelity Computational Fluid Dynamics simulations are generally associated with large computing requirements, which are progressively acute with each new generation of supercomputers. However, significant research efforts are required to unlock the computing power of leading-edge systems, currently referred to as pre-Exascale systems, based on increasingly complex architectures. In this paper… ▽ More

    Submitted 6 July, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

    Journal ref: Future Generation Computer Systems, Volume 107, 2020,Pages 31-48

  9. arXiv:1805.03949  [pdf, other

    cs.MS cs.DC cs.PF cs.PL

    MPI+X: task-based parallelization and dynamic load balance of finite element assembly

    Authors: Marta Garcia-Gasulla, Guillaume Houzeaux, Roger Ferrer, Antoni Artigues, Victor López, Jesús Labarta, Mariano Vázquez

    Abstract: The main computing tasks of a finite element code(FE) for solving partial differential equations (PDE's) are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. Although we will describe algorithms in the FE context, a similar strategy can be straightforwardly applied to other discretization methods, like the finit… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.