Search | arXiv e-print repository

Requirements analysis for HPC\&HTC infrastructures integration in ESCAPE Science Analysis Platform

Authors: S. Bertocco, D. Goz, S. A. Russo, M. Moliaro, G. Taffoni

Abstract: ESCAPE (European Science Cluster of Astronomy and Particle physics ESFRI research infrastructures) is a project to set up a cluster of ESFRI (European Strategy Forum on Research Infrastructures) facilities for astronomy, astroparticle and particle physics to face the challenges emerging through the modern multi-disciplinary data driven science. One of the main goal of ESCAPE is the building of ESA… ▽ More ESCAPE (European Science Cluster of Astronomy and Particle physics ESFRI research infrastructures) is a project to set up a cluster of ESFRI (European Strategy Forum on Research Infrastructures) facilities for astronomy, astroparticle and particle physics to face the challenges emerging through the modern multi-disciplinary data driven science. One of the main goal of ESCAPE is the building of ESAP (ESFRI Science Analysis Platform), a science platform for the analysis of open access data available through the EOSC (European Open Science Cloud) environment. ESAP will allow EOSC researchers to identify and stage existing data collections for analysis, share data, share and run scientific workflows. For many of the concerned ESFRIs and RIs, the data scales involved require significant computational resources (storage and compute) to support processing and analysis. The EOSC-ESFRI science platform therefore must implement appropriate interfaces to an underlying HPC (High Performance Computing) or HTC (High Throughput Computing) infrastructure to take advantage of it. This poster describes the analysis done to identify the main requirements for the implementation of the interfaces enabling the ESAP data access and computation resources integration in HPC and HTC computation infrastructures in terms of authentication and authorization policies, data management, workflow deployment and run. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Comments: Will appear in Proceedings ADASSXXX ASP Conference Series

arXiv:2301.06061 [pdf, other]

High Performance W-stacking for Imaging Radio Astronomy Data: a Parallel and Accelerated Solution

Authors: Claudio Gheller, Giuliano Taffoni, David Goz

Abstract: Current and upcoming radio-interferometers are expected to produce volumes of data of increasing size that need to be processed in order to generate the corresponding sky brightness distributions through imaging. This represents an outstanding computational challenge, especially when large fields of view and/or high resolution observations are processed. We have investigated the adoption of modern… ▽ More Current and upcoming radio-interferometers are expected to produce volumes of data of increasing size that need to be processed in order to generate the corresponding sky brightness distributions through imaging. This represents an outstanding computational challenge, especially when large fields of view and/or high resolution observations are processed. We have investigated the adoption of modern High Performance Computing systems specifically addressing the gridding, FFT-transform and w-correction of imaging, combining parallel and accelerated solutions. We have demonstrated that the code we have developed can support dataset and images of any size compatible with the available hardware, efficiently scaling up to thousands of cores or hundreds of GPUs, kee** the time to solution below one hour even when images of the size of the order of billion or tens of billion of pixels are generated. In addition, portability has been targeted as a primary objective, both in terms of usability on different computing platforms and in terms of performance. The presented results have been obtained on two different state-of-the-art High Performance Computing architectures. △ Less

Submitted 15 January, 2023; originally announced January 2023.

Comments: 16 pages, 12 figures, accepted for publication on RAS Techniques and Instruments

arXiv:2105.14219 [pdf, other]

Machine Learning for Performance Prediction of Channel Bonding in Next-Generation IEEE 802.11 WLANs

Authors: Francesc Wilhelmi, David Góez, Paola Soto, Ramon Vallés, Mohammad Alfaifi, Abdulrahman Algunayah, Jorge Martin-Pérez, Luigi Girletti, Rajasekar Mohan, K Venkat Ramnan, Boris Bellalta

Abstract: With the advent of Artificial Intelligence (AI)-empowered communications, industry, academia, and standardization organizations are progressing on the definition of mechanisms and procedures to address the increasing complexity of future 5G and beyond communications. In this context, the International Telecommunication Union (ITU) organized the first AI for 5G Challenge to bring industry and acade… ▽ More With the advent of Artificial Intelligence (AI)-empowered communications, industry, academia, and standardization organizations are progressing on the definition of mechanisms and procedures to address the increasing complexity of future 5G and beyond communications. In this context, the International Telecommunication Union (ITU) organized the first AI for 5G Challenge to bring industry and academia together to introduce and solve representative problems related to the application of Machine Learning (ML) to networks. In this paper, we present the results gathered from Problem Statement~13 (PS-013), organized by Universitat Pompeu Fabra (UPF), which primary goal was predicting the performance of next-generation Wireless Local Area Networks (WLANs) applying Channel Bonding (CB) techniques. In particular, we overview the ML models proposed by participants (including Artificial Neural Networks, Graph Neural Networks, Random Forest regression, and gradient boosting) and analyze their performance on an open dataset generated using the IEEE 802.11ax-oriented Komondor network simulator. The accuracy achieved by the proposed methods demonstrates the suitability of ML for predicting the performance of WLANs. Moreover, we discuss the importance of abstracting WLAN interactions to achieve better results, and we argue that there is certainly room for improvement in throughput prediction through ML. △ Less

Submitted 29 May, 2021; originally announced May 2021.

arXiv:2003.10850 [pdf, other]

doi 10.3233/APC200043

Gadget3 on GPUs with OpenACC

Authors: Antonio Ragagnin, Klaus Dolag, Mathias Wagner, Claudio Gheller, Conradin Roffler, David Goz, David Hubber, Alexander Arth

Abstract: We present preliminary results of a GPU porting of all main Gadget3 modules (gravity computation, SPH density computation, SPH hydrodynamic force, and thermal conduction) using OpenACC directives. Here we assign one GPU to each MPI rank and exploit both the host and accellerator capabilities by overlap** computations on the CPUs and GPUs: while GPUs asynchronously compute interactions between pa… ▽ More We present preliminary results of a GPU porting of all main Gadget3 modules (gravity computation, SPH density computation, SPH hydrodynamic force, and thermal conduction) using OpenACC directives. Here we assign one GPU to each MPI rank and exploit both the host and accellerator capabilities by overlap** computations on the CPUs and GPUs: while GPUs asynchronously compute interactions between particles within their MPI ranks, CPUs perform tree-walks and MPI communications of neighbouring particles. We profile various portions of the code to understand the origin of our speedup, where we find that a peak speedup is not achieved because of time-steps with few active particles. We run a hydrodynamic cosmological simulation from the Magneticum project, with $2\cdot10^{7}$ particles, where we find a final total speedup of $\approx 2.$ We also present the results of an encouraging scaling test of a preliminary gravity-only OpenACC porting, run in the context of the EuroHack17 event, where the prototype of the porting proved to keep a constant speedup up to $1024$ GPUs. △ Less

Submitted 24 March, 2020; originally announced March 2020.

Comments: 10 pages, 4 figures, printed by ParCo 2019 (at IOS Advances in Parallel Computing, Volume 36, pages 209 - 218, ISBN 978-1-64368-070-5)

arXiv:2003.03283 [pdf, other]

Performance and energy footprint assessment of FPGAs and GPUs on HPC systems using Astrophysics application

Authors: David Goz, Georgios Ieronymakis, Vassilis Papaefstathiou, Nikolaos Dimou, Sara Bertocco, Francesco Simula, Antonio Ragagnin, Luca Tornatore, Igor Coretti, Giuliano Taffoni

Abstract: New challenges in Astronomy and Astrophysics (AA) are urging the need for a large number of exceptionally computationally intensive simulations. "Exascale" (and beyond) computational facilities are mandatory to address the size of theoretical problems and data coming from the new generation of observational facilities in AA. Currently, the High Performance Computing (HPC) sector is undergoing a pr… ▽ More New challenges in Astronomy and Astrophysics (AA) are urging the need for a large number of exceptionally computationally intensive simulations. "Exascale" (and beyond) computational facilities are mandatory to address the size of theoretical problems and data coming from the new generation of observational facilities in AA. Currently, the High Performance Computing (HPC) sector is undergoing a profound phase of innovation, in which the primary challenge to the achievement of the "Exascale" is the power-consumption. The goal of this work is to give some insights about performance and energy footprint of contemporary architectures for a real astrophysical application in an HPC context. We use a state-of-the-art N-body application that we re-engineered and optimized to exploit the heterogeneous underlying hardware fully. We quantitatively evaluate the impact of computation on energy consumption when running on four different platforms. Two of them represent the current HPC systems (Intel-based and equipped with NVIDIA GPUs), one is a micro-cluster based on ARM-MPSoC, and one is a "prototype towards Exascale" equipped with ARM-MPSoCs tightly coupled with FPGAs. We investigate the behavior of the different devices where the high-end GPUs excel in terms of time-to-solution while MPSoC-FPGA systems outperform GPUs in power consumption. Our experience reveals that considering FPGAs for computationally intensive application seems very promising, as their performance is improving to meet the requirements of scientific applications. This work can be a reference for future platforms development for astrophysics applications where computationally intensive calculations are required. △ Less

Submitted 10 April, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

Comments: 15 pages, 4 figures, 3 tables; Preprint (V2) submitted to MDPI (Special Issue: Energy-Efficient Computing on Parallel Architectures)

arXiv:1912.05340 [pdf, ps, other]

INAF Trieste Astronomical Observatory Information Technology Framework

Authors: S. Bertocco, D. Goz, L. Tornatore, A. Ragagnin, G. Maggio, F. Gasparo, C. Vuerli, G. Taffoni, M. Molinaro

Abstract: INAF Trieste Astronomical Observatory (OATs) has a long tradition in information technology applied to Astronomical and Astrophysical use cases, particularly for what regards computing for data reduction, analysis and simulations; data and archives management; space missions data processing; design and software development for ground-based instruments. The ensemble of these activities, in the last… ▽ More INAF Trieste Astronomical Observatory (OATs) has a long tradition in information technology applied to Astronomical and Astrophysical use cases, particularly for what regards computing for data reduction, analysis and simulations; data and archives management; space missions data processing; design and software development for ground-based instruments. The ensemble of these activities, in the last years, pushed the need to acquire new computing resources and technologies and to deep competences in theirs management. In this paper we describe INAF-OATs computing centre technological stuff, our involvement in different EU Projects both in the path of building of EOSC, the European Open Science Cloud; in the design and prototy** of new Exascale supercomputers in Europe and the main research activities carried on using our computing centre. △ Less

Submitted 11 December, 2019; originally announced December 2019.

arXiv:1910.14496 [pdf, other]

Direct N-body application on low-power and energy-efficient parallel architectures

Authors: D. Goz, G. Ieronymakis, V. Papaefstathiou, N. Dimou, S. Bertocco, A. Ragagnin, L. Tornatore, G. Taffoni, I. Coretti

Abstract: The aim of this work is to quantitatively evaluate the impact of computation on the energy consumption on ARM MPSoC platforms, exploiting CPUs, embedded GPUs and FPGAs. One of them possibly represents the future of High Performance Computing systems: a prototype of an Exascale supercomputer. Performance and energy measurements are made using a state-of-the-art direct $N$-body code from the astroph… ▽ More The aim of this work is to quantitatively evaluate the impact of computation on the energy consumption on ARM MPSoC platforms, exploiting CPUs, embedded GPUs and FPGAs. One of them possibly represents the future of High Performance Computing systems: a prototype of an Exascale supercomputer. Performance and energy measurements are made using a state-of-the-art direct $N$-body code from the astrophysical domain. We provide a comparison of the time-to-solution and energy delay product metrics, for different software configurations. We have shown that FPGA technologies can be used for application kernel acceleration and are emerging as a promising alternative to "traditional" technologies for HPC, which purely focus on peak-performance than on power-efficiency. △ Less

Submitted 31 October, 2019; originally announced October 2019.

Comments: 10 pages, 5 figure, 2 tables; The final publication will be available at IOS Press

arXiv:1904.11720 [pdf, other]

Shall numerical astrophysics step into the era of Exascale computing?

Authors: Giuliano Taffoni, Giuseppe Murante, Luca Tornatore, David Goz, Stefano Borgani, Manolis Katevenis, Nikolaos Chrysos, Manolis Marazakis

Abstract: High performance computing numerical simulations are today one of the more effective instruments to implement and study new theoretical models, and they are mandatory during the preparatory phase and operational phase of any scientific experiment. New challenges in Cosmology and Astrophysics will require a large number of new extremely computationally intensive simulations to investigate physical… ▽ More High performance computing numerical simulations are today one of the more effective instruments to implement and study new theoretical models, and they are mandatory during the preparatory phase and operational phase of any scientific experiment. New challenges in Cosmology and Astrophysics will require a large number of new extremely computationally intensive simulations to investigate physical processes at different scales. Moreover, the size and complexity of the new generation of observational facilities also implies a new generation of high performance data reduction and analysis tools pushing toward the use of Exascale computing capabilities. Exascale supercomputers cannot be produced today. We discuss the major technological challenges in the design, development and use of such computing capabilities and we will report on the progresses that has been made in the last years in Europe, in particular in the framework of the ExaNeSt European funded project. We also discuss the impact of this new computing resources on the numerical codes in Astronomy and Astrophysics. △ Less

Submitted 26 April, 2019; originally announced April 2019.

Comments: 3 figures, invited talk for proceedings of ADASS XXVI, accepted by ASP Conference Series

arXiv:1901.08532 [pdf, other]

doi 10.1007/978-3-030-22871-2_14

Direct $N$-body code on low-power embedded ARM GPUs

Authors: David Goz, Sara Bertocco, Luca Tornatore, Giuliano Taffoni

Abstract: This work arises on the environment of the ExaNeSt project aiming at design and development of an exascale ready supercomputer with low energy consumption profile but able to support the most demanding scientific and technical applications. The ExaNeSt compute unit consists of densely-packed low-power 64-bit ARM processors, embedded within Xilinx FPGA SoCs. SoC boards are heterogeneous architectur… ▽ More This work arises on the environment of the ExaNeSt project aiming at design and development of an exascale ready supercomputer with low energy consumption profile but able to support the most demanding scientific and technical applications. The ExaNeSt compute unit consists of densely-packed low-power 64-bit ARM processors, embedded within Xilinx FPGA SoCs. SoC boards are heterogeneous architecture where computing power is supplied both by CPUs and GPUs, and are emerging as a possible low-power and low-cost alternative to clusters based on traditional CPUs. A state-of-the-art direct $N$-body code suitable for astrophysical simulations has been re-engineered in order to exploit SoC heterogeneous platforms based on ARM CPUs and embedded GPUs. Performance tests show that embedded GPUs can be effectively used to accelerate real-life scientific calculations, and that are promising also because of their energy efficiency, which is a crucial design in future exascale platforms. △ Less

Submitted 24 January, 2019; originally announced January 2019.

Comments: 16 pages, 7 figures, 1 table, accepted for publication in the Computing Conference 2019 proceedings

Journal ref: Intelligent Computing. CompCom 2019. Advances in Intelligent Systems and Computing, vol 997. Springer, Cham

arXiv:1812.00367 [pdf, ps, other]

Astrophysical code migration into Exascale Era

Authors: David Goz, Sara Bertocco, Luca Tornatore, Giuliano Taffoni

Abstract: The ExaNeSt and EuroExa H2020 EU-funded projects aim to design and develop an exascale ready computing platform prototype based on low-energy-consumption ARM64 cores and FPGA accelerators. We participate in the application-driven design of the hardware solutions and prototype validation. To carry on this work we are using, among others, Hy-Nbody, a state-of-the-art direct N-body code. Core algorit… ▽ More The ExaNeSt and EuroExa H2020 EU-funded projects aim to design and develop an exascale ready computing platform prototype based on low-energy-consumption ARM64 cores and FPGA accelerators. We participate in the application-driven design of the hardware solutions and prototype validation. To carry on this work we are using, among others, Hy-Nbody, a state-of-the-art direct N-body code. Core algorithms of Hy-Nbody have been improved in such a way to increasingly fit them to the exascale target platform. Waiting for the ExaNest prototype release, we are performing tests and code tuning operations on an ARM64 SoC facility: a SLURM managed HPC cluster based on 64-bit ARMv8 Cortex-A72/Cortex-A53 core design and powered by a Mali-T864 embedded GPU. In parallel, we are porting a kernel of Hy-Nbody on FPGA aiming to test and compare the performance-per-watt of our algorithms on different platforms. In this paper we describe how we re-engineered the application and we show first results on ARM SoC. △ Less

Submitted 2 December, 2018; originally announced December 2018.

Comments: 4 pages, 1 figure, 1 table; proceedings of ADASS XXVIII, accepted by ASP Conference Series

arXiv:1712.00252 [pdf, ps, other]

Cosmological Simulations in Exascale Era

Authors: D. Goz, L. Tornatore, G. Taffoni, G. Murante

Abstract: The architecture of Exascale computing facilities, which involves millions of heterogeneous processing units, will deeply impact on scientific applications. Future astrophysical HPC applications must be designed to make such computing systems exploitable. The ExaNeSt H2020 EU-funded project aims to design and develop an exascale ready prototype based on low-energy-consumption ARM64 cores and FPGA… ▽ More The architecture of Exascale computing facilities, which involves millions of heterogeneous processing units, will deeply impact on scientific applications. Future astrophysical HPC applications must be designed to make such computing systems exploitable. The ExaNeSt H2020 EU-funded project aims to design and develop an exascale ready prototype based on low-energy-consumption ARM64 cores and FPGA accelerators. We participate to the design of the platform and to the validation of the prototype with cosmological N-body and hydrodynamical codes suited to perform large-scale, high-resolution numerical simulations of cosmic structures formation and evolution. We discuss our activities on astrophysical applications to take advantage of the underlying architecture. △ Less

Submitted 1 December, 2017; originally announced December 2017.

Comments: submitted to ASP

arXiv:1610.09843 [pdf, other]

doi 10.1093/mnras/stx869

Panchromatic Spectral Energy Distributions of simulated galaxies: results at redshift $z=0$

Authors: David Goz, Pierluigi Monaco, Gian Luigi Granato, Giuseppe Murante, Rosa Domínguez-Tenreiro, Aura Obreja, Marianna Annunziatella, Edoardo Tescari

Abstract: We present predictions of Spectral Energy Distributions (SEDs), from the UV to the FIR, of simulated galaxies at $z=0$. These were obtained by post-processing the results of an N-body+hydro simulation of a small cosmological volume, that uses the Multi-Phase Particle Integrator (MUPPI) for star formation and stellar feedback, with the GRASIL-3D radiative transfer code, that includes reprocessing o… ▽ More We present predictions of Spectral Energy Distributions (SEDs), from the UV to the FIR, of simulated galaxies at $z=0$. These were obtained by post-processing the results of an N-body+hydro simulation of a small cosmological volume, that uses the Multi-Phase Particle Integrator (MUPPI) for star formation and stellar feedback, with the GRASIL-3D radiative transfer code, that includes reprocessing of UV light by dust. Physical properties of galaxies resemble observed ones, though with some tension at small and large stellar masses. Comparing predicted SEDs of simulated galaxies with different samples of local galaxies, we find that these resemble observed ones, when normalised at 3.6 $μ$m. A comparison with the Herschel Reference Survey shows that, when binning galaxies in Star Formation Rate (SFR), average SEDs are reproduced to within a factor of $\sim2$ even in normalization, while binning in stellar mass highlights the same tension that is present in the stellar mass -- SFR plane. We use our sample to investigate the correlation of IR luminosity in Spitzer and Herschel bands with several galaxy properties. SFR is the quantity that best correlates with IR light up to $160\ μ$m, while at longer wavelengths better correlations are found with molecular mass and, at $500\ μ$m, with dust mass. However, using the position of the FIR peak as a proxy for cold dust temperature, we assess that heating of cold dust is mostly determined by SFR, with stellar mass giving only a minor contribution. We finally show how our sample of simulated galaxies can be used as a guide to understand the physical properties and selection biases of observed samples. △ Less

Submitted 31 October, 2016; originally announced October 2016.

Comments: 18 pages, 11 figures, Submitted to MNRAS

arXiv:1412.2883 [pdf, ps, other]

Properties of barred spiral disks in hydrodynamical cosmological simulations

Authors: David Goz, Pierluigi Monaco, Giuseppe Murante, Anna Curir

Abstract: We present a quantification of the properties of bars in two N-body+SPH cosmological simulations of spiral galaxies, named GA and AqC. The initial conditions were obtained using the zoom-in technique and represent two dark matter (DM) halos of $2-3\times10^{12}\ {\rm M}_\odot$, available at two different resolutions. The resulting galaxies are presented in the companion paper of Murante et al. (20… ▽ More We present a quantification of the properties of bars in two N-body+SPH cosmological simulations of spiral galaxies, named GA and AqC. The initial conditions were obtained using the zoom-in technique and represent two dark matter (DM) halos of $2-3\times10^{12}\ {\rm M}_\odot$, available at two different resolutions. The resulting galaxies are presented in the companion paper of Murante et al. (2014). We find that the GA galaxy has a bar of length $8.8$ kpc, present at the two resolution levels even though with a slightly different strength. Classical bar signatures (e.g. pattern of streaming motions, high $m=2$ Fourier mode with roughly constant phase) are consistently found at both resolutions. Though a close encounter with a merging satellite at $z\sim0.6$ (mass ratio $1:50$) causes a strong, transient spiral pattern and some heating of the disk, we find that bar instability is due to secular process, caused by a low Toomre parameter $Q\lesssim1$ due to accumulation of mass in the disk. The AqC galaxy has a slightly different history: it suffers a similar tidal disturbance due to a merging satellite at $z\sim0.5$ but with a mass ratio of $1:32$, that triggers a bar in the high-resolution simulation, while at low resolution the merging is found to take place at a later time, so that both secular evolution and merging are plausible triggers for bar instability. △ Less

Submitted 9 December, 2014; originally announced December 2014.

Comments: 15 pages, 18 figures, MNRAS accepted

arXiv:1411.3671 [pdf, ps, other]

Simulating realistic disk galaxies with a novel sub-resolution ISM model

Authors: Giuseppe Murante, Pierluigi Monaco, Stefano Borgani, Luca Tornatore, Klaus Dolag, David Goz

Abstract: We present results of cosmological simulations of disk galaxies carried out with the GADGET-3 TreePM+SPH code, where star formation and stellar feedback are described using our MUlti Phase Particle Integrator (MUPPI) model. This description is based on simple multi-phase model of the interstellar medium at unresolved scales, where mass and energy flows among the components are explicitly followed… ▽ More We present results of cosmological simulations of disk galaxies carried out with the GADGET-3 TreePM+SPH code, where star formation and stellar feedback are described using our MUlti Phase Particle Integrator (MUPPI) model. This description is based on simple multi-phase model of the interstellar medium at unresolved scales, where mass and energy flows among the components are explicitly followed by solving a system of ordinary differential equations. Thermal energy from SNe is injected into the local hot phase, so as to avoid that it is promptly radiated away. A kinetic feedback prescription generates the massive outflows needed to avoid the over-production of stars. We use two sets of zoomed-in initial conditions of isolated cosmological halos with masses (2-3) * 10^{12} Msun, both available at several resolution levels. In all cases we obtain spiral galaxies with small bulge-over-total stellar mass ratios (B/T \approx 0.2), extended stellar and gas disks, flat rotation curves and realistic values of stellar masses. Gas profiles are relatively flat, molecular gas is found to dominate at the centre of galaxies, with star formation rates following the observed Schmidt-Kennicutt relation. Stars kinematically belonging to the bulge form early, while disk stars show a clear inside-out formation pattern and mostly form after redshift z=2. However, the baryon conversion efficiencies in our simulations differ from the relation given by Moster et al. (2010) at a 3 sigma level, thus indicating that our stellar disks are still too massive for the Dark Matter halo in which they reside. Results are found to be remarkably stable against resolution. This further demonstrates the feasibility of carrying out simulations producing a realistic population of galaxies within representative cosmological volumes, at a relatively modest resolution. △ Less

Submitted 14 November, 2014; v1 submitted 13 November, 2014; originally announced November 2014.

Comments: 19 pages, 21 figures, MNRAS accepted

Showing 1–14 of 14 results for author: Goz, D