Baler -- Machine Learning Based Compression of Scientific Data
Authors:
Fritjof Bengtsson,
Caterina Doglioni,
Per Alexander Ekman,
Axel Gallén,
Pratik Jawahar,
Alma Orucevic-Alagic,
Marta Camps Santasmasas,
Nicola Skidmore,
Oliver Woolland
Abstract:
Storing and sharing increasingly large datasets is a challenge across scientific research and industry. In this paper, we document the development and applications of Baler - a Machine Learning based data compression tool for use across scientific disciplines and industry. Here, we present Baler's performance for the compression of High Energy Physics (HEP) data, as well as its application to Comp…
▽ More
Storing and sharing increasingly large datasets is a challenge across scientific research and industry. In this paper, we document the development and applications of Baler - a Machine Learning based data compression tool for use across scientific disciplines and industry. Here, we present Baler's performance for the compression of High Energy Physics (HEP) data, as well as its application to Computational Fluid Dynamics (CFD) toy data as a proof-of-principle. We also present suggestions for cross-disciplinary guidelines to enable feasibility studies for machine learning based compression for scientific data.
△ Less
Submitted 16 February, 2024; v1 submitted 3 May, 2023;
originally announced May 2023.
Near-wall approximations to speed up simulations for atmosphere boundary layers in the presence of forests using lattice Boltzmann method on GPU
Authors:
Xinyuan Shao,
Marta Camps Santasmasas,
Xiao Xue,
Jiqiang Niu,
Lars Davidson,
Alistair J. Revell,
Hua-Dong Yao
Abstract:
Forests play an important role in influencing the wind resource in atmospheric boundary layers and the fatigue life of wind turbines. Due to turbulence, a difficulty in the simulation of the forest effects is that flow statistical and fluctuating content should be accurately resolved using a turbulence-resolved CFD method, which requires a large amount of computing time and resources. In this pape…
▽ More
Forests play an important role in influencing the wind resource in atmospheric boundary layers and the fatigue life of wind turbines. Due to turbulence, a difficulty in the simulation of the forest effects is that flow statistical and fluctuating content should be accurately resolved using a turbulence-resolved CFD method, which requires a large amount of computing time and resources. In this paper, we demonstrate a fast but accurate simulation platform that uses a lattice Boltzmann method with large eddy simulation on Graphic Processing Units (GPU). The simulation tool is the open-source program, GASCANS, developed at the University of Manchester. The simulation platform is validated based on canonical wall-bounded turbulent flows. A forest is modelled in the form of body forces injected near the wall. Since a uniform cell size is applied throughout the computational domain, the averaged first-layer cell height over the wall reaches to $\langle Δy^+\rangle = 165$. Simulation results agree well with previous experiments and numerical data obtained from finite volume methods. We demonstrate that good results are possible without the use of a wall-function, since the forest forces overwhelm wall friction. This is shown to hold as long as the forest region is resolved with several cells. In addition to the GPU speedup, the approximations also significantly benefit the computation efficiency.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.