Search | arXiv e-print repository

Cache Blocking for Flux Reconstruction: Extension to Navier-Stokes Equations and Anti-aliasing

Authors: Semih Akkurt, Freddie Witherden, Peter Vincent

Abstract: In this article, cache blocking is implemented for the Navier Stokes equations with anti-aliasing support on mixed grids in PyFR for CPUs. In particular, cache blocking is used as an alternative to kernel fusion to eliminate unnecessary data movements between kernels at the main memory level. Specifically, kernels that exchange data are grouped together, and these groups are then executed on small… ▽ More In this article, cache blocking is implemented for the Navier Stokes equations with anti-aliasing support on mixed grids in PyFR for CPUs. In particular, cache blocking is used as an alternative to kernel fusion to eliminate unnecessary data movements between kernels at the main memory level. Specifically, kernels that exchange data are grouped together, and these groups are then executed on small sub-regions of the domain that fit in per-core private data cache. Additionally, cache blocking is also used to efficiently implement a tensor product factorisation of the interpolation operators associated with anti-aliasing. By using cache blocking, the intermediate results between application of the sparse factors are stored in per-core private data cache, and a significant amount of data movement from main memory is avoided. In order to assess the performance gains a theoretical model is developed, and the implementation is benchmarked using a compressible 3D Taylor-Green vortex test case on both hexahedral and prismatic grids, with third- and forth-order solution polynomials. The expected performance gains based on the theoretical model range from 1.99 to 2.62, and the speedups obtained in practice range from 1.67 to 3.67 compared to PyFR v1.11.0. △ Less

Submitted 6 November, 2023; originally announced January 2024.

arXiv:2308.00700 [pdf, other]

Validation of wall boundary conditions for simulating complex fluid flows via the Boltzmann equation: Momentum transport and skin friction

Authors: Tarik Dzanic, Freddie D. Witherden, Luigi Martinelli

Abstract: The influence and validity of wall boundary conditions for non-equilibrium fluid flows described by the Boltzmann equation remains an open problem. The substantial computational cost of directly solving the Boltzmann equation has limited the extent of numerical validation studies to simple, often two-dimensional, flow problems. Recent algorithmic advancements for the Boltzmann--BGK equation introd… ▽ More The influence and validity of wall boundary conditions for non-equilibrium fluid flows described by the Boltzmann equation remains an open problem. The substantial computational cost of directly solving the Boltzmann equation has limited the extent of numerical validation studies to simple, often two-dimensional, flow problems. Recent algorithmic advancements for the Boltzmann--BGK equation introduced by the authors, consisting of a high-order spatial discretization augmented with a discretely-conservative velocity model, have made it feasible to accurately simulate unsteady three-dimensional flow problems across both the rarefied and continuum regimes. This work presents a comprehensive evaluation and validation of wall boundary conditions across a variety of flow regimes, primarily for the purpose of exploring their effects on momentum transfer in the low Mach limit. Results are presented for a range of steady and unsteady wall-bounded flow problems across both the rarefied and continuum regimes, from canonical two-dimensional laminar flows to unsteady three-dimensional transitional and turbulent flows, the latter of which are the first instances of wall-bounded turbulent flows computed by directly solving the Boltzmann equation. We show that approximations of the molecular gas dynamics equations can accurately predict both non-equilibrium phenomena and complex hydrodynamic flow instabilities and show how spatial and velocity domain resolution affect the accuracy. The results indicate that an accurate approximation of particle transport (i.e. high spatial resolution) is significantly more important than particle collision (i.e. high velocity domain resolution) for predicting flow instabilities and momentum transfer consistent with that predicted by the hydrodynamic equations and that these effects can be computed accurately even with very few degrees of freedom in the velocity domain. △ Less

Submitted 31 December, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 34 pages, 36 figures

arXiv:2306.06744 [pdf, other]

An efficient GPU-based h-adaptation framework via linear trees for the flux reconstruction method

Authors: Lai Wang, Freddie Witherden, Antony Jameson

Abstract: In this paper, we develop the first entirely graphic processing unit (GPU) based h-adaptive flux reconstruction (FR) method with linear trees. The adaptive solver fully operates on the GPU hardware, using a linear quadtree for two dimensional (2D) problems and a linear octree for three dimensional (3D) problems. We articulate how to efficiently perform tree construction, 2:1 balancing, connectivit… ▽ More In this paper, we develop the first entirely graphic processing unit (GPU) based h-adaptive flux reconstruction (FR) method with linear trees. The adaptive solver fully operates on the GPU hardware, using a linear quadtree for two dimensional (2D) problems and a linear octree for three dimensional (3D) problems. We articulate how to efficiently perform tree construction, 2:1 balancing, connectivity query, and how to perform adaptation for the flux reconstruction method on the GPU hardware. As a proof of concept, we apply the adaptive flux reconstruction method to solve the inviscid isentropic vortex propagation problem on 2D and 3D meshes to demonstrate the efficiency of the developed adaptive FR method on a single GPU card. Depending on the computational domain size, acceleration of one or two orders of magnitude can be achieved compared to uniform meshing. The total computational cost of adaption, including tree manipulations, connectivity query and data transfer, compared to that of the numerical solver, is insignificant. It can be less than 2% of the total wall clock time for 3D problems even if we perform adaptation as frequent as every 10 time steps with an explicit 3-stage Runge--Kutta time integrator. △ Less

Submitted 11 June, 2023; originally announced June 2023.

arXiv:2302.13359 [pdf, other]

On the anti-aliasing properties of entropy filtering for discontinuous spectral element approximations of under-resolved turbulent flows

Authors: Tarik Dzanic, Will Trojak, Freddie D. Witherden

Abstract: For large Reynolds number flows, it is typically necessary to perform simulations that are under-resolved with respect to the underlying flow physics. For nodal discontinuous spectral element approximations of these under-resolved flows, the collocation projection of the nonlinear flux can introduce aliasing errors which can result in numerical instabilities. In Dzanic and Witherden (J. Comput. Ph… ▽ More For large Reynolds number flows, it is typically necessary to perform simulations that are under-resolved with respect to the underlying flow physics. For nodal discontinuous spectral element approximations of these under-resolved flows, the collocation projection of the nonlinear flux can introduce aliasing errors which can result in numerical instabilities. In Dzanic and Witherden (J. Comput. Phys., 468, 2022), an entropy-based adaptive filtering approach was introduced as a robust, parameter-free shock-capturing method for discontinuous spectral element methods. This work explores the ability of entropy filtering for mitigating aliasing-driven instabilities in the simulation of under-resolved turbulent flows through high-order implicit large eddy simulations of a NACA0021 airfoil in deep stall at a Reynolds number of 270,000. It was observed that entropy filtering can adequately mitigate aliasing-driven instabilities without degrading the accuracy of the underlying high-order scheme on par with standard anti-aliasing methods such as over-integration, albeit with marginally worse performance at higher approximation orders. △ Less

Submitted 31 December, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

Comments: 13 pages, 8 figures

arXiv:2301.03129 [pdf, other]

doi 10.1016/j.compfluid.2023.106056

Positivity-preserving entropy filtering for the ideal magnetohydrodynamics equations

Authors: Tarik Dzanic, Freddie D. Witherden

Abstract: In this work, we present a positivity-preserving adaptive filtering approach for discontinuous spectral element approximations of the ideal magnetohydrodynamics equations. This approach combines the entropy filtering method (Dzanic and Witherden, J. Comput. Phys., 468, 2022) for shock capturing in gas dynamics along with the eight-wave method for enforcing a divergence-free magnetic field. Due to… ▽ More In this work, we present a positivity-preserving adaptive filtering approach for discontinuous spectral element approximations of the ideal magnetohydrodynamics equations. This approach combines the entropy filtering method (Dzanic and Witherden, J. Comput. Phys., 468, 2022) for shock capturing in gas dynamics along with the eight-wave method for enforcing a divergence-free magnetic field. Due to the inclusion of non-conservative source terms, an operator-splitting approach is introduced to ensure that the positivity and entropy constraints remain satisfied by the discrete solution. Furthermore, a computationally efficient algorithm for solving the optimization process for this nonlinear filtering approach is presented. The resulting scheme can robustly resolve strong discontinuities on general unstructured grids without tunable parameters while recovering high-order accuracy for smooth solutions. The efficacy of the scheme is shown in numerical experiments on various problems including extremely magnetized blast waves and three-dimensional magnetohydrodynamic instabilities. △ Less

Submitted 17 September, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

Comments: 24 pages, 17 figures

Journal ref: Computers & Fluids, 266, 106056, 2023

arXiv:2210.15611 [pdf, other]

doi 10.1016/j.jcp.2023.112146

A positivity-preserving and conservative high-order flux reconstruction method for the polyatomic Boltzmann--BGK equation

Authors: Tarik Dzanic, Freddie D. Witherden, Luigi Martinelli

Abstract: In this work, we present a positivity-preserving high-order flux reconstruction method for the polyatomic Boltzmann--BGK equation augmented with a discrete velocity model that ensures the scheme is discretely conservative. Through modeling the internal degrees of freedom, the approach is further extended to polyatomic molecules and can encompass arbitrary constitutive laws. The approach is validat… ▽ More In this work, we present a positivity-preserving high-order flux reconstruction method for the polyatomic Boltzmann--BGK equation augmented with a discrete velocity model that ensures the scheme is discretely conservative. Through modeling the internal degrees of freedom, the approach is further extended to polyatomic molecules and can encompass arbitrary constitutive laws. The approach is validated on a series of large-scale complex numerical experiments, ranging from shock-dominated flows computed on unstructured grids to direct numerical simulation of three-dimensional compressible turbulent flows, the latter of which is the first instance of such a flow computed by directly solving the Boltzmann equation. The results show the ability of the scheme to directly resolve shock structures without any ad hoc numerical shock capturing method and correctly approximate turbulent flow phenomena in a consistent manner with the hydrodynamic equations. △ Less

Submitted 18 April, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

Comments: 31 pages, 20 figures

Journal ref: Journal of Computational Physics, 485, 112146, 2022

arXiv:2202.09733 [pdf, other]

Nonlinear p-multigrid preconditioner for implicit time integration of compressible Navier--Stokes equations

Authors: Lai Wang, Will Trojak, Freddie Witherden, Antony Jameson

Abstract: Within the framework of $ p $-adaptive flux reconstruction, we aim to construct efficient polynomial multigrid ($p$MG) preconditioners for implicit time integration of the Navier--Stokes equations using Jacobian-free Newton--Krylov (JFNK) methods. We hypothesise that in pseudo transient continuation (PTC), as the residual drops, the frequency of error modes that dictates the convergence rate gets… ▽ More Within the framework of $ p $-adaptive flux reconstruction, we aim to construct efficient polynomial multigrid ($p$MG) preconditioners for implicit time integration of the Navier--Stokes equations using Jacobian-free Newton--Krylov (JFNK) methods. We hypothesise that in pseudo transient continuation (PTC), as the residual drops, the frequency of error modes that dictates the convergence rate gets higher and higher. We apply nonlinear $p$MG solvers to stiff steady problems at low Mach number ($\mathrm{Ma}=10^{-3}$) to verify our hypothesis. It is demonstrated that once the residual drops by a few orders of magnitude, improved smoothing on intermediate $ p $-sublevels will not only maintain the stability of $ p $MG at large time steps but also improve the convergence rate. For the unsteady Navier--Stokes equations, we elaborate how to construct nonlinear preconditioners using pseudo transient continuation for the matrix-free generalized minimal residual (GMRES) method used in explicit first stage, singly diagonally implicit Runge--Kutta (ESDIRK) methods, and linearly implicit Rosenbrock--Wanner (ROW) methods. Given that at each time step the initial guess in the nonlinear solver is not distant from the converged solution, we recommend a two-level $p\{p_0\text{-}p_0/2\} $ or even $ p\{p_0\text{-}(p_0-1)\} $ $p$-hierarchy for optimal efficiency with a matrix-based smoother on the coarser level based on our hypothesis. It is demonstrated that insufficient smoothing on intermediate $p$-sublevels will deteriorate the performance of $p$MG preconditioner greatly. (See full abstract in the paper.) △ Less

Submitted 20 February, 2022; originally announced February 2022.

arXiv:2201.10502 [pdf, other]

doi 10.1016/j.jcp.2022.111501

Positivity-Preserving Entropy-Based Adaptive Filtering for Discontinuous Spectral Element Methods

Authors: Tarik Dzanic, Freddie D. Witherden

Abstract: In this work, we present a positivity-preserving entropy-based adaptive filtering method for shock capturing in discontinuous spectral element methods. By adapting the filter strength to enforce positivity and a local discrete minimum entropy principle, the resulting approach can robustly resolve strong discontinuities with sub-element resolution, does not require problem-dependent parameter tunin… ▽ More In this work, we present a positivity-preserving entropy-based adaptive filtering method for shock capturing in discontinuous spectral element methods. By adapting the filter strength to enforce positivity and a local discrete minimum entropy principle, the resulting approach can robustly resolve strong discontinuities with sub-element resolution, does not require problem-dependent parameter tuning, and can be easily implemented on general unstructured meshes with relatively low computational cost. The efficacy of the approach is shown in numerical experiments on hyperbolic and mixed hyperbolic-parabolic conservation laws such as the Euler and Navier-Stokes equations for problems including extreme shocks, shock-vortex interactions, and complex compressible turbulent flows. △ Less

Submitted 28 July, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

Comments: 25 pages, 11 figures

Journal ref: Journal of Computational Physics, 111501, 2022

arXiv:2111.07915 [pdf, other]

doi 10.1016/j.compfluid.2022.105634

Artificial Compressibility Approaches in Flux Reconstruction for Incompressible Viscous Flow Simulations

Authors: Will Trojak, Nagabhushana Rao Vadlamani, James Tyacke, Freddie Witherden, Antony Jameson

Abstract: Several competing artificial compressibility methods for the incompressible flow equations are examined using the high-order flux reconstruction method. The established artificial compressibility method (ACM) of \citet{Chorin1967} is compared to the alternative entropically damped (EDAC) method of \citet{Clausen2013}, as well as an ACM formulation with hyperbolised diffusion. While the former requ… ▽ More Several competing artificial compressibility methods for the incompressible flow equations are examined using the high-order flux reconstruction method. The established artificial compressibility method (ACM) of \citet{Chorin1967} is compared to the alternative entropically damped (EDAC) method of \citet{Clausen2013}, as well as an ACM formulation with hyperbolised diffusion. While the former requires the solution to be converged to a divergence free state at each physical time step through pseudo iterations, the latter can be applied explicitly. We examine the sensitivity of both methods to the parameterisation for a series of test cases over a range of Reynolds numbers. As the compressibility is reduced, EDAC is found to give linear improvements in divergence whereas ACM yields diminishing returns. For the Taylor--Green vortex, EDAC is found to perform well; however on the more challenging circular cylinder at $Re=3900$, EDAC gives rise to early transition of the free shear-layer and over-production of the turbulence kinetic energy. This is attributed to the spatial pressure fluctuations of the method. Similar behaviour is observed for an aerofoil at $Re=60,000$ with an attached transitional boundary layer. It is concluded that hyperbolic diffusion of ACM can be beneficial but at the cost of case setup time, and EDAC can be an efficient method for incompressible flow. However, care must be taken as pressure fluctuations can have a significant impact on physics and the remedy causes the governing equation to become overly stiff. △ Less

Submitted 15 November, 2021; originally announced November 2021.

Journal ref: Computers and Fluids 247 (2022) 105634

arXiv:2110.03653 [pdf, other]

Utilizing Time-Reversibility for Shock Capturing in Nonlinear Hyperbolic Conservation Laws

Authors: Tarik Dzanic, Will Trojak, Freddie D. Witherden

Abstract: In this work, we introduce a novel approach to formulating an artificial viscosity for shock capturing in nonlinear hyperbolic systems by utilizing the property that the solutions of hyperbolic conservation laws are not reversible in time in the vicinity of shocks. The proposed approach does not require any additional governing equations or a priori knowledge of the hyperbolic system in question,… ▽ More In this work, we introduce a novel approach to formulating an artificial viscosity for shock capturing in nonlinear hyperbolic systems by utilizing the property that the solutions of hyperbolic conservation laws are not reversible in time in the vicinity of shocks. The proposed approach does not require any additional governing equations or a priori knowledge of the hyperbolic system in question, is independent of the mesh and approximation order, and requires the use of only one tunable parameter. The primary novelty is that the resulting artificial viscosity is unique for each component of the conservation law which is advantageous for systems in which some components exhibit discontinuities while others do not. The efficacy of the method is shown in numerical experiments of multi-dimensional hyperbolic conservation laws such as nonlinear transport, Euler equations, and ideal magnetohydrodynamics using a high-order discontinuous spectral element method on unstructured grids. △ Less

Submitted 18 April, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

Comments: 20 pages, 14 figures

arXiv:2107.14027 [pdf, other]

doi 10.1016/j.cpc.2021.108235

Hyperbolic Diffusion in Flux Reconstruction: Optimisation through Kernel Fusion within Tensor-Product Elements

Authors: Will Trojak, Rob Watson, Freddie Witherden

Abstract: Novel methods are presented in this initial study for the fusion of GPU kernels in the artificial compressibility method (ACM), using tensor product elements with constant Jacobians and flux reconstruction. This is made possible through the hyperbolisation of the diffusion terms, which eliminates the expensive algorithmic steps needed to form the viscous stresses. Two fusion approaches are present… ▽ More Novel methods are presented in this initial study for the fusion of GPU kernels in the artificial compressibility method (ACM), using tensor product elements with constant Jacobians and flux reconstruction. This is made possible through the hyperbolisation of the diffusion terms, which eliminates the expensive algorithmic steps needed to form the viscous stresses. Two fusion approaches are presented, which offer differing levels of parallelism. This is found to be necessary for the change in workload as the order of accuracy of the elements is increased. Several further optimisations of these approaches are demonstrated, including a generation time memory manager which maximises resource usage. The fused kernels are able to achieve 3-4 times speedup, which compares favourably with a theoretical maximum speedup of 4. In three dimensional test cases, the generated fused kernels are found to reduce total runtime by ${\sim}25\%$, and, when compared to the standard ACM formulation, simulations demonstrate that a speedup of $2.3$ times can be achieved. △ Less

Submitted 15 November, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

arXiv:2106.02582 [pdf, other]

doi 10.1016/j.jcp.2022.110992

Partially-Averaged Navier-Stokes Simulations of Turbulence Within a High-Order Flux Reconstruction Framework

Authors: Tarik Dzanic, Sharath Girimaji, Freddie Witherden

Abstract: High-order methods and hybrid turbulence models have independently shown promise as means of decreasing the computational cost of scale-resolving simulations. The objective of this work is to develop the combination of these methods and analyze the effects of high-order discretizations on hybrid turbulence models, particularly with respect the optimal model parameters and the relative accuracy ben… ▽ More High-order methods and hybrid turbulence models have independently shown promise as means of decreasing the computational cost of scale-resolving simulations. The objective of this work is to develop the combination of these methods and analyze the effects of high-order discretizations on hybrid turbulence models, particularly with respect the optimal model parameters and the relative accuracy benefits compared to approaches such as under-resolved direct numerical simulation (URDNS). We employ the Partially-Averaged Navier-Stokes (PANS) approach using the flux reconstruction scheme on the flow around a periodic hill and the wake flow of a circular cylinder at a Reynolds number of 3900, the latter of which we provide direct numerical simulation results and novel statistical analysis. By increasing the order of the discretization while fixing the total degrees of freedom, it was observed that larger improvements in the prediction of the statistics and flow physics were generally seen with PANS than URDNS. Furthermore, less sensitivity to the resolution-control parameter was observed with a high-order discretization, indicating that high-order discretizations may be an effective approach for increasing the accuracy and reliability of hybrid turbulence models for scale-resolving simulations without a significant increase in computational effort. △ Less

Submitted 4 June, 2021; originally announced June 2021.

Comments: 20 pages, 13 figures

arXiv:2102.01487 [pdf, other]

doi 10.1063/5.0070890

Turbulence closure modeling with data-driven techniques: Investigation of generalizable deep neural networks

Authors: Salar Taghizadeh, Freddie Witherden, Yassin Hassan, Sharath Girimaji

Abstract: Generalizability of machine-learning (ML) based turbulence closures to accurately predict unseen practical flows remains an important challenge. At the Reynolds-averaged Navier-Stokes (RANS) level, NN-based turbulence closure modeling is rendered difficult due to two important reasons: inherent complexity of the constitutive relation arising from flow-dependent non-linearity and bifurcations; and,… ▽ More Generalizability of machine-learning (ML) based turbulence closures to accurately predict unseen practical flows remains an important challenge. At the Reynolds-averaged Navier-Stokes (RANS) level, NN-based turbulence closure modeling is rendered difficult due to two important reasons: inherent complexity of the constitutive relation arising from flow-dependent non-linearity and bifurcations; and, inordinate difficulty in obtaining high-fidelity data covering the entire parameter space of interest. In this context, the objective of the work is to investigate the approximation capabilities of standard moderate-sized fully-connected NNs. We seek to systematically investigate the effects of: (i) intrinsic complexity of the solution manifold; (ii) sampling procedure (interpolation vs. extrapolation) and (iii) optimization procedure. To overcome the data acquisition challenges, three proxy-physics turbulence surrogates of different degrees of complexity (yet significantly simpler than turbulence physics) are employed to generate the parameter-to-solution maps. Even for this simple proxy-physics system, it is demonstrated that feed-forward NNs require more degrees of freedom than the original proxy-physics model to accurately approximate the true model even when trained with data over the entire parameter space (interpolation). Additionally, if deep fully-connected NNs are trained with data only from part of the parameter space (extrapolation), their approximation capability reduces considerably and it is not straightforward to find an optimal architecture. Overall, the findings provide a realistic perspective on the utility of ML turbulence closures for practical applications and identify areas for improvement. △ Less

Submitted 4 November, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

arXiv:2009.06072 [pdf, other]

doi 10.1016/j.compfluid.2021.104922

Accuracy, Stability, and Performance Comparison between the Spectral Difference and Flux Reconstruction Schemes

Authors: Christopher Cox, Will Trojak, Tarik Dzanic, Freddie Witherden, Antony Jameson

Abstract: We report the development of a discontinuous spectral element flow solver that includes the implementation of both spectral difference and flux reconstruction formulations. With this high order framework, we have constructed a foundation upon which to provide a fair and accurate assessment of these two schemes in terms of accuracy, stability, and performance with special attention to the true spec… ▽ More We report the development of a discontinuous spectral element flow solver that includes the implementation of both spectral difference and flux reconstruction formulations. With this high order framework, we have constructed a foundation upon which to provide a fair and accurate assessment of these two schemes in terms of accuracy, stability, and performance with special attention to the true spectral difference scheme and the modified spectral difference scheme recovered via the flux reconstruction formulation. Building on previous analysis of the spectral difference and flux reconstruction schemes, we provide a novel nonlinear stability analysis of the spectral difference scheme. Through various numerical experiments, we demonstrate the additional stability afforded by the true, baseline spectral difference scheme without explicit filtering or de-aliasing due to its inherent feature of staggered flux points. This arrangement leads to favorable suppression of aliasing errors and improves stability needed for under-resolved simulations of turbulent flows. △ Less

Submitted 13 September, 2020; originally announced September 2020.

Comments: 44 pages, 15 figures, 3 tables

Journal ref: Computers & Fluids, Volume 221, 15 May 2021, 104922

arXiv:2008.05463 [pdf, other]

On Fourier analysis of polynomial multigrid for arbitrary multi-stage cycles

Authors: Will Trojak, Freddie D. Witherden

Abstract: The Fourier analysis of the \emph{p}-multigrid acceleration technique is considered for a dual-time scheme applied to the advection-diffusion equation with various cycle configurations. It is found that improved convergence can be achieved through \emph{V}-cycle asymmetry where additional prolongation smoothing is applied. Experiments conducted on the artificial compressibility formulation of the… ▽ More The Fourier analysis of the \emph{p}-multigrid acceleration technique is considered for a dual-time scheme applied to the advection-diffusion equation with various cycle configurations. It is found that improved convergence can be achieved through \emph{V}-cycle asymmetry where additional prolongation smoothing is applied. Experiments conducted on the artificial compressibility formulation of the Navier--Stokes equations found that these analytic findings could be observed numerically in the pressure residual, whereas velocity terms---which are more hyperbolic in character---benefited primarily from increased pseudo-time steps. △ Less

Submitted 12 August, 2020; originally announced August 2020.

MSC Class: 65M60; 65T99; 65M55; 76D99

arXiv:2004.03031 [pdf, other]

doi 10.1088/1367-2630/abadb3

Turbulence closure modeling with data-driven techniques: physical compatibility and consistency considerations

Authors: Salar Taghizadeh, Freddie D. Witherden, Sharath S. Girimaji

Abstract: A recent thrust in turbulence closure modeling research is to incorporate machine learning (ML) elements, such as neural networks, for the purpose of enhancing the predictive capability to a broader class of flows. Such a turbulence closure framework entails solving a system of equations comprised of ML functionals coupled with traditional (physics-based - PB) elements. While combining closure ele… ▽ More A recent thrust in turbulence closure modeling research is to incorporate machine learning (ML) elements, such as neural networks, for the purpose of enhancing the predictive capability to a broader class of flows. Such a turbulence closure framework entails solving a system of equations comprised of ML functionals coupled with traditional (physics-based - PB) elements. While combining closure elements from fundamentally different ideologies can lead to unprecedented progress, there are many critical challenges that must be overcome. This study examines three such challenges: (i) Physical compatibility (or lack thereof) between ML and PB constituents of the modeling system of equations; (ii) Internal (self) consistency of the ML training process; and (iii) Formulation of an optimal objective (or loss) function for training. These issues are critically important for generalization of the ML-enhanced methods to predictive computations of complex engineering flows. Training and implementation strategies in current practice that may lead to significant incompatibilities and inconsistencies are identified. Using the simple test case of turbulent channel flow, key deficiencies are highlighted and proposals for mitigating them are investigated. Compatibility constraints are evaluated and it is demonstrated that an iterative training procedure can help ensure certain degree of consistency. In summary, this work develops foundational tenets to guide development of ML-enhanced turbulence closure models. △ Less

Submitted 6 April, 2020; originally announced April 2020.

Journal ref: New J. Phys. 22 (2020) 093023

arXiv:2003.02633 [pdf, other]

Inline Vector Compression for Computational Physics

Authors: Will Trojak, Freddie Witherden

Abstract: A novel inline data compression method is presented for single-precision vectors in three dimensions. The primary application of the method is for accelerating computational physics calculations where the throughput is bound by memory bandwidth. The scheme employs spherical polar coordinates, angle quantisation, and a bespoke floating-point representation of the magnitude to achieve a fixed compre… ▽ More A novel inline data compression method is presented for single-precision vectors in three dimensions. The primary application of the method is for accelerating computational physics calculations where the throughput is bound by memory bandwidth. The scheme employs spherical polar coordinates, angle quantisation, and a bespoke floating-point representation of the magnitude to achieve a fixed compression ratio of 1.5. The anisotropy of this method is considered, along with companding and fractional splitting techniques to improve the efficiency of the representation. We evaluate the scheme numerically within the context of high-order computational fluid dynamics. For both the isentropic convecting vortex and the Taylor--Green vortex test cases, the results are found to be comparable to those without compression. Performance is evaluated for a vector addition kernel on an NVIDIA Titan V GPU; it is demonstrated that a speedup of 1.5 can be achieved. △ Less

Submitted 24 June, 2020; v1 submitted 27 February, 2020; originally announced March 2020.

Comments: The extra supplementary material can be found at: https://github.com/WillTrojak/compression_examples

MSC Class: 68U20; 68W40; 68P30; 65M60; 76F65

arXiv:1912.06752 [pdf, other]

Parameter-Conditioned Sequential Generative Modeling of Fluid Flows

Authors: Jeremy Morton, Freddie D. Witherden, Mykel J. Kochenderfer

Abstract: The computational cost associated with simulating fluid flows can make it infeasible to run many simulations across multiple flow conditions. Building upon concepts from generative modeling, we introduce a new method for learning neural network models capable of performing efficient parameterized simulations of fluid flows. Evaluated on their ability to simulate both two-dimensional and three-dime… ▽ More The computational cost associated with simulating fluid flows can make it infeasible to run many simulations across multiple flow conditions. Building upon concepts from generative modeling, we introduce a new method for learning neural network models capable of performing efficient parameterized simulations of fluid flows. Evaluated on their ability to simulate both two-dimensional and three-dimensional fluid flows, trained models are shown to capture local and global properties of the flow fields at a wide array of flow conditions. Furthermore, flow simulations generated by the trained models are shown to be orders of magnitude faster than the corresponding computational fluid dynamics simulations. △ Less

Submitted 13 December, 2019; originally announced December 2019.

Comments: 29 pages, 21 figures

arXiv:1812.01177 [pdf, other]

doi 10.1016/j.jcp.2019.05.041

Recovering missing CFD data for high-order discretizations using deep neural networks and dynamics learning

Authors: Kevin T. Carlberg, Antony Jameson, Mykel J. Kochenderfer, Jeremy Morton, Liqian Peng, Freddie D. Witherden

Abstract: Data I/O poses a significant bottleneck in large-scale CFD simulations; thus, practitioners would like to significantly reduce the number of times the solution is saved to disk, yet retain the ability to recover any field quantity (at any time instance) a posteriori. The objective of this work is therefore to accurately recover missing CFD data a posteriori at any time instance, given that the sol… ▽ More Data I/O poses a significant bottleneck in large-scale CFD simulations; thus, practitioners would like to significantly reduce the number of times the solution is saved to disk, yet retain the ability to recover any field quantity (at any time instance) a posteriori. The objective of this work is therefore to accurately recover missing CFD data a posteriori at any time instance, given that the solution has been written to disk at only a relatively small number of time instances. We consider in particular high-order discretizations (e.g., discontinuous Galerkin), as such techniques are becoming increasingly popular for the simulation of highly separated flows. To satisfy this objective, this work proposes a methodology consisting of two stages: 1) dimensionality reduction and 2) dynamics learning. For dimensionality reduction, we propose a novel hierarchical approach. First, the method reduces the number of degrees of freedom within each element of the high-order discretization by applying autoencoders from deep learning. Second, the methodology applies principal component analysis to compress the global vector of encodings. This leads to a low-dimensional state, which associates with a nonlinear embedding of the original CFD data. For dynamics learning, we propose to apply regression techniques (e.g., kernel methods) to learn the discrete-time velocity characterizing the time evolution of this low-dimensional state. A numerical example on a large-scale CFD example characterized by nearly 13 million degrees of freedom illustrates the suitability of the proposed method in an industrial setting. △ Less

Submitted 28 May, 2019; v1 submitted 30 November, 2018; originally announced December 2018.

Comments: Accepted in Journal of Computational Physics

arXiv:1806.00378 [pdf, other]

High-Order Computational Fluid Dynamics Simulations of a Spinning Golf Ball

Authors: Jacob Crabill, Freddie Witherden, Antony Jameson

Abstract: This paper presents the first high-order computational fluid dynamics (CFD) simulations of static and spinning golf balls at realistic flow conditions. The present results are shown to capture the complex fluid dynamics inside the dimples which lead to drag reduction versus a smooth sphere, and compare well to previous experimental and computational studies. The high--order Flux Reconstruction met… ▽ More This paper presents the first high-order computational fluid dynamics (CFD) simulations of static and spinning golf balls at realistic flow conditions. The present results are shown to capture the complex fluid dynamics inside the dimples which lead to drag reduction versus a smooth sphere, and compare well to previous experimental and computational studies. The high--order Flux Reconstruction method has been paired with the Artificial Boundary overset method to enable simplified mesh generation and grid motion. The compressible Navier--Stokes equations are modeled using a scale--resolving Large Eddy Simulation (LES) approach with no sub--grid models. The codes implementing these methods have been implemented for NVIDIA Graphical Processing Units (GPUs), enabling large speedups over traditional computer hardware. The new method allows for the simulation of golf balls, and other objects at similar moderate Reynolds numbers, to be simulated in a matter of days on large computing clusters. The use of CFD for the design of objects such as golf balls and other sports balls is now within reach. △ Less

Submitted 29 May, 2018; originally announced June 2018.

Comments: Preprint submitted to Sports Engineering. arXiv admin note: substantial text overlap with arXiv:1711.07663

MSC Class: 76G25

arXiv:1711.07663 [pdf, other]

doi 10.1016/j.jcp.2018.05.036

A Parallel Direct Cut Algorithm for High-Order Overset Methods with Application to a Spinning Golf Ball

Authors: Jacob A. Crabill, Freddie D. Witherden, Antony Jameson

Abstract: Overset methods are commonly employed to enable the effective simulation of problems involving complex geometries and moving objects such as rotorcraft. This paper presents a novel overset domain connectivity algorithm based upon the direct cut approach suitable for use with GPU-accelerated solvers on high-order curved grids. In contrast to previous methods it is capable of exploiting the highly d… ▽ More Overset methods are commonly employed to enable the effective simulation of problems involving complex geometries and moving objects such as rotorcraft. This paper presents a novel overset domain connectivity algorithm based upon the direct cut approach suitable for use with GPU-accelerated solvers on high-order curved grids. In contrast to previous methods it is capable of exploiting the highly data-parallel nature of modern accelerators. Further, the approach is also substantially more efficient at handling the curved grids which arise within the context of high-order methods. An implementation of this new algorithm is presented and combined with a high-order fluid dynamics code. The algorithm is validated against several benchmark problems, including flow over a spinning golf ball at a Reynolds number of 150,000. △ Less

Submitted 23 May, 2018; v1 submitted 21 November, 2017; originally announced November 2017.

Comments: Preprint accepted for publication in the Journal of Computational Physics. From previous version: Fixed typos Fixed incorrect plot of spinning golf ball forces; Reorganized discussion of code performance for clarity; Added additional code performance profiling results

MSC Class: 76G25 68W10 65M70

arXiv:1409.0405 [pdf, other]

Heterogeneous Computing on Mixed Unstructured Grids with PyFR

Authors: F. D. Witherden, B. C. Vermeire, P. E. Vincent

Abstract: PyFR is an open-source high-order accurate computational fluid dynamics solver for mixed unstructured grids that can target a range of hardware platforms from a single codebase. In this paper we demonstrate the ability of PyFR to perform high-order accurate unsteady simulations of flow on mixed unstructured grids using heterogeneous multi-node hardware. Specifically, after benchmarking single-node… ▽ More PyFR is an open-source high-order accurate computational fluid dynamics solver for mixed unstructured grids that can target a range of hardware platforms from a single codebase. In this paper we demonstrate the ability of PyFR to perform high-order accurate unsteady simulations of flow on mixed unstructured grids using heterogeneous multi-node hardware. Specifically, after benchmarking single-node performance for various platforms, PyFR v0.2.2 is used to undertake simulations of unsteady flow over a circular cylinder at Reynolds number 3 900 using a mixed unstructured grid of prismatic and tetrahedral elements on a desktop workstation containing an Intel Xeon E5-2697 v2 CPU, an NVIDIA Tesla K40c GPU, and an AMD FirePro W9100 GPU. Both the performance and accuracy of PyFR are assessed. PyFR v0.2.2 is freely available under a 3-Clause New Style BSD license (see www.pyfr.org). △ Less

Submitted 1 September, 2014; originally announced September 2014.

Comments: 21 pages, 9 figures, 6 tables

arXiv:1312.1638 [pdf, other]

doi 10.1016/j.cpc.2014.07.011

PyFR: An Open Source Framework for Solving Advection-Diffusion Type Problems on Streaming Architectures using the Flux Reconstruction Approach

Authors: Freddie D Witherden, Antony M Farrington, Peter E Vincent

Abstract: High-order numerical methods for unstructured grids combine the superior accuracy of high-order spectral or finite difference methods with the geometric flexibility of low-order finite volume or finite element schemes. The Flux Reconstruction (FR) approach unifies various high-order schemes for unstructured grids within a single framework. Additionally, the FR approach exhibits a significant degre… ▽ More High-order numerical methods for unstructured grids combine the superior accuracy of high-order spectral or finite difference methods with the geometric flexibility of low-order finite volume or finite element schemes. The Flux Reconstruction (FR) approach unifies various high-order schemes for unstructured grids within a single framework. Additionally, the FR approach exhibits a significant degree of element locality, and is thus able to run efficiently on modern streaming architectures, such as Graphical Processing Units (GPUs). The aforementioned properties of FR mean it offers a promising route to performing affordable, and hence industrially relevant, scale-resolving simulations of hitherto intractable unsteady flows within the vicinity of real-world engineering geometries. In this paper we present PyFR, an open-source Python based framework for solving advection-diffusion type problems on streaming architectures using the FR approach. The framework is designed to solve a range of governing systems on mixed unstructured grids containing various element types. It is also designed to target a range of hardware platforms via use of an in-built domain specific language based on the Mako templating engine. The current release of PyFR is able to solve the compressible Euler and Navier-Stokes equations on grids of quadrilateral and triangular elements in two dimensions, and hexahedral elements in three dimensions, targeting clusters of CPUs, and NVIDIA GPUs. Results are presented for various benchmark flow problems, single-node performance is discussed, and scalability of the code is demonstrated on up to 104 NVIDIA M2090 GPUs. The software is freely available under a 3-Clause New Style BSD license (see www.pyfr.org). △ Less

Submitted 7 May, 2014; v1 submitted 5 December, 2013; originally announced December 2013.

MSC Class: 65M60 65M70

Showing 1–23 of 23 results for author: Witherden, F