-
Efficient Interleaved Batch Matrix Solvers for CUDA
Authors:
Andrew Gloster,
Enda Carroll,
Miguel Bustamante,
Lennon O'Naraigh
Abstract:
In this paper we present a new methodology for data accesses when solving batches of Tridiagonal and Pentadiagonal matrices that all share the same LHS matrix. By only storing one copy of this matrix there is a significant reduction in storage overheads and the authors show that there is also a performance increase in terms of compute time. These two results combined lead to an overall more effici…
▽ More
In this paper we present a new methodology for data accesses when solving batches of Tridiagonal and Pentadiagonal matrices that all share the same LHS matrix. By only storing one copy of this matrix there is a significant reduction in storage overheads and the authors show that there is also a performance increase in terms of compute time. These two results combined lead to an overall more efficient implementation over the current state of the art algorithms cuThomasBatch and cuPentBatch, allowing for a greater number of systems to be solved on a single GPU.
△ Less
Submitted 12 September, 2019; v1 submitted 10 September, 2019;
originally announced September 2019.
-
cuSten -- CUDA Finite Difference and Stencil Library
Authors:
Andrew Gloster,
Lennon O'Naraigh
Abstract:
In this paper we present cuSten, a new library of functions to handle the implementation of 2D and batched 1D finite-difference/stencil programs in CUDA. cuSten wraps data handling, kernel calls and streaming into four easy to use functions that speed up development of numerical codes on GPU platforms. The paper also presents an example of this library applied to solve the Cahn-Hilliard equation u…
▽ More
In this paper we present cuSten, a new library of functions to handle the implementation of 2D and batched 1D finite-difference/stencil programs in CUDA. cuSten wraps data handling, kernel calls and streaming into four easy to use functions that speed up development of numerical codes on GPU platforms. The paper also presents an example of this library applied to solve the Cahn-Hilliard equation utilizing an ADI method with periodic boundary conditions, this solver is also used to benchmark the cuSten library performance against a serial implementation.
△ Less
Submitted 4 September, 2019; v1 submitted 26 February, 2019;
originally announced February 2019.
-
Travelling-wave spatially periodic forcing of asymmetric binary mixtures
Authors:
Lennon O'Naraigh
Abstract:
We study travelling-wave spatially periodic solutions of a forced Cahn-Hilliard equation. This is a model for phase separation of a binary mixture, subject to external forcing. We look at arbitrary values of the mean mixture concentration, corresponding to asymmetric mixtures (previous studies have only considered the symmetric case). We characterize in depth one particular solution which consists…
▽ More
We study travelling-wave spatially periodic solutions of a forced Cahn-Hilliard equation. This is a model for phase separation of a binary mixture, subject to external forcing. We look at arbitrary values of the mean mixture concentration, corresponding to asymmetric mixtures (previous studies have only considered the symmetric case). We characterize in depth one particular solution which consists of an oscillation around the mean concentration level, using a range of techniques, both numerical and analytical. We determine the stability of this solution to small-amplitude perturbations. Next, we use methods developed elsewhere in the context of shallow-water waves to uncover a (possibly infinite) family of multiple-spike solutions for the concentration profile, which linear stability analysis demonstrates to be unstable. Throughout the work, we perform thorough parametric studies to outline for which parameter values the different solution types occur.
△ Less
Submitted 30 August, 2018; v1 submitted 23 July, 2018;
originally announced July 2018.
-
A flow-pattern map for phase separation using the Navier-Stokes Cahn-Hilliard model
Authors:
Aurore Naso,
Lennon O'Naraigh
Abstract:
We use the Navier-Stokes-Cahn-Hilliard model equations to simulate phase separation with flow. We study coarsening - the growth of extended domains wherein the binary mixture phase separates into its component parts. The coarsening is characterized by two competing effects: flow, and the Cahn-Hilliard diffusion term, which drives the phase separation. Based on extensive two-dimensional direct nume…
▽ More
We use the Navier-Stokes-Cahn-Hilliard model equations to simulate phase separation with flow. We study coarsening - the growth of extended domains wherein the binary mixture phase separates into its component parts. The coarsening is characterized by two competing effects: flow, and the Cahn-Hilliard diffusion term, which drives the phase separation. Based on extensive two-dimensional direct numerical simulations, we construct a flow-pattern map outlining the relative strength of these effects in different parts of the parameter space. The map reveals large regions of parameter space where a standard theory applies, and where the domains grow algebraically in time. However, there are significant parts of the parameter space where the standard theory does not apply. In one region, corresponding to low values of viscosity and diffusion, the coarsening is accelerated compared to the standard theory. Previous studies involving Stokes flow report on this phenomenon; we complete the picture by demonstrating that this anomalous regime occurs not only for Stokes flow, but also, for flows dominated by inertia. In a second region, corresponding to arbitrary viscosities and high Cahn-Hilliard diffusion, the diffusion overwhelms the hydrodynamics altogether, and the latter can effectively be ignored, in contrast to the prediction of the standard scaling theory. Based on further high-resolution simulations in three dimensions, we find that broadly speaking, the above description holds there also, although the formation of the anomalous domains in the low-viscosity-low-diffusion part of the parameter space is delayed in three dimensions compared to two.
△ Less
Submitted 19 July, 2017;
originally announced July 2017.
-
Advection of nematic liquid crystals by chaotic flow
Authors:
Lennon O'Naraigh
Abstract:
Consideration is given to the effects of inhomogeneous shear flow (both regular and chaotic) on nematic liquid crystals in a planar geometry. The Landau--de Gennes equation coupled to an externally-prescribed flow field is the basis for the study: this is solved numerically in a periodic spatial domain. The focus is on a limiting case where the advection is passive, such that variations in the liq…
▽ More
Consideration is given to the effects of inhomogeneous shear flow (both regular and chaotic) on nematic liquid crystals in a planar geometry. The Landau--de Gennes equation coupled to an externally-prescribed flow field is the basis for the study: this is solved numerically in a periodic spatial domain. The focus is on a limiting case where the advection is passive, such that variations in the liquid-crystal properties do not feed back into the equation for the fluid velocity. The main tool for analyzing the results (both with and without flow) is the identification of the fixed points of the dynamical equations without flow, which are relevant (to varying degrees) when flow is introduced. The fixed points are classified as stable/unstable and further as either uniaxial or biaxial. Accordingly, various models of passive shear flow are investigated, with the main focus being on the case where tumbling is absent from the model. In this scenario, not only must advection of the $Q$-tensor be considered, but also its co-rotation relative to the local vorticity field. Thus, only the biaxial fixed point survives as a solution of the $Q$-tensor dynamics under the imposition of a general flow field. For certain flows for which co-rotation effects can effectively be ignored along trajectories, both families of fixed points survive. In this scenario, the system exhibits coarsening arrest, whereby the liquid-crystal domains are `frozen in' to the flow structures and the growth in their size is thus limited. The outcome (biaxial final state or a mixture of both families of fixed points) can be understood in terms of the flow time- and length-scales that manifest themselves through the co-rotational derivative. Some consideration is also given to the specific case where tumbling is present; here, the flow has a strong effect on the liquid-crystal morphology.
△ Less
Submitted 19 July, 2017; v1 submitted 18 May, 2016;
originally announced May 2016.
-
Particle-laden viscous channel flows - model regularization and parameter study
Authors:
Lennon O'Naraigh,
Ricardo Barros
Abstract:
We characterize the flow of a viscous suspension in an inclined channel where the flow is maintained in a steady state under the competing influences of gravity and an applied pressure drop. The basic model relies on a diffusive-flux formalism. Such models are common in the literature, yet many of them possess an unphysical singularity at the channel centreline where the shear rate vanishes. We th…
▽ More
We characterize the flow of a viscous suspension in an inclined channel where the flow is maintained in a steady state under the competing influences of gravity and an applied pressure drop. The basic model relies on a diffusive-flux formalism. Such models are common in the literature, yet many of them possess an unphysical singularity at the channel centreline where the shear rate vanishes. We therefore present a regularization of the basic diffusive-flux model that removes this singularity. This introduces an explicit (physical) dependence on the particle size into the model equations. This approach enables us to carry out a detailed parameter study showing in particular the opposing effects of the pressure drop and gravity. Conditions for counter-current flow and complete flow reversal are obtained from numerical solutions of the model equations. These are supplemented by an analytic lower bound on the ratio of the gravitational force to the applied pressure drop necessary to bring about complete flow reversal.
△ Less
Submitted 14 January, 2016;
originally announced January 2016.
-
Simplified TPLS as a learning tool for high-performance computational fluid dynamics
Authors:
James Fannon,
Jean-Christophe Loiseau,
Prashant Valluri,
Iain Bethune,
Lennon O'Naraigh
Abstract:
We introduce a modified and simplified version of the pre-existing fully parallelized three-dimensional Navier--Stokes flow solver known as TPLS. We demonstrate how the simplified version can be used as a pedagogical tool for the study of computational fluid dynamics and parallel computing. TPLS is at its heart a two-phase flow solver, and uses calls to a range of external libraries to accelerate…
▽ More
We introduce a modified and simplified version of the pre-existing fully parallelized three-dimensional Navier--Stokes flow solver known as TPLS. We demonstrate how the simplified version can be used as a pedagogical tool for the study of computational fluid dynamics and parallel computing. TPLS is at its heart a two-phase flow solver, and uses calls to a range of external libraries to accelerate its performance. However, in the present context we narrow the focus of the study to basic hydrodynamics and parallel computing techniques, and the code is therefore simplified and modified to simulate pressure-driven single-phase flow in a channel, using only relatively simple Fortran 90 code with MPI parallelization, but no calls to any other external libraries. The modified code is analysed in order to both validate its accuracy and investigate its scalability up to 1000 CPU cores. Simulations are performed for several benchmark cases in pressure-driven channel flow, including a turbulent simulation, wherein the turbulence is incorporated via the large-eddy simulation technique. The work may be of use to advanced undergraduate and graduate students as an introductory study in computational fluid dynamics, while also providing insight for those interested in more general aspects of high-performance computing.
△ Less
Submitted 24 November, 2015;
originally announced November 2015.
-
Linear and nonlinear instability in vertical counter-current laminar gas-liquid flows
Authors:
Patrick Schmidt,
Lennon Ó'Náraigh,
Mathieu Lucquiaud,
Prashant Valluri
Abstract:
We consider the genesis and dynamics of interfacial instability in gas-liquid flows, using as a model the two-dimensional channel flow of a thin falling film sheared by counter-current gas. The methodology is linear stability theory (Orr-Sommerfeld analysis) together with direct numerical simulation of the two-phase flow in the case of nonlinear disturbances. We investigate the influence of three…
▽ More
We consider the genesis and dynamics of interfacial instability in gas-liquid flows, using as a model the two-dimensional channel flow of a thin falling film sheared by counter-current gas. The methodology is linear stability theory (Orr-Sommerfeld analysis) together with direct numerical simulation of the two-phase flow in the case of nonlinear disturbances. We investigate the influence of three main flow parameters (density contrast between liquid and gas, film thickness, pressure drop applied to drive the gas stream) on the interfacial dynamics. Energy budget analyses based on the Orr-Sommerfeld theory reveal various coexisting unstable modes (interfacial, shear, internal) in the case of high density contrasts, which results in mode coalescence and mode competition, but only one dynamically relevant unstable internal mode for low density contrast. The same linear stability approach provides a quantitative prediction for the onset of (partial) liquid flow reversal in terms of the gas and liquid flow rates. A study of absolute and convective instability for low density contrast shows that the system is absolutely unstable for all but two narrow regions of the investigated parameter space. Direct numerical simulations of the same system (low density contrast) show that linear theory holds up remarkably well upon the onset of large-amplitude waves as well as the existence of weakly nonlinear waves. In comparison, for high density contrasts corresponding more closely to an air-water-type system, although the linear stability theory is successful at determining the most-dominant features in the interfacial wave dynamics at early-to-intermediate times, the short waves selected by the linear theory undergo secondary instability and the wave train is no longer regular but rather exhibits chaotic dynamics and eventually, wave overturning.
△ Less
Submitted 16 July, 2015;
originally announced July 2015.
-
Transient growth calculations obtained directly from the Orr-Sommerfeld matrices
Authors:
Lennon O'Naraigh
Abstract:
We introduce and validate an algorithm to compute transient amplication factors for the Orr-Sommerfeld-Squire linear theory for parallel two-phase flow. We further introduce direct numerical simulation as a way of comparing the linear theory with early-stage wave growth in simulations. The simulation results are drawn from a strongly supercritical parameter case wherein the modal growth rates are…
▽ More
We introduce and validate an algorithm to compute transient amplication factors for the Orr-Sommerfeld-Squire linear theory for parallel two-phase flow. We further introduce direct numerical simulation as a way of comparing the linear theory with early-stage wave growth in simulations. The simulation results are drawn from a strongly supercritical parameter case wherein the modal growth rates are strong. In this case, the modal growth dominates the transient growth
△ Less
Submitted 18 October, 2017; v1 submitted 27 April, 2015;
originally announced April 2015.
-
Flow-parametric regulation of shear-driven phase separation in two and three dimensions
Authors:
Lennon O'Naraigh,
Selma Shun,
Aurore Naso
Abstract:
The Cahn-Hilliard equation with an externally-prescribed chaotic shear flow is studied in two and three dimensions. The main goal is to compare and contrast the phase separation in two and three dimensions, using high-resolution numerical simulation as the basis for the study. The model flow is parametrized by its amplitudes (thereby admitting the possibility of anisotropy), lengthscales, and mult…
▽ More
The Cahn-Hilliard equation with an externally-prescribed chaotic shear flow is studied in two and three dimensions. The main goal is to compare and contrast the phase separation in two and three dimensions, using high-resolution numerical simulation as the basis for the study. The model flow is parametrized by its amplitudes (thereby admitting the possibility of anisotropy), lengthscales, and multiple time scales, and the outcome of the phase separation is investigated as a function of these parameters as well as the dimensionality. In this way, a parameter regime is identified wherein the phase separation and the associated coarsening phenomenon are not only arrested but in fact the concentration variance decays, thereby opening up the possibility of describing the dynamics of the concentration field using the theories of advection diffusion. This parameter regime corresponds to long flow correlation times, large flow amplitudes and small diffusivities. The onset of this hyperdiffusive regime is interpreted by introducing Batchelor lengthscales. A key result is that in the hyperdiffusive regime, the distribution of concentration (in particular, the frequency of extreme values of concentration) depends strongly on the dimensionality. Anisotropic scenarios are also investigated: for scenarios wherein the variance saturates (corresponding to coarsening arrest), the direction in which the domains align depends on the flow correlation time. Thus, for correlation times comparable to the inverse of the mean shear rate, the domains align in the direction of maximum flow amplitude, while for short correlation times, the domains initially align in the opposite direction. However, at very late times (after the passage of thousands of correlation times), the fate of the domains is the same regardless of correlation time, namely alignment in the direction of maximum flow amplitude.
△ Less
Submitted 22 December, 2014; v1 submitted 29 July, 2014;
originally announced July 2014.