-
Conditional Korhunen-Loéve regression model with Basis Adaptation for high-dimensional problems: uncertainty quantification and inverse modeling
Authors:
Yu-Hong Yeung,
Ramakrishna Tipireddy,
David A. Barajas-Solano,
Alexandre M. Tartakovsky
Abstract:
We propose a methodology for improving the accuracy of surrogate models of the observable response of physical systems as a function of the systems' spatially heterogeneous parameter fields with applications to uncertainty quantification and parameter estimation in high-dimensional problems. Practitioners often formulate finite-dimensional representations of spatially heterogeneous parameter field…
▽ More
We propose a methodology for improving the accuracy of surrogate models of the observable response of physical systems as a function of the systems' spatially heterogeneous parameter fields with applications to uncertainty quantification and parameter estimation in high-dimensional problems. Practitioners often formulate finite-dimensional representations of spatially heterogeneous parameter fields using truncated unconditional Karhunen-Loéve expansions (KLEs) for a certain choice of unconditional covariance kernel and construct surrogate models of the observable response with respect to the random variables in the KLE. When direct measurements of the parameter fields are available, we propose improving the accuracy of these surrogate models by representing the parameter fields via conditional Karhunen-Loéve expansions (CKLEs). CKLEs are constructed by conditioning the covariance kernel of the unconditional expansion on the direct measurements via Gaussian process regression and then truncating the corresponding KLE. We apply the proposed methodology to constructing surrogate models via the Basis Adaptation (BA) method of the stationary hydraulic head response, measured at spatially discrete observation locations, of a groundwater flow model of the Hanford Site, as a function of the 1,000-dimensional representation of the model's log-transmissivity field. We find that BA surrogate models of the hydraulic head based on CKLEs are more accurate than BA surrogate models based on unconditional expansions for forward uncertainty quantification tasks. Furthermore, we find that inverse estimates of the hydraulic transmissivity field computed using CKLE-based BA surrogate models are more accurate than those computed using unconditional BA surrogate models.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Gaussian process regression and conditional Karhunen-Loéve models for data assimilation in inverse problems
Authors:
Yu-Hong Yeung,
David A. Barajas-Solano,
Alexandre M. Tartakovsky
Abstract:
We present a model inversion algorithm, CKLEMAP, for data assimilation and parameter estimation in partial differential equation models of physical systems with spatially heterogeneous parameter fields. These fields are approximated using low-dimensional conditional Karhunen-Loéve expansions, which are constructed using Gaussian process regression models of these fields trained on the parameters'…
▽ More
We present a model inversion algorithm, CKLEMAP, for data assimilation and parameter estimation in partial differential equation models of physical systems with spatially heterogeneous parameter fields. These fields are approximated using low-dimensional conditional Karhunen-Loéve expansions, which are constructed using Gaussian process regression models of these fields trained on the parameters' measurements. We then assimilate measurements of the state of the system and compute the maximum a posteriori estimate of the CKLE coefficients by solving a nonlinear least-squares problem. When solving this optimization problem, we efficiently compute the Jacobian of the vector objective by exploiting the sparsity structure of the linear system of equations associated with the forward solution of the physics problem. The CKLEMAP method provides better scalability compared to the standard MAP method. In the MAP method, the number of unknowns to be estimated is equal to the number of elements in the numerical forward model. On the other hand, in CKLEMAP, the number of unknowns (CKLE coefficients) is controlled by the smoothness of the parameter field and the number of measurements, and is in general much smaller than the number of discretization nodes, which leads to a significant reduction of computational cost with respect to the standard MAP method. To show its advantage in scalability, we apply CKLEMAP to estimate the transmissivity field in a two-dimensional steady-state subsurface flow model of the Hanford Site by assimilating synthetic measurements of transmissivity and hydraulic head. We find that the execution time of CKLEMAP scales nearly linearly as $N^{1.33}$, where $N$ is the number of discretization nodes, while the execution time of standard MAP scales as $N^{2.91}$. The CKLEMAP method improved execution time without sacrificing accuracy when compared to the standard MAP.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Differentiable modeling to unify machine learning and physical models and advance Geosciences
Authors:
Chaopeng Shen,
Alison P. Appling,
Pierre Gentine,
Toshiyuki Bandai,
Hoshin Gupta,
Alexandre Tartakovsky,
Marco Baity-Jesi,
Fabrizio Fenicia,
Daniel Kifer,
Li Li,
Xiaofeng Liu,
Wei Ren,
Yi Zheng,
Ciaran J. Harman,
Martyn Clark,
Matthew Farthing,
Dapeng Feng,
Praveen Kumar,
Doaa Aboelyazeed,
Farshid Rahmani,
Hylke E. Beck,
Tadd Bindas,
Dipankar Dwivedi,
Kuai Fang,
Marvin Höge
, et al. (5 additional authors not shown)
Abstract:
Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage lar…
▽ More
Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage large datasets. ML methods, especially deep networks, presented strong predictive skills yet lacked the ability to answer specific scientific questions. While various methods have been proposed for ML-physics integration, an important underlying theme -- differentiable modeling -- is not sufficiently recognized. Here we outline the concepts, applicability, and significance of differentiable geoscientific modeling (DG). "Differentiable" refers to accurately and efficiently calculating gradients with respect to model variables, critically enabling the learning of high-dimensional unknown relationships. DG refers to a range of methods connecting varying amounts of prior knowledge to neural networks and training them together, capturing a different scope than physics-guided machine learning and emphasizing first principles. Preliminary evidence suggests DG offers better interpretability and causality than ML, improved generalizability and extrapolation capability, and strong potential for knowledge discovery, while approaching the performance of purely data-driven ML. DG models require less training data while scaling favorably in performance and efficiency with increasing amounts of data. With DG, geoscientists may be better able to frame and investigate questions, test hypotheses, and discover unrecognized linkages.
△ Less
Submitted 26 December, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Enhanced physics-constrained deep neural networks for modeling vanadium redox flow battery
Authors:
QiZhi He,
Yucheng Fu,
Panos Stinis,
Alexandre Tartakovsky
Abstract:
Numerical modeling and simulation have become indispensable tools for advancing a comprehensive understanding of the underlying mechanisms and cost-effective process optimization and control of flow batteries. In this study, we propose an enhanced version of the physics-constrained deep neural network (PCDNN) approach [1] to provide high-accuracy voltage predictions in the vanadium redox flow batt…
▽ More
Numerical modeling and simulation have become indispensable tools for advancing a comprehensive understanding of the underlying mechanisms and cost-effective process optimization and control of flow batteries. In this study, we propose an enhanced version of the physics-constrained deep neural network (PCDNN) approach [1] to provide high-accuracy voltage predictions in the vanadium redox flow batteries (VRFBs). The purpose of the PCDNN approach is to enforce the physics-based zero-dimensional (0D) VRFB model in a neural network to assure model generalization for various battery operation conditions. Limited by the simplifications of the 0D model, the PCDNN cannot capture sharp voltage changes in the extreme SOC regions. To improve the accuracy of voltage prediction at extreme ranges, we introduce a second (enhanced) DNN to mitigate the prediction errors carried from the 0D model itself and call the resulting approach enhanced PCDNN (ePCDNN). By comparing the model prediction with experimental data, we demonstrate that the ePCDNN approach can accurately capture the voltage response throughout the charge--discharge cycle, including the tail region of the voltage discharge curve. Compared to the standard PCDNN, the prediction accuracy of the ePCDNN is significantly improved. The loss function for training the ePCDNN is designed to be flexible by adjusting the weights of the physics-constrained DNN and the enhanced DNN. This allows the ePCDNN framework to be transferable to battery systems with variable physical model fidelity.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Autonomous Inversion of In Situ Deformation Measurement Data for CO2 Storage Decision Support
Authors:
Jeff Burghardt,
Ting Bao,
Kailai Xu,
Alexandre Tartakovsky,
Eric Darve
Abstract:
Current methods of estimating the change in stress caused by injecting fluid into subsurface formations require choosing the type of constitutive model and the model parameters based on core, log, and geophysical data during the characterization phase, with little feedback from operational observations to validate or refine these choices. It is shown that errors in the assumed constitutive respons…
▽ More
Current methods of estimating the change in stress caused by injecting fluid into subsurface formations require choosing the type of constitutive model and the model parameters based on core, log, and geophysical data during the characterization phase, with little feedback from operational observations to validate or refine these choices. It is shown that errors in the assumed constitutive response, even when informed by laboratory tests on core samples, are likely to be common, large, and underestimate the magnitude of stress change caused by injection. Recent advances in borehole-based strain instruments and borehole and surface-based tilt and displacement instruments have now enabled monitoring of the deformation of the storage system throughout its operational lifespan. This data can enable validation and refinement of the knowledge of the geomechanical properties and state of the system, but brings with it a challenge to transform the raw data into actionable knowledge. We demonstrate a method to perform a gradient-based deterministic inversion of geomechanical monitoring data. This approach allows autonomous integration of the instrument data without the need for time consuming manual interpretation and selection of updated model parameters. The approach presented is very flexible as to what type of geomechanical constitutive response can be used. The approach is easily adaptable to nonlinear physics-based constitutive models to account for common rock behaviors such as creep and plasticity. The approach also enables training of machine learning-based constitutive models by allowing back propagation of errors through the finite element calculations. This enables strongly enforcing known physics, such as conservation of momentum and continuity, while allowing data-driven models to learn the truly unknown physics such as the constitutive or petrophysical responses.
△ Less
Submitted 24 September, 2021;
originally announced September 2021.
-
Physics-Informed Machine Learning Method for Large-Scale Data Assimilation Problems
Authors:
Yu-Hong Yeung,
David A. Barajas-Solano,
Alexandre M. Tartakovsky
Abstract:
We develop a physics-informed machine learning approach for large-scale data assimilation and parameter estimation and apply it for estimating transmissivity and hydraulic head in the two-dimensional steady-state subsurface flow model of the Hanford Site given synthetic measurements of said variables. In our approach, we extend the physics-informed conditional Karhunen-Loéve expansion (PICKLE) met…
▽ More
We develop a physics-informed machine learning approach for large-scale data assimilation and parameter estimation and apply it for estimating transmissivity and hydraulic head in the two-dimensional steady-state subsurface flow model of the Hanford Site given synthetic measurements of said variables. In our approach, we extend the physics-informed conditional Karhunen-Loéve expansion (PICKLE) method for modeling subsurface flow with unknown flux (Neumann) and varying head (Dirichlet) boundary conditions. We demonstrate that the PICKLE method is comparable in accuracy with the standard maximum a posteriori (MAP) method, but is significantly faster than MAP for large-scale problems. Both methods use a mesh to discretize the computational domain. In MAP, the parameters and states are discretized on the mesh; therefore, the size of the MAP parameter estimation problem directly depends on the mesh size. In PICKLE, the mesh is used to evaluate the residuals of the governing equation, while the parameters and states are approximated by the truncated conditional Karhunen-Loéve expansions with the number of parameters controlled by the smoothness of the parameter and state fields, and not by the mesh size. For a considered example, we demonstrate that the computational cost of PICKLE increases near linearly (as $N_{FV}^{1.15}$) with the number of grid points $N_{FV}$, while that of MAP increases much faster as $N_{FV}^{3.28}$. We demonstrated that once trained for one set of Dirichlet boundary conditions (i.e., one river stage), the PICKLE method provides accurate estimates of the hydraulic head for any value of the Dirichlet boundary conditions (i.e., for any river stage).
△ Less
Submitted 30 July, 2021;
originally announced August 2021.
-
Physics-constrained deep neural network method for estimating parameters in a redox flow battery
Authors:
QiZhi He,
Panos Stinis,
Alexandre Tartakovsky
Abstract:
In this paper, we present a physics-constrained deep neural network (PCDNN) method for parameter estimation in the zero-dimensional (0D) model of the vanadium redox flow battery (VRFB). In this approach, we use deep neural networks (DNNs) to approximate the model parameters as functions of the operating conditions. This method allows the integration of the VRFB computational models as the physical…
▽ More
In this paper, we present a physics-constrained deep neural network (PCDNN) method for parameter estimation in the zero-dimensional (0D) model of the vanadium redox flow battery (VRFB). In this approach, we use deep neural networks (DNNs) to approximate the model parameters as functions of the operating conditions. This method allows the integration of the VRFB computational models as the physical constraints in the parameter learning process, leading to enhanced accuracy of parameter estimation and cell voltage prediction. Using an experimental dataset, we demonstrate that the PCDNN method can estimate model parameters for a range of operating conditions and improve the 0D model prediction of voltage compared to the 0D model prediction with constant operation-condition-independent parameters estimated with traditional inverse methods. We also demonstrate that the PCDNN approach has an improved generalization ability for estimating parameter values for operating conditions not used in the DNN training.
△ Less
Submitted 4 March, 2022; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Physics-informed CoKriging model of a redox flow battery
Authors:
Amanda A. Howard,
Alexandre M. Tartakovsky
Abstract:
Redox flow batteries (RFBs) offer the capability to store large amounts of energy cheaply and efficiently, however, there is a need for fast and accurate models of the charge-discharge curve of a RFB to potentially improve the battery capacity and performance. We develop a multifidelity model for predicting the charge-discharge curve of a RFB. In the multifidelity model, we use the Physics-informe…
▽ More
Redox flow batteries (RFBs) offer the capability to store large amounts of energy cheaply and efficiently, however, there is a need for fast and accurate models of the charge-discharge curve of a RFB to potentially improve the battery capacity and performance. We develop a multifidelity model for predicting the charge-discharge curve of a RFB. In the multifidelity model, we use the Physics-informed CoKriging (CoPhIK) machine learning method that is trained on experimental data and constrained by the so-called "zero-dimensional" physics-based model. Here we demonstrate that the model shows good agreement with experimental results and significant improvements over existing zero-dimensional models. We show that the proposed model is robust as it is not sensitive to the input parameters in the zero-dimensional model. We also show that only a small amount of high-fidelity experimental datasets are needed for accurate predictions for the range of considered input parameters, which include current density, flow rate, and initial concentrations.
△ Less
Submitted 16 June, 2021;
originally announced June 2021.
-
Physics-Informed Neural Network Method for Forward and Backward Advection-Dispersion Equations
Authors:
QiZhi He,
Alexandre M. Tartakovsky
Abstract:
We propose a discretization-free approach based on the physics-informed neural network (PINN) method for solving coupled advection-dispersion and Darcy flow equations with space-dependent hydraulic conductivity. In this approach, the hydraulic conductivity, hydraulic head, and concentration fields are approximated with deep neural networks (DNNs). We assume that the conductivity field is given by…
▽ More
We propose a discretization-free approach based on the physics-informed neural network (PINN) method for solving coupled advection-dispersion and Darcy flow equations with space-dependent hydraulic conductivity. In this approach, the hydraulic conductivity, hydraulic head, and concentration fields are approximated with deep neural networks (DNNs). We assume that the conductivity field is given by its values on a grid, and we use these values to train the conductivity DNN. The head and concentration DNNs are trained by minimizing the residuals of the flow equation and ADE and using the initial and boundary conditions as additional constraints. The PINN method is applied to one- and two-dimensional forward advection-dispersion equations (ADEs), where its performance for various Péclet numbers ($Pe$) is compared with the analytical and numerical solutions. We find that the PINN method is accurate with errors of less than 1% and outperforms some conventional discretization-based methods for $Pe$ larger than 100. Next, we demonstrate that the PINN method remains accurate for the backward ADEs, with the relative errors in most cases staying under 5% compared to the reference concentration field. Finally, we show that when available, the concentration measurements can be easily incorporated in the PINN method and significantly improve (by more than 50% in the considered cases) the accuracy of the PINN solution of the backward ADE.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Stochastically forced ensemble dynamic mode decomposition for forecasting and analysis of near-periodic systems
Authors:
Daniel Dylewsky,
David Barajas-Solano,
Tong Ma,
Alexandre M. Tartakovsky,
J. Nathan Kutz
Abstract:
Time series forecasting remains a central challenge problem in almost all scientific disciplines. We introduce a novel load forecasting method in which observed dynamics are modeled as a forced linear system using Dynamic Mode Decomposition (DMD) in time delay coordinates. Central to this approach is the insight that grid load, like many observables on complex real-world systems, has an "almost-pe…
▽ More
Time series forecasting remains a central challenge problem in almost all scientific disciplines. We introduce a novel load forecasting method in which observed dynamics are modeled as a forced linear system using Dynamic Mode Decomposition (DMD) in time delay coordinates. Central to this approach is the insight that grid load, like many observables on complex real-world systems, has an "almost-periodic" character, i.e., a continuous Fourier spectrum punctuated by dominant peaks, which capture regular (e.g., daily or weekly) recurrences in the dynamics. The forecasting method presented takes advantage of this property by (i) regressing to a deterministic linear model whose eigenspectrum maps onto those peaks, and (ii) simultaneously learning a stochastic Gaussian process regression (GPR) process to actuate this system. Our forecasting algorithm is compared against state-of-the-art forecasting techniques not using additional explanatory variables and is shown to produce superior performance. Moreover, its use of linear intrinsic dynamics offers a number of desirable properties in terms of interpretability and parsimony. Results are presented for a test case using load data from an electrical grid. Load forecasting is an essential challenge in power systems engineering, with major implications for real-time control, pricing, maintenance, and security decisions.
△ Less
Submitted 9 July, 2021; v1 submitted 8 October, 2020;
originally announced October 2020.
-
Learning Unknown Physics of non-Newtonian Fluids
Authors:
Brandon Reyes,
Amanda A. Howard,
Paris Perdikaris,
Alexandre M. Tartakovsky
Abstract:
We extend the physics-informed neural network (PINN) method to learn viscosity models of two non-Newtonian systems (polymer melts and suspensions of particles) using only velocity measurements. The PINN-inferred viscosity models agree with the empirical models for shear rates with large absolute values but deviate for shear rates near zero where the analytical models have an unphysical singularity…
▽ More
We extend the physics-informed neural network (PINN) method to learn viscosity models of two non-Newtonian systems (polymer melts and suspensions of particles) using only velocity measurements. The PINN-inferred viscosity models agree with the empirical models for shear rates with large absolute values but deviate for shear rates near zero where the analytical models have an unphysical singularity. Once a viscosity model is learned, we use the PINN method to solve the momentum conservation equation for non-Newtonian fluid flow using only the boundary conditions.
△ Less
Submitted 26 August, 2020;
originally announced September 2020.
-
Physics-Informed Neural Networks for Multiphysics Data Assimilation with Application to Subsurface Transport
Authors:
QiZhi He,
David Brajas-Solano,
Guzel Tartakovsky,
Alexandre M. Tartakovsky
Abstract:
Data assimilation for parameter and state estimation in subsurface transport problems remains a significant challenge due to the sparsity of measurements, the heterogeneity of porous media, and the high computational cost of forward numerical models. We present a physics-informed deep neural networks (DNNs) machine learning method for estimating space-dependent hydraulic conductivity, hydraulic he…
▽ More
Data assimilation for parameter and state estimation in subsurface transport problems remains a significant challenge due to the sparsity of measurements, the heterogeneity of porous media, and the high computational cost of forward numerical models. We present a physics-informed deep neural networks (DNNs) machine learning method for estimating space-dependent hydraulic conductivity, hydraulic head, and concentration fields from sparse measurements. In this approach, we employ individual DNNs to approximate the unknown parameters (e.g., hydraulic conductivity) and states (e.g., hydraulic head and concentration) of a physical system, and jointly train these DNNs by minimizing the loss function that consists of the governing equations residuals in addition to the error with respect to measurement data. We apply this approach to assimilate conductivity, hydraulic head, and concentration measurements for joint inversion of the conductivity, hydraulic head, and concentration fields in a steady-state advection--dispersion problem. We study the accuracy of the physics-informed DNN approach with respect to data size, number of variables (conductivity and head versus conductivity, head, and concentration), DNNs size, and DNN initialization during training. We demonstrate that the physics-informed DNNs are significantly more accurate than standard data-driven DNNs when the training set consists of sparse data. We also show that the accuracy of parameter estimation increases as additional variables are inverted jointly.
△ Less
Submitted 5 December, 2019;
originally announced December 2019.
-
Physics-Informed Machine Learning with Conditional Karhunen-Loève Expansions
Authors:
Alexandre M. Tartakovsky,
David A. Barajas-Solano,
Qizhi He
Abstract:
We present a new physics-informed machine learning approach for the inversion of PDE models with heterogeneous parameters. In our approach, the space-dependent partially-observed parameters and states are approximated via Karhunen-Loève expansions (KLEs). Each of these KLEs is then conditioned on their corresponding measurements, resulting in low-dimensional models of the parameters and states tha…
▽ More
We present a new physics-informed machine learning approach for the inversion of PDE models with heterogeneous parameters. In our approach, the space-dependent partially-observed parameters and states are approximated via Karhunen-Loève expansions (KLEs). Each of these KLEs is then conditioned on their corresponding measurements, resulting in low-dimensional models of the parameters and states that resolve observed data. Finally, the coefficients of the KLEs are estimated by minimizing the norm of the residual of the PDE model evaluated at a finite set of points in the computational domain, ensuring that the reconstructed parameters and states are consistent with both the observations and the PDE model to an arbitrary level of accuracy.
In our approach, KLEs are constructed using the eigendecomposition of covariance models of spatial variability. For the model parameters, we employ a parameterized covariance model calibrated on parameter observations; for the model states, the covariance is estimated from a number of forward simulations of the PDE model corresponding to realizations of the parameters drawn from their KLE. We apply the proposed approach to identifying heterogeneous log-diffusion coefficients in diffusion equations from spatially sparse measurements of the log-diffusion coefficient and the solution of the diffusion equation. We find that the proposed approach compares favorably against state-of-the-art point estimates such as maximum a posteriori estimation and physics-informed neural networks.
△ Less
Submitted 4 December, 2019;
originally announced December 2019.
-
Highly-scalable, physics-informed GANs for learning solutions of stochastic PDEs
Authors:
Liu Yang,
Sean Treichler,
Thorsten Kurth,
Keno Fischer,
David Barajas-Solano,
Josh Romero,
Valentin Churavy,
Alexandre Tartakovsky,
Michael Houston,
Prabhat,
George Karniadakis
Abstract:
Uncertainty quantification for forward and inverse problems is a central challenge across physical and biomedical disciplines. We address this challenge for the problem of modeling subsurface flow at the Hanford Site by combining stochastic computational models with observational data using physics-informed GAN models. The geographic extent, spatial heterogeneity, and multiple correlation length s…
▽ More
Uncertainty quantification for forward and inverse problems is a central challenge across physical and biomedical disciplines. We address this challenge for the problem of modeling subsurface flow at the Hanford Site by combining stochastic computational models with observational data using physics-informed GAN models. The geographic extent, spatial heterogeneity, and multiple correlation length scales of the Hanford Site require training a computationally intensive GAN model to thousands of dimensions. We develop a hierarchical scheme for exploiting domain parallelism, map discriminators and generators to multiple GPUs, and employ efficient communication schemes to ensure training stability and convergence. We developed a highly optimized implementation of this scheme that scales to 27,500 NVIDIA Volta GPUs and 4584 nodes on the Summit supercomputer with a 93.1% scaling efficiency, achieving peak and sustained half-precision rates of 1228 PF/s and 1207 PF/s.
△ Less
Submitted 28 October, 2019;
originally announced October 2019.
-
A New Approach For Learning Coarse-Grained Potentials with Application to Immiscible Fluids
Authors:
Peiyuan Gao,
Xiu Yang,
Alexandre M. Tartakovsky
Abstract:
Even though atomistic and coarse-grained (CG) models have been used to simulate liquid nanodroplets in vapor, very few rigorous studies of the liquid-liquid interface structure are available, and most of them are limited to planar interfaces. In this work, we evaluate several existing force fields (FF)s, including two atomistic and three CG FFs, with respect to modeling the interface structure and…
▽ More
Even though atomistic and coarse-grained (CG) models have been used to simulate liquid nanodroplets in vapor, very few rigorous studies of the liquid-liquid interface structure are available, and most of them are limited to planar interfaces. In this work, we evaluate several existing force fields (FF)s, including two atomistic and three CG FFs, with respect to modeling the interface structure and thermodynamic properties of the water-hexane interface. Both atomistic FFs are able to quantitatively reproduce the interfacial tension and the coexisting densities of the experimentally-observed planar interface. We use the atomistic FFs to model water droplets in hexane and use these simulations to test the CG FFs. We find that the tested CG FFs cannot reproduce the interfacial tensions of planar and/or curved interfaces. Finally, we propose a new approach for learning CG potentials within the CG SDK (Shinoda-DeVane-Klein) FF framework from atomistic simulation data. We demonstrate that the new potential significantly improves the prediction of both the interfacial tension and structure of water-hexane planar and curved interfaces.
△ Less
Submitted 13 July, 2019;
originally announced July 2019.
-
Non-local model for surface tension in fluid-fluid simulations
Authors:
Amanda A. Howard,
Alexandre M. Tartakovsky
Abstract:
We propose a non-local model for surface tension obtained in the form of an integral of a molecular-force-like function with support $3.5\varepsilon$ added to the Navier-Stokes momentum conservation equation. We demonstrate analytically and numerically that with the non-local model interfaces with a radius of curvature larger than the support length behave macroscopically and microscopically, othe…
▽ More
We propose a non-local model for surface tension obtained in the form of an integral of a molecular-force-like function with support $3.5\varepsilon$ added to the Navier-Stokes momentum conservation equation. We demonstrate analytically and numerically that with the non-local model interfaces with a radius of curvature larger than the support length behave macroscopically and microscopically, otherwise. For static droplets, the pressure difference $P_{\varepsilon, in} - P_{\varepsilon, out}$ satisfies the Young-Laplace law for droplet radius greater than $3.5\varepsilon$ and otherwise deviates from the Young-Laplace law. The latter indicates that the surface tension in the proposed model decreases with decreasing radius of curvature, which agrees with molecular dynamics and experimental studies of nanodroplets. Using the non-local model we perform numerical simulations of droplets under dynamic conditions, including a rising droplet, a droplet in shear flow, and two colliding droplets in shear flow, and compare results with a standard Navier-Stokes model subject to the Young-Laplace boundary condition at the fluid-fluid interface implemented via the Conservative Level Set (CLS) method. We find good agreement with existing numerical methods and analytical results for a rising macroscopic droplet and a droplet in a shear flow. For colliding droplets in shear flow, the non-local model converges (with respect to the grid size) to the correct behavior, including sliding, coalescence, and merging and breaking of two droplets depending on the capillary number. In contrast, we find that the results of the CLS model are highly grid-size dependent.
△ Less
Submitted 30 June, 2020; v1 submitted 24 June, 2019;
originally announced June 2019.
-
Analytical steady-state solutions for pressure with a multiscale non-local model for two-fluid systems
Authors:
Amanda A. Howard,
Yongcheng Zhou,
Alexandre M. Tartakovsky
Abstract:
We consider the nonlocal multiscale model for surface tension \citep{Tartakovsky2018} as an alternative to the (macroscale) Young-Laplace law. The nonlocal model is obtained in the form of an integral of a molecular-force-like function with support $\varepsilon$ added to the Navier-Stokes momentum conservation equation. Using this model, we calculate analytical forms for the steady-state equilibri…
▽ More
We consider the nonlocal multiscale model for surface tension \citep{Tartakovsky2018} as an alternative to the (macroscale) Young-Laplace law. The nonlocal model is obtained in the form of an integral of a molecular-force-like function with support $\varepsilon$ added to the Navier-Stokes momentum conservation equation. Using this model, we calculate analytical forms for the steady-state equilibrium pressure gradient and pressure profile for circular and spherical bubbles and flat interfaces in two and three dimensions. According to the analytical solutions, the pressure changes continuously across the interface in a way that is quantitatively similar to what is observed in MD simulations. Furthermore, the pressure difference $P_{\varepsilon, in} - P_{\varepsilon, out}$ satisfies the Young-Laplace law for the radius of curvature greater than $3\varepsilon$ and deviates from the Young-Laplace law otherwise (i.e., $P_{\varepsilon, in} - P_{\varepsilon, out}$ goes to zero as the radius of the curvature goes to zero, where $P_{\varepsilon, out}$ is the pressure outside of the bubble at the distance greater than $3\varepsilon$ from the interface and $P_{\varepsilon, in}$ is the pressure at the center of the bubble). The latter indicates that the surface tension in the proposed model decreases with the decreasing radius of curvature, which agrees with molecular dynamics simulations and laboratory experiments with nanobubbles. Therefore, our results demonstrate that the nonlocal model behaves microscopically at scales smaller than $\varepsilon$ and macroscopically, otherwise.
△ Less
Submitted 21 May, 2019; v1 submitted 16 May, 2019;
originally announced May 2019.
-
A comparative study of physics-informed neural network models for learning unknown dynamics and constitutive relations
Authors:
Ramakrishna Tipireddy,
Paris Perdikaris,
Panos Stinis,
Alexandre Tartakovsky
Abstract:
We investigate the use of discrete and continuous versions of physics-informed neural network methods for learning unknown dynamics or constitutive relations of a dynamical system. For the case of unknown dynamics, we represent all the dynamics with a deep neural network (DNN). When the dynamics of the system are known up to the specification of constitutive relations (that can depend on the state…
▽ More
We investigate the use of discrete and continuous versions of physics-informed neural network methods for learning unknown dynamics or constitutive relations of a dynamical system. For the case of unknown dynamics, we represent all the dynamics with a deep neural network (DNN). When the dynamics of the system are known up to the specification of constitutive relations (that can depend on the state of the system), we represent these constitutive relations with a DNN. The discrete versions combine classical multistep discretization methods for dynamical systems with neural network based machine learning methods. On the other hand, the continuous versions utilize deep neural networks to minimize the residual function for the continuous governing equations. We use the case of a fedbatch bioreactor system to study the effectiveness of these approaches and discuss conditions for their applicability. Our results indicate that the accuracy of the trained neural network models is much higher for the cases where we only have to learn a constitutive relation instead of the whole dynamics. This finding corroborates the well-known fact from scientific computing that building as much structural information is available into an algorithm can enhance its efficiency and/or accuracy.
△ Less
Submitted 2 April, 2019;
originally announced April 2019.
-
MARTINI-based Coarse-grained Model for Poly(alpha-peptoid)s
Authors:
Peiyuan Gao,
Alex Tartakovsky
Abstract:
In this paper, we present a new coarse-grained (CG) model for poly (alpha-peptoid)s that is compatible with the MARTINI CG FF. In the proposed model, CG poly (alpha-peptoid) is composed by a CG backbone (here we select polysarcosine as the backbone) and side chain beads. The CG model of the backbone (polysarcosine) in a solvent is first developed and then extended to poly (alpha-peptoid)s with dif…
▽ More
In this paper, we present a new coarse-grained (CG) model for poly (alpha-peptoid)s that is compatible with the MARTINI CG FF. In the proposed model, CG poly (alpha-peptoid) is composed by a CG backbone (here we select polysarcosine as the backbone) and side chain beads. The CG model of the backbone (polysarcosine) in a solvent is first developed and then extended to poly (alpha-peptoid)s with different side groups that can be obtained from MARTINI FF. We demonstrate that our CG model has good transferability. For example, the CG potentials for polysarcosine can be transferred to predict hydration free energy of other peptoids. Also, the CG polypeptoid model accurately predicts the radius of gyration over a wide range of chain lengths and the solvation free energy for relatively short peptoid molecules in good solvents. We use the CG model to study sequenced diblock polypeptoid in binary solvent mixtures and compare the results with the experimentally observed coil-globule transition.
△ Less
Submitted 5 March, 2019;
originally announced March 2019.
-
Learning Parameters and Constitutive Relationships with Physics Informed Deep Neural Networks
Authors:
Alexandre M. Tartakovsky,
Carlos Ortiz Marrero,
Paris Perdikaris,
Guzel D. Tartakovsky,
David Barajas-Solano
Abstract:
We present a physics informed deep neural network (DNN) method for estimating parameters and unknown physics (constitutive relationships) in partial differential equation (PDE) models. We use PDEs in addition to measurements to train DNNs to approximate unknown parameters and constitutive relationships as well as states. The proposed approach increases the accuracy of DNN approximations of partial…
▽ More
We present a physics informed deep neural network (DNN) method for estimating parameters and unknown physics (constitutive relationships) in partial differential equation (PDE) models. We use PDEs in addition to measurements to train DNNs to approximate unknown parameters and constitutive relationships as well as states. The proposed approach increases the accuracy of DNN approximations of partially known functions when a limited number of measurements is available and allows for training DNNs when no direct measurements of the functions of interest are available. We employ physics informed DNNs to estimate the unknown space-dependent diffusion coefficient in a linear diffusion equation and an unknown constitutive relationship in a non-linear diffusion equation. For the parameter estimation problem, we assume that partial measurements of the coefficient and states are available and demonstrate that under these conditions, the proposed method is more accurate than state-of-the-art methods. For the non-linear diffusion PDE model with a fully unknown constitutive relationship (i.e., no measurements of constitutive relationship are available), the physics informed DNN method can accurately estimate the non-linear constitutive relationship based on state measurements only. Finally, we demonstrate that the proposed method remains accurate in the presence of measurement noise.
△ Less
Submitted 17 August, 2018; v1 submitted 9 August, 2018;
originally announced August 2018.
-
Discrete-element model for the interaction between ocean waves and sea ice
Authors:
Zhijie Xu,
Alexandre M. Tartakovsky,
Wenxiao Pan
Abstract:
We present a discrete element method (DEM) model to simulate the mechanical behavior of sea ice in response to ocean waves. The interaction of ocean waves and sea ice can potentially lead to the fracture and fragmentation of sea ice depending on the wave amplitude and period. The fracture behavior of sea ice is explicitly modeled by a DEM method, where sea ice is modeled by densely packed spherica…
▽ More
We present a discrete element method (DEM) model to simulate the mechanical behavior of sea ice in response to ocean waves. The interaction of ocean waves and sea ice can potentially lead to the fracture and fragmentation of sea ice depending on the wave amplitude and period. The fracture behavior of sea ice is explicitly modeled by a DEM method, where sea ice is modeled by densely packed spherical particles with finite size. These particles are bonded together at their contact points through mechanical bonds that can sustain both tensile and compressive forces and moments. Fracturing can be naturally represented by the sequential breaking of mechanical bonds. For a given amplitude and period of incident ocean wave, the model provides information for the spatial distribution and time evolution of stress and micro-fractures and the fragment size distribution. We demonstrate that the fraction of broken bonds, , increases with increasing wave amplitude. In contrast, the ice fragment size l decreases with increasing amplitude. This information is important for the understanding of breakup of individual ice floes and floe fragment size.
△ Less
Submitted 2 August, 2018;
originally announced August 2018.
-
A dissipative particle dynamics model of biofilm growth
Authors:
Zhijie Xu,
Paul Meakin,
Alexandre Tartakovsky,
Timothy. D. Scheibe
Abstract:
A dissipative particle dynamics (DPD) model for the quantitative simulation of biofilm growth controlled by substrate (nutrient) consumption, advective and diffusive substrate transport, and hydrodynamic interactions with fluid flow (including fragmentation and reattachment) is described. The model was used to simulate biomass growth, decay, and spreading. It predicts how the biofilm morphology de…
▽ More
A dissipative particle dynamics (DPD) model for the quantitative simulation of biofilm growth controlled by substrate (nutrient) consumption, advective and diffusive substrate transport, and hydrodynamic interactions with fluid flow (including fragmentation and reattachment) is described. The model was used to simulate biomass growth, decay, and spreading. It predicts how the biofilm morphology depends on flow conditions, biofilm growth kinetics, the rheomechanical properties of the biofilm and adhesion to solid surfaces. The morphology of the model biofilm depends strongly on its rigidity and the magnitude of the body force that drives the fluid over the biofilm.
△ Less
Submitted 2 August, 2018;
originally announced August 2018.
-
A diffuse-interface model for smoothed particle hydrodynamics
Authors:
Zhijie Xu,
Paul Meakin,
Alexandre Tartakovsky
Abstract:
Diffuse-interface theory provides a foundation for the modeling and simulation of microstructure evolution in a very wide range of materials, and for the tracking/capturing of dynamic interfaces between different materials on larger scales. Smoothed particle hydrodynamics (SPH) is also widely used to simulate fluids and solids that are subjected to large deformations and have complex dynamic bound…
▽ More
Diffuse-interface theory provides a foundation for the modeling and simulation of microstructure evolution in a very wide range of materials, and for the tracking/capturing of dynamic interfaces between different materials on larger scales. Smoothed particle hydrodynamics (SPH) is also widely used to simulate fluids and solids that are subjected to large deformations and have complex dynamic boundaries and/or interfaces, but no explicit interface tracking/capturing is required, even when topological changes such as fragmentation and coalescence occur, because of its Lagrangian particle nature. Here we developed an SPH model for single-component two-phase fluids that is based on diffuse-interface theory. In the model, the interface has a finite thickness and a surface tension that depend on the coefficient, k, of the gradient contribution to the Helmholtz free energy functional and the density dependent homogeneous free energy. In this model, there is no need to locate the surface (or interface) or to compute the curvature at and near the interface. One- and two-dimensional SPH simulations were used to validate the model.
△ Less
Submitted 2 August, 2018;
originally announced August 2018.
-
Method of model reduction and multifidelity models for solute transport in random layered porous media
Authors:
Zhijie Xu,
Alexandre M. Tartakovsky
Abstract:
This work presents a hierarchical model for solute transport in bounded layered porous media with random permeability. The model generalizes the Taylor-Aris dispersion theory to stochastic transport in random layered porous media with a known velocity covariance function. In the hierarchical model, we represent (random) concentration in terms of its cross-sectional average and a variation function…
▽ More
This work presents a hierarchical model for solute transport in bounded layered porous media with random permeability. The model generalizes the Taylor-Aris dispersion theory to stochastic transport in random layered porous media with a known velocity covariance function. In the hierarchical model, we represent (random) concentration in terms of its cross-sectional average and a variation function. We derive a one-dimensional stochastic advection-dispersion-type equation for the average concentration and a stochastic Poisson equation for the variation function, as well as expressions for the effective velocity and dispersion coefficient. We observe that velocity fluctuations enhance dispersion in a non-monotonic fashion: the dispersion initially increases with correlation length λ, reaches a maximum, and decreases to zero at infinity. Maximum enhancement can be obtained at the correlation length about 0.25 the size of the porous media perpendicular to flow.
△ Less
Submitted 25 June, 2018;
originally announced June 2018.
-
Engineering Structural Robustness in Power Grid Networks Susceptible to Coherent Swing Instability
Authors:
Daniel Dylewsky,
Xiu Yang,
Alexandre Tartakovsky,
J. Nathan Kutz
Abstract:
Networked power grid systems are susceptible to a phenomenon known as Coherent Swing Instability (CSI), in which a subset of machines in the grid lose synchrony with the rest of the network. We develop network level evaluation metrics to (i) identify community substructures in the power grid network, (ii) determine weak points in the network that are particularly sensitive to CSI, and (iii) produc…
▽ More
Networked power grid systems are susceptible to a phenomenon known as Coherent Swing Instability (CSI), in which a subset of machines in the grid lose synchrony with the rest of the network. We develop network level evaluation metrics to (i) identify community substructures in the power grid network, (ii) determine weak points in the network that are particularly sensitive to CSI, and (iii) produce an engineering approach for the addition of transmission lines to reduce the incidences of CSI in existing networks, or design new power grid networks that are robust to CSI by their network design. For simulations on a reduced model for the American Northeast power grid, where a block of buses representing the New England region exhibit a strong propensity for CSI, we show that modifying the network's connectivity structure can markedly improve the grid's resilience to CSI. Our analysis provides a versatile diagnostic tool for evaluating the efficacy of adding lines to a power grid which is known to be prone to CSI. This is a particularly relevant problem in large-scale power systems, where improving stability and robustness to interruptions by increasing overall network connectivity is not feasible due to financial and infrastructural constraints.
△ Less
Submitted 22 May, 2019; v1 submitted 24 May, 2018;
originally announced May 2018.
-
Continuum Model for Nanoscale Multiphase Flows
Authors:
Alexandre M. Tartakovsky
Abstract:
We propose a nonlocal model for surface tension. This model, in combination with the Landau-Lifshitz-Navier-Stokes equations, describes mesoscale features of the multiphase flow, including the static (pressure) tensor and curvature dependence of surface tension. The nonlocal model is obtained in the form of an integral of a molecular-force-like function added into the momentum conservation equatio…
▽ More
We propose a nonlocal model for surface tension. This model, in combination with the Landau-Lifshitz-Navier-Stokes equations, describes mesoscale features of the multiphase flow, including the static (pressure) tensor and curvature dependence of surface tension. The nonlocal model is obtained in the form of an integral of a molecular-force-like function added into the momentum conservation equation. We present an analytical steady-state solution for fluid pressure at the fluid-fluid interface and numerical Smoothed Particle Hydrodynamics solutions that reveal the mesoscopic features of the proposed model.
△ Less
Submitted 21 May, 2018;
originally announced May 2018.
-
Persistent incomplete mixing in reactive flows
Authors:
Alexandre M. Tartakovsky,
David Barajas-Solano
Abstract:
We present an effective stochastic advection-diffusion-reaction (SADR) model that explains incomplete mixing typically observed in transport with bimolecular reactions. Unlike traditional advection-dispersion-reaction models, the SADR model describes mechanical and diffusive mixing as two separate processes. In the SADR model, mechanical mixing is driven by random advective velocity with the varia…
▽ More
We present an effective stochastic advection-diffusion-reaction (SADR) model that explains incomplete mixing typically observed in transport with bimolecular reactions. Unlike traditional advection-dispersion-reaction models, the SADR model describes mechanical and diffusive mixing as two separate processes. In the SADR model, mechanical mixing is driven by random advective velocity with the variance given by the coefficient of mechanical dispersion. The diffusive mixing is modeled as a Fickian diffusion with the effective diffusion coefficient. We demonstrate that the sum of the two coefficients is equal to the dispersion coefficients, but only the effective diffusion coefficient contributes to the mixing-controlled reactions, indicating that such systems do not get fully mixed at the Representative Elementary Volume scale where the deterministic equations and dispersion coefficient are defined. We use the experimental results of Gramling et al. \cite{Gramling} to show that for transport and bimolecular reactions in porous media, the SADR model is significantly more accurate than the traditional dispersion model, which overestimates the concentration of the reaction product by as much as 60\%.
△ Less
Submitted 18 March, 2018;
originally announced March 2018.
-
Modeling Temporal Activity Patterns in Dynamic Social Networks
Authors:
Vasanthan Raghavan,
Greg Ver Steeg,
Aram Galstyan,
Alexander G. Tartakovsky
Abstract:
The focus of this work is on develo** probabilistic models for user activity in social networks by incorporating the social network influence as perceived by the user. For this, we propose a coupled Hidden Markov Model, where each user's activity evolves according to a Markov chain with a hidden state that is influenced by the collective activity of the friends of the user. We develop generalize…
▽ More
The focus of this work is on develo** probabilistic models for user activity in social networks by incorporating the social network influence as perceived by the user. For this, we propose a coupled Hidden Markov Model, where each user's activity evolves according to a Markov chain with a hidden state that is influenced by the collective activity of the friends of the user. We develop generalized Baum-Welch and Viterbi algorithms for model parameter learning and state estimation for the proposed framework. We then validate the proposed model using a significant corpus of user activity on Twitter. Our numerical studies show that with sufficient observations to ensure accurate model learning, the proposed framework explains the observed data better than either a renewal process-based model or a conventional uncoupled Hidden Markov Model. We also demonstrate the utility of the proposed approach in predicting the time to the next tweet. Finally, clustering in the model parameter space is shown to result in distinct natural clusters of users characterized by the interaction dynamic between a user and his network.
△ Less
Submitted 8 May, 2013;
originally announced May 2013.
-
Hidden Markov models for the activity profile of terrorist groups
Authors:
Vasanthan Raghavan,
Aram Galstyan,
Alexander G. Tartakovsky
Abstract:
The main focus of this work is on develo** models for the activity profile of a terrorist group, detecting sudden spurts and downfalls in this profile, and, in general, tracking it over a period of time. Toward this goal, a $d$-state hidden Markov model (HMM) that captures the latent states underlying the dynamics of the group and thus its activity profile is developed. The simplest setting of…
▽ More
The main focus of this work is on develo** models for the activity profile of a terrorist group, detecting sudden spurts and downfalls in this profile, and, in general, tracking it over a period of time. Toward this goal, a $d$-state hidden Markov model (HMM) that captures the latent states underlying the dynamics of the group and thus its activity profile is developed. The simplest setting of $d=2$ corresponds to the case where the dynamics are coarsely quantized as Active and Inactive, respectively. A state estimation strategy that exploits the underlying HMM structure is then developed for spurt detection and tracking. This strategy is shown to track even nonpersistent changes that last only for a short duration at the cost of learning the underlying model. Case studies with real terrorism data from open-source databases are provided to illustrate the performance of the proposed methodology.
△ Less
Submitted 15 January, 2014; v1 submitted 5 July, 2012;
originally announced July 2012.
-
The Effect of Nonlinearity in Hybrid KMC-Continuum models
Authors:
Ariel Balter,
Guang Lin,
Alexandre M. Tartakovsky
Abstract:
Recently there has been interest in develo** efficient ways to model heterogeneous surface reactions with hybrid computational models that couple a KMC model for a surface to a finite difference model for bulk diffusion in a continuous domain. We consider two representative problems that validate a hybrid method and also show that this method captures the combined effects of nonlinearity and sto…
▽ More
Recently there has been interest in develo** efficient ways to model heterogeneous surface reactions with hybrid computational models that couple a KMC model for a surface to a finite difference model for bulk diffusion in a continuous domain. We consider two representative problems that validate a hybrid method and also show that this method captures the combined effects of nonlinearity and stochasticity. We first validate a simple deposition/dissolution model with a linear rate showing that the KMC-continuum hybrid agrees with both a fully deterministic model and its analytical solution. We then study a deposition/dissolution model including competitive adsorption, which leads to a nonlinear rate, and show that, in this case, the KMC-continuum hybrid and fully deterministic simulations do not agree. However, we are able to identify the difference as a natural result of the stochasticity coming from the KMC surface process. Because KMC captures inherent fluctuations, we consider it to be more realistic than a purely deterministic model. Therefore, we consider the KMC-continuum hybrid to be more representative of a real system.
△ Less
Submitted 26 August, 2011;
originally announced August 2011.
-
Multinomial Diffusion Equation
Authors:
Ariel Balter,
Alexandre Tartakovsky
Abstract:
We describe a new, microscopic model for diffusion that captures diffusion induced fluctuations at scales where the concept of concentration gives way to discrete particles. We show that in the limit as the number of particles $N \to \infty$, our model is equivalent to the classical stochastic diffusion equation (SDE). We test our new model and the SDE against Langevin dynamics in numerical simula…
▽ More
We describe a new, microscopic model for diffusion that captures diffusion induced fluctuations at scales where the concept of concentration gives way to discrete particles. We show that in the limit as the number of particles $N \to \infty$, our model is equivalent to the classical stochastic diffusion equation (SDE). We test our new model and the SDE against Langevin dynamics in numerical simulations, and show that our model successfully reproduces the correct ensemble statistics, while the classical model fails.
△ Less
Submitted 3 May, 2011; v1 submitted 4 October, 2010;
originally announced October 2010.