-
A Multiple Filter Based Neural Network Approach to the Extrapolation of Adsorption Energies on Metal Surfaces for Catalysis Applications
Authors:
Asif J. Chowdhury,
Wenqiang Yang,
Kareem E. Abdelfatah,
Mehdi Zare,
Andreas Heyden,
Gabriel Terejanu
Abstract:
Computational catalyst discovery involves the development of microkinetic reactor models based on estimated parameters determined from density functional theory (DFT). For complex surface chemistries, the cost of calculating the adsorption energies by DFT for a large number of reaction intermediates can become prohibitive. Here, we have identified appropriate descriptors and machine learning model…
▽ More
Computational catalyst discovery involves the development of microkinetic reactor models based on estimated parameters determined from density functional theory (DFT). For complex surface chemistries, the cost of calculating the adsorption energies by DFT for a large number of reaction intermediates can become prohibitive. Here, we have identified appropriate descriptors and machine learning models that can be used to predict part of these adsorption energies given data on the rest of them. Our investigations also included the case when the species data used to train the predictive model is of different size relative to the species the model tries to predict - an extrapolation in the data space which is typically difficult with regular machine learning models. We have developed a neural network based predictive model that combines an established model with the concepts of a convolutional neural network that, when extrapolating, achieves significant improvement over the previous models.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Identifying Active Sites of the Water-Gas Shift Reaction over Titania Supported Platinum Catalysts under Uncertainty
Authors:
Eric A. Walker,
Donald Mitchell,
Gabriel A. Terejanu,
Andreas Heyden
Abstract:
A comprehensive uncertainty quantification framework has been developed for integrating computational and experimental kinetic data and to identify active sites and reaction mechanisms in catalysis. Three hypotheses regarding the active site for the water-gas shift reaction on Pt/TiO2 catalysts are tested - Pt(111), an edge interface site, and a corner interface site. Uncertainties associated with…
▽ More
A comprehensive uncertainty quantification framework has been developed for integrating computational and experimental kinetic data and to identify active sites and reaction mechanisms in catalysis. Three hypotheses regarding the active site for the water-gas shift reaction on Pt/TiO2 catalysts are tested - Pt(111), an edge interface site, and a corner interface site. Uncertainties associated with DFT calculations and model errors of microkinetic models of the active sites are informed and verified using Bayesian inference and predictive validation. Significant evidence is found for the role of the oxide support in the mechanism. Positive evidence is found in support of the edge interface active site over the corner interface site. For the edge interface site, the CO-promoted redox mechanism is found to be the dominant pathway and only at temperatures above 573 K does the classical redox mechanism contribute significantly to the overall rate. At all reaction conditions, water and surface O-H bond dissociation steps at the Pt/TiO2 interface are the main rate controlling steps.
△ Less
Submitted 10 October, 2017;
originally announced October 2017.
-
Validating Predictions of Unobserved Quantities
Authors:
Todd A. Oliver,
Gabriel Terejanu,
Christopher S. Simmons,
Robert D. Moser
Abstract:
The ultimate purpose of most computational models is to make predictions, commonly in support of some decision-making process (e.g., for design or operation of some system). The quantities that need to be predicted (the quantities of interest or QoIs) are generally not experimentally observable before the prediction, since otherwise no prediction would be needed. Assessing the validity of such ext…
▽ More
The ultimate purpose of most computational models is to make predictions, commonly in support of some decision-making process (e.g., for design or operation of some system). The quantities that need to be predicted (the quantities of interest or QoIs) are generally not experimentally observable before the prediction, since otherwise no prediction would be needed. Assessing the validity of such extrapolative predictions, which is critical to informed decision-making, is challenging. In classical approaches to validation, model outputs for observed quantities are compared to observations to determine if they are consistent. By itself, this consistency only ensures that the model can predict the observed quantities under the conditions of the observations. This limitation dramatically reduces the utility of the validation effort for decision making because it implies nothing about predictions of unobserved QoIs or for scenarios outside of the range of observations. However, there is no agreement in the scientific community today regarding best practices for validation of extrapolative predictions made using computational models. The purpose of this paper is to propose and explore a validation and predictive assessment process that supports extrapolative predictions for models with known sources of error. The process includes stochastic modeling, calibration, validation, and predictive assessment phases where representations of known sources of uncertainty and error are built, informed, and tested. The proposed methodology is applied to an illustrative extrapolation problem involving a misspecified nonlinear oscillator.
△ Less
Submitted 29 April, 2014;
originally announced April 2014.
-
Optimal Data Split Methodology for Model Validation
Authors:
Rebecca Morrison,
Corey Bryant,
Gabriel Terejanu,
Kenji Miki,
Serge Prudhomme
Abstract:
The decision to incorporate cross-validation into validation processes of mathematical models raises an immediate question - how should one partition the data into calibration and validation sets? We answer this question systematically: we present an algorithm to find the optimal partition of the data subject to certain constraints. While doing this, we address two critical issues: 1) that the mod…
▽ More
The decision to incorporate cross-validation into validation processes of mathematical models raises an immediate question - how should one partition the data into calibration and validation sets? We answer this question systematically: we present an algorithm to find the optimal partition of the data subject to certain constraints. While doing this, we address two critical issues: 1) that the model be evaluated with respect to predictions of a given quantity of interest and its ability to reproduce the data, and 2) that the model be highly challenged by the validation set, assuming it is properly informed by the calibration set. This framework also relies on the interaction between the experimentalist and/or modeler, who understand the physical system and the limitations of the model; the decision-maker, who understands and can quantify the cost of model failure; and the computational scientists, who strive to determine if the model satisfies both the modeler's and decision maker's requirements. We also note that our framework is quite general, and may be applied to a wide range of problems. Here, we illustrate it through a specific example involving a data reduction model for an ICCD camera from a shock-tube experiment located at the NASA Ames Research Center (ARC).
△ Less
Submitted 30 August, 2011;
originally announced August 2011.
-
Comparison of SCIPUFF Plume Prediction with Particle Filter Assimilated Prediction for Dipole Pride 26 Data
Authors:
Gabriel Terejanu,
Yang Cheng,
Tarunraj Singh,
Peter D. Scott
Abstract:
This paper presents the application of a particle filter for data assimilation in the context of puff-based dispersion models. Particle filters provide estimates of the higher moments, and are well suited for strongly nonlinear and/or non-Gaussian models. The Gaussian puff model SCIPUFF, is used in predicting the chemical concentration field after a chemical incident. This model is highly nonlinea…
▽ More
This paper presents the application of a particle filter for data assimilation in the context of puff-based dispersion models. Particle filters provide estimates of the higher moments, and are well suited for strongly nonlinear and/or non-Gaussian models. The Gaussian puff model SCIPUFF, is used in predicting the chemical concentration field after a chemical incident. This model is highly nonlinear and evolves with variable state dimension and, after sufficient time, high dimensionality. While the particle filter formalism naturally supports variable state dimensionality high dimensionality represents a challenge in selecting an adequate number of particles, especially for the Bootstrap version. We present an implementation of the Bootstrap particle filter and compare its performance with the SCIPUFF predictions. Both the model and the Particle Filter are evaluated on the Dipole Pride 26 experimental data. Since there is no available ground truth, the data has been divided in two sets: training and testing. We show that even with a modest number of particles, the Bootstrap particle filter provides better estimates of the concentration field compared with the process model, without excessive increase in computational complexity.
△ Less
Submitted 7 July, 2011;
originally announced July 2011.
-
Bayesian experimental design for the active nitridation of graphite by atomic nitrogen
Authors:
Gabriel Terejanu,
Rochan R. Upadhyay,
Kenji Miki
Abstract:
The problem of optimal data collection to efficiently learn the model parameters of a graphite nitridation experiment is studied in the context of Bayesian analysis using both synthetic and real experimental data. The paper emphasizes that the optimal design can be obtained as a result of an information theoretic sensitivity analysis. Thus, the preferred design is where the statistical dependence…
▽ More
The problem of optimal data collection to efficiently learn the model parameters of a graphite nitridation experiment is studied in the context of Bayesian analysis using both synthetic and real experimental data. The paper emphasizes that the optimal design can be obtained as a result of an information theoretic sensitivity analysis. Thus, the preferred design is where the statistical dependence between the model parameters and observables is the highest possible. In this paper, the statistical dependence between random variables is quantified by mutual information and estimated using a k-nearest neighbor based approximation. It is shown, that by monitoring the inference process via measures such as entropy or Kullback-Leibler divergence, one can determine when to stop the data collection process. The methodology is applied to select the most informative designs on both a simulated data set and on an experimental data set, previously published in the literature. It is also shown that the sequential Bayesian analysis used in the experimental design can also be useful in detecting conflicting information between measurements and model predictions.
△ Less
Submitted 7 July, 2011;
originally announced July 2011.
-
Application of Predictive Model Selection to Coupled Models
Authors:
Gabriel Terejanu,
Todd Oliver,
Chris Simmons
Abstract:
A predictive Bayesian model selection approach is presented to discriminate coupled models used to predict an unobserved quantity of interest (QoI). The need for accurate predictions arises in a variety of critical applications such as climate, aerospace and defense. A model problem is introduced to study the prediction yielded by the coupling of two physics/sub-components. For each single physics…
▽ More
A predictive Bayesian model selection approach is presented to discriminate coupled models used to predict an unobserved quantity of interest (QoI). The need for accurate predictions arises in a variety of critical applications such as climate, aerospace and defense. A model problem is introduced to study the prediction yielded by the coupling of two physics/sub-components. For each single physics domain, a set of model classes and a set of sensor observations are available. A goal-oriented algorithm using a predictive approach to Bayesian model selection is then used to select the combination of single physics models that best predict the QoI. It is shown that the best coupled model for prediction is the one that provides the most robust predictive distribution for the QoI.
△ Less
Submitted 5 July, 2011;
originally announced July 2011.