Search | arXiv e-print repository

Variational Bayesian Optimal Experimental Design with Normalizing Flows

Authors: Jiayuan Dong, Christian Jacobsen, Mehdi Khalloufi, Maryam Akram, Wanjiao Liu, Karthik Duraisamy, Xun Huan

Abstract: Bayesian optimal experimental design (OED) seeks experiments that maximize the expected information gain (EIG) in model parameters. Directly estimating the EIG using nested Monte Carlo is computationally expensive and requires an explicit likelihood. Variational OED (vOED), in contrast, estimates a lower bound of the EIG without likelihood evaluations by approximating the posterior distributions w… ▽ More Bayesian optimal experimental design (OED) seeks experiments that maximize the expected information gain (EIG) in model parameters. Directly estimating the EIG using nested Monte Carlo is computationally expensive and requires an explicit likelihood. Variational OED (vOED), in contrast, estimates a lower bound of the EIG without likelihood evaluations by approximating the posterior distributions with variational forms, and then tightens the bound by optimizing its variational parameters. We introduce the use of normalizing flows (NFs) for representing variational distributions in vOED; we call this approach vOED-NFs. Specifically, we adopt NFs with a conditional invertible neural network architecture built from compositions of coupling layers, and enhanced with a summary network for data dimension reduction. We present Monte Carlo estimators to the lower bound along with gradient expressions to enable a gradient-based simultaneous optimization of the variational parameters and the design variables. The vOED-NFs algorithm is then validated in two benchmark problems, and demonstrated on a partial differential equation-governed application of cathodic electrophoretic deposition and an implicit likelihood case with stochastic modeling of aphid population. The findings suggest that a composition of 4--5 coupling layers is able to achieve lower EIG estimation bias, under a fixed budget of forward model runs, compared to previous approaches. The resulting NFs produce approximate posteriors that agree well with the true posteriors, able to capture non-Gaussian and multi-modal features effectively. △ Less

Submitted 8 April, 2024; originally announced April 2024.

MSC Class: 62K05; 94A17; 62C10; 62F15

arXiv:2404.12556 [pdf, other]

Variance-informed Rounding Uncertainty Analysis for Floating-point Statistical Models

Authors: Sahil Bhola, Karthik Duraisamy

Abstract: Advancements in computer hardware have made it possible to utilize low- and mixed-precision arithmetic for enhanced computational efficiency. In practical predictive modeling, however, it is vital to quantify uncertainty due to rounding along other sources like measurement, sampling, and numerical discretization. Traditional deterministic rounding uncertainty analysis (DBEA) assumes that the round… ▽ More Advancements in computer hardware have made it possible to utilize low- and mixed-precision arithmetic for enhanced computational efficiency. In practical predictive modeling, however, it is vital to quantify uncertainty due to rounding along other sources like measurement, sampling, and numerical discretization. Traditional deterministic rounding uncertainty analysis (DBEA) assumes that the rounding errors equal the unit roundoff $u$. However, despite providing strong guarantees, DBEA severely overestimates rounding uncertainty. This work presents a novel probabilistic rounding uncertainty analysis called VIBEA. By treating rounding errors as i.i.d. random variables and leveraging concentration inequalities, VIBEA provides high-confidence estimates for rounding uncertainty using higher-order rounding error statistics. The presented framework is valid for all problem sizes $n$, unlike DBEA, which necessitates $nu<1$. Further, it can account for the potential cancellation of rounding errors, resulting in rounding uncertainty estimates that grow slowly with $n$. We show that for $n>n_c(u)$, VIBEA produces tighter estimates for rounding uncertainty than DBEA. We also show that VIBEA improves existing probabilistic rounding uncertainty analysis techniques for $n\ge3$ by using higher-order rounding error statistics. We conduct numerical experiments on random vector dot products, a linear system solution, and a stochastic boundary value problem. We show that quantifying rounding uncertainty along with traditional sources (numerical discretization, sampling, parameters) enables a more efficient allocation of computational resources, thereby balancing computational efficiency with predictive accuracy. This study is a step towards a comprehensive mixed-precision approach that improves model reliability and enables budgeting of computational resources in predictive modeling and decision-making. △ Less

Submitted 18 April, 2024; originally announced April 2024.

arXiv:2304.02025 [pdf, other]

doi 10.1038/s41598-023-44589-3

Estimating Global Identifiability Using Conditional Mutual Information in a Bayesian Framework

Authors: Sahil Bhola, Karthik Duraisamy

Abstract: A novel information-theoretic approach is proposed to assess the global practical identifiability of Bayesian statistical models. Based on the concept of conditional mutual information, an estimate of information gained for each model parameter is used to quantify the identifiability with practical considerations. No assumptions are made about the structure of the statistical model or the prior di… ▽ More A novel information-theoretic approach is proposed to assess the global practical identifiability of Bayesian statistical models. Based on the concept of conditional mutual information, an estimate of information gained for each model parameter is used to quantify the identifiability with practical considerations. No assumptions are made about the structure of the statistical model or the prior distribution while constructing the estimator. The estimator has the following notable advantages: first, no controlled experiment or data is required to conduct the practical identifiability analysis; second, unlike popular variance-based global sensitivity analysis methods, different forms of uncertainties, such as model-form, parameter, or measurement can be taken into account; third, the identifiability analysis is global, and therefore independent of a realization of the parameters. If an individual parameter has low identifiability, it can belong to an identifiable subset such that parameters within the subset have a functional relationship and thus have a combined effect on the statistical model. The practical identifiability framework is extended to highlight the dependencies between parameter pairs that emerge a posteriori to find identifiable parameter subsets. The applicability of the proposed approach is demonstrated using a linear Gaussian model and a non-linear methane-air reduced kinetics model. It is shown that by examining the information gained for each model parameter along with its dependencies with other parameters, a subset of parameters that can be estimated with high posterior certainty can be found. △ Less

Submitted 21 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

arXiv:2002.10637 [pdf, other]

doi 10.1017/jfm.2021.271

Sparsity-promoting algorithms for the discovery of informative Koopman invariant subspaces

Authors: Shaowu Pan, Nicholas Arnold-Medabalimi, Karthik Duraisamy

Abstract: Koopman decomposition is a non-linear generalization of eigen-decomposition, and is being increasingly utilized in the analysis of spatio-temporal dynamics. Well-known techniques such as the dynamic mode decomposition (DMD) and its linear variants provide approximations to the Koopman operator, and have been applied extensively in many fluid dynamic problems. Despite being endowed with a richer di… ▽ More Koopman decomposition is a non-linear generalization of eigen-decomposition, and is being increasingly utilized in the analysis of spatio-temporal dynamics. Well-known techniques such as the dynamic mode decomposition (DMD) and its linear variants provide approximations to the Koopman operator, and have been applied extensively in many fluid dynamic problems. Despite being endowed with a richer dictionary of nonlinear observables, nonlinear variants of the DMD, such as extended/kernel dynamic mode decomposition (EDMD/KDMD) are seldom applied to large-scale problems primarily due to the difficulty of discerning the Koopman invariant subspace from thousands of resulting Koopman eigenmodes. To address this issue, we propose a framework based on multi-task feature learning to extract the most informative Koopman invariant subspace by removing redundant and spurious Koopman triplets. In particular, we develop a pruning procedure that penalizes departure from linear evolution. These algorithms can be viewed as sparsity promoting extensions of EDMD/KDMD. Further, we extend KDMD to a continuous-time setting and show a relationship between the present algorithm, sparsity-promoting DMD, and an empirical criterion from the viewpoint of non-convex optimization. The effectiveness of our algorithm is demonstrated on examples ranging from simple dynamical systems to two-dimensional cylinder wake flows at different Reynolds numbers and a three-dimensional turbulent ship air-wake flow. The latter two problems are designed such that very strong nonlinear transients are present, thus requiring an accurate approximation of the Koopman operator. Underlying physical mechanisms are analyzed, with an emphasis on characterizing transient dynamics. The results are compared to existing theoretical expositions and numerical approximations. △ Less

Submitted 2 January, 2021; v1 submitted 24 February, 2020; originally announced February 2020.

Comments: 48 pages

MSC Class: 37C30; 68T05

arXiv:1906.03663 [pdf, other]

doi 10.1137/19M1267246

Physics-Informed Probabilistic Learning of Linear Embeddings of Non-linear Dynamics With Guaranteed Stability

Authors: Shaowu Pan, Karthik Duraisamy

Abstract: The Koopman operator has emerged as a powerful tool for the analysis of nonlinear dynamical systems as it provides coordinate transformations to globally linearize the dynamics. While recent deep learning approaches have been useful in extracting the Koopman operator from a data-driven perspective, several challenges remain. In this work, we formalize the problem of learning the continuous-time Ko… ▽ More The Koopman operator has emerged as a powerful tool for the analysis of nonlinear dynamical systems as it provides coordinate transformations to globally linearize the dynamics. While recent deep learning approaches have been useful in extracting the Koopman operator from a data-driven perspective, several challenges remain. In this work, we formalize the problem of learning the continuous-time Koopman operator with deep neural networks in a measure-theoretic framework. Our approach induces two types of models: differential and recurrent form, the choice of which depends on the availability of the governing equations and data. We then enforce a structural parameterization that renders the realization of the Koopman operator provably stable. A new autoencoder architecture is constructed, such that only the residual of the dynamic mode decomposition is learned. Finally, we employ mean-field variational inference (MFVI) on the aforementioned framework in a hierarchical Bayesian setting to quantify uncertainties in the characterization and prediction of the dynamics of observables. The framework is evaluated on a simple polynomial system, the Duffing oscillator, and an unstable cylinder wake flow with noisy measurements. △ Less

Submitted 20 June, 2020; v1 submitted 9 June, 2019; originally announced June 2019.

Comments: 31 pages

Journal ref: SIAM Journal on Applied Dynamical Systems 19.1 (2020): 480-509

arXiv:1805.12547 [pdf, other]

doi 10.1155/2018/4801012

Long-time predictive modeling of nonlinear dynamical systems using neural networks

Authors: Shaowu Pan, Karthik Duraisamy

Abstract: We study the use of feedforward neural networks (FNN) to develop models of nonlinear dynamical systems from data. Emphasis is placed on predictions at long times, with limited data availability. Inspired by global stability analysis, and the observation of the strong correlation between the local error and the maximum singular value of the Jacobian of the ANN, we introduce Jacobian regularization… ▽ More We study the use of feedforward neural networks (FNN) to develop models of nonlinear dynamical systems from data. Emphasis is placed on predictions at long times, with limited data availability. Inspired by global stability analysis, and the observation of the strong correlation between the local error and the maximum singular value of the Jacobian of the ANN, we introduce Jacobian regularization in the loss function. This regularization suppresses the sensitivity of the prediction to the local error and is shown to improve accuracy and robustness. Comparison between the proposed approach and sparse polynomial regression is presented in numerical examples ranging from simple ODE systems to nonlinear PDE systems including vortex shedding behind a cylinder, and instability-driven buoyant mixing flow. Furthermore, limitations of feedforward neural networks are highlighted, especially when the training data does not include a low dimensional attractor. Strategies of data augmentation are presented as remedies to address these issues to a certain extent. △ Less

Submitted 14 November, 2018; v1 submitted 31 May, 2018; originally announced May 2018.

Comments: 30 pages. Complexity, 2018

MSC Class: 37M99

arXiv:1803.09318 [pdf, other]

doi 10.1137/18M1177263

Data-driven Discovery of Closure Models

Authors: Shaowu Pan, Karthik Duraisamy

Abstract: Derivation of reduced order representations of dynamical systems requires the modeling of the truncated dynamics on the retained dynamics. In its most general form, this so-called closure model has to account for memory effects. In this work, we present a framework of operator inference to extract the governing dynamics of closure from data in a compact, non-Markovian form. We employ sparse polyno… ▽ More Derivation of reduced order representations of dynamical systems requires the modeling of the truncated dynamics on the retained dynamics. In its most general form, this so-called closure model has to account for memory effects. In this work, we present a framework of operator inference to extract the governing dynamics of closure from data in a compact, non-Markovian form. We employ sparse polynomial regression and artificial neural networks to extract the underlying operator. For a special class of non-linear systems, observability of the closure in terms of the resolved dynamics is analyzed and theoretical results are presented on the compactness of the memory. The proposed framework is evaluated on examples consisting of linear to nonlinear systems with and without chaotic dynamics, with an emphasis on predictive performance on unseen data. △ Less

Submitted 10 September, 2018; v1 submitted 25 March, 2018; originally announced March 2018.

Comments: 33 pages

MSC Class: 70G60; 76F20

Journal ref: SIAM Journal on Applied Dynamical Systems, 17(4), 2381-2413

arXiv:1511.02258 [pdf, other]

Efficient Multiscale Gaussian Process Regression using Hierarchical Clustering

Authors: Z. Zhang, K. Duraisamy, N. A. Gumerov

Abstract: Standard Gaussian Process (GP) regression, a powerful machine learning tool, is computationally expensive when it is applied to large datasets, and potentially inaccurate when data points are sparsely distributed in a high-dimensional feature space. To address these challenges, a new multiscale, sparsified GP algorithm is formulated, with the goal of application to large scientific computing datas… ▽ More Standard Gaussian Process (GP) regression, a powerful machine learning tool, is computationally expensive when it is applied to large datasets, and potentially inaccurate when data points are sparsely distributed in a high-dimensional feature space. To address these challenges, a new multiscale, sparsified GP algorithm is formulated, with the goal of application to large scientific computing datasets. In this approach, the data is partitioned into clusters and the cluster centers are used to define a reduced training set, resulting in an improvement over standard GPs in terms of training and evaluation costs. Further, a hierarchical technique is used to adaptively map the local covariance representation to the underlying sparsity of the feature space, leading to improved prediction accuracy when the data distribution is highly non-uniform. A theoretical investigation of the computational complexity of the algorithm is presented. The efficacy of this method is then demonstrated on smooth and discontinuous analytical functions and on data from a direct numerical simulation of turbulent combustion. △ Less

Submitted 6 March, 2016; v1 submitted 6 November, 2015; originally announced November 2015.

Comments: 22 pages, 9 figures. Preprint. Submitted to Machine Learning Mar. 2016

Showing 1–8 of 8 results for author: Duraisamy, K