-
Bayesian averaging for ground state masses of atomic nuclei in a Machine Learning approach
Authors:
M. R. Mumpower,
M. Li,
T. M. Sprouse,
B. S. Meyer,
A. E. Lovell,
A. T. Mohan
Abstract:
We present global predictions of the ground state mass of atomic nuclei based on a novel Machine Learning (ML) algorithm. We combine precision nuclear experimental measurements together with theoretical predictions of unmeasured nuclei. This hybrid data set is used to train a probabilistic neural network. In addition to training on this data, a physics-based loss function is employed to help refin…
▽ More
We present global predictions of the ground state mass of atomic nuclei based on a novel Machine Learning (ML) algorithm. We combine precision nuclear experimental measurements together with theoretical predictions of unmeasured nuclei. This hybrid data set is used to train a probabilistic neural network. In addition to training on this data, a physics-based loss function is employed to help refine the solutions. The resultant Bayesian averaged predictions have excellent performance compared to the testing set and come with well-quantified uncertainties which are critical for contemporary scientific applications. We assess extrapolations of the model's predictions and estimate the growth of uncertainties in the region far from measurements.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Full trajectory optimizing operator inference for reduced-order modeling using differentiable programming
Authors:
Surya Chakrabarti,
Arvind T. Mohan,
Datta V. Gaitonde,
Daniel Livescu
Abstract:
Accurate and inexpensive Reduced Order Models (ROMs) for forecasting turbulent flows can facilitate rapid design iterations and thus prove critical for predictive control in engineering problems. Galerkin projection based Reduced Order Models (GP-ROMs), derived by projecting the Navier-Stokes equations on a truncated Proper Orthogonal Decomposition (POD) basis, are popular because of their low com…
▽ More
Accurate and inexpensive Reduced Order Models (ROMs) for forecasting turbulent flows can facilitate rapid design iterations and thus prove critical for predictive control in engineering problems. Galerkin projection based Reduced Order Models (GP-ROMs), derived by projecting the Navier-Stokes equations on a truncated Proper Orthogonal Decomposition (POD) basis, are popular because of their low computational costs and theoretical foundations. However, the accuracy of traditional GP-ROMs degrades over long time prediction horizons. To address this issue, we extend the recently proposed Neural Galerkin Projection (NeuralGP) data driven framework to compressibility-dominated transonic flow, considering a prototypical problem of a buffeting NACA0012 airfoil governed by the full Navier-Stokes equations. The algorithm maintains the form of the ROM-ODE obtained from the Galerkin projection; however coefficients are learned directly from the data using gradient descent facilitated by differentiable programming. This blends the strengths of the physics driven GP-ROM and purely data driven neural network-based techniques, resulting in a computationally cheaper model that is easier to interpret. We show that the NeuralGP method minimizes a more rigorous full trajectory error norm compared to a linearized error definition optimized by the calibration procedure. We also find that while both procedures stabilize the ROM by displacing the eigenvalues of the linear dynamics matrix of the ROM-ODE to the complex left half-plane, the NeuralGP algorithm adds more dissipation to the trailing POD modes resulting in its better long-term performance. The results presented highlight the superior accuracy of the NeuralGP technique compared to the traditional calibrated GP-ROM method.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Physics-Constrained Generative Adversarial Networks for 3D Turbulence
Authors:
Dima Tretiak,
Arvind T. Mohan,
Daniel Livescu
Abstract:
Generative Adversarial Networks (GANs) have received wide acclaim among the machine learning (ML) community for their ability to generate realistic 2D images. ML is being applied more often to complex problems beyond those of computer vision. However, current frameworks often serve as black boxes and lack physics embeddings, leading to poor ability in enforcing constraints and unreliable models. I…
▽ More
Generative Adversarial Networks (GANs) have received wide acclaim among the machine learning (ML) community for their ability to generate realistic 2D images. ML is being applied more often to complex problems beyond those of computer vision. However, current frameworks often serve as black boxes and lack physics embeddings, leading to poor ability in enforcing constraints and unreliable models. In this work, we develop physics embeddings that can be stringently imposed, referred to as hard constraints, in the neural network architecture. We demonstrate their capability for 3D turbulence by embedding them in GANs, particularly to enforce the mass conservation constraint in incompressible fluid turbulence. In doing so, we also explore and contrast the effects of other methods of imposing physics constraints within the GANs framework, especially penalty-based physics constraints popular in literature. By using physics-informed diagnostics and statistics, we evaluate the strengths and weaknesses of our approach and demonstrate its feasibility.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Physically Interpretable Machine Learning for nuclear masses
Authors:
M. R. Mumpower,
T. M. Sprouse,
A. E. Lovell,
A. T. Mohan
Abstract:
We present a novel approach to modeling the ground state mass of atomic nuclei based directly on a probabilistic neural network constrained by relevant physics. Our Physically Interpretable Machine Learning (PIML) approach incorporates knowledge of physics by using a physically motivated feature space in addition to a soft physics constraint that is implemented as a penalty to the loss function. W…
▽ More
We present a novel approach to modeling the ground state mass of atomic nuclei based directly on a probabilistic neural network constrained by relevant physics. Our Physically Interpretable Machine Learning (PIML) approach incorporates knowledge of physics by using a physically motivated feature space in addition to a soft physics constraint that is implemented as a penalty to the loss function. We train our PIML model on a random set of $\sim$20\% of the Atomic Mass Evaluation (AME) and predict the remaining $\sim$80\%. The success of our methodology is exhibited by the unprecedented $σ_\textrm{RMS}\sim186$ keV match to data for the training set and $σ_\textrm{RMS}\sim316$ keV for the entire AME with $Z \geq 20$. We show that our general methodology can be interpreted using feature importance.
△ Less
Submitted 20 March, 2022;
originally announced March 2022.
-
Nuclear masses learned from a probabilistic neural network
Authors:
A. E. Lovell,
A. T. Mohan,
T. M. Sprouse,
M. R. Mumpower
Abstract:
Machine learning methods and uncertainty quantification have been gaining interest throughout the last several years in low-energy nuclear physics. In particular, Gaussian processes and Bayesian Neural Networks have increasingly been applied to improve mass model predictions while providing well-quantified uncertainties. In this work, we use the probabilistic Mixture Density Network (MDN) to direc…
▽ More
Machine learning methods and uncertainty quantification have been gaining interest throughout the last several years in low-energy nuclear physics. In particular, Gaussian processes and Bayesian Neural Networks have increasingly been applied to improve mass model predictions while providing well-quantified uncertainties. In this work, we use the probabilistic Mixture Density Network (MDN) to directly predict the mass excess of the 2016 Atomic Mass Evaluation within the range of measured data, and we extrapolate the inferred models beyond available experimental data. The MDN not only provides mean values but also full posterior distributions both within the training set and extrapolated testing set. We show that the addition of physical information to the feature space increases the accuracy of the match to the training data as well as provides for more physically meaningful extrapolations beyond the the limits of experimental data.
△ Less
Submitted 3 January, 2022;
originally announced January 2022.
-
Validation and parameterization of a novel physics-constrained neural dynamics model applied to turbulent fluid flow
Authors:
Varun Shankar,
Gavin D. Portwood,
Arvind T. Mohan,
Peetak P. Mitra,
Dilip Krishnamurthy,
Christopher Rackauckas,
Lucas A. Wilson,
David P. Schmidt,
Venkatasubramanian Viswanathan
Abstract:
In fluid physics, data-driven models to enhance or accelerate solution methods are becoming increasingly popular for many application domains, such as alternatives to turbulence closures, system surrogates, or for new physics discovery. In the context of reduced order models of high-dimensional time-dependent fluid systems, machine learning methods grant the benefit of automated learning from data…
▽ More
In fluid physics, data-driven models to enhance or accelerate solution methods are becoming increasingly popular for many application domains, such as alternatives to turbulence closures, system surrogates, or for new physics discovery. In the context of reduced order models of high-dimensional time-dependent fluid systems, machine learning methods grant the benefit of automated learning from data, but the burden of a model lies on its reduced-order representation of both the fluid state and physical dynamics. In this work, we build a physics-constrained, data-driven reduced order model for the Navier-Stokes equations to approximate spatio-temporal turbulent fluid dynamics. The model design choices mimic numerical and physical constraints by, for example, implicitly enforcing the incompressibility constraint and utilizing continuous Neural Ordinary Differential Equations for tracking the evolution of the differential equation. We demonstrate this technique on three-dimensional, moderate Reynolds number turbulent fluid flow. In assessing the statistical quality and characteristics of the machine-learned model through rigorous diagnostic tests, we find that our model is capable of reconstructing the dynamics of the flow over large integral timescales, favoring accuracy at the larger length scales. More significantly, comprehensive diagnostics suggest that physically-interpretable model parameters, corresponding to the representations of the fluid state and dynamics, have attributable and quantifiable impact on the quality of the model predictions and computational complexity.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
Learning Stable Galerkin Models of Turbulence with Differentiable Programming
Authors:
Arvind T. Mohan,
Kaushik Nagarajan,
Daniel Livescu
Abstract:
Turbulent flow control has numerous applications and building reduced-order models (ROMs) of the flow and the associated feedback control laws is extremely challenging. Despite the complexity of building data-driven ROMs for turbulence, the superior representational capacity of deep neural networks has demonstrated considerable success in learning ROMs. Nevertheless, these strategies are typically…
▽ More
Turbulent flow control has numerous applications and building reduced-order models (ROMs) of the flow and the associated feedback control laws is extremely challenging. Despite the complexity of building data-driven ROMs for turbulence, the superior representational capacity of deep neural networks has demonstrated considerable success in learning ROMs. Nevertheless, these strategies are typically devoid of physical foundations and often lack interpretability. Conversely, the Proper Orthogonal Decomposition (POD) based Galerkin projection (GP) approach for ROM has been popular in many problems owing to its theoretically consistent and explainable physical foundations. However, a key limitation is that the ordinary differential equations (ODEs) arising from GP ROMs are highly susceptible to instabilities due to truncation of POD modes and lead to deterioration in temporal predictions. In this work, we propose a \textit{differentiable programming} approach that blends the strengths of both these strategies, by embedding neural networks explicitly into the GP ODE structure, termed Neural Galerkin projection. We demonstrate this approach on the isentropic Navier-Stokes equations for compressible flow over a cavity at a moderate Mach number. When provided the structure of the projected equations, we show that the Neural Galerkin approach implicitly learns stable ODE coefficients from POD coefficients and demonstrates significantly longer and accurate time horizon predictions, when compared to the classical POD-GP assisted by calibration. We observe that the key benefits of this differentiable programming-based approach include increased flexibility in physics-based learning, very low computational costs, and a significant increase in interpretability, when compared to purely data-driven neural networks.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Quantifying Uncertainties on Fission Fragment Mass Yields With Mixture Density Networks
Authors:
A. E. Lovell,
A. T. Mohan,
P. Talou
Abstract:
Probabilistic machine learning techniques can learn both complex relations between input features and output quantities of interest as well as take into account stochasticity or uncertainty within a data set. In this initial work, we explore the use of one such probabilistic network, the Mixture Density Network (MDN), to reproduce fission yields and their uncertainties. We study mass yields for th…
▽ More
Probabilistic machine learning techniques can learn both complex relations between input features and output quantities of interest as well as take into account stochasticity or uncertainty within a data set. In this initial work, we explore the use of one such probabilistic network, the Mixture Density Network (MDN), to reproduce fission yields and their uncertainties. We study mass yields for the spontaneous fission of $^{252}$Cf, exploring the number of training samples needed for converged predictions, how different levels of uncertainty propagate from the training set to the MDN predictions, and how well physical constraints of the yields - such as normalization and symmetry - are upheld by the algorithm. Finally, we test the ability of the MDN to interpolate between and extrapolate beyond samples in the training set using energy-dependent mass yields for the neutron-induced fission on $^{235}$U. The MDN provides a reliable way to include and predict uncertainties and is a promising path forward for supplementing sparse sets of nuclear data.
△ Less
Submitted 6 May, 2020;
originally announced May 2020.
-
Embedding Hard Physical Constraints in Neural Network Coarse-Graining of 3D Turbulence
Authors:
Arvind T. Mohan,
Nicholas Lubbers,
Daniel Livescu,
Michael Chertkov
Abstract:
In the recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of PDEs is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid into Convolutional Neural Networks, and apply this to coarse-graining of…
▽ More
In the recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of PDEs is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid into Convolutional Neural Networks, and apply this to coarse-graining of turbulent flow. These physics-embedded neural networks leverage interpretable strategies from numerical methods and computational fluid dynamics to enforce physical laws and boundary conditions by taking advantage the mathematical properties of the underlying equations. We demonstrate results on three-dimensional fully-developed turbulence, showing that this technique drastically improves local conservation of mass, without sacrificing performance according to several other metrics characterizing the fluid flow.
△ Less
Submitted 15 February, 2020; v1 submitted 31 January, 2020;
originally announced February 2020.
-
A Deep Learning based Approach to Reduced Order Modeling for Turbulent Flow Control using LSTM Neural Networks
Authors:
Arvind T. Mohan,
Datta V. Gaitonde
Abstract:
Reduced Order Modeling (ROM) for engineering applications has been a major research focus in the past few decades due to the unprecedented physical insight into turbulence offered by high-fidelity CFD. The primary goal of a ROM is to model the key physics/features of a flow-field without computing the full Navier-Stokes (NS) equations. This is accomplished by projecting the high-dimensional dynami…
▽ More
Reduced Order Modeling (ROM) for engineering applications has been a major research focus in the past few decades due to the unprecedented physical insight into turbulence offered by high-fidelity CFD. The primary goal of a ROM is to model the key physics/features of a flow-field without computing the full Navier-Stokes (NS) equations. This is accomplished by projecting the high-dimensional dynamics to a low-dimensional subspace, typically utilizing dimensionality reduction techniques like Proper Orthogonal Decomposition (POD), coupled with Galerkin projection. In this work, we demonstrate a deep learning based approach to build a ROM using the POD basis of canonical DNS datasets, for turbulent flow control applications. We find that a type of Recurrent Neural Network, the Long Short Term Memory (LSTM) which has been primarily utilized for problems like speech modeling and language translation, shows attractive potential in modeling temporal dynamics of turbulence. Additionally, we introduce the Hurst Exponent as a tool to study LSTM behavior for non-stationary data, and uncover useful characteristics that may aid ROM development for a variety of applications.
△ Less
Submitted 24 April, 2018;
originally announced April 2018.