Search | arXiv e-print repository

Bayesian averaging for ground state masses of atomic nuclei in a Machine Learning approach

Authors: M. R. Mumpower, M. Li, T. M. Sprouse, B. S. Meyer, A. E. Lovell, A. T. Mohan

Abstract: We present global predictions of the ground state mass of atomic nuclei based on a novel Machine Learning (ML) algorithm. We combine precision nuclear experimental measurements together with theoretical predictions of unmeasured nuclei. This hybrid data set is used to train a probabilistic neural network. In addition to training on this data, a physics-based loss function is employed to help refin… ▽ More We present global predictions of the ground state mass of atomic nuclei based on a novel Machine Learning (ML) algorithm. We combine precision nuclear experimental measurements together with theoretical predictions of unmeasured nuclei. This hybrid data set is used to train a probabilistic neural network. In addition to training on this data, a physics-based loss function is employed to help refine the solutions. The resultant Bayesian averaged predictions have excellent performance compared to the testing set and come with well-quantified uncertainties which are critical for contemporary scientific applications. We assess extrapolations of the model's predictions and estimate the growth of uncertainties in the region far from measurements. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: 15 pages, 10 figures, comments welcome

Report number: LA-UR-23-23224

arXiv:2301.10804 [pdf, other]

Full trajectory optimizing operator inference for reduced-order modeling using differentiable programming

Authors: Surya Chakrabarti, Arvind T. Mohan, Datta V. Gaitonde, Daniel Livescu

Abstract: Accurate and inexpensive Reduced Order Models (ROMs) for forecasting turbulent flows can facilitate rapid design iterations and thus prove critical for predictive control in engineering problems. Galerkin projection based Reduced Order Models (GP-ROMs), derived by projecting the Navier-Stokes equations on a truncated Proper Orthogonal Decomposition (POD) basis, are popular because of their low com… ▽ More Accurate and inexpensive Reduced Order Models (ROMs) for forecasting turbulent flows can facilitate rapid design iterations and thus prove critical for predictive control in engineering problems. Galerkin projection based Reduced Order Models (GP-ROMs), derived by projecting the Navier-Stokes equations on a truncated Proper Orthogonal Decomposition (POD) basis, are popular because of their low computational costs and theoretical foundations. However, the accuracy of traditional GP-ROMs degrades over long time prediction horizons. To address this issue, we extend the recently proposed Neural Galerkin Projection (NeuralGP) data driven framework to compressibility-dominated transonic flow, considering a prototypical problem of a buffeting NACA0012 airfoil governed by the full Navier-Stokes equations. The algorithm maintains the form of the ROM-ODE obtained from the Galerkin projection; however coefficients are learned directly from the data using gradient descent facilitated by differentiable programming. This blends the strengths of the physics driven GP-ROM and purely data driven neural network-based techniques, resulting in a computationally cheaper model that is easier to interpret. We show that the NeuralGP method minimizes a more rigorous full trajectory error norm compared to a linearized error definition optimized by the calibration procedure. We also find that while both procedures stabilize the ROM by displacing the eigenvalues of the linear dynamics matrix of the ROM-ODE to the complex left half-plane, the NeuralGP algorithm adds more dissipation to the trailing POD modes resulting in its better long-term performance. The results presented highlight the superior accuracy of the NeuralGP technique compared to the traditional calibrated GP-ROM method. △ Less

Submitted 25 January, 2023; originally announced January 2023.

Report number: LA-UR-23-20434

arXiv:2212.00217 [pdf, other]

Physics-Constrained Generative Adversarial Networks for 3D Turbulence

Authors: Dima Tretiak, Arvind T. Mohan, Daniel Livescu

Abstract: Generative Adversarial Networks (GANs) have received wide acclaim among the machine learning (ML) community for their ability to generate realistic 2D images. ML is being applied more often to complex problems beyond those of computer vision. However, current frameworks often serve as black boxes and lack physics embeddings, leading to poor ability in enforcing constraints and unreliable models. I… ▽ More Generative Adversarial Networks (GANs) have received wide acclaim among the machine learning (ML) community for their ability to generate realistic 2D images. ML is being applied more often to complex problems beyond those of computer vision. However, current frameworks often serve as black boxes and lack physics embeddings, leading to poor ability in enforcing constraints and unreliable models. In this work, we develop physics embeddings that can be stringently imposed, referred to as hard constraints, in the neural network architecture. We demonstrate their capability for 3D turbulence by embedding them in GANs, particularly to enforce the mass conservation constraint in incompressible fluid turbulence. In doing so, we also explore and contrast the effects of other methods of imposing physics constraints within the GANs framework, especially penalty-based physics constraints popular in literature. By using physics-informed diagnostics and statistics, we evaluate the strengths and weaknesses of our approach and demonstrate its feasibility. △ Less

Submitted 30 November, 2022; originally announced December 2022.

Report number: LA-UR-22-32475

arXiv:2203.10594 [pdf, other]

doi 10.1103/PhysRevC.106.L021301

Physically Interpretable Machine Learning for nuclear masses

Authors: M. R. Mumpower, T. M. Sprouse, A. E. Lovell, A. T. Mohan

Abstract: We present a novel approach to modeling the ground state mass of atomic nuclei based directly on a probabilistic neural network constrained by relevant physics. Our Physically Interpretable Machine Learning (PIML) approach incorporates knowledge of physics by using a physically motivated feature space in addition to a soft physics constraint that is implemented as a penalty to the loss function. W… ▽ More We present a novel approach to modeling the ground state mass of atomic nuclei based directly on a probabilistic neural network constrained by relevant physics. Our Physically Interpretable Machine Learning (PIML) approach incorporates knowledge of physics by using a physically motivated feature space in addition to a soft physics constraint that is implemented as a penalty to the loss function. We train our PIML model on a random set of $\sim$20\% of the Atomic Mass Evaluation (AME) and predict the remaining $\sim$80\%. The success of our methodology is exhibited by the unprecedented $σ_\textrm{RMS}\sim186$ keV match to data for the training set and $σ_\textrm{RMS}\sim316$ keV for the entire AME with $Z \geq 20$. We show that our general methodology can be interpreted using feature importance. △ Less

Submitted 20 March, 2022; originally announced March 2022.

Comments: 5 pages, 3 figures, comments welcome

Report number: LA-UR-22-21855

arXiv:2201.00676 [pdf, other]

doi 10.1103/PhysRevC.106.014305

Nuclear masses learned from a probabilistic neural network

Authors: A. E. Lovell, A. T. Mohan, T. M. Sprouse, M. R. Mumpower

Abstract: Machine learning methods and uncertainty quantification have been gaining interest throughout the last several years in low-energy nuclear physics. In particular, Gaussian processes and Bayesian Neural Networks have increasingly been applied to improve mass model predictions while providing well-quantified uncertainties. In this work, we use the probabilistic Mixture Density Network (MDN) to direc… ▽ More Machine learning methods and uncertainty quantification have been gaining interest throughout the last several years in low-energy nuclear physics. In particular, Gaussian processes and Bayesian Neural Networks have increasingly been applied to improve mass model predictions while providing well-quantified uncertainties. In this work, we use the probabilistic Mixture Density Network (MDN) to directly predict the mass excess of the 2016 Atomic Mass Evaluation within the range of measured data, and we extrapolate the inferred models beyond available experimental data. The MDN not only provides mean values but also full posterior distributions both within the training set and extrapolated testing set. We show that the addition of physical information to the feature space increases the accuracy of the match to the training data as well as provides for more physically meaningful extrapolations beyond the the limits of experimental data. △ Less

Submitted 3 January, 2022; originally announced January 2022.

Comments: 10 pages, 3 figures, under review with Phys. Rev. C

Report number: LA-UR-21-27783

arXiv:2110.11528 [pdf, other]

doi 10.1063/5.0122115

Validation and parameterization of a novel physics-constrained neural dynamics model applied to turbulent fluid flow

Authors: Varun Shankar, Gavin D. Portwood, Arvind T. Mohan, Peetak P. Mitra, Dilip Krishnamurthy, Christopher Rackauckas, Lucas A. Wilson, David P. Schmidt, Venkatasubramanian Viswanathan

Abstract: In fluid physics, data-driven models to enhance or accelerate solution methods are becoming increasingly popular for many application domains, such as alternatives to turbulence closures, system surrogates, or for new physics discovery. In the context of reduced order models of high-dimensional time-dependent fluid systems, machine learning methods grant the benefit of automated learning from data… ▽ More In fluid physics, data-driven models to enhance or accelerate solution methods are becoming increasingly popular for many application domains, such as alternatives to turbulence closures, system surrogates, or for new physics discovery. In the context of reduced order models of high-dimensional time-dependent fluid systems, machine learning methods grant the benefit of automated learning from data, but the burden of a model lies on its reduced-order representation of both the fluid state and physical dynamics. In this work, we build a physics-constrained, data-driven reduced order model for the Navier-Stokes equations to approximate spatio-temporal turbulent fluid dynamics. The model design choices mimic numerical and physical constraints by, for example, implicitly enforcing the incompressibility constraint and utilizing continuous Neural Ordinary Differential Equations for tracking the evolution of the differential equation. We demonstrate this technique on three-dimensional, moderate Reynolds number turbulent fluid flow. In assessing the statistical quality and characteristics of the machine-learned model through rigorous diagnostic tests, we find that our model is capable of reconstructing the dynamics of the flow over large integral timescales, favoring accuracy at the larger length scales. More significantly, comprehensive diagnostics suggest that physically-interpretable model parameters, corresponding to the representations of the fluid state and dynamics, have attributable and quantifiable impact on the quality of the model predictions and computational complexity. △ Less

Submitted 21 October, 2021; originally announced October 2021.

Comments: Submitted to Physical Review Fluids

arXiv:2107.07559 [pdf, other]

Learning Stable Galerkin Models of Turbulence with Differentiable Programming

Authors: Arvind T. Mohan, Kaushik Nagarajan, Daniel Livescu

Abstract: Turbulent flow control has numerous applications and building reduced-order models (ROMs) of the flow and the associated feedback control laws is extremely challenging. Despite the complexity of building data-driven ROMs for turbulence, the superior representational capacity of deep neural networks has demonstrated considerable success in learning ROMs. Nevertheless, these strategies are typically… ▽ More Turbulent flow control has numerous applications and building reduced-order models (ROMs) of the flow and the associated feedback control laws is extremely challenging. Despite the complexity of building data-driven ROMs for turbulence, the superior representational capacity of deep neural networks has demonstrated considerable success in learning ROMs. Nevertheless, these strategies are typically devoid of physical foundations and often lack interpretability. Conversely, the Proper Orthogonal Decomposition (POD) based Galerkin projection (GP) approach for ROM has been popular in many problems owing to its theoretically consistent and explainable physical foundations. However, a key limitation is that the ordinary differential equations (ODEs) arising from GP ROMs are highly susceptible to instabilities due to truncation of POD modes and lead to deterioration in temporal predictions. In this work, we propose a \textit{differentiable programming} approach that blends the strengths of both these strategies, by embedding neural networks explicitly into the GP ODE structure, termed Neural Galerkin projection. We demonstrate this approach on the isentropic Navier-Stokes equations for compressible flow over a cavity at a moderate Mach number. When provided the structure of the projected equations, we show that the Neural Galerkin approach implicitly learns stable ODE coefficients from POD coefficients and demonstrates significantly longer and accurate time horizon predictions, when compared to the classical POD-GP assisted by calibration. We observe that the key benefits of this differentiable programming-based approach include increased flexibility in physics-based learning, very low computational costs, and a significant increase in interpretability, when compared to purely data-driven neural networks. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: 18 pages

Report number: Los Alamos National Laboratory Unlimited Release. Document Number: LA-UR-21-26236

arXiv:2005.03198 [pdf, other]

doi 10.1088/1361-6471/ab9f58

Quantifying Uncertainties on Fission Fragment Mass Yields With Mixture Density Networks

Authors: A. E. Lovell, A. T. Mohan, P. Talou

Abstract: Probabilistic machine learning techniques can learn both complex relations between input features and output quantities of interest as well as take into account stochasticity or uncertainty within a data set. In this initial work, we explore the use of one such probabilistic network, the Mixture Density Network (MDN), to reproduce fission yields and their uncertainties. We study mass yields for th… ▽ More Probabilistic machine learning techniques can learn both complex relations between input features and output quantities of interest as well as take into account stochasticity or uncertainty within a data set. In this initial work, we explore the use of one such probabilistic network, the Mixture Density Network (MDN), to reproduce fission yields and their uncertainties. We study mass yields for the spontaneous fission of $^{252}$Cf, exploring the number of training samples needed for converged predictions, how different levels of uncertainty propagate from the training set to the MDN predictions, and how well physical constraints of the yields - such as normalization and symmetry - are upheld by the algorithm. Finally, we test the ability of the MDN to interpolate between and extrapolate beyond samples in the training set using energy-dependent mass yields for the neutron-induced fission on $^{235}$U. The MDN provides a reliable way to include and predict uncertainties and is a promising path forward for supplementing sparse sets of nuclear data. △ Less

Submitted 6 May, 2020; originally announced May 2020.

Comments: 20 pages, 8 figures, submitted to J. Phys. G

Report number: LA-UR-20-22632

arXiv:2002.00021 [pdf, other]

Embedding Hard Physical Constraints in Neural Network Coarse-Graining of 3D Turbulence

Authors: Arvind T. Mohan, Nicholas Lubbers, Daniel Livescu, Michael Chertkov

Abstract: In the recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of PDEs is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid into Convolutional Neural Networks, and apply this to coarse-graining of… ▽ More In the recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of PDEs is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid into Convolutional Neural Networks, and apply this to coarse-graining of turbulent flow. These physics-embedded neural networks leverage interpretable strategies from numerical methods and computational fluid dynamics to enforce physical laws and boundary conditions by taking advantage the mathematical properties of the underlying equations. We demonstrate results on three-dimensional fully-developed turbulence, showing that this technique drastically improves local conservation of mass, without sacrificing performance according to several other metrics characterizing the fluid flow. △ Less

Submitted 15 February, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

Comments: v2: modified illustration in Eqn. 8 for clarity

Report number: LA-UR-19-31836 (Los Alamos National Laboratory)

arXiv:1804.09269 [pdf, other]

A Deep Learning based Approach to Reduced Order Modeling for Turbulent Flow Control using LSTM Neural Networks

Authors: Arvind T. Mohan, Datta V. Gaitonde

Abstract: Reduced Order Modeling (ROM) for engineering applications has been a major research focus in the past few decades due to the unprecedented physical insight into turbulence offered by high-fidelity CFD. The primary goal of a ROM is to model the key physics/features of a flow-field without computing the full Navier-Stokes (NS) equations. This is accomplished by projecting the high-dimensional dynami… ▽ More Reduced Order Modeling (ROM) for engineering applications has been a major research focus in the past few decades due to the unprecedented physical insight into turbulence offered by high-fidelity CFD. The primary goal of a ROM is to model the key physics/features of a flow-field without computing the full Navier-Stokes (NS) equations. This is accomplished by projecting the high-dimensional dynamics to a low-dimensional subspace, typically utilizing dimensionality reduction techniques like Proper Orthogonal Decomposition (POD), coupled with Galerkin projection. In this work, we demonstrate a deep learning based approach to build a ROM using the POD basis of canonical DNS datasets, for turbulent flow control applications. We find that a type of Recurrent Neural Network, the Long Short Term Memory (LSTM) which has been primarily utilized for problems like speech modeling and language translation, shows attractive potential in modeling temporal dynamics of turbulence. Additionally, we introduce the Hurst Exponent as a tool to study LSTM behavior for non-stationary data, and uncover useful characteristics that may aid ROM development for a variety of applications. △ Less

Submitted 24 April, 2018; originally announced April 2018.

Showing 1–10 of 10 results for author: Mohan, A T