Search | arXiv e-print repository

arXiv:2407.00761 [pdf, other]

Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models

Authors: Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E. Jones, Nikolaos Bouklas

Abstract: Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_0$ sparsification prior to Stein variational gradient descent ($L_0$+SVGD) is a more robust and efficient means of uncertainty quantificatio… ▽ More Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_0$ sparsification prior to Stein variational gradient descent ($L_0$+SVGD) is a more robust and efficient means of uncertainty quantification, in terms of computational cost and performance than the direct application of SGVD or projected SGVD methods. Specifically, $L_0$+SVGD demonstrates superior resilience to noise, the ability to perform well in extrapolated regions, and a faster convergence rate to an optimal solution. △ Less

Submitted 30 June, 2024; originally announced July 2024.

Comments: 30 pages, 11 figures

arXiv:2405.19082 [pdf, other]

Multiscale simulation of spatially correlated microstructure via a latent space representation

Authors: Reese E. Jones, Craig M. Hamel, Dan Bolintineanu, Kyle Johnson, Robert Buarque de Macedo, Jan Fuhg, Nikolaos Bouklas, Sharlotte Kramer

Abstract: When deformation gradients act on the scale of the microstructure of a part due to geometry and loading, spatial correlations and finite-size effects in simulation cells cannot be neglected. We propose a multiscale method that accounts for these effects using a variational autoencoder to encode the structure-property map of the stochastic volume elements making up the statistical description of th… ▽ More When deformation gradients act on the scale of the microstructure of a part due to geometry and loading, spatial correlations and finite-size effects in simulation cells cannot be neglected. We propose a multiscale method that accounts for these effects using a variational autoencoder to encode the structure-property map of the stochastic volume elements making up the statistical description of the part. In this paradigm the autoencoder can be used to directly encode the microstructure or, alternatively, its latent space can be sampled to provide likely realizations. We demonstrate the method on three examples using the common additively manufactured material AlSi10Mg in: (a) a comparison with direct numerical simulation of the part microstructure, (b) a push forward of microstructural uncertainty to performance quantities of interest, and (c) a simulation of functional gradation of a part with stochastic microstructure. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 23 pages, 25 figures

arXiv:2405.03658 [pdf, other]

A review on data-driven constitutive laws for solids

Authors: Jan Niklas Fuhg, Govinda Anantha Padmanabha, Nikolaos Bouklas, Bahador Bahmani, WaiChing Sun, Nikolaos N. Vlassis, Moritz Flaschel, Pietro Carrara, Laura De Lorenzis

Abstract: This review article highlights state-of-the-art data-driven techniques to discover, encode, surrogate, or emulate constitutive laws that describe the path-independent and path-dependent response of solids. Our objective is to provide an organized taxonomy to a large spectrum of methodologies developed in the past decades and to discuss the benefits and drawbacks of the various techniques for inter… ▽ More This review article highlights state-of-the-art data-driven techniques to discover, encode, surrogate, or emulate constitutive laws that describe the path-independent and path-dependent response of solids. Our objective is to provide an organized taxonomy to a large spectrum of methodologies developed in the past decades and to discuss the benefits and drawbacks of the various techniques for interpreting and forecasting mechanics behavior across different scales. Distinguishing between machine-learning-based and model-free methods, we further categorize approaches based on their interpretability and on their learning process/type of required data, while discussing the key problems of generalization and trustworthiness. We attempt to provide a road map of how these can be reconciled in a data-availability-aware context. We also touch upon relevant aspects such as data sampling techniques, design of experiments, verification, and validation. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 57 pages, 7 Figures

MSC Class: 74-02 (Primary)

arXiv:2404.15562 [pdf, other]

Polyconvex neural network models of thermoelasticity

Authors: Jan N. Fuhg, Asghar Jadoon, Oliver Weeger, D. Thomas Seidl, Reese E. Jones

Abstract: Machine-learning function representations such as neural networks have proven to be excellent constructs for constitutive modeling due to their flexibility to represent highly nonlinear data and their ability to incorporate constitutive constraints, which also allows them to generalize well to unseen data. In this work, we extend a polyconvex hyperelastic neural network framework to thermo-hyperel… ▽ More Machine-learning function representations such as neural networks have proven to be excellent constructs for constitutive modeling due to their flexibility to represent highly nonlinear data and their ability to incorporate constitutive constraints, which also allows them to generalize well to unseen data. In this work, we extend a polyconvex hyperelastic neural network framework to thermo-hyperelasticity by specifying the thermodynamic and material theoretic requirements for an expansion of the Helmholtz free energy expressed in terms of deformation invariants and temperature. Different formulations which a priori ensure polyconvexity with respect to deformation and concavity with respect to temperature are proposed and discussed. The physics-augmented neural networks are furthermore calibrated with a recently proposed sparsification algorithm that not only aims to fit the training data but also penalizes the number of active parameters, which prevents overfitting in the low data regime and promotes generalization. The performance of the proposed framework is demonstrated on synthetic data, which illustrate the expected thermomechanical phenomena, and existing temperature-dependent uniaxial tension and tension-torsion experimental datasets. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 18 pages, 9 figures

arXiv:2404.03863 [pdf, other]

doi 10.1016/j.mtla.2024.102151

Establishing the relationship between generalized crystallographic texture and macroscopic yield surfaces using partial input convex neural networks

Authors: Lloyd van Wees, Karthik Shankar, Jan N. Fuhg, Nikolaos Bouklas, Paul Shade, Mark Obstalecki, Matthew Kasemer

Abstract: In this study, we present a methodology to predict the macroscopic yield surface of metals and metallic alloys with general crystallographic textures. In previous work, we have established the use of partially input convex neural networks (pICNN) as macroscopic yield functions of crystal plasticity simulations. However, this work was performed with an over-abundance of data, and on limited crystal… ▽ More In this study, we present a methodology to predict the macroscopic yield surface of metals and metallic alloys with general crystallographic textures. In previous work, we have established the use of partially input convex neural networks (pICNN) as macroscopic yield functions of crystal plasticity simulations. However, this work was performed with an over-abundance of data, and on limited crystallographic textures. Here, we extend this study to approach more realistic material states (i.e., complex crystallographic textures), and consider data-availability as a major driver for our approach. We present our modified framework capable of handling generalized material states and demonstrate its effectiveness on samples with multi-modal textures deformed under plane stress conditions. We further describe an adaptive algorithm for the generation of training data as informed by the shape of yield surfaces to reduce the time for both the generation of training data as well as pICNN training. Finally, we will discuss errors in both training and test datasets, limitations, and future extensibility. △ Less

Submitted 27 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

Comments: 29 pages, 16 figures

arXiv:2310.03652 [pdf, other]

Extreme sparsification of physics-augmented neural networks for interpretable model discovery in mechanics

Authors: Jan N. Fuhg, Reese E. Jones, Nikolaos Bouklas

Abstract: Data-driven constitutive modeling with neural networks has received increased interest in recent years due to its ability to easily incorporate physical and mechanistic constraints and to overcome the challenging and time-consuming task of formulating phenomenological constitutive laws that can accurately capture the observed material response. However, even though neural network-based constitutiv… ▽ More Data-driven constitutive modeling with neural networks has received increased interest in recent years due to its ability to easily incorporate physical and mechanistic constraints and to overcome the challenging and time-consuming task of formulating phenomenological constitutive laws that can accurately capture the observed material response. However, even though neural network-based constitutive laws have been shown to generalize proficiently, the generated representations are not easily interpretable due to their high number of trainable parameters. Sparse regression approaches exist that allow to obtaining interpretable expressions, but the user is tasked with creating a library of model forms which by construction limits their expressiveness to the functional forms provided in the libraries. In this work, we propose to train regularized physics-augmented neural network-based constitutive models utilizing a smoothed version of $L^{0}$-regularization. This aims to maintain the trustworthiness inherited by the physical constraints, but also enables interpretability which has not been possible thus far on any type of machine learning-based constitutive model where model forms were not assumed a-priory but were actually discovered. During the training process, the network simultaneously fits the training data and penalizes the number of active parameters, while also ensuring constitutive constraints such as thermodynamic consistency. We show that the method can reliably obtain interpretable and trustworthy constitutive models for compressible and incompressible hyperelasticity, yield functions, and hardening models for elastoplasticity, for synthetic and experimental data. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 34 pages, 19 Figures

MSC Class: 74B20 (Primary); 74C05 (Secondary)

arXiv:2308.11080 [pdf, other]

Stress representations for tensor basis neural networks: alternative formulations to Finger-Rivlin-Ericksen

Authors: Jan N. Fuhg, Nikolaos Bouklas, Reese E. Jones

Abstract: Data-driven constitutive modeling frameworks based on neural networks and classical representation theorems have recently gained considerable attention due to their ability to easily incorporate constitutive constraints and their excellent generalization performance. In these models, the stress prediction follows from a linear combination of invariant-dependent coefficient functions and known tens… ▽ More Data-driven constitutive modeling frameworks based on neural networks and classical representation theorems have recently gained considerable attention due to their ability to easily incorporate constitutive constraints and their excellent generalization performance. In these models, the stress prediction follows from a linear combination of invariant-dependent coefficient functions and known tensor basis generators. However, thus far the formulations have been limited to stress representations based on the classical Rivlin and Ericksen form, while the performance of alternative representations has yet to be investigated. In this work, we survey a variety of tensor basis neural network models for modeling hyperelastic materials in a finite deformation context, including a number of so far unexplored formulations which use theoretically equivalent invariants and generators to Finger-Rivlin-Ericksen. Furthermore, we compare potential-based and coefficient-based approaches, as well as different calibration techniques. Nine variants are tested against both noisy and noiseless datasets for three different materials. Theoretical and practical insights into the performance of each formulation are given. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 32 pages, 20 figures, 4 appendices

arXiv:2307.04301 [pdf, other]

NN-EVP: A physics informed neural network-based elasto-viscoplastic framework for predictions of grain size-aware flow response under large deformations

Authors: Adnan Eghtesad, Jan Niklas Fuhg, Nikolaos Bouklas

Abstract: We propose a physics informed, neural network-based elasto-viscoplasticity (NN-EVP) constitutive modeling framework for predicting the flow response in metals as a function of underlying grain size. The developed NN-EVP algorithm is based on input convex neural networks as a means to strictly enforce thermodynamic consistency, while allowing high expressivity towards model discovery from limited d… ▽ More We propose a physics informed, neural network-based elasto-viscoplasticity (NN-EVP) constitutive modeling framework for predicting the flow response in metals as a function of underlying grain size. The developed NN-EVP algorithm is based on input convex neural networks as a means to strictly enforce thermodynamic consistency, while allowing high expressivity towards model discovery from limited data. It utilizes state-of-the-art machine learning tools within PyTorch's high-performance library providing a flexible tool for data-driven, automated constitutive modeling. To test the performance of the framework, we generate synthetic stress-strain curves using a power law-based model with phenomenological hardening at small strains and test the trained model for strain amplitudes beyond the training data. Next, experimentally measured flow responses obtained from uniaxial deformations are used to train the framework under large plastic deformations. Ultimately, the Hall-Petch relationship corresponding to grain size strengthening is discovered by training flow response as a function of grain size, also leading to efficient extrapolation. The present work demonstrates a successful integration of neural networks into elasto-viscoplastic constitutive laws, providing a robust automated framework for constitutive model discovery that can efficiently generalize, while also providing insights into predictions of flow response and grain size-property relationships in metals and metallic alloys under large plastic deformations. △ Less

Submitted 9 July, 2023; originally announced July 2023.

arXiv:2304.13897 [pdf, other]

Physics-informed Data-driven Discovery of Constitutive Models with Application to Strain-Rate-sensitive Soft Materials

Authors: Kshitiz Upadhyay, Jan N. Fuhg, Nikolaos Bouklas, K. T. Ramesh

Abstract: A novel data-driven constitutive modeling approach is proposed, which combines the physics-informed nature of modeling based on continuum thermodynamics with the benefits of machine learning. This approach is demonstrated on strain-rate-sensitive soft materials. This model is based on the viscous dissipation-based visco-hyperelasticity framework where the total stress is decomposed into volumetric… ▽ More A novel data-driven constitutive modeling approach is proposed, which combines the physics-informed nature of modeling based on continuum thermodynamics with the benefits of machine learning. This approach is demonstrated on strain-rate-sensitive soft materials. This model is based on the viscous dissipation-based visco-hyperelasticity framework where the total stress is decomposed into volumetric, isochoric hyperelastic, and isochoric viscous overstress contributions. It is shown that each of these stress components can be written as linear combinations of the components of an irreducible integrity basis. Three Gaussian process regression-based surrogate models are trained (one per stress component) between principal invariants of strain and strain rate tensors and the corresponding coefficients of the integrity basis components. It is demonstrated that this type of model construction enforces key physics-based constraints on the predicted responses: the second law of thermodynamics, the principles of local action and determinism, objectivity, the balance of angular momentum, an assumed reference state, isotropy, and limited memory. The three surrogate models that constitute our constitutive model are evaluated by training them on small-size numerically generated data sets corresponding to a single deformation mode and then analyzing their predictions over a much wider testing regime comprising multiple deformation modes. Our physics-informed data-driven constitutive model predictions are compared with the corresponding predictions of classical continuum thermodynamics-based and purely data-driven models. It is shown that our surrogate models can reasonably capture the stress-strain-strain rate responses in both training and testing regimes, and provide improvements in terms of prediction accuracy, generalizability to multiple deformation modes, and compatibility with limited data. △ Less

Submitted 26 April, 2023; originally announced April 2023.

arXiv:2210.08343 [pdf, other]

doi 10.1016/j.cma.2023.115930

Modular machine learning-based elastoplasticity: generalization in the context of limited data

Authors: Jan N. Fuhg, Craig M. Hamel, Kyle Johnson, Reese Jones, Nikolaos Bouklas

Abstract: The development of accurate constitutive models for materials that undergo path-dependent processes continues to be a complex challenge in computational solid mechanics. Challenges arise both in considering the appropriate model assumptions and from the viewpoint of data availability, verification, and validation. Recently, data-driven modeling approaches have been proposed that aim to establish s… ▽ More The development of accurate constitutive models for materials that undergo path-dependent processes continues to be a complex challenge in computational solid mechanics. Challenges arise both in considering the appropriate model assumptions and from the viewpoint of data availability, verification, and validation. Recently, data-driven modeling approaches have been proposed that aim to establish stress-evolution laws that avoid user-chosen functional forms by relying on machine learning representations and algorithms. However, these approaches not only require a significant amount of data but also need data that probes the full stress space with a variety of complex loading paths. Furthermore, they rarely enforce all necessary thermodynamic principles as hard constraints. Hence, they are in particular not suitable for low-data or limited-data regimes, where the first arises from the cost of obtaining the data and the latter from the experimental limitations of obtaining labeled data, which is commonly the case in engineering applications. In this work, we discuss a hybrid framework that can work on a variable amount of data by relying on the modularity of the elastoplasticity formulation where each component of the model can be chosen to be either a classical phenomenological or a data-driven model depending on the amount of available information and the complexity of the response. The method is tested on synthetic uniaxial data coming from simulations as well as cyclic experimental data for structural materials. The discovered material models are found to not only interpolate well but also allow for accurate extrapolation in a thermodynamically consistent manner far outside the domain of the training data. Training aspects and details of the implementation of these models into Finite Element simulations are discussed and analyzed. △ Less

Submitted 15 October, 2022; originally announced October 2022.

Comments: 36 pages, 25 figures

arXiv:2206.04675 [pdf, other]

Deep Convolutional Ritz Method: Parametric PDE surrogates without labeled data

Authors: Jan Niklas Fuhg, Arnav Karmarkar, Teeratorn Kadeethum, Hongkyu Yoon, Nikolaos Bouklas

Abstract: Parametric surrogate models for partial differential equations (PDEs) are a necessary component for many applications in the computational sciences, and convolutional neural networks (CNNs) have proved as an excellent tool to generate these surrogates when parametric fields are present. CNNs are commonly trained on labeled data based on one-to-one sets of parameter-input and PDE-output fields. Rec… ▽ More Parametric surrogate models for partial differential equations (PDEs) are a necessary component for many applications in the computational sciences, and convolutional neural networks (CNNs) have proved as an excellent tool to generate these surrogates when parametric fields are present. CNNs are commonly trained on labeled data based on one-to-one sets of parameter-input and PDE-output fields. Recently, residual-based convolutional physics-informed neural network (CPINN) solvers for parametric PDEs have been proposed to build surrogates without the need for labeled data. These allow for the generation of surrogates without an expensive offline-phase. In this work, we present an alternative formulation termed Deep Convolutional Ritz Method (DCRM) as a parametric PDE solver. The approach is based on the minimization of energy functionals, which lowers the order of the differential operators compared to residual-based methods. Based on studies involving the Poisson equation with a spatially parameterized source term and boundary conditions, we found that CNNs trained on labeled data outperform CPINNs in convergence speed and generalization ability. Surrogates generated from DCRM, however, converge significantly faster than their CPINN counterparts and prove to generalize faster and better than surrogates obtained from both CNNs trained on labeled data and CPINNs. This hints that DCRM could make PDE solution surrogates trained without labeled data possible. △ Less

Submitted 7 June, 2022; originally announced June 2022.

Comments: 20 pages, 12 figures

MSC Class: 65N99 (Primary) 35Q62; 35Q68 (Secondary) ACM Class: G.1.8

arXiv:2204.04529 [pdf, other]

doi 10.1016/j.jmps.2022.105022

Learning hyperelastic anisotropy from data via a tensor basis neural network

Authors: Jan N. Fuhg, Nikolaos Bouklas, Reese E. Jones

Abstract: Anisotropy in the mechanical response of materials with microstructure is common and yet is difficult to assess and model. To construct accurate response models given only stress-strain data, we employ classical representation theory, novel neural network layers, and L1 regularization. The proposed tensor-basis neural network can discover both the type and orientation of the anisotropy and provide… ▽ More Anisotropy in the mechanical response of materials with microstructure is common and yet is difficult to assess and model. To construct accurate response models given only stress-strain data, we employ classical representation theory, novel neural network layers, and L1 regularization. The proposed tensor-basis neural network can discover both the type and orientation of the anisotropy and provide an accurate model of the stress response. The method is demonstrated with data from hyperelastic materials with off-axis transverse isotropy and orthotropy, as well as materials with less well-defined symmetries induced by fibers or spherical inclusions. Both plain feed-forward neural networks and input-convex neural network formulations are developed and tested. Using the latter, a polyconvex potential can be established, which, by satisfying the growth condition can guarantee the existence of boundary value problem solutions. △ Less

Submitted 9 April, 2022; originally announced April 2022.

Comments: 36 pages, 20 figures

arXiv:2202.01885 [pdf, other]

Machine-learning convex and texture-dependent macroscopic yield from crystal plasticity simulations

Authors: Jan N. Fuhg, Lloyd van Wees, Mark Obstalecki, Paul Shade, Nikolaos Bouklas, Matthew Kasemer

Abstract: The influence of the microstructure of a polycrystalline material on its macroscopic deformation response is still one of the major problems in materials engineering. For materials characterized by elastic-plastic deformation responses, predictive computational models to characterize crystal-plasticity (CP) have been developed. However, due to their large demand of computational resources, CP simu… ▽ More The influence of the microstructure of a polycrystalline material on its macroscopic deformation response is still one of the major problems in materials engineering. For materials characterized by elastic-plastic deformation responses, predictive computational models to characterize crystal-plasticity (CP) have been developed. However, due to their large demand of computational resources, CP simulations cannot be straightforwardly implemented in hierarchical computational models such as FE$^{2}$. This bottleneck intensifies the need for the development of macroscopic simulation tools that can be directly informed by microstructural quantities. Using a 3D Finite-Element solver for CP, we generate a macroscopic yield function database based on general loading conditions and crystallographic texture. We furthermore assume an independence of the yield function to hydrostatic pressure of the yield function. Leveraging the advancement in statistical modeling we describe and apply a machine learning framework for predicting macroscopic yield as a function of crystallographic texture. The convexity of the data-driven yield function is guaranteed by using partially input convex neural networks as the predictive tool. Furthermore, in order to allow for the predicted yield function to be directly incorporated in time-integration schemes, as needed for the Finite Element method, the yield surfaces are interpreted as the boundaries of signed distance function level sets. △ Less

Submitted 28 January, 2022; originally announced February 2022.

Comments: 23 pages, 16 figures

MSC Class: 74Q15(Primary); 74C15; 68T07 (Secondary)

arXiv:2109.11028 [pdf, other]

doi 10.1016/j.cma.2022.114915

On physics-informed data-driven isotropic and anisotropic constitutive models through probabilistic machine learning and space-filling sampling

Authors: Jan Niklas Fuhg, Nikolaos Bouklas

Abstract: Data-driven constitutive modeling is an emerging field in computational solid mechanics with the prospect of significantly relieving the computational costs of hierarchical computational methods. Traditionally, these surrogates have been trained using datasets which map strain inputs to stress outputs directly. Data-driven constitutive models for elastic and inelastic materials have commonly been… ▽ More Data-driven constitutive modeling is an emerging field in computational solid mechanics with the prospect of significantly relieving the computational costs of hierarchical computational methods. Traditionally, these surrogates have been trained using datasets which map strain inputs to stress outputs directly. Data-driven constitutive models for elastic and inelastic materials have commonly been developed based on artificial neural networks (ANNs), which recently enabled the incorporation of physical laws in the construction of these models. However, ANNs do not offer convergence guarantees and are reliant on user-specified parameters. In contrast to ANNs, Gaussian process regression (GPR) is based on nonparametric modeling principles as well as on fundamental statistical knowledge and hence allows for strict convergence guarantees. GPR however has the major disadvantage that it scales poorly as datasets get large. In this work we present a physics-informed data-driven constitutive modeling approach for isostropic and anisotropic materials based on probabilistic machine learning that can be used in the big data context. The trained GPR surrogates are able to respect physical principles such as material frame indifference, material symmetry, thermodynamic consistency, stress-free undeformed configuration, and the local balance of angular momentum. Furthermore, this paper presents the first sampling approach that directly generates space-filling points in the invariant space corresponding to bounded domain of the gradient deformation tensor. Overall, the presented approach is tested on synthetic data from isotropic and anisotropic constitutive laws and shows surprising accuracy even far beyond the limits of the training domain, indicating that the resulting surrogates can efficiently generalize as they incorporate knowledge about the underlying physics. △ Less

Submitted 19 September, 2021; originally announced September 2021.

Comments: 26 pages, 12 figures

MSC Class: 35Q74 (Primary); 35Q62 (Secondary) ACM Class: J.2; I.2.6; G.1.8

arXiv:2106.13727 [pdf, other]

Interval and fuzzy physics-informed neural networks for uncertain fields

Authors: Jan Niklas Fuhg, Ioannis Kalogeris, Amélie Fau, Nikolaos Bouklas

Abstract: Temporally and spatially dependent uncertain parameters are regularly encountered in engineering applications. Commonly these uncertainties are accounted for using random fields and processes, which require knowledge about the appearing probability distributions functions that is not readily available. In these cases non-probabilistic approaches such as interval analysis and fuzzy set theory are h… ▽ More Temporally and spatially dependent uncertain parameters are regularly encountered in engineering applications. Commonly these uncertainties are accounted for using random fields and processes, which require knowledge about the appearing probability distributions functions that is not readily available. In these cases non-probabilistic approaches such as interval analysis and fuzzy set theory are helpful uncertainty measures. Partial differential equations involving fuzzy and interval fields are traditionally solved using the finite element method where the input fields are sampled using some basis function expansion methods. This approach however is problematic, as it is reliant on knowledge about the spatial correlation fields. In this work we utilize physics-informed neural networks (PINNs) to solve interval and fuzzy partial differential equations. The resulting network structures termed interval physics-informed neural networks (iPINNs) and fuzzy physics-informed neural networks (fPINNs) show promising results for obtaining bounded solutions of equations involving spatially and/or temporally uncertain parameter fields. In contrast to finite element approaches, no correlation length specification of the input fields as well as no Monte-Carlo simulations are necessary. In fact, information about the input interval fields is obtained directly as a byproduct of the presented solution scheme. Furthermore, all major advantages of PINNs are retained, i.e. meshfree nature of the scheme, and ease of inverse problem set-up. △ Less

Submitted 19 November, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

Comments: Added new author who helped rewrite the paper Added a new application Slight rewrite of some sections 18 pages,19 figures

MSC Class: 35Q74 ACM Class: J.2; I.2.8

arXiv:2105.13136 [pdf, other]

A framework for data-driven solution and parameter estimation of PDEs using conditional generative adversarial networks

Authors: Teeratorn Kadeethum, Daniel O'Malley, Jan Niklas Fuhg, Youngsoo Choi, Jonghyun Lee, Hari S. Viswanathan, Nikolaos Bouklas

Abstract: This work is the first to employ and adapt the image-to-image translation concept based on conditional generative adversarial networks (cGAN) towards learning a forward and an inverse solution operator of partial differential equations (PDEs). Even though the proposed framework could be applied as a surrogate model for the solution of any PDEs, here we focus on steady-state solutions of coupled hy… ▽ More This work is the first to employ and adapt the image-to-image translation concept based on conditional generative adversarial networks (cGAN) towards learning a forward and an inverse solution operator of partial differential equations (PDEs). Even though the proposed framework could be applied as a surrogate model for the solution of any PDEs, here we focus on steady-state solutions of coupled hydro-mechanical processes in heterogeneous porous media. Strongly heterogeneous material properties, which translate to the heterogeneity of coefficients of the PDEs and discontinuous features in the solutions, require specialized techniques for the forward and inverse solution of these problems. Additionally, parametrization of the spatially heterogeneous coefficients is excessively difficult by using standard reduced order modeling techniques. In this work, we overcome these challenges by employing the image-to-image translation concept to learn the forward and inverse solution operators and utilize a U-Net generator and a patch-based discriminator. Our results show that the proposed data-driven reduced order model has competitive predictive performance capabilities in accuracy and computational efficiency as well as training time requirements compared to state-of-the-art data-driven methods for both forward and inverse problems. △ Less

Submitted 27 May, 2021; originally announced May 2021.

arXiv:2105.04554 [pdf, other]

doi 10.1016/j.cma.2021.114217

Local approximate Gaussian process regression for data-driven constitutive laws: Development and comparison with neural networks

Authors: Jan Niklas Fuhg, Michele Marino, Nikolaos Bouklas

Abstract: Hierarchical computational methods for multiscale mechanics such as the FE$^2$ and FE-FFT methods are generally accompanied by high computational costs. Data-driven approaches are able to speed the process up significantly by enabling to incorporate the effective micromechanical response in macroscale simulations without the need of performing additional computations at each Gauss point explicitly… ▽ More Hierarchical computational methods for multiscale mechanics such as the FE$^2$ and FE-FFT methods are generally accompanied by high computational costs. Data-driven approaches are able to speed the process up significantly by enabling to incorporate the effective micromechanical response in macroscale simulations without the need of performing additional computations at each Gauss point explicitly. Traditionally artificial neural networks (ANNs) have been the surrogate modeling technique of choice in the solid mechanics community. However they suffer from severe drawbacks due to their parametric nature and suboptimal training and inference properties for the investigated datasets in a three dimensional setting. These problems can be avoided using local approximate Gaussian process regression (laGPR). This method can allow the prediction of stress outputs at particular strain space locations by training local regression models based on Gaussian processes, using only a subset of the data for each local model, offering better and more reliable accuracy than ANNs. A modified Newton-Raphson approach is proposed to accommodate for the local nature of the laGPR approximation when solving the global structural problem in a FE setting. Hence, the presented work offers a complete and general framework enabling multiscale calculations combining a data-driven constitutive prediction using laGPR, and macroscopic calculations using an FE scheme that we test for finite-strain three-dimensional hyperelastic problems. △ Less

Submitted 7 May, 2021; originally announced May 2021.

Comments: 22 pages, 15 figures

MSC Class: 35Q74 (Primary); 35Q62 ACM Class: J.2; I.2.6; G.1.8

arXiv:2104.09623 [pdf, other]

doi 10.1016/j.jcp.2021.110839

The mixed deep energy method for resolving concentration features in finite strain hyperelasticity

Authors: Jan N. Fuhg, Nikolaos Bouklas

Abstract: The introduction of Physics-informed Neural Networks (PINNs) has led to an increased interest in deep neural networks as universal approximators of PDEs in the solid mechanics community. Recently, the Deep Energy Method (DEM) has been proposed. DEM is based on energy minimization principles, contrary to PINN which is based on the residual of the PDEs. A significant advantage of DEM, is that it req… ▽ More The introduction of Physics-informed Neural Networks (PINNs) has led to an increased interest in deep neural networks as universal approximators of PDEs in the solid mechanics community. Recently, the Deep Energy Method (DEM) has been proposed. DEM is based on energy minimization principles, contrary to PINN which is based on the residual of the PDEs. A significant advantage of DEM, is that it requires the approximation of lower order derivatives compared to formulations that are based on strong form residuals. However both DEM and classical PINN formulations struggle to resolve fine features of the stress and displacement fields, for example concentration features in solid mechanics applications. We propose an extension to the Deep Energy Method (DEM) to resolve these features for finite strain hyperelasticity. The developed framework termed mixed Deep Energy Method (mDEM) introduces stress measures as an additional output of the NN to the recently introduced pure displacement formulation. Using this approach, Neumann boundary conditions are approximated more accurately and the accuracy around spatial features which are typically responsible for high concentrations is increased. In order to make the proposed approach more versatile, we introduce a numerical integration scheme based on Delaunay integration, which enables the mDEM framework to be used for random training point position sets commonly needed for computational domains with stress concentrations. We highlight the advantages of the proposed approach while showing the shortcomings of classical PINN and DEM formulations. The method is offering comparable results to Finite-Element Method (FEM) on the forward calculation of challenging computational experiments involving domains with fine geometric features and concentrated loads. △ Less

Submitted 15 April, 2021; originally announced April 2021.

Comments: 17 pages, 15 figures

arXiv:2104.02650 [pdf, other]

doi 10.1016/j.ijengsci.2021.103522

Model-data-driven constitutive responses: application to a multiscale computational framework

Authors: Jan Niklas Fuhg, Christoph Boehm, Nikolaos Bouklas, Amelie Fau, Peter Wriggers, Michele Marino

Abstract: Computational multiscale methods for analyzing and deriving constitutive responses have been used as a tool in engineering problems because of their ability to combine information at different length scales. However, their application in a nonlinear framework can be limited by high computational costs, numerical difficulties, and/or inaccuracies. In this paper, a hybrid methodology is presented wh… ▽ More Computational multiscale methods for analyzing and deriving constitutive responses have been used as a tool in engineering problems because of their ability to combine information at different length scales. However, their application in a nonlinear framework can be limited by high computational costs, numerical difficulties, and/or inaccuracies. In this paper, a hybrid methodology is presented which combines classical constitutive laws (model-based), a data-driven correction component, and computational multiscale approaches. A model-based material representation is locally improved with data from lower scales obtained by means of a nonlinear numerical homogenization procedure leading to a model-data-driven approach. Therefore, macroscale simulations explicitly incorporate the true microscale response, maintaining the same level of accuracy that would be obtained with online micro-macro simulations but with a computational cost comparable to classical model-driven approaches. In the proposed approach, both model and data play a fundamental role allowing for the synergistic integration between a physics-based response and a machine learning black-box. Numerical applications are implemented in two dimensions for different tests investigating both material and structural responses in large deformation. △ Less

Submitted 6 April, 2021; originally announced April 2021.

Comments: 43 pages, 28 figures

MSC Class: 74B20; 68T99 ACM Class: G.1.8; I.2.6

Journal ref: International Journal of Engineering Science. 167 (2021) 103522

arXiv:2001.03438 [pdf, other]

doi 10.1016/j.cma.2020.113008

A machine learning based plasticity model using proper orthogonal decomposition

Authors: Dengpeng Huang, Jan Niklas Fuhg, Christian Weißenfels, Peter Wriggers

Abstract: Data-driven material models have many advantages over classical numerical approaches, such as the direct utilization of experimental data and the possibility to improve performance of predictions when additional data is available. One approach to develop a data-driven material model is to use machine learning tools. These can be trained offline to fit an observed material behaviour and then be app… ▽ More Data-driven material models have many advantages over classical numerical approaches, such as the direct utilization of experimental data and the possibility to improve performance of predictions when additional data is available. One approach to develop a data-driven material model is to use machine learning tools. These can be trained offline to fit an observed material behaviour and then be applied in online applications. However, learning and predicting history dependent material models, such as plasticity, is still challenging. In this work, a machine learning based material modelling framework is proposed for both elasticity and plasticity. The machine learning based hyperelasticity model is developed with the Feed forward Neural Network (FNN) directly whereas the machine learning based plasticity model is developed by using of a novel method called Proper Orthogonal Decomposition Feed forward Neural Network (PODFNN). In order to account for the loading history, the accumulated absolute strain is proposed to be the history variable of the plasticity model. Additionally, the strain-stress sequence data for plasticity is collected from different loading-unloading paths based on the concept of sequence for plasticity. By means of the POD, the multi-dimensional stress sequence is decoupled leading to independent one dimensional coefficient sequences. In this case, the neural network with multiple output is replaced by multiple independent neural networks each possessing a one-dimensional output, which leads to less training time and better training performance. To apply the machine learning based material model in finite element analysis, the tangent matrix is derived by the automatic symbolic differentiation tool AceGen. The effectiveness and generalization of the presented models are investigated by a series of numerical examples using both 2D and 3D finite element analysis. △ Less

Submitted 7 January, 2020; originally announced January 2020.

Journal ref: Computer Methods in Applied Mechanics and Engineering, Volume 365, 2020, 113008, ISSN 0045-7825,

arXiv:1907.02208 [pdf, other]

Surrogate model approach for investigating the stability of a friction-induced oscillator of Duffing's type

Authors: Jan N. Fuhg, Amelie Fau

Abstract: Parametric studies for dynamic systems are of high interest to detect instability domains. This prediction can be demanding as it requires a refined exploration of the parametric space due to the disrupted mechanical behavior. In this paper, an efficient surrogate strategy is proposed to investigate the behavior of an oscillator of Duffing's type in combination with an elasto-plastic friction forc… ▽ More Parametric studies for dynamic systems are of high interest to detect instability domains. This prediction can be demanding as it requires a refined exploration of the parametric space due to the disrupted mechanical behavior. In this paper, an efficient surrogate strategy is proposed to investigate the behavior of an oscillator of Duffing's type in combination with an elasto-plastic friction force model. Relevant quantities of interest are discussed. Sticking time is considered using a machine learning technique based on Gaussian processes called kriging. The largest Lyapunov exponent is proposed as an efficient indicator of non-regular behavior. This indicator is estimated using a perturbation method. A dedicated adaptive kriging strategy for classification called MiVor is utilized and appears to be highly proficient in order to detect instabilities over the parametric space and can furthermore be used for complex response surfaces in multi-dimensional parametric domains. △ Less

Submitted 2 July, 2019; originally announced July 2019.

Comments: 33 pages, 20 Figures

arXiv:1907.01490 [pdf, other]

An innovative adaptive kriging approach for efficient binary classification of mechanical problems

Authors: Jan N. Fuhg, Amelie Fau

Abstract: Kriging is an efficient machine-learning tool, which allows to obtain an approximate response of an investigated phenomenon on the whole parametric space. Adaptive schemes provide a the ability to guide the experiment yielding new sample point positions to enrich the metamodel. Herein a novel adaptive scheme called Monte Carlo-intersite Voronoi (MiVor) is proposed to efficiently identify binary de… ▽ More Kriging is an efficient machine-learning tool, which allows to obtain an approximate response of an investigated phenomenon on the whole parametric space. Adaptive schemes provide a the ability to guide the experiment yielding new sample point positions to enrich the metamodel. Herein a novel adaptive scheme called Monte Carlo-intersite Voronoi (MiVor) is proposed to efficiently identify binary decision regions on the basis of a regression surrogate model. The performance of the innovative approach is tested for analytical functions as well as some mechanical problems and is furthermore compared to two regression-based adaptive schemes. For smooth problems, all three methods have comparable performances. For highly fluctuating response surface as encountered e.g. for dynamics or damage problems, the innovative MiVor algorithm performs very well and provides accurate binary classification with only a few observation points. △ Less

Submitted 2 July, 2019; originally announced July 2019.

Comments: 62 pages, 26 Figures

arXiv:1905.05345 [pdf, other]

Adaptive surrogate models for parametric studies

Authors: Jan N. Fuhg

Abstract: The computational effort for the evaluation of numerical simulations based on e.g. the finite-element method is high. Metamodels can be utilized to create a low-cost alternative. However the number of required samples for the creation of a sufficient metamodel should be kept low, which can be achieved by using adaptive sampling techniques. In this Master thesis adaptive sampling techniques are inv… ▽ More The computational effort for the evaluation of numerical simulations based on e.g. the finite-element method is high. Metamodels can be utilized to create a low-cost alternative. However the number of required samples for the creation of a sufficient metamodel should be kept low, which can be achieved by using adaptive sampling techniques. In this Master thesis adaptive sampling techniques are investigated for their use in creating metamodels with the Kriging technique, which interpolates values by a Gaussian process governed by prior covariances. The Kriging framework with extension to multifidelity problems is presented and utilized to compare adaptive sampling techniques found in the literature for benchmark problems as well as applications for contact mechanics. This thesis offers the first comprehensive comparison of a large spectrum of adaptive techniques for the Kriging framework. Furthermore a multitude of adaptive techniques is introduced to multifidelity Kriging as well as well as to a Kriging model with reduced hyperparameter dimension called partial least squares Kriging. In addition, an innovative adaptive scheme for binary classification is presented and tested for identifying chaotic motion of a Duffing's type oscillator. △ Less

Submitted 12 May, 2019; originally announced May 2019.

Comments: 225 pages, Master's thesis, Leibniz University of Hannover, Germany (2019)

Showing 1–23 of 23 results for author: Fuhg, J