-
Improving Explainability of Softmax Classifiers Using a Prototype-Based Joint Embedding Method
Authors:
Hilarie Sit,
Brendan Keith,
Karianne Bergen
Abstract:
We propose a prototype-based approach for improving explainability of softmax classifiers that provides an understandable prediction confidence, generated through stochastic sampling of prototypes, and demonstrates potential for out of distribution detection (OOD). By modifying the model architecture and training to make predictions using similarities to any set of class examples from the training…
▽ More
We propose a prototype-based approach for improving explainability of softmax classifiers that provides an understandable prediction confidence, generated through stochastic sampling of prototypes, and demonstrates potential for out of distribution detection (OOD). By modifying the model architecture and training to make predictions using similarities to any set of class examples from the training dataset, we acquire the ability to sample for prototypical examples that contributed to the prediction, which provide an instance-based explanation for the model's decision. Furthermore, by learning relationships between images from the training dataset through relative distances within the model's latent space, we obtain a metric for uncertainty that is better able to detect out of distribution data than softmax confidence.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Finite elements for Matérn-type random fields: Uncertainty in computational mechanics and design optimization
Authors:
Tobias Duswald,
Brendan Keith,
Boyan Lazarov,
Socratis Petrides,
Barbara Wohlmuth
Abstract:
This work highlights an approach for incorporating realistic uncertainties into scientific computing workflows based on finite elements, focusing on applications in computational mechanics and design optimization. We leverage Matérn-type Gaussian random fields (GRFs) generated using the SPDE method to model aleatoric uncertainties, including environmental influences, variating material properties,…
▽ More
This work highlights an approach for incorporating realistic uncertainties into scientific computing workflows based on finite elements, focusing on applications in computational mechanics and design optimization. We leverage Matérn-type Gaussian random fields (GRFs) generated using the SPDE method to model aleatoric uncertainties, including environmental influences, variating material properties, and geometric ambiguities. Our focus lies on delivering practical GRF realizations that accurately capture imperfections and variations and understanding how they impact the predictions of computational models and the topology of optimized designs. We describe a numerical algorithm based on solving a generalized SPDE to sample GRFs on arbitrary meshed domains. The algorithm leverages established techniques and integrates seamlessly with the open-source finite element library MFEM and associated scientific computing workflows, like those found in industrial and national laboratory settings. Our solver scales efficiently for large-scale problems and supports various domain types, including surfaces and embedded manifolds. We showcase its versatility through biomechanics and topology optimization applications. The flexibility and efficiency of SPDE-based GRF generation empower us to run large-scale optimization problems on 2D and 3D domains, including finding optimized designs on embedded surfaces, and to generate topologies beyond the reach of conventional techniques. Moreover, these capabilities allow us to model geometric uncertainties of reconstructed submanifolds, such as the surfaces of cerebral aneurysms. In addition to offering benefits in these specific domains, the proposed techniques transcend specific applications and generalize to arbitrary forward and backward problems in uncertainty quantification involving finite elements.
△ Less
Submitted 24 May, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
High-performance finite elements with MFEM
Authors:
Julian Andrej,
Nabil Atallah,
Jan-Phillip Bäcker,
John Camier,
Dylan Copeland,
Veselin Dobrev,
Yohann Dudouit,
Tobias Duswald,
Brendan Keith,
Dohyun Kim,
Tzanio Kolev,
Boyan Lazarov,
Ketan Mittal,
Will Pazner,
Socratis Petrides,
Syun'ichi Shiraiwa,
Mark Stowell,
Vladimir Tomov
Abstract:
The MFEM (Modular Finite Element Methods) library is a high-performance C++ library for finite element discretizations. MFEM supports numerous types of finite element methods and is the discretization engine powering many computational physics and engineering applications across a number of domains. This paper describes some of the recent research and development in MFEM, focusing on performance p…
▽ More
The MFEM (Modular Finite Element Methods) library is a high-performance C++ library for finite element discretizations. MFEM supports numerous types of finite element methods and is the discretization engine powering many computational physics and engineering applications across a number of domains. This paper describes some of the recent research and development in MFEM, focusing on performance portability across leadership-class supercomputing facilities, including exascale supercomputers, as well as new capabilities and functionality, enabling a wider range of applications. Much of this work was undertaken as part of the Department of Energy's Exascale Computing Project (ECP) in collaboration with the Center for Efficient Exascale Discretizations (CEED).
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
DynAMO: Multi-agent reinforcement learning for dynamic anticipatory mesh optimization with applications to hyperbolic conservation laws
Authors:
Tarik Dzanic,
Ketan Mittal,
Dohyun Kim,
Jiachen Yang,
Socratis Petrides,
Brendan Keith,
Robert Anderson
Abstract:
We introduce DynAMO, a reinforcement learning paradigm for Dynamic Anticipatory Mesh Optimization. Adaptive mesh refinement is an effective tool for optimizing computational cost and solution accuracy in numerical methods for partial differential equations. However, traditional adaptive mesh refinement approaches for time-dependent problems typically rely only on instantaneous error indicators to…
▽ More
We introduce DynAMO, a reinforcement learning paradigm for Dynamic Anticipatory Mesh Optimization. Adaptive mesh refinement is an effective tool for optimizing computational cost and solution accuracy in numerical methods for partial differential equations. However, traditional adaptive mesh refinement approaches for time-dependent problems typically rely only on instantaneous error indicators to guide adaptivity. As a result, standard strategies often require frequent remeshing to maintain accuracy. In the DynAMO approach, multi-agent reinforcement learning is used to discover new local refinement policies that can anticipate and respond to future solution states by producing meshes that deliver more accurate solutions for longer time intervals. By applying DynAMO to discontinuous Galerkin methods for the linear advection and compressible Euler equations in two dimensions, we demonstrate that this new mesh refinement paradigm can outperform conventional threshold-based strategies while also generalizing to different mesh sizes, remeshing and simulation times, and initial conditions.
△ Less
Submitted 9 March, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Proximal Galerkin: A structure-preserving finite element method for pointwise bound constraints
Authors:
Brendan Keith,
Thomas M. Surowiec
Abstract:
The proximal Galerkin finite element method is a high-order, low iteration complexity, nonlinear numerical method that preserves the geometric and algebraic structure of pointwise bound constraints in infinite-dimensional function spaces. This paper introduces the proximal Galerkin method and applies it to solve free boundary problems, enforce discrete maximum principles, and develop a scalable, m…
▽ More
The proximal Galerkin finite element method is a high-order, low iteration complexity, nonlinear numerical method that preserves the geometric and algebraic structure of pointwise bound constraints in infinite-dimensional function spaces. This paper introduces the proximal Galerkin method and applies it to solve free boundary problems, enforce discrete maximum principles, and develop a scalable, mesh-independent algorithm for optimal design problems with pointwise bound constraints. This paper also provides a derivation of the latent variable proximal point (LVPP) algorithm, an unconditionally stable alternative to the interior point method. LVPP is an infinite-dimensional optimization algorithm that may be viewed as having an adaptive barrier function that is updated with a new informative prior at each (outer loop) optimization iteration. One of its main benefits is witnessed when analyzing the classical obstacle problem. Therein, we find that the original variational inequality can be replaced by a sequence of partial differential equations (PDEs) that are readily discretized and solved with, e.g., high-order finite elements. Throughout this work, we arrive at several unexpected contributions that may be of independent interest. These include (1) a semilinear PDE we refer to as the entropic Poisson equation; (2) an algebraic/geometric connection between high-order positivity-preserving discretizations and certain infinite-dimensional Lie groups; and (3) a gradient-based, bound-preserving algorithm for two-field density-based topology optimization. The complete latent variable proximal Galerkin methodology combines ideas from nonlinear programming, functional analysis, tropical algebra, and differential geometry and can potentially lead to new synergies among these areas as well as within variational and numerical analysis.
△ Less
Submitted 24 October, 2023; v1 submitted 23 July, 2023;
originally announced July 2023.
-
An Adaptive Sampling Augmented Lagrangian Method for Stochastic Optimization with Deterministic Constraints
Authors:
Raghu Bollapragada,
Cem Karamanli,
Brendan Keith,
Boyan Lazarov,
Socratis Petrides,
**gyi Wang
Abstract:
The primary goal of this paper is to provide an efficient solution algorithm based on the augmented Lagrangian framework for optimization problems with a stochastic objective function and deterministic constraints. Our main contribution is combining the augmented Lagrangian framework with adaptive sampling, resulting in an efficient optimization methodology validated with practical examples. To ac…
▽ More
The primary goal of this paper is to provide an efficient solution algorithm based on the augmented Lagrangian framework for optimization problems with a stochastic objective function and deterministic constraints. Our main contribution is combining the augmented Lagrangian framework with adaptive sampling, resulting in an efficient optimization methodology validated with practical examples. To achieve the presented efficiency, we consider inexact solutions for the augmented Lagrangian subproblems, and through an adaptive sampling mechanism, we control the variance in the gradient estimates. Furthermore, we analyze the theoretical performance of the proposed scheme by showing equivalence to a gradient descent algorithm on a Moreau envelope function, and we prove sublinear convergence for convex objectives and linear convergence for strongly convex objectives with affine equality constraints. The worst-case sample complexity of the resulting algorithm, for an arbitrary choice of penalty parameter in the augmented Lagrangian function, is $O(ε^{-3-δ})$, where $ε> 0$ is the expected error of the solution and $δ> 0$ is a user-defined parameter. If the penalty parameter is chosen to be $O(ε^{-1})$, we demonstrate that the result can be improved to $O(ε^{-2})$, which is competitive with the other methods employed in the literature. Moreover, if the objective function is strongly convex with affine equality constraints, we obtain $O(ε^{-1}\log(1/ε))$ complexity. Finally, we empirically verify the performance of our adaptive sampling augmented Lagrangian framework in machine learning optimization and engineering design problems, including topology optimization of a heat sink with environmental uncertainty.
△ Less
Submitted 26 December, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Authors:
Jiachen Yang,
Ketan Mittal,
Tarik Dzanic,
Socratis Petrides,
Brendan Keith,
Brenden Petersen,
Daniel Faissol,
Robert Anderson
Abstract:
Adaptive mesh refinement (AMR) is necessary for efficient finite element simulations of complex physical phenomenon, as it allocates limited computational budget based on the need for higher or lower resolution, which varies over space and time. We present a novel formulation of AMR as a fully-cooperative Markov game, in which each element is an independent agent who makes refinement and de-refine…
▽ More
Adaptive mesh refinement (AMR) is necessary for efficient finite element simulations of complex physical phenomenon, as it allocates limited computational budget based on the need for higher or lower resolution, which varies over space and time. We present a novel formulation of AMR as a fully-cooperative Markov game, in which each element is an independent agent who makes refinement and de-refinement choices based on local information. We design a novel deep multi-agent reinforcement learning (MARL) algorithm called Value Decomposition Graph Network (VDGN), which solves the two core challenges that AMR poses for MARL: posthumous credit assignment due to agent creation and deletion, and unstructured observations due to the diversity of mesh geometries. For the first time, we show that MARL enables anticipatory refinement of regions that will encounter complex features at future times, thereby unlocking entirely new regions of the error-cost objective landscape that are inaccessible by traditional methods based on local error estimators. Comprehensive experiments show that VDGN policies significantly outperform error threshold-based policies in global error and cost metrics. We show that learned policies generalize to test problems with physical features, mesh geometries, and longer simulation times that were not seen in training. We also extend VDGN with multi-objective optimization capabilities to find the Pareto front of the tradeoff between cost and error.
△ Less
Submitted 23 February, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Learning robust marking policies for adaptive mesh refinement
Authors:
Andrew Gillette,
Brendan Keith,
Socratis Petrides
Abstract:
In this work, we revisit the marking decisions made in the standard adaptive finite element method (AFEM). Experience shows that a naïve marking policy leads to inefficient use of computational resources for adaptive mesh refinement (AMR). Consequently, using AFEM in practice often involves ad-hoc or time-consuming offline parameter tuning to set appropriate parameters for the marking subroutine.…
▽ More
In this work, we revisit the marking decisions made in the standard adaptive finite element method (AFEM). Experience shows that a naïve marking policy leads to inefficient use of computational resources for adaptive mesh refinement (AMR). Consequently, using AFEM in practice often involves ad-hoc or time-consuming offline parameter tuning to set appropriate parameters for the marking subroutine. To address these practical concerns, we recast AMR as a Markov decision process in which refinement parameters can be selected on-the-fly at run time, without the need for pre-tuning by expert users. In this new paradigm, the refinement parameters are also chosen adaptively via a marking policy that can be optimized using methods from reinforcement learning. We use the Poisson equation to demonstrate our techniques on $h$- and $hp$-refinement benchmark problems, and our experiments suggest that superior marking policies remain undiscovered for many classical AFEM applications. Furthermore, an unexpected observation from this work is that marking policies trained on one family of PDEs are sometimes robust enough to perform well on problems far outside the training family. For illustration, we show that a simple $hp$-refinement policy trained on 2D domains with only a single re-entrant corner can be deployed on far more complicated 2D domains, and even 3D domains, without significant performance loss. For reproduction and broader adoption, we accompany this work with an open-source implementation of our methods.
△ Less
Submitted 26 December, 2023; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Risk-averse design of tall buildings for uncertain wind conditions
Authors:
Anoop Kodakkal,
Brendan Keith,
Ustim Khristenko,
Andreas Apostolatos,
Kai-Uwe Bletzinger,
Barbara Wohlmuth,
Roland Wuechner
Abstract:
Reducing the intensity of wind excitation via aerodynamic shape modification is a major strategy to mitigate the reaction forces on supertall buildings, reduce construction and maintenance costs, and improve the comfort of future occupants. To this end, computational fluid dynamics (CFD) combined with state-of-the-art stochastic optimization algorithms is more promising than the trial and error ap…
▽ More
Reducing the intensity of wind excitation via aerodynamic shape modification is a major strategy to mitigate the reaction forces on supertall buildings, reduce construction and maintenance costs, and improve the comfort of future occupants. To this end, computational fluid dynamics (CFD) combined with state-of-the-art stochastic optimization algorithms is more promising than the trial and error approach adopted by the industry. The present study proposes and investigates a novel approach to risk-averse shape optimization of tall building structures that incorporates site-specific uncertainties in the wind velocity, terrain conditions, and wind flow direction. A body-fitted finite element approximation is used for the CFD with different wind directions incorporated by re-meshing the fluid domain. The bending moment at the base of the building is minimized, resulting in a building with reduced cost, material, and hence, a reduced carbon footprint. Both risk-neutral and risk-averse optimization of the twist and tapering of a representative building are presented under uncertain inflow wind conditions that have been calibrated to fit freely-available site-specific data from Basel, Switzerland. The risk-averse strategy uses the conditional value-at-risk to optimize for the low-probability high-consequence events appearing in the worst 10% of loading conditions. Adaptive sampling is used to accelerate the gradient-based stochastic optimization pipeline. The adaptive method is easy to implement and particularly helpful for compute-intensive simulations because the number of gradient samples grows only as the optimal design algorithm converges. The performance of the final risk-averse building geometry is exceptionally favorable when compared to the risk-neutral optimized geometry, thus, demonstrating the effectiveness of the risk-averse design approach in computational wind engineering.
△ Less
Submitted 6 July, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
Learning the structure of wind: A data-driven nonlocal turbulence model for the atmospheric boundary layer
Authors:
Brendan Keith,
Ustim Khristenko,
Barbara Wohlmuth
Abstract:
We develop a novel data-driven approach to modeling the atmospheric boundary layer. This approach leads to a nonlocal, anisotropic synthetic turbulence model which we refer to as the deep rapid distortion (DRD) model. Our approach relies on an operator regression problem which characterizes the best fitting candidate in a general family of nonlocal covariance kernels parameterized in part by a neu…
▽ More
We develop a novel data-driven approach to modeling the atmospheric boundary layer. This approach leads to a nonlocal, anisotropic synthetic turbulence model which we refer to as the deep rapid distortion (DRD) model. Our approach relies on an operator regression problem which characterizes the best fitting candidate in a general family of nonlocal covariance kernels parameterized in part by a neural network. This family of covariance kernels is expressed in Fourier space and is obtained from approximate solutions to the Navier--Stokes equations at very high Reynolds numbers. Each member of the family incorporates important physical properties such as mass conservation and a realistic energy cascade. The DRD model can be calibrated with noisy data from field experiments. After calibration, the model can be used to generate synthetic turbulent velocity fields. To this end, we provide a new numerical method based on domain decomposition which delivers scalable, memory-efficient turbulence generation with the DRD model as well as others. We demonstrate the robustness of our approach with both filtered and noisy data coming from the 1968 Air Force Cambridge Research Laboratory Kansas experiments. Using this data, we witness exceptional accuracy with the DRD model, especially when compared to the International Electrotechnical Commission standard.
△ Less
Submitted 23 July, 2021;
originally announced July 2021.
-
Learning orbital dynamics of binary black hole systems from gravitational wave measurements
Authors:
Brendan Keith,
Akshay Khadse,
Scott E. Field
Abstract:
We introduce a gravitational waveform inversion strategy that discovers mechanical models of binary black hole (BBH) systems. We show that only a single time series of (possibly noisy) waveform data is necessary to construct the equations of motion for a BBH system. Starting with a class of universal differential equations parameterized by feed-forward neural networks, our strategy involves the co…
▽ More
We introduce a gravitational waveform inversion strategy that discovers mechanical models of binary black hole (BBH) systems. We show that only a single time series of (possibly noisy) waveform data is necessary to construct the equations of motion for a BBH system. Starting with a class of universal differential equations parameterized by feed-forward neural networks, our strategy involves the construction of a space of plausible mechanical models and a physics-informed constrained optimization within that space to minimize the waveform error. We apply our method to various BBH systems including extreme and comparable mass ratio systems in eccentric and non-eccentric orbits. We show the resulting differential equations apply to time durations longer than the training interval, and relativistic effects, such as perihelion precession, radiation reaction, and orbital plunge, are automatically accounted for. The methods outlined here provide a new, data-driven approach to studying the dynamics of binary black hole systems.
△ Less
Submitted 8 November, 2021; v1 submitted 25 February, 2021;
originally announced February 2021.
-
A priori error analysis of high-order LL* (FOSLL*) finite element methods
Authors:
Brendan Keith
Abstract:
A number of non-standard finite element methods have been proposed in recent years, each of which derives from a specific class of PDE-constrained norm minimization problems. The most notable examples are $\mathcal{L}\mathcal{L}^*$ methods. In this work, we argue that all high-order methods in this class should be expected to deliver substandard uniform h-refinement convergence rates. In fact, one…
▽ More
A number of non-standard finite element methods have been proposed in recent years, each of which derives from a specific class of PDE-constrained norm minimization problems. The most notable examples are $\mathcal{L}\mathcal{L}^*$ methods. In this work, we argue that all high-order methods in this class should be expected to deliver substandard uniform h-refinement convergence rates. In fact, one may not even see rates proportional to the polynomial order $p > 1$ when the exact solution is a constant function. We show that the convergence rate is limited by the regularity of an extraneous Lagrange multiplier variable which naturally appears via a saddle-point analysis. In turn, limited convergence rates appear because the regularity of this Lagrange multiplier is determined, in part, by the geometry of the domain. Numerical experiments support our conclusions.
△ Less
Submitted 30 August, 2021; v1 submitted 17 December, 2020;
originally announced December 2020.
-
Adaptive sampling strategies for risk-averse stochastic optimization with constraints
Authors:
Florian Beiser,
Brendan Keith,
Simon Urbainczyk,
Barbara Wohlmuth
Abstract:
We introduce adaptive sampling methods for stochastic programs with deterministic constraints. First, we propose and analyze a variant of the stochastic projected gradient method where the sample size used to approximate the reduced gradient is determined on-the-fly and updated adaptively. This method is applicable to a broad class of expectation-based risk measures and leads to a significant redu…
▽ More
We introduce adaptive sampling methods for stochastic programs with deterministic constraints. First, we propose and analyze a variant of the stochastic projected gradient method where the sample size used to approximate the reduced gradient is determined on-the-fly and updated adaptively. This method is applicable to a broad class of expectation-based risk measures and leads to a significant reduction in the individual gradient evaluations used to estimate the objective function gradient. Numerical experiments with expected risk minimization and conditional value-at-risk minimization support this conclusion and demonstrate practical performance and efficacy for both risk-neutral and risk-averse problems. Second, we propose an SQP-type method based on similar adaptive sampling principles. The benefits of this method are demonstrated in a simplified engineering design application featuring risk-averse shape optimization of a steel shell structure subject to uncertain loading conditions and model uncertainty.
△ Less
Submitted 5 February, 2023; v1 submitted 7 December, 2020;
originally announced December 2020.
-
Narrative Maps: An Algorithmic Approach to Represent and Extract Information Narratives
Authors:
Brian Keith,
Tanushree Mitra
Abstract:
Narratives are fundamental to our perception of the world and are pervasive in all activities that involve the representation of events in time. Yet, modern online information systems do not incorporate narratives in their representation of events occurring over time. This article aims to bridge this gap, combining the theory of narrative representations with the data from modern online systems. W…
▽ More
Narratives are fundamental to our perception of the world and are pervasive in all activities that involve the representation of events in time. Yet, modern online information systems do not incorporate narratives in their representation of events occurring over time. This article aims to bridge this gap, combining the theory of narrative representations with the data from modern online systems. We make three key contributions: a theory-driven computational representation of narratives, a novel extraction algorithm to obtain these representations from data, and an evaluation of our approach. In particular, given the effectiveness of visual metaphors, we employ a route map metaphor to design a narrative map representation. The narrative map representation illustrates the events and stories in the narrative as a series of landmarks and routes on the map. Each element of our representation is backed by a corresponding element from formal narrative theory, thus providing a solid theoretical background to our method. Our approach extracts the underlying graph structure of the narrative map using a novel optimization technique focused on maximizing coherence while respecting structural and coverage constraints. We showcase the effectiveness of our approach by performing a user evaluation to assess the quality of the representation, metaphor, and visualization. Evaluation results indicate that the Narrative Map representation is a powerful method to communicate complex narratives to individuals. Our findings have implications for intelligence analysts, computational journalists, and misinformation researchers.
△ Less
Submitted 26 October, 2020; v1 submitted 9 September, 2020;
originally announced September 2020.
-
A fractional PDE model for turbulent velocity fields near solid walls
Authors:
Brendan Keith,
Ustim Khristenko,
Barbara Wohlmuth
Abstract:
This paper presents a class of turbulence models written in terms of fractional partial differential equations (FPDEs) with stochastic loads. Every solution of these FPDE models is an incompressible velocity field and the distribution of solutions is Gaussian. Interaction of the turbulence with solid walls is incorporated through the enforcement of various boundary conditions. The various boundary…
▽ More
This paper presents a class of turbulence models written in terms of fractional partial differential equations (FPDEs) with stochastic loads. Every solution of these FPDE models is an incompressible velocity field and the distribution of solutions is Gaussian. Interaction of the turbulence with solid walls is incorporated through the enforcement of various boundary conditions. The various boundary conditions deliver extensive flexibility in the near-wall statistics that can be modelled. Reproduction of both fully-developed shear-free and uniform shear boundary layer turbulence are highlighted as two simple physical applications; the first of which is also directly validated with experimental data. The rendering of inhomogeneous synthetic turbulence inlet boundary conditions is an additional application, motivated by contemporary numerical wind tunnel simulations. Calibration of model parameters and efficient numerical methods are also conferred upon.
△ Less
Submitted 10 September, 2020; v1 submitted 10 August, 2020;
originally announced August 2020.
-
The surrogate matrix methodology: Accelerating isogeometric analysis of waves
Authors:
Daniel Drzisga,
Brendan Keith,
Barbara Wohlmuth
Abstract:
The surrogate matrix methodology delivers low-cost approximations of matrices (i.e., surrogate matrices) which are normally computed in Galerkin methods via element-scale quadrature formulas. In this paper, the methodology is applied to a number of model problems in wave mechanics treated in the Galerkin isogeometic setting. Herein, the resulting surrogate methods are shown to significantly reduce…
▽ More
The surrogate matrix methodology delivers low-cost approximations of matrices (i.e., surrogate matrices) which are normally computed in Galerkin methods via element-scale quadrature formulas. In this paper, the methodology is applied to a number of model problems in wave mechanics treated in the Galerkin isogeometic setting. Herein, the resulting surrogate methods are shown to significantly reduce the assembly time in high frequency wave propagation problems. In particular, the assembly time is reduced with negligible loss in solution accuracy. This paper also extends the scope of previous articles in its series by considering multi-patch discretizations of time-harmonic, transient, and nonlinear PDEs as particular use cases of the methodology. Our a priori error analysis for the Helmholtz equation demonstrates that the additional consistency error introduced by the presence of surrogate matrices is independent of the wave number. In addition, our floating point analysis establishes that the computational complexity of the methodology compares favorably to other contemporary fast assembly techniques for isogeometric methods. Our numerical experiments demonstrate clear performance gains for time-harmonic problems, both with and without the presence of perfectly matched layers. Notable speed-ups are also presented for a transient problem with a compressible neo-Hookean material.
△ Less
Submitted 1 July, 2020; v1 submitted 10 April, 2020;
originally announced April 2020.
-
The surrogate matrix methodology: A reference implementation for low-cost assembly in isogeometric analysis
Authors:
Daniel Drzisga,
Brendan Keith,
Barbara Wohlmuth
Abstract:
A reference implementation of a new method in isogeometric analysis (IGA) is presented. It delivers low-cost variable-scale approximations (surrogates) of the matrices which IGA conventionally requires to be computed by element-scale quadrature. To generate surrogate matrices, quadrature must only be performed on a fraction of the elements in the computational domain. In this way, quadrature deter…
▽ More
A reference implementation of a new method in isogeometric analysis (IGA) is presented. It delivers low-cost variable-scale approximations (surrogates) of the matrices which IGA conventionally requires to be computed by element-scale quadrature. To generate surrogate matrices, quadrature must only be performed on a fraction of the elements in the computational domain. In this way, quadrature determines only a subset of the entries in the final matrix. The remaining matrix entries are computed by a simple B-spline interpolation procedure. We present the modifications and extensions required for a reference implementation in the open-source IGA software library GeoPDEs. The exposition is fashioned to help facilitate similar modifications in other contemporary software libraries.
△ Less
Submitted 8 September, 2019;
originally announced September 2019.
-
The surrogate matrix methodology: Low-cost assembly for isogeometric analysis
Authors:
Daniel Drzisga,
Brendan Keith,
Barbara Wohlmuth
Abstract:
A new methodology in isogeometric analysis (IGA) is presented. This methodology delivers low-cost variable-scale approximations (surrogates) of the matrices which IGA conventionally requires to be computed from element-scale quadrature formulas. To generate surrogate matrices, quadrature must only be performed on certain elements in the computational domain. This, in turn, determines only a subset…
▽ More
A new methodology in isogeometric analysis (IGA) is presented. This methodology delivers low-cost variable-scale approximations (surrogates) of the matrices which IGA conventionally requires to be computed from element-scale quadrature formulas. To generate surrogate matrices, quadrature must only be performed on certain elements in the computational domain. This, in turn, determines only a subset of the entries in the final matrix. The remaining matrix entries are computed by a simple B-spline interpolation procedure. Poisson's equation, membrane vibration, plate bending, and Stokes' flow problems are studied. In these problems, the use of surrogate matrices has a negligible impact on solution accuracy. Because only a small fraction of the original quadrature must be performed, we are able to report beyond a fifty-fold reduction in overall assembly time in the same software. The capacity for even further speed-ups is clearly demonstrated. The implementation used here was achieved by a small number of modifications to the open-source IGA software library GeoPDEs. Similar modifications could be made to other present-day software libraries.
△ Less
Submitted 17 September, 2019; v1 submitted 15 April, 2019;
originally announced April 2019.
-
The surrogate matrix methodology: a priori error estimation
Authors:
Daniel Drzisga,
Brendan Keith,
Barbara Wohlmuth
Abstract:
We give the first mathematically rigorous analysis of an emerging approach to finite element analysis (see, e.g., Bauer et al. [Appl. Numer. Math., 2017]), which we hereby refer to as the surrogate matrix methodology. This methodology is based on the piece-wise smooth approximation of the matrices involved in a standard finite element discretization. In particular, it relies on the projection of s…
▽ More
We give the first mathematically rigorous analysis of an emerging approach to finite element analysis (see, e.g., Bauer et al. [Appl. Numer. Math., 2017]), which we hereby refer to as the surrogate matrix methodology. This methodology is based on the piece-wise smooth approximation of the matrices involved in a standard finite element discretization. In particular, it relies on the projection of smooth so-called stencil functions onto high-order polynomial subspaces. The performance advantage of the surrogate matrix methodology is seen in constructions where each stencil function uniquely determines the values of a significant collection of matrix entries. Such constructions are shown to be widely achievable through the use of locally-structured meshes. Therefore, this methodology can be applied to a wide variety of physically meaningful problems, including nonlinear problems and problems with curvilinear geometries. Rigorous a priori error analysis certifies the convergence of a novel surrogate method for the variable coefficient Poisson equation. The flexibility of the methodology is also demonstrated through the construction of novel methods for linear elasticity and nonlinear diffusion problems. In numerous numerical experiments, we demonstrate the efficacy of these new methods in a matrix-free environment with geometric multigrid solvers. In our experiments, up to a twenty-fold decrease in computation time is witnessed over the classical method with an otherwise identical implementation.
△ Less
Submitted 19 February, 2019;
originally announced February 2019.
-
The DPG-star method
Authors:
Leszek Demkowicz,
Jay Gopalakrishnan,
Brendan Keith
Abstract:
This article introduces the DPG-star (from now on, denoted DPG$^*$) finite element method. It is a method that is in some sense dual to the discontinuous Petrov-Galerkin (DPG) method. The DPG methodology can be viewed as a means to solve an overdetermined discretization of a boundary value problem. In the same vein, the DPG$^*$ methodology is a means to solve an underdetermined discretization. The…
▽ More
This article introduces the DPG-star (from now on, denoted DPG$^*$) finite element method. It is a method that is in some sense dual to the discontinuous Petrov-Galerkin (DPG) method. The DPG methodology can be viewed as a means to solve an overdetermined discretization of a boundary value problem. In the same vein, the DPG$^*$ methodology is a means to solve an underdetermined discretization. These two viewpoints are developed by embedding the same operator equation into two different saddle-point problems. The analyses of the two problems have many common elements. Comparison to other methods in the literature round out the newly garnered perspective. Notably, DPG$^*$ and DPG methods can be seen as generalizations of $\mathcal{L}\mathcal{L}^\ast$ and least-squares methods, respectively. A priori error analysis and a posteriori error control for the DPG$^*$ method are considered in detail. Reports of several numerical experiments are provided which demonstrate the essential features of the new method. A notable difference between the results from the DPG$^*$ and DPG analyses is that the convergence rates of the former are limited by the regularity of an extraneous Lagrange multiplier variable.
△ Less
Submitted 10 September, 2018;
originally announced September 2018.
-
On perfectly matched layers for discontinuous Petrov-Galerkin methods
Authors:
Ali Vaziri Astaneh,
Brendan Keith,
Leszek Demkowicz
Abstract:
In this article, several discontinuous Petrov-Galerkin (DPG) methods with perfectly matched layers (PMLs) are derived along with their quasi-optimal graph test norms. Ultimately, two different complex coordinate stretching strategies are considered in these derivations. Unlike with classical formulations used by Bubnov-Galerkin methods, with so-called ultraweak variational formulations, these two…
▽ More
In this article, several discontinuous Petrov-Galerkin (DPG) methods with perfectly matched layers (PMLs) are derived along with their quasi-optimal graph test norms. Ultimately, two different complex coordinate stretching strategies are considered in these derivations. Unlike with classical formulations used by Bubnov-Galerkin methods, with so-called ultraweak variational formulations, these two strategies in fact deliver different formulations in the PML region. One of the strategies, which is argued to be more physically natural, is employed for numerically solving two- and three-dimensional time-harmonic acoustic, elastic, and electromagnetic wave propagation problems, defined in unbounded domains. Through these numerical experiments, efficacy of the new DPG methods with PMLs is verified.
△ Less
Submitted 9 April, 2018;
originally announced April 2018.
-
Goal-oriented adaptive mesh refinement for non-symmetric functional settings
Authors:
Brendan Keith,
Ali Vaziri Astaneh,
Leszek Demkowicz
Abstract:
In this article, a new unified duality theory is developed for Petrov-Galerkin finite element methods. This novel theory is then used to motivate goal-oriented adaptive mesh refinement strategies for use with discontinuous Petrov-Galerkin (DPG) methods. The focus of this article is mainly on broken ultraweak variational formulations of stationary boundary value problems, however, many of the ideas…
▽ More
In this article, a new unified duality theory is developed for Petrov-Galerkin finite element methods. This novel theory is then used to motivate goal-oriented adaptive mesh refinement strategies for use with discontinuous Petrov-Galerkin (DPG) methods. The focus of this article is mainly on broken ultraweak variational formulations of stationary boundary value problems, however, many of the ideas presented within are general enough that they be extended to any such well-posed variational formulation. The proposed goal-oriented adaptive mesh refinement procedures require the construction of refinement indicators for both a primal problem and a dual problem. In the DPG context, the primal problem is simply the system of linear equations coming from a standard DPG method and the dual problem is a similar system of equations, coming from a new method which is dual to DPG. This new method has the same coefficient matrix as the associated DPG method but has a different load. We refer to this new finite element method as a DPG* method. A thorough analysis of DPG* methods, as stand-alone finite element methods, is not given here but will be provided in subsequent articles. For DPG methods, the current theory of a posteriori error estimation is reviewed and the reliability estimate in [13, Theorem 2.1] is improved on. For DPG* methods, three different classes of refinement indicators are derived and several contributions are made towards rigorous a posteriori error estimation. At the closure of the article, results of numerical experiments with Poisson's boundary value problem in a three-dimensional domain are provided. These results clearly demonstrate the utility of the goal-oriented adaptive mesh refinement strategies for quantities of interest with either interior or boundary terms.
△ Less
Submitted 24 April, 2018; v1 submitted 6 November, 2017;
originally announced November 2017.
-
DPG* Method
Authors:
Brendan Keith,
Leszek Demkowicz,
Jay Gopalakrishnan
Abstract:
We introduce a cousin of the DPG method - the DPG* method - discuss their relationship and compare the two methods through numerical experiments.
We introduce a cousin of the DPG method - the DPG* method - discuss their relationship and compare the two methods through numerical experiments.
△ Less
Submitted 14 October, 2017;
originally announced October 2017.
-
Discrete least-squares finite element methods
Authors:
Brendan Keith,
Socratis Petrides,
Federico Fuentes,
Leszek Demkowicz
Abstract:
A finite element methodology for large classes of variational boundary value problems is defined which involves discretizing two linear operators: (1) the differential operator defining the spatial boundary value problem; and (2) a Riesz map on the test space. The resulting linear system is overdetermined. Two different approaches for solving the system are suggested (although others are discussed…
▽ More
A finite element methodology for large classes of variational boundary value problems is defined which involves discretizing two linear operators: (1) the differential operator defining the spatial boundary value problem; and (2) a Riesz map on the test space. The resulting linear system is overdetermined. Two different approaches for solving the system are suggested (although others are discussed): (1) solving the associated normal equation with linear solvers for symmetric positive-definite systems (e.g. Cholesky factorization); and (2) solving the overdetermined system with orthogonalization algorithms (e.g. QR factorization). The finite element assembly algorithm for each of these approaches is described in detail. The normal equation approach is usually faster for direct solvers and requires less storage. The second approach reduces the condition number of the system by a power of two and is less sensitive to round-off error. The rectangular stiffness matrix of second approach is demonstrated to have condition number $\mathcal{O}(h^{-1})$ for a variety of formulations of Poisson's equation. The stiffness matrix from the normal equation approach is demonstrated to be related to the monolithic stiffness matrices of least-squares finite element methods and it is proved that the two are identical in some cases. An example with Poisson's equation indicates that the solutions of these two different linear systems can be nearly indistinguishable (if round-off error is not an issue) and rapidly converge to each other. The orthogonalization approach is suggested to be beneficial for problems which induce poorly conditioned linear systems. Experiments with Poisson's equation in single-precision arithmetic as well as the linear acoustics problem near resonance in double-precision arithmetic verify this conclusion.
△ Less
Submitted 4 May, 2017;
originally announced May 2017.
-
An ultraweak DPG method for viscoelastic fluids
Authors:
Brendan Keith,
Philipp Knechtges,
Nathan V. Roberts,
Stefanie Elgeti,
Marek Behr,
Leszek Demkowicz
Abstract:
We explore a vexing benchmark problem for viscoelastic fluid flows with the discontinuous Petrov-Galerkin (DPG) finite element method of Demkowicz and Gopalakrishnan [1,2]. In our analysis, we develop an intrinsic a posteriori error indicator which we use for adaptive mesh generation. The DPG method is useful for the problem we consider because the method is inherently stable---requiring no stabil…
▽ More
We explore a vexing benchmark problem for viscoelastic fluid flows with the discontinuous Petrov-Galerkin (DPG) finite element method of Demkowicz and Gopalakrishnan [1,2]. In our analysis, we develop an intrinsic a posteriori error indicator which we use for adaptive mesh generation. The DPG method is useful for the problem we consider because the method is inherently stable---requiring no stabilization of the linearized discretization in order to handle the advective terms in the model. Because stabilization is a pressing issue in these models, this happens to become a very useful property of the method which simplifies our analysis. This built-in stability at all length scales and the a posteriori error indicator additionally allows for the generation of parameter-specific meshes starting from a common coarse initial mesh. A DPG discretization always produces a symmetric positive definite stiffness matrix. This feature allows us to use the most efficient direct solvers for all of our computations. We use the Camellia finite element software package [3,4] for all of our analysis.
△ Less
Submitted 26 June, 2017; v1 submitted 9 December, 2016;
originally announced December 2016.
-
Coupled variational formulations of linear elasticity and the DPG methodology
Authors:
Federico Fuentes,
Brendan Keith,
Leszek Demkowicz,
Patrick Le Tallec
Abstract:
This article presents a general approach akin to domain-decomposition methods to solve a single linear PDE, but where each subdomain of a partitioned domain is associated to a distinct variational formulation coming from a mutually well-posed family of broken variational formulations of the original PDE. It can be exploited to solve challenging problems in a variety of physical scenarios where sta…
▽ More
This article presents a general approach akin to domain-decomposition methods to solve a single linear PDE, but where each subdomain of a partitioned domain is associated to a distinct variational formulation coming from a mutually well-posed family of broken variational formulations of the original PDE. It can be exploited to solve challenging problems in a variety of physical scenarios where stability or a particular mode of convergence is desired in a part of the domain. The linear elasticity equations are solved in this work, but the approach can be applied to other equations as well. The broken variational formulations, which are essentially extensions of more standard formulations, are characterized by the presence of mesh-dependent broken test spaces and interface trial variables at the boundaries of the elements of the mesh. This allows necessary information to be naturally transmitted between adjacent subdomains, resulting in coupled variational formulations which are then proved to be globally well-posed. They are solved numerically using the DPG methodology, which is especially crafted to produce stable discretizations of broken formulations. Finally, expected convergence rates are verified in two different and illustrative examples.
△ Less
Submitted 25 May, 2017; v1 submitted 26 September, 2016;
originally announced September 2016.
-
The DPG methodology applied to different variational formulations of linear elasticity
Authors:
Brendan Keith,
Federico Fuentes,
Leszek Demkowicz
Abstract:
The flexibility of the DPG methodology is exposed by solving the linear elasticity equations under different variational formulations, including some with non-symmetric functional settings (different infinite-dimensional trial and test spaces). The family of formulations presented are proved to be mutually ill or well-posed when using traditional energy spaces on the whole domain. Moreover, they a…
▽ More
The flexibility of the DPG methodology is exposed by solving the linear elasticity equations under different variational formulations, including some with non-symmetric functional settings (different infinite-dimensional trial and test spaces). The family of formulations presented are proved to be mutually ill or well-posed when using traditional energy spaces on the whole domain. Moreover, they are shown to remain well-posed when using broken energy spaces and interface variables. Four variational formulations are solved in 3D using the DPG methodology. Numerical evidence is given for both smooth and singular solutions and the expected convergence rates are observed.
△ Less
Submitted 27 May, 2016; v1 submitted 28 January, 2016;
originally announced January 2016.
-
Orientation Embedded High Order Shape Functions for the Exact Sequence Elements of All Shapes
Authors:
Federico Fuentes,
Brendan Keith,
Leszek Demkowicz,
Sriram Nagaraj
Abstract:
A unified construction of high order shape functions is given for all four classical energy spaces ($H^1$, $H(\mathrm{curl})$, $H(\mathrm{div})$ and $L^2$) and for elements of "all" shapes (segment, quadrilateral, triangle, hexahedron, tetrahedron, triangular prism and pyramid). The discrete spaces spanned by the shape functions satisfy the commuting exact sequence property for each element. The s…
▽ More
A unified construction of high order shape functions is given for all four classical energy spaces ($H^1$, $H(\mathrm{curl})$, $H(\mathrm{div})$ and $L^2$) and for elements of "all" shapes (segment, quadrilateral, triangle, hexahedron, tetrahedron, triangular prism and pyramid). The discrete spaces spanned by the shape functions satisfy the commuting exact sequence property for each element. The shape functions are conforming, hierarchical and compatible with other neighboring elements across shared boundaries so they may be used in hybrid meshes. Expressions for the shape functions are given in coordinate free format in terms of the relevant affine coordinates of each element shape. The polynomial order is allowed to differ for each separate topological entity (vertex, edge, face or interior) in the mesh, so the shape functions can be used to implement local $p$ adaptive finite element methods. Each topological entity may have its own orientation, and the shape functions can have that orientation embedded by a simple permutation of arguments.
△ Less
Submitted 19 May, 2015; v1 submitted 12 April, 2015;
originally announced April 2015.
-
Phonon Density of States of LaFeAsO1-xFx
Authors:
A. D. Christianson,
M. D. Lumsden,
O. Delaire,
M. B. Stone,
D. L. Abernathy,
M. A. McGuire,
A. S. Sefat,
R. **,
B. C. Sales,
D. Mandrus,
E. D. Mun,
P. C. Canfield,
J. Y. Y. Lin,
M. Lucas,
M. Kresch,
J. B. Keith,
B. Fultz,
E. A. Goremychkin,
R. J. McQueeney
Abstract:
We have studied the phonon density of states (PDOS) in LaFeAsO1-xFx with inelastic neutron scattering methods. The PDOS of the parent compound(x=0) is very similar to the PDOS of samples optimally doped with fluorine to achieve the maximum Tc (x~0.1). Good agreement is found between the experimental PDOS and first-principle calculations with the exception of a small difference in Fe mode frequen…
▽ More
We have studied the phonon density of states (PDOS) in LaFeAsO1-xFx with inelastic neutron scattering methods. The PDOS of the parent compound(x=0) is very similar to the PDOS of samples optimally doped with fluorine to achieve the maximum Tc (x~0.1). Good agreement is found between the experimental PDOS and first-principle calculations with the exception of a small difference in Fe mode frequencies. The PDOS reported here is not consistent with conventional electron-phonon mediated superconductivity.
△ Less
Submitted 22 July, 2008;
originally announced July 2008.
-
The Bianchi identity and weak gravitational lensing
Authors:
Thomas P. Kling,
Brian Keith
Abstract:
We consider the Bianchi identity as a field equation for the distortion of the shapes of images produced by weak gravitational lensing. Using the spin coefficient formalism of Newman and Penrose [1962], we show that certain complex components of the Weyl and Ricci curvature tensors are directly related to fundamental observables in weak gravitational lensing. In the case of weak gravitational fi…
▽ More
We consider the Bianchi identity as a field equation for the distortion of the shapes of images produced by weak gravitational lensing. Using the spin coefficient formalism of Newman and Penrose [1962], we show that certain complex components of the Weyl and Ricci curvature tensors are directly related to fundamental observables in weak gravitational lensing. In the case of weak gravitational fields, we then show that the Bianchi identity provides a field equation for the Ricci tensor assuming a known Weyl tensor. From the Bianchi identity, we derive the integral equation for weak lensing presented by Miralda-Escude [1996], thus making the Bianchi identity a first principles equation of weak gravitational lensing. This equation is integrated in the important case of an axially symmetric lens and explicitly demonstrated in the case of a point lens and a SIS model.
△ Less
Submitted 23 June, 2005;
originally announced June 2005.