-
Considerations in the use of ML interaction potentials for free energy calculations
Authors:
Orlando A. Mendible,
Jonathan K. Whitmer,
Yamil J. Colón
Abstract:
Machine learning potentials (MLPs) offer the potential to accurately model the energy and free energy landscapes of molecules with the precision of quantum mechanics and an efficiency similar to classical simulations. This research focuses on using equivariant graph neural networks MLPs due to their proven effectiveness in modeling equilibrium molecular trajectories. A key issue addressed is the c…
▽ More
Machine learning potentials (MLPs) offer the potential to accurately model the energy and free energy landscapes of molecules with the precision of quantum mechanics and an efficiency similar to classical simulations. This research focuses on using equivariant graph neural networks MLPs due to their proven effectiveness in modeling equilibrium molecular trajectories. A key issue addressed is the capability of MLPs to accurately predict free energies and transition states by considering both the energy and the diversity of molecular configurations. We examined how the distribution of collective variables (CVs) in the training data affects MLP accuracy in determining the free energy surface (FES) of systems, using Metadynamics simulations for butane and alanine dipeptide (ADP). The study involved training forty-three MLPs, half based on classical molecular dynamics data and the rest on ab initio computed energies. The MLPs were trained using different distributions that aim to replicate hypothetical scenarios of sampled CVs obtained if the underlying FES of the system was unknown. Findings for butane revealed that training data coverage of key FES regions ensures model accuracy regardless of CV distribution. However, missing significant FES regions led to correct potential energy predictions but failed free energy reconstruction. For ADP, models trained on classical dynamics data were notably less accurate, while ab initio-based MLPs predicted potential energy well but faltered on free energy predictions. These results emphasize the challenge of assembling an all-encompassing training set for accurate FES prediction and highlight the importance of understanding the FES in preparing training data. The study points out the limitations of MLPs in free energy calculations, stressing the need for comprehensive data that encompasses the system's full FES for effective model training.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Data science and social justice in the mathematics community
Authors:
Quindel Jones,
Andrés R. Vindas Meléndez,
Ariana Mendible,
Manuchehr Aminian,
Heather Z. Brooks,
Nathan Alexander,
Carrie Diaz Eaton,
Philip Chodrow
Abstract:
Data science for social justice (DS4SJ) is data-scientific work that supports the liberation of oppressed and marginalized people. By nature, this work lies at the intersection of technical scholarship and activist practice. We discuss this growing efforts in DS4SJ within the broad mathematics community. We begin by defining terms and offering a series of guiding principles for engaging in critica…
▽ More
Data science for social justice (DS4SJ) is data-scientific work that supports the liberation of oppressed and marginalized people. By nature, this work lies at the intersection of technical scholarship and activist practice. We discuss this growing efforts in DS4SJ within the broad mathematics community. We begin by defining terms and offering a series of guiding principles for engaging in critical data science work, providing examples of how these principles play out in practice. We then highlight the roles that DS4SJ can play in the scholarship and pedagogy of practicing mathematicians. We focus in particular on the engagement of early-career mathematicians in DS4SJ, which we illustrate through a series of four personal vignettes. While the primary aim of DS4SJ is to achieve impact for marginalized communities, we also argue that engagement with DS4SJ can benefit the entire mathematical ecosystem, including researchers, instructors, students, departments, institutes, and professional societies. We close with reflections on how these various actors can support ongoing efforts in data science for social justice.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Data-driven Modeling of Two-Dimensional Detonation Wave Fronts
Authors:
Ariana Mendible,
Weston Lowrie,
Steven L. Brunton,
J. Nathan Kutz
Abstract:
Historical experimental testing of high-altitude nuclear explosions (HANEs) are known to cause severe and detrimental effects to radio frequency signals and communications infrastructure. In order to study and predict the impact of HANEs, tractable computational approaches are required to model the complex physical processes involved in the detonation wave physics. Modern reduced-order models (ROM…
▽ More
Historical experimental testing of high-altitude nuclear explosions (HANEs) are known to cause severe and detrimental effects to radio frequency signals and communications infrastructure. In order to study and predict the impact of HANEs, tractable computational approaches are required to model the complex physical processes involved in the detonation wave physics. Modern reduced-order models (ROMs) can enable long-time and many-parameter simulations with minimal computational cost. However, translational and scale invariances inherent to this type of wave propagation problem are known to limit traditional ROM approaches. Specifically, dimensionality reduction methods are typically ineffective in producing low-rank models when invariances are present in the data. In this work, an unsupervised machine learning method is used to discover coordinate systems that make such invariances amenable to traditional dimensionality reduction methods. The method, which has previously been demonstrated on one-dimensional translations, is extended to higher dimensions and additional invariances. A surrogate HANE system, i.e. a HANE-ROM, with one detonation wave is captured well at extremely low-rank. Two detonation-waves are also considered with various amounts of interaction between the waves, with improvements to low-rank models for multiple wave quantities with limited interaction.
△ Less
Submitted 30 June, 2021;
originally announced June 2021.
-
Data-driven Modeling of Rotating Detonation Waves
Authors:
Ariana Mendible,
James Koch,
Henning Lange,
Steven L. Brunton,
J. Nathan Kutz
Abstract:
The direct monitoring of a rotating detonation engine (RDE) combustion chamber has enabled the observation of combustion front dynamics that are composed of a number of co- and/or counter-rotating coherent traveling shock waves whose nonlinear mode-locking behavior exhibit bifurcations and instabilities which are not well understood. Computational fluid dynamics simulations are ubiquitous in chara…
▽ More
The direct monitoring of a rotating detonation engine (RDE) combustion chamber has enabled the observation of combustion front dynamics that are composed of a number of co- and/or counter-rotating coherent traveling shock waves whose nonlinear mode-locking behavior exhibit bifurcations and instabilities which are not well understood. Computational fluid dynamics simulations are ubiquitous in characterizing the dynamics of RDE's reactive, compressible flow. Such simulations are prohibitively expensive when considering multiple engine geometries, different operating conditions, and the long-time dynamics of the mode-locking interactions. Reduced-order models (ROMs) provide a critically enabling simulation framework because they exploit low-rank structure in the data to minimize computational cost and allow for rapid parameterized studies and long-time simulations. However, ROMs are inherently limited by translational invariances manifest by the combustion waves present in RDEs. In this work, we leverage machine learning algorithms to discover moving coordinate frames into which the data is shifted, thus overcoming limitations imposed by the underlying translational invariance of the RDE and allowing for the application of traditional dimensionality reduction techniques. We explore a diverse suite of data-driven ROM strategies for characterizing the complex shock wave dynamics and interactions in the RDE. Specifically, we employ the dynamic mode decomposition and a deep Koopman embedding to give new modeling insights and understanding of combustion wave interactions in RDEs.
△ Less
Submitted 12 August, 2020;
originally announced August 2020.
-
Dimensionality Reduction and Reduced Order Modeling for Traveling Wave Physics
Authors:
Ariana Mendible,
Steven L. Brunton,
Aleksandr Y. Aravkin,
Wes Lowrie,
J. Nathan Kutz
Abstract:
We develop an unsupervised machine learning algorithm for the automated discovery and identification of traveling waves in spatio-temporal systems governed by partial differential equations (PDEs). Our method uses sparse regression and subspace clustering to robustly identify translational invariances that can be leveraged to build improved reduced order models (ROMs). Invariances, whether transla…
▽ More
We develop an unsupervised machine learning algorithm for the automated discovery and identification of traveling waves in spatio-temporal systems governed by partial differential equations (PDEs). Our method uses sparse regression and subspace clustering to robustly identify translational invariances that can be leveraged to build improved reduced order models (ROMs). Invariances, whether translational or rotational, are well known to compromise the ability of ROMs to produce accurate and/or low-rank representations of the spatio-temporal dynamics. However, by discovering translations in a principled way, data can be shifted into a coordinate systems where quality, low-dimensional ROMs can be constructed. This approach can be used on either numerical or experimental data with or without knowledge of the governing equations. We demonstrate our method on a variety of PDEs of increasing difficulty, taken from the field of fluid dynamics, showing the efficacy and robustness of the proposed approach.
△ Less
Submitted 18 May, 2020; v1 submitted 1 November, 2019;
originally announced November 2019.
-
Randomized Nonnegative Matrix Factorization
Authors:
N. Benjamin Erichson,
Ariana Mendible,
Sophie Wihlborn,
J. Nathan Kutz
Abstract:
Nonnegative matrix factorization (NMF) is a powerful tool for data mining. However, the emergence of `big data' has severely challenged our ability to compute this fundamental decomposition using deterministic algorithms. This paper presents a randomized hierarchical alternating least squares (HALS) algorithm to compute the NMF. By deriving a smaller matrix from the nonnegative input data, a more…
▽ More
Nonnegative matrix factorization (NMF) is a powerful tool for data mining. However, the emergence of `big data' has severely challenged our ability to compute this fundamental decomposition using deterministic algorithms. This paper presents a randomized hierarchical alternating least squares (HALS) algorithm to compute the NMF. By deriving a smaller matrix from the nonnegative input data, a more efficient nonnegative decomposition can be computed. Our algorithm scales to big data applications while attaining a near-optimal factorization. The proposed algorithm is evaluated using synthetic and real world data and shows substantial speedups compared to deterministic HALS.
△ Less
Submitted 30 April, 2018; v1 submitted 6 November, 2017;
originally announced November 2017.