-
Benchmarking sparse system identification with low-dimensional chaos
Authors:
Alan A. Kaptanoglu,
Lanyue Zhang,
Zachary G. Nicolaou,
Urban Fasel,
Steven L. Brunton
Abstract:
Sparse system identification is the data-driven process of obtaining parsimonious differential equations that describe the evolution of a dynamical system, balancing model complexity and accuracy. There has been rapid innovation in system identification across scientific domains, but there remains a gap in the literature for large-scale methodological comparisons that are evaluated on a variety of…
▽ More
Sparse system identification is the data-driven process of obtaining parsimonious differential equations that describe the evolution of a dynamical system, balancing model complexity and accuracy. There has been rapid innovation in system identification across scientific domains, but there remains a gap in the literature for large-scale methodological comparisons that are evaluated on a variety of dynamical systems. In this work, we systematically benchmark sparse regression variants by utilizing the dysts standardized database of chaotic systems. In particular, we demonstrate how this open-source tool can be used to quantitatively compare different methods of system identification. To illustrate how this benchmark can be utilized, we perform a large comparison of four algorithms for solving the sparse identification of nonlinear dynamics (SINDy) optimization problem, finding strong performance of the original algorithm and a recent mixed-integer discrete algorithm. In all cases, we used ensembling to improve the noise robustness of SINDy and provide statistical comparisons. In addition, we show very compelling evidence that the weak SINDy formulation provides significant improvements over the traditional method, even on clean data. Lastly, we investigate how Pareto-optimal models generated from SINDy algorithms depend on the properties of the equations, finding that the performance shows no significant dependence on a set of dynamical properties that quantify the amount of chaos, scale separation, degree of nonlinearity, and the syntactic complexity.
△ Less
Submitted 4 February, 2023;
originally announced February 2023.
-
PySINDy: A comprehensive Python package for robust sparse system identification
Authors:
Alan A. Kaptanoglu,
Brian M. de Silva,
Urban Fasel,
Kadierdan Kaheman,
Andy J. Goldschmidt,
Jared L. Callaham,
Charles B. Delahunt,
Zachary G. Nicolaou,
Kathleen Champion,
Jean-Christophe Loiseau,
J. Nathan Kutz,
Steven L. Brunton
Abstract:
Automated data-driven modeling, the process of directly discovering the governing equations of a system from data, is increasingly being used across the scientific community. PySINDy is a Python package that provides tools for applying the sparse identification of nonlinear dynamics (SINDy) approach to data-driven model discovery. In this major update to PySINDy, we implement several advanced feat…
▽ More
Automated data-driven modeling, the process of directly discovering the governing equations of a system from data, is increasingly being used across the scientific community. PySINDy is a Python package that provides tools for applying the sparse identification of nonlinear dynamics (SINDy) approach to data-driven model discovery. In this major update to PySINDy, we implement several advanced features that enable the discovery of more general differential equations from noisy and limited data. The library of candidate terms is extended for the identification of actuated systems, partial differential equations (PDEs), and implicit differential equations. Robust formulations, including the integral form of SINDy and ensembling techniques, are also implemented to improve performance for real-world data. Finally, we provide a range of new optimization algorithms, including several sparse regression techniques and algorithms to enforce and promote inequality constraints and stability. Together, these updates enable entirely new SINDy model discovery capabilities that have not been reported in the literature, such as constrained PDE identification and ensembling with different sparse regression optimizers.
△ Less
Submitted 25 January, 2022; v1 submitted 12 November, 2021;
originally announced November 2021.
-
Heterogeneity-stabilized homogeneous states in driven media
Authors:
Zachary G. Nicolaou,
Daniel J. Case,
Ernest B. van der Wee,
Michelle M. Driscoll,
Adilson E. Motter
Abstract:
Understanding the relationship between symmetry breaking, system properties, and instabilities has been a problem of longstanding scientific interest. Symmetry-breaking instabilities underlie the formation of important patterns in driven systems, but there are many instances in which such instabilities are undesirable. Using parametric resonance as a model process, here we show that a range of sta…
▽ More
Understanding the relationship between symmetry breaking, system properties, and instabilities has been a problem of longstanding scientific interest. Symmetry-breaking instabilities underlie the formation of important patterns in driven systems, but there are many instances in which such instabilities are undesirable. Using parametric resonance as a model process, here we show that a range of states that would be destabilized by symmetry-breaking instabilities can be preserved and stabilized by the introduction of suitable system asymmetry. Because symmetric states are spatially homogeneous and asymmetric systems are spatially heterogeneous, we refer to this effect as heterogeneity-stabilized homogeneity. We illustrate this effect theoretically using driven pendulum array models and demonstrate it experimentally using Faraday wave instabilities. Our results have potential implications for the mitigation of instabilities in engineered systems and the emergence of homogeneous states in natural systems with inherent heterogeneities.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
Missing links as a source of seemingly variable constants in complex reaction networks
Authors:
Zachary G. Nicolaou,
Adilson E. Motter
Abstract:
A major challenge in network science is to determine parameters governing complex network dynamics from experimental observations and theoretical models. In complex chemical reaction networks, for example, such as those describing processes in internal combustion engines and power generators, rate constant estimates vary significantly across studies despite substantial experimental efforts. Here,…
▽ More
A major challenge in network science is to determine parameters governing complex network dynamics from experimental observations and theoretical models. In complex chemical reaction networks, for example, such as those describing processes in internal combustion engines and power generators, rate constant estimates vary significantly across studies despite substantial experimental efforts. Here, we examine the possibility that variability in measured constants can be largely attributed to the impact of missing network information on parameter estimation. Through the numerical simulation of measurements in incomplete chemical reaction networks, we show that unaccountability of network links presumed unimportant (with local sensitivity amounting to less than two percent of that of a measured link) can create apparent rate constant variations as large as one order of magnitude even if no experimental errors are present in the data. Furthermore, the correlation coefficient between the logarithmic deviation of the rate constant estimate and the cumulative relative sensitivity of the neglected reactions was less than $0.5$ in all cases. Thus, for dynamical processes on complex networks, iteratively expanding a model by determining new parameters from data collected under specific conditions is unlikely to produce reliable results.
△ Less
Submitted 28 October, 2020; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Non-normality and non-monotonic dynamics in complex reaction networks
Authors:
Zachary G. Nicolaou,
Takashi Nishikawa,
Schuyler B. Nicholson,
Jason R. Green,
Adilson E. Motter
Abstract:
Complex chemical reaction networks, which underlie many industrial and biological processes, often exhibit non-monotonic changes in chemical species concentrations, typically described using nonlinear models. Such non-monotonic dynamics are in principle possible even in linear models if the matrices defining the models are non-normal, as characterized by a necessarily non-orthogonal set of eigenve…
▽ More
Complex chemical reaction networks, which underlie many industrial and biological processes, often exhibit non-monotonic changes in chemical species concentrations, typically described using nonlinear models. Such non-monotonic dynamics are in principle possible even in linear models if the matrices defining the models are non-normal, as characterized by a necessarily non-orthogonal set of eigenvectors. However, the extent to which non-normality is responsible for non-monotonic behavior remains an open question. Here, using a master equation to model the reaction dynamics, we derive a general condition for observing non-monotonic dynamics of individual species, establishing that non-normality promotes non-monotonicity but is not a requirement for it. In contrast, we show that non-normality is a requirement for non-monotonic dynamics to be observed in the Rényi entropy. Using hydrogen combustion as an example application, we demonstrate that non-monotonic dynamics under experimental conditions are supported by a linear chain of connected components, in contrast with the dominance of a single giant component observed in typical random reaction networks. The exact linearity of the master equation enables development of rigorous theory and simulations for dynamical networks of unprecedented size (approaching $10^5$ dynamical variables, even for a network of only 20 reactions and involving less than 100 atoms). Our conclusions are expected to hold for other combustion processes, and the general theory we develop is applicable to all chemical reaction networks, including biological ones.
△ Less
Submitted 21 August, 2020;
originally announced August 2020.