-
Neural Network Representations of Multiphase Equations of State
Authors:
George A. Kevrekidis,
Daniel A. Serino,
Alexander Kaltenborn,
J. Tinka Gammel,
Joshua W. Burby,
Marc L. Klasky
Abstract:
Equations of State model relations between thermodynamic variables and are ubiquitous in scientific modelling, appearing in modern day applications ranging from Astrophysics to Climate Science. The three desired properties of a general Equation of State model are adherence to the Laws of Thermodynamics, incorporation of phase transitions, and multiscale accuracy. Analytic models that adhere to all…
▽ More
Equations of State model relations between thermodynamic variables and are ubiquitous in scientific modelling, appearing in modern day applications ranging from Astrophysics to Climate Science. The three desired properties of a general Equation of State model are adherence to the Laws of Thermodynamics, incorporation of phase transitions, and multiscale accuracy. Analytic models that adhere to all three are hard to develop and cumbersome to work with, often resulting in sacrificing one of these elements for the sake of efficiency. In this work, two deep-learning methods are proposed that provably satisfy the first and second conditions on a large-enough region of thermodynamic variable space. The first is based on learning the generating function (thermodynamic potential) while the second is based on structure-preserving, symplectic neural networks, respectively allowing modifications near or on phase transition regions. They can be used either "from scratch" to learn a full Equation of State, or in conjunction with a pre-existing consistent model, functioning as a modification that better adheres to experimental data. We formulate the theory and provide several computational examples to justify both approaches, and highlight their advantages and shortcomings.
△ Less
Submitted 1 July, 2024; v1 submitted 28 June, 2024;
originally announced June 2024.
-
Practical identifiability and parameter estimation of compartmental epidemiological models
Authors:
Q. Y. Chen,
Z. Rapti,
Y. Drossinos,
J. Cuevas-Maraver,
G. A. Kevrekidis,
P. G. Kevrekidis
Abstract:
Practical parameter identifiability in ODE-based epidemiological models is a known issue, yet one that merits further study. It is essentially ubiquitous due to noise and errors in real data. In this study, to avoid uncertainty stemming from data of unknown quality, simulated data with added noise are used to investigate practical identifiability in two distinct epidemiological models. Particular…
▽ More
Practical parameter identifiability in ODE-based epidemiological models is a known issue, yet one that merits further study. It is essentially ubiquitous due to noise and errors in real data. In this study, to avoid uncertainty stemming from data of unknown quality, simulated data with added noise are used to investigate practical identifiability in two distinct epidemiological models. Particular emphasis is placed on the role of initial conditions, which are assumed unknown, except those that are directly measured. Instead of just focusing on one method of estimation, we use and compare results from various broadly used methods, including maximum likelihood and Markov Chain Monte Carlo (MCMC) estimation.
Among other findings, our analysis revealed that the MCMC estimator is overall more robust than the point estimators considered. Its estimates and predictions are improved when the initial conditions of certain compartments are fixed so that the model becomes globally identifiable. For the point estimators, whether fixing or fitting the that are not directly measured improves parameter estimates is model-dependent. Specifically, in the standard SEIR model, fixing the initial condition for the susceptible population S(0) improved parameter estimates, while this was not true when fixing the initial condition of the asymptomatic population in a more involved model. Our study corroborates the change in quality of parameter estimates upon usage of pre-peak or post-peak time-series under consideration. Finally, our examples suggest that in the presence of significantly noisy data, the value of structural identifiability is moot.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Machine Learning for the identification of phase-transitions in interacting agent-based systems
Authors:
Nikolaos Evangelou,
Dimitrios G. Giovanis,
George A. Kevrekidis,
Grigorios A. Pavliotis,
Ioannis G. Kevrekidis
Abstract:
Deriving closed-form, analytical expressions for reduced-order models, and judiciously choosing the closures leading to them, has long been the strategy of choice for studying phase- and noise-induced transitions for agent-based models (ABMs). In this paper, we propose a data-driven framework that pinpoints phase transitions for an ABM in its mean-field limit, using a smaller number of variables t…
▽ More
Deriving closed-form, analytical expressions for reduced-order models, and judiciously choosing the closures leading to them, has long been the strategy of choice for studying phase- and noise-induced transitions for agent-based models (ABMs). In this paper, we propose a data-driven framework that pinpoints phase transitions for an ABM in its mean-field limit, using a smaller number of variables than traditional closed-form models. To this end, we use the manifold learning algorithm Diffusion Maps to identify a parsimonious set of data-driven latent variables, and show that they are in one-to-one correspondence with the expected theoretical order parameter of the ABM. We then utilize a deep learning framework to obtain a conformal reparametrization of the data-driven coordinates that facilitates, in our example, the identification of a single parameter-dependent ODE in these coordinates. We identify this ODE through a residual neural network inspired by a numerical integration scheme (forward Euler). We then use the identified ODE -- enabled through an odd symmetry transformation -- to construct the bifurcation diagram exhibiting the phase transition.
△ Less
Submitted 29 October, 2023;
originally announced October 2023.
-
Vaccination compartmental epidemiological models for the delta and omicron SARS-CoV-2 variants
Authors:
J. Cuevas-Maraver,
P. G. Kevrekidis,
Q. Y. Chen,
G. A. Kevrekidis,
Y. Drossinos
Abstract:
We explore the inclusion of vaccination in compartmental epidemiological models concerning the delta and omicron variants of the SARS-CoV-2 virus that caused the COVID-19 pandemic. We expand on our earlier compartmental-model work by incorporating vaccinated populations. We present two classes of models that differ depending on the immunological properties of the variant. The first one is for the…
▽ More
We explore the inclusion of vaccination in compartmental epidemiological models concerning the delta and omicron variants of the SARS-CoV-2 virus that caused the COVID-19 pandemic. We expand on our earlier compartmental-model work by incorporating vaccinated populations. We present two classes of models that differ depending on the immunological properties of the variant. The first one is for the delta variant, where we do not follow the dynamics of the vaccinated individuals since infections of vaccinated individuals were rare. The second one for the far more contagious omicron variant incorporates the evolution of the infections within the vaccinated cohort. We explore comparisons with available data involving two possible classes of counts, fatalities and hospitalizations. We present our results for two regions, Andalusia and Switzerland (including the Principality of Liechtenstein), where the necessary data are available. In the majority of the considered cases, the models are found to yield good agreement with the data and have a reasonable predictive capability beyond their training window, rendering them potentially useful tools for the interpretation of the COVID-19 and further pandemic waves, and for the design of intervention strategies during these waves.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
Towards fully covariant machine learning
Authors:
Soledad Villar,
David W. Hogg,
Weichi Yao,
George A. Kevrekidis,
Bernhard Schölkopf
Abstract:
Any representation of data involves arbitrary investigator choices. Because those choices are external to the data-generating process, each choice leads to an exact symmetry, corresponding to the group of transformations that takes one possible representation to another. These are the passive symmetries; they include coordinate freedom, gauge symmetry, and units covariance, all of which have led t…
▽ More
Any representation of data involves arbitrary investigator choices. Because those choices are external to the data-generating process, each choice leads to an exact symmetry, corresponding to the group of transformations that takes one possible representation to another. These are the passive symmetries; they include coordinate freedom, gauge symmetry, and units covariance, all of which have led to important results in physics. In machine learning, the most visible passive symmetry is the relabeling or permutation symmetry of graphs. Our goal is to understand the implications for machine learning of the many passive symmetries in play. We discuss dos and don'ts for machine learning practice if passive symmetries are to be respected. We discuss links to causal modeling, and argue that the implementation of passive symmetries is particularly valuable when the goal of the learning problem is to generalize out of sample. This paper is conceptual: It translates among the languages of physics, mathematics, and machine-learning. We believe that consideration and implementation of passive symmetries might help machine learning in the same ways that it transformed physics in the twentieth century.
△ Less
Submitted 28 June, 2023; v1 submitted 31 January, 2023;
originally announced January 2023.
-
MarkerMap: nonlinear marker selection for single-cell studies
Authors:
Nabeel Sarwar,
Wilson Gregory,
George A Kevrekidis,
Soledad Villar,
Bianca Dumitrascu
Abstract:
Single-cell RNA-seq data allow the quantification of cell type differences across a growing set of biological contexts. However, pinpointing a small subset of genomic features explaining this variability can be ill-defined and computationally intractable. Here we introduce MarkerMap, a generative model for selecting minimal gene sets which are maximally informative of cell type origin and enable w…
▽ More
Single-cell RNA-seq data allow the quantification of cell type differences across a growing set of biological contexts. However, pinpointing a small subset of genomic features explaining this variability can be ill-defined and computationally intractable. Here we introduce MarkerMap, a generative model for selecting minimal gene sets which are maximally informative of cell type origin and enable whole transcriptome reconstruction. MarkerMap provides a scalable framework for both supervised marker selection, aimed at identifying specific cell type populations, and unsupervised marker selection, aimed at gene expression imputation and reconstruction. We benchmark MarkerMap's competitive performance against previously published approaches on real single cell gene expression data sets. MarkerMap is available as a pip installable package, as a community resource aimed at develo** explainable machine learning techniques for enhancing interpretability in single-cell studies.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
The role of mobility in the dynamics of the COVID-19 epidemic in Andalusia
Authors:
Z. Rapti,
J. Cuevas-Maraver,
E. Kontou,
S. Liu,
Y. Drossinos,
P. G. Kevrekidis,
G. A. Kevrekidis,
M. Barmann,
Q. -Y. Chen
Abstract:
Metapopulation models have been a popular tool for the study of epidemic spread over a network of highly populated nodes (cities, provinces, countries) and have been extensively used in the context of the ongoing COVID-19 pandemic. In the present work, we revisit such a model, bearing a particular case example in mind, namely that of the region of Andalusia in Spain during the period of the summer…
▽ More
Metapopulation models have been a popular tool for the study of epidemic spread over a network of highly populated nodes (cities, provinces, countries) and have been extensively used in the context of the ongoing COVID-19 pandemic. In the present work, we revisit such a model, bearing a particular case example in mind, namely that of the region of Andalusia in Spain during the period of the summer-fall of 2020 (i.e., between the first and second pandemic waves). Our aim is to consider the possibility of incorporation of mobility across the province nodes focusing on mobile-phone time dependent data, but also discussing the comparison for our case example with a gravity model, as well as with the dynamics in the absence of mobility. Our main finding is that mobility is key towards a quantitative understanding of the emergence of the second wave of the pandemic and that the most accurate way to capture it involves dynamic (rather than static) inclusion of time-dependent mobility matrices based on cell-phone data. Alternatives bearing no mobility are unable to capture the trends revealed by the data in the context of the metapopulation model considered herein.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Backcasting COVID-19: A Physics-Informed Estimate for Early Case Incidence
Authors:
G. A. Kevrekidis,
Z. Rapti,
Y. Drossinos,
P. G. Kevrekidis,
M. A. Barmann,
Q. Y. Chen,
J. Cuevas-Maraver
Abstract:
It is widely accepted that the number of reported cases during the first stages of the COVID-19 pandemic severely underestimates the number of actual cases. We leverage delay embedding theorems of Whitney and Takens and use Gaussian Process regression to estimate the number of cases during the first 2020 wave based on the second wave of the epidemic in several European countries, South Korea, and…
▽ More
It is widely accepted that the number of reported cases during the first stages of the COVID-19 pandemic severely underestimates the number of actual cases. We leverage delay embedding theorems of Whitney and Takens and use Gaussian Process regression to estimate the number of cases during the first 2020 wave based on the second wave of the epidemic in several European countries, South Korea, and Brazil. We assume that the second wave was more accurately monitored and hence that it can be trusted. We then construct a manifold diffeomorphic to that of the implied original dynamical system, using fatalities or hospitalizations only. Finally, we restrict the diffeomorphism to the reported cases coordinate of the dynamical system. Our main finding is that in the European countries studied, the actual cases are under-reported by as much as 50\%. On the other hand, in South Korea -- which had an exemplary and proactive mitigation approach -- a far smaller discrepancy between the actual and reported cases is predicted, with an approximately 17\% predicted under-estimation. We believe that our backcasting framework is applicable to other epidemic outbreaks where (due to limited or poor quality data) there is uncertainty around the actual cases.
△ Less
Submitted 30 January, 2022;
originally announced February 2022.
-
On the Parameter Combinations That Matter and on Those That do Not
Authors:
Nikolaos Evangelou,
Noah J. Wichrowski,
George A. Kevrekidis,
Felix Dietrich,
Mahdi Kooshkbaghi,
Sarah McFann,
Ioannis G. Kevrekidis
Abstract:
We present a data-driven approach to characterizing nonidentifiability of a model's parameters and illustrate it through dynamic as well as steady kinetic models. By employing Diffusion Maps and their extensions, we discover the minimal combinations of parameters required to characterize the output behavior of a chemical system: a set of effective parameters for the model. Furthermore, we introduc…
▽ More
We present a data-driven approach to characterizing nonidentifiability of a model's parameters and illustrate it through dynamic as well as steady kinetic models. By employing Diffusion Maps and their extensions, we discover the minimal combinations of parameters required to characterize the output behavior of a chemical system: a set of effective parameters for the model. Furthermore, we introduce and use a Conformal Autoencoder Neural Network technique, as well as a kernel-based Jointly Smooth Function technique, to disentangle the redundant parameter combinations that do not affect the output behavior from the ones that do. We discuss the interpretability of our data-driven effective parameters, and demonstrate the utility of the approach both for behavior prediction and parameter estimation. In the latter task, it becomes important to describe level sets in parameter space that are consistent with a particular output behavior. We validate our approach on a model of multisite phosphorylation, where a reduced set of effective parameters (nonlinear combinations of the physical ones) has previously been established analytically.
△ Less
Submitted 9 June, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
Estimation of the effective reproduction number for SARS-CoV-2 infection during the first epidemic wave in the metropolitan area of Athens, Greece
Authors:
Konstantinos Kaloudis,
George A. Kevrekidis,
Helena C. Maltezou,
Cleo Anastassopoulou,
Athanasios Tsakris,
Lucia Russo
Abstract:
Herein, we provide estimations for the effective reproduction number $R_e$ for the greater metropolitan area of Athens, Greece during the first wave of the pandemic (February 26-May 15, 2020). For our calculations, we implemented, in a comparative approach, the two most widely used methods for the estimation of $R_e$, that by Wallinga and Teunis and by Cori et al. Data were retrieved from the nati…
▽ More
Herein, we provide estimations for the effective reproduction number $R_e$ for the greater metropolitan area of Athens, Greece during the first wave of the pandemic (February 26-May 15, 2020). For our calculations, we implemented, in a comparative approach, the two most widely used methods for the estimation of $R_e$, that by Wallinga and Teunis and by Cori et al. Data were retrieved from the national database of SARS-CoV-2 infections in Greece. Our analysis revealed that the expected value of Re dropped below 1 around March 15, shortly after the suspension of the operation of educational institutions of all levels nationwide on March 10, and the closing of all retail activities (cafes, bars, museums, shop** centres, sports facilities and restaurants) on March 13. On May 4, the date on which the gradual relaxation of the strict lockdown commenced, the expected value of $R_e$ was slightly below 1, however with relatively high levels of uncertainty due to the limited number of notified cases during this period. Finally, we discuss the limitations and pitfalls of the methods utilized for the estimation of the $R_e$, highlighting that the results of such analyses should be considered only as indicative by policy makers.
△ Less
Submitted 28 December, 2020;
originally announced December 2020.
-
Reaction-diffusion spatial modeling of COVID-19: Greece and Andalusia as case examples
Authors:
P. G. Kevrekidis,
J. Cuevas-Maraver,
Y. Drossinos,
Z. Rapti,
G. A. Kevrekidis
Abstract:
We examine the spatial modeling of the outbreak of COVID-19 in two regions: the autonomous community of Andalusia in Spain and the mainland of Greece. We start with a 0D compartmental epidemiological model consisting of Susceptible, Exposed, Asymptomatic, (symptomatically) Infected, Hospitalized, Recovered, and deceased populations. We emphasize the importance of the viral latent period and the ke…
▽ More
We examine the spatial modeling of the outbreak of COVID-19 in two regions: the autonomous community of Andalusia in Spain and the mainland of Greece. We start with a 0D compartmental epidemiological model consisting of Susceptible, Exposed, Asymptomatic, (symptomatically) Infected, Hospitalized, Recovered, and deceased populations. We emphasize the importance of the viral latent period and the key role of an asymptomatic population. We optimize model parameters for both regions by comparing predictions to the cumulative number of infected and total number of deaths via minimizing the $\ell^2$ norm of the difference between predictions and observed data. We consider the sensitivity of model predictions on reasonable variations of model parameters and initial conditions, addressing issues of parameter identifiability. We model both pre-quarantine and post-quarantine evolution of the epidemic by a time-dependent change of the viral transmission rates that arises in response to containment measures. Subsequently, a spatially distributed version of the 0D model in the form of reaction-diffusion equations is developed. We consider that, after an initial localized seeding of the infection, its spread is governed by the diffusion (and 0D model "reactions") of the asymptomatic and symptomatically infected populations, which decrease with the imposed restrictive measures. We inserted the maps of the two regions, and we imported population-density data into COMSOL, which was subsequently used to solve numerically the model PDEs. Upon discussing how to adapt the 0D model to this spatial setting, we show that these models bear significant potential towards capturing both the well-mixed, 0D description and the spatial expansion of the pandemic in the two regions. Veins of potential refinement of the model assumptions towards future work are also explored.
△ Less
Submitted 1 July, 2021; v1 submitted 9 May, 2020;
originally announced May 2020.