-
Neural Network Representations of Multiphase Equations of State
Authors:
George A. Kevrekidis,
Daniel A. Serino,
Alexander Kaltenborn,
J. Tinka Gammel,
Joshua W. Burby,
Marc L. Klasky
Abstract:
Equations of State model relations between thermodynamic variables and are ubiquitous in scientific modelling, appearing in modern day applications ranging from Astrophysics to Climate Science. The three desired properties of a general Equation of State model are adherence to the Laws of Thermodynamics, incorporation of phase transitions, and multiscale accuracy. Analytic models that adhere to all…
▽ More
Equations of State model relations between thermodynamic variables and are ubiquitous in scientific modelling, appearing in modern day applications ranging from Astrophysics to Climate Science. The three desired properties of a general Equation of State model are adherence to the Laws of Thermodynamics, incorporation of phase transitions, and multiscale accuracy. Analytic models that adhere to all three are hard to develop and cumbersome to work with, often resulting in sacrificing one of these elements for the sake of efficiency. In this work, two deep-learning methods are proposed that provably satisfy the first and second conditions on a large-enough region of thermodynamic variable space. The first is based on learning the generating function (thermodynamic potential) while the second is based on structure-preserving, symplectic neural networks, respectively allowing modifications near or on phase transition regions. They can be used either "from scratch" to learn a full Equation of State, or in conjunction with a pre-existing consistent model, functioning as a modification that better adheres to experimental data. We formulate the theory and provide several computational examples to justify both approaches, and highlight their advantages and shortcomings.
△ Less
Submitted 1 July, 2024; v1 submitted 28 June, 2024;
originally announced June 2024.
-
Towards fully covariant machine learning
Authors:
Soledad Villar,
David W. Hogg,
Weichi Yao,
George A. Kevrekidis,
Bernhard Schölkopf
Abstract:
Any representation of data involves arbitrary investigator choices. Because those choices are external to the data-generating process, each choice leads to an exact symmetry, corresponding to the group of transformations that takes one possible representation to another. These are the passive symmetries; they include coordinate freedom, gauge symmetry, and units covariance, all of which have led t…
▽ More
Any representation of data involves arbitrary investigator choices. Because those choices are external to the data-generating process, each choice leads to an exact symmetry, corresponding to the group of transformations that takes one possible representation to another. These are the passive symmetries; they include coordinate freedom, gauge symmetry, and units covariance, all of which have led to important results in physics. In machine learning, the most visible passive symmetry is the relabeling or permutation symmetry of graphs. Our goal is to understand the implications for machine learning of the many passive symmetries in play. We discuss dos and don'ts for machine learning practice if passive symmetries are to be respected. We discuss links to causal modeling, and argue that the implementation of passive symmetries is particularly valuable when the goal of the learning problem is to generalize out of sample. This paper is conceptual: It translates among the languages of physics, mathematics, and machine-learning. We believe that consideration and implementation of passive symmetries might help machine learning in the same ways that it transformed physics in the twentieth century.
△ Less
Submitted 28 June, 2023; v1 submitted 31 January, 2023;
originally announced January 2023.
-
The role of mobility in the dynamics of the COVID-19 epidemic in Andalusia
Authors:
Z. Rapti,
J. Cuevas-Maraver,
E. Kontou,
S. Liu,
Y. Drossinos,
P. G. Kevrekidis,
G. A. Kevrekidis,
M. Barmann,
Q. -Y. Chen
Abstract:
Metapopulation models have been a popular tool for the study of epidemic spread over a network of highly populated nodes (cities, provinces, countries) and have been extensively used in the context of the ongoing COVID-19 pandemic. In the present work, we revisit such a model, bearing a particular case example in mind, namely that of the region of Andalusia in Spain during the period of the summer…
▽ More
Metapopulation models have been a popular tool for the study of epidemic spread over a network of highly populated nodes (cities, provinces, countries) and have been extensively used in the context of the ongoing COVID-19 pandemic. In the present work, we revisit such a model, bearing a particular case example in mind, namely that of the region of Andalusia in Spain during the period of the summer-fall of 2020 (i.e., between the first and second pandemic waves). Our aim is to consider the possibility of incorporation of mobility across the province nodes focusing on mobile-phone time dependent data, but also discussing the comparison for our case example with a gravity model, as well as with the dynamics in the absence of mobility. Our main finding is that mobility is key towards a quantitative understanding of the emergence of the second wave of the pandemic and that the most accurate way to capture it involves dynamic (rather than static) inclusion of time-dependent mobility matrices based on cell-phone data. Alternatives bearing no mobility are unable to capture the trends revealed by the data in the context of the metapopulation model considered herein.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Backcasting COVID-19: A Physics-Informed Estimate for Early Case Incidence
Authors:
G. A. Kevrekidis,
Z. Rapti,
Y. Drossinos,
P. G. Kevrekidis,
M. A. Barmann,
Q. Y. Chen,
J. Cuevas-Maraver
Abstract:
It is widely accepted that the number of reported cases during the first stages of the COVID-19 pandemic severely underestimates the number of actual cases. We leverage delay embedding theorems of Whitney and Takens and use Gaussian Process regression to estimate the number of cases during the first 2020 wave based on the second wave of the epidemic in several European countries, South Korea, and…
▽ More
It is widely accepted that the number of reported cases during the first stages of the COVID-19 pandemic severely underestimates the number of actual cases. We leverage delay embedding theorems of Whitney and Takens and use Gaussian Process regression to estimate the number of cases during the first 2020 wave based on the second wave of the epidemic in several European countries, South Korea, and Brazil. We assume that the second wave was more accurately monitored and hence that it can be trusted. We then construct a manifold diffeomorphic to that of the implied original dynamical system, using fatalities or hospitalizations only. Finally, we restrict the diffeomorphism to the reported cases coordinate of the dynamical system. Our main finding is that in the European countries studied, the actual cases are under-reported by as much as 50\%. On the other hand, in South Korea -- which had an exemplary and proactive mitigation approach -- a far smaller discrepancy between the actual and reported cases is predicted, with an approximately 17\% predicted under-estimation. We believe that our backcasting framework is applicable to other epidemic outbreaks where (due to limited or poor quality data) there is uncertainty around the actual cases.
△ Less
Submitted 30 January, 2022;
originally announced February 2022.
-
Estimation of the effective reproduction number for SARS-CoV-2 infection during the first epidemic wave in the metropolitan area of Athens, Greece
Authors:
Konstantinos Kaloudis,
George A. Kevrekidis,
Helena C. Maltezou,
Cleo Anastassopoulou,
Athanasios Tsakris,
Lucia Russo
Abstract:
Herein, we provide estimations for the effective reproduction number $R_e$ for the greater metropolitan area of Athens, Greece during the first wave of the pandemic (February 26-May 15, 2020). For our calculations, we implemented, in a comparative approach, the two most widely used methods for the estimation of $R_e$, that by Wallinga and Teunis and by Cori et al. Data were retrieved from the nati…
▽ More
Herein, we provide estimations for the effective reproduction number $R_e$ for the greater metropolitan area of Athens, Greece during the first wave of the pandemic (February 26-May 15, 2020). For our calculations, we implemented, in a comparative approach, the two most widely used methods for the estimation of $R_e$, that by Wallinga and Teunis and by Cori et al. Data were retrieved from the national database of SARS-CoV-2 infections in Greece. Our analysis revealed that the expected value of Re dropped below 1 around March 15, shortly after the suspension of the operation of educational institutions of all levels nationwide on March 10, and the closing of all retail activities (cafes, bars, museums, shop** centres, sports facilities and restaurants) on March 13. On May 4, the date on which the gradual relaxation of the strict lockdown commenced, the expected value of $R_e$ was slightly below 1, however with relatively high levels of uncertainty due to the limited number of notified cases during this period. Finally, we discuss the limitations and pitfalls of the methods utilized for the estimation of the $R_e$, highlighting that the results of such analyses should be considered only as indicative by policy makers.
△ Less
Submitted 28 December, 2020;
originally announced December 2020.