-
Implementation of a data-driven equation-discovery mesoscale parameterization into an ocean model
Authors:
Pavel Perezhogin,
Cheng Zhang,
Alistair Adcroft,
Carlos Fernandez-Granda,
Laure Zanna
Abstract:
Mesoscale eddies are poorly represented in climate ocean models, and therefore their effects on the large scale circulation must be parameterized. Classical parameterizations, which represent the bulk effect of the unresolved eddies, can be improved with new subgrid models learned directly from data. Zanna and Bolton (2020) (ZB20) applied an equation-discovery algorithm to reveal an interpretable…
▽ More
Mesoscale eddies are poorly represented in climate ocean models, and therefore their effects on the large scale circulation must be parameterized. Classical parameterizations, which represent the bulk effect of the unresolved eddies, can be improved with new subgrid models learned directly from data. Zanna and Bolton (2020) (ZB20) applied an equation-discovery algorithm to reveal an interpretable expression parameterizing the subgrid mesoscale fluxes through the components of the velocity-gradient tensor. In this work, we implement the ZB20 parameterization into the primitive-equation GFDL MOM6 ocean model and test it in two idealized configurations with significantly different stratification and topography. In addition, we propose an approach based on spatial filtering to improve the representation of large-scale energy backscatter and numerical properties of the parameterization. The ZB20 parameterization led to improved climatological mean flow and energy distributions, compared to the current state-of-the-art energy backscatter parameterizations. The ZB20 is scale-aware and can be used with a single value of the non-dimensional scaling coefficient for a range of resolutions. The successful application of the ZB20 to parameterize mesoscale eddies in two idealized configurations offers a promising opportunity to reduce long-standing biases in global ocean simulations in future studies.
△ Less
Submitted 4 November, 2023;
originally announced November 2023.
-
Reliable coarse-grained turbulent simulations through combined offline learning and neural emulation
Authors:
Christian Pedersen,
Laure Zanna,
Joan Bruna,
Pavel Perezhogin
Abstract:
Integration of machine learning (ML) models of unresolved dynamics into numerical simulations of fluid dynamics has been demonstrated to improve the accuracy of coarse resolution simulations. However, when trained in a purely offline mode, integrating ML models into the numerical scheme can lead to instabilities. In the context of a 2D, quasi-geostrophic turbulent system, we demonstrate that inclu…
▽ More
Integration of machine learning (ML) models of unresolved dynamics into numerical simulations of fluid dynamics has been demonstrated to improve the accuracy of coarse resolution simulations. However, when trained in a purely offline mode, integrating ML models into the numerical scheme can lead to instabilities. In the context of a 2D, quasi-geostrophic turbulent system, we demonstrate that including an additional network in the loss function, which emulates the state of the system into the future, produces offline-trained ML models that capture important subgrid processes, with improved stability properties.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
A data-driven framework for dimensionality reduction and causal inference in climate fields
Authors:
Fabrizio Falasca,
Pavel Perezhogin,
Laure Zanna
Abstract:
We propose a data-driven framework to simplify the description of spatiotemporal climate variability into few entities and their causal linkages. Given a high-dimensional climate field, the methodology first reduces its dimensionality into a set of regionally constrained patterns. Time-dependent causal links are then inferred in the interventional sense through the fluctuation-response formalism,…
▽ More
We propose a data-driven framework to simplify the description of spatiotemporal climate variability into few entities and their causal linkages. Given a high-dimensional climate field, the methodology first reduces its dimensionality into a set of regionally constrained patterns. Time-dependent causal links are then inferred in the interventional sense through the fluctuation-response formalism, as shown in Baldovin et al. (2020). These two steps allow to explore how regional climate variability can influence remote locations. To distinguish between true and spurious responses, we propose a novel analytical null model for the fluctuation-dissipation relation, therefore allowing for uncertainty estimation at a given confidence level. Finally, we select a set of metrics to summarize the results, offering a useful and simplified approach to explore climate dynamics. We showcase the methodology on the monthly sea surface temperature field at global scale. We demonstrate the usefulness of the proposed framework by studying few individual links as well as "link maps", visualizing the cumulative degree of causation between a given region and the whole system. Finally, each pattern is ranked in terms of its "causal strength", quantifying its relative ability to influence the system's dynamics. We argue that the methodology allows to explore and characterize causal relationships in high-dimensional spatiotemporal fields in a rigorous and interpretable way.
△ Less
Submitted 5 April, 2024; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Subgrid parameterizations of ocean mesoscale eddies based on Germano decomposition
Authors:
Pavel Perezhogin,
Andrey Glazunov
Abstract:
Ocean models at intermediate resolution (1/4 degree), which partially resolve mesoscale eddies, can be seen as Large eddy simulations (LES) of the primitive equations, in which the effect of unresolved eddies must be parameterized. In this work, we propose new subgrid models that are consistent with the physics of two-dimensional (2D) flows. We analyze subgrid fluxes in barotropic decaying turbule…
▽ More
Ocean models at intermediate resolution (1/4 degree), which partially resolve mesoscale eddies, can be seen as Large eddy simulations (LES) of the primitive equations, in which the effect of unresolved eddies must be parameterized. In this work, we propose new subgrid models that are consistent with the physics of two-dimensional (2D) flows. We analyze subgrid fluxes in barotropic decaying turbulence using Germano (1986) decomposition. We show that Leonard and Cross stresses are responsible for the enstrophy dissipation, while the Reynolds stress is responsible for additional kinetic energy backscatter. We utilize these findings to propose a new model, consisting of three parts, that is compared to a baseline dynamic Smagorinsky model (DSM). The three-component model accurately simulates the spectral transfer of energy and enstrophy and improves the representation of kinetic energy (KE) spectrum, resolved KE and enstrophy decay in a posteriori experiments. The backscattering component of the new model (Reynolds stress) is implemented both in quasi-geostrophic and primitive equation ocean models and improves statistical characteristics, such as the vertical profile of eddy kinetic energy, meridional overturning circulation and cascades of kinetic and potential energy.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
Implementation and Evaluation of a Machine Learned Mesoscale Eddy Parameterization into a Numerical Ocean Circulation Model
Authors:
Cheng Zhang,
Pavel Perezhogin,
Cem Gultekin,
Alistair Adcroft,
Carlos Fernandez-Granda,
Laure Zanna
Abstract:
We address the question of how to use a machine learned parameterization in a general circulation model, and assess its performance both computationally and physically. We take one particular machine learned parameterization \cite{Guillaumin1&Zanna-JAMES21} and evaluate the online performance in a different model from which it was previously tested. This parameterization is a deep convolutional ne…
▽ More
We address the question of how to use a machine learned parameterization in a general circulation model, and assess its performance both computationally and physically. We take one particular machine learned parameterization \cite{Guillaumin1&Zanna-JAMES21} and evaluate the online performance in a different model from which it was previously tested. This parameterization is a deep convolutional network that predicts parameters for a stochastic model of subgrid momentum forcing by mesoscale eddies. We treat the parameterization as we would a conventional parameterization once implemented in the numerical model. This includes trying the parameterization in a different flow regime from that in which it was trained, at different spatial resolutions, and with other differences, all to test generalization. We assess whether tuning is possible, which is a common practice in general circulation model development. We find the parameterization, without modification or special treatment, to be stable and that the action of the parameterization to be diminishing as spatial resolution is refined. We also find some limitations of the machine learning model in implementation: 1) tuning of the outputs from the parameterization at various depths is necessary; 2) the forcing near boundaries is not predicted as well as in the open ocean; 3) the cost of the parameterization is prohibitively high on CPUs. We discuss these limitations, present some solutions to problems, and conclude that this particular ML parameterization does inject energy, and improve backscatter, as intended but it might need further refinement before we can use it in production mode in contemporary climate models.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Generative data-driven approaches for stochastic subgrid parameterizations in an idealized ocean model
Authors:
Pavel Perezhogin,
Laure Zanna,
Carlos Fernandez-Granda
Abstract:
Subgrid parameterizations of mesoscale eddies continue to be in demand for climate simulations. These subgrid parameterizations can be powerfully designed using physics and/or data-driven methods, with uncertainty quantification. For example, Guillaumin and Zanna (2021) proposed a Machine Learning (ML) model that predicts subgrid forcing and its local uncertainty. The major assumption and potentia…
▽ More
Subgrid parameterizations of mesoscale eddies continue to be in demand for climate simulations. These subgrid parameterizations can be powerfully designed using physics and/or data-driven methods, with uncertainty quantification. For example, Guillaumin and Zanna (2021) proposed a Machine Learning (ML) model that predicts subgrid forcing and its local uncertainty. The major assumption and potential drawback of this model is the statistical independence of stochastic residuals between grid points. Here, we aim to improve the simulation of stochastic forcing with generative models of ML, such as Generative adversarial network (GAN) and Variational autoencoder (VAE). Generative models learn the distribution of subgrid forcing conditioned on the resolved flow directly from data and they can produce new samples from this distribution. Generative models can potentially capture not only the spatial correlation but any statistically significant property of subgrid forcing. We test the proposed stochastic parameterizations offline and online in an idealized ocean model. We show that generative models are able to predict subgrid forcing and its uncertainty with spatially correlated stochastic forcing. Online simulations for a range of resolutions demonstrated that generative models are superior to the baseline ML model at the coarsest resolution.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.