-
Validating Deep-Learning Weather Forecast Models on Recent High-Impact Extreme Events
Authors:
Olivier C. Pasche,
Jonathan Wider,
Zhongwei Zhang,
Jakob Zscheischler,
Sebastian Engelke
Abstract:
The forecast accuracy of deep-learning-based weather prediction models is improving rapidly, leading many to speak of a "second revolution in weather forecasting". With numerous methods being developed, and limited physical guarantees offered by deep-learning models, there is a critical need for comprehensive evaluation of these emerging techniques. While this need has been partly fulfilled by ben…
▽ More
The forecast accuracy of deep-learning-based weather prediction models is improving rapidly, leading many to speak of a "second revolution in weather forecasting". With numerous methods being developed, and limited physical guarantees offered by deep-learning models, there is a critical need for comprehensive evaluation of these emerging techniques. While this need has been partly fulfilled by benchmark datasets, they provide little information on rare and impactful extreme events, or on compound impact metrics, for which model accuracy might degrade due to misrepresented dependencies between variables. To address these issues, we compare deep-learning weather prediction models (GraphCast, PanguWeather, FourCastNet) and ECMWF's high-resolution forecast (HRES) system in three case studies: the 2021 Pacific Northwest heatwave, the 2023 South Asian humid heatwave, and the North American winter storm in 2021. We find evidence that machine learning (ML) weather prediction models can locally achieve similar accuracy to HRES on record-shattering events such as the 2021 Pacific Northwest heatwave and even forecast the compound 2021 North American winter storm substantially better. However, extrapolating to extreme conditions may impact machine learning models more severely than HRES, as evidenced by the comparable or superior spatially- and temporally-aggregated forecast accuracy of HRES for the two heatwaves studied. The ML forecasts also lack variables required to assess the health risks of events such as the 2023 South Asian humid heatwave. Generally, case-study-driven, impact-centric evaluation can complement existing research, increase public trust, and aid in develo** reliable ML weather prediction models.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Insights into the drivers and spatio-temporal trends of extreme Mediterranean wildfires with statistical deep-learning
Authors:
Jordan Richards,
Raphaël Huser,
Emanuele Bevacqua,
Jakob Zscheischler
Abstract:
Extreme wildfires are a significant cause of human death and biodiversity destruction within countries that encompass the Mediterranean Basin. Recent worrying trends in wildfire activity (i.e., occurrence and spread) suggest that wildfires are likely to be highly impacted by climate change. In order to facilitate appropriate risk mitigation, we must identify the main drivers of extreme wildfires a…
▽ More
Extreme wildfires are a significant cause of human death and biodiversity destruction within countries that encompass the Mediterranean Basin. Recent worrying trends in wildfire activity (i.e., occurrence and spread) suggest that wildfires are likely to be highly impacted by climate change. In order to facilitate appropriate risk mitigation, we must identify the main drivers of extreme wildfires and assess their spatio-temporal trends, with a view to understanding the impacts of global warming on fire activity. We analyse the monthly burnt area due to wildfires over a region encompassing most of Europe and the Mediterranean Basin from 2001 to 2020, and identify high fire activity during this period in Algeria, Italy and Portugal. We build an extreme quantile regression model with a high-dimensional predictor set describing meteorological conditions, land cover usage, and orography. To model the complex relationships between the predictor variables and wildfires, we use a hybrid statistical deep-learning framework that can disentangle the effects of vapour-pressure deficit (VPD), air temperature, and drought on wildfire activity. Our results highlight that whilst VPD, air temperature, and drought significantly affect wildfire occurrence, only VPD affects wildfire spread. To gain insights into the effect of climate trends on wildfires in the near future, we focus on August 2001 and perturb temperature according to its observed trends (median over Europe: +0.04K per year). We find that, on average over Europe, these trends lead to a relative increase of 17.1\% and 1.6\% in the expected frequency and severity, respectively, of wildfires in August 2001, with spatially non-uniform changes in both aspects.
△ Less
Submitted 5 June, 2023; v1 submitted 4 December, 2022;
originally announced December 2022.
-
Modelling and simulating spatial extremes by combining extreme value theory with generative adversarial networks
Authors:
Younes Boulaguiem,
Jakob Zscheischler,
Edoardo Vignotto,
Karin van der Wiel,
Sebastian Engelke
Abstract:
Modelling dependencies between climate extremes is important for climate risk assessment, for instance when allocating emergency management funds. In statistics, multivariate extreme value theory is often used to model spatial extremes. However, most commonly used approaches require strong assumptions and are either too simplistic or over-parameterized. From a machine learning perspective, Generat…
▽ More
Modelling dependencies between climate extremes is important for climate risk assessment, for instance when allocating emergency management funds. In statistics, multivariate extreme value theory is often used to model spatial extremes. However, most commonly used approaches require strong assumptions and are either too simplistic or over-parameterized. From a machine learning perspective, Generative Adversarial Networks (GANs) are a powerful tool to model dependencies in high-dimensional spaces. Yet in the standard setting, GANs do not well represent dependencies in the extremes. Here we combine GANs with extreme value theory (evtGAN) to model spatial dependencies in summer maxima of temperature and winter maxima in precipitation over a large part of western Europe. We use data from a stationary 2000-year climate model simulation to validate the approach and explore its sensitivity to small sample sizes. Our results show that evtGAN outperforms classical GANs and standard statistical approaches to model spatial extremes. Already with about 50 years of data, which corresponds to commonly available climate records, we obtain reasonably good performance. In general, dependencies between temperature extremes are better captured than dependencies between precipitation extremes due to the high spatial coherence in temperature fields. Our approach can be applied to other climate variables and can be used to emulate climate models when running very long simulations to determine dependencies in the extremes is deemed infeasible.
△ Less
Submitted 19 January, 2022; v1 submitted 30 October, 2021;
originally announced November 2021.
-
Have precipitation extremes and annual totals been increasing in the world's dry regions over the last 60 years?
Authors:
Sebastian Sippel,
Jakob Zscheischler,
Martin Heimann,
Holger Lange,
Miguel D. Mahecha,
Geert Jan van Oldenborgh,
Friederike E. L. Otto,
Markus Reichstein
Abstract:
Daily rainfall extremes and annual totals have increased in large parts of the global land area over the last decades. These observations are consistent with theoretical considerations of a warming climate. However, until recently these global tendencies have not been shown to consistently affect land regions with limited moisture availability. A recent study, published by Donat et al. (2016, Natu…
▽ More
Daily rainfall extremes and annual totals have increased in large parts of the global land area over the last decades. These observations are consistent with theoretical considerations of a warming climate. However, until recently these global tendencies have not been shown to consistently affect land regions with limited moisture availability. A recent study, published by Donat et al. (2016, Nature Climate Change, doi:10.1038/nclimate2941), now identified rapid increases in globally aggregated dry region daily extreme rainfall and annual rainfall totals. Here, we reassess the respective analysis and find that a) statistical artifacts introduced by the choice of the reference period prior to data standardization lead to an overestimation of the reported trends by up to 40%, and also that b) the definition of `dry regions of the globe' affect the reported globally aggregated trends in extreme rainfall. Using the same observational dataset, but accounting for the statistical artifacts and using alternative, well-established dryness definitions, we find no significant increases in heavy precipitation in the world's dry regions. Adequate data pre-processing approaches and accounting for uncertainties regarding the definition of dryness are crucial to the quantification of spatially aggregated trends in the world's dry regions. In view of the high relevance of the question to many potentially affected stakeholders, we call for a cautionary consideration of specific data processing methods, including issues related to the definition of dry areas, to guarantee robustness of communicated climate change relevant findings.
△ Less
Submitted 1 September, 2016;
originally announced September 2016.
-
Distinguishing cause from effect using observational data: methods and benchmarks
Authors:
Joris M. Mooij,
Jonas Peters,
Dominik Janzing,
Jakob Zscheischler,
Bernhard Schölkopf
Abstract:
The discovery of causal relationships from purely observational data is a fundamental problem in science. The most elementary form of such a causal discovery problem is to decide whether X causes Y or, alternatively, Y causes X, given joint observations of two variables X, Y. An example is to decide whether altitude causes temperature, or vice versa, given only joint measurements of both variables…
▽ More
The discovery of causal relationships from purely observational data is a fundamental problem in science. The most elementary form of such a causal discovery problem is to decide whether X causes Y or, alternatively, Y causes X, given joint observations of two variables X, Y. An example is to decide whether altitude causes temperature, or vice versa, given only joint measurements of both variables. Even under the simplifying assumptions of no confounding, no feedback loops, and no selection bias, such bivariate causal discovery problems are challenging. Nevertheless, several approaches for addressing those problems have been proposed in recent years. We review two families of such methods: Additive Noise Methods (ANM) and Information Geometric Causal Inference (IGCI). We present the benchmark CauseEffectPairs that consists of data for 100 different cause-effect pairs selected from 37 datasets from various domains (e.g., meteorology, biology, medicine, engineering, economy, etc.) and motivate our decisions regarding the "ground truth" causal directions of all pairs. We evaluate the performance of several bivariate causal discovery methods on these real-world benchmark data and in addition on artificially simulated data. Our empirical results on real-world data indicate that certain methods are indeed able to distinguish cause from effect using only purely observational data, although more benchmark data would be needed to obtain statistically significant conclusions. One of the best performing methods overall is the additive-noise method originally proposed by Hoyer et al. (2009), which obtains an accuracy of 63+-10 % and an AUC of 0.74+-0.05 on the real-world benchmark. As the main theoretical contribution of this work we prove the consistency of that method.
△ Less
Submitted 24 December, 2015; v1 submitted 11 December, 2014;
originally announced December 2014.
-
Inferring deterministic causal relations
Authors:
Povilas Daniusis,
Dominik Janzing,
Joris Mooij,
Jakob Zscheischler,
Bastian Steudel,
Kun Zhang,
Bernhard Schoelkopf
Abstract:
We consider two variables that are related to each other by an invertible function. While it has previously been shown that the dependence structure of the noise can provide hints to determine which of the two variables is the cause, we presently show that even in the deterministic (noise-free) case, there are asymmetries that can be exploited for causal inference. Our method is based on the idea…
▽ More
We consider two variables that are related to each other by an invertible function. While it has previously been shown that the dependence structure of the noise can provide hints to determine which of the two variables is the cause, we presently show that even in the deterministic (noise-free) case, there are asymmetries that can be exploited for causal inference. Our method is based on the idea that if the function and the probability density of the cause are chosen independently, then the distribution of the effect will, in a certain sense, depend on the function. We provide a theoretical analysis of this method, showing that it also works in the low noise regime, and link it to information geometry. We report strong empirical results on various real-world data sets from different domains.
△ Less
Submitted 15 March, 2012;
originally announced March 2012.
-
Testing whether linear equations are causal: A free probability theory approach
Authors:
Jakob Zscheischler,
Dominik Janzing,
Kun Zhang
Abstract:
We propose a method that infers whether linear relations between two high-dimensional variables X and Y are due to a causal influence from X to Y or from Y to X. The earlier proposed so-called Trace Method is extended to the regime where the dimension of the observed variables exceeds the sample size. Based on previous work, we postulate conditions that characterize a causal relation between X and…
▽ More
We propose a method that infers whether linear relations between two high-dimensional variables X and Y are due to a causal influence from X to Y or from Y to X. The earlier proposed so-called Trace Method is extended to the regime where the dimension of the observed variables exceeds the sample size. Based on previous work, we postulate conditions that characterize a causal relation between X and Y. Moreover, we describe a statistical test and argue that both causal directions are typically rejected if there is a common cause. A full theoretical analysis is presented for the deterministic case but our approach seems to be valid for the noisy case, too, for which we additionally present an approach based on a sparsity constraint. The discussed method yields promising results for both simulated and real world data.
△ Less
Submitted 14 February, 2012;
originally announced February 2012.