[1]Gustau Camps-Valls

[1]\orgdivImage Processing Laboratory, \orgnameUniversitat de València, \countrySpain 2]Chair of Data Science in Earth Observation, Technical University of Munich, Munich, Germany 3]\orgdivDepartment of Electronics, Information, and Bioengineering, \orgnamePolitecnico di Milano, \countryItaly 4]University of Bremen, Institute of Environmental Physics (IUP), Bremen, Germany 5]Max Planck Institute of Biogeochemistry, Jena, Germany 6]Leipzig University, Leipzig, Germany 7]National Technical University of Athens, Greece 8]National Observatory of Athens, Athens, Greece 9]Department of Signal Processing and Communications, Universidad de Alcalá, Madrid, Spain 10]German Aerospace Center (DLR), Jena, Germany 11]IMT Atlantique, Brest, France 12]Institute for Environmental Studies, VU Amsterdam, The Netherlands 13]Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Leipzig, Germany 14]Institute for Earth System Science & Remote Sensing, Leipzig University, Leipzig, Germany 15]Remote Sensing Centre for Earth System Research (RSC4Earth), Leipzig University, Leipzig, Germany 16]Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany

AI for Extreme Event Modeling and Understanding: Methodologies and Challenges

[email protected]    Miguel-Ángel Fernández-Torres    Kai-Hendrik Cohrs    Adrian Höhl    Andrea Castelletti    Aytac Pacal    Claire Robin    Francesco Martinuzzi    Ioannis Papoutsis    Ioannis Prapas    Jorge Pérez-Aracil    Katja Weigel    Maria Gonzalez-Calabuig    Markus Reichstein    Martin Rabel    Matteo Giuliani    Miguel Mahecha    Oana-Iuliana Popescu    Oscar J. Pellicer-Valero    Said Ouala1111{}^{\text{11}}start_FLOATSUPERSCRIPT 11 end_FLOATSUPERSCRIPT    Sancho Salcedo-Sanz    Sebastian Sippel    Spyros Kondylatos    Tamara Happé    Tristan Williams * [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
Abstract

In recent years, artificial intelligence (AI) has deeply impacted various fields, including Earth system sciences. Here, AI improved weather forecasting, model emulation, parameter estimation, and the prediction of extreme events. However, the latter comes with specific challenges, such as develo** accurate predictors from noisy, heterogeneous and limited annotated data. This paper reviews how AI is being used to analyze extreme events (like floods, droughts, wildfires and heatwaves), highlighting the importance of creating accurate, transparent, and reliable AI models. We discuss the hurdles of dealing with limited data, integrating information in real-time, deploying models, and making them understandable, all crucial for gaining the trust of stakeholders and meeting regulatory needs. We provide an overview of how AI can help identify and explain extreme events more effectively, improving disaster response and communication. We emphasize the need for collaboration across different fields to create AI solutions that are practical, understandable, and trustworthy for analyzing and predicting extreme events. Such collaborative efforts aim to enhance disaster readiness and disaster risk reduction.

keywords:
Artificial intelligence, extreme events, droughts, heatwaves, floods, detection, forecasting, modeling, understanding, attribution, explainable AI, causal inference, communication of risk

1 Introduction

The frequency, intensity, and duration of weather and climate extremes have increased in recent years, posing unprecedented challenges to societal stability, economic security, biodiversity loss, and ecological integrity seneviratne2021weather . These events –ranging from severe storms and floods to droughts and heatwaves– exert profound impacts on human livelihoods and the natural environment, often with long-lasting and sometimes irreversible consequences. Modeling, characterizing, and understanding extreme events is key to advancing mitigation and adaptation strategies.

In this context, Artificial Intelligence (AI) has emerged as a transformative tool for the detection aidetoolbox , forecasting Lam2023 ; ferchichi2022 , analysis of extreme events and generation of worst-case events ragone2021rare , and promises advances in attribution studies, explanation, and communication of risk Yokota2004 . The capabilities of machine learning (ML), and deep learning (DL) in particular, in combination with computer vision techniques, are advancing the detection and localization of events by exploiting climate data, such as reanalysis and observations salcedosanz2024analysis . Modern techniques for quantifying uncertainty gawlikowski2023survey are a necessary step to making progress in climate change risk assessment harrington2021quantifying . Using ensembles in combination with AI models has progressed the field of attribution of extremes stott2016attribution , identification of patterns, trends madakumbura2021anthropogenic and climate analogs chattopadhyay_analog_2020 . Yet, AI does not only excel at prediction but can also explain processes (e.g., via explainable AI FindingtherightXAIMethodBommer2024 and causal inference Hannart2016CausalCT ), which is essential for decision-making and effective mitigation strategies. Recent advances in large language models (LLMs) that retrieve information from heterogeneous text sources allow for effective integration of methods for communication tasks and human-machine interaction in extreme event and situation analysis vaghefi2023chatclimate .

Refer to caption
Figure 1: A general AI-driven extreme event analysis pipeline. Different components in modeling and understanding extreme events using AI methodologies are interconnected, highlighting the flow from data collection and analysis to actionable insights/outputs and the challenges encountered. Note the feedback loops where AI does not only produce some relevant outputs and products from data (predictions, patterns and trends, climate attributions, and causal relations) but also may help suggest areas for improvement and adaptation in methodologies to overcome identified challenges, redefine the hypotheses and challenges themselves, as well as inform data collection and preprocessing.

Here, w review the role of AI for extreme event analysis, its challenges and opportunities for extreme event analysis. The general pipeline of AI-driven extreme event analysis (cf. Fig. 1) encapsulates the entire workflow from data collection and preprocessing to the generation of outputs such as predictions, patterns, trends, climate attributions, and causal relations. The framework also highlights the iterative nature of AI processes, where outputs serve their direct purpose and also serve to inform and improve data collection, preprocessing, and hypothesis formulation. Nevertheless, develo** AI models that are accurate, operational, explainable, and trustworthy still remains a significant challenge. Issues such as data scarcity, the need for high-dimensional data processing, and the interpretation and explanation of AI models are critical challenges that must be addressed. Moreover, communicating risk effectively and ethically, and integrating AI tools with existing systems to ensure operational viability and scientific robustness are nontrivial tasks requiring ongoing research and development.

We provide a comprehensive overview of the AI-driven pipeline for analyzing extreme events, including modeling, detection, forecasting, and communication (Section 2), and discuss key challenges (Section 3). We present case studies on AI applications for droughts, heatwaves, wildfires, and floods (Section 4), and conclude by identifying major challenges and future research opportunities in AI for extreme events (Section 5).

2 Review of AI methods

This section reviews the main methods in all aspects of extreme event analysis (cf. Table 1): data, modeling, understanding and trustworthiness, and the last mile on communication and deployment.

Table 1: A comprehensive categorization of extreme events analysis in four main blocks (data, modeling, understanding and trustworthiness, and the last mile on communication and deployment) regarding their task, standard and current AI methods.

[Uncaptioned image]

2.1 Extreme Event Modeling

AI methodologies for extreme event modeling can be categorized into detection ruff2021unifying , prediction han2019review and impact assessment zennaro2021exploring tasks. Thanks to the emergence and success of DL, all of these tasks can be tackled by designing data-driven models that exploit spatio-temporal and multisource Earth data characteristics, from climate variables to in situ measurements and satellite remote sensing images (see Figure 2 [top row]).

2.1.1 Detection

Detecting and identifying extreme events geographically over time to distinguish them from normal conditions is fundamental for discovering underlying patterns and correlations between climate variables. This allows for a better understanding of their generating processes and mechanisms and facilitates their anticipation.

Classical statistical methods, such as threshold or percentile-based indices, have been widely applied to detect extreme events (see Table 1). However, these often fail to identify the same events as expert or impact-based detection methods Jones_2023 . AI methodologies can help reconcile expert knowledge with data-driven approaches as they capture regularities and nuanced relations in large volumes of observational data. Canonical ML treats the detection problem as a one-class problem or an outlier detection problem. Many methods are thus applied flach2017multivariate , and available in software packages aidetoolbox . Recent advances include deep learning for segmentation and detection of tropical cyclones (TCs) and atmospheric rivers (ARs) in high-resolution climate model output karaismailoglu2021climatenet and semi-supervised localization of extremes racah2017extremeweather . Alternatively, reconstruction-based models (e.g., with autoencoders) are optimized to accurately reconstruct normal data instances, and thus extremes are associated with large reconstruction errors GuancheGarca2018 . Finally, probabilistic approaches try to identify extremes by estimating the data probability density function (PDF) or specific quantiles thereof. Standard Extreme value theory (EVT) approaches, however, struggle with short time-series data, nonlinearities, and nonstationary processes. Alternative probabilistic (ML) models have relied on surrogate meteorological and hydrological seasonal re-forecasts Klehmet2024 , Gaussian processes and nonlinear dependence measures Johnson20sakame and multivariate Gaussianization Johnson20rbig4eo .

2.1.2 Prediction

Designing predictive systems that accurately model extreme events is essential to anticipating the effects of future extreme events and providing critical information for decision-makers to prevent damages. Spatial and temporal predictions aim to provide a quantitative estimate of the future value of the Earth’s state (Table 1).

Many ML algorithms have been proposed for deterministic extreme event prediction, but most are only applied to small regions and specific use cases, though. Prediction can be performed using climate variables alone vijverberg2022role or in combination with satellite imagery kladny2024enhanced . Here, for example, the vegetation response under extreme drought conditions is modeled as a video prediction task of the satellite imagery while conditioning on the climate variables. Another common approach is to directly estimate an indicator that defines the extreme event, e.g. flood risk maps bentivoglio2022deep or drought indices belayneh2014long . The estimated variables can cover different lead times depending on the characteristics of the extreme, spanning from short-term prediction to seasonal prediction kondylatos_wildfire_2022 . Recently, DL-based prediction techniques have gained popularity for their ability to process large data volumes, capture complex nonlinear relationships, and reduce manual feature engineering. These benefits have led to the creation of global models that generalize across various locations, as seen in flood nearing2024global and wildfire zhang2021deep predictions.

Probabilistic models, unlike deterministic ones, focus on predicting the probability distribution of future variable states. The importance of probabilistic forecasts for extreme heatwaves has been emphasized miloshevich2023probabilistic . AI-based techniques, integrated within climate models, can enhance predictions, such as in drought prediction vo2023lstm and extreme convective precipitations downscaling shi2020enabling .

2.1.3 Impact Assessment

Estimating the effects of extreme events on society, the economy, and the environment is crucial for conveying potential future consequences to the public, policymakers, and across disciplines callaghan2021machine ; zennaro2021exploring . Impact assessment involves understanding how a system reacts to extreme event forcings. Unlike extreme event detection and prediction, the focus here is on impact-related outcomes, such as the number of injuries, households affected, or crop losses.

In recent years, there has been an increasing interest in predicting vegetation state using ML ferchichi2022 , as a way to estimate the impact of climatic extremes on the evolution of the vegetation state variable sutanto2019moving ; salakpi2022forecasting . Recent advances methods have used echo state networks martinuzzi2023learning , ConvLSTM-based ahmad2023machine ; kladny2024enhanced , and transformers benson2023forecasting using high-resolution remote sensing and climate data.

Another way to address impact assessment with ML is to analyze changes in the probability density function over time. This approach allows a quantification of the impact of different events, which can be used to improve our understanding of the drivers of vulnerability, zhang2021impact to such as population displacement ronco2023exploring . Alternatively, the impact of extreme events can be detected by analyzing the news coverage based on natural language processing (NLP) and, more recently, on large language models (LLMs) Sodoge2023 .

Refer to caption
Figure 2: Components in an AI pipeline for extreme events. AI mainly exploits spatio-temporal Earth observation, reanalysis, and climate data to answer “what’-questions (top row): detection of events, prediction, and impact assessment. AI can also be used for understanding events and thus answer “what if,” “why,” and “how sure” questions (middle row) and makes use of explainable AI (XAI) to identify relevant drivers of events, causality to understanding the system, estimate causal effects and impacts, and imagine counterfactual scenarios for attribution and uncertainty estimation to quantify trust and robustness for decision-making. Communicating extreme events and their impacts can benefit from statistical/machine learning (bottom row) by improving operationalization, ensuring fair and equitable narratives, and integrating language models in situation rooms for enhanced decision-making.

2.2 Extreme events understanding and trustworthiness

Yet, all previous approaches focus on the ‘what,’ ‘when,’ and ‘where’ questions, not the ‘why,’ ‘what if,’ and ‘how certain’ ones. In environmental sciences, there’s an increasing emphasis on the need for trustworthy machine learning bostromTrustTrustworthy . This is especially important in extreme events, where critical high-stakes decisions need to be made impacting public safety, health, infrastructure, and resource allocation. Disciplines like explainable AI (xAI) and uncertainty quantification (UQ) offer methods to make AI more reliable and trustworthy (see Figure 2 [middle row]). These approaches not only help us interpret AI model predictions but also enhance our understanding of extreme events themselves. Techniques such as causal inference and extreme event attribution further enhance xAI and UQ by understanding the mechanisms behind these events, crucial for improving AI models and gaining trust in the decision-making process.

2.2.1 Extreme Event Attribution

Extreme event attribution (EEA) quantifies the influence of anthropogenic forcings (such as greenhouse gas emissions) on the likelihood of extreme climate events by using numerical simulations with General Circulation Models (GCM) to compare their probabilities under observed conditions (the factual world) and a hypothetical scenario without human emissions (the counterfactual world) naveau2020statistical (cf. Table 1). Two main viewpoints exist: probabilistic EEA employs quantitative statistical methods to estimate this likelihood (e.g., the EE is 600 times more likely due to human emissions), while storyline approaches simulate the evolution of the EE under different forcings to gather a process-based attribution statement (e.g., 50% of the magnitude of an EE is explained by natural variability) otto2023attribution . Ideally, both methods should be combined to provide a complete understanding.

Ensembles of neural networks have been developed to emulate GCMs, creating surrogate models that operate significantly faster pasini2017attribution . These models successfully predict the year based on the global annual temperature field under present conditions but struggle to perform accurately in hypothetical scenarios resembling pre-industrial conditions barnes2019viewing . Despite the scarce literature on this topic, the impressive recent advances in weather and climate emulation with AI Lam2023 suggest that climate emulators will play a significant role shortly, allowing almost real-time EEA.

2.2.2 Explainable AI

Many models are transparent and interpretable-by-design, like linear models or decision trees, but they may not perform well on complex problems and are rarely employed in extreme events (cf. Table 1). Explainable AI (xAI) aims to unveil the decision-making process of AI models. xAI also facilitates debugging, improving models, and gathering scientific insight by revealing the model functioning, learned relationships, and biases. The most commonly used xAI approaches rely on model-agnostic distillation or feature attribution methods ghaffarianExplainableArtificial2023 ; Tuia24perspective , cf. Fig. 2. Distillation methods, such as SHapley Additive exPlanations (SHAP) and Local interpretable model-agnostic explanations (LIME), create surrogate models and have been widely used in geosciences and climate sciences schlund2020constraining ; ronco2023exploring . Feature attribution methods, like Partial Dependence Plot (PDP) or Gradient-based Class Activation Map (Grad-CAM), highlight important features by perturbing the inputs or using backpropagation srinivasan2020machine . Recent approaches explain DL models using attention in drought prediction dikshitSolvingTransparency2022a and prototypes for explaining event localization barnesThisLooks2022 . xAI have been also used to evaluate climate predictions FindingtherightXAIMethodBommer2024 ; mamalakisInvestigatingFidelity2022 , and thus offering an agnostic, data-driven approach for model-data intercomparison.

2.2.3 Causal inference

Causal graphs encode intuitive information about robust system properties. This information is further required for deciding the identifiability of queries about interventional or counterfactual properties (pearl2017causality, ; Peters2017elements, ). For extreme events, most widespread causal inference methods Runge2023 ; CampsValls23physcausaldiscovery cannot be applied directly, as traditional assumptions of normality and mild outliers do not hold, and dependence may manifest only in the extremes (Table 1). This has motivated recent works that deal with causal discovery for extreme event analysis, providing different frameworks for understanding the dependencies and causal structures in extreme values Gnecco2019CausalDI , with applications on, e.g. rivers discharge Pasche2021CausalMO . Another relevant line of work focuses on answering counterfactual questions for climate extreme events, which links to EEA (Hannart2016CausalCT, ; Kiriliouk2019ClimateEE, ).

2.2.4 Uncertainty quantification

Even equipped with explanatory and causal methods, it is still crucial to assess how confident are AI model decisions, as inaccurate warnings or decisions could impact safety and resources (gawlikowski2023survey, ; ghanem2017, ). Understanding the sources of uncertainty is important to inform and disentangle the inherent (aleatoric) uncertainty of the weather phenomenon from the lack of knowledge in the model (epistemic) uncertainty. One can reduce the latter with more data and additional assumptions, but not the former (cf. Table 1). DL approaches with UQ have shown promise for extreme events recently miloshevich2023probabilistic ; xu2022quantifying .

2.3 The Last Mile: operationalization, communication, ethics and decision-making

The previous components (methods and techniques) in the AI pipeline need to be operationalized, be robust, accountable and fair, so they ultimately serve the purpose of evidence-based policy making (see Figure 2 [bottom row]).

2.3.1 Operationalization

Operationalization strictly needs the previous layer in place (xAI, causality, UQ) for improved and accountable explanations of the predictions in early warning systems (EWSs), and for enhanced disaster risk management. Calibration of ML models bella2010calibration , especially when coupled with uncertainty estimation, aligns predicted probabilities to reflect the real-world likelihood of extreme outcomes, enhancing reliability and interpretability essential for informed decision-making. Poor calibration can lead to misguided decisions, such as overestimating or underestimating the likelihood of critical events.

2.3.2 Communication of risk and ethical aspects

History shows that even actionable forecasts can fail if not communicated properly. For instance, despite predictions of the Mediterranean storm Daniel’s landfall four days in advance, the lack of effective communication contributed to the tragic outcome in Libya, with severe casualties and displacement corps2023libya . Even in the more developed country of Germany, the devastating effects of floods were related to ineffective warnings dwd2021weather . This highlights the critical need for robust EWSs that predict events and effectively communicate risks to ensure community preparedness and response (Table 1).

The challenge of false alarms in the context of EWS is a significant challenge, as they can lead to “warning fatigue”, where the public becomes desensitized to alerts and may ignore crucial warnings during actual emergencies tojcic2021performance ; yore2020early ; reichstein2023ews . Addressing this issue requires improving the accuracy and trust in predictive models, refining communication strategies, and engaging the community tamamadin2020automation .

2.3.3 Ethics in AI models and data

The governance of AI ethics calls for systems to be respectful of human dignity, ensure security, and support democratic values RNY004 . The deployment of AI to help manage extremes involves several fundamental principles: ensuring fairness, maintaining privacy, and achieving transparency RNY003 ; kochupillai2022earth . In this context, the rise of Large Language Models (LLMs) heightens ethical risks like outdated or inaccurate data, bias, errors, and misinformation, often obscured by their vast training on web data. Additionally, current AI models (like generative AI and LLMs) often draw on large datasets, so there is a risk of strong biases if these datasets are unrepresentative Tuia24perspective . Spatial sampling and analysis are vital to collecting geographically and environmentally representative, fair, and unbiased data. Global initiatives like the Common Alerting Protocol have standardized warning data, but the success of these systems depends on their inclusivity and ability to adapt to the diverse needs of affected communities, who should participate in develo** EWSs macherera2016review ; reichstein2023ews . AI enhances these systems by enabling the rapid dissemination of personalized alerts tailored to specific locations and individual risk factors like proximity to flood plains or wildfire zones, ensuring warnings are understandable and relevant to all.

Achieving the inclusion of affected communities, especially in the Global South, still remains a significant challenge, as centralized, ’one-size-fits-all’ models are often simpler and cheaper to implement but less effective at addressing local nuances. Designing AI systems, building on large-language models fine-tuned to local communities with users in the loop, offers excellent opportunities to overcome “one-size-fits-all” while being efficient.

2.3.4 Policy and decision-making

Even if AI-assisted, human operators are in charge of implementing the final decision. This implies supporting end-users in making the most out of the generated information Yokota2004 . This operational value is often problem-specific and depends upon the system’s dominant dynamics and the socio-economic sectors considered. Quantifying this value, however, might not be straightforward as a more accurate prediction does not necessarily imply a better decision by the end users Ramos2013 . When multiple forecasts from different systems are available, users should address a number of problems where AI can be helpful, including the selection of the forecast product, the lead time, the variable aggregation, the bias correction, and how to cope with the forecast uncertainty.

Integrating the pipeline in Fig. 1 with impact models that leverage Reinforcement Learning algorithms to simulate optimal decision-making can help quantify how AI-enhanced information about extreme events translates into better decisions Giuliani2015ISA . The co-design of impact models via participatory processes, including end users in the loop, further strengthens the overall model-based investigation by better capturing end users’ requirements, expectations and concerns. End users should understand both the benefits of AI-enhanced information for improved decision-making and the risks of misinformation leading to poor choices.

3 Data, model and integration challenges

Extreme event analysis faces many important challenges related to the data and model characteristics, but also to the integration of AI in decision pipelines (see Table 2.

Table 2: Challenges and risks of using AI for extreme event analysis, and needed advances in AI to tackle them. The same blocks before related to data, model and integration challenges are studied.

[Uncaptioned image]

3.1 Data challenges

A major challenge is the lack of sufficient data with expert annotations, which are essential for training and evaluating AI models. Given their rarity, extreme events can be overlooked during the data preprocessing steps to eliminate noise, gaps, biases, and inconsistencies camps2021deep ; Reichstein19nat . Additionally, AI struggles with integrating and extracting relevant information across various data sources and scales, which can complicate feature extraction and selection Tuia24perspective . Future AI development needs to focus on deriving robust features (or representations) that effectively capture the distinct characteristics of extreme events (cf. Table 2).

AI models are increasingly used to enhance parameterizations of subgrid-scale processes in Earth system models, addressing gaps in traditional methods. However, a key challenge is their numerical instability over extended time horizons, which may produce unrealistic scenarios when simulating extremes due to insufficient training data iles2020benefits . The quality of observational data also challenges AI methods used for data assimilation. Recently, hybrid ML models that combine domain-driven with data-driven models approaches promise more robust and trustworthy AI models cheng2023machine ; Reichstein19nat . Moreover, AI-based methods have improved error characterization, enhancing uncertainty quantification Haynes2023 . Generative models are also used to sample ensemble members more effectively, offering better representations of the system’s state Li2022 (cf. Table 2).

3.2 Model challenges

The lack of a clear statistical definition of extreme events and the mechanisms responsible for their occurrence hamper model development and adoption. For detection, extremes usually constitute not pointwise but complex contextual, group, or conditional anomalies, whose source (changing process, parents, distribution) is often unknown Tuia24perspective . This results in challenges, such as capturing subtle (new) patterns, setting adaptive thresholds, or integrating data across distant points in space and time (cf. Table 2). For prediction and impact assessment, AI models are sensitive to initial conditions and may not capture long-term dependencies camps2021deep . Moreover, data might not reveal the dynamics of extremes, there might be changes to unseen dynamics flach2017multivariate , and stationarity may not hold in general. Using hybrid models that combine data, domain knowledge and models could allow insights into the mechanisms that trigger extreme events Reichstein19nat ; farazmand2019extreme ; cheng2023machine .

The complexity of extremes also makes attribution, causal discovery, and explainability particularly challenging. xAI can only reveal correlations the model learned and has no information about the causal structure. This could lead xAI to exacerbate model biases or spurious correlations. Indeed, different xAI methods can produce very different explanations, whose suitability for different models should be quantitatively evaluated FindingtherightXAIMethodBommer2024 . Causality is not error-free, as a wrong assumption about where the extreme comes from could lead to wrong causal graphs, conclusions and decisions pearl2017causality . Finally, challenges in UQ include differentiating and quantifying aleatoric and epistemic uncertainties, which is complicated by the models’ overparameterization and their lack of robust probabilistic foundations (cf. Table 2).

3.3 Integration challenges

ML models are typically trained on high-quality, well-curated datasets, such as Copernicus ERA 5 reanalysis or cloud-free satellite imagery, which often do not reflect the error-prone meteorological forecasts and cloudy conditions encountered in real-world situations (Table 2). Applying domain adaptation strategies or leveraging invariant features could align model performance from training phases to operational conditions. Besides, leveraging proprietary and trusted geospatial data from operational stakeholders, such as detailed forest fuel maps and elevation models, allows for fine-tuning these models to enhance detection and forecasting accuracy and enable finer spatial and temporal resolution in output products.

4 Case Studies

We showcase four case studies on drought, heatwaves, wildfires, and floods, each of them covering the different aspects of detection, forecasting, impact, explainability, attribution, and communication of risk (Fig. 3).

Refer to caption

Figure 3: Summary of case studies using AI to manage extreme events. Four use cases (drought, heatwaves, wildfires, and floods) are showcased where AI enables detection, forecasting, impact assessment, explanation, understanding, and communication of risk, providing a comprehensive solution for disaster management. (a) Droughts. Top: AI leverages multimodal data to predict Earth’s surface dynamics, enhancing forecasts for crop yields, forest health, and drought impacts. Bottom: XAI techniques, like “neuron integrated gradients,” elucidate the key factors driving severe drought conditions, highlighting variable interactions over time. (b) Heatwaves. Top: Variables of interest are extracted from heterogeneous data sources (images, time series, text) and potentially aggregated over space and/or time. Bottom Left: Relevant features can be extracted from the data using e.g., clustering techniques. Right: Prediction of heatwaves can be done in combination with dimensionality reduction tools or directly from the selected features. (c) Wildfires. Top: AI enhances understanding and prediction of wildfire dynamics, particularly for mega-fires intensified by global warming, by analyzing extensive datasets and differentiating fire types with XAI. Bottom: AI combined with causal inference aims to better detect and understand pyrocumulonimbus clouds, intense storm systems generated by large wildfires that complicate fire behavior prediction. (d) Floods. AI transforms flood risk communication by using realistic 3D visualizations and animations to depict rising water levels’ impact on communities and infrastructure, making the information more relatable (thisclimatedoesnotexist.com). AI-driven platforms analyze vast amounts of data from weather forecasts, river levels, and historical flood patterns to predict future events accurately, integrating this information with digital maps and urban models to identify high-risk areas (climate-viz.github.io.com). This approach enhances flood risk management by allowing for targeted, personalized communication, enabling residents to receive specific alerts and visualize potential impacts on their homes. AI also supports the generation of detailed flood reports from various sources, enhancing preparedness and mitigation efforts (floodbrain.com).

4.1 Droughts: detection, forecasting and explainability

Droughts are among the costliest natural hazards, having destructive effects on the ecological environment, agricultural production, and socioeconomic conditions. They have been traditionally approached with heuristic thresholds and simple parametric models on related variables (e.g., soil moisture, vegetation indices, precipitation, or temperature, etc.). Being drought monitoring historically purely based on station-based measurements, Earth observation technology (e.g., Sentinel and Landsat series) and remote sensing allowed for estimating drought-related variables over larger spatial and temporal scales west2019remote . However, drought definitions presented in the literature (meteorological, hydrological, agricultural, operational, or socioeconomic) are usually subjective and limited zargar2011review . These types are not independent but refer to complementary physical, chemical, and biological processes of the Earth system droughts involve. Although empirical multivariate drought indicators have been proposed to account for these dependencies, the complexity of the combination of drought processes makes their detection, prediction, and characterization (severity, duration, start-end, etc.) a burden to researchers and policymakers, which can benefit from data-driven AI approaches.

Traditional ML approaches have been successfully applied to drought prediction, from support vector machines, decision trees, and random forests to more advanced multivariate density estimation approaches jehanzaib2021comprehensive ; 2024mengxue ; Johnson20rbig4eo . However, DL algorithms are required for efficiently modeling spatial and temporal correlations in data. To this end, neural networks have been used for drought monitoring in a supervised way, mainly using multilayer, convolutional, and/or recurrent neural networks aidetoolbox . Drought detection is an unsupervised problem due to its spatiotemporal variability and the shifting definition of drought under climate change. Thus, unsupervised and semi-supervised methods are recommended for practical applications ruff2021unifying .

To overcome some of the challenges before, recent works have applied AI methods to process multimodal data (satellite imagery at different resolutions, mesoscale climate variables, and static features). Drought detection has been addressed at the European level employing domain knowledge-driven variational autoencoders (VAEs) 2024mengxue , which combine traditional drought indicators with climate data. Forecasting the dynamics of the Earth’s surface with AI may help predict crop yield, forest health, and the effects of drought events benson2023forecasting (Fig. 3a, top). Similar DL drought detectors can be explained with xAI, identifying the most salient regions in time and space and attributing their main drivers dikshit2021interpretable (Fig. 3a, bottom).

4.2 Heatwaves: prediction, attribution and understanding

In the context of climate change, heat extremes are becoming more frequent and intense barriopedro_heat_2023 , but still regional trends are uncertain teng2022warming . For example, a notable warming hotspot in western Europe was identified with heat extremes increasing faster than predicted by the current state-of-the-art climate models vautard_heat_2023 . Understanding the physical drivers of heat extremes is crucial for accurate seasonal prediction and projection of these events. Several drivers of heat extremes remain uncertain though, specifically long-term changes in atmospheric circulation as well as land-atmosphere interactions, that may amplify or enhance specific events barriopedro_heat_2023 .

AI methods can accurately detect and predict heat extremes on different spatial and temporal scales, some lead times and aggregations using deep learning chattopadhyay_analog_2020 ; jacques2022deep ; fister_accurate_2023 , causal-informed ridge regression vijverberg2022role , or hybrid models van_straaten_CorrectingSubseasonal_2023 , among others. These approaches often require dimensionality reduction tools to select predictors from high-dimensional climate data (Fig. 3c). Expressive lower-dimensional feature representations have been extracted from high-dimensional data using VAEs, which can be used to improve the prediction of heat extremes miloshevich2023probabilistic . New methods, such as the EarthFormer, explore transformer networks specifically for the prediction of temperature anomalies, where encoder/decoder structures are combined with spatial attention layers gao2023earthformer . Next to predicting heatwaves, physical drivers of heatwaves can be inferred from the ML models as well. For example, XAI tools can be applied to discover physical drivers of heat extremes, e.g. van_straaten_CorrectingSubseasonal_2023 . Apart from XAI, causal inference methods have been used to understand the drivers of extreme temperatures vijverberg2022role .

Attribution studies for temperature and precipitation have increasingly used AI techniques barnes2019viewing . When it comes to heatwaves the identification of circulation-induced vs. thermodynamical changes becomes crucial. Several statistical and ML methods are currently in use for this purpose singh_circulation_2023 . The next vital step for attribution studies is to use such ML methods to understand the contribution of dynamical changes to the observed trends. This will lighten the discrepancies between circulation trends in climate model observations and reanalysis vautard_heat_2023 . Finally, recent development shows that ML techniques can be successfully applied for rare event modeling, sampling, and prediction miloshevich2023probabilistic .

4.3 Wildfires: predict, explain and understand

Most wildfire damage stems from a few extreme events. Predicting these events is vital for effective fire management and ecosystem conservation. Climate change is expected to increase the frequency, size, and severity of extreme wildfires, exacerbating fire weather conditions jones2022global . Traditional firefighting approaches are increasingly inadequate, with many extreme fires burning until naturally extinguished el_garroussi_europe_2024 . The complexity of nonlinear interactions among fire drivers across various scales hinders the predictability of fire behavior. Therefore, enhancing models to better understand and predict conditions leading to large, uncontrollable fires is crucial.

DL methods can associate weather predictions, satellite observations, and burned area datasets to model wildfires achieving better predictability than conventional methods. DL has been used successfully to forecast wildfire danger kondylatos_wildfire_2022 and for susceptibility map** zhang_forest_2019 . In the short term, wildfires are driven by the daily weather variations but also by the cumulative effects of vegetation and drought. On sub-seasonal to seasonal scales, where weather forecasts are less reliable, wildfires are modulated by Earth system large-scale processes, such as teleconnections. Early work has shown that DL models can leverage information from teleconnections to improve long-term wildfire forecasting li2023attentionfire_v1 . Apart from predicting wildfires, it is very important to understand the reasons behind the predictions to support decision-making and fire management. In this context, xAI can aid in identifying what drives wildfires, supporting, for example, the differentiation and management of various fire types, such as wind-driven and drought-driven fires kondylatos_wildfire_2022 .

Under global warming el_garroussi_europe_2024 , in the coming years, we expect an increase in the frequency of unstable atmospheric conditions that can cause Pyrocumulonimbus (pyroCbs), i.e., storm clouds that generate their weather fronts and can make wildfire behavior unpredictable fromm_understanding_2022 . Despite the risk posed by pyroCbs, the conditions leading to their occurrence and evolution are still poorly understood, and their causal mechanisms are uncertain. AI, in combination with causal inference, can advance the detection, forecasting, and understanding of the drivers of pyroCb events salas2022identifying .

4.4 Floods: from detection to communication of risk

Studying floods is crucial as they are the most frequent and costly natural disasters, affecting millions annually and causing over $40 billion in damages worldwide each year Jonkman2005Global . Develo** novel methods for flood detection enhances EWSs, reducing fatalities by up to 40%, while effective risk communication ensures better preparedness and response, potentially saving thousands of lives.

In July 2021, intense rainfall in Germany, Belgium, France, and the UK triggered severe flash floods, particularly devastating the Ahr River region in Germany, claiming nearly 200200200200 lives cornwall2021europe ; mohr2023multi . Alerts were issued by the European Flood Awareness System (EFAS) and disseminated through an international warning system. However, the floods exposed significant flaws in these systems (Fig. 3d). Damage to river monitoring infrastructure impeded data accuracy, and although meteorological forecasts predicted the rainfall two days in advance, flood predictions for smaller basins lacked precision and failed to consider debris flow and morphodynamics, leading to underestimations of flood severity.

AI offers promising avenues for enhancing flood management systems. Advanced global meteorological forecasting models powered by AI can rapidly process large data ensembles, providing more accurate probabilistic estimates even during extreme weather events. Furthermore, AI techniques such as ML-accelerated computational fluid dynamics could address the computational challenges in hydro-morpho-dynamic modeling, allowing for more precise predictions of stream flow and flood levels, particularly in ungauged basins kochkov2021machine .

Additionally, AI can aid in calibrating non-contact video gauges, potentially more robust than traditional methods, as they are not directly affected by the water. AI can also guide forensic analysis to evaluate exposure and vulnerability, employing multi-modal approaches to refine geospatial models at a local scale. However, the effectiveness of these AI-driven models depends on detailed knowledge of the local terrain, including potential bottlenecks like bridges and channels, and accurate data about societal vulnerability to floods.

Yet, AI can also transform how warnings are issued to improve communication and response strategies. For instance, AI-generated maps and photorealistic visualizations based on digital elevation models can depict expected inundation areas and damages lutjens2023physicallyconsistent . Moreover, AI can generate easily understandable, language-based warnings, both written and auditory, tailored for diverse populations, including the visually impaired. An LLM-based chatbot feature could enhance interactivity, providing real-time, personalized responses to emergency inquiries (see examples in Fig. 3d).

5 Conclusions and Outlook

This review highlighted the significant potential of AI for analyzing and modeling extreme events while also detailing the main difficulties and prospects associated with this emerging field. Integrating AI into extreme event analysis faces several challenges, including data management issues like handling dynamic datasets, biases, and high dimensionality that complicate feature extraction. AI models also struggle with unclear statistical definitions of what is ‘extreme’. Furthermore, integrating AI with physical models poses substantial challenges yet offers promising opportunities for enhancing model accuracy and reliability. Trustworthiness concerns arise from the complexity and interpretability of ML models, the difficulty of generalizing across different contexts, and the quantification of uncertainty. Operational challenges include the complexity of AI outputs which hinder interpretation by non-experts, resistance to AI adoption due to concerns over reliability and fairness, and the need for frameworks that facilitate transparent and ethical integration of AI insights into decision-making processes.

The previous challenges compromise reproducibility and comparability of ML models in analyzing extreme events. These challenges are exacerbated by data scarcity, lack of transparency in model configurations, and the use of proprietary tools. Additionally, interdisciplinary differences hinder consistency and comparability. Effective solutions demand robust, transparent methodologies, inclusive data-sharing practices, and frameworks that support cross-disciplinary collaboration.

We overviewed and emphasized the importance of develo** operational, explainable, and trustworthy AI systems. Addressing these challenges requires a coordinated effort across disciplines involving AI researchers, environmental and climate scientists, field experts, and policymakers. This collaborative approach is crucial for advancing AI applications in extreme event analysis and ensuring that these technologies are adapted to real-world needs and constraints. From an operational perspective, adapting AI solutions to real-time data integration, model deployment, and resource allocation highlights the need for systems that can function within the operational frameworks of disaster management and risk mitigation. Besides, methodological improvements are still needed in model evaluation and benchmarking to alleviate issues like overfitting and enhance AI systems’ generalization capabilities.

Looking forward, there are significant areas for further exploration and improvement. These include the development of benchmarks specific to extreme events, enhanced integration of domain knowledge to improve data fusion and model training, and the creation of robust, scalable AI systems capable of adapting to the dynamic nature of extreme events. Recent LLMs harvest vast amounts of domain knowledge embedded in the literature, promising significant advances in communicating risk.

As we advance, the ultimate goal is to harness AI’s potential to substantially benefit society, particularly by enhancing our capacity to manage and respond to extreme events. Through dedicated research and collaborative innovation, AI can become a cornerstone in our strategy to understand and mitigate the impacts of these challenging and elusive phenomena.

References

  • \bibcommenthead
  • (1) Seneviratne, S.I., Zhang, X., Adnan, M., Badi, W., Dereczynski, C., Di Luca, A., Ghosh, S., Iskander, I., Kossin, J., Lewis, S., et al.: Weather and climate extreme events in a changing climate (chapter 11) (2021)
  • (2) Gonzalez-Calabuig, M., Cortés-Andrés, J., Williams, T., Zhang, M., Pellicer-Valero, O.J., Fernández-Torres, M.-Á., Camps-Valls, G.: The AIDE Toolbox: AI for Disentangling Extreme Events. IEEE Geoscience and Remote Sensing Magazine 12(3), 1–8 (2024). https://doi.org/10.1109/MGRS.2024.3382544
  • (3) Lam, R., Sanchez-Gonzalez, A., Willson, M., Wirnsberger, P., Fortunato, M., Alet, F., Ravuri, S., Ewalds, T., Eaton-Rosen, Z., Hu, W., Merose, A., Hoyer, S., Holland, G., Vinyals, O., Stott, J., Pritzel, A., Mohamed, S., Battaglia, P.: Learning skillful medium-range global weather forecasting. Science 382, 1416–1422 (2023)
  • (4) Ferchichi, A., Abbes, A.B., Barra, V., Farah, I.R.: Forecasting vegetation indices from spatio-temporal remotely sensed data using deep learning-based approaches: A systematic literature review. Ecological Informatics, 101552 (2022). https://doi.org/10.1016/j.ecoinf.2022.101552
  • (5) Ragone, F., Bouchet, F.: Rare event algorithm study of extreme warm summers and heatwaves over europe. Geophysical Research Letters 48(12), 2020–091197 (2021)
  • (6) Yokota, F., Thompson, K.M.: Value of information analysis in environmental health risk management decisions: past, present, and future. Risk analysis 24(3), 635–650 (2004)
  • (7) Salcedo-Sanz, S., Pérez-Aracil, J., Ascenso, G., Del Ser, J., Casillas-Pérez, D., Kadow, C., Fister, D., Barriopedro, D., García-Herrera, R., Giuliani, M., Castelletti, A.: Analysis, characterization, prediction, and attribution of extreme atmospheric events with machine learning and deep learning techniques: a review. Theoretical and Applied Climatology 155(1), 1–44 (2024). https://doi.org/10.1007/s00704-023-04571-5
  • (8) Gawlikowski, J., Tassi, C.R.N., Ali, M., Lee, J., Humt, M., Feng, J., Kruspe, A., Triebel, R., Jung, P., Roscher, R., Shahzad, M., Yang, W., Bamler, R., Zhu, X.X.: A survey of uncertainty in deep neural networks. Artificial Intelligence Review 56(Suppl 1), 1513–1589 (2023) arXiv:2107.03342. https://doi.org/10.1007/s10462-023-10562-9
  • (9) Harrington, L.J., Schleussner, C.F., Otto, F.E.L.: Quantifying uncertainty in aggregated climate change risk assessments. Nature Communications 12(1), 7140 (2021). https://doi.org/10.1038/s41467-021-27491-2
  • (10) Stott, P.A., Christidis, N., Otto, F.E.L., Sun, Y., Vanderlinden, J.P., van Oldenborgh, G.J., Vautard, R., von Storch, H., Walton, P., Yiou, P., Zwiers, F.W.: Attribution of extreme weather and climate-related events. Wiley Interdisciplinary Reviews: Climate Change 7(1), 23–41 (2016). https://doi.org/10.1002/wcc.380
  • (11) Madakumbura, G.D., Thackeray, C.W., Norris, J., Goldenson, N., Hall, A.: Anthropogenic influence on extreme precipitation over global land areas seen in multiple observational datasets. Nature Communications 12(1), 3944 (2021)
  • (12) Chattopadhyay, A., Nabizadeh, E., Hassanzadeh, P.: Analog forecasting of extreme-causing weather patterns using deep learning. Journal of Advances in Modeling Earth Systems 12(2), 2019–001958. https://doi.org/10.1029/2019MS001958
  • (13) Bommer, P.L., Kretschmer, M., Hedstrom, A., Bareeva, D., Hohne, M.M.-C.: Finding the right XAI method — A guide for the evaluation and ranking of explainable ai methods in climate science. Artificial Intelligence for the Earth Systems (2024). https://doi.org/10.1175/AIES-D-23-0074.1
  • (14) Hannart, A., Pearl, J., Otto, F.E.L., Naveau, P., Ghil, M.: Causal counterfactual theory for the attribution of weather and climate-related events. Bulletin of the American Meteorological Society 97, 99–110 (2016)
  • (15) Vaghefi, S.A., Stammbach, D., Muccione, V., Bingler, J., Ni, J., Kraus, M., Allen, S., Colesanti-Senni, C., Wekhof, T., Schimanski, T., et al.: ChatClimate: Grounding conversational AI in climate science. Communications Earth & Environment 4(1), 480 (2023)
  • (16) Ruff, L., Kauffmann, J.R., Vandermeulen, R.A., Montavon, G., Samek, W., Kloft, M., Dietterich, T.G., Muller, K.-R.: A unifying review of deep and shallow anomaly detection. Proceedings of the IEEE 109(5), 756–795 (2021)
  • (17) Han, Z., Zhao, J., Leung, H., Ma, K.F., Wang, W.: A review of deep learning models for time series prediction. IEEE Sensors Journal 21(6), 7833–7848 (2019)
  • (18) Zennaro, F., Furlan, E., Simeoni, C., Torresan, S., Aslan, S., Critto, A., Marcomini, A.: Exploring machine learning potential for climate change risk assessment. Earth-Science Reviews 220, 103752 (2021). https://doi.org/10.1016/j.earscirev.2021.103752
  • (19) Jones, R.L., Kharb, A., Tubeuf, S.: The untold story of missing data in disaster research: a systematic review of the empirical literature utilising the emergency events database (EM-DAT). Environmental Research Letters 18(10), 103006 (2023). https://doi.org/10.1088/1748-9326/acfd42
  • (20) Flach, M., Gans, F., Brenning, A., Denzler, J., Reichstein, M., Rodner, E., Bathiany, S., Bodesheim, P., Guanche, Y., Sippel, S., et al.: Multivariate anomaly detection for Earth observations: a comparison of algorithms and feature extraction techniques. Earth System Dynamics 8(3), 677–696 (2017)
  • (21) Prabhat, Kashinath, K., Mudigonda, M., Kim, S., Kapp-Schwoerer, L., Graubner, A., Karaismailoglu, E., von Kleist, L., Kurth, T., Greiner, A., et al.: ClimateNet: An expert-labelled open dataset and Deep Learning architecture for enabling high-precision analyses of extreme weather. Geoscientific Model Development Discussions 2020, 1–28 (2020)
  • (22) Racah, E., Beckham, C., Maharaj, T., Ebrahimi Kahou, S., Prabhat, M., Pal, C.: Extremeweather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events. Advances in Neural Information Processing Systems (NeurIPS) 30 (2017)
  • (23) Guanche García, Y., Shadaydeh, M., Mahecha, M., Denzler, J.: Extreme anomaly event detection in biosphere using linear regression and a spatiotemporal mrf model. Natural Hazards 98(3), 849–867 (2018). https://doi.org/10.1007/s11069-018-3415-8
  • (24) Klehmet, K., Berg, P., Bozhinova, D., Crochemore, L., Du, Y., Pechlivanidis, I., Photiadou, C., Yang, W.: Robustness of hydrometeorological extremes in surrogated seasonal forecasts. International Journal of Climatology 44(5), 1725–1738 (2024). https://doi.org/10.1002/joc.8407
  • (25) Johnson, J.E., Laparra, V., Pérez-Suay, A., Mahecha, M., Camps-Valls, G.: Kernel methods and their derivatives: Concept and perspectives for the Earth system sciences. PLOS One 15(10), 0235885 (2020)
  • (26) Johnson, J.E., Laparra, V., Piles, M., Camps-Valls, G.: Gaussianizing the Earth: Multidimensional information measures for Earth data analysis. IEEE Geoscience and Remote Sensing Magazine 9(4), 191–208 (2021). https://doi.org/10.1109/MGRS.2021.3066260
  • (27) Vijverberg, S., Coumou, D.: The role of the Pacific Decadal Oscillation and ocean-atmosphere interactions in driving US temperature variability. npj Climate and Atmospheric Science 5(1), 18 (2022). https://doi.org/10.1038/s41612-022-00237-7
  • (28) Kladny, K.-R., Milanta, M., Mraz, O., Hufkens, K., Stocker, B.D.: Enhanced prediction of vegetation responses to extreme drought using deep learning and Earth observation data. Ecological Informatics, 102474 (2024)
  • (29) Bentivoglio, R., Isufi, E., Jonkman, S.N., Taormina, R.: Deep learning methods for flood map**: a review of existing applications and future research directions. Hydrology and Earth system sciences 26(16), 4345–4378 (2022)
  • (30) Belayneh, A., Adamowski, J., Khalil, B., Ozga-Zielinski, B.: Long-term SPI drought forecasting in the Awash River Basin in Ethiopia using wavelet neural network and wavelet support vector regression models. Journal of Hydrology 508, 418–429 (2014)
  • (31) Kondylatos, S., Prapas, I., Ronco, M., Papoutsis, I., Camps-Valls, G., Piles, M., Fernández-Torres, M.-A., Carvalhais, N.: Wildfire Danger Prediction and Understanding With Deep Learning. Geophysical Research Letters 49(17), 2022–099368 (2022). https://doi.org/10.1029/2022GL099368
  • (32) Nearing, G., Cohen, D., Dube, V., Gauch, M., Gilon, O., Harrigan, S., Hassidim, A., Klotz, D., Kratzert, F., Metzger, A., et al.: Global prediction of extreme floods in ungauged watersheds. Nature 627(8004), 559–563 (2024)
  • (33) Zhang, G., Wang, M., Liu, K.: Deep neural networks for global wildfire susceptibility modelling. Ecological Indicators 127, 107735 (2021)
  • (34) Miloshevich, G., Cozian, B., Abry, P., Borgnat, P., Bouchet, F.: Probabilistic forecasts of extreme heatwaves using convolutional neural networks in a regime of lack of data. Physical Review Fluids 8(4), 40501 (2023) arXiv:2208.00971. https://doi.org/10.1103/PhysRevFluids.8.040501
  • (35) Vo, T.Q., Kim, S.-H., Nguyen, D.H., Bae, D.-H.: LSTM-CM: a hybrid approach for natural drought prediction based on deep learning and climate models. Stochastic Environmental Research and Risk Assessment 37(6), 2035–2051 (2023)
  • (36) Shi, X.: Enabling smart dynamical downscaling of extreme precipitation events with machine learning. Geophysical Research Letters 47(19), 2020–090309 (2020)
  • (37) Callaghan, M., Schleussner, C.-F., Nath, S., et al.: Machine-learning-based evidence and attribution map** of 100,000 climate impact studies. Nature Climate Change 11, 966–972 (2021). https://doi.org/10.1038/s41558-021-01168-6
  • (38) Sutanto, S.J., van der Weert, M., Wanders, N., et al.: Moving from drought hazard to impact forecasts. Nature communications 10, 4945 (2019). https://doi.org/10.1038/s41467-019-12840-z
  • (39) Salakpi, E.E., Hurley, P.D., Muthoka, J.M., Barrett, A.B., Bowell, A., Oliver, S., Rowhani, P.: Forecasting vegetation condition with a Bayesian auto-regressive distributed lags (BARDL) model. Natural Hazards and Earth System Sciences 22(8), 2703–2723 (2022)
  • (40) Martinuzzi, F., Mahecha, M.D., Camps-Valls, G., Montero, D., Williams, T., Mora, K.: Learning extreme vegetation response to climate forcing: A comparison of recurrent neural network architectures. Nonlinear Processes in Geophysics 2024, 1–32 (2024)
  • (41) Ahmad, R., Yang, B., Ettlin, G., Berger, A., Rodríguez-Bocca, P.: A machine-learning based ConvLSTM architecture for NDVI forecasting. International Transactions in Operational Research 30(4), 2025–2048 (2023)
  • (42) Benson, V., Requena-Mesa, C., Robin, C., Alonso, L., Cortés, J., Gao, Z., Linscheid, N., Weynants, M., Reichstein, M.: Forecasting localized weather impacts on vegetation as seen from space with meteo-guided video prediction. In: CVPR 2024 (2024)
  • (43) Zhang, X., Chen, G., Cai, L., Jiao, H., Hua, J., Luo, X., Wei, X.: Impact assessments of typhoon Lekima on forest damages in Subtropical China using machine learning methods and Landsat 8 OLI imagery. Sustainability 13(9), 4893 (2021)
  • (44) Ronco, M., Tárraga, J.M., Muñoz, J., Piles, M., Marco, E.S., Wang, Q., Espinosa, M.T.M., Ponserre, S., Camps-Valls, G.: Exploring interactions between socioeconomic context and natural hazards on human population displacement. Nature Communications 14(1), 8004 (2023)
  • (45) Sodoge, J., Kuhlicke, C., de Brito, M.M.: Automatized spatio-temporal detection of drought impacts from newspaper articles using natural language processing and machine learning. Weather and Climate Extremes 41, 100574 (2023). https://doi.org/10.1016/j.wace.2023.100574
  • (46) Bostrom, A., Demuth, J.L., Wirz, C.D., Cains, M.G., Schumacher, A., Madlambayan, D., Bansal, A.S., Bearth, A., Chase, R., Crosman, K.M., Ebert-Uphoff, I., Gagne II, D.J., Guikema, S., Hoffman, R., Johnson, B.B., Kumler-Bonfanti, C., Lee, J.D., Lowe, A., McGovern, A., Przybylo, V., Radford, J.T., Roth, E., Sutter, C., Tissot, P., Roebber, P., Stewart, J.Q., White, M., Williams, J.K.: Trust and trustworthy artificial intelligence: A research agenda for AI in the environmental sciences. Risk Analysis. https://doi.org/10.1111/risa.14245
  • (47) Naveau, P., Hannart, A., Ribes, A.: Statistical methods for extreme event attribution in climate science. Annual Review of Statistics and Its Application 7, 89–110 (2020). https://doi.org/10.1146/annurev-statistics-031219-041314
  • (48) Otto, F.E.: Attribution of extreme events to climate change. Annual Review of Environment and Resources 48, 813–828 (2023)
  • (49) Pasini, A., Racca, P., Amendola, S., Cartocci, G., Cassardo, C.: Attribution of recent temperature behaviour reassessed by a neural-network method. Scientific reports 7(1), 17681 (2017)
  • (50) Barnes, E.A., Hurrell, J.W., Ebert-Uphoff, I., Anderson, C., Anderson, D.: Viewing forced climate patterns through an AI lens. Geophysical Research Letters 46(22), 13389–13398 (2019)
  • (51) Ghaffarian, S., Taghikhah, F.R., Maier, H.R.: Explainable artificial intelligence in disaster risk management: Achievements and prospective futures. International Journal of Disaster Risk Reduction 98, 104123 (2023). https://doi.org/10.1016/j.ijdrr.2023.104123
  • (52) Tuia, D., Schindler, K., Demir, B., Camps-Valls, G., Zhu, X.X., Kochupillai, M., Džeroski, S., van Rijn, J.N., Hoos, H.H., Frate, F.D., Datcu, M., Quiané-Ruiz, J.-A., Markl, V., Saux, B.L., Schneider, R.: Artificial intelligence to advance Earth observation: a perspective. IEEE Geoscience and Remote Sensing Magazine (2024)
  • (53) Schlund, M., Eyring, V., Camps-Valls, G., Friedlingstein, P., Gentine, P., Reichstein, M.: Constraining uncertainty in projected gross primary production with machine learning. Journal of Geophysical Research: Biogeosciences 125(11), 2019–005619 (2020)
  • (54) Srinivasan, R., Wang, L., Bulleid, J.: Machine learning-based climate time series anomaly detection using convolutional neural networks. Weather and Climate 40(1), 16–31 (2020)
  • (55) Dikshit, A., Pradhan, B., Assiri, M.E., Almazroui, M., Park, H.-J.: Solving transparency in drought forecasting using attention models. Science of The Total Environment 837, 155856 (2022). https://doi.org/10.1016/j.scitotenv.2022.155856
  • (56) Barnes, E.A., Barnes, R.J., Martin, Z.K., Rader, J.K.: This Looks Like That There: Interpretable Neural Networks for Image Tasks When Location Matters. Artificial Intelligence for the Earth Systems 1(3), 220001 (2022). https://doi.org/10.1175/AIES-D-22-0001.1
  • (57) Mamalakis, A., Barnes, E.A., Ebert-Uphoff, I.: Investigating the Fidelity of Explainable Artificial Intelligence Methods for Applications of Convolutional Neural Networks in Geoscience. Artificial Intelligence for the Earth Systems 1(4), 220012 (2022). https://doi.org/%****␣main.tex␣Line␣1500␣****10.1175/AIES-D-22-0012.1
  • (58) Pearl, J.: Causality: Models, Reasoning, and Inference, 2nd edn. MIT Press, Cambridge, MA (2017)
  • (59) Peters, J., Janzing, D., Schölkopf, B.: Elements of Causal Inference - Foundations and Learning Algorithms. Adaptive Computation and Machine Learning Series. MIT Press, Cambridge, MA (2017)
  • (60) Runge, J., Gerhardus, A., Varando, G., Eyring, V., Camps-Valls, G.: Causal inference for time series. Nature Reviews Earth and Environment 4(7), 487–505 (2023). https://doi.org/10.1038/s43017-023-00431-y
  • (61) Camps-Valls, G., Gerhardus, A., Ninad, U., Varando, G., Martius, G., Balaguer-Ballester, E., Vinuesa, R., Diaz, E., Zanna, L., Runge, J.: Discovering causal relations and equations from data. Physics Reports 1044, 1–68 (2023)
  • (62) Gnecco, N., Meinshausen, N., Peters, J., Engelke, S.: Causal discovery in heavy-tailed models. The Annals of Statistics (2019)
  • (63) Pasche, O.C., Chavez-Demoulin, V., Davison, A.C.: Causal modelling of heavy-tailed variables and confounders with application to river flow. Extremes 26, 573–594 (2021)
  • (64) Kiriliouk, A., Naveau, P.: Climate extreme event attribution using multivariate peaks-over-thresholds modeling and counterfactual theory. The Annals of Applied Statistics 14(3), 1342–1358 (2020). https://doi.org/10.1214/20-AOAS1355
  • (65) Ghanem, R., Higdon, D., Owhadi, H.: Handbook of Uncertainty Quantification. Springer, Cham, Switzerland (2017)
  • (66) Xu, L., Chen, N., Yang, C., Yu, H., Chen, Z.: Quantifying the uncertainty of precipitation forecasting using probabilistic deep learning. Hydrology and Earth System Sciences 26(11), 2923–2938 (2022). https://doi.org/10.5194/hess-26-2923-2022
  • (67) Bella, A., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Calibration of machine learning models. In: Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques, pp. 128–146. IGI Global, Hershey, PA (2010)
  • (68) Corps, I.M.: Libya Flooding: Situation Report #9 (2023)
  • (69) DWD: The weather in Germany 2021 (2021). https://www.dwd.de/EN/press/press_release/EN/2021/20210830_the_weather_in_germany_in_summer_2021_news.html
  • (70) Tojcic, I., Denamiel, C., Vilibic, I.: Performance of the Adriatic early warning system during the multi-meteotsunami event of 11–19 May 2020: an assessment using energy banners. Natural Hazards and Earth System Sciences (2021). https://doi.org/10.5194/nhess-21-2427-2021
  • (71) Yore, R., Walker, J.: Early Warning Systems and Evacuation: Rare and Extreme vs Frequent and Small-Scale Tropical Cyclones in the Philippines and Dominica. Disasters (2020). https://doi.org/10.1111/disa.12434
  • (72) Reichstein, M., Benson, V., Camps-Valls, G., Boran, H., Fearnley, C., Kornhuber, K., Rahaman, N., Schölkopf, B., Tárraga, J.M., Vinuesa, R., et al.: Early warning of complex climate risk with integrated artificial intelligence. Nature Communications (under evaluation) (2024)
  • (73) Tamamadin, M., Susandi, A., Pratama, A., Faisal, I., Wijaya, A., Firdaus, I.M., Kuntoro, W.S.: Automation process to support an information system on extreme weather warning. IOP Conference Series: Materials Science and Engineering (2020). https://doi.org/10.1088/1757-899X/803/1/012044
  • (74) AI, H.: Ethics Guidlines for trustworthy AI: High-level expert group on artificial intelligence. European Commission (2018). https://wayback.archive-it.org/12090/20201227221227/https://ec.europa.eu/digital-single-market/en/news/ethics-guidelines-trustworthy-ai
  • (75) Jobin, A., Ienca, M., Vayena, E.: The global landscape of AI ethics guidelines. Nature Machine Intelligence 1, 389–399 (2019)
  • (76) Kochupillai, M., Kahl, M., Schmitt, M., Taubenböck, H., Zhu, X.X.: Earth observation and artificial intelligence: Understanding emerging ethical issues and opportunities. IEEE Geoscience and Remote Sensing Magazine 10(4), 90–124 (2022)
  • (77) Macherera, M., Chimbari, M.J.: A review of studies on community based early warning systems. Journal of Disaster Risk Studies 8 (2016)
  • (78) Ramos, M.H., van Andel, S.J., Pappenberger, F.: Do probabilistic forecasts lead to better decisions? Hydrology and Earth System Sciences 17(6), 2219–2232 (2013). https://doi.org/10.5194/hess-17-2219-2013
  • (79) Giuliani, M., Pianosi, F., Castelletti, A.: Making the most of data: an information selection and assessment framework to improve water systems operations. Water Resources Research 51(11), 9073–9093 (2015). https://doi.org/%****␣main.tex␣Line␣1825␣****10.1002/2015WR017044
  • (80) Camps-Valls, G., Tuia, D., Zhu, X.X., Reichstein, M.: Deep Learning for the Earth Sciences: A Comprehensive Approach to Remote Sensing, Climate Science and Geosciences. John Wiley & Sons, Hoboken, NJ (2021)
  • (81) Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., Prabhat: Deep learning and process understanding for data-driven Earth system science. Nature 566, 195–204 (2019)
  • (82) Iles, C.E., Vautard, R., Strachan, J., Joussaume, S., Eggen, B.R., Hewitt, C.D.: The benefits of increasing resolution in global and regional climate simulations for european climate extremes. Geoscientific Model Development 13(11), 5583–5607 (2020)
  • (83) Cheng, S., Quilodrán-Casas, C., Ouala, S., Farchi, A., Liu, C., Tandeo, P., Fablet, R., Lucor, D., Iooss, B., Brajard, J., et al.: Machine learning with data assimilation and uncertainty quantification for dynamical systems: a review. IEEE/CAA Journal of Automatica Sinica 10(6), 1361–1387 (2023)
  • (84) Haynes, K., Lagerquist, R., McGraw, M., Musgrave, K., Ebert-Uphoff, I.: Creating and evaluating uncertainty estimates with neural networks for environmental-science applications. Artificial Intelligence for the Earth Systems 2(2) (2023). https://doi.org/10.1175/aies-d-22-0061.1
  • (85) Li, W., Pan, B., Xia, J., Duan, Q.: Convolutional neural network-based statistical post-processing of ensemble precipitation forecasts. Journal of Hydrology 605, 127301 (2022). https://doi.org/10.1016/j.jhydrol.2021.127301
  • (86) Farazmand, M., Sapsis, T.P.: Extreme events: Mechanisms and prediction. Applied Mechanics Reviews 71(5), 050801 (2019)
  • (87) West, H., Quinn, N., Horswell, M.: Remote sensing for drought monitoring & impact assessment: Progress, past challenges and future opportunities. Remote Sensing of Environment 232, 111291 (2019)
  • (88) Zargar, A., Sadiq, R., Naser, B., Khan, F.I.: A review of drought indices. Environmental Reviews 19(NA), 333–349 (2011)
  • (89) Jehanzaib, M., Bilal Idrees, M., Kim, D., Kim, T.-W.: Comprehensive evaluation of machine learning techniques for hydrological drought forecasting. Journal of Irrigation and Drainage Engineering 147(7), 04021022 (2021)
  • (90) Zhang, M., Fernández-Torres, M.A., Camps-Valls, G.: Domain knowledge-driven variational recurrent networks for drought monitoring. Remote Sensing of Environment (2024, under review)
  • (91) Dikshit, A., Pradhan, B.: Interpretable and explainable AI (XAI) model for spatial drought prediction. Science of the Total Environment 801, 149797 (2021)
  • (92) Barriopedro, D., García-Herrera, R., Ordóñez, C., Miralles, D., Salcedo-Sanz, S.: Heat waves: Physical understanding and scientific challenges. Reviews of Geophysics 61(2), 2022–000780
  • (93) Teng, H., Leung, R., Branstator, G., Lu, J., Ding, Q.: Warming pattern over the northern hemisphere midlatitudes in boreal summer 1979–2020. Journal of Climate 35(11), 3479–3494 (2022)
  • (94) Vautard, R., Cattiaux, J., Happé, T., Singh, J., Bonnet, R., Cassou, C., Coumou, D., D’Andrea, F., Faranda, D., Fischer, E., Ribes, A., Sippel, S., Yiou, P.: Heat extremes in western europe increasing faster than simulated due to atmospheric circulation trends. Nature Communications 14(1), 6803. https://doi.org/%****␣main.tex␣Line␣2075␣****10.1038/s41467-023-42143-3. Nature Publishing Group. Accessed 2024-04-30
  • (95) Jacques-Dumas, V., Ragone, F., Borgnat, P., Abry, P., Bouchet, F.: Deep learning-based extreme heatwave forecast. Frontiers in Climate 4 (2022)
  • (96) Fister, D., Pérez-Aracil, J., Peláez-Rodríguez, C., Del Ser, J., Salcedo-Sanz, S.: Accurate long-term air temperature prediction with machine learning models and data reduction techniques. Applied Soft Computing 136, 110118. https://doi.org/10.1016/j.asoc.2023.110118. Elsevier
  • (97) van Straaten, C., Whan, K., Coumou, D., van den Hurk, B., Schmeits, M.: Correcting subseasonal forecast errors with an explainable ann to understand misrepresented sources of predictability of european summer temperatures. Artificial Intelligence for the Earth Systems 2(3), 220047 (2023). https://doi.org/10.1175/AIES-D-22-0047.1
  • (98) Gao, Z., Shi, X., Wang, H., Zhu, Y., Wang, Y.B., Li, M., Yeung, D.-Y.: Earthformer: Exploring space-time transformers for Earth system forecasting. Advances in Neural Information Processing Systems 35, 25390–25403 (2022)
  • (99) Singh, J., Sippel, S., Fischer, E.M.: Circulation dampened heat extremes intensification over the midwest USA and amplified over western europe. Communications Earth & Environment 4(1), 1–9. Nature Publishing Group
  • (100) Jones, M.W., Abatzoglou, J.T., Veraverbeke, S., Andela, N., Lasslop, G., Forkel, M., Smith, A.J.P., Burton, C., Betts, R.A., van der Werf, G.R., Sitch, S., Canadell, J.G., Santín, C., Kolden, C., Doerr, S.H., Le Quéré, C.: Global and Regional Trends and Drivers of Fire Under Climate Change. Reviews of Geophysics 60(3), 2020–000726 (2022). https://doi.org/10.1029/2020RG000726
  • (101) El Garroussi, S., Di Giuseppe, F., Barnard, C., Wetterhall, F.: Europe faces up to tenfold increase in extreme fires in a warming climate. npj Climate and Atmospheric Science 7(1), 1–11 (2024). https://doi.org/10.1038/s41612-024-00575-8. Nature Publishing Group
  • (102) Zhang, G., Wang, M., Liu, K.: Forest Fire Susceptibility Modeling Using a Convolutional Neural Network for Yunnan Province of China. International Journal of Disaster Risk Science 10(3), 386–403 (2019). https://doi.org/10.1007/s13753-019-00233-1
  • (103) Li, F., Zhu, Q., Riley, W.J., Zhao, L., Xu, L., Yuan, K., Chen, M., Wu, H., Gui, Z., Gong, J., et al.: Attentionfire_v1. 0: interpretable machine learning fire model for burned-area predictions over tropics. Geoscientific Model Development 16(3), 869–884 (2023)
  • (104) Fromm, M., Servranckx, R., Stocks, B.J., Peterson, D.A.: Understanding the critical elements of the pyrocumulonimbus storm sparked by high-intensity wildland fire. Communications Earth & Environment 3(1), 1–7 (2022). https://doi.org/10.1038/s43247-022-00566-8. Nature Publishing Group
  • (105) Salas-Porras, E.D., Tazi, K., Braude, A., Okoh, D., Lamb, K.D., Watson-Parris, D., Harder, P., Meinert, N.: Identifying the causes of Pyrocumulonimbus (PyroCb). In: 2022 NeurIPS Workshop on Causal Machine Learning for Real-World Impact (2022). http://arxiv.longhoe.net/abs/2211.08883
  • (106) Jonkman, S.: Global perspectives on loss of human life caused by floods. Natural Hazards 34, 151–175 (2005). https://doi.org/10.1007/S11069-004-8891-3
  • (107) Cornwall, W.: Europe’s deadly floods leave scientists stunned. Science 373(6553), 372–373 (2021). https://doi.org/10.1126/science.373.6553.372
  • (108) Mohr, S., Ehret, U., Kunz, M., Ludwig, P., Caldas-Alvarez, A., Daniell, J.E., Ehmele, F., Feldmann, H., Franca, M.J., Gattke, C., et al.: A multi-disciplinary analysis of the exceptional flood event of July 2021 in central Europe–Part 1: Event description and analysis. Natural Hazards and Earth System Sciences 23, 525–551 (2023). https://doi.org/10.5194/nhess-23-525-2023
  • (109) Kochkov, D., Smith, J.A., Alieva, A., Wang, Q., Brenner, M.P., Hoyer, S.: Machine learning–accelerated computational fluid dynamics. Proceedings of the National Academy of Sciences 118(21), 2101784118 (2021). https://doi.org/10.1073/pnas.2101784118
  • (110) Lutjens, B., Leshchinskiy, B., Requena-Mesa, C., Chishtie, F., Díaz-Rodríguez, N., Boulais, O., Sankaranarayanan, A., Masson-Forsythe, M., Piña, A., Gal, Y., Raïssi, C., Lavin, A., Newman, D.: Physically-Consistent Generative Adversarial Networks for Coastal Flood Visualization (2023)