Search | arXiv e-print repository

AIFS - ECMWF's data-driven forecasting system

Authors: Simon Lang, Mihai Alexe, Matthew Chantry, Jesper Dramsch, Florian Pinault, Baudouin Raoult, Mariana C. A. Clare, Christian Lessig, Michael Maier-Gerber, Linus Magnusson, Zied Ben Bouallègue, Ana Prieto Nemesio, Peter D. Dueben, Andrew Brown, Florian Pappenberger, Florence Rabier

Abstract: Machine learning-based weather forecasting models have quickly emerged as a promising methodology for accurate medium-range global weather forecasting. Here, we introduce the Artificial Intelligence Forecasting System (AIFS), a data driven forecast model developed by the European Centre for Medium-Range Weather Forecasts (ECMWF). AIFS is based on a graph neural network (GNN) encoder and decoder, a… ▽ More Machine learning-based weather forecasting models have quickly emerged as a promising methodology for accurate medium-range global weather forecasting. Here, we introduce the Artificial Intelligence Forecasting System (AIFS), a data driven forecast model developed by the European Centre for Medium-Range Weather Forecasts (ECMWF). AIFS is based on a graph neural network (GNN) encoder and decoder, and a sliding window transformer processor, and is trained on ECMWF's ERA5 re-analysis and ECMWF's operational numerical weather prediction (NWP) analyses. It has a flexible and modular design and supports several levels of parallelism to enable training on high-resolution input data. AIFS forecast skill is assessed by comparing its forecasts to NWP analyses and direct observational data. We show that AIFS produces highly skilled forecasts for upper-air variables, surface weather parameters and tropical cyclone tracks. AIFS is run four times daily alongside ECMWF's physics-based NWP model and forecasts are available to the public under ECMWF's open data policy. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2308.15560 [pdf, other]

WeatherBench 2: A benchmark for the next generation of data-driven global weather models

Authors: Stephan Rasp, Stephan Hoyer, Alexander Merose, Ian Langmore, Peter Battaglia, Tyler Russel, Alvaro Sanchez-Gonzalez, Vivian Yang, Rob Carver, Shreya Agrawal, Matthew Chantry, Zied Ben Bouallegue, Peter Dueben, Carla Bromberg, Jared Sisk, Luke Barrington, Aaron Bell, Fei Sha

Abstract: WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and… ▽ More WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art models: https://sites.research.google/weatherbench. This paper describes the design principles of the evaluation framework and presents results for current state-of-the-art physical and data-driven weather models. The metrics are based on established practices for evaluating weather forecasts at leading operational weather centers. We define a set of headline scores to provide an overview of model performance. In addition, we also discuss caveats in the current evaluation setup and challenges for the future of data-driven weather forecasting. △ Less

Submitted 26 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

arXiv:2004.06582 [pdf]

doi 10.1175/BAMS-D-19-0308.1

Statistical Postprocessing for Weather Forecasts -- Review, Challenges and Avenues in a Big Data World

Authors: Stéphane Vannitsem, John Bjørnar Bremnes, Jonathan Demaeyer, Gavin R. Evans, Jonathan Flowerdew, Stephan Hemri, Sebastian Lerch, Nigel Roberts, Susanne Theis, Aitor Atencia, Zied Ben Bouallègue, Jonas Bhend, Markus Dabernig, Lesley De Cruz, Leila Hieta, Olivier Mestre, Lionel Moret, Iris Odak Plenković, Maurice Schmeits, Maxime Taillardat, Joris Van den Bergh, Bert Van Schaeybroeck, Kirien Whan, Jussi Ylhaisi

Abstract: Statistical postprocessing techniques are nowadays key components of the forecasting suites in many National Meteorological Services (NMS), with for most of them, the objective of correcting the impact of different types of errors on the forecasts. The final aim is to provide optimal, automated, seamless forecasts for end users. Many techniques are now flourishing in the statistical, meteorologica… ▽ More Statistical postprocessing techniques are nowadays key components of the forecasting suites in many National Meteorological Services (NMS), with for most of them, the objective of correcting the impact of different types of errors on the forecasts. The final aim is to provide optimal, automated, seamless forecasts for end users. Many techniques are now flourishing in the statistical, meteorological, climatological, hydrological, and engineering communities. The methods range in complexity from simple bias corrections to very sophisticated distribution-adjusting techniques that incorporate correlations among the prognostic variables. The paper is an attempt to summarize the main activities going on this area from theoretical developments to operational applications, with a focus on the current challenges and potential avenues in the field. Among these challenges is the shift in NMS towards running ensemble Numerical Weather Prediction (NWP) systems at the kilometer scale that produce very large datasets and require high-density high-quality observations; the necessity to preserve space time correlation of high-dimensional corrected fields; the need to reduce the impact of model changes affecting the parameters of the corrections; the necessity for techniques to merge different types of forecasts and ensembles with different behaviors; and finally the ability to transfer research on statistical postprocessing to operations. Potential new avenues will also be discussed. △ Less

Submitted 14 April, 2020; originally announced April 2020.

Comments: This work has been submitted to the Bulletin of the American Meteorological Society. Copyright in this work may be transferred without further notice

arXiv:2001.08712 [pdf, ps, other]

doi 10.1002/qj.3853

Statistical post-processing of heat index ensemble forecasts: is there a royal road?

Authors: Sándor Baran, Ágnes Baran, Florian Pappenberger, Zied Ben Bouallègue

Abstract: We investigate the effect of statistical post-processing on the probabilistic skill of discomfort index (DI) and indoor wet-bulb globe temperature (WBGTid) ensemble forecasts, both calculated from the corresponding forecasts of temperature and dew point temperature. Two different methodological approaches to calibration are compared. In the first case, we start with joint post-processing of the te… ▽ More We investigate the effect of statistical post-processing on the probabilistic skill of discomfort index (DI) and indoor wet-bulb globe temperature (WBGTid) ensemble forecasts, both calculated from the corresponding forecasts of temperature and dew point temperature. Two different methodological approaches to calibration are compared. In the first case, we start with joint post-processing of the temperature and dew point forecasts and then create calibrated samples of DI and WBGTid using samples from the obtained bivariate predictive distributions. This approach is compared with direct post-processing of the heat index ensemble forecasts. For this purpose, a novel ensemble model output statistics model based on a generalized extreme value distribution is proposed. The predictive performance of both methods is tested on the operational temperature and dew point ensemble forecasts of the European Centre for Medium-Range Weather Forecasts and the corresponding forecasts of DI and WBGTid. For short lead times (up to day 6), both approaches significantly improve the forecast skill. Among the competing post-processing methods, direct calibration of heat indices exhibits the best predictive performance, very closely followed by the more general approach based on joint calibration of temperature and dew point temperature. Additionally, a machine learning approach is tested and shows comparable performance for the case when one is interested only in forecasting heat index warning level categories. △ Less

Submitted 27 January, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

Comments: 29 pages, 12 figures

Journal ref: Quarterly Journal of the Royal Meteorological Society 146 (2020), no. 732, 3416-3434

arXiv:1811.05821 [pdf, other]

doi 10.1002/qj.3521

Statistical post-processing of dual-resolution ensemble forecasts

Authors: Sándor Baran, Martin Leutbecher, Marianna Szabó, Zied Ben Bouallègue

Abstract: The computational cost as well as the probabilistic skill of ensemble forecasts depends on the spatial resolution of the numerical weather prediction model and the ensemble size. Periodically, e.g. when more computational resources become available, it is appropriate to reassess the balance between resolution and ensemble size. Recently, it has been proposed to investigate this balance in the cont… ▽ More The computational cost as well as the probabilistic skill of ensemble forecasts depends on the spatial resolution of the numerical weather prediction model and the ensemble size. Periodically, e.g. when more computational resources become available, it is appropriate to reassess the balance between resolution and ensemble size. Recently, it has been proposed to investigate this balance in the context of dual-resolution ensembles, which use members with two different resolutions to make probabilistic forecasts. This study investigates whether statistical post-processing of such dual-resolution ensemble forecasts changes the conclusions regarding the optimal dual-resolution configuration. Medium-range dual-resolution ensemble forecasts of 2-metre temperature have been calibrated using ensemble model output statistics. The forecasts are produced with ECMWF's Integrated Forecast System and have horizontal resolutions between 18 km and 45 km. The ensemble sizes range from 8 to 254 members. The forecasts are verified with SYNOP station data. Results show that score differences between various single and dual-resolution configurations are strongly reduced by statistical post-processing. Therefore, the benefit of some dual-resolution configurations over single resolution configurations appears to be less pronounced than for raw forecasts. Moreover, the ranking of the ensemble configurations can be affected by the statistical post-processing. △ Less

Submitted 14 November, 2018; originally announced November 2018.

Comments: 25 pages, 12 figures, 2 tables

Journal ref: Quarterly Journal of the Royal Meteorological Society 145 (2019), 1705-1720

arXiv:1511.05877 [pdf, ps, other]

doi 10.1175/MWR-D-15-0403.1

Generation of scenarios from calibrated ensemble forecasts with a dual ensemble copula coupling approach

Authors: Zied Ben Bouallegue, Tobias Heppelmann, Susanne E. Theis, Pierre Pinson

Abstract: Probabilistic forecasts in the form of ensemble of scenarios are required for complex decision making processes. Ensemble forecasting systems provide such products but the spatio-temporal structures of the forecast uncertainty is lost when statistical calibration of the ensemble forecasts is applied for each lead time and location independently. Non-parametric approaches allow the reconstruction o… ▽ More Probabilistic forecasts in the form of ensemble of scenarios are required for complex decision making processes. Ensemble forecasting systems provide such products but the spatio-temporal structures of the forecast uncertainty is lost when statistical calibration of the ensemble forecasts is applied for each lead time and location independently. Non-parametric approaches allow the reconstruction of spatio-temporal joint probability distributions at a low computational cost. For example, the ensemble copula coupling (ECC) method rebuilds the multivariate aspect of the forecast from the original ensemble forecasts. Based on the assumption of error stationarity, parametric methods aim to fully describe the forecast dependence structures. In this study, the concept of ECC is combined with past data statistics in order to account for the autocorrelation of the forecast error. The new approach, called d-ECC, is applied to wind forecasts from the high resolution ensemble system COSMO-DE-EPS run operationally at the German weather service. Scenarios generated by ECC and d-ECC are compared and assessed in the form of time series by means of multivariate verification tools and in a product oriented framework. Verification results over a 3 month period show that the innovative method d-ECC outperforms or performs as well as ECC in all investigated aspects. △ Less

Submitted 3 June, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

arXiv:1510.00535 [pdf, ps, other]

Assessment and added value estimation of an ensemble approach with a focus on global radiation forecasts

Authors: Zied Ben Bouallegue

Abstract: The assessment of the high-resolution ensemble weather prediction system COSMO-DE-EPS is achieved with the perspective of using it for renewable energy applications. The performance of the ensemble forecast is explored focusing on global radiation, the main weather variable affecting solar power production, and on quantile forecasts, key probabilistic products for the energy sector. First, the abi… ▽ More The assessment of the high-resolution ensemble weather prediction system COSMO-DE-EPS is achieved with the perspective of using it for renewable energy applications. The performance of the ensemble forecast is explored focusing on global radiation, the main weather variable affecting solar power production, and on quantile forecasts, key probabilistic products for the energy sector. First, the ability of the ensemble system to capture and resolve the observation variability is assessed. Secondly, the potential benefit of the ensemble forecasting strategy compared to a single forecast approach is quantitatively estimated. A new metric called ensemble added value is proposed, aiming at a fair comparison of an ensemble forecast with a single forecast, when optimized to the users' needs. Hourly mean forecasts are verified against pyranometer measurements over verification periods covering 2013. The results show in particular that the added value of the ensemble approach is season-dependent and increases with the forecast horizon. △ Less

Submitted 2 October, 2015; originally announced October 2015.

Journal ref: Ben Bouallegue Z, 2015: Assessement and added value estimation of an ensemble approach with a focus on global radiation forecasts. Mausam, 66, 541-550

arXiv:1504.04211 [pdf, ps, other]

doi 10.1002/qj.2624

Quantile forecast discrimination ability and value

Authors: Zied Ben Bouallegue, Pierre Pinson, Petra Friederichs

Abstract: While probabilistic forecast verification for categorical forecasts is well established, some of the existing concepts and methods have not found their equivalent for the case of continuous variables. New tools dedicated to the assessment of forecast discrimination ability and forecast value are introduced here, based on quantile forecasts being the base product for the continuous case (hence in a… ▽ More While probabilistic forecast verification for categorical forecasts is well established, some of the existing concepts and methods have not found their equivalent for the case of continuous variables. New tools dedicated to the assessment of forecast discrimination ability and forecast value are introduced here, based on quantile forecasts being the base product for the continuous case (hence in a nonparametric framework). The relative user characteristic (RUC) curve and the quantile value plot allow analysing the performance of a forecast for a specific user in a decision-making framework. The RUC curve is designed as a user-based discrimination tool and the quantile value plot translates forecast discrimination ability in terms of economic value. The relationship between the overall value of a quantile forecast and the respective quantile skill score is also discussed. The application of these new verification approaches and tools is illustrated based on synthetic datasets, as well as for the case of global radiation forecasts from the high resolution ensemble COSMO-DE-EPS of the German Weather Service. △ Less

Submitted 16 April, 2015; originally announced April 2015.

Showing 1–8 of 8 results for author: Bouallegue, Z B