License: arXiv.org perpetual non-exclusive license
arXiv:2312.04416v1 [cs.LG] 07 Dec 2023

Monitoring Sustainable Global Development Along Shared Socioeconomic Pathways

Michelle W.L. Wan
University of Bristol, UK
[email protected] &Jeffrey N. Clark
University of Bristol, UK
[email protected]
&Edward A. Small
University of Bristol, UK
[email protected]
&Elena Fillola Mayoral
University of Bristol, UK
[email protected]
&Raúl Santos-Rodríguez
University of Bristol, UK
[email protected]
Abstract

Sustainable global development is one of the most prevalent challenges facing the world today, hinging on the equilibrium between socioeconomic growth and environmental sustainability. We propose approaches to monitor and quantify sustainable development along the Shared Socioeconomic Pathways (SSPs), including mathematically derived scoring algorithms, and machine learning methods. These integrate socioeconomic and environmental datasets, to produce an interpretable metric for SSP alignment. An initial study demonstrates promising results, laying the groundwork for the application of different methods to the monitoring of sustainable global development.

1 Introduction

To address the ongoing climate crisis, global socioeconomic progress must be balanced with environmental sustainability. However, assessing a region’s overall development trajectory with this balance in mind poses a complex challenge. Progress is typically monitored with respect to warming limits of 1.5 {}^{\circ}start_FLOATSUPERSCRIPT ∘ end_FLOATSUPERSCRIPTC and 2 {}^{\circ}start_FLOATSUPERSCRIPT ∘ end_FLOATSUPERSCRIPTC using self-reported nationally determined contributions (NDCs) of emissions Lecocq et al. (2022). A study of European cities integrated environmental and socioeconomic datasets and applied machine learning techniques to assess emissions reduction efforts Hsu et al. (2022). Despite substantial literature on country-level mitigation pathways, not all countries are well represented Lecocq et al. (2022). In climate change mitigation research, reference scenarios are used to evaluate possible low-carbon strategies; policymakers and researchers tend to focus on an individual scenario, which can limit the scope of understanding Grant et al. (2020).

In 2017, the five Shared Socioeconomic Pathways (SSPs) were introduced: (1) Sustainability, (2) Middle of the Road, (3) Regional Rivalry, (4) Inequality, (5) Fossil-fueled Development Riahi et al. (2017); O’Neill et al. (2017). These characterise the ways in which socioeconomic factors and atmospheric emissions may change in the coming century; analysis in 2020 identified over 1370 studies utilising the SSPs O’Neill et al. (2020). SSP projection datasets for complex, and sometimes competing, socioeconomic and environmental features are available from the year 2015, projected ahead to 2100. While the SSPs are presented as potential pathways, rather than ideal pathways or targets, they can be used to contextualise current development trajectories. With this in mind, the SSP projections can be compared to observational data to evaluate the development sustainability of different regions relative to the established SSPs. With an increasing range of stakeholders using climate scenario information, it becomes ever more crucial to improve the communication of climate scenario research O’Neill et al. (2020). Thus, we propose the application of data science and machine learning methods to distill these existing datasets into a single, interpretable score. These methods will act as pivotal tools to assist decision-makers in addressing the climate crisis, and monitoring sustainable socioeconomic development.

2 Proposed methods

Here we propose the distillation of environmental and socioeconomical features into a single interpretable metric, to evaluate development against established scenarios. We propose initial investigation into mathematical methods: norm induced measures, to simply quantify the difference between observations and SSPs, and TraCE (Trajectory Counterfactual Explanation) scores Clark et al. (2023), which treat SSPs as counterfactual instances; as well as exploration of machine learning methods to better capture complex relationships. Additionally, scores from independent methods can be ensembled, to more robustly quantify SSP alignment than any single approach. These methods can be applied at a range of spatial scales, from regional to global.

2.1 Norm induced measures

Given a norm such as the Euclidean f2=Ωf(x)2𝑑xsubscriptdelimited-∥∥𝑓2subscriptΩ𝑓superscript𝑥2differential-d𝑥\lVert f\rVert_{2}=\sqrt{\int_{\Omega}f(x)^{2}dx}∥ italic_f ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = square-root start_ARG ∫ start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_f ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_x end_ARG, we consider the difference between ground truth uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and target SSP visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for each feature i𝑖iitalic_i as the error ei=uivisubscript𝑒𝑖subscript𝑢𝑖subscript𝑣𝑖e_{i}=u_{i}-v_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The norm measures the average error between truth and target by finding eidelimited-∥∥subscript𝑒𝑖\lVert e_{i}\rVert∥ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ over the domain of interest ΩisubscriptΩ𝑖\Omega_{i}roman_Ω start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and dividing by time frame length (or discrete number of points). This measure is summed over all features (with weight wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for each feature), and divided by the sum of the weights to obtain a score:

S=1w2i=1nwiei2𝑆1subscriptdelimited-∥∥𝑤2superscriptsubscript𝑖1𝑛subscript𝑤𝑖subscriptdelimited-∥∥subscript𝑒𝑖2S=\frac{1}{\lVert w\rVert_{2}}\sum_{i=1}^{n}w_{i}\lVert e_{i}\rVert_{2}italic_S = divide start_ARG 1 end_ARG start_ARG ∥ italic_w ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (1)

This method allows for varying resolutions between features without interpolation. Weights can be selected to allow greater contribution of important measures to the score than other features. Using this method, if S=0e=0𝑆0𝑒0S=0\implies e=0italic_S = 0 ⟹ italic_e = 0 and therefore u=v𝑢𝑣u=vitalic_u = italic_v, an SSP is followed perfectly. S𝑆Sitalic_S is therefore unbounded but strictly positive, with a larger value of S𝑆Sitalic_S indicating a larger value for e𝑒eitalic_e and therefore worse alignment with an SSP.

Preliminary results are presented in Section 3, with all features equally weighted.

2.2 TraCE scores

TraCE Clark et al. (2023) outputs a score S𝑆Sitalic_S between 11-1- 1 and 1111, derived from the combination of an angle metric R1(xt,xt)subscript𝑅1subscript𝑥𝑡subscriptsuperscript𝑥𝑡R_{1}(x_{t},x^{\prime}_{t})italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) and a distance metric R2(xt,xt)subscript𝑅2subscript𝑥𝑡subscriptsuperscript𝑥𝑡R_{2}(x_{t},x^{\prime}_{t})italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ), weighted by λ[0,1]𝜆01\lambda\in[0,1]italic_λ ∈ [ 0 , 1 ]:

S(xt,xt)=λR1(xt,xt)+(1λ)R2(xt,xt)𝑆subscript𝑥𝑡subscriptsuperscript𝑥𝑡𝜆subscript𝑅1subscript𝑥𝑡subscriptsuperscript𝑥𝑡1𝜆subscript𝑅2subscript𝑥𝑡subscriptsuperscript𝑥𝑡S(x_{t},x^{\prime}_{t})=\lambda R_{1}(x_{t},x^{\prime}_{t})+(1-\lambda)R_{2}(x% _{t},x^{\prime}_{t})italic_S ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = italic_λ italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + ( 1 - italic_λ ) italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) (2)

Positive values of S𝑆Sitalic_S indicate that a factual instance, xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, representing historical observations, is moving towards a target, xtsuperscriptsubscript𝑥𝑡x_{t}^{\prime}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, representing an SSP projection; negative values indicate movement away from the SSP. Target point selection and λ𝜆\lambdaitalic_λ are user-adjustable parameters to be informed by domain expertise. λ𝜆\lambdaitalic_λ can also be implemented as a learnable function from machine learning methods, or calibrated using the outputs of other scoring methods.

Informed by user priorities or a specific research question, we propose exploring input feature weighting, which can be assigned individually or by grou** features, for example into economic and environmental categories. An exploration of TraCE score presentation will identify the most useful summary statistics and visualisations for communicating results with stakeholders.

Section 3 presents a preliminary study, with λ=0.9𝜆0.9\lambda=0.9italic_λ = 0.9, and all features weighted equally.

2.3 Time series classification

Using supervised machine learning methods, time series classification algorithms can be applied to labelled SSP projection datasets. Observational time series data for a given region can then be classified against the different SSP labels, with classification probability enabling a quantified comparison of a region’s alignment with each SSP in turn. Approaches of interest include deep learning architectures, such as Long Short Term Memory (LSTM) networks Karim et al. (2018, 2019), and transformers Zerveas et al. (2021); Jiang et al. (2022).

3 Preliminary study

Refer to caption
(a) Norm induced measure over time for Brazil, for the period 2015-2022. Lower scores indicate closer alignment with a Shared Socioeconomic Pathway (SSP).
Refer to caption
(b) Average TraCE scores for eight countries, for the period 2015-2022. Higher scores indicate closer alignment with a Shared Socioeconomic Pathway (SSP).
Figure 1: Preliminary results for two of the proposed methods

An initial exploratory study leverages global data from 2015 to 2022. Five features from historical environmental (temperature, precipitation, methane) and socioeconomic (population, GDP) data are compared against corresponding SSP projections at each time point, to quantify country-level alignment with each SSP. By distilling alignment across these features into a single value, sustainable development trajectories can be quantified both within and between countries. Scores for a single regional time series are shown in Figure 0(a), as the overall norm induced measures for Brazil. With this method, alignment for Brazil is strongest with SSP5 (Fossil-fueled Development) and SSP1 (Sustainability). TraCE scores are shown in Figure 0(b) for multiple regions, with countries generally aligning most strongly with SSP5. Germany scores lower across all SSPs, with little variation between different pathways, while Italy instead aligns most highly with SSP3 (Regional Rivalry). Future work could investigate how alignment patterns may vary at different spatial scales, and how these may be explained by computing scores at the feature level.

4 Responsible implementation and impact

Here we propose the development of a framework to monitor sustainable development. With an interpretable approach to reconcile numerous and sometimes conflicting features, non-experts are able to quickly assess alignment with SSP scenarios. This is particularly useful because manual assessment of raw data becomes increasingly challenging as more features are included. These methods can therefore enable improved communication between stakeholder groups.

To ensure fully informed implementation, this work requires active engagement with experts across domains encompassing environmental, social, and economic sciences. Several factors will affect results, including the selection of data features and their weighting, and the model sources for SSP projection data. This in turn will affect the conclusions drawn about the trajectories of regions in alignment with the different SSPs. Reported results must therefore clearly discuss these considerations and the implications of choices made. Implementation must also explicitly acknowledge that these analyses do not attribute sole responsibility to specific regions for the SSP alignment outcomes. This is because the observed features used for monitoring one region’s alignment can be influenced by the actions of other regions.

We envision this work as a component of a broader toolkit for applications such as monitoring real-time trajectories, and emissions simulation experiments, providing an output metric to quantify SSP alignment. This will enable decision-makers to effectively monitor current development trajectories, and evaluate the impact of possible actions in the context of the climate crisis.

Acknowledgments and Disclosure of Funding

MWLW, JNC, and RSR are funded by the UKRI Turing AI Fellowship [grant number EP/V024817/1]. EAS is funded by the ARC Centre of Excellence for Automated Decision-Making and Society (project number CE200100005), funded by the Australian Government through the Australian Research Council. Part of this work was done within the University of Bristol’s Machine Learning and Computer Vision (MaVi) Summer Research Program 2023. EFM is funded by a Google PhD Fellowship.

References

  • Clark et al. [2023] J. N. Clark, E. A. Small, N. Keshtmand, M. W. L. Wan, E. F. Mayoral, E. Werner, C. P. Bourdeaux, and R. Santos-Rodriguez. Trace: Trajectory counterfactual explanation scores. 2023. URL http://arxiv.longhoe.net/abs/2309.15965.
  • Grant et al. [2020] N. Grant, A. Hawkes, T. Napp, and A. Gambhir. The appropriate use of reference scenarios in mitigation analysis. Nat. Clim. Chang., 10:605–610, 2020. doi: 10.1038/s41558-020-0826-9. URL https://doi.org/10.1038/s41558-020-0826-9.
  • Hsu et al. [2022] A. Hsu, X. Wang, J. Tan, W. Toh, and N. Goyal. Predicting european cities’ climate mitigation performance using machine learning. Nature Communications, 13(1):7487, 2022. doi: 10.1038/s41467-022-35108-5.
  • Jiang et al. [2022] H. Jiang, L. Liu, and C. Lian. Multi-modal fusion transformer for multivariate time series classification. In 2022 14th International Conference on Advanced Computational Intelligence (ICACI), pages 284–288, 2022. doi: 10.1109/ICACI55529.2022.9837525.
  • Karim et al. [2018] F. Karim, S. Majumdar, H. Darabi, and S. Chen. Lstm fully convolutional networks for time series classification. IEEE Access, 6:1662–1669, 2018. doi: 10.1109/ACCESS.2017.2779939.
  • Karim et al. [2019] F. Karim, S. Majumdar, H. Darabi, and S. Harford. Multivariate lstm-fcns for time series classification. Neural Networks, 116:237–245, 2019. doi: 10.1016/j.neunet.2019.04.014.
  • Lecocq et al. [2022] F. Lecocq, H. Winkler, J. P. Daka, S. Fu, J. S. Gerber, S. Kartha, V. Krey, H. Lofgren, T. Masui, R. Mathur, J. Portugal-Pereira, B. K. Sovacool, M. V. Vilariño, and N. Zhou. Mitigation and development pathways in the near- to mid-term. In P. R. Shukla, J. Skea, R. Slade, A. Al Khourdajie, R. van Diemen, D. McCollum, M. Pathak, S. Some, P. Vyas, R. Fradera, M. Belkacemi, A. Hasija, G. Lisboa, S. Luz, and J. Malley, editors, Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge, UK and New York, NY, USA, 2022. doi: 10.1017/9781009157926.006.
  • O’Neill et al. [2017] B. C. O’Neill, E. Kriegler, K. L. Ebi, E. Kemp-Benedict, K. Riahi, D. S. Rothman, B. J. van Ruijven, D. P. van Vuuren, J. Birkmann, K. Kok, M. Levy, and W. Solecki. The roads ahead: Narratives for shared socioeconomic pathways describing world futures in the 21st century. Global Environmental Change, 42:169–180, 2017. doi: https://doi.org/10.1016/j.gloenvcha.2015.01.004.
  • O’Neill et al. [2020] B. C. O’Neill, T. R. Carter, K. Ebi, and et al. Achievements and needs for the climate change scenario framework. Nat. Clim. Chang., 10:1074–1084, 2020. doi: 10.1038/s41558-020-00952-0.
  • Riahi et al. [2017] K. Riahi, D. P. van Vuuren, E. Kriegler, J. Edmonds, B. C. O’Neill, S. Fujimori, N. Bauer, K. Calvin, R. Dellink, O. Fricko, W. Lutz, A. Popp, J. C. Cuaresma, S. KC, M. Leimbach, L. Jiang, T. Kram, S. Rao, J. Emmerling, K. Ebi, T. Hasegawa, P. Havlik, F. Humpenöder, L. A. D. Silva, S. Smith, E. Stehfest, V. Bosetti, J. Eom, D. Gernaat, T. Masui, J. Rogelj, J. Strefler, L. Drouet, V. Krey, G. Luderer, M. Harmsen, K. Takahashi, L. Baumstark, J. C. Doelman, M. Kainuma, Z. Klimont, G. Marangoni, H. Lotze-Campen, M. Obersteiner, A. Tabeau, and M. Tavoni. The shared socioeconomic pathways and their energy, land use, and greenhouse gas emissions implications: An overview. Global Environmental Change, 42:153–168, Jan 2017. doi: 10.1016/j.gloenvcha.2016.05.009.
  • Zerveas et al. [2021] G. Zerveas, S. Jayaraman, D. Patel, A. Bhamidipaty, and C. Eickhoff. A transformer-based framework for multivariate time series representation learning. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 2114–2124, 2021. doi: 10.1145/3447548.3467401.