Guidelines for the Creation of Analysis Ready Data
Authors:
Harriette Phillips,
Aiden Price,
Owen Forbes,
Claire Boulange,
Kerrie Mengersen,
Marketa Reeves,
Rebecca Glauert
Abstract:
Globally, there is an increased need for guidelines to produce high-quality data outputs for analysis. No framework currently exists that provides guidelines for a comprehensive approach to producing analysis ready data (ARD). Through critically reviewing and summarising current literature, this paper proposes such guidelines for the creation of ARD. The guidelines proposed in this paper inform te…
▽ More
Globally, there is an increased need for guidelines to produce high-quality data outputs for analysis. No framework currently exists that provides guidelines for a comprehensive approach to producing analysis ready data (ARD). Through critically reviewing and summarising current literature, this paper proposes such guidelines for the creation of ARD. The guidelines proposed in this paper inform ten steps in the generation of ARD: ethics, project documentation, data governance, data management, data storage, data discovery and collection, data cleaning, quality assurance, metadata, and data dictionary. These steps are illustrated through a substantive case study that aimed to create ARD for a digital spatial platform: the Australian Child and Youth Wellbeing Atlas (ACYWA).
△ Less
Submitted 29 April, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
Data-driven recommendations for enhancing real-time natural hazard warnings, communication, and response
Authors:
Kate R. Saunders,
Owen Forbes,
Jess K. Hopf,
Charlotte R. Patterson,
Sarah A. Vollert,
Kaitlyn Brown,
Raiha Browning,
Miguel Canizares,
Richard S. Cottrell,
Lanxi Li,
Catherine J. S. Kim,
Tace P. Stewart,
Connie Susilawati,
Xiang Y. Zhao,
Kate J. Helmstedt
Abstract:
The effectiveness and adequacy of natural hazard warnings hinges on the availability of data and its transformation into actionable knowledge for the public. Real-time warning communication and emergency response therefore need to be evaluated from a data science perspective. However, there are currently gaps between established data science best practices and their application in supporting natur…
▽ More
The effectiveness and adequacy of natural hazard warnings hinges on the availability of data and its transformation into actionable knowledge for the public. Real-time warning communication and emergency response therefore need to be evaluated from a data science perspective. However, there are currently gaps between established data science best practices and their application in supporting natural hazard warnings. This Perspective reviews existing data-driven approaches that underpin real-time warning communication and emergency response, highlighting limitations in hazard and impact forecasts. Four main themes for enhancing warnings are emphasised: (i) applying best-practice principles in visualising hazard forecasts, (ii) data opportunities for more effective impact forecasts, (iii) utilising data for more localised forecasts, and (iv) improving data-driven decision-making using uncertainty. Motivating examples are provided from the extensive flooding experienced in Australia in 2022. This Perspective shows the capacity for improving the efficacy of natural hazard warnings using data science, and the collaborative potential between the data science and natural hazards communities.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.