FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows
Authors:
Sonia Natalie Mitchell,
Andrew Lahiff,
Nathan Cummings,
Jonathan Hollocombe,
Bram Boskamp,
Ryan Field,
Dennis Reddyhoff,
Kristian Zarebski,
Antony Wilson,
Bruno Viola,
Martin Burke,
Blair Archibald,
Paul Bessell,
Richard Blackwell,
Lisa A Boden,
Alys Brett,
Sam Brett,
Ruth Dundas,
Jessica Enright,
Alejandra N. Gonzalez-Beltran,
Claire Harris,
Ian Hinder,
Christopher David Hughes,
Martin Knight,
Vino Mano
, et al. (13 additional authors not shown)
Abstract:
Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily da…
▽ More
Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily damaged and is often low, with cynicism arising where claims of "following the science" are made without accompanying evidence. Tracing the provenance of such decisions back through open software to primary data would clarify this evidence, enhancing the transparency of the decision-making process. Here, we demonstrate a Findable, Accessible, Interoperable and Reusable (FAIR) data pipeline developed during the COVID-19 pandemic that allows easy annotation of data as they are consumed by analyses, while tracing the provenance of scientific outputs back through the analytical source code to data sources. Such a tool provides a mechanism for the public, and fellow scientists, to better assess the trust that should be placed in scientific evidence, while allowing scientists to support policy-makers in openly justifying their decisions. We believe that tools such as this should be promoted for use across all areas of policy-facing research.
△ Less
Submitted 4 May, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
Revisiting a synthetic intracellular regulatory network that exhibits oscillations
Authors:
Jonathan Tyler,
Anne Shiu,
Jay Walton
Abstract:
In 2000, Elowitz and Leibler introduced the repressilator--a synthetic gene circuit with three genes that cyclically repress transcription of the next gene--as well as a corresponding mathematical model. Experimental data and model simulations exhibited oscillations in the protein concentrations across generations. In 2006, Müller \textit{et al.}\ generalized the model to an arbitrary number of ge…
▽ More
In 2000, Elowitz and Leibler introduced the repressilator--a synthetic gene circuit with three genes that cyclically repress transcription of the next gene--as well as a corresponding mathematical model. Experimental data and model simulations exhibited oscillations in the protein concentrations across generations. In 2006, Müller \textit{et al.}\ generalized the model to an arbitrary number of genes and analyzed the resulting dynamics. Their new model arose from five key assumptions, two of which are restrictive given current biological knowledge. Accordingly, we propose a new repressilator system that allows for general functions to model transcription, degradation, and translation. We prove that, with an odd number of genes, the new model has a unique steady state and the system converges to this steady state or to a periodic orbit. We also give a necessary and sufficient condition for stability of steady states when the number of genes is even and conjecture a condition for stability for an odd number. Finally, we derive a new rate function describing transcription that arises under more reasonable biological assumptions than the widely used single-step binding assumption. With this new transcription-rate function, we compare the model's amplitude and period with that of a model with the conventional transcription-rate function. Taken together, our results enhance our understanding of genetic regulation by repression.
△ Less
Submitted 31 December, 2018; v1 submitted 1 August, 2018;
originally announced August 2018.