-
Be Prospective, Not Retrospective: A Philosophy for Advancing Reproducibility in Modern Biological Research
Authors:
Griffin Chure
Abstract:
The ubiquity of computation in modern scientific research inflicts new challenges for reproducibility. While most journals now require code and data be made available, the standards for organization, annotation, and validation remain lax, making the data and code often difficult to decipher or practically use. I believe that this is due to the documentation, collation, and validation of code and d…
▽ More
The ubiquity of computation in modern scientific research inflicts new challenges for reproducibility. While most journals now require code and data be made available, the standards for organization, annotation, and validation remain lax, making the data and code often difficult to decipher or practically use. I believe that this is due to the documentation, collation, and validation of code and data only being done in retrospect. In this essay, I reflect on my experience contending with these challenges and present a philosophy for prioritizing reproducibility in modern biological research where balancing computational analysis and wet-lab experiments is commonplace. Modern tools used in scientific workflows (such as GitHub repositories) lend themselves well to this philosophy where reproducibility begins at project inception, not completion. To that end, I present and provide a programming-language agnostic template architecture that can be immediately copied and made bespoke to your next paper, whether your lab work is wet, dry, or somewhere in between.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
The Anthropocene by the Numbers: A Quantitative Snapshot of Humanity's Influence on the Planet
Authors:
Griffin Chure,
Rachel A. Banks,
Avi I. Flamholz,
Nicholas S. Sarai,
Mason Kamb,
Ignacio Lopez-Gomez,
Yinon M. Bar-On,
Ron Milo,
Rob Phillips
Abstract:
The presence and action of humans on Earth has exerted a strong influence on the evolution of the planet over the past $\approx$ 10,000 years, the consequences of which are now becoming broadly evident. Despite a deluge of tightly-focused and necessarily technical studies exploring each facet of "human impacts" on the planet, their integration into a complete picture of the human-Earth system lags…
▽ More
The presence and action of humans on Earth has exerted a strong influence on the evolution of the planet over the past $\approx$ 10,000 years, the consequences of which are now becoming broadly evident. Despite a deluge of tightly-focused and necessarily technical studies exploring each facet of "human impacts" on the planet, their integration into a complete picture of the human-Earth system lags far behind. Here, we quantify twelve dimensionless ratios which put the magnitude of human impacts in context, comparing the magnitude of anthropogenic processes to their natural analogues. These ratios capture the extent to which humans alter the terrestrial surface, hydrosphere, biosphere, atmosphere, and biogeochemistry of Earth. In almost all twelve cases, the impact of human processes rivals or exceeds their natural counterparts. The values and corresponding uncertainties for these impacts at global and regional resolution are drawn from the primary scientific literature, governmental and international databases, and industry reports. We present this synthesis of the current "state of affairs" as a graphical snapshot designed to be used as a reference. Furthermore, we establish a searchable database termed the Human Impacts Database (www.anthroponumbers.org) which houses all quantities reported here and many others with extensive curation and annotation. While necessarily incomplete, this work collates and contextualizes a set of essential numbers summarizing the broad impacts of human activities on Earth's atmosphere, land, water, and biota.
△ Less
Submitted 23 January, 2021;
originally announced January 2021.
-
First-principles prediction of the information processing capacity of a simple genetic circuit
Authors:
Manuel Razo-Mejia,
Sarah Marzen,
Griffin Chure,
Rachel Taubman,
Muir Morrison,
Rob Phillips
Abstract:
Given the stochastic nature of gene expression, genetically identical cells exposed to the same environmental inputs will produce different outputs. This heterogeneity has been hypothesized to have consequences for how cells are able to survive in changing environments. Recent work has explored the use of information theory as a framework to understand the accuracy with which cells can ascertain t…
▽ More
Given the stochastic nature of gene expression, genetically identical cells exposed to the same environmental inputs will produce different outputs. This heterogeneity has been hypothesized to have consequences for how cells are able to survive in changing environments. Recent work has explored the use of information theory as a framework to understand the accuracy with which cells can ascertain the state of their surroundings. Yet the predictive power of these approaches is limited and has not been rigorously tested using precision measurements. To that end, we generate a minimal model for a simple genetic circuit in which all parameter values for the model come from independently published data sets. We then predict the information processing capacity of the genetic circuit for a suite of biophysical parameters such as protein copy number and protein-DNA affinity. We compare these parameter-free predictions with an experimental determination of protein expression distributions and the resulting information processing capacity of E. coli cells. We find that our minimal model captures the scaling of the cell-to-cell variability in the data and the inferred information processing capacity of our simple genetic circuit up to a systematic deviation.
△ Less
Submitted 6 May, 2020;
originally announced May 2020.
-
The Energetics of Molecular Adaptation in Transcriptional Regulation
Authors:
Griffin Chure,
Manuel Razo-Mejia,
Nathan M. Belliveau,
Tal Einav,
Zofii Kaczmarek,
Stephanie L. Barnes,
Mitchell Lewis,
Rob Phillips
Abstract:
Mutation is a critical mechanism by which evolution explores the functional landscape of proteins. Despite our ability to experimentally inflict mutations at will, it remains difficult to link sequence-level perturbations to systems-level responses. Here, we present a framework centered on measuring changes in the free energy of the system to link individual mutations in an allosteric transcriptio…
▽ More
Mutation is a critical mechanism by which evolution explores the functional landscape of proteins. Despite our ability to experimentally inflict mutations at will, it remains difficult to link sequence-level perturbations to systems-level responses. Here, we present a framework centered on measuring changes in the free energy of the system to link individual mutations in an allosteric transcriptional repressor to the parameters which govern its response. We find the energetic effects of the mutations can be categorized into several classes which have characteristic curves as a function of the inducer concentration. We experimentally test these diagnostic predictions using the well-characterized LacI repressor of Escherichia coli, probing several mutations in the DNA binding and inducer binding domains. We find that the change in gene expression due to a point mutation can be captured by modifying only a subset of the model parameters that describe the respective domain of the wild-type protein. These parameters appear to be insulated, with mutations in the DNA binding domain altering only the DNA affinity and those in the inducer binding domain altering only the allosteric parameters. Changing these subsets of parameters tunes the free energy of the system in a way that is concordant with theoretical expectations. Finally, we show that the induction profiles and resulting free energies associated with pairwise double mutants can be predicted with quantitative accuracy given knowledge of the single mutants, providing an avenue for identifying and quantifying epistatic interactions.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
Figure 1 Theory Meets Figure 2 Experiments in the Study of Gene Expression
Authors:
Rob Phillips,
Nathan M. Belliveau,
Griffin Chure,
Hernan G. Garcia,
Manuel Razo-Mejia,
Clarissa Scholes
Abstract:
It is tempting to believe that we now own the genome. The ability to read and re-write it at will has ushered in a stunning period in the history of science. Nonetheless, there is an Achilles heel exposed by all of the genomic data that has accrued: we still don't know how to interpret it. Many genes are subject to sophisticated programs of transcriptional regulation, mediated by DNA sequences tha…
▽ More
It is tempting to believe that we now own the genome. The ability to read and re-write it at will has ushered in a stunning period in the history of science. Nonetheless, there is an Achilles heel exposed by all of the genomic data that has accrued: we still don't know how to interpret it. Many genes are subject to sophisticated programs of transcriptional regulation, mediated by DNA sequences that harbor binding sites for transcription factors which can up- or down-regulate gene expression depending upon environmental conditions. This gives rise to an input-output function describing how the level of expression depends upon the parameters of the regulated gene { for instance, on the number and type of binding sites in its regulatory sequence. In recent years, the ability to make precision measurements of expression, coupled with the ability to make increasingly sophisticated theoretical predictions, have enabled an explicit dialogue between theory and experiment that holds the promise of covering this genomic Achilles heel. The goal is to reach a predictive understanding of transcriptional regulation that makes it possible to calculate gene expression levels from DNA regulatory sequence. This review focuses on the canonical simple repression motif to ask how well the models that have been used to characterize it actually work. We consider a hierarchy of increasingly sophisticated experiments in which the minimal parameter set learned at one level is applied to make quantitative predictions at the next. We show that these careful quantitative dissections provide a template for a predictive understanding of the many more complex regulatory arrangements found across all domains of life.
△ Less
Submitted 30 December, 2018;
originally announced December 2018.
-
Connecting the dots between mechanosensitive channel abundance, osmotic shock, and survival at single-cell resolution
Authors:
Griffin Chure,
Heun ** Lee,
Rob Phillips
Abstract:
Rapid changes in extracellular osmolarity are one of many insults microbial cells face on a daily basis. To protect against such shocks, Escherichia coli and other microbes express several types of transmembrane channels which open and close in response to changes in membrane tension. In E. coli, one of the most abundant channels is the mechanosensitive channel of large conductance (MscL). While t…
▽ More
Rapid changes in extracellular osmolarity are one of many insults microbial cells face on a daily basis. To protect against such shocks, Escherichia coli and other microbes express several types of transmembrane channels which open and close in response to changes in membrane tension. In E. coli, one of the most abundant channels is the mechanosensitive channel of large conductance (MscL). While this channel has been heavily characterized through structural methods, electrophysiology, and theoretical modeling, our understanding of its physiological role in preventing cell death by alleviating high membrane tension remains tenuous. In this work, we examine the contribution of MscL alone to cell survival after osmotic shock at single cell resolution using quantitative fluorescence microscopy. We conduct these experiments in an E. coli strain which is lacking all mechanosensitive channel genes save for MscL whose expression is tuned across three orders of magnitude through modifications of the Shine-Dalgarno sequence. While theoretical models suggest that only a few MscL channels would be needed to alleviate even large changes in osmotic pressure, we find that between 500 and 700 channels per cell are needed to convey upwards of 80% survival. This number agrees with the average MscL copy number measured in wild-type E. coli cells through proteomic studies and quantitative Western blotting. Furthermore, we observe zero survival events in cells with less than 100 channels per cell. This work opens new questions concerning the contribution of other mechanosensitive channels to survival as well as regulation of their activity.
△ Less
Submitted 3 June, 2018;
originally announced June 2018.
-
Tuning transcriptional regulation through signaling: A predictive theory of allosteric induction
Authors:
Manuel Razo-Mejia,
Stephanie L. Barnes,
Nathan M. Belliveau,
Griffin Chure,
Tal Einav,
Mitchell Lewis,
Rob Phillips
Abstract:
Allosteric regulation is found across all domains of life, yet we still lack simple, predictive theories that directly link the experimentally tunable parameters of a system to its input-output response. To that end, we present a general theory of allosteric transcriptional regulation using the Monod-Wyman-Changeux model. We rigorously test this model using the ubiquitous simple repression motif i…
▽ More
Allosteric regulation is found across all domains of life, yet we still lack simple, predictive theories that directly link the experimentally tunable parameters of a system to its input-output response. To that end, we present a general theory of allosteric transcriptional regulation using the Monod-Wyman-Changeux model. We rigorously test this model using the ubiquitous simple repression motif in bacteria by first predicting the behavior of strains that span a large range of repressor copy numbers and DNA binding strengths and then constructing and measuring their response. Our model not only accurately captures the induction profiles of these strains but also enables us to derive analytic expressions for key properties such as the dynamic range and $[EC_{50}]$. Finally, we derive an expression for the free energy of allosteric repressors which enables us to collapse our experimental data onto a single master curve that captures the diverse phenomenology of the induction profiles.
△ Less
Submitted 21 June, 2017; v1 submitted 23 February, 2017;
originally announced February 2017.