-
The Anthropocene by the Numbers: A Quantitative Snapshot of Humanity's Influence on the Planet
Authors:
Griffin Chure,
Rachel A. Banks,
Avi I. Flamholz,
Nicholas S. Sarai,
Mason Kamb,
Ignacio Lopez-Gomez,
Yinon M. Bar-On,
Ron Milo,
Rob Phillips
Abstract:
The presence and action of humans on Earth has exerted a strong influence on the evolution of the planet over the past $\approx$ 10,000 years, the consequences of which are now becoming broadly evident. Despite a deluge of tightly-focused and necessarily technical studies exploring each facet of "human impacts" on the planet, their integration into a complete picture of the human-Earth system lags…
▽ More
The presence and action of humans on Earth has exerted a strong influence on the evolution of the planet over the past $\approx$ 10,000 years, the consequences of which are now becoming broadly evident. Despite a deluge of tightly-focused and necessarily technical studies exploring each facet of "human impacts" on the planet, their integration into a complete picture of the human-Earth system lags far behind. Here, we quantify twelve dimensionless ratios which put the magnitude of human impacts in context, comparing the magnitude of anthropogenic processes to their natural analogues. These ratios capture the extent to which humans alter the terrestrial surface, hydrosphere, biosphere, atmosphere, and biogeochemistry of Earth. In almost all twelve cases, the impact of human processes rivals or exceeds their natural counterparts. The values and corresponding uncertainties for these impacts at global and regional resolution are drawn from the primary scientific literature, governmental and international databases, and industry reports. We present this synthesis of the current "state of affairs" as a graphical snapshot designed to be used as a reference. Furthermore, we establish a searchable database termed the Human Impacts Database (www.anthroponumbers.org) which houses all quantities reported here and many others with extensive curation and annotation. While necessarily incomplete, this work collates and contextualizes a set of essential numbers summarizing the broad impacts of human activities on Earth's atmosphere, land, water, and biota.
△ Less
Submitted 23 January, 2021;
originally announced January 2021.
-
Quantitative clarification of key questions about COVID-19 epidemiology
Authors:
Yinon M. Bar-On,
Ron Sender,
Avi I. Flamholz,
Rob Phillips,
Ron Milo
Abstract:
Modeling the spread of COVID-19 is crucial for informing public health policy. All models for COVID-19 epidemiology rely on parameters describing the dynamics of the infection process. The meanings of epidemiological parameters like R_0, R_t, the "serial interval" and "generation interval" can be challenging to understand, especially as these and other parameters are conceptually overlap** and s…
▽ More
Modeling the spread of COVID-19 is crucial for informing public health policy. All models for COVID-19 epidemiology rely on parameters describing the dynamics of the infection process. The meanings of epidemiological parameters like R_0, R_t, the "serial interval" and "generation interval" can be challenging to understand, especially as these and other parameters are conceptually overlap** and sometimes confusingly named. Moreover, the procedures used to estimate these parameters make various assumptions and use different mathematical approaches that should be understood and accounted for when relying on parameter values and reporting them to the public. Here, we offer several insights regarding the derivation of commonly-reported epidemiological parameters, and describe how mitigation measures like lockdown are expected to affect their values. We aim to present these quantitative relationships in a manner that is accessible to the widest audience possible. We hope that better communicating the intricacies of epidemiological models will improve our collective understanding of their strengths and weaknesses, and will help avoid possible pitfalls when using them.
△ Less
Submitted 9 July, 2020;
originally announced July 2020.
-
A quantitative compendium of COVID-19 epidemiology
Authors:
Yinon M. Bar-On,
Ron Sender,
Avi I. Flamholz,
Rob Phillips,
Ron Milo
Abstract:
Accurate numbers are needed to understand and predict viral dynamics. Curation of high-quality literature values for the infectious period duration or household secondary attack rate, for example, is especially pressing currently because these numbers inform decisions about how and when to lockdown or reopen societies. We aim to provide a curated source for the key numbers that help us understand…
▽ More
Accurate numbers are needed to understand and predict viral dynamics. Curation of high-quality literature values for the infectious period duration or household secondary attack rate, for example, is especially pressing currently because these numbers inform decisions about how and when to lockdown or reopen societies. We aim to provide a curated source for the key numbers that help us understand the virus driving our current global crisis. This compendium focuses solely on COVID-19 epidemiology. The numbers reported in summary format are substantiated by annotated references. For each property, we provide a concise definition, description of measurement and inference methods, and associated caveats. We hope this compendium will make essential numbers more accessible and avoid common sources of confusion for the many newcomers to the field such as using the incubation period to denote and quantify the latent period or using the hospitalization duration for the infectiousness period duration. This document will be repeatedly updated and the community is invited to participate in improving it.
△ Less
Submitted 9 July, 2020; v1 submitted 1 June, 2020;
originally announced June 2020.
-
SARS-CoV-2 (COVID-19) by the numbers
Authors:
Yinon M. Bar-On,
Avi I. Flamholz,
Rob Phillips,
Ron Milo
Abstract:
The current SARS-CoV-2 pandemic is a harsh reminder of the fact that, whether in a single human host or a wave of infection across continents, viral dynamics is often a story about the numbers. In this snapshot, our aim is to provide a one-stop, curated graphical source for the key numbers that help us understand the virus driving our current global crisis. The discussion is framed around two broa…
▽ More
The current SARS-CoV-2 pandemic is a harsh reminder of the fact that, whether in a single human host or a wave of infection across continents, viral dynamics is often a story about the numbers. In this snapshot, our aim is to provide a one-stop, curated graphical source for the key numbers that help us understand the virus driving our current global crisis. The discussion is framed around two broad themes: 1) the biology of the virus itself and 2) the characteristics of the infection of a single human host. Our one-page summary provides the key numbers pertaining to SARS-CoV-2, based mostly on peer-reviewed literature. The numbers reported in summary format are substantiated by the annotated references below. Readers are urged to remember that much uncertainty remains and knowledge of this pandemic and the virus driving it is rapidly evolving. In the paragraphs below we provide "back of the envelope" calculations that exemplify the insights that can be gained from knowing some key numbers and using quantitative logic. These calculations serve to improve our intuition through sanity checks, but do not replace detailed epidemiological analysis.
△ Less
Submitted 30 March, 2020; v1 submitted 28 March, 2020;
originally announced March 2020.
-
Understanding the Dynamics and Optimizing the Performance of Chemostat Selection Experiments
Authors:
Aryeh Wides,
Ron Milo
Abstract:
A chemostat enables long-term, continuous, exponential-phase growth in an environment limited as prescribed by the researcher. It is thus a potent tool for laboratory evolution - selecting for strains with desired phenotypes. However, despite the apparently simple design governed by a limited set of rules, analysis of chemostat dynamics shows that they display counter-intuitive properties. For exa…
▽ More
A chemostat enables long-term, continuous, exponential-phase growth in an environment limited as prescribed by the researcher. It is thus a potent tool for laboratory evolution - selecting for strains with desired phenotypes. However, despite the apparently simple design governed by a limited set of rules, analysis of chemostat dynamics shows that they display counter-intuitive properties. For example, the concentration of limiting substrate in the chemostat is independent of the concentration in the influx and only dependent on the dilution rate and the strain parameters. Moreover, choosing optimal operational parameters (dilution rate, volume size, etc.) can be challenging. There are conflicting requirements in the experimental design, such as a need for relatively fast growth conditions for mutation accumulation on the one hand versus slow dilution for a large fitness advantage for mutants to take over the population quickly on the other.In this study, we provide analytic and computational tools to help understand and predict chemostat dynamics, and choose suitable operational parameters. We refer to five stages of the process: (A) parameter choice and setup, (B) basic steady state growth, (C) mutation, (D) single takeover and (E) successive takeovers. We present a qualitative and quantitative framework to answer the questions confronted in each of these stages. We provide a set of simulations which support the quantitative results, and a graphical user interface to give a hands-on opportunity to experience and visualize the analytic results. We detail conditions that produce ineffectual selection regimes, and find that when avoided, the selection time is relatively robust, and usually varies by less than an order of magnitude. Finally, we suggest rules of thumb to help ensure that the chosen parameters lead to effective selection and minimize the duration of the selection process.
△ Less
Submitted 1 June, 2018;
originally announced June 2018.
-
The Energetic Cost of Building a Virus
Authors:
Gita Mahmoudabadi,
Ron Milo,
Rob Phillips
Abstract:
Viruses are incapable of autonomous energy production. Although many experimental studies make it clear that viruses are parasitic entities that hijack the host's molecular resources, a detailed estimate for the energetic cost of viral synthesis is largely lacking. To quantify the energetic cost of viruses to their hosts, we enumerated the costs associated with two very distinct but representative…
▽ More
Viruses are incapable of autonomous energy production. Although many experimental studies make it clear that viruses are parasitic entities that hijack the host's molecular resources, a detailed estimate for the energetic cost of viral synthesis is largely lacking. To quantify the energetic cost of viruses to their hosts, we enumerated the costs associated with two very distinct but representative DNA and RNA viruses, namely T4 and influenza. We found that for these viruses, translation of viral proteins is the most energetically expensive process. Interestingly, the cost of building a T4 phage and a single influenza virus are nearly the same. Due to influenza's higher burst size, however, the overall cost of a T4 phage infection is only 2-3% of the cost of an influenza infection. The costs of these infections relative to their host's estimated energy budget during the infection reveal that a T4 infection consumes about a third of its host's energy budget, where as an influenza infection consumes only 1%. Building on our estimates for T4, we show how the energetic costs of double-stranded DNA viruses scale with virus size, revealing that the dominant cost of building a virus can switch from translation to genome replication above a critical virus size. Lastly, using our predictions for the energetic cost of viruses, we provide estimates for the strengths of selection and genetic drift acting on newly incorporated genetic elements in viral genomes, under conditions of energy limitation.
△ Less
Submitted 5 January, 2017;
originally announced January 2017.
-
The protein cost of metabolic fluxes: prediction from enzymatic rate laws and cost minimization
Authors:
Elad Noor,
Avi Flamholz,
Arren Bar-Even,
Dan Davidi,
Ron Milo,
Wolfram Liebermeister
Abstract:
Bacterial growth depends crucially on metabolic fluxes, which are limited by the cell's capacity to maintain metabolic enzymes. The necessary enzyme amount per unit flux is a major determinant of metabolic strategies both in evolution and bioengineering. It depends on enzyme parameters (such as kcat and KM constants), but also on metabolite concentrations. Moreover, similar amounts of different en…
▽ More
Bacterial growth depends crucially on metabolic fluxes, which are limited by the cell's capacity to maintain metabolic enzymes. The necessary enzyme amount per unit flux is a major determinant of metabolic strategies both in evolution and bioengineering. It depends on enzyme parameters (such as kcat and KM constants), but also on metabolite concentrations. Moreover, similar amounts of different enzymes might incur different costs for the cell, depending on enzyme-specific properties such as protein size and half-life. Here, we developed enzyme cost minimization (ECM), a scalable method for computing enzyme amounts that support a given metabolic flux at a minimal protein cost. The complex interplay of enzyme and metabolite concentrations, e.g. through thermodynamic driving forces and enzyme saturation, would make it hard to solve this optimization problem directly. By treating enzyme cost as a function of metabolite levels, we formulated ECM as a numerically tractable, convex optimization problem. Its tiered approach allows for building models at different levels of detail, depending on the amount of available data. Validating our method with measured metabolite and protein levels in E. coli central metabolism, we found typical prediction fold errors of 3.8 and 2.7, respectively, for the two kinds of data. ECM can be used to predict enzyme levels and protein cost in natural and engineered pathways, establishes a direct connection between protein cost and thermodynamics, and provides a physically plausible and computationally tractable way to include enzyme kinetics into constraint-based metabolic models, where kinetics have usually been ignored or oversimplified.
△ Less
Submitted 1 April, 2016;
originally announced April 2016.
-
Cross-species analysis traces adaptation of Rubisco towards optimality in a low dimensional landscape
Authors:
Yonatan Savir,
Elad Noor,
Ron Milo,
Tsvi Tlusty
Abstract:
Rubisco, probably the most abundant protein in the biosphere, performs an essential part in the process of carbon fixation through photosynthesis thus facilitating life on earth. Despite the significant effect that Rubisco has on the fitness of plants and other photosynthetic organisms, this enzyme is known to have a remarkably low catalytic rate and a tendency to confuse its substrate, carbon dio…
▽ More
Rubisco, probably the most abundant protein in the biosphere, performs an essential part in the process of carbon fixation through photosynthesis thus facilitating life on earth. Despite the significant effect that Rubisco has on the fitness of plants and other photosynthetic organisms, this enzyme is known to have a remarkably low catalytic rate and a tendency to confuse its substrate, carbon dioxide, with oxygen. This apparent inefficiency is puzzling and raises questions regarding the roles of evolution versus biochemical constraints in sha** Rubisco. Here we examine these questions by analyzing the measured kinetic parameters of Rubisco from various organisms in various environments. The analysis presented here suggests that the evolution of Rubisco is confined to an effectively one-dimensional landscape, which is manifested in simple power law correlations between its kinetic parameters. Within this one dimensional landscape, which may represent biochemical and structural constraints, Rubisco appears to be tuned to the intracellular environment in which it resides such that the net photosynthesis rate is nearly optimal. Our analysis indicates that the specificity of Rubisco is not the main determinant of its efficiency but rather the tradeoff between the carboxylation velocity and CO2 affinity. As a result, the presence of oxygen has only moderate effect on the optimal performance of Rubisco, which is determined mostly by the local CO2 concentration. Rubisco appears as an experimentally testable example for the evolution of proteins subject both to strong selection pressure and to biochemical constraints which strongly confine the evolutionary plasticity to a low dimensional landscape.
△ Less
Submitted 26 July, 2010;
originally announced July 2010.
-
Coarse-Graining and Self-Dissimilarity of Complex Networks
Authors:
Shalev Itzkovitz,
Reuven Levitt,
Nadav Kashtan,
Ron Milo,
Michael Itzkovitz,
Uri Alon
Abstract:
Can complex engineered and biological networks be coarse-grained into smaller and more understandable versions in which each node represents an entire pattern in the original network? To address this, we define coarse-graining units (CGU) as connectivity patterns which can serve as the nodes of a coarse-grained network, and present algorithms to detect them. We use this approach to systematicall…
▽ More
Can complex engineered and biological networks be coarse-grained into smaller and more understandable versions in which each node represents an entire pattern in the original network? To address this, we define coarse-graining units (CGU) as connectivity patterns which can serve as the nodes of a coarse-grained network, and present algorithms to detect them. We use this approach to systematically reverse-engineer electronic circuits, forming understandable high-level maps from incomprehensible transistor wiring: first, a coarse-grained version in which each node is a gate made of several transistors is established. Then, the coarse-grained network is itself coarse-grained, resulting in a high-level blueprint in which each node is a circuit-module made of multiple gates. We apply our approach also to a mammalian protein-signaling network, to find a simplified coarse-grained network with three main signaling channels that correspond to cross-interacting MAP-kinase cascades. We find that both biological and electronic networks are 'self-dissimilar', with different network motifs found at each level. The present approach can be used to simplify a wide variety of directed and nondirected, natural and designed networks.
△ Less
Submitted 18 October, 2004; v1 submitted 14 May, 2004;
originally announced May 2004.
-
Topological Generalizations of network motifs
Authors:
N. Kashtan,
S. Itzkovitz,
R. Milo,
U. Alon
Abstract:
Biological and technological networks contain patterns, termed network motifs, which occur far more often than in randomized networks. Network motifs were suggested to be elementary building blocks that carry out key functions in the network. It is of interest to understand how network motifs combine to form larger structures. To address this, we present a systematic approach to define 'motif ge…
▽ More
Biological and technological networks contain patterns, termed network motifs, which occur far more often than in randomized networks. Network motifs were suggested to be elementary building blocks that carry out key functions in the network. It is of interest to understand how network motifs combine to form larger structures. To address this, we present a systematic approach to define 'motif generalizations': families of motifs of different sizes that share a common architectural theme. To define motif generalizations, we first define 'roles' in a subgraph according to structural equivalence. For example, the feedforward loop triad, a motif in transcription, neuronal and some electronic networks, has three roles, an input node, an output node and an internal node. The roles are used to define possible generalizations of the motif. The feedforward loop can have three simple generalizations, based on replicating each of the three roles and their connections. We present algorithms for efficiently detecting motif generalizations. We find that the transcription networks of bacteria and yeast display only one of the three generalizations, the multi-output feedforward generalization. In contrast, the neuronal network of \emph{C. elegans} mainly displays the multi-input generalization. Forward-logic electronic circuits display a multi-input, multi-output hybrid. Thus, networks which share a common motif can have very different generalizations of that motif. Using mathematical modelling, we describe the information processing functions of the different motif generalizations in transcription, neuronal and electronic networks.
△ Less
Submitted 16 May, 2004; v1 submitted 15 December, 2003;
originally announced December 2003.
-
On the uniform generation of random graphs with prescribed degree sequences
Authors:
R. Milo,
N. Kashtan,
S. Itzkovitz,
M. E. J. Newman,
U. Alon
Abstract:
Random graphs with prescribed degree sequences have been widely used as a model of complex networks. Comparing an observed network to an ensemble of such graphs allows one to detect deviations from randomness in network properties. Here we briefly review two existing methods for the generation of random graphs with arbitrary degree sequences, which we call the ``switching'' and ``matching'' meth…
▽ More
Random graphs with prescribed degree sequences have been widely used as a model of complex networks. Comparing an observed network to an ensemble of such graphs allows one to detect deviations from randomness in network properties. Here we briefly review two existing methods for the generation of random graphs with arbitrary degree sequences, which we call the ``switching'' and ``matching'' methods, and present a new method based on the ``go with the winners'' Monte Carlo method. The matching method may suffer from nonuniform sampling, while the switching method has no general theoretical bound on its mixing time. The ``go with the winners'' method has neither of these drawbacks, but is slow. It can however be used to evaluate the reliability of the other two methods and, by doing this, we demonstrate that the deviations of the switching and matching algorithms under realistic conditions are small compared to the ``go with the winners'' algorithm. Because of its combination of speed and accuracy we recommend the use of the switching method for most calculations.
△ Less
Submitted 30 May, 2004; v1 submitted 1 December, 2003;
originally announced December 2003.
-
Subgraphs in random networks
Authors:
S. Itzkovitz,
R. Milo,
N. Kashtan,
G. Ziv,
U. Alon
Abstract:
Understanding the subgraph distribution in random networks is important for modelling complex systems. In classic Erdos networks, which exhibit a Poissonian degree distribution, the number of appearances of a subgraph G with n nodes and g edges scales with network size as \mean{G} ~ N^{n-g}. However, many natural networks have a non-Poissonian degree distribution. Here we present approximate equ…
▽ More
Understanding the subgraph distribution in random networks is important for modelling complex systems. In classic Erdos networks, which exhibit a Poissonian degree distribution, the number of appearances of a subgraph G with n nodes and g edges scales with network size as \mean{G} ~ N^{n-g}. However, many natural networks have a non-Poissonian degree distribution. Here we present approximate equations for the average number of subgraphs in an ensemble of random sparse directed networks, characterized by an arbitrary degree sequence. We find new scaling rules for the commonly occurring case of directed scale-free networks, in which the outgoing degree distribution scales as P(k) ~ k^{-γ}. Considering the power exponent of the degree distribution, γ, as a control parameter, we show that random networks exhibit transitions between three regimes. In each regime the subgraph number of appearances follows a different scaling law, \mean{G} ~ N^α, where α=n-g+s-1 for γ<2, α=n-g+s+1-γfor 2<γ<γ_c, and α=n-g for γ>γ_c, s is the maximal outdegree in the subgraph, and γ_c=s+1. We find that certain subgraphs appear much more frequently than in Erdos networks. These results are in very good agreement with numerical simulations. This has implications for detecting network motifs, subgraphs that occur in natural networks significantly more than in their randomized counterparts.
△ Less
Submitted 26 August, 2003; v1 submitted 19 February, 2003;
originally announced February 2003.