-
A picture guide to cancer progression and monotonic accumulation models: evolutionary assumptions, plausible interpretations, and alternative uses
Authors:
Ramon Diaz-Uriarte,
Iain G. Johnston
Abstract:
Cancer progression and monotonic accumulation models were developed to discover dependencies in the irreversible acquisition of binary traits from cross-sectional data. They have been used in computational oncology and virology but also in widely different problems such as malaria progression. These methods have been applied to predict future states of the system, identify routes of feature acquis…
▽ More
Cancer progression and monotonic accumulation models were developed to discover dependencies in the irreversible acquisition of binary traits from cross-sectional data. They have been used in computational oncology and virology but also in widely different problems such as malaria progression. These methods have been applied to predict future states of the system, identify routes of feature acquisition, and improve patient stratification, and they hold promise for evolutionary-based treatments. New methods continue to be developed.
But these methods have shortcomings, which are yet to be systematically critiqued, regarding key evolutionary assumptions and interpretations. After an overview of the available methods, we focus on why inferences might not be about the processes we intend. Using fitness landscapes, we highlight difficulties that arise from bulk sequencing and reciprocal sign epistasis, from conflating lines of descent, path of the maximum, and mutational profiles, and from ambiguous use of the idea of exclusivity. We examine how the previous concerns change when bulk sequencing is explicitly considered, and underline opportunities for addressing dependencies due to frequency-dependent selection. This review identifies major standing issues, and should encourage the use of these methods in other areas with a better alignment between entities and model assumptions.
△ Less
Submitted 30 June, 2024; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Encounter networks from collective mitochondrial dynamics support the emergence of effective mtDNA genomes in plant cells
Authors:
Konstantinos Giannakis,
Joanna M. Chustecki,
Iain G. Johnston
Abstract:
Mitochondria in plant cells form strikingly dynamic populations of largely individual organelles. Each mitochondrion contains on average less than a full copy of the mitochondrial DNA (mtDNA) genome. Here, we asked whether mitochondrial dynamics may allow individual mitochondria to `collect' a full copy of the mtDNA genome over time, by facilitating exchange between individuals. Akin to trade on a…
▽ More
Mitochondria in plant cells form strikingly dynamic populations of largely individual organelles. Each mitochondrion contains on average less than a full copy of the mitochondrial DNA (mtDNA) genome. Here, we asked whether mitochondrial dynamics may allow individual mitochondria to `collect' a full copy of the mtDNA genome over time, by facilitating exchange between individuals. Akin to trade on a social network, exchange of mtDNA fragments across organelles may lead to the emergence of full `effective' genomes in individuals over time. We characterise the collective dynamics of mitochondria in \emph{Arabidopsis thaliana} hypocotyl cells using a recent approach combining single-cell timelapse microscopy, video analysis, and network science. We then use a quantitative model to predict the capacity for the sharing and accumulation of genetic information through the networks of encounters between mitochondria. We find that biological encounter networks are strikingly well predisposed to support the collection of full genomes over time, outperforming a range of other networks generated from theory and simulation. Using results from the coupon collector's problem, we show that the upper tail of the degree distribution is a key determinant of an encounter network's performance at this task and discuss how features of mitochondrial dynamics observed in biology facilitate the emergence of full effective genomes.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Data-driven modelling and characterisation of task completion sequences in online courses
Authors:
Robert L. Peach,
Sam F. Greenbury,
Iain G. Johnston,
Sophia N. Yaliraki,
David Lefevre,
Mauricio Barahona
Abstract:
The intrinsic temporality of learning demands the adoption of methodologies capable of exploiting time-series information. In this study we leverage the sequence data framework and show how data-driven analysis of temporal sequences of task completion in online courses can be used to characterise personal and group learners' behaviors, and to identify critical tasks and course sessions in a given…
▽ More
The intrinsic temporality of learning demands the adoption of methodologies capable of exploiting time-series information. In this study we leverage the sequence data framework and show how data-driven analysis of temporal sequences of task completion in online courses can be used to characterise personal and group learners' behaviors, and to identify critical tasks and course sessions in a given course design. We also introduce a recently developed probabilistic Bayesian model to learn sequence trajectories of students and predict student performance. The application of our data-driven sequence-based analyses to data from learners undertaking an on-line Business Management course reveals distinct behaviors within the cohort of learners, identifying learners or groups of learners that deviate from the nominal order expected in the course. Using course grades a posteriori, we explore differences in behavior between high and low performing learners. We find that high performing learners follow the progression between weekly sessions more regularly than low performing learners, yet within each weekly session high performing learners are less tied to the nominal task order. We then model the sequences of high and low performance students using the probablistic Bayesian model and show that we can learn engagement behaviors associated with performance. We also show that the data sequence framework can be used for task centric analysis; we identify critical junctures and differences among types of tasks within the course design. We find that non-rote learning tasks, such as interactive tasks or discussion posts, are correlated with higher performance. We discuss the application of such analytical techniques as an aid to course design, intervention, and student supervision.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
Optimal strategies in the Fighting Fantasy gaming system: influencing stochastic dynamics by gambling with limited resource
Authors:
Iain G. Johnston
Abstract:
Fighting Fantasy is a popular recreational fantasy gaming system worldwide. Combat in this system progresses through a stochastic game involving a series of rounds, each of which may be won or lost. Each round, a limited resource (`luck') may be spent on a gamble to amplify the benefit from a win or mitigate the deficit from a loss. However, the success of this gamble depends on the amount of rema…
▽ More
Fighting Fantasy is a popular recreational fantasy gaming system worldwide. Combat in this system progresses through a stochastic game involving a series of rounds, each of which may be won or lost. Each round, a limited resource (`luck') may be spent on a gamble to amplify the benefit from a win or mitigate the deficit from a loss. However, the success of this gamble depends on the amount of remaining resource, and if the gamble is unsuccessful, benefits are reduced and deficits increased. Players thus dynamically choose to expend resource to attempt to influence the stochastic dynamics of the game, with diminishing probability of positive return. The identification of the optimal strategy for victory is a Markov decision problem that has not yet been solved. Here, we combine stochastic analysis and simulation with dynamic programming to characterise the dynamical behaviour of the system in the absence and presence of gambling policy. We derive a simple expression for the victory probability without luck-based strategy. We use a backward induction approach to solve the Bellman equation for the system and identify the optimal strategy for any given state during the game. The optimal control strategies can dramatically enhance success probabilities, but take detailed forms; we use stochastic simulation to approximate these optimal strategies with simple heuristics that can be practically employed. Our findings provide a roadmap to improving success in the games that millions of people play worldwide, and inform a class of resource allocation problems with diminishing returns in stochastic games.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
Intracellular Energy Variability Modulates Cellular Decision-Making Capacity
Authors:
Ryan Kerr,
Sara Jabbari,
Iain G. Johnston
Abstract:
Cells are able to generate phenotypic diversity both during development and in response to stressful and changing environments, aiding survival. The biologically and medically vital process of a cell assuming a functionally important fate from a range of phenotypic possibilities can be thought of as a cell decision. To make these decisions, a cell relies on energy dependent pathways of signalling…
▽ More
Cells are able to generate phenotypic diversity both during development and in response to stressful and changing environments, aiding survival. The biologically and medically vital process of a cell assuming a functionally important fate from a range of phenotypic possibilities can be thought of as a cell decision. To make these decisions, a cell relies on energy dependent pathways of signalling and expression. However, energy availability is often overlooked as a modulator of cellular decision-making. As cells can vary dramatically in energy availability, this limits our knowledge of how this key biological axis affects cell behaviour. Here, we consider the energy dependence of a highly generalisable decision-making regulatory network, and show that energy variability changes the sets of decisions a cell can make and the ease with which they can be made. Increasing intracellular energy levels can increase the number of stable phenotypes it can generate, corresponding to increased decision-making capacity. For this decision-making architecture, a cell with intracellular energy below a threshold is limited to a singular phenotype, potentially forcing the adoption of a specific cell fate. We suggest that common energetic differences between cells may explain some of the observed variability in cellular decision-making, and demonstrate the importance of considering energy levels in several diverse biological decision-making phenomena.
△ Less
Submitted 13 December, 2019;
originally announced December 2019.
-
HyperTraPS: Inferring probabilistic patterns of trait acquisition in evolutionary and disease progression pathways
Authors:
Sam F. Greenbury,
Mauricio Barahona,
Iain G. Johnston
Abstract:
The explosion of data throughout the biomedical sciences provides unprecedented opportunities to learn about the dynamics of evolution and disease progression, but harnessing these large and diverse datasets remains challenging. Here, we describe a highly generalisable statistical platform to infer the dynamic pathways by which many, potentially interacting, discrete traits are acquired or lost ov…
▽ More
The explosion of data throughout the biomedical sciences provides unprecedented opportunities to learn about the dynamics of evolution and disease progression, but harnessing these large and diverse datasets remains challenging. Here, we describe a highly generalisable statistical platform to infer the dynamic pathways by which many, potentially interacting, discrete traits are acquired or lost over time in biomedical systems. The platform uses HyperTraPS (hypercubic transition path sampling) to learn progression pathways from cross-sectional, longitudinal, or phylogenetically-linked data with unprecedented efficiency, readily distinguishing multiple competing pathways, and identifying the most parsimonious mechanisms underlying given observations. Its Bayesian structure quantifies uncertainty in pathway structure and allows interpretable predictions of behaviours, such as which symptom a patient will acquire next. We exploit the model's topology to provide visualisation tools for intuitive assessment of multiple, variable pathways. We apply the method to ovarian cancer progression and the evolution of multidrug resistance in tuberculosis, demonstrating its power to reveal previously undetected dynamic pathways.
△ Less
Submitted 28 November, 2019;
originally announced December 2019.
-
Mitochondrial network state scales mtDNA genetic dynamics
Authors:
Juvid Aryaman,
Charlotte Bowles,
Nick S. Jones,
Iain G. Johnston
Abstract:
Mitochondrial DNA (mtDNA) mutations cause severe congenital diseases but may also be associated with healthy aging. MtDNA is stochastically replicated and degraded, and exists within organelles which undergo dynamic fusion and fission. The role of the resulting mitochondrial networks in the time evolution of the cellular proportion of mutated mtDNA molecules (heteroplasmy), and cell-to-cell variab…
▽ More
Mitochondrial DNA (mtDNA) mutations cause severe congenital diseases but may also be associated with healthy aging. MtDNA is stochastically replicated and degraded, and exists within organelles which undergo dynamic fusion and fission. The role of the resulting mitochondrial networks in the time evolution of the cellular proportion of mutated mtDNA molecules (heteroplasmy), and cell-to-cell variability in heteroplasmy (heteroplasmy variance), remains incompletely understood. Heteroplasmy variance is particularly important since it modulates the number of pathological cells in a tissue. Here, we provide the first wide-reaching theoretical framework which bridges mitochondrial network and genetic states. We show that, under a range of conditions, the (genetic) rate of increase in heteroplasmy variance and de novo mutation are proportionally modulated by the (physical) fraction of unfused mitochondria, independently of the absolute fission-fusion rate. In the context of selective fusion, we show that intermediate fusion/fission ratios are optimal for the clearance of mtDNA mutants. Our findings imply that modulating network state, mitophagy rate and copy number to slow down heteroplasmy dynamics when mean heteroplasmy is low could have therapeutic advantages for mitochondrial disease and healthy aging.
△ Less
Submitted 3 July, 2019; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Mitochondrial heterogeneity
Authors:
Juvid Aryaman,
Iain G. Johnston,
Nick S. Jones
Abstract:
Cell-to-cell heterogeneity drives a range of (patho)physiologically important phenomena, such as cell fate and chemotherapeutic resistance. The role of metabolism, and particularly mitochondria, is increasingly being recognised as an important explanatory factor in cell-to-cell heterogeneity. Most eukaryotic cells possess a population of mitochondria, in the sense that mitochondrial DNA (mtDNA) is…
▽ More
Cell-to-cell heterogeneity drives a range of (patho)physiologically important phenomena, such as cell fate and chemotherapeutic resistance. The role of metabolism, and particularly mitochondria, is increasingly being recognised as an important explanatory factor in cell-to-cell heterogeneity. Most eukaryotic cells possess a population of mitochondria, in the sense that mitochondrial DNA (mtDNA) is held in multiple copies per cell, where the sequence of each molecule can vary. Hence intra-cellular mitochondrial heterogeneity is possible, which can induce inter-cellular mitochondrial heterogeneity, and may drive aspects of cellular noise. In this review, we discuss sources of mitochondrial heterogeneity (variations between mitochondria in the same cell, and mitochondrial variations between supposedly identical cells) from both genetic and non-genetic perspectives, and mitochondrial genotype-phenotype links. We discuss the apparent homeostasis of mtDNA copy number, the observation of pervasive intra-cellular mtDNA mutation (we term `microheteroplasmy') and developments in the understanding of inter-cellular mtDNA mutation (`macroheteroplasmy'). We point to the relationship between mitochondrial supercomplexes, cristal structure, pH and cardiolipin as a potential amplifier of the mitochondrial genotype-phenotype link. We also discuss mitochondrial membrane potential and networks as sources of mitochondrial heterogeneity, and their influence upon the mitochondrial genome. Finally, we revisit the idea of mitochondrial complementation as a means of dampening mitochondrial genotype-phenotype links in light of recent experimental developments. The diverse sources of mitochondrial heterogeneity, as well as their increasingly recognised role in contributing to cellular heterogeneity, highlights the need for future single-cell mitochondrial measurements in the context of cellular noise studies.
△ Less
Submitted 18 December, 2018; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Endless love: On the termination of a playground number game
Authors:
Iain G. Johnston
Abstract:
A simple and popular childhood game, `LOVES' or the `Love Calculator', involves an iterated rule applied to a string of digits and gives rise to surprisingly rich behaviour. Traditionally, players' names are used to set the initial conditions for an instance of the game: its behaviour for an exhaustive set of pairings of popular UK childrens' names, and for more general initial conditions, is exam…
▽ More
A simple and popular childhood game, `LOVES' or the `Love Calculator', involves an iterated rule applied to a string of digits and gives rise to surprisingly rich behaviour. Traditionally, players' names are used to set the initial conditions for an instance of the game: its behaviour for an exhaustive set of pairings of popular UK childrens' names, and for more general initial conditions, is examined. Convergence to a fixed outcome (the desired result) is not guaranteed, even for some plausible first name pairings. No pairs of top-50 common first names exhibit non-convergence, suggesting that it is rare in the playground; however, including surnames makes non-convergence more likely due to higher letter counts (for example, `Reese Witherspoon LOVES Calvin Harris'). Different game keywords (including from different languages) are also considered. An estimate for non-convergence propensity is derived: if the sum $m$ of digits in a string of length $w$ obeys $m > 18/(3/2)^{w-4}$, convergence is less likely. Pairs of top UK names with pairs of `O's and several `L's (for example, Chloe and Joseph, or Brooke and Scarlett) often attain high scores. When considering individual names playing with a range of partners, those with no `LOVES' letters score lowest, and names with intermediate (not simply the highest) letter counts often perform best, with Connor and Evie averaging the highest scores when played with other UK top names.
△ Less
Submitted 25 January, 2016;
originally announced February 2016.
-
Stochastic modelling, Bayesian inference, and new in vivo measurements elucidate the debated mtDNA bottleneck mechanism
Authors:
Iain G. Johnston,
Joerg P. Burgstaller,
Vitezslav Havlicek,
Thomas Kolbe,
Thomas Rulicke,
Gottfried Brem,
Jo Poulton,
Nick S. Jones
Abstract:
Dangerous damage to mitochondrial DNA (mtDNA) can be ameliorated during mammalian development through a highly debated mechanism called the mtDNA bottleneck. Uncertainty surrounding this process limits our ability to address inherited mtDNA diseases. We produce a new, physically motivated, generalisable theoretical model for mtDNA populations during development, allowing the first statistical comp…
▽ More
Dangerous damage to mitochondrial DNA (mtDNA) can be ameliorated during mammalian development through a highly debated mechanism called the mtDNA bottleneck. Uncertainty surrounding this process limits our ability to address inherited mtDNA diseases. We produce a new, physically motivated, generalisable theoretical model for mtDNA populations during development, allowing the first statistical comparison of proposed bottleneck mechanisms. Using approximate Bayesian computation and mouse data, we find most statistical support for a combination of binomial partitioning of mtDNAs at cell divisions and random mtDNA turnover, meaning that the debated exact magnitude of mtDNA copy number depletion is flexible. New experimental measurements from a wild-derived mtDNA pairing in mice confirm the theoretical predictions of this model. We analytically solve a mathematical description of this mechanism, computing probabilities of mtDNA disease onset, efficacy of clinical sampling strategies, and effects of potential dynamic interventions, thus develo** a quantitative and experimentally-supported stochastic theory of the bottleneck.
△ Less
Submitted 9 December, 2015;
originally announced December 2015.
-
Closed-form stochastic solutions for non-equilibrium dynamics and inheritance of cellular components over many cell divisions
Authors:
Iain G. Johnston,
Nick S. Jones
Abstract:
Stochastic dynamics govern many important processes in cellular biology, and an underlying theoretical approach describing these dynamics is desirable to address a wealth of questions in biology and medicine. Mathematical tools exist for treating several important examples of these stochastic processes, most notably gene expression, and random partitioning at single cell divisions or after a stead…
▽ More
Stochastic dynamics govern many important processes in cellular biology, and an underlying theoretical approach describing these dynamics is desirable to address a wealth of questions in biology and medicine. Mathematical tools exist for treating several important examples of these stochastic processes, most notably gene expression, and random partitioning at single cell divisions or after a steady state has been reached. Comparatively little work exists exploring different and specific ways that repeated cell divisions can lead to stochastic inheritance of unequilibrated cellular populations. Here we introduce a mathematical formalism to describe cellular agents that are subject to random creation, replication, and/or degradation, and are inherited according to a range of random dynamics at cell divisions. We obtain closed-form generating functions describing systems at any time after any number of cell divisions for binomial partitioning and divisions provoking a deterministic or random, subtractive or additive change in copy number, and show that these solutions agree exactly with stochastic simulation. We apply this general formalism to several example problems involving the dynamics of mitochondrial DNA (mtDNA) during development and organismal lifetimes.
△ Less
Submitted 25 January, 2015;
originally announced January 2015.
-
Explicit tracking of uncertainty increases the power of quantitative rule-of-thumb reasoning in cell biology
Authors:
Iain G. Johnston,
Benjamin C. Rickett,
Nick S. Jones
Abstract:
"Back-of-the-envelope" or "rule-of-thumb" calculations involving rough estimates of quantities play a central scientific role in develo** intuition about the structure and behaviour of physical systems, for example in so-called `Fermi problems' in the physical sciences. Such calculations can be used to powerfully and quantitatively reason about biological systems, particularly at the interface b…
▽ More
"Back-of-the-envelope" or "rule-of-thumb" calculations involving rough estimates of quantities play a central scientific role in develo** intuition about the structure and behaviour of physical systems, for example in so-called `Fermi problems' in the physical sciences. Such calculations can be used to powerfully and quantitatively reason about biological systems, particularly at the interface between physics and biology. However, substantial uncertainties are often associated with values in cell biology, and performing calculations without taking this uncertainty into account may limit the extent to which results can be interpreted for a given problem. We present a means to facilitate such calculations where uncertainties are explicitly tracked through the line of reasoning, and introduce a `probabilistic calculator' called Caladis, a web tool freely available at www.caladis.org, designed to perform this tracking. This approach allows users to perform more statistically robust calculations in cell biology despite having uncertain values, and to identify which quantities need to be measured more precisely in order to make confident statements, facilitating efficient experimental design. We illustrate the use of our tool for tracking uncertainty in several example biological calculations, showing that the results yield powerful and interpretable statistics on the quantities of interest. We also demonstrate that the outcomes of calculations may differ from point estimates when uncertainty is accurately tracked. An integral link between Caladis and the Bionumbers repository of biological quantities further facilitates the straightforward location, selection, and use of a wealth of experimental data in cell biological calculations.
△ Less
Submitted 4 December, 2014;
originally announced December 2014.
-
Phenotypic landscape inference reveals multiple evolutionary paths to C$_4$ photosynthesis
Authors:
Ben P. Williams,
Iain G. Johnston,
Sarah Covshoff,
Julian M. Hibberd
Abstract:
C$_4$ photosynthesis has independently evolved from the ancestral C$_3$ pathway in at least 60 plant lineages, but, as with other complex traits, how it evolved is unclear. Here we show that the polyphyletic appearance of C$_4$ photosynthesis is associated with diverse and flexible evolutionary paths that group into four major trajectories. We conducted a meta-analysis of 18 lineages containing sp…
▽ More
C$_4$ photosynthesis has independently evolved from the ancestral C$_3$ pathway in at least 60 plant lineages, but, as with other complex traits, how it evolved is unclear. Here we show that the polyphyletic appearance of C$_4$ photosynthesis is associated with diverse and flexible evolutionary paths that group into four major trajectories. We conducted a meta-analysis of 18 lineages containing species that use C$_3$, C$_4$, or intermediate C$_3$-C$_4$ forms of photosynthesis to parameterise a 16-dimensional phenotypic landscape. We then developed and experimentally verified a novel Bayesian approach based on a hidden Markov model that predicts how the C$_4$ phenotype evolved. The alternative evolutionary histories underlying the appearance of C$_4$ photosynthesis were determined by ancestral lineage and initial phenotypic alterations unrelated to photosynthesis. We conclude that the order of C$_4$ trait acquisition is flexible and driven by non-photosynthetic drivers. This flexibility will have facilitated the convergent evolution of this complex trait.
△ Less
Submitted 17 September, 2014;
originally announced September 2014.
-
Efficient parametric inference for stochastic biological systems with measured variability
Authors:
Iain G. Johnston
Abstract:
Stochastic systems in biology often exhibit substantial variability within and between cells. This variability, as well as having dramatic functional consequences, provides information about the underlying details of the system's behaviour. It is often desirable to infer properties of the parameters governing such systems given experimental observations of the mean and variance of observed quantit…
▽ More
Stochastic systems in biology often exhibit substantial variability within and between cells. This variability, as well as having dramatic functional consequences, provides information about the underlying details of the system's behaviour. It is often desirable to infer properties of the parameters governing such systems given experimental observations of the mean and variance of observed quantities. In some circumstances, analytic forms for the likelihood of these observations allow very efficient inference: we present these forms and demonstrate their usage. When likelihood functions are unavailable or difficult to calculate, we show that an implementation of approximate Bayesian computation (ABC) is a powerful tool for parametric inference in these systems. However, the calculations required to apply ABC to these systems can also be computationally expensive, relying on repeated stochastic simulations. We propose an ABC approach that cheaply eliminates unimportant regions of parameter space, by addressing computationally simple mean behaviour before explicitly simulating the more computationally demanding variance behaviour. We show that this approach leads to a substantial increase in speed when applied to synthetic and experimental datasets.
△ Less
Submitted 6 November, 2015; v1 submitted 31 March, 2014;
originally announced March 2014.
-
A tractable genotype-phenotype map for the self-assembly of protein quaternary structure
Authors:
Sam F. Greenbury,
Iain G. Johnston,
Ard A. Louis,
Sebastian E. Ahnert
Abstract:
The map** between biological genotypes and phenotypes is central to the study of biological evolution. Here we introduce a rich, intuitive, and biologically realistic genotype-phenotype (GP) map, that serves as a model of self-assembling biological structures, such as protein complexes, and remains computationally and analytically tractable. Our GP map arises naturally from the self-assembly of…
▽ More
The map** between biological genotypes and phenotypes is central to the study of biological evolution. Here we introduce a rich, intuitive, and biologically realistic genotype-phenotype (GP) map, that serves as a model of self-assembling biological structures, such as protein complexes, and remains computationally and analytically tractable. Our GP map arises naturally from the self-assembly of polyomino structures on a 2D lattice and exhibits a number of properties: $\textit{redundancy}$ (genotypes vastly outnumber phenotypes), $\textit{phenotype bias}$ (genotypic redundancy varies greatly between phenotypes), $\textit{genotype component disconnectivity}$ (phenotypes consist of disconnected mutational networks) and $\textit{shape space covering}$ (most phenotypes can be reached in a small number of mutations). We also show that the mutational robustness of phenotypes scales very roughly logarithmically with phenotype redundancy and is positively correlated with phenotypic evolvability. Although our GP map describes the assembly of disconnected objects, it shares many properties with other popular GP maps for connected units, such as models for RNA secondary structure or the HP lattice model for protein tertiary structure. The remarkable fact that these important properties similarly emerge from such different models suggests the possibility that universal features underlie a much wider class of biologically realistic GP maps.
△ Less
Submitted 2 November, 2013;
originally announced November 2013.
-
The chaos within: exploring noise in cellular biology
Authors:
Iain G. Johnston
Abstract:
Cellular biology exists embedded in a world dominated by random dynamics and chance. Many vital molecules and pieces of cellular machinery diffuse within cells, moving along random trajectories as they collide with the other biomolecular inhabitants of the cell. Cellular components may block each other's progress, be produced or degraded at random times, and become unevenly separated as cells grow…
▽ More
Cellular biology exists embedded in a world dominated by random dynamics and chance. Many vital molecules and pieces of cellular machinery diffuse within cells, moving along random trajectories as they collide with the other biomolecular inhabitants of the cell. Cellular components may block each other's progress, be produced or degraded at random times, and become unevenly separated as cells grow and divide. Cellular behaviour, including important features of stem cells, tumours and infectious bacteria, is profoundly influenced by the chaos which is the environment within the cell walls. Here we will look at some important causes and effects of randomness in cellular biology, and some ways in which researchers, helped by the vast amounts of data that are now flowing in, have made progress in describing the randomness of nature.
△ Less
Submitted 10 August, 2012;
originally announced August 2012.
-
Epistasis can lead to fragmented neutral spaces and contingency in evolution
Authors:
Steffen Schaper,
Iain G. Johnston,
Ard A. Louis
Abstract:
In evolution, the effects of a single deleterious mutation can sometimes be compensated for by a second mutation which recovers the original phenotype. Such epistatic interactions have implications for the structure of genome space - namely, that networks of genomes encoding the same phenotype may not be connected by single mutational moves. We use the folding of RNA sequences into secondary struc…
▽ More
In evolution, the effects of a single deleterious mutation can sometimes be compensated for by a second mutation which recovers the original phenotype. Such epistatic interactions have implications for the structure of genome space - namely, that networks of genomes encoding the same phenotype may not be connected by single mutational moves. We use the folding of RNA sequences into secondary structures as a model genotype-phenotype map and explore the neutral spaces corresponding to networks of genotypes with the same phenotype. In most of these networks, we find that it is not possible to connect all genotypes to one another by single point mutations. Instead, a network for a phenotypic structure with $n$ bonds typically fragments into at least $2^n$ neutral components, often of similar size. While components of the same network generate the same phenotype, they show important variations in their properties, most strikingly in their evolvability and mutational robustness. This heterogeneity implies contingency in the evolutionary process.
△ Less
Submitted 6 December, 2011; v1 submitted 4 August, 2011;
originally announced August 2011.
-
Mitochondrial Variability as a Source of Extrinsic Cellular Noise
Authors:
Iain G. Johnston,
Bernadett Gaal,
Ricardo Pires das Neves,
Tariq Enver,
Francisco J. Iborra,
Nick S. Jones
Abstract:
We present a study investigating the role of mitochondrial variability in generating noise in eukaryotic cells. Noise in cellular physiology plays an important role in many fundamental cellular processes, including transcription, translation, stem cell differentiation and response to medication, but the specific random influences that affect these processes have yet to be clearly elucidated. Here…
▽ More
We present a study investigating the role of mitochondrial variability in generating noise in eukaryotic cells. Noise in cellular physiology plays an important role in many fundamental cellular processes, including transcription, translation, stem cell differentiation and response to medication, but the specific random influences that affect these processes have yet to be clearly elucidated. Here we present a mechanism by which variability in mitochondrial volume and functionality, along with cell cycle dynamics, is linked to variability in transcription rate and hence has a profound effect on downstream cellular processes. Our model mechanism is supported by an appreciable volume of recent experimental evidence, and we present the results of several new experiments with which our model is also consistent. We find that noise due to mitochondrial variability can sometimes dominate over other extrinsic noise sources (such as cell cycle asynchronicity) and can significantly affect large-scale observable properties such as cell cycle length and gene expression levels. We also explore two recent regulatory network-based models for stem cell differentiation, and find that extrinsic noise in transcription rate causes appreciable variability in the behaviour of these model systems. These results suggest that mitochondrial and transcriptional variability may be an important mechanism influencing a large variety of cellular processes and properties.
△ Less
Submitted 1 December, 2011; v1 submitted 22 July, 2011;
originally announced July 2011.
-
Evolutionary Dynamics in a Simple Model of Self-Assembly
Authors:
Iain G. Johnston,
Sebastian A. Ahnert,
Jonathan P. K. Doye,
Ard A. Louis
Abstract:
We investigate the evolutionary dynamics of an idealised model for the robust self-assembly of two-dimensional structures called polyominoes. The model includes rules that encode interactions between sets of square tiles that drive the self-assembly process. The relationship between the model's rule set and its resulting self-assembled structure can be viewed as a genotype-phenotype map and incorp…
▽ More
We investigate the evolutionary dynamics of an idealised model for the robust self-assembly of two-dimensional structures called polyominoes. The model includes rules that encode interactions between sets of square tiles that drive the self-assembly process. The relationship between the model's rule set and its resulting self-assembled structure can be viewed as a genotype-phenotype map and incorporated into a genetic algorithm. The rule sets evolve under selection for specified target structures. The corresponding, complex fitness landscape generates rich evolutionary dynamics as a function of parameters such as the population size, search space size, mutation rate, and method of recombination. Furthermore, these systems are simple enough that in some cases the associated model genome space can be completely characterised, shedding light on how the evolutionary dynamics depends on the detailed structure of the fitness landscape. Finally, we apply the model to study the emergence of the preference for dihedral over cyclic symmetry observed for homomeric protein tetramers.
△ Less
Submitted 28 February, 2011;
originally announced February 2011.
-
The effect of scale-free topology on the robustness and evolvability of genetic regulatory networks
Authors:
Sam F. Greenbury,
Iain G. Johnston,
Matthew A. Smith,
Jonathan P. K. Doye,
Ard A. Louis
Abstract:
We investigate how scale-free (SF) and Erdos-Renyi (ER) topologies affect the interplay between evolvability and robustness of model gene regulatory networks with Boolean threshold dynamics. In agreement with Oikonomou and Cluzel (2006) we find that networks with SFin topologies, that is SF topology for incoming nodes and ER topology for outgoing nodes, are significantly more evolvable towards spe…
▽ More
We investigate how scale-free (SF) and Erdos-Renyi (ER) topologies affect the interplay between evolvability and robustness of model gene regulatory networks with Boolean threshold dynamics. In agreement with Oikonomou and Cluzel (2006) we find that networks with SFin topologies, that is SF topology for incoming nodes and ER topology for outgoing nodes, are significantly more evolvable towards specific oscillatory targets than networks with ER topology for both incoming and outgoing nodes. Similar results are found for networks with SFboth and SFout topologies. The functionality of the SFout topology, which most closely resembles the structure of biological gene networks (Babu et al., 2004), is compared to the ER topology in further detail through an extension to multiple target outputs, with either an oscillatory or a non-oscillatory nature. For multiple oscillatory targets of the same length, the differences between SFout and ER networks are enhanced, but for non-oscillatory targets both types of networks show fairly similar evolvability. We find that SF networks generate oscillations much more easily than ER networks do, and this may explain why SF networks are more evolvable than ER networks are for oscillatory phenotypes. In spite of their greater evolvability, we find that networks with SFout topologies are also more robust to mutations than ER networks. Furthermore, the SFout topologies are more robust to changes in initial conditions (environmental robustness). For both topologies, we find that once a population of networks has reached the target state, further neutral evolution can lead to an increase in both the mutational robustness and the environmental robustness to changes in initial conditions.
△ Less
Submitted 24 May, 2010;
originally announced May 2010.
-
Self-assembly, modularity and physical complexity
Authors:
S. E. Ahnert,
I. G. Johnston,
T. M. A. Fink,
J. P. K. Doye,
A. A. Louis
Abstract:
We present a quantitative measure of physical complexity, based on the amount of information required to build a given physical structure through self-assembly. Our procedure can be adapted to any given geometry, and thus to any given type of physical system. We illustrate our approach using self-assembling polyominoes, and demonstrate the breadth of its potential applications by quantifying the…
▽ More
We present a quantitative measure of physical complexity, based on the amount of information required to build a given physical structure through self-assembly. Our procedure can be adapted to any given geometry, and thus to any given type of physical system. We illustrate our approach using self-assembling polyominoes, and demonstrate the breadth of its potential applications by quantifying the physical complexity of molecules and protein complexes. This measure is particularly well suited for the detection of symmetry and modularity in the underlying structure, and allows for a quantitative definition of structural modularity. Furthermore we use our approach to show that symmetric and modular structures are favoured in biological self-assembly, for example of protein complexes. Lastly, we also introduce the notions of joint, mutual and conditional complexity, which provide a useful distance measure between physical structures.
△ Less
Submitted 12 January, 2010; v1 submitted 17 December, 2009;
originally announced December 2009.
-
Modelling the Self-Assembly of Virus Capsids
Authors:
Iain G. Johnston,
Ard A. Louis,
Jonathan P. K. Doye
Abstract:
We use computer simulations to study a model, first proposed by Wales [1], for the reversible and monodisperse self-assembly of simple icosahedral virus capsid structures. The success and efficiency of assembly as a function of thermodynamic and geometric factors can be qualitatively related to the potential energy landscape structure of the assembling system. Even though the model is strongly c…
▽ More
We use computer simulations to study a model, first proposed by Wales [1], for the reversible and monodisperse self-assembly of simple icosahedral virus capsid structures. The success and efficiency of assembly as a function of thermodynamic and geometric factors can be qualitatively related to the potential energy landscape structure of the assembling system. Even though the model is strongly coarse-grained, it exhibits a number of features also observed in experiments, such as sigmoidal assembly dynamics, hysteresis in capsid formation and numerous kinetic traps. We also investigate the effect of macromolecular crowding on the assembly dynamics. Crowding agents generally reduce capsid yields at optimal conditions for non-crowded assembly, but may increase yields for parameter regimes away from the optimum. Finally, we generalize the model to a larger triangulation number T = 3, and observe more complex assembly dynamics than that seen for the original T = 1 model.
△ Less
Submitted 10 October, 2009;
originally announced October 2009.
-
The self-assembly of DNA Holliday junctions studied with a minimal model
Authors:
Thomas E. Ouldridge,
Iain G. Johnston,
Ard A. Louis,
Jonathan P. K. Doye
Abstract:
In this paper, we explore the feasibility of using coarse-grained models to simulate the self-assembly of DNA nanostructures. We introduce a simple model of DNA where each nucleotide is represented by two interaction sites corresponding to the phosphate-sugar backbone and the base. Using this model, we are able to simulate the self-assembly of both DNA duplexes and Holliday junctions from single…
▽ More
In this paper, we explore the feasibility of using coarse-grained models to simulate the self-assembly of DNA nanostructures. We introduce a simple model of DNA where each nucleotide is represented by two interaction sites corresponding to the phosphate-sugar backbone and the base. Using this model, we are able to simulate the self-assembly of both DNA duplexes and Holliday junctions from single-stranded DNA. We find that assembly is most successful in the temperature window below the melting temperatures of the target structure and above the melting temperature of misbonded aggregates. Furthermore, in the case of the Holliday junction, we show how a hierarchical assembly mechanism reduces the possibility of becoming trapped in misbonded configurations. The model is also able to reproduce the relative melting temperatures of different structures accurately, and allows strand displacement to occur.
△ Less
Submitted 22 December, 2008; v1 submitted 21 July, 2008;
originally announced July 2008.