-
Concepts and methods for predicting viral evolution
Authors:
Matthijs Meijers,
Denis Ruchnewitz,
Jan Eberhardt,
Malancha Karmakar,
Marta Łuksza,
Michael Lässig
Abstract:
The seasonal human influenza virus undergoes rapid evolution, leading to significant changes in circulating viral strains from year to year. These changes are typically driven by adaptive mutations, particularly in the antigenic epitopes, the regions of the viral surface protein haemagglutinin targeted by human antibodies. Here we describe a consistent set of methods for data-driven predictive ana…
▽ More
The seasonal human influenza virus undergoes rapid evolution, leading to significant changes in circulating viral strains from year to year. These changes are typically driven by adaptive mutations, particularly in the antigenic epitopes, the regions of the viral surface protein haemagglutinin targeted by human antibodies. Here we describe a consistent set of methods for data-driven predictive analysis of viral evolution. Our pipeline integrates four types of data: (1) sequence data of viral isolates collected on a worldwide scale, (2) epidemiological data on incidences, (3) antigenic characterization of circulating viruses, and (4) intrinsic viral phenotypes. From the combined analysis of these data, we obtain estimates of relative fitness for circulating strains and predictions of clade frequencies for periods of up to one year. Furthermore, we obtain comparative estimates of protection against future viral populations for candidate vaccine strains, providing a basis for pre-emptive vaccine strain selection. Continuously updated predictions obtained from the prediction pipeline for influenza and SARS-CoV-2 are available on the website https://previr.app.
△ Less
Submitted 2 May, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.
-
Nonequilibrium antigen recognition during infections and vaccinations
Authors:
Roberto Morán-Tovar,
Michael Lässig
Abstract:
The immune response to an acute primary infection is a coupled process of antigen proliferation, molecular recognition by naive B cells, and their subsequent proliferation and antibody shedding. This process contains a fundamental problem: the recognition of an exponentially time-dependent antigen signal. Here we show that B cells can efficiently recognise new antigens by a tuned kinetic proofread…
▽ More
The immune response to an acute primary infection is a coupled process of antigen proliferation, molecular recognition by naive B cells, and their subsequent proliferation and antibody shedding. This process contains a fundamental problem: the recognition of an exponentially time-dependent antigen signal. Here we show that B cells can efficiently recognise new antigens by a tuned kinetic proofreading mechanism, where the molecular recognition machinery is adapted to the complexity of the immune repertoire. This process produces potent, specific and fast recognition of antigens, maintaining a spectrum of genetically distinct B cell lineages as input for affinity maturation. We show that the proliferation-recognition dynamics of a primary infection is a generalised Luria-Delbrück process, akin to the dynamics of the classic fluctuation experiment. This map establishes a link between signal recognition dynamics and evolution. We derive the resulting statistics of the activated immune repertoire: antigen binding affinity, expected size, and frequency of active B cell clones are related by power laws, which define the class of generalised Luria-Delbrück processes. Their exponents depend on the antigen and B cell proliferation rate, the number of proofreading steps, and the lineage density of the naive repertoire. We extend the model to include spatio-temporal processes, including the diffusion-recognition dynamics of a vaccination. Empirical data of activated mouse immune repertoires are found to be consistent with activation involving about three proofreading steps. The model predicts key clinical characteristics of acute infections and vaccinations, including the emergence of elite neutralisers and the effects of immune ageing. More broadly, our results establish infections and vaccinations as a new probe into the global architecture and functional principles of immune repertoires.
△ Less
Submitted 27 February, 2024; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Vaccination shapes evolutionary trajectories of SARS-CoV-2
Authors:
Matthijs Meijers,
Denis Ruchnewitz,
Marta Łuksza,
Michael Lässig
Abstract:
The large-scale evolution of the SARS-CoV-2 virus has been marked by rapid turnover of genetic clades. New variants show intrinsic changes, notably increased transmissibility, as well as antigenic changes that reduce the cross-immunity induced by previous infections or vaccinations. How this functional variation shapes the global evolutionary dynamics has remained unclear. Here we show that select…
▽ More
The large-scale evolution of the SARS-CoV-2 virus has been marked by rapid turnover of genetic clades. New variants show intrinsic changes, notably increased transmissibility, as well as antigenic changes that reduce the cross-immunity induced by previous infections or vaccinations. How this functional variation shapes the global evolutionary dynamics has remained unclear. Here we show that selection induced by vaccination impacts on the recent antigenic evolution of SARS-CoV-2; other relevant forces include intrinsic selection and antigenic selection induced by previous infections. We obtain these results from a fitness model with intrinsic and antigenic fitness components. To infer model parameters, we combine time-resolved sequence data, epidemiological records, and cross-neutralisation assays. This model accurately captures the large-scale evolutionary dynamics of SARS-CoV-2 in multiple geographical regions. In particular, it quantifies how recent vaccinations and infections affect the speed of frequency shifts between viral variants. Our results show that timely neutralisation data can be harvested to identify hotspots of antigenic selection and to predict the impact of vaccination on viral evolution.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Adaptive ratchets and the evolution of molecular complexity
Authors:
Tom Röschinger,
Roberto Morán Tovar,
Simone Pompei,
Michael Lässig
Abstract:
Biological systems have evolved to amazingly complex states, yet we do not understand in general how evolution operates to generate increasing genetic and functional complexity. Molecular recognition sites are short genome segments or peptides binding a cognate recognition target of sufficient sequence similarity. Such sites are simple, ubiquitous modules of sequence information, cellular function…
▽ More
Biological systems have evolved to amazingly complex states, yet we do not understand in general how evolution operates to generate increasing genetic and functional complexity. Molecular recognition sites are short genome segments or peptides binding a cognate recognition target of sufficient sequence similarity. Such sites are simple, ubiquitous modules of sequence information, cellular function, and evolution. Here we show that recognition sites, if coupled to a time-dependent target, can rapidly evolve to complex states with larger code length and smaller coding density than sites recognising a static target. The underlying fitness model contains selection for recognition, which depends on the sequence similarity between site and target, and a uniform cost per unit of code length. Site sequences are shown to evolve in a specific adaptive ratchet, which produces selection of different strength for code extensions and compressions. Ratchet evolution increases the adaptive width of evolved sites, accelerating the adaptation to moving targets and facilitating refinement and innovation of recognition functions. We apply these results to the recognition of fast-evolving antigens by the human immune system. Our analysis shows how molecular complexity can evolve as a collateral to selection for function in a dynamic environment.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
Stochasticity of infectious outbreaks and consequences for optimal interventions
Authors:
Roberto Morán-Tovar,
Henning Gruell,
Florian Klein,
Michael Lässig
Abstract:
Global strategies to contain a pandemic, such as social distancing and protective measures, are designed to reduce the overall transmission rate between individuals. Despite such measures, essential institutions, including hospitals, schools, and food producing plants, remain focal points of local outbreaks. Here we develop a model for the stochastic outbreak dynamics in such local communities. We…
▽ More
Global strategies to contain a pandemic, such as social distancing and protective measures, are designed to reduce the overall transmission rate between individuals. Despite such measures, essential institutions, including hospitals, schools, and food producing plants, remain focal points of local outbreaks. Here we develop a model for the stochastic outbreak dynamics in such local communities. We derive analytical expressions for the probability of containment of the outbreak, which is complementary to the probability of seeding a deterministically growing epidemic. This probability depends on the statistics of the intra-community contact network and the initial conditions, in particular, on the contact degree of patient zero. Based on this model, we suggest surveillance protocols by which individuals are tested proportionally to their degree in the contact network. We characterize the efficacy of contact-based protocols as a function of the epidemiological and the contact network parameters, and show numerically that such protocols outperform random testing.
△ Less
Submitted 31 July, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Antigenic waves of virus-immune co-evolution
Authors:
Jacopo Marchi,
Michael Lässig,
Aleksandra M. Walczak,
Thierry Mora
Abstract:
The evolution of many microbes and pathogens, including circulating viruses such as seasonal influenza, is driven by immune pressure from the host population. In turn, the immune systems of infected populations get updated, chasing viruses even further away. Quantitatively understanding how these dynamics result in observed patterns of rapid pathogen and immune adaptation is instrumental to epidem…
▽ More
The evolution of many microbes and pathogens, including circulating viruses such as seasonal influenza, is driven by immune pressure from the host population. In turn, the immune systems of infected populations get updated, chasing viruses even further away. Quantitatively understanding how these dynamics result in observed patterns of rapid pathogen and immune adaptation is instrumental to epidemiological and evolutionary forecasting. Here we present a mathematical theory of co-evolution between immune systems and viruses in a finite-dimensional antigenic space, which describes the cross-reactivity of viral strains and immune systems primed by previous infections. We show the emergence of an antigenic wave that is pushed forward and canalized by cross-reactivity. We obtain analytical results for shape, speed, and angular diffusion of the wave. In particular, we show that viral-immune co-evolution generates a new emergent timescale, the persistence time of the wave's direction in antigenic space, which can be much longer than the coalescence time of the viral population. We compare these dynamics to the observed antigenic turnover of influenza strains, and we discuss how the dimensionality of antigenic space impacts on the predictability of the evolutionary dynamics. Our results provide a concrete and tractable framework to describe pathogen-host co-evolution.
△ Less
Submitted 7 May, 2021; v1 submitted 20 February, 2021;
originally announced February 2021.
-
Predicting in vivo escape dynamics of HIV-1 from a broadly neutralizing antibody
Authors:
Matthijs Meijers,
Kanika Vanshylla,
Henning Gruell,
Florian Klein,
Michael Laessig
Abstract:
Broadly neutralizing antibodies are promising candidates for treatment and prevention of HIV-1 infections. Such antibodies can temporarily suppress viral load in infected individuals; however, the virus often rebounds by escape mutants that have evolved resistance. In this paper, we map an in vivo fitness landscape of HIV-1 interacting with broadly neutralizing antibodies, using data from a recent…
▽ More
Broadly neutralizing antibodies are promising candidates for treatment and prevention of HIV-1 infections. Such antibodies can temporarily suppress viral load in infected individuals; however, the virus often rebounds by escape mutants that have evolved resistance. In this paper, we map an in vivo fitness landscape of HIV-1 interacting with broadly neutralizing antibodies, using data from a recent clinical trial. We identify two fitness factors, antibody dosage and viral load, that determine viral reproduction rates reproducibly across different hosts. The model successfully predicts the escape dynamics of HIV-1 in the course of an antibody treatment, including a characteristic frequency turnover between sensitive and resistant strains. This turnover is governed by a dosage-dependent fitness ranking, resulting from an evolutionary tradeoff between antibody resistance and its collateral cost in drug-free growth. Our analysis suggests resistance-cost tradeoff curves as a measure of antibody performance in the presence of resistance evolution.
△ Less
Submitted 6 August, 2020;
originally announced August 2020.
-
Predicting trajectories and mechanisms of antibiotic resistance evolution
Authors:
Fernanda Pinheiro,
Omar Warsi,
Dan I. Andersson,
Michael Lässig
Abstract:
Bacteria evolve resistance to antibiotics by a multitude of mechanisms. A central, yet unsolved question is how resistance evolution affects cell growth at different drug levels. Here we develop a fitness model that predicts growth rates of common resistance mutants from their effects on cell metabolism. We map metabolic effects of resistance mutations in drug-free environments and under drug chal…
▽ More
Bacteria evolve resistance to antibiotics by a multitude of mechanisms. A central, yet unsolved question is how resistance evolution affects cell growth at different drug levels. Here we develop a fitness model that predicts growth rates of common resistance mutants from their effects on cell metabolism. We map metabolic effects of resistance mutations in drug-free environments and under drug challenge; the resulting fitness trade-off defines a Pareto surface of resistance evolution. We predict evolutionary trajectories of dosage-dependent growth rates and resistance levels, as well as the prevalent resistance mechanism depending on drug and nutrient levels. These predictions are confirmed by empirical growth curves and genomic data of E. coli populations. Our results show that resistance evolution, by coupling major metabolic pathways, is strongly intertwined with systems biology and ecology of microbial populations.
△ Less
Submitted 2 July, 2020;
originally announced July 2020.
-
Adaptive evolution of hybrid bacteria by horizontal gene transfer
Authors:
Jeffrey J. Power,
Fernanda Pinheiro,
Simone Pompei,
Viera Kovacova,
Melih Yüksel,
Isabel Rathmann,
Mona Förster,
Michael Lässig,
Berenike Maier
Abstract:
Horizontal gene transfer is an important factor in bacterial evolution that can act across species boundaries. Yet, we know little about rate and genomic targets of cross-lineage gene transfer, and about its effects on the recipient organism's physiology and fitness. Here, we address these questions in a parallel evolution experiment with two Bacillus subtilis lineages of 7% sequence divergence. W…
▽ More
Horizontal gene transfer is an important factor in bacterial evolution that can act across species boundaries. Yet, we know little about rate and genomic targets of cross-lineage gene transfer, and about its effects on the recipient organism's physiology and fitness. Here, we address these questions in a parallel evolution experiment with two Bacillus subtilis lineages of 7% sequence divergence. We observe rapid evolution of hybrid organisms: gene transfer swaps ~12% of the core genome in just 200 generations, and 60% of core genes are replaced in at least one population. By genomics, transcriptomics, fitness assays, and statistical modeling, we show that transfer generates adaptive evolution and functional alterations in hybrids. Specifically, our experiments reveal a strong, repeatable fitness increase of evolved populations in the stationary growth phase. By genomic analysis of the transfer statistics across replicate populations, we infer that selection on HGT has a broad genetic basis: 40% of the observed transfers are adaptive. At the level of functional gene networks, we find signatures of negative and positive selection, consistent with hybrid incompatibilities and adaptive evolution of network functions. Our results suggest that gene transfer navigates a complex cross-lineage fitness landscape, bridging epistatic barriers along multiple high-fitness paths.
△ Less
Submitted 23 April, 2020;
originally announced April 2020.
-
Multi-lineage evolution in viral populations driven by host immune systems
Authors:
Jacopo Marchi,
Michael Lässig,
Thierry Mora,
Aleksandra M. Walczak
Abstract:
Viruses evolve in the background of host immune systems that exert selective pressure and drive viral evolutionary trajectories. This interaction leads to different evolutionary patterns in antigenic space. Examples observed in nature include the effectively one-dimensional escape characteristic of influenza A and the prolonged coexistence of lineages in influenza B. Here we use an evolutionary mo…
▽ More
Viruses evolve in the background of host immune systems that exert selective pressure and drive viral evolutionary trajectories. This interaction leads to different evolutionary patterns in antigenic space. Examples observed in nature include the effectively one-dimensional escape characteristic of influenza A and the prolonged coexistence of lineages in influenza B. Here we use an evolutionary model for viruses in the presence of immune host systems with finite memory to delineate parameter regimes of these patterns in a in two-dimensional antigenic space. We find that for small effective mutation rates and mutation jump ranges, a single lineage is the only stable solution. Large effective mutation rates combined with large mutational jumps in antigenic space lead to multiple stably co-existing lineages over prolonged evolutionary periods. These results combined with observations from data constrain the parameter regimes for the adaptation of viruses, including influenza.
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
Survival of the simplest: the cost of complexity in microbial evolution
Authors:
Torsten Held,
Daniel Klemmer,
Michael Lässig
Abstract:
The evolution of microbial and viral organisms often generates clonal interference, a mode of competition between genetic clades within a population. In this paper, we show that interference strongly constrains the genetic and phenotypic complexity of evolving systems. Our analysis uses biophysically grounded evolutionary models for an organism's quantitative molecular phenotypes, such as fold sta…
▽ More
The evolution of microbial and viral organisms often generates clonal interference, a mode of competition between genetic clades within a population. In this paper, we show that interference strongly constrains the genetic and phenotypic complexity of evolving systems. Our analysis uses biophysically grounded evolutionary models for an organism's quantitative molecular phenotypes, such as fold stability and enzymatic activity of genes. We find a generic mode of asexual evolution called phenotypic interference with strong implications for systems biology: it couples the stability and function of individual genes to the population's global speed of evolution. This mode occurs over a wide range of evolutionary parameters appropriate for microbial populations. It generates selection against genome complexity, because the fitness cost of mutations increases faster than linearly with the number of genes. Recombination can generate a distinct mode of sexual evolution that eliminates the superlinear cost. We show that positive selection can drive a transition from asexual to facultative sexual evolution, providing a specific, biophysically grounded scenario for the evolution of sex. In a broader context, our analysis suggests that the systems biology of microbial organisms is strongly intertwined with their mode of evolution.
△ Less
Submitted 22 March, 2018;
originally announced March 2018.
-
The asexual genome of Drosophila
Authors:
Stephan Schiffels,
Ville Mustonen,
Michael Lässig
Abstract:
The rate of recombination affects the mode of molecular evolution. In high-recombining sequence, the targets of selection are individual genetic loci; under low recombination, selection collectively acts on large, genetically linked genomic segments. Selection under linkage can induce clonal interference, a specific mode of evolution by competition of genetic clades within a population. This mode…
▽ More
The rate of recombination affects the mode of molecular evolution. In high-recombining sequence, the targets of selection are individual genetic loci; under low recombination, selection collectively acts on large, genetically linked genomic segments. Selection under linkage can induce clonal interference, a specific mode of evolution by competition of genetic clades within a population. This mode is well known in asexually evolving microbes, but has not been traced systematically in an obligate sexual organism. Here we show that the Drosophila genome is partitioned into two modes of evolution: a local interference regime with limited effects of genetic linkage, and an interference condensate with clonal competition. We map these modes by differences in mutation frequency spectra, and we show that the transition between them occurs at a threshold recombination rate that is predictable from genomic summary statistics. We find the interference condensate in segments of low-recombining sequence that are located primarily in chromosomal regions flanking the centromeres and cover about 20% of the Drosophila genome. Condensate regions have characteristics of asexual evolution that impact gene function: the efficacy of selection and the speed of evolution are lower and the genetic load is higher than in regions of local interference. Our results suggest that multicellular eukaryotes can harbor heterogeneous modes and tempi of evolution within one genome. We argue that this variation generates selection on genome architecture.
△ Less
Submitted 29 November, 2017;
originally announced November 2017.
-
Pervasive adaptation of gene expression in Drosophila
Authors:
Armita Nourmohammad,
Joachim Rambeau,
Torsten Held,
Johannes Berg,
Michael Lassig
Abstract:
Gene expression levels are important molecular quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies in recent years have revealed substantial adaptive evolution at the genomic level. However, the evolutionary modes of gene expression have remained controversial. Here we present evidence that adaptation dominates the evolution of gene…
▽ More
Gene expression levels are important molecular quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies in recent years have revealed substantial adaptive evolution at the genomic level. However, the evolutionary modes of gene expression have remained controversial. Here we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 63% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from the variation of expression within species and the time-resolved divergence across a family of related species, using a new inference method for selection. We identify functional classes of adaptively regulated genes, as well as sex-specific adaptation occurring predominantly in males. Our analysis opens a new avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.
△ Less
Submitted 2 April, 2015; v1 submitted 23 February, 2015;
originally announced February 2015.
-
Epidemiological and evolutionary analysis of the 2014 Ebola virus outbreak
Authors:
Marta Łuksza,
Trevor Bedford,
Michael Lässig
Abstract:
The 2014 epidemic of the Ebola virus is governed by a genetically diverse viral population. In the early Sierra Leone outbreak, a recent study has identified new mutations that generate genetically distinct sequence clades. Here we find evidence that major Sierra Leone clades have systematic differences in growth rate and reproduction number. If this growth heterogeneity remains stable, it will ge…
▽ More
The 2014 epidemic of the Ebola virus is governed by a genetically diverse viral population. In the early Sierra Leone outbreak, a recent study has identified new mutations that generate genetically distinct sequence clades. Here we find evidence that major Sierra Leone clades have systematic differences in growth rate and reproduction number. If this growth heterogeneity remains stable, it will generate major shifts in clade frequencies and influence the overall epidemic dynamics on time scales within the current outbreak. Our method is based on simple summary statistics of clade growth, which can be inferred from genealogical trees with an underlying clade-specific birth-death model of the infection dynamics. This method can be used to perform realtime tracking of an evolving epidemic and identify emerging clades of epidemiological or evolutionary significance.
△ Less
Submitted 6 November, 2014;
originally announced November 2014.
-
Rate and cost of adaptation in the Drosophila Genome
Authors:
Stephan Schiffels,
Michael Lässig,
Ville Mustonen
Abstract:
Recent studies have consistently inferred high rates of adaptive molecular evolution between Drosophila species. At the same time, the Drosophila genome evolves under different rates of recombination, which results in partial genetic linkage between alleles at neighboring genomic loci. Here we analyze how linkage correlations affect adaptive evolution. We develop a new inference method for adaptat…
▽ More
Recent studies have consistently inferred high rates of adaptive molecular evolution between Drosophila species. At the same time, the Drosophila genome evolves under different rates of recombination, which results in partial genetic linkage between alleles at neighboring genomic loci. Here we analyze how linkage correlations affect adaptive evolution. We develop a new inference method for adaptation that takes into account the effect on an allele at a focal site caused by neighboring deleterious alleles (background selection) and by neighboring adaptive substitutions (hitchhiking). Using complete genome sequence data and fine-scale recombination maps, we infer a highly heterogeneous scenario of adaptation in Drosophila. In high-recombining regions, about 50% of all amino acid substitutions are adaptive, together with about 20% of all substitutions in proximal intergenic regions. In low-recombining regions, only a small fraction of the amino acid substitutions are adaptive, while hitchhiking accounts for the majority of these changes. Hitchhiking of deleterious alleles generates a substantial collateral cost of adaptation, leading to a fitness decline of about 30/2N per gene and per million years in the lowest-recombining regions. Our results show how recombination shapes rate and efficacy of the adaptive dynamics in eukaryotic genomes.
△ Less
Submitted 5 September, 2014;
originally announced September 2014.
-
Multiple-line inference of selection on quantitative traits
Authors:
Nico Riedel,
Bhavin S. Khatri,
Michael Lässig,
Johannes Berg
Abstract:
Trait differences between species may be attributable to natural selection. However, quantifying the strength of evidence for selection acting on a particular trait is a difficult task. Here we develop a population-genetic test for selection acting on a quantitative trait which is based on multiple-line crosses. We show that using multiple lines increases both the power and the scope of selection…
▽ More
Trait differences between species may be attributable to natural selection. However, quantifying the strength of evidence for selection acting on a particular trait is a difficult task. Here we develop a population-genetic test for selection acting on a quantitative trait which is based on multiple-line crosses. We show that using multiple lines increases both the power and the scope of selection inference. First, a test based on three or more lines detects selection with strongly increased statistical significance, and we show explicitly how the sensitivity of the test depends on the number of lines. Second, a multiple-line test allows to distinguish different lineage-specific selection scenarios. Our analytical results are complemented by extensive numerical simulations. We then apply the multiple-line test to QTL data on floral character traits in plant species of the Mimulus genus and on photoperiodic traits in different maize strains, where we find a signatures of lineage-specific selection not seen in a two-line test.
△ Less
Submitted 6 July, 2015; v1 submitted 7 May, 2014;
originally announced May 2014.
-
Adaptive evolution of molecular phenotypes
Authors:
Torsten Held,
Armita Nourmohammad,
Michael Lässig
Abstract:
Molecular phenotypes link genomic information with organismic functions, fitness, and evolution. Quantitative traits are complex phenotypes that depend on multiple genomic loci. In this paper, we study the adaptive evolution of a quantitative trait under time-dependent selection, which arises from environmental changes or through fitness interactions with other co-evolving phenotypes. We analyze a…
▽ More
Molecular phenotypes link genomic information with organismic functions, fitness, and evolution. Quantitative traits are complex phenotypes that depend on multiple genomic loci. In this paper, we study the adaptive evolution of a quantitative trait under time-dependent selection, which arises from environmental changes or through fitness interactions with other co-evolving phenotypes. We analyze a model of trait evolution under mutations and genetic drift in a single-peak fitness seascape. The fitness peak performs a constrained random walk in the trait amplitude, which determines the time-dependent trait optimum in a given population. We derive analytical expressions for the distribution of the time-dependent trait divergence between populations and of the trait diversity within populations. Based on this solution, we develop a method to infer adaptive evolution of quantitative traits. Specifically, we show that the ratio of the average trait divergence and the diversity is a universal function of evolutionary time, which predicts the stabilizing strength and the driving rate of the fitness seascape. From an information-theoretic point of view, this function measures the macro-evolutionary entropy in a population ensemble, which determines the predictability of the evolutionary process. Our solution also quantifies two key characteristics of adapting populations: the cumulative fitness flux, which measures the total amount of adaptation, and the adaptive load, which is the fitness cost due to a population's lag behind the fitness peak.
△ Less
Submitted 7 March, 2014;
originally announced March 2014.
-
Universality and predictability in molecular quantitative genetics
Authors:
Armita Nourmohammad,
Torsten Held,
Michael Lässig
Abstract:
Molecular traits, such as gene expression levels or protein binding affinities, are increasingly accessible to quantitative measurement by modern high-throughput techniques. Such traits measure molecular functions and, from an evolutionary point of view, are important as targets of natural selection. We review recent developments in evolutionary theory and experiments that are expected to become b…
▽ More
Molecular traits, such as gene expression levels or protein binding affinities, are increasingly accessible to quantitative measurement by modern high-throughput techniques. Such traits measure molecular functions and, from an evolutionary point of view, are important as targets of natural selection. We review recent developments in evolutionary theory and experiments that are expected to become building blocks of a quantitative genetics of molecular traits. We focus on universal evolutionary characteristics: these are largely independent of a trait's genetic basis, which is often at least partially unknown. We show that universal measurements can be used to infer selection on a quantitative trait, which determines its evolutionary mode of conservation or adaptation. Furthermore, universality is closely linked to predictability of trait evolution across lineages. We argue that universal trait statistics extends over a range of cellular scales and opens new avenues of quantitative evolutionary systems biology.
△ Less
Submitted 14 November, 2013; v1 submitted 12 September, 2013;
originally announced September 2013.
-
Evolution of molecular phenotypes under stabilizing selection
Authors:
Armita Nourmohammad,
Stephan Schiffels,
Michael Laessig
Abstract:
Molecular phenotypes are important links between genomic information and organismic functions, fitness, and evolution. Complex phenotypes, which are also called quantitative traits, often depend on multiple genomic loci. Their evolution builds on genome evolution in a complicated way, which involves selection, genetic drift, mutations and recombination. Here we develop a coarse-grained evolutionar…
▽ More
Molecular phenotypes are important links between genomic information and organismic functions, fitness, and evolution. Complex phenotypes, which are also called quantitative traits, often depend on multiple genomic loci. Their evolution builds on genome evolution in a complicated way, which involves selection, genetic drift, mutations and recombination. Here we develop a coarse-grained evolutionary statistics for phenotypes, which decouples from details of the underlying genotypes. We derive approximate evolution equations for the distribution of phenotype values within and across populations. This dynamics covers evolutionary processes at high and low recombination rates, that is, it applies to sexual and asexual populations. In a fitness landscape with a single optimal phenotype value, the phenotypic diversity within populations and the divergence between populations reach evolutionary equilibria, which describe stabilizing selection. We compute the equilibrium distributions of both quantities analytically and we show that the ratio of mean divergence and diversity depends on the strength of selection in a universal way: it is largely independent of the phenotype's genomic encoding and of the recombination rate. This establishes a new method for the inference of selection on molecular phenotypes beyond the genome level. We discuss the implications of our findings for the predictability of evolutionary processes.
△ Less
Submitted 16 January, 2013;
originally announced January 2013.
-
Formation of regulatory modules by local sequence duplication
Authors:
Armita Nourmohammad,
Michael Laessig
Abstract:
Turnover of regulatory sequence and function is an important part of molecular evolution. But what are the modes of sequence evolution leading to rapid formation and loss of regulatory sites? Here, we show that a large fraction of neighboring transcription factor binding sites in the fly genome have formed from a common sequence origin by local duplications. This mode of evolution is found to prod…
▽ More
Turnover of regulatory sequence and function is an important part of molecular evolution. But what are the modes of sequence evolution leading to rapid formation and loss of regulatory sites? Here, we show that a large fraction of neighboring transcription factor binding sites in the fly genome have formed from a common sequence origin by local duplications. This mode of evolution is found to produce regulatory information: duplications can seed new sites in the neighborhood of existing sites. Duplicate seeds evolve subsequently by point mutations, often towards binding a different factor than their ancestral neighbor sites. These results are based on a statistical analysis of 346 cis-regulatory modules in the Drosophila melanogaster genome, and a comparison set of intergenic regulatory sequence in Saccharomyces cerevisiae. In fly regulatory modules, pairs of binding sites show significantly enhanced sequence similarity up to distances of about 50 bp. We analyze these data in terms of an evolutionary model with two distinct modes of site formation: (i) evolution from independent sequence origin and (ii) divergent evolution following duplication of a common ancestor sequence. Our results suggest that pervasive formation of binding sites by local sequence duplications distinguishes the complex regulatory architecture of higher eukaryotes from the simpler architecture of unicellular organisms.
△ Less
Submitted 24 May, 2011;
originally announced May 2011.
-
Significance analysis and statistical mechanics: an application to clustering
Authors:
Marta Łuksza,
Michael Lässig,
Johannes Berg
Abstract:
This paper addresses the statistical significance of structures in random data: Given a set of vectors and a measure of mutual similarity, how likely does a subset of these vectors form a cluster with enhanced similarity among its elements? The computation of this cluster p-value for randomly distributed vectors is mapped onto a well-defined problem of statistical mechanics. We solve this problem…
▽ More
This paper addresses the statistical significance of structures in random data: Given a set of vectors and a measure of mutual similarity, how likely does a subset of these vectors form a cluster with enhanced similarity among its elements? The computation of this cluster p-value for randomly distributed vectors is mapped onto a well-defined problem of statistical mechanics. We solve this problem analytically, establishing a connection between the physics of quenched disorder and multiple testing statistics in clustering and related problems. In an application to gene expression data, we find a remarkable link between the statistical significance of a cluster and the functional relationships between its genes.
△ Less
Submitted 13 September, 2010;
originally announced September 2010.
-
From Protein Interactions to Functional Annotation: Graph Alignment in Herpes
Authors:
Michal Kolář,
Michael Lässig,
Johannes Berg
Abstract:
Sequence alignment forms the basis of many methods for functional annotation by phylogenetic comparison, but becomes unreliable in the `twilight' regions of high sequence divergence and short gene length. Here we perform a cross-species comparison of two herpesviruses, VZV and KSHV, with a hybrid method called graph alignment. The method is based jointly on the similarity of protein interaction…
▽ More
Sequence alignment forms the basis of many methods for functional annotation by phylogenetic comparison, but becomes unreliable in the `twilight' regions of high sequence divergence and short gene length. Here we perform a cross-species comparison of two herpesviruses, VZV and KSHV, with a hybrid method called graph alignment. The method is based jointly on the similarity of protein interaction networks and on sequence similarity. In our alignment, we find open reading frames for which interaction similarity concurs with a low level of sequence similarity, thus confirming the evolutionary relationship. In addition, we find high levels of interaction similarity between open reading frames without any detectable sequence similarity. The functional predictions derived from this alignment are consistent with genomic position and gene expression data.
△ Less
Submitted 9 July, 2007;
originally announced July 2007.
-
Bayesian analysis of biological networks: clusters, motifs, cross-species correlations
Authors:
Johannes Berg,
Michael Lässig
Abstract:
An important part of the analysis of bio-molecular networks is to detect different functional units. Different functions are reflected in a different evolutionary dynamics, and hence in different statistical characteristics of network parts. In this sense, the {\em global statistics} of a biological network, e.g., its connectivity distribution, provides a background, and {\em local deviations} f…
▽ More
An important part of the analysis of bio-molecular networks is to detect different functional units. Different functions are reflected in a different evolutionary dynamics, and hence in different statistical characteristics of network parts. In this sense, the {\em global statistics} of a biological network, e.g., its connectivity distribution, provides a background, and {\em local deviations} from this background signal functional units. In the computational analysis of biological networks, we thus typically have to discriminate between different statistical models governing different parts of the dataset. The nature of these models depends on the biological question asked. We illustrate this rationale here with three examples: identification of functional parts as highly connected \textit{network clusters}, finding \textit{network motifs}, which occur in a similar form at different places in the network, and the analysis of \textit{cross-species network correlations}, which reflect evolutionary dynamics between species.
△ Less
Submitted 28 September, 2006;
originally announced September 2006.
-
Cross-species analysis of biological networks by Bayesian alignment
Authors:
Johannes Berg,
Michael Lässig
Abstract:
Complex interactions between genes or proteins contribute a substantial part to phenotypic evolution. Here we develop an evolutionarily grounded method for the cross-species analysis of interaction networks by {\em alignment}, which maps bona fide functional relationships between genes in different organisms. Network alignment is based on a scoring function measuring mutual similarities between…
▽ More
Complex interactions between genes or proteins contribute a substantial part to phenotypic evolution. Here we develop an evolutionarily grounded method for the cross-species analysis of interaction networks by {\em alignment}, which maps bona fide functional relationships between genes in different organisms. Network alignment is based on a scoring function measuring mutual similarities between networks taking into account their interaction patterns as well as sequence similarities between their nodes. High-scoring alignments and optimal alignment parameters are inferred by a systematic Bayesian analysis. We apply this method to analyze the evolution of co-expression networks between human and mouse. We find evidence for significant conservation of gene expression clusters and give network-based predictions of gene function. We discuss examples where cross-species functional relationships between genes do not concur with sequence similarity.
△ Less
Submitted 15 August, 2006; v1 submitted 20 April, 2006;
originally announced April 2006.
-
The Freezing of Random RNA
Authors:
Michael Lässig,
Kay Joerg Wiese
Abstract:
We study secondary structures of random RNA molecules by means of a renormalized field theory based on an expansion in the sequence disorder. We show that there is a continuous phase transition from a molten phase at higher temperatures to a low-temperature glass phase. The primary freezing occurs above the critical temperature, with local islands of stable folds forming within the molten phase.…
▽ More
We study secondary structures of random RNA molecules by means of a renormalized field theory based on an expansion in the sequence disorder. We show that there is a continuous phase transition from a molten phase at higher temperatures to a low-temperature glass phase. The primary freezing occurs above the critical temperature, with local islands of stable folds forming within the molten phase. The size of these islands defines the correlation length of the transition. Our results include critical exponents at the transition and in the glass phase.
△ Less
Submitted 18 April, 2006; v1 submitted 16 November, 2005;
originally announced November 2005.
-
Universality of Long-Range Correlations in Expansion-Randomization Systems
Authors:
Philipp W. Messer,
Michael Lassig,
Peter F. Arndt
Abstract:
We study the stochastic dynamics of sequences evolving by single site mutations, segmental duplications, deletions, and random insertions. These processes are relevant for the evolution of genomic DNA. They define a universality class of non-equilibrium 1D expansion-randomization systems with generic stationary long-range correlations in a regime of growing sequence length. We obtain explicitly…
▽ More
We study the stochastic dynamics of sequences evolving by single site mutations, segmental duplications, deletions, and random insertions. These processes are relevant for the evolution of genomic DNA. They define a universality class of non-equilibrium 1D expansion-randomization systems with generic stationary long-range correlations in a regime of growing sequence length. We obtain explicitly the two-point correlation function of the sequence composition and the distribution function of the composition bias in sequences of finite length. The characteristic exponent $χ$ of these quantities is determined by the ratio of two effective rates, which are explicitly calculated for several specific sequence evolution dynamics of the universality class. Depending on the value of $χ$, we find two different scaling regimes, which are distinguished by the detectability of the initial composition bias. All analytic results are accurately verified by numerical simulations. We also discuss the non-stationary build-up and decay of correlations, as well as more complex evolutionary scenarios, where the rates of the processes vary in time. Our findings provide a possible example for the emergence of universality in molecular biology.
△ Less
Submitted 22 September, 2005;
originally announced September 2005.
-
A minimal stochastic model for influenza evolution
Authors:
Francesca Tria,
Michael Laessig,
Luca Peliti,
Silvio Franz
Abstract:
We introduce and discuss a minimal individual-based model for influenza dynamics. The model takes into account the effects of specific immunization against viral strains, but also infectivity randomness and the presence of a short-lived strain transcending immunity recently suggested in the literature. We show by simulations that the resulting model exhibits substitution of viral strains along t…
▽ More
We introduce and discuss a minimal individual-based model for influenza dynamics. The model takes into account the effects of specific immunization against viral strains, but also infectivity randomness and the presence of a short-lived strain transcending immunity recently suggested in the literature. We show by simulations that the resulting model exhibits substitution of viral strains along the years, but that their divergence remains bounded. We also show that drop** any of these features results in a drastically different behavior, leading either to the extinction of the disease, to the proliferation of the viral strains, or to their divergence.
△ Less
Submitted 18 May, 2005;
originally announced May 2005.
-
Biodiversity in model ecosystems, II: Species assembly and food web structure
Authors:
Ugo Bastolla,
Michael Lässig,
Susanna C. Manrubia,
Angelo Valleriani
Abstract:
This is the second of two papers dedicated to the relationship between population models of competition and biodiversity. Here we consider species assembly models where the population dynamics is kept far from fixed points through the continuous introduction of new species, and generalize to such models thecoexistence condition derived for systems at the fixed point. The ecological overlap betwe…
▽ More
This is the second of two papers dedicated to the relationship between population models of competition and biodiversity. Here we consider species assembly models where the population dynamics is kept far from fixed points through the continuous introduction of new species, and generalize to such models thecoexistence condition derived for systems at the fixed point. The ecological overlap between species with shared preys, that we define here, provides a quantitative measure of the effective interspecies competition and of the trophic network topology. We obtain distributions of the overlap from simulations of a new model based both on immigration and speciation, and show that they are in good agreement with those measured for three large natural food webs. As discussed in the first paper, rapid environmental fluctuations, interacting with the condition for coexistence of competing species, limit the maximal biodiversity that a trophic level can host. This horizontal limitation to biodiversity is here combined with either dissipation of energy or growth of fluctuations, which in our model limit the length of food webs in the vertical direction. These ingredients yield an effective model of food webs that produce a biodiversity profile with a maximum at an intermediate trophic level, in agreement with field studies.
△ Less
Submitted 19 February, 2005;
originally announced February 2005.
-
Biodiversity in model ecosystems, I: Coexistence conditions for competing species
Authors:
Ugo Bastolla,
Michael Lässig,
Susanna C. Manrubia,
Angelo Valleriani
Abstract:
This is the first of two papers where we discuss the limits imposed by competition to the biodiversity of species communities. In this first paper we study the coexistence of competing species at the fixed point of population dynamic equations. For many simple models, this imposes a limit on the width of the productivity distribution, which is more severe the more diverse the ecosystem is (Chess…
▽ More
This is the first of two papers where we discuss the limits imposed by competition to the biodiversity of species communities. In this first paper we study the coexistence of competing species at the fixed point of population dynamic equations. For many simple models, this imposes a limit on the width of the productivity distribution, which is more severe the more diverse the ecosystem is (Chesson, 1994). Here we review and generalize this analysis, beyond the ``mean-field''-like approximation of the competition matrix used in previous works, and extend it to structured food webs. In all cases analysed, we obtain qualitatively similar relations between biodiversity and competition: the narrower the productivity distribution is, the more species can stably coexist. We discuss how this result, considered together with environmental fluctuations, limits the maximal biodiversity that a trophic level can host.
△ Less
Submitted 19 February, 2005;
originally announced February 2005.
-
A Solvable Sequence Evolution Model and Genomic Correlations
Authors:
Philipp W. Messer,
Peter F. Arndt,
Michael Lässig
Abstract:
We study a minimal model for genome evolution whose elementary processes are single site mutation, duplication and deletion of sequence regions and insertion of random segments. These processes are found to generate long-range correlations in the composition of letters as long as the sequence length is growing, i.e., the combined rates of duplications and insertions are higher than the deletion…
▽ More
We study a minimal model for genome evolution whose elementary processes are single site mutation, duplication and deletion of sequence regions and insertion of random segments. These processes are found to generate long-range correlations in the composition of letters as long as the sequence length is growing, i.e., the combined rates of duplications and insertions are higher than the deletion rate. For constant sequence length, on the other hand, all initial correlations decay exponentially. These results are obtained analytically and by simulations. They are compared with the long-range correlations observed in genomic DNA, and the implications for genome evolution are discussed.
△ Less
Submitted 9 January, 2005;
originally announced January 2005.
-
Local graph alignment and motif search in biological networks
Authors:
Johannes Berg,
Michael Lässig
Abstract:
Interaction networks are of central importance in post-genomic molecular biology, with increasing amounts of data becoming available by high-throughput methods. Examples are gene regulatory networks or protein interaction maps. The main challenge in the analysis of these data is to read off biological functions from the topology of the network. Topological motifs, i.e., patterns occurring repeat…
▽ More
Interaction networks are of central importance in post-genomic molecular biology, with increasing amounts of data becoming available by high-throughput methods. Examples are gene regulatory networks or protein interaction maps. The main challenge in the analysis of these data is to read off biological functions from the topology of the network. Topological motifs, i.e., patterns occurring repeatedly at different positions in the network have recently been identified as basic modules of molecular information processing. In this paper, we discuss motifs derived from families of mutually similar but not necessarily identical patterns. We establish a statistical model for the occurrence of such motifs, from which we derive a scoring function for their statistical significance. Based on this scoring function, we develop a search algorithm for topological motifs called graph alignment, a procedure with some analogies to sequence alignment. The algorithm is applied to the gene regulation network of E. coli.
△ Less
Submitted 27 November, 2004; v1 submitted 13 August, 2003;
originally announced August 2003.
-
Modes of speciation in heterogeneous space
Authors:
Martin Rost,
Michael Lässig
Abstract:
Modes of speciation have been the subject of a century's debate. Traditionally, most speciations are believed to be caused by spatial separation of populations (allopatry). Recent observations (Meyer 1990, Schliewen 1994, Schliewen 2001, Rico 2002) and models (MaynardSmith 1966, Antonovics 1971, Dickinson 1973, Rosenzweig 1978, T urner 1995, Noest 1997, Geritz 1998, Kondrashov 1999, Dieckmann 19…
▽ More
Modes of speciation have been the subject of a century's debate. Traditionally, most speciations are believed to be caused by spatial separation of populations (allopatry). Recent observations (Meyer 1990, Schliewen 1994, Schliewen 2001, Rico 2002) and models (MaynardSmith 1966, Antonovics 1971, Dickinson 1973, Rosenzweig 1978, T urner 1995, Noest 1997, Geritz 1998, Kondrashov 1999, Dieckmann 1999, Doebeli 2000, Slatkin 1980), show that speciation can also take place in sympatry. We discuss a comprehensive model of coupled differentiation in phenotype, mating, and space, showing that spatial segregation can be an induced process following a sympatric differentiation. This is found to be a generic mechanism of adaptation to heterogeneous environments, for which we propose the term diapatric speciation (Greek). It explains the ubiquitous spatial patching of newly formed species, despite their sympatric origin (Schliewen 1994, Schliewen 2001, Rico 2002).
△ Less
Submitted 15 July, 2003; v1 submitted 14 July, 2003;
originally announced July 2003.
-
Adaptive evolution of transcription factor binding sites
Authors:
Johannes Berg,
Stana Willmann,
Michael Lässig
Abstract:
The regulation of a gene depends on the binding of transcription factors to specific sites located in the regulatory region of the gene. The generation of these binding sites and of cooperativity between them are essential building blocks in the evolution of complex regulatory networks. We study a theoretical model for the sequence evolution of binding sites by point mutations. The approach is b…
▽ More
The regulation of a gene depends on the binding of transcription factors to specific sites located in the regulatory region of the gene. The generation of these binding sites and of cooperativity between them are essential building blocks in the evolution of complex regulatory networks. We study a theoretical model for the sequence evolution of binding sites by point mutations. The approach is based on biophysical models for the binding of transcription factors to DNA. Hence we derive empirically grounded fitness landscapes, which enter a population genetics model including mutations, genetic drift, and selection. We show that the selection for factor binding generically leads to specific correlations between nucleotide frequencies at different positions of a binding site. We demonstrate the possibility of rapid adaptive evolution generating a new binding site for a given transcription factor by point mutations. The evolutionary time required is estimated in terms of the neutral (background) mutation rate, the selection coefficient, and the effective population size. The efficiency of binding site formation is seen to depend on two joint conditions: the binding site motif must be short enough and the promoter region must be long enough. These constraints on promoter architecture are indeed seen in eukaryotic systems. Furthermore, we analyse the adaptive evolution of genetic switches and of signal integration through binding cooperativity between different sites. Experimental tests of this picture involving the statistics of polymorphisms and phylogenies of sites are discussed.
△ Less
Submitted 27 November, 2004; v1 submitted 29 January, 2003;
originally announced January 2003.
-
Evolutionary games and quasispecies
Authors:
M. Laessig,
L. Peliti,
F. Tria
Abstract:
We discuss a population of sequences subject to mutations and frequency-dependent selection, where the fitness of a sequence depends on the composition of the entire population. This type of dynamics is crucial to understand the evolution of genomic regulation. Mathematically, it takes the form of a reaction-diffusion problem that is nonlinear in the population state. In our model system, the fi…
▽ More
We discuss a population of sequences subject to mutations and frequency-dependent selection, where the fitness of a sequence depends on the composition of the entire population. This type of dynamics is crucial to understand the evolution of genomic regulation. Mathematically, it takes the form of a reaction-diffusion problem that is nonlinear in the population state. In our model system, the fitness is determined by a simple mathematical game, the hawk-dove game. The stationary population distribution is found to be a quasispecies with properties different from those which hold in fixed fitness landscapes.
△ Less
Submitted 10 February, 2003; v1 submitted 4 September, 2002;
originally announced September 2002.
-
Structure and evolution of protein interaction networks: A statistical model for link dynamics and gene duplications
Authors:
Johannes Berg,
Michael Lässig,
Andreas Wagner
Abstract:
The structure of molecular networks derives from dynamical processes on evolutionary time scales. For protein interaction networks, global statistical features of their structure can now be inferred consistently from several large-throughput datasets. Understanding the underlying evolutionary dynamics is crucial for discerning random parts of the network from biologically important properties sh…
▽ More
The structure of molecular networks derives from dynamical processes on evolutionary time scales. For protein interaction networks, global statistical features of their structure can now be inferred consistently from several large-throughput datasets. Understanding the underlying evolutionary dynamics is crucial for discerning random parts of the network from biologically important properties shaped by natural selection. We present a detailed statistical analysis of the protein interactions in Saccharomyces cerevisiae based on several large-throughput datasets. Protein pairs resulting from gene duplications are used as tracers into the evolutionary past of the network.
From this analysis, we infer rate estimates for two key evolutionary processes sha** the network: (i) gene duplications and (ii) gain and loss of interactions through mutations in existing proteins, which are referred to as link dynamics. Importantly, the link dynamics is asymmetric, i.e., the evolutionary steps are mutations in just one of the binding parters. The link turnover is shown to be much faster than gene duplications. According to this model, the link dynamics is the dominant evolutionary force sha** the statistical structure of the network, while the slower gene duplication dynamics mainly affects its size. Specifically, the model predicts (i) a broad distribution of the connectivities (i.e., the number of binding partners of a protein) and (ii) correlations between the connectivities of interacting proteins.
△ Less
Submitted 27 November, 2004; v1 submitted 30 July, 2002;
originally announced July 2002.
-
Quantum Game Theory
Authors:
Michael Lassig
Abstract:
A systematic theory is introduced that describes stochastic effects in game theory. In a biological context, such effects are relevant for the evolution of finite populations with frequency-dependent selection. They are characterized by quantum Nash equilibria, a generalization of the well-known Nash equilibrium points in classical game theory. The implications of this theory for biological syst…
▽ More
A systematic theory is introduced that describes stochastic effects in game theory. In a biological context, such effects are relevant for the evolution of finite populations with frequency-dependent selection. They are characterized by quantum Nash equilibria, a generalization of the well-known Nash equilibrium points in classical game theory. The implications of this theory for biological systems are discussed in detail.
△ Less
Submitted 6 June, 2002;
originally announced June 2002.
-
Correlated random networks
Authors:
Johannes Berg,
Michael Lässig
Abstract:
We develop a statistical theory of networks. A network is a set of vertices and links given by its adjacency matrix $\c$, and the relevant statistical ensembles are defined in terms of a partition function $Z=\sum_{\c} \exp {[}-β\H(\c) {]}$. The simplest cases are uncorrelated random networks such as the well-known Erdös-Rény graphs. Here we study more general interactions $\H(\c)$ which lead to…
▽ More
We develop a statistical theory of networks. A network is a set of vertices and links given by its adjacency matrix $\c$, and the relevant statistical ensembles are defined in terms of a partition function $Z=\sum_{\c} \exp {[}-β\H(\c) {]}$. The simplest cases are uncorrelated random networks such as the well-known Erdös-Rény graphs. Here we study more general interactions $\H(\c)$ which lead to {\em correlations}, for example, between the connectivities of adjacent vertices. In particular, such correlations occur in {\em optimized} networks described by partition functions in the limit $β\to \infty$. They are argued to be a crucial signature of evolutionary design in biological networks.
△ Less
Submitted 20 October, 2002; v1 submitted 28 May, 2002;
originally announced May 2002.
-
The shape of ecological networks
Authors:
Michael Lassig,
Ugo Bastolla,
Susanna C. Manrubia,
Angelo Valleriani
Abstract:
We study the statistics of ecosystems with a variable number of co-evolving species. The species interact in two ways: by prey-predator relationships and by direct competition with similar kinds. The interaction coefficients change slowly through successful adaptations and speciations. We treat them as quenched random variables. These interactions determine long-term topological features of the…
▽ More
We study the statistics of ecosystems with a variable number of co-evolving species. The species interact in two ways: by prey-predator relationships and by direct competition with similar kinds. The interaction coefficients change slowly through successful adaptations and speciations. We treat them as quenched random variables. These interactions determine long-term topological features of the species network, which are found to agree with those of biological systems.
△ Less
Submitted 16 January, 2001;
originally announced January 2001.
-
Diversity patterns from ecological models at dynamical equilibrium
Authors:
U. Bastolla,
M. Laessig,
S. Manrubia,
A. Valleriani
Abstract:
We study a dynamic model of ecosystems where immigration plays an essential role both in assembling the species community and in mantaining its biodiversity. This framework is particularly relevant for insular ecosystems. Population dynamics is represented either as an individual based model or as a set of deterministic equations for population abundances. Local extinctions and immigrations bala…
▽ More
We study a dynamic model of ecosystems where immigration plays an essential role both in assembling the species community and in mantaining its biodiversity. This framework is particularly relevant for insular ecosystems. Population dynamics is represented either as an individual based model or as a set of deterministic equations for population abundances. Local extinctions and immigrations balance in a statistically stationary state where biodiversity fluctuates around a constant mean value. At stationarity, biodiversity increases as a power law of the immigration rate. Our model yields almost power law species area relationships, with a range of effective exponents in agreement with that observed for biodiversity of whole archipelagos. We also observe broad distributions for species abundances and species lifetimes and a small number of trophic levels, limited by the immigration rate. These results are rather robust with respect to change of description level, as well as change of population dynamic equations, from prey dependent to ratio dependent.
△ Less
Submitted 13 September, 2000;
originally announced September 2000.
-
Delocalization transitions of semi-flexible manifolds
Authors:
Ralf Bundschuh,
Michael Lassig
Abstract:
Semi-flexible manifolds such as fluid membranes or semi-flexible polymers undergo delocalization transitions if they are subject to attractive interactions. We study manifolds with short-ranged interactions by field-theoretic methods based on the operator product expansion of local interaction fields. We apply this approach to manifolds in a random potential. Randomness is always relevant for fl…
▽ More
Semi-flexible manifolds such as fluid membranes or semi-flexible polymers undergo delocalization transitions if they are subject to attractive interactions. We study manifolds with short-ranged interactions by field-theoretic methods based on the operator product expansion of local interaction fields. We apply this approach to manifolds in a random potential. Randomness is always relevant for fluid membranes, while for semi-flexible polymers there is a first order transition to the strong coupling regime at a finite temperature.
△ Less
Submitted 19 March, 1999; v1 submitted 16 February, 1999;
originally announced February 1999.
-
Optimizing Smith-Waterman alignments
Authors:
Rolf Olsen,
Terence Hwa,
Michael Lassig
Abstract:
Mutual correlation between segments of DNA or protein sequences can be detected by Smith-Waterman local alignments. We present a statistical analysis of alignment of such sequences, based on a recent scaling theory. A new fidelity measure is introduced and shown to capture the significance of the local alignment, i.e., the extent to which the correlated subsequences are correctly identified. It…
▽ More
Mutual correlation between segments of DNA or protein sequences can be detected by Smith-Waterman local alignments. We present a statistical analysis of alignment of such sequences, based on a recent scaling theory. A new fidelity measure is introduced and shown to capture the significance of the local alignment, i.e., the extent to which the correlated subsequences are correctly identified. It is demonstrated how the fidelity may be optimized in the space of penalty parameters using only the alignment score data of a single sequence pair.
△ Less
Submitted 16 November, 1998;
originally announced November 1998.
-
Dynamical Anomalies and Intermittency in Burgers Turbulence
Authors:
M. Lassig
Abstract:
We analyze the field theory of fully developed Burgers turbulence. Its key elements are shock fields, which characterize the singularity statistics of the velocity field. The shock fields enter an operator product expansion describing intermittency. The latter is found to be constrained by dynamical anomalies expressing finite dissipation in the inviscid limit. The link between dynamical anomali…
▽ More
We analyze the field theory of fully developed Burgers turbulence. Its key elements are shock fields, which characterize the singularity statistics of the velocity field. The shock fields enter an operator product expansion describing intermittency. The latter is found to be constrained by dynamical anomalies expressing finite dissipation in the inviscid limit. The link between dynamical anomalies and intermittency is argued to be important in a wider context of turbulence.
△ Less
Submitted 6 March, 2000; v1 submitted 16 November, 1998;
originally announced November 1998.
-
On Growth, Disorder, and Field Theory
Authors:
Michael Lassig
Abstract:
This article reviews recent developments in statistical field theory far from equilibrium. It focuses on the Kardar-Parisi-Zhang equation of stochastic surface growth and its mathematical relatives, namely the stochastic Burgers equation in fluid mechanics and directed polymers in a medium with quenched disorder. At strong stochastic driving -- or at strong disorder, respectively -- these system…
▽ More
This article reviews recent developments in statistical field theory far from equilibrium. It focuses on the Kardar-Parisi-Zhang equation of stochastic surface growth and its mathematical relatives, namely the stochastic Burgers equation in fluid mechanics and directed polymers in a medium with quenched disorder. At strong stochastic driving -- or at strong disorder, respectively -- these systems develop nonperturbative scale-invariance. Presumably exact values of the scaling exponents follow from a self-consistent asymptotic theory. This theory is based on the concept of an operator product expansion formed by the local scaling fields. The key difference to standard Lagrangian field theory is the appearance of a dangerous irrelevant coupling constant generating dynamical anomalies in the continuum limit.
△ Less
Submitted 16 November, 1998; v1 submitted 26 June, 1998;
originally announced June 1998.
-
Scaling Laws and Similarity Detection in Sequence Alignment with Gaps
Authors:
Dirk Drasdo,
Terence Hwa,
Michael Lassig
Abstract:
We study the problem of similarity detection by sequence alignment with gaps, using a recently established theoretical framework based on the morphology of alignment paths. Alignments of sequences without mutual correlations are found to have scale-invariant statistics. This is the basis for a scaling theory of alignments of correlated sequences. Using a simple Markov model of evolution, we gene…
▽ More
We study the problem of similarity detection by sequence alignment with gaps, using a recently established theoretical framework based on the morphology of alignment paths. Alignments of sequences without mutual correlations are found to have scale-invariant statistics. This is the basis for a scaling theory of alignments of correlated sequences. Using a simple Markov model of evolution, we generate sequences with well-defined mutual correlations and quantify the fidelity of an alignment in an unambiguous way. The scaling theory predicts the dependence of the fidelity on the alignment parameters and on the statistical evolution parameters characterizing the sequence correlations. Specific criteria for the optimal choice of alignment parameters emerge from this theory. The results are verified by extensive numerical simulations.
△ Less
Submitted 11 February, 1998;
originally announced February 1998.
-
Optimal Detection of Sequence Similarity by Local Alignment
Authors:
Terence Hwa,
Michael Lassig
Abstract:
The statistical properties of local alignment algorithms with gaps are analyzed theoretically for uncorrelated and correlated DNA sequences. In the vicinity of the log-linear phase transition, the statistics of alignment with gaps is shown to be characteristically different from that of gapless alignment. The optimal scores obtained for uncorrelated sequences obey certain robust scaling laws. De…
▽ More
The statistical properties of local alignment algorithms with gaps are analyzed theoretically for uncorrelated and correlated DNA sequences. In the vicinity of the log-linear phase transition, the statistics of alignment with gaps is shown to be characteristically different from that of gapless alignment. The optimal scores obtained for uncorrelated sequences obey certain robust scaling laws. Deviation from these scaling laws signals sequence homology, and can be used to guide the empirical selection of scoring parameters for the optimal detection of sequence similarities. This can be accomplished in a computationally efficient way by using a novel approach focusing on the score landscape. Furthermore, by assuming a few gross features characterizing the statistics of underlying sequence-sequence correlations, quantitative criteria are obtained for the choice of optimal scoring parameters: Optimal similarity detection is most likely to occur in a region close to the log side of the log-linear phase transition.
△ Less
Submitted 12 February, 1998; v1 submitted 6 December, 1997;
originally announced December 1997.
-
Quantized Scaling of Growing Surfaces
Authors:
Michael Lassig
Abstract:
The Kardar-Parisi-Zhang universality class of stochastic surface growth is studied by exact field-theoretic methods. From previous numerical results, a few qualitative assumptions are inferred. In particular, height correlations should satisfy an operator product expansion and, unlike the correlations in a turbulent fluid, exhibit no multiscaling. These properties impose a quantization condition…
▽ More
The Kardar-Parisi-Zhang universality class of stochastic surface growth is studied by exact field-theoretic methods. From previous numerical results, a few qualitative assumptions are inferred. In particular, height correlations should satisfy an operator product expansion and, unlike the correlations in a turbulent fluid, exhibit no multiscaling. These properties impose a quantization condition on the roughness exponent $χ$ and the dynamic exponent $z$. Hence the exact values $χ= 2/5, z = 8/5$ for two-dimensional and $χ= 2/7, z = 12/7$ for three-dimensional surfaces are derived.
△ Less
Submitted 5 November, 1997;
originally announced November 1997.
-
The Upper Critical Dimension of the KPZ Equation
Authors:
Michael Lassig,
Harald Kinzelbach
Abstract:
The strong-coupling regime of Kardar-Parisi-Zhang surface growth driven by short-ranged noise has an upper critical dimension d_> less or equal to four (where the dynamic exponent z takes the value z (d_>) = 2). To derive this, we use the map** onto directed polymers with quenched disorder. Two such polymers coupled by a small contact attraction are shown to form a bound state at all temperatu…
▽ More
The strong-coupling regime of Kardar-Parisi-Zhang surface growth driven by short-ranged noise has an upper critical dimension d_> less or equal to four (where the dynamic exponent z takes the value z (d_>) = 2). To derive this, we use the map** onto directed polymers with quenched disorder. Two such polymers coupled by a small contact attraction are shown to form a bound state at all temperatures below the roughening temperature of a single polymer. Comparing the singularities of the localization length at and below the critical temperature yields d_> \leq 4.
△ Less
Submitted 22 August, 1996;
originally announced August 1996.
-
Vicinal Surfaces and the Calogero-Sutherland Model
Authors:
Michael Lassig
Abstract:
A miscut (vicinal) crystal surface can be regarded as an array of meandering but non-crossing steps. Interactions between the steps are shown to induce a faceting transition of the surface between a homogeneous Luttinger liquid state and a low-temperature regime consisting of local step clusters in coexistence with ideal facets. This morphological transition is governed by a hitherto neglected c…
▽ More
A miscut (vicinal) crystal surface can be regarded as an array of meandering but non-crossing steps. Interactions between the steps are shown to induce a faceting transition of the surface between a homogeneous Luttinger liquid state and a low-temperature regime consisting of local step clusters in coexistence with ideal facets. This morphological transition is governed by a hitherto neglected critical line of the well-known Calogero-Sutherland model. Its exact solution yields expressions for measurable quantities that compare favorably with recent experiments on Si surfaces.
△ Less
Submitted 21 February, 1996;
originally announced February 1996.
-
Directed polymers in high dimensions
Authors:
Ralf Bundschuh,
Michael Lassig
Abstract:
We study directed polymers subject to a quenched random potential in d transversal dimensions. This system is closely related to the Kardar-Parisi-Zhang equation of nonlinear stochastic growth. By a careful analysis of the perturbation theory we show that physical quantities develop singular behavior for d to 4. For example, the universal finite size amplitude of the free energy at the roughenin…
▽ More
We study directed polymers subject to a quenched random potential in d transversal dimensions. This system is closely related to the Kardar-Parisi-Zhang equation of nonlinear stochastic growth. By a careful analysis of the perturbation theory we show that physical quantities develop singular behavior for d to 4. For example, the universal finite size amplitude of the free energy at the roughening transition is proportional to (4-d)^(1/2). This shows that the dimension d=4 plays a special role for this system and points towards d=4 as the upper critical dimension of the Kardar-Parisi-Zhang problem.
△ Less
Submitted 8 February, 1996;
originally announced February 1996.
-
Similarity-Detection and Localization
Authors:
Terence Hwa,
Michael Lassig
Abstract:
The detection of similarities between long DNA and protein sequences is studied using concepts of statistical physics. It is shown that mutual similarities can be detected by sequence alignment methods only if their amount exceeds a threshold value. The onset of detection is a continuous phase transition which can be viewed as a localization-delocalization transition. The ``fidelity'' of the ali…
▽ More
The detection of similarities between long DNA and protein sequences is studied using concepts of statistical physics. It is shown that mutual similarities can be detected by sequence alignment methods only if their amount exceeds a threshold value. The onset of detection is a continuous phase transition which can be viewed as a localization-delocalization transition. The ``fidelity'' of the alignment is the order parameter of that transition; it leads to criteria for the selection of optimal alignment parameters.
△ Less
Submitted 14 November, 1995;
originally announced November 1995.