-
A biological circuit to anticipate trend
Authors:
Steven A. Frank
Abstract:
Organisms gain by anticipating future changes in the environment. Those environmental changes often follow stochastic trends. The greater the slope of the trend, the more likely the trend's momentum carries the future trend in the same direction. This article presents a simple biological circuit that measures the momentum, providing a prediction about future trend. The circuit calculates the momen…
▽ More
Organisms gain by anticipating future changes in the environment. Those environmental changes often follow stochastic trends. The greater the slope of the trend, the more likely the trend's momentum carries the future trend in the same direction. This article presents a simple biological circuit that measures the momentum, providing a prediction about future trend. The circuit calculates the momentum by the difference between a short-term and a long-term exponential moving average. The time lengths of the two moving averages can be adjusted by changing the decay rates of state variables. Different time lengths for those averages trade off between errors caused by noise and errors caused by lags in predicting a change in the direction of the trend. Prior studies have emphasized circuits that make similar calculations about trends. However, those prior studies embedded their analyses in the details of particular applications, obscuring the simple generality and wide applicability of the approach. The model here contributes to the topic by clarifying the great simplicity and generality of anticipation for stochastic trends. This article also notes that, in financial analysis, the difference between moving averages is widely used to predict future trends in asset prices. The financial measure is called the moving average convergence-divergence (MACD) indicator. Connecting the biological problem to financial analysis opens the way for future studies in biology to exploit the variety of highly developed trend models in finance.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Disease from opposing forces in regulatory control
Authors:
Steven A. Frank
Abstract:
Danger requires a strong rapid response. Speedy triggers are prone to false signals. False alarms can be costly, requiring strong negative regulators to oppose the initial triggers. Strongly opposed forces can easily be perturbed, leading to imbalance and disease. For example, immunity and fear response balance strong rapid triggers against widespread slow negative regulators. Diseases of immunity…
▽ More
Danger requires a strong rapid response. Speedy triggers are prone to false signals. False alarms can be costly, requiring strong negative regulators to oppose the initial triggers. Strongly opposed forces can easily be perturbed, leading to imbalance and disease. For example, immunity and fear response balance strong rapid triggers against widespread slow negative regulators. Diseases of immunity and behavior arise from imbalance. A different opposition of forces occurs in mammalian growth, which balances strong paternally expressed accelerators against maternally expressed suppressors. Diseases of overgrowth or undergrowth arise from imbalance. Other examples of opposing forces and disease include control of dopamine expression and male versus female favored traits.
△ Less
Submitted 30 September, 2023; v1 submitted 23 July, 2023;
originally announced July 2023.
-
Robustness and complexity
Authors:
Steven A. Frank
Abstract:
When a biological system robustly corrects component-level errors, the direct pressure on component performance declines. Components may become less reliable, maintain more genetic variability, or drift neutrally in design, creating the basis for new forms of organismal complexity. This article links the protection-decay dynamic to other aspects of robust and complex systems. Examples include the…
▽ More
When a biological system robustly corrects component-level errors, the direct pressure on component performance declines. Components may become less reliable, maintain more genetic variability, or drift neutrally in design, creating the basis for new forms of organismal complexity. This article links the protection-decay dynamic to other aspects of robust and complex systems. Examples include the hourglass pattern of biological development and Doyle's hourglass architecture for robustly complex systems in engineering. The deeply and densely connected wiring architecture in biology's cellular controls and in machine learning's computational neural networks provide another link. By unifying these seemingly different aspects into a unified framework, we gain a new perspective on robust and complex systems.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Automatic differentiation and the optimization of differential equation models in biology
Authors:
Steven A. Frank
Abstract:
A computational revolution unleashed the power of artificial neural networks. At the heart of that revolution is automatic differentiation, which calculates the derivative of a performance measure relative to a large number of parameters. Differentiation enhances the discovery of improved performance in large models, an achievement that was previously difficult or impossible. Recently, a second co…
▽ More
A computational revolution unleashed the power of artificial neural networks. At the heart of that revolution is automatic differentiation, which calculates the derivative of a performance measure relative to a large number of parameters. Differentiation enhances the discovery of improved performance in large models, an achievement that was previously difficult or impossible. Recently, a second computational advance optimizes the temporal trajectories traced by differential equations. Optimization requires differentiating a measure of performance over a trajectory, such as the closeness of tracking the environment, with respect to the parameters of the differential equations. Because model trajectories are usually calculated numerically by multistep algorithms, such as Runge-Kutta, the automatic differentiation must be passed through the numerical algorithm. This article explains how such automatic differentiation of trajectories is achieved. It also discusses why such computational breakthroughs are likely to advance theoretical and statistical studies of biological problems, in which one can consider variables as dynamic paths over time and space. Many common problems arise between improving success in computational learning models over performance landscapes, improving evolutionary fitness over adaptive landscapes, and improving statistical fits to data over information landscapes.
△ Less
Submitted 11 October, 2022; v1 submitted 10 July, 2022;
originally announced July 2022.
-
Optimizing differential equations to fit data and predict outcomes
Authors:
Steven A. Frank
Abstract:
Many scientific problems focus on observed patterns of change or on how to design a system to achieve particular dynamics. Those problems often require fitting differential equation models to target trajectories. Fitting such models can be difficult because each evaluation of the fit must calculate the distance between the model and target patterns at numerous points along a trajectory. The gradie…
▽ More
Many scientific problems focus on observed patterns of change or on how to design a system to achieve particular dynamics. Those problems often require fitting differential equation models to target trajectories. Fitting such models can be difficult because each evaluation of the fit must calculate the distance between the model and target patterns at numerous points along a trajectory. The gradient of the fit with respect to the model parameters can be challenging. Recent technical advances in automatic differentiation through numerical differential equation solvers potentially change the fitting process into a relatively easy problem, opening up new possibilities to study dynamics. However, application of the new tools to real data may fail to achieve a good fit. This article illustrates how to overcome a variety of common challenges, using the classic ecological data for oscillations in hare and lynx populations. Models include simple ordinary differential equations (ODEs) and neural ordinary differential equations (NODEs), which use artificial neural networks to estimate the derivatives of differential equation systems. Comparing the fits obtained with ODEs versus NODEs, representing small and large parameter spaces, and changing the number of variable dimensions provide insight into the geometry of the observed and model trajectories. To analyze the quality of the models for predicting future observations, a Bayesian-inspired preconditioned stochastic gradient Langevin dynamics (pSGLD) calculation of the posterior distribution of predicted model trajectories clarifies the tendency for various models to underfit or overfit the data. Coupling fitted differential equation systems with pSGLD sampling provides a powerful way to study the properties of optimization surfaces, raising an analogy with mutation-selection dynamics on fitness landscapes.
△ Less
Submitted 16 April, 2022;
originally announced April 2022.
-
The fundamental equations of change in statistical ensembles and biological populations
Authors:
Steven A. Frank,
Frank J. Bruggeman
Abstract:
A recent article in Nature Physics unified key results from thermodynamics, statistics, and information theory. The unification arose from a general equation for the rate of change in the information content of a system. The general equation describes the change in the moments of an observable quantity over a probability distribution. One term in the equation describes the change in the probabilit…
▽ More
A recent article in Nature Physics unified key results from thermodynamics, statistics, and information theory. The unification arose from a general equation for the rate of change in the information content of a system. The general equation describes the change in the moments of an observable quantity over a probability distribution. One term in the equation describes the change in the probability distribution. The other term describes the change in the observable values for a given state. We show the equivalence of this general equation for moment dynamics with the widely known Price equation from evolutionary theory, named after George Price. We introduce the Price equation from its biological roots, review a mathematically abstract form of the equation, and discuss the potential for this equation to unify diverse mathematical theories from different disciplines. The new work in Nature Physics and many applications in biology show that this equation also provides the basis for deriving many novel theoretical results within each discipline.
△ Less
Submitted 27 October, 2020;
originally announced October 2020.
-
A Graph Neural Network Framework for Causal Inference in Brain Networks
Authors:
Simon Wein,
Wilhelm Malloni,
Ana Maria Tomé,
Sebastian M. Frank,
Gina-Isabelle Henze,
Stefan Wüst,
Mark W. Greenlee,
Elmar W. Lang
Abstract:
A central question in neuroscience is how self-organizing dynamic interactions in the brain emerge on their relatively static structural backbone. Due to the complexity of spatial and temporal dependencies between different brain areas, fully comprehending the interplay between structure and function is still challenging and an area of intense research. In this paper we present a graph neural netw…
▽ More
A central question in neuroscience is how self-organizing dynamic interactions in the brain emerge on their relatively static structural backbone. Due to the complexity of spatial and temporal dependencies between different brain areas, fully comprehending the interplay between structure and function is still challenging and an area of intense research. In this paper we present a graph neural network (GNN) framework, to describe functional interactions based on the structural anatomical layout. A GNN allows us to process graph-structured spatio-temporal signals, providing a possibility to combine structural information derived from diffusion tensor imaging (DTI) with temporal neural activity profiles, like observed in functional magnetic resonance imaging (fMRI). Moreover, dynamic interactions between different brain regions learned by this data-driven approach can provide a multi-modal measure of causal connectivity strength. We assess the proposed model's accuracy by evaluating its capabilities to replicate empirically observed neural activation profiles, and compare the performance to those of a vector auto regression (VAR), like typically used in Granger causality. We show that GNNs are able to capture long-term dependencies in data and also computationally scale up to the analysis of large-scale networks. Finally we confirm that features learned by a GNN can generalize across MRI scanner types and acquisition protocols, by demonstrating that the performance on small datasets can be improved by pre-training the GNN on data from an earlier and different study. We conclude that the proposed multi-modal GNN framework can provide a novel perspective on the structure-function relationship in the brain. Therewith this approach can be promising for the characterization of the information flow in brain networks.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
Recording and manipulation of vagus nerve electrical activity in chronically instrumented unanesthetized near term fetal sheep
Authors:
Aude Castel,
Patrick M. Burns,
Javier Benito,
Hai L. Liu,
Shikha Kuthiala,
Lucien D. Durosier,
Yael S. Frank,
Mingju Cao,
Marilène Paquet,
Gilles Fecteau,
André Desrochers,
Martin G. Frasch
Abstract:
Background: The chronically instrumented pregnant sheep has been used as a model of human fetal development and responses to pathophysiologic stimuli. This is due to the unique amenability of the unanesthetized fetal sheep to the surgical placement and maintenance of catheters and electrodes, allowing repetitive blood sampling, substance injection, recording of bioelectrical activity, application…
▽ More
Background: The chronically instrumented pregnant sheep has been used as a model of human fetal development and responses to pathophysiologic stimuli. This is due to the unique amenability of the unanesthetized fetal sheep to the surgical placement and maintenance of catheters and electrodes, allowing repetitive blood sampling, substance injection, recording of bioelectrical activity, application of electric stimulation and in vivo organ imaging. Recently, there has been growing interest in pleiotropic effects of vagus nerve stimulation (VNS) on various organ systems such as innate immunity, metabolism, and appetite control. There is no approach to study this in utero and corresponding physiological understanding is scarce. New Method: Based on our previous presentation of a stable chronically instrumented unanesthetized fetal sheep model, here we describe the surgical instrumentation procedure allowing successful implantation of a cervical uni- or bilateral VNS probe with or without vagotomy. Results: In a cohort of 53 animals, we present the changes in blood gas, metabolic, and inflammatory markers during the postoperative period. We detail the design of a VNS probe which also allows recording from the nerve. We also present an example of vagus electroneurogram (VENG) recorded from the VNS probe and an analytical approach to the data. Comparison with Existing Methods: This method represents the first implementation of VENG/VNS in a large pregnant mammalian organism. Conclusions: This study describes a new surgical procedure allowing to record and manipulate chronically the vagus nerve activity in an animal model of human pregnancy.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Metabolic heat in microbial conflict and cooperation
Authors:
Steven A. Frank
Abstract:
Many microbes live in habitats below their optimum temperature. Retention of metabolic heat by aggregation or insulation would boost growth. Generation of excess metabolic heat may also provide benefit. A cell that makes excess metabolic heat pays the cost of production, whereas the benefit may be shared by neighbors within a zone of local heat capture. Metabolic heat as a shareable public good ra…
▽ More
Many microbes live in habitats below their optimum temperature. Retention of metabolic heat by aggregation or insulation would boost growth. Generation of excess metabolic heat may also provide benefit. A cell that makes excess metabolic heat pays the cost of production, whereas the benefit may be shared by neighbors within a zone of local heat capture. Metabolic heat as a shareable public good raises interesting questions about conflict and cooperation of heat production and capture. Metabolic heat may also be deployed as a weapon. Species with greater thermotolerance gain by raising local temperature to outcompete less thermotolerant taxa. Metabolic heat may provide defense against bacteriophage attack, by analogy with fever in vertebrates. This article outlines the theory of metabolic heat in microbial conflict and cooperation, presenting several predictions for future study.
△ Less
Submitted 30 July, 2020; v1 submitted 10 May, 2020;
originally announced May 2020.
-
The generalized Price equation: forces that change population statistics
Authors:
Steven A. Frank,
William Godsoe
Abstract:
The Price equation partitions the change in the expected value of a population measure. The first component describes the partial change caused by altered frequencies. The second component describes the partial change caused by altered measurements. In biology, frequency changes often associate with the direct effect of natural selection. Measure changes reflect processes during transmission that…
▽ More
The Price equation partitions the change in the expected value of a population measure. The first component describes the partial change caused by altered frequencies. The second component describes the partial change caused by altered measurements. In biology, frequency changes often associate with the direct effect of natural selection. Measure changes reflect processes during transmission that alter trait values. More broadly, the two components describe the direct forces that change population composition and the altered frame of reference that changes measured values. The classic Price equation is limited to population statistics that can expressed as the expected value of a measure. Many statistics cannot be expressed as expected values, such as the harmonic mean and the family of rescaled diversity measures. We generalize the Price equation to any population statistic that can be expressed as a function of frequencies and measurements. We obtain the generalized partition between the direct forces that cause frequency change and the altered frame of reference that changes measurements.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
The neonatal sepsis is diminished by cervical vagus nerve stimulation and tracked non-invasively by ECG: a preliminary report in the piglet model
Authors:
Aude Castel,
Patrick Burns,
Colin Wakefield,
Keven. J. Jean,
Yael S. Frank,
Mingju Cao,
Andre Desrochers,
Gilles Fecteau,
Christophe Faure,
Christophe L. Herry,
Martin G. Frasch
Abstract:
An electrocardiogram (ECG)-derived heart rate variability (HRV) index reliably tracks the inflammatory response induced by low-dose lipopolysaccharide (LPS) in near-term sheep fetuses. We evaluated the effect of vagus nerve stimulation (VNS) on vagus nerve electroneurogram (VENG) and the systemic inflammatory response induced by a high dose of LPS in neonatal piglets to mimic late-onset neonatal s…
▽ More
An electrocardiogram (ECG)-derived heart rate variability (HRV) index reliably tracks the inflammatory response induced by low-dose lipopolysaccharide (LPS) in near-term sheep fetuses. We evaluated the effect of vagus nerve stimulation (VNS) on vagus nerve electroneurogram (VENG) and the systemic inflammatory response induced by a high dose of LPS in neonatal piglets to mimic late-onset neonatal sepsis. We tested if our HRV inflammatory index tracks inflammation in piglets and its relationship to VENG. Following anesthesia, electrodes were attached to the left vagal nerve; ECG and blood pressure (BP) were recorded throughout the experiment. Following baseline, the piglets were administered LPS as 2mg/kg IV bolus. In the VNS treated piglet, the vagus nerve was stimulated for 10 minutes prior to and 10 min after the injection of LPS. In both groups, every 15 min post LPS, the arterial blood sample was drawn for blood gas, metabolites, and inflammatory cytokines. At the end of the experiment, the piglets were euthanized. BP and HRV measures were calculated. The piglets developed a potent inflammatory response to the LPS injection with TNF-alpha, IL-1beta, IL-6 and IL-8 peaking between 45 and 90 min post-injection. VNS diminished the LPS-induced systemic inflammatory response varying across the measured cytokines from two to ten-fold. The HRV index tracked accurately the temporal profile of cytokines and VENG changes. This novel model allows manipulating and tracking neonatal sepsis: The HRV inflammatory index 1) applies across species pre- and postnatally and 2) performs well at different degrees of sepsis (i.e., nanogram and milligram doses of LPS); 3) the present VNS paradigm effectively suppresses LPS-induced inflammation, even at high doses of LPS. The potential of early postnatal VNS to counteract sepsis and of HRV monitoring to early detect and track it deserve further study.
△ Less
Submitted 20 December, 2022; v1 submitted 10 February, 2020;
originally announced February 2020.
-
Invariance in ecological pattern
Authors:
Steven A. Frank,
Jordi Bascompte
Abstract:
The abundance of different species in a community often follows the log series distribution. Other ecological patterns also have simple forms. Why does the complexity and variability of ecological systems reduce to such simplicity? Common answers include maximum entropy, neutrality, and convergent outcome from different underlying biological processes. This article proposes a more general answer b…
▽ More
The abundance of different species in a community often follows the log series distribution. Other ecological patterns also have simple forms. Why does the complexity and variability of ecological systems reduce to such simplicity? Common answers include maximum entropy, neutrality, and convergent outcome from different underlying biological processes. This article proposes a more general answer based on the concept of invariance, the property by which a pattern remains the same after transformation. Invariance has a long tradition in physics. For example, general relativity emphasizes the need for the equations describing the laws of physics to have the same form in all frames of reference. By bringing this unifying invariance approach into ecology, we show that the log series pattern dominates when the consequences of processes acting on abundance are invariant to the addition or multiplication of abundance by a constant. The lognormal pattern dominates when the processes acting on net species growth rate obey rotational invariance (symmetry) with respect to the summing up of the individual component processes. Recognizing how these invariances connect pattern to process leads to a synthesis of previous approaches. First, invariance provides a simpler and more fundamental maximum entropy derivation of the log series distribution. Second, invariance provides a simple derivation of the key result from neutral theory: the log series at the metacommunity scale and a clearer form of the skewed lognormal at the local community scale. The invariance expressions are easy to understand because they uniquely describe the basic underlying components that shape pattern.
△ Less
Submitted 20 August, 2019; v1 submitted 17 June, 2019;
originally announced June 2019.
-
Simple unity among the fundamental equations of science
Authors:
Steven A. Frank
Abstract:
The Price equation describes the change in populations. Change concerns some value, such as biological fitness, information or physical work. The Price equation reveals universal aspects for the nature of change, independently of the meaning ascribed to values. By understanding those universal aspects, we can see more clearly why fundamental mathematical results in different disciplines often shar…
▽ More
The Price equation describes the change in populations. Change concerns some value, such as biological fitness, information or physical work. The Price equation reveals universal aspects for the nature of change, independently of the meaning ascribed to values. By understanding those universal aspects, we can see more clearly why fundamental mathematical results in different disciplines often share a common form. We can also interpret more clearly the meaning of key results within each discipline. For example, the mathematics of natural selection in biology has a form closely related to information theory and physical entropy. Does that mean that natural selection is about information or entropy? Or do natural selection, information and entropy arise as interpretations of a common underlying abstraction? The Price equation suggests the latter. The Price equation achieves its abstract generality by partitioning change into two terms. The first term naturally associates with the direct forces that cause change. The second term naturally associates with the changing frame of reference. In the Price equation's canonical form, total change remains zero because the conservation of total probability requires that all probabilities invariantly sum to one. Much of the shared common form for the mathematics of different disciplines may arise from that seemingly trivial invariance of total probability, which leads to the partitioning of total change into equal and opposite components of the direct forces and the changing frame of reference.
△ Less
Submitted 4 August, 2019; v1 submitted 29 March, 2019;
originally announced April 2019.
-
The common patterns of abundance: the log series and Zipf's law
Authors:
Steven A. Frank
Abstract:
In a language corpus, the probability that a word occurs $n$ times is often proportional to $1/n^2$. Assigning rank, $s$, to words according to their abundance, $\log s$ vs $\log n$ typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different…
▽ More
In a language corpus, the probability that a word occurs $n$ times is often proportional to $1/n^2$. Assigning rank, $s$, to words according to their abundance, $\log s$ vs $\log n$ typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size $n$ is typically proportional to $1/n$, declining exponentially for larger $n$, the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns of abundance, the generic form of probability distributions plus the conserved average abundance is sufficient. The general theory includes cases that are between the Zipf and log series endpoints, providing a broad framework for analyzing widely observed abundance patterns.
△ Less
Submitted 12 January, 2019; v1 submitted 23 December, 2018;
originally announced December 2018.
-
The Price equation program: simple invariances unify population dynamics, thermodynamics, probability, information and inference
Authors:
Steven A. Frank
Abstract:
The fundamental equations of various disciplines often seem to share the same basic structure. Natural selection increases information in the same way that Bayesian updating increases information. Thermodynamics and the forms of common probability distributions express maximum increase in entropy, which appears mathematically as loss of information. Physical mechanics follows paths of change that…
▽ More
The fundamental equations of various disciplines often seem to share the same basic structure. Natural selection increases information in the same way that Bayesian updating increases information. Thermodynamics and the forms of common probability distributions express maximum increase in entropy, which appears mathematically as loss of information. Physical mechanics follows paths of change that maximize Fisher information. The information expressions typically have analogous interpretations as the Newtonian balance between force and acceleration, representing a partition between direct causes of change and opposing changes in the frame of reference. This web of vague analogies hints at a deeper common mathematical structure. I suggest that the Price equation expresses that underlying universal structure. The abstract Price equation describes dynamics as the change between two sets. One component of dynamics expresses the change in the frequency of things, holding constant the values associated with things. The other component of dynamics expresses the change in the values of things, holding constant the frequency of things. The separation of frequency from value generalizes Shannon's separation of the frequency of symbols from the meaning of symbols in information theory. The Price equation's generalized separation of frequency and value reveals a few simple invariances that define universal geometric aspects of change. For example, the conservation of total frequency, although a trivial invariance by itself, creates a powerful constraint on the geometry of change. That constraint plus a few others seem to explain the common structural forms of the equations in different disciplines. From that abstract perspective, interpretations such as selection, information, entropy, force, acceleration, and physical work arise from the same underlying geometry expressed by the Price equation.
△ Less
Submitted 14 December, 2018; v1 submitted 22 October, 2018;
originally announced October 2018.
-
Lexical representation explains cortical entrainment during speech comprehension
Authors:
Stefan Frank,
**biao Yang
Abstract:
Results from a recent neuroimaging study on spoken sentence comprehension have been interpreted as evidence for cortical entrainment to hierarchical syntactic structure. We present a simple computational model that predicts the power spectra from this study, even though the model's linguistic knowledge is restricted to the lexical level, and word-level representations are not combined into higher-…
▽ More
Results from a recent neuroimaging study on spoken sentence comprehension have been interpreted as evidence for cortical entrainment to hierarchical syntactic structure. We present a simple computational model that predicts the power spectra from this study, even though the model's linguistic knowledge is restricted to the lexical level, and word-level representations are not combined into higher-level units (phrases or sentences). Hence, the cortical entrainment results can also be explained from the lexical properties of the stimuli, without recourse to hierarchical syntax.
△ Less
Submitted 10 January, 2018; v1 submitted 18 June, 2017;
originally announced June 2017.
-
Receptor uptake arrays for vitamin B12, siderophores and glycans shape bacterial communities
Authors:
Steven A. Frank
Abstract:
Molecular variants of vitamin B12, siderophores and glycans occur. To take up variant forms, bacteria may express an array of receptors. The gut microbe Bacteroides thetaiotaomicron has three different receptors to take up variants of vitamin B12 and 88 receptors to take up various glycans. The design of receptor arrays reflects key processes that shape cellular evolution. Competition may focus ea…
▽ More
Molecular variants of vitamin B12, siderophores and glycans occur. To take up variant forms, bacteria may express an array of receptors. The gut microbe Bacteroides thetaiotaomicron has three different receptors to take up variants of vitamin B12 and 88 receptors to take up various glycans. The design of receptor arrays reflects key processes that shape cellular evolution. Competition may focus each species on a subset of the available nutrient diversity. Some gut bacteria can take up only a narrow range of carbohydrates, whereas species such as B.~thetaiotaomicron can digest many different complex glycans. Comparison of different nutrients, habitats, and genomes provide opportunity to test hypotheses about the breadth of receptor arrays. Another important process concerns fluctuations in nutrient availability. Such fluctuations enhance the value of cellular sensors, which gain information about environmental availability and adjust receptor deployment. Bacteria often adjust receptor expression in response to fluctuations of particular carbohydrate food sources. Some species may adjust expression of uptake receptors for specific siderophores. How do cells use sensor information to control the response to fluctuations? That question about regulatory wiring relates to problems that arise in control theory and artificial intelligence. Control theory clarifies how to analyze environmental fluctuations in relation to the design of sensors and response systems. Recent advances in deep learning studies of artificial intelligence focus on the architecture of regulatory wiring and the ways in which complex control networks represent and classify environmental states. I emphasize the similar design problems that arise in cellular evolution, control theory, and artificial intelligence. I connect those broad concepts to testable hypotheses for bacterial uptake of B12, siderophores and glycans.
△ Less
Submitted 21 August, 2017; v1 submitted 31 May, 2017;
originally announced June 2017.
-
Puzzles in modern biology. IV. Neurodegeneration, localized origin and widespread decay
Authors:
Steven A. Frank
Abstract:
The motor neuron disease amyotrophic lateral sclerosis (ALS) typically begins with localized muscle weakness. Progressive, widespread paralysis often follows over a few years. Does the disease begin with local changes in a small piece of neural tissue and then spread? Or does neural decay happen independently across diverse spatial locations? The distinction matters, because local initiation may a…
▽ More
The motor neuron disease amyotrophic lateral sclerosis (ALS) typically begins with localized muscle weakness. Progressive, widespread paralysis often follows over a few years. Does the disease begin with local changes in a small piece of neural tissue and then spread? Or does neural decay happen independently across diverse spatial locations? The distinction matters, because local initiation may arise by local changes in a tissue microenvironment, by somatic mutation, or by various epigenetic or regulatory fluctuations in a few cells. A local trigger must be coupled with a mechanism for spread. By contrast, independent decay across spatial locations cannot begin by a local change, but must depend on some global predisposition or spatially distributed change that leads to approximately synchronous decay. This article outlines the conceptual frame by which one contrasts local triggers and spread versus parallel spatially distributed decay. Various neurodegenerative diseases differ in their mechanistic details, but all can usefully be understood as falling along a continuum of interacting local and global processes. Cancer provides an example of disease progression by local triggers and spatial spread, setting a conceptual basis for clarifying puzzles in neurodegeneration. Heart disease also has crucial interactions between global processes, such as circulating lipid levels, and local processes in the development of atherosclerotic plaques. The distinction between local and global processes helps to understand these various age-related diseases.
△ Less
Submitted 19 October, 2016;
originally announced October 2016.
-
Puzzles in modern biology. III. Two kinds of causality in age-related disease
Authors:
Steven A. Frank
Abstract:
The two primary causal dimensions of age-related disease are rate and function. Change in rate of disease development shifts the age of onset. Change in physiological function provides necessary steps in disease progression. A causal factor may alter the rate of physiological change, but that causal factor itself may have no direct physiological role. Alternatively, a causal factor may provide a n…
▽ More
The two primary causal dimensions of age-related disease are rate and function. Change in rate of disease development shifts the age of onset. Change in physiological function provides necessary steps in disease progression. A causal factor may alter the rate of physiological change, but that causal factor itself may have no direct physiological role. Alternatively, a causal factor may provide a necessary physiological function, but that causal factor itself may not alter the rate of disease onset. The rate-function duality provides the basis for solving puzzles of age-related disease. Causal factors of cancer illustrate the duality between rate processes of discovery, such as somatic mutation, and necessary physiological functions, such as invasive penetration across tissue barriers. Examples from cancer suggest general principles of age-related disease.
△ Less
Submitted 19 October, 2016;
originally announced October 2016.
-
Puzzles in modern biology. II. Language, cancer and the recursive processes of evolutionary innovation
Authors:
Steven A. Frank
Abstract:
Human language emerged abruptly. Diverse body forms evolved suddenly. Seed-bearing plants spread rapidly. How do complex evolutionary innovations arise so quickly? Resolving alternative claims remains difficult. The great events of the past happened a long time ago. Cancer provides a model to study evolutionary innovation. A tumor must evolve many novel traits to become an aggressive cancer. I use…
▽ More
Human language emerged abruptly. Diverse body forms evolved suddenly. Seed-bearing plants spread rapidly. How do complex evolutionary innovations arise so quickly? Resolving alternative claims remains difficult. The great events of the past happened a long time ago. Cancer provides a model to study evolutionary innovation. A tumor must evolve many novel traits to become an aggressive cancer. I use what we know or could study about cancer to describe the key processes of innovation. In general, evolutionary systems form a hierarchy of recursive processes. Those recursive processes determine the rates at which innovations are generated, spread and transmitted. I relate the recursive processes to abrupt evolutionary innovation.
△ Less
Submitted 9 September, 2016;
originally announced September 2016.
-
Puzzles in modern biology. I. Male sterility, failure reveals design
Authors:
Steven A. Frank
Abstract:
Many human males produce dysfunctional sperm. Various plants frequently abort pollen. Hybrid matings often produce sterile males. Widespread male sterility is puzzling. Natural selection prunes reproductive failure. Puzzling failure implies something that we do not understand about how organisms are designed. Solving the puzzle reveals the hidden processes of design.
Many human males produce dysfunctional sperm. Various plants frequently abort pollen. Hybrid matings often produce sterile males. Widespread male sterility is puzzling. Natural selection prunes reproductive failure. Puzzling failure implies something that we do not understand about how organisms are designed. Solving the puzzle reveals the hidden processes of design.
△ Less
Submitted 9 September, 2016;
originally announced September 2016.
-
Invariant death
Authors:
Steven A. Frank
Abstract:
In nematodes, environmental or physiological perturbations alter death's scaling of time. In human cancer, genetic perturbations alter death's curvature of time. Those changes in scale and curvature follow the constraining contours of death's invariant geometry. I show that the constraints arise from a fundamental extension to the theories of randomness, invariance and scale. A generalized Gompert…
▽ More
In nematodes, environmental or physiological perturbations alter death's scaling of time. In human cancer, genetic perturbations alter death's curvature of time. Those changes in scale and curvature follow the constraining contours of death's invariant geometry. I show that the constraints arise from a fundamental extension to the theories of randomness, invariance and scale. A generalized Gompertz law follows. The constraints imposed by the invariant Gompertz geometry explain the tendency of perturbations to stretch or bend death's scaling of time. Variability in death rate arises from a combination of constraining universal laws and particular biological processes.
△ Less
Submitted 25 August, 2016;
originally announced August 2016.
-
The invariances of power law size distributions
Authors:
Steven A. Frank
Abstract:
Size varies. Small things are typically more frequent than large things. The logarithm of frequency often declines linearly with the logarithm of size. That power law relation forms one of the common patterns of nature. Why does the complexity of nature reduce to such a simple pattern? Why do things as different as tree size and enzyme rate follow similarly simple patterns? Here I analyze such pat…
▽ More
Size varies. Small things are typically more frequent than large things. The logarithm of frequency often declines linearly with the logarithm of size. That power law relation forms one of the common patterns of nature. Why does the complexity of nature reduce to such a simple pattern? Why do things as different as tree size and enzyme rate follow similarly simple patterns? Here I analyze such patterns by their invariant properties. For example, a common pattern should not change when adding a constant value to all observations. That shift is essentially the renumbering of the points on a ruler without changing the metric information provided by the ruler. A ruler is shift invariant only when its scale is properly calibrated to the pattern being measured. Stretch invariance corresponds to the conservation of the total amount of something, such as the total biomass and consequently the average size. Rotational invariance corresponds to pattern that does not depend on the order in which underlying processes occur, for example, a scale that additively combines the component processes leading to observed values. I use tree size as an example to illustrate how the key invariances shape pattern. A simple interpretation of common pattern follows. That simple interpretation connects the normal distribution to a wide variety of other common patterns through the transformations of scale set by the fundamental invariances.
△ Less
Submitted 7 November, 2016; v1 submitted 17 April, 2016;
originally announced April 2016.
-
Commentary: The nature of cancer research
Authors:
Steven A. Frank
Abstract:
Cancer research reflects an implicit conflict. On the one hand, there is an overwhelming desire to control the disease. We all wish that. On the other hand, we would like to understand why cancer follows so many clearly defined yet puzzling patterns. Why is there such regularity in the rates of progression? Why do different tissues vary so much? There should, of course, be no conflict between cont…
▽ More
Cancer research reflects an implicit conflict. On the one hand, there is an overwhelming desire to control the disease. We all wish that. On the other hand, we would like to understand why cancer follows so many clearly defined yet puzzling patterns. Why is there such regularity in the rates of progression? Why do different tissues vary so much? There should, of course, be no conflict between control and understanding. But the history of cancer research seems to say that those different goals remain oddly estranged. Peto's 1977 article locates the seeds of this conflict most clearly. He describes what is still the most powerful theoretical perspective for analyzing the causes of cancer. He presents many key unsolved puzzles within that context. He also says why most cancer researchers are not interested in these fundamental issues. The subsequent decades of research grew around this rift, blindly, in the way that research disciplines often grow. Let us revisit Peto, almost 40 years ago. We can learn much about the current nature of cancer research.
△ Less
Submitted 24 June, 2015;
originally announced June 2015.
-
d'Alembert's direct and inertial forces acting on populations: the Price equation and the fundamental theorem of natural selection
Authors:
Steven A. Frank
Abstract:
I develop a framework for interpreting the forces that act on any population described by frequencies. The conservation of total frequency, or total probability, shapes the characteristics of force. I begin with Fisher's fundamental theorem of natural selection. That theorem partitions the total evolutionary change of a population into two components. The first component is the partial change caus…
▽ More
I develop a framework for interpreting the forces that act on any population described by frequencies. The conservation of total frequency, or total probability, shapes the characteristics of force. I begin with Fisher's fundamental theorem of natural selection. That theorem partitions the total evolutionary change of a population into two components. The first component is the partial change caused by the direct force of natural selection, holding constant all aspects of the environment. The second component is the partial change caused by the changing environment. I demonstrate that Fisher's partition of total change into the direct force of selection and the forces from the changing environmental frame of reference is identical to d'Alembert's principle of mechanics, which separates the work done by the direct forces from the work done by the inertial forces associated with the changing frame of reference. In d'Alembert's principle, there exist inertial forces from a change in the frame of reference that exactly balance the direct forces. I show that the conservation of total probability strongly shapes the form of the balance between the direct and inertial forces. I then use the strong results for conserved probability to obtain general results for the change in any system quantity, such as biological fitness or energy. Those general results derive from simple coordinate changes between frequencies and system quantities. Ultimately, d'Alembert's separation of direct and inertial forces provides deep conceptual insight into the interpretation of forces and the unification of disparate fields of study.
△ Less
Submitted 20 October, 2015; v1 submitted 24 June, 2015;
originally announced June 2015.
-
Universal expressions of population change by the Price equation: natural selection, information, and maximum entropy production
Authors:
Steven A. Frank
Abstract:
The Price equation shows the unity between the fundamental expressions of change in biology, in information and entropy descriptions of populations, and in aspects of thermodynamics. The Price equation partitions the change in the average value of a metric between two populations. A population may be composed of organisms or particles or any members of a set to which we can assign probabilities. A…
▽ More
The Price equation shows the unity between the fundamental expressions of change in biology, in information and entropy descriptions of populations, and in aspects of thermodynamics. The Price equation partitions the change in the average value of a metric between two populations. A population may be composed of organisms or particles or any members of a set to which we can assign probabilities. A metric may be biological fitness or physical energy or the output of an arbitrarily complicated function that assigns quantitative values to members of the population. The first part of the Price equation describes how directly applied forces change the probabilities assigned to members of the population when holding constant the metrical values of the members---a fixed metrical frame of reference. The second part describes how the metrical values change, altering the metrical frame of reference. In canonical examples, the direct forces balance the changing metrical frame of reference, leaving the average or total metrical values unchanged. In biology, relative reproductive success (fitness) remains invariant as a simple consequence of the conservation of total probability. In physics, systems often conserve total energy. Nonconservative metrics can be described by starting with conserved metrics, and then studying how coordinate transformations between conserved and nonconserved metrics alter the geometry of the dynamics and the aggregate values of populations. From this abstract perspective, key results from different subjects appear more simply as universal geometric principles for the dynamics of populations subject to the constraints of particular conserved quantities
△ Less
Submitted 19 May, 2017; v1 submitted 24 June, 2015;
originally announced June 2015.
-
The inductive theory of natural selection: summary and synthesis
Authors:
Steven A. Frank
Abstract:
The theory of natural selection has two forms. Deductive theory describes how populations change over time. One starts with an initial population and some rules for change. From those assumptions, one calculates the future state of the population. Deductive theory predicts how populations adapt to environmental challenge. Inductive theory describes the causes of change in populations. One starts w…
▽ More
The theory of natural selection has two forms. Deductive theory describes how populations change over time. One starts with an initial population and some rules for change. From those assumptions, one calculates the future state of the population. Deductive theory predicts how populations adapt to environmental challenge. Inductive theory describes the causes of change in populations. One starts with a given amount of change. One then assigns different parts of the total change to particular causes. Inductive theory analyzes alternative causal models for how populations have adapted to environmental challenge. This chapter emphasizes the inductive analysis of cause.
△ Less
Submitted 12 November, 2016; v1 submitted 3 December, 2014;
originally announced December 2014.
-
How to read probability distributions as statements about process
Authors:
Steven A. Frank
Abstract:
Probability distributions can be read as simple expressions of information. Each continuous probability distribution describes how information changes with magnitude. Once one learns to read a probability distribution as a measurement scale of information, opportunities arise to understand the processes that generate the commonly observed patterns. Probability expressions may be parsed into four c…
▽ More
Probability distributions can be read as simple expressions of information. Each continuous probability distribution describes how information changes with magnitude. Once one learns to read a probability distribution as a measurement scale of information, opportunities arise to understand the processes that generate the commonly observed patterns. Probability expressions may be parsed into four components: the dissipation of all information, except the preservation of average values, taken over the measurement scale that relates changes in observed values to changes in information, and the transformation from the underlying scale on which information dissipates to alternative scales on which probability pattern may be expressed. Information invariances set the commonly observed measurement scales and the relations between them. In particular, a measurement scale for information is defined by its invariance to specific transformations of underlying values into measurable outputs. Essentially all common distributions can be understood within this simple framework of information invariance and measurement scale.
△ Less
Submitted 18 November, 2014; v1 submitted 18 September, 2014;
originally announced September 2014.
-
Generative models versus underlying symmetries to explain biological pattern
Authors:
Steven A. Frank
Abstract:
Mathematical models play an increasingly important role in the interpretation of biological experiments. Studies often present a model that generates the observations, connecting hypothesized process to an observed pattern. Such generative models confirm the plausibility of an explanation and make testable hypotheses for further experiments. However, studies rarely consider the broad family of alt…
▽ More
Mathematical models play an increasingly important role in the interpretation of biological experiments. Studies often present a model that generates the observations, connecting hypothesized process to an observed pattern. Such generative models confirm the plausibility of an explanation and make testable hypotheses for further experiments. However, studies rarely consider the broad family of alternative models that match the same observed pattern. The symmetries that define the broad class of matching models are in fact the only aspects of information truly revealed by observed pattern. Commonly observed patterns derive from simple underlying symmetries. This article illustrates the problem by showing the symmetry associated with the observed rate of increase in fitness in a constant environment. That underlying symmetry reveals how each particular generative model defines a single example within the broad class of matching models. Further progress on the relation between pattern and process requires deeper consideration of the underlying symmetries.
△ Less
Submitted 18 April, 2014;
originally announced April 2014.
-
Microbial metabolism: optimal control of uptake versus synthesis
Authors:
Steven A. Frank
Abstract:
Microbes require several complex organic molecules for growth. A species may obtain a required factor by taking up molecules released by other species or by synthesizing the molecule. The patterns of uptake and synthesis set a flow of resources through the multiple species that create a microbial community. This article analyzes a simple mathematical model of the tradeoff between uptake and synthe…
▽ More
Microbes require several complex organic molecules for growth. A species may obtain a required factor by taking up molecules released by other species or by synthesizing the molecule. The patterns of uptake and synthesis set a flow of resources through the multiple species that create a microbial community. This article analyzes a simple mathematical model of the tradeoff between uptake and synthesis. Key factors include the influx rate from external sources relative to the outflux rate, the rate of internal decay within cells, and the cost of synthesis. Aspects of demography also matter, such as cellular birth and death rates, the expected time course of a local resource flow, and the associated lifespan of the local population. Spatial patterns of genetic variability and differentiation between populations may also strongly influence the evolution of metabolic regulatory controls of individual species and thus the structuring of microbial communities. The widespread use of optimality approaches in recent work on microbial metabolism has ignored demography and genetic structure.
△ Less
Submitted 25 February, 2014;
originally announced February 2014.
-
Input-output relations in biological systems: measurement, information and the Hill equation
Authors:
Steven A. Frank
Abstract:
Biological systems produce outputs in response to variable inputs. Input-output relations tend to follow a few regular patterns. For example, many chemical processes follow the S-shaped Hill equation relation between input concentrations and output concentrations. That Hill equation pattern contradicts the fundamental Michaelis-Menten theory of enzyme kinetics. I use the discrepancy between the ex…
▽ More
Biological systems produce outputs in response to variable inputs. Input-output relations tend to follow a few regular patterns. For example, many chemical processes follow the S-shaped Hill equation relation between input concentrations and output concentrations. That Hill equation pattern contradicts the fundamental Michaelis-Menten theory of enzyme kinetics. I use the discrepancy between the expected Michaelis-Menten process of enzyme kinetics and the widely observed Hill equation pattern of biological systems to explore the general properties of biological input-output relations. I start with the various processes that could explain the discrepancy between basic chemistry and biological pattern. I then expand the analysis to consider broader aspects that shape biological input-output relations. Key aspects include the input-output processing by component subsystems and how those components combine to determine the system's overall input-output relations. That aggregate structure often imposes strong regularity on underlying disorder. Aggregation imposes order by dissipating information as it flows through the components of a system. The dissipation of information may be evaluated by the analysis of measurement and precision, explaining why certain common scaling patterns arise so frequently in input-output relations. I discuss how aggregation, measurement and scale provide a framework for understanding the relations between pattern and process. The regularity imposed by those broader structural aspects sets the contours of variation in biology. Thus, biological design will also tend to follow those contours. Natural selection may act primarily to modulate system properties within those broad constraints.
△ Less
Submitted 10 December, 2013;
originally announced December 2013.
-
Natural selection. VII. History and interpretation of kin selection theory
Authors:
Steven A. Frank
Abstract:
Kin selection theory is a kind of causal analysis. The initial form of kin selection ascribed cause to costs, benefits, and genetic relatedness. The theory then slowly developed a deeper and more sophisticated approach to partitioning the causes of social evolution. Controversy followed because causal analysis inevitably attracts opposing views. It is always possible to separate total effects into…
▽ More
Kin selection theory is a kind of causal analysis. The initial form of kin selection ascribed cause to costs, benefits, and genetic relatedness. The theory then slowly developed a deeper and more sophisticated approach to partitioning the causes of social evolution. Controversy followed because causal analysis inevitably attracts opposing views. It is always possible to separate total effects into different component causes. Alternative causal schemes emphasize different aspects of a problem, reflecting the distinct goals, interests, and biases of different perspectives. For example, group selection is a particular causal scheme with certain advantages and significant limitations. Ultimately, to use kin selection theory to analyze natural patterns and to understand the history of debates over different approaches, one must follow the underlying history of causal analysis. This article describes the history of kin selection theory, with emphasis on how the causal perspective improved through the study of key patterns of natural history, such as dispersal and sex ratio, and through a unified approach to demographic and social processes. Independent historical developments in the multivariate analysis of quantitative traits merged with the causal analysis of social evolution by kin selection.
△ Less
Submitted 19 May, 2013;
originally announced May 2013.
-
Natural selection. VI. Partitioning the information in fitness and characters by path analysis
Authors:
Steven A. Frank
Abstract:
Three steps aid in the analysis of selection. First, describe phenotypes by their component causes. Components include genes, maternal effects, symbionts, and any other predictors of phenotype that are of interest. Second, describe fitness by its component causes, such as an individual's phenotype, its neighbors' phenotypes, resource availability, and so on. Third, put the predictors of phenotype…
▽ More
Three steps aid in the analysis of selection. First, describe phenotypes by their component causes. Components include genes, maternal effects, symbionts, and any other predictors of phenotype that are of interest. Second, describe fitness by its component causes, such as an individual's phenotype, its neighbors' phenotypes, resource availability, and so on. Third, put the predictors of phenotype and fitness into an exact equation for evolutionary change, providing a complete expression of selection and other evolutionary processes. The complete expression separates the distinct causal roles of the various hypothesized components of phenotypes and fitness. Traditionally, those components are given by the covariance, variance, and regression terms of evolutionary models. I show how to interpret those statistical expressions with respect to information theory. The resulting interpretation allows one to read the fundamental equations of selection and evolution as sentences that express how various causes lead to the accumulation of information by selection and the decay of information by other evolutionary processes. The interpretation in terms of information leads to a deeper understanding of selection and heritability, and a clearer sense of how to formulate causal hypotheses about evolutionary process. Kin selection appears as a particular type of causal analysis that partitions social effects into meaningful components.
△ Less
Submitted 22 January, 2013;
originally announced January 2013.
-
Natural selection. V. How to read the fundamental equations of evolutionary change in terms of information theory
Authors:
Steven A. Frank
Abstract:
The equations of evolutionary change by natural selection are commonly expressed in statistical terms. Fisher's fundamental theorem emphasizes the variance in fitness. Quantitative genetics expresses selection with covariances and regressions. Population genetic equations depend on genetic variances. How can we read those statistical expressions with respect to the meaning of natural selection? On…
▽ More
The equations of evolutionary change by natural selection are commonly expressed in statistical terms. Fisher's fundamental theorem emphasizes the variance in fitness. Quantitative genetics expresses selection with covariances and regressions. Population genetic equations depend on genetic variances. How can we read those statistical expressions with respect to the meaning of natural selection? One possibility is to relate the statistical expressions to the amount of information that populations accumulate by selection. However, the connection between selection and information theory has never been compelling. Here, I show the correct relations between statistical expressions for selection and information theory expressions for selection. Those relations link selection to the fundamental concepts of entropy and information in the theories of physics, statistics, and communication. We can now read the equations of selection in terms of their natural meaning. Selection causes populations to accumulate information about the environment.
△ Less
Submitted 16 November, 2012;
originally announced November 2012.
-
Natural selection. IV. The Price equation
Authors:
Steven A. Frank
Abstract:
The Price equation partitions total evolutionary change into two components. The first component provides an abstract expression of natural selection. The second component subsumes all other evolutionary processes, including changes during transmission. The natural selection component is often used in applications. Those applications attract widespread interest for their simplicity of expression a…
▽ More
The Price equation partitions total evolutionary change into two components. The first component provides an abstract expression of natural selection. The second component subsumes all other evolutionary processes, including changes during transmission. The natural selection component is often used in applications. Those applications attract widespread interest for their simplicity of expression and ease of interpretation. Those same applications attract widespread criticism by drop** the second component of evolutionary change and by leaving unspecified the detailed assumptions needed for a complete study of dynamics. Controversies over approximation and dynamics have nothing to do with the Price equation itself, which is simply a mathematical equivalence relation for total evolutionary change expressed in an alternative form. Disagreements about approach have to do with the tension between the relative valuation of abstract versus concrete analyses. The Price equation's greatest value has been on the abstract side, particularly the invariance relations that illuminate the understanding of natural selection. Those abstract insights lay the foundation for applications in terms of kin selection, information theory interpretations of natural selection, and partitions of causes by path analysis. I discuss recent critiques of the Price equation by Nowak and van Veelen.
△ Less
Submitted 6 April, 2012;
originally announced April 2012.
-
Demography and the tragedy of the commons
Authors:
Steven A. Frank
Abstract:
Individual success in group-structured populations has two components. First, an individual gains by outcompeting its neighbors for local resources. Second, an individual's share of group success must be weighted by the total productivity of the group. The essence of sociality arises from the tension between selfish gains against neighbors and the associated loss that selfishness imposes by degrad…
▽ More
Individual success in group-structured populations has two components. First, an individual gains by outcompeting its neighbors for local resources. Second, an individual's share of group success must be weighted by the total productivity of the group. The essence of sociality arises from the tension between selfish gains against neighbors and the associated loss that selfishness imposes by degrading the efficiency of the group. Without some force to modulate selfishness, the natural tendencies of self interest typically degrade group performance to the detriment of all. This is the tragedy of the commons. Kin selection provides the most widely discussed way in which the tragedy is overcome in biology. Kin selection arises from behavioral associations within groups caused either by genetical kinship or by other processes that correlate the behaviors of group members. Here, I emphasize demography as a second factor that may also modulate the tragedy of the commons and favor cooperative integration of groups. Each act of selfishness or cooperation in a group often influences group survival and fecundity over many subsequent generations. For example, a cooperative act early in the growth cycle of a colony may enhance the future size and survival of the colony. This time-dependent benefit can greatly increase the degree of cooperation favored by natural selection, providing another way in which to overcome the tragedy of the commons and enhance the integration of group behavior. I conclude that analyses of sociality must account for both the behavioral associations of kin selection theory and the demographic consequences of life history theory.
△ Less
Submitted 14 December, 2011;
originally announced December 2011.
-
A general model of the public goods dilemma
Authors:
Steven A. Frank
Abstract:
An individually costly act that benefits all group members is a public good. Natural selection favors individual contribution to public goods only when some benefit to the individual offsets the cost of contribution. Problems of sex ratio, parasite virulence, microbial metabolism, punishment of noncooperators, and nearly all aspects of sociality have been analyzed as public goods shaped by kin and…
▽ More
An individually costly act that benefits all group members is a public good. Natural selection favors individual contribution to public goods only when some benefit to the individual offsets the cost of contribution. Problems of sex ratio, parasite virulence, microbial metabolism, punishment of noncooperators, and nearly all aspects of sociality have been analyzed as public goods shaped by kin and group selection. Here, I develop two general aspects of the public goods problem that have received relatively little attention. First, variation in individual resources favors selfish individuals to vary their allocation to public goods. Those individuals better endowed contribute their excess resources to public benefit, whereas those individuals with fewer resources contribute less to the public good. Thus, purely selfish behavior causes individuals to stratify into upper classes that contribute greatly to public benefit and social cohesion and to lower classes that contribute little to the public good. Second, if group success absolutely requires production of the public good, then the pressure favoring production is relatively high. By contrast, if group success depends weakly on the public good, then the pressure favoring production is relatively weak. Stated in this way, it is obvious that the role of baseline success is important. However, discussions of public goods problems sometimes fail to emphasize this point sufficiently. The models here suggest simple tests for the roles of resource variation and baseline success. Given the widespread importance of public goods, better models and tests would greatly deepen our understanding of many processes in biology and sociality.
△ Less
Submitted 14 December, 2011;
originally announced December 2011.
-
Evolutionary foundations of cooperation and group cohesion
Authors:
Steven A. Frank
Abstract:
In biology, the evolution of increasingly cooperative groups has shaped the history of life. Genes collaborate in the control of cells; cells efficiently divide tasks to produce cohesive multicellular individuals; individual members of insect colonies cooperate in integrated societies. Biological cooperation provides a foundation on which to understand human behavior. Conceptually, the economics o…
▽ More
In biology, the evolution of increasingly cooperative groups has shaped the history of life. Genes collaborate in the control of cells; cells efficiently divide tasks to produce cohesive multicellular individuals; individual members of insect colonies cooperate in integrated societies. Biological cooperation provides a foundation on which to understand human behavior. Conceptually, the economics of efficient allocation and the game-like processes of strategy are well understood in biology; we find the same essential processes in many successful theories of human sociality. Historically, the trace of biological evolution informs in two ways. First, the evolutionary transformations in biological cooperation provide insight into how economic and strategic processes play out over time--a source of analogy that, when applied thoughtfully, aids analysis of human sociality. Second, humans arose from biological history--a factual account of the past that tells us much about the material basis of human behavior.
△ Less
Submitted 13 December, 2011;
originally announced December 2011.
-
Maladaptation and the paradox of robustness in evolution
Authors:
Steven A. Frank
Abstract:
Background. Organisms use a variety of mechanisms to protect themselves against perturbations. For example, repair mechanisms fix damage, feedback loops keep homeostatic systems at their setpoints, and biochemical filters distinguish signal from noise. Such buffering mechanisms are often discussed in terms of robustness, which may be measured by reduced sensitivity of performance to perturbations.…
▽ More
Background. Organisms use a variety of mechanisms to protect themselves against perturbations. For example, repair mechanisms fix damage, feedback loops keep homeostatic systems at their setpoints, and biochemical filters distinguish signal from noise. Such buffering mechanisms are often discussed in terms of robustness, which may be measured by reduced sensitivity of performance to perturbations. Methodology/Principal Findings. I use a mathematical model to analyze the evolutionary dynamics of robustness in order to understand aspects of organismal design by natural selection. I focus on two characters: one character performs an adaptive task; the other character buffers the performance of the first character against perturbations. Increased perturbations favor enhanced buffering and robustness, which in turn decreases sensitivity and reduces the intensity of natural selection on the adaptive character. Reduced selective pressure on the adaptive character often leads to a less costly, lower performance trait. Conclusions/Significance. The paradox of robustness arises from evolutionary dynamics: enhanced robustness causes an evolutionary reduction in the adaptive performance of the target character, leading to a degree of maladaptation compared to what could be achieved by natural selection in the absence of robustness mechanisms. Over evolutionary time, buffering traits may become layered on top of each other, while the underlying adaptive traits become replaced by cheaper, lower performance components. The paradox of robustness has widespread implications for understanding organismal design.
△ Less
Submitted 13 December, 2011;
originally announced December 2011.
-
Natural selection. III. Selection versus transmission and the levels of selection
Authors:
Steven A. Frank
Abstract:
George Williams defined an evolutionary unit as hereditary information for which the selection bias between competing units dominates the informational decay caused by imperfect transmission. In this article, I extend Williams' approach to show that the ratio of selection bias to transmission bias provides a unifying framework for diverse biological problems. Specific examples include Haldane and…
▽ More
George Williams defined an evolutionary unit as hereditary information for which the selection bias between competing units dominates the informational decay caused by imperfect transmission. In this article, I extend Williams' approach to show that the ratio of selection bias to transmission bias provides a unifying framework for diverse biological problems. Specific examples include Haldane and Lande's mutation-selection balance, Eigen's error threshold and quasispecies, Van Valen's clade selection, Price's multilevel formulation of group selection, Szathmary and Demeter's evolutionary origin of primitive cells, Levin and Bull's short-sighted evolution of HIV virulence, Frank's timescale analysis of microbial metabolism, and Maynard Smith and Szathmary's major transitions in evolution. The insights from these diverse applications lead to a deeper understanding of kin selection, group selection, multilevel evolutionary analysis, and the philosophical problems of evolutionary units and individuality.
△ Less
Submitted 12 December, 2011;
originally announced December 2011.
-
Natural selection. II. Developmental variability and evolutionary rate
Authors:
Steven A. Frank
Abstract:
In classical evolutionary theory, genetic variation provides the source of heritable phenotypic variation on which natural selection acts. Against this classical view, several theories have emphasized that developmental variability and learning enhance nonheritable phenotypic variation, which in turn can accelerate evolutionary response. In this paper, I show how developmental variability alters e…
▽ More
In classical evolutionary theory, genetic variation provides the source of heritable phenotypic variation on which natural selection acts. Against this classical view, several theories have emphasized that developmental variability and learning enhance nonheritable phenotypic variation, which in turn can accelerate evolutionary response. In this paper, I show how developmental variability alters evolutionary dynamics by smoothing the landscape that relates genotype to fitness. In a fitness landscape with multiple peaks and valleys, developmental variability can smooth the landscape to provide a directly increasing path of fitness to the highest peak. Developmental variability also allows initial survival of a genotype in response to novel or extreme environmental challenge, providing an opportunity for subsequent adaptation. This initial survival advantage arises from the way in which developmental variability smooths and broadens the fitness landscape. Ultimately, the synergism between developmental processes and genetic variation sets evolutionary rate.
△ Less
Submitted 26 September, 2011;
originally announced September 2011.
-
Natural selection. I. Variable environments and uncertain returns on investment
Authors:
Steven A. Frank
Abstract:
Many studies have analyzed how variability in reproductive success affects fitness. However, each study tends to focus on a particular problem, leaving unclear the overall structure of variability in populations. This fractured conceptual framework often causes particular applications to be incomplete or improperly analyzed. In this paper, I present a concise introduction to the two key aspects of…
▽ More
Many studies have analyzed how variability in reproductive success affects fitness. However, each study tends to focus on a particular problem, leaving unclear the overall structure of variability in populations. This fractured conceptual framework often causes particular applications to be incomplete or improperly analyzed. In this paper, I present a concise introduction to the two key aspects of the theory. First, all measures of fitness ultimately arise from the relative comparison of the reproductive success of individuals or genotypes with the average reproductive success in the population. That relative measure creates a diminishing relation between reproductive success and fitness. Diminishing returns reduce fitness in proportion to variability in reproductive success. The relative measurement of success also induces a frequency dependence that favors rare types. Second, variability in populations has a hierarchical structure. Variable success in different traits of an individual affects that individual's variation in reproduction. Correlation between different individuals' reproduction affects variation in the aggregate success of particular alleles across the population. One must consider the hierarchical structure of variability in relation to different consequences of temporal, spatial, and developmental variability. Although a complete analysis of variability has many separate parts, this simple framework allows one to see the structure of the whole and to place particular problems in their proper relation to the general theory. The biological understanding of relative success and the hierarchical structure of variability in populations may also contribute to a deeper economic theory of returns under uncertainty.
△ Less
Submitted 26 September, 2011;
originally announced September 2011.
-
Wright's adaptive landscape versus Fisher's fundamental theorem
Authors:
Steven A. Frank
Abstract:
Two giants of evolutionary theory, Sewall Wright and R. A. Fisher, fought bitterly for over thirty years. The Wright-Fisher controversy forms a cornerstone of the history and philosophy of biology. I argue that the standard interpretations of the Wright-Fisher controversy do not accurately represent the ideas and arguments of these two key historical figures. The usual account contrasts the major…
▽ More
Two giants of evolutionary theory, Sewall Wright and R. A. Fisher, fought bitterly for over thirty years. The Wright-Fisher controversy forms a cornerstone of the history and philosophy of biology. I argue that the standard interpretations of the Wright-Fisher controversy do not accurately represent the ideas and arguments of these two key historical figures. The usual account contrasts the major slogans attached to each name: Wright's adaptive landscape and shifting balance theory of evolution versus Fisher's fundamental theorem of natural selection. These alternative theories are in fact incommensurable. Wright's theory is a detailed dynamical model of evolutionary change in actual populations. Fisher's theory is an abstract invariance and conservation law that, like all physical laws, captures essential features of a system but does not account for all aspects of dynamics in real examples. This key contrast between embodied theories of real cases and abstract laws is missing from prior analyses of Wright versus Fisher. They never argued about this contrast. Instead, the issue at stake in their arguments concerned the actual dynamics of real populations. Both agreed that fluctuations of nonadditive (epistatic) gene combinations play a central role in evolution. Wright emphasized stochastic fluctuations of gene combinations in small, isolated populations. By contrast, Fisher believed that fluctuating selection in large populations was the main cause of fluctuation in nonadditive gene combinations. Close reading shows that widely cited views attributed to Fisher mostly come from what Wright said about Fisher, whereas Fisher's own writings clearly do not support such views.
△ Less
Submitted 17 February, 2011;
originally announced February 2011.
-
Measurement scale in maximum entropy models of species abundance
Authors:
Steven A. Frank
Abstract:
The consistency of the species abundance distribution across diverse communities has attracted widespread attention. In this paper, I argue that the consistency of pattern arises because diverse ecological mechanisms share a common symmetry with regard to measurement scale. By symmetry, I mean that different ecological processes preserve the same measure of information and lose all other informati…
▽ More
The consistency of the species abundance distribution across diverse communities has attracted widespread attention. In this paper, I argue that the consistency of pattern arises because diverse ecological mechanisms share a common symmetry with regard to measurement scale. By symmetry, I mean that different ecological processes preserve the same measure of information and lose all other information in the aggregation of various perturbations. I frame these explanations of symmetry, measurement, and aggregation in terms of a recently developed extension to the theory of maximum entropy. I show that the natural measurement scale for the species abundance distribution is log-linear: the information in observations at small population sizes scales logarithmically and, as population size increases, the scaling of information grades from logarithmic to linear. Such log-linear scaling leads naturally to a gamma distribution for species abundance, which matches well with the observed patterns. Much of the variation between samples can be explained by the magnitude at which the measurement scale grades from logarithmic to linear. This measurement approach can be applied to the similar problem of allelic diversity in population genetics and to a wide variety of other patterns in biology.
△ Less
Submitted 24 November, 2010;
originally announced November 2010.
-
A simple derivation and classification of common probability distributions based on information symmetry and measurement scale
Authors:
Steven A. Frank,
Eric Smith
Abstract:
Commonly observed patterns typically follow a few distinct families of probability distributions. Over one hundred years ago, Karl Pearson provided a systematic derivation and classification of the common continuous distributions. His approach was phenomenological: a differential equation that generated common distributions without any underlying conceptual basis for why common distributions have…
▽ More
Commonly observed patterns typically follow a few distinct families of probability distributions. Over one hundred years ago, Karl Pearson provided a systematic derivation and classification of the common continuous distributions. His approach was phenomenological: a differential equation that generated common distributions without any underlying conceptual basis for why common distributions have particular forms and what explains the familial relations. Pearson's system and its descendants remain the most popular systematic classification of probability distributions. Here, we unify the disparate forms of common distributions into a single system based on two meaningful and justifiable propositions. First, distributions follow maximum entropy subject to constraints, where maximum entropy is equivalent to minimum information. Second, different problems associate magnitude to information in different ways, an association we describe in terms of the relation between information invariance and measurement scale. Our framework relates the different continuous probability distributions through the variations in measurement scale that change each family of maximum entropy distributions into a distinct family.
△ Less
Submitted 11 October, 2010;
originally announced October 2010.
-
Measurement Invariance, Entropy, and Probability
Authors:
Steven A. Frank,
D. Eric Smith
Abstract:
We show that the natural scaling of measurement for a particular problem defines the most likely probability distribution of observations taken from that measurement scale. Our approach extends the method of maximum entropy to use measurement scale as a type of information constraint. We argue that a very common measurement scale is linear at small magnitudes grading into logarithmic at large ma…
▽ More
We show that the natural scaling of measurement for a particular problem defines the most likely probability distribution of observations taken from that measurement scale. Our approach extends the method of maximum entropy to use measurement scale as a type of information constraint. We argue that a very common measurement scale is linear at small magnitudes grading into logarithmic at large magnitudes, leading to observations that often follow Student's probability distribution which has a Gaussian shape for small fluctuations from the mean and a power law shape for large fluctuations from the mean. An inverse scaling often arises in which measures naturally grade from logarithmic to linear as one moves from small to large magnitudes, leading to observations that often follow a gamma probability distribution. A gamma distribution has a power law shape for small magnitudes and an exponential shape for large magnitudes. The two measurement scales are natural inverses connected by the Laplace integral transform. This inversion connects the two major scaling patterns commonly found in nature. We also show that superstatistics is a special case of an integral transform, and thus can be understood as a particular way in which to change the scale of measurement. Incorporating information about measurement scale into maximum entropy provides a general approach to the relations between measurement, information and probability.
△ Less
Submitted 26 February, 2010;
originally announced March 2010.
-
The common patterns of nature
Authors:
Steven A. Frank
Abstract:
We typically observe large-scale outcomes that arise from the interactions of many hidden, small-scale processes. Examples include age of disease onset, rates of amino acid substitutions, and composition of ecological communities. The macroscopic patterns in each problem often vary around a characteristic shape that can be generated by neutral processes. A neutral generative model assumes that e…
▽ More
We typically observe large-scale outcomes that arise from the interactions of many hidden, small-scale processes. Examples include age of disease onset, rates of amino acid substitutions, and composition of ecological communities. The macroscopic patterns in each problem often vary around a characteristic shape that can be generated by neutral processes. A neutral generative model assumes that each microscopic process follows unbiased stochastic fluctuations: random connections of network nodes; amino acid substitutions with no effect on fitness; species that arise or disappear from communities randomly. These neutral generative models often match common patterns of nature. In this paper, I present the theoretical background by which we can understand why these neutral generative models are so successful. I show how the classic patterns such as Poisson and Gaussian arise. Each classic pattern was often discovered by a simple neutral generative model. The neutral patterns share a special characteristic: they describe the patterns of nature that follow from simple constraints on information. For example, any aggregation of processes that preserves information only about the mean and variance attracts to the Gaussian pattern; any aggregation that preserves information only about the mean attracts to the exponential pattern; any aggregation that preserves information only about the geometric mean attracts to the power law pattern. I present an informational framework of the common patterns of nature based on the method of maximum entropy. This framework shows that each neutral generative model is a special case that helps to discover a particular set of informational constraints; those informational constraints define a much wider domain of non-neutral generative processes that attract to the same neutral pattern.
△ Less
Submitted 18 June, 2009;
originally announced June 2009.
-
Natural selection maximizes Fisher information
Authors:
Steven A. Frank
Abstract:
In biology, information flows from the environment to the genome by the process of natural selection. But it has not been clear precisely what sort of information metric properly describes natural selection. Here, I show that Fisher information arises as the intrinsic metric of natural selection and evolutionary dynamics. Maximizing the amount of Fisher information about the environment captured…
▽ More
In biology, information flows from the environment to the genome by the process of natural selection. But it has not been clear precisely what sort of information metric properly describes natural selection. Here, I show that Fisher information arises as the intrinsic metric of natural selection and evolutionary dynamics. Maximizing the amount of Fisher information about the environment captured by the population leads to Fisher's fundamental theorem of natural selection, the most profound statement about how natural selection influences evolutionary dynamics. I also show a relation between Fisher information and Shannon information (entropy) that may help to unify the correspondence between information and dynamics. Finally, I discuss possible connections between the fundamental role of Fisher information in statistics, biology, and other fields of science.
△ Less
Submitted 23 January, 2009;
originally announced January 2009.