-
Limit theorems for Randic index for Erdős-Renyi graphs
Authors:
Laura Eslava,
Sayle Sigarreta,
Arno Siri-Jegousse
Abstract:
We prove that the generalized Randic index over graphs following the Erdős-Renyi model, for both the sparse and dense regimes, is concentrated around its mean when the number of vertices tends to infinity.
We prove that the generalized Randic index over graphs following the Erdős-Renyi model, for both the sparse and dense regimes, is concentrated around its mean when the number of vertices tends to infinity.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
The Lamperti transformation in the infinite-dimensional setting and the genealogies of self-similar Markov processes
Authors:
Arno Siri-Jégousse,
Alejandro Hernández Wences
Abstract:
We propose a change in focus from the prevalent paradigm based on the branching property as a tool to analyze the structure of population models, to one based on the self-similarity property, which we also introduce for the first time in the setting of measure-valued processes. By extending the well-known Lamperti transformation for self-similar Markov processes to the Banach-valued case we are ab…
▽ More
We propose a change in focus from the prevalent paradigm based on the branching property as a tool to analyze the structure of population models, to one based on the self-similarity property, which we also introduce for the first time in the setting of measure-valued processes. By extending the well-known Lamperti transformation for self-similar Markov processes to the Banach-valued case we are able to generalize celebrated results in population genetics that describe the frequency-process of measure-valued stable branching processes in terms of the subfamily of Beta-Fleming-Viot processes. In our work we describe the frequency process of populations whose total size evolves as any positive self-similar Markov process in terms of general $Λ$-Fleming-Viot processes. Our results demonstrate the potential power of the self-similar perspective for the study of population models in which the reproduction dynamics of the individuals depend on the total population size, allowing for more complex and realistic models.
△ Less
Submitted 27 May, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Exchangeable coalescents beyond the Cannings class
Authors:
Arno Siri-Jégousse,
Alejandro H. Wences
Abstract:
We propose a general framework for the study of the genealogy of neutral discrete-time populations. We remove the standard assumption of exchangeability of offspring distributions appearing in Cannings' models, and replace it by a less restrictive condition of non-heritability of reproductive success. We provide a general criterion for the weak convergence of their genealogies to $Ξ$-coalescents,…
▽ More
We propose a general framework for the study of the genealogy of neutral discrete-time populations. We remove the standard assumption of exchangeability of offspring distributions appearing in Cannings' models, and replace it by a less restrictive condition of non-heritability of reproductive success. We provide a general criterion for the weak convergence of their genealogies to $Ξ$-coalescents, and apply it to a simple parametrization of our scenario (which, under mild conditions, we also prove to essentially include the general case). We provide examples for such populations, including models with highly-asymmetric offspring distributions and populations undergoing random but recurrent bottlenecks. Finally we study the limit genealogy of a new exponential model which, as previously shown for related models and in spite of its built in (fitness) inheritance mechanism, can be brought into our setting.
△ Less
Submitted 17 April, 2024; v1 submitted 5 December, 2022;
originally announced December 2022.
-
Seed bank Cannings Graphs: How dormancy smoothes random genetic drift
Authors:
Adrián González Casanova,
Lizbeth Peñaloza,
Arno Siri-Jégousse
Abstract:
In this article, we introduce a random (directed) graph model for the simultaneous forwards and backwards description of a rather broad class of Cannings models with a seed bank mechanism. This provides a simple tool to establish a sampling duality in the finite population size, and obtain a path-wise embedding of the forward frequency process and the backward ancestral process. Further, it allows…
▽ More
In this article, we introduce a random (directed) graph model for the simultaneous forwards and backwards description of a rather broad class of Cannings models with a seed bank mechanism. This provides a simple tool to establish a sampling duality in the finite population size, and obtain a path-wise embedding of the forward frequency process and the backward ancestral process. Further, it allows the derivation of limit theorems that generalize celebrated results by Möhle to models with seed banks, and where it can be seen how the effect of seed banks affects the genealogies. The explicit graphical construction is a new tool to understand the subtle interplay of seed banks, reproduction and genetic drift in population genetics.
△ Less
Submitted 19 May, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Asymptotics of the frequency spectrum for general Dirichlet Xi-coalescents
Authors:
Adrian Gonzalez Casanova,
Veronica Miro Pina,
Emmanuel Schertzer,
Arno Siri-Jegousse
Abstract:
In this work, we study general Dirichlet coalescents, which are a family of Xi-coalecents constructed from i.i.d mass partitions, and are an extension of the symmetric coalescent. This class of models is motivated by population models with recurrent demographic bottlenecks. We study the short time behavior of the multidimensional block counting process whose i-th component counts the number of blo…
▽ More
In this work, we study general Dirichlet coalescents, which are a family of Xi-coalecents constructed from i.i.d mass partitions, and are an extension of the symmetric coalescent. This class of models is motivated by population models with recurrent demographic bottlenecks. We study the short time behavior of the multidimensional block counting process whose i-th component counts the number of blocks of size i. Compared to standard coalescent models (such as the class of Lambda-coalescents coming down from infinity), our process has no deterministic speed of coming down from infinity. In particular, we prove that, under appropriate re-scaling, it converges to a stochastic process which is the unique solution of a martingale problem. We show that the multivariate Lamperti transform of this limiting process is a Markov Additive Process (MAP). This allows us to provide some asymptotics for the n-Site Frequency Spectrum, which is a statistic widely used in population genetics. In particular, the rescaled number of mutations converges to the exponential functional of a subordinator.
△ Less
Submitted 20 September, 2023; v1 submitted 27 September, 2022;
originally announced September 2022.
-
The shape of a seed bank tree
Authors:
Adrián González Casanova,
Lizbeth Peñaloza,
Arno Siri-Jégousse
Abstract:
We derive the asymptotic behavior of the total, active and inactive branch lengths of the seed bank coalescent, when the size of the initial sample grows to infinity. Those random variables have important applications for populations evolving under some seed bank effects, such as plants and bacteria, and for some cases of structured populations like metapopulations. The proof relies on the study o…
▽ More
We derive the asymptotic behavior of the total, active and inactive branch lengths of the seed bank coalescent, when the size of the initial sample grows to infinity. Those random variables have important applications for populations evolving under some seed bank effects, such as plants and bacteria, and for some cases of structured populations like metapopulations. The proof relies on the study of the tree at a stop** time corresponding to the first time that a deactivated lineage reactivates. We also give conditional sampling formulas for the random partition and we study the system at the time of the first deactivation of a lineage. All these results provide a good picture of the different regimes and behaviors of the block-counting process of the seed bank coalescent.
△ Less
Submitted 24 September, 2020; v1 submitted 13 January, 2020;
originally announced January 2020.
-
Site Frequency Spectrum of the Bolthausen-Sznitman Coalescent
Authors:
Götz Kersting,
Arno Siri-Jégousse,
Alejandro H. Wences
Abstract:
We derive explicit formulas for the two first moments of he site frequency spectrum $(SFS_{n,b})_{1\leq b\leq n-1}$ of the Bolthausen-Sznitman coalescent along with some precise and efficient approximations, even for small sample sizes $n$. These results provide new $L_2$-asymptotics for some values of $b=o(n)$. We also study the length of internal branches carrying $b>n/2$ individuals. In this ca…
▽ More
We derive explicit formulas for the two first moments of he site frequency spectrum $(SFS_{n,b})_{1\leq b\leq n-1}$ of the Bolthausen-Sznitman coalescent along with some precise and efficient approximations, even for small sample sizes $n$. These results provide new $L_2$-asymptotics for some values of $b=o(n)$. We also study the length of internal branches carrying $b>n/2$ individuals. In this case we obtain the distribution function and a convergence in law. Our results rely on the random recursive tree construction of the Bolthausen-Sznitman coalescent.
△ Less
Submitted 3 October, 2019;
originally announced October 2019.
-
The minimal observable clade size of exchangeable coalescents
Authors:
Fabian Freund,
Arno Siri-Jégousse
Abstract:
For $Λ$-$n$-coalescents with mutation, we analyse the size $O_n$ of the partition block of $i\in\{1,\ldots,n\}$ at the time where the first mutation appears on the tree that affects $i$ and is shared with any other $j\in\{1,\ldots,n\}$. We provide asymptotics of $O_n$ for $n\to\infty$ and a recursion for all moments of $O_n$ for finite $n$. This variable gives an upper bound for the minimal clade…
▽ More
For $Λ$-$n$-coalescents with mutation, we analyse the size $O_n$ of the partition block of $i\in\{1,\ldots,n\}$ at the time where the first mutation appears on the tree that affects $i$ and is shared with any other $j\in\{1,\ldots,n\}$. We provide asymptotics of $O_n$ for $n\to\infty$ and a recursion for all moments of $O_n$ for finite $n$. This variable gives an upper bound for the minimal clade size [2], which is not observable in real data. In applications to genetics, it has been shown to be useful to lower classification errors in genealogical model selection [10].
△ Less
Submitted 27 June, 2019;
originally announced June 2019.
-
The Symmetric Coalescent and Wright-Fisher models with bottlenecks
Authors:
Adrián González Casanova,
Verónica Miró Pina,
Arno Siri-Jégousse
Abstract:
We define a new class of $Ξ$-coalescents characterized by a possibly infinite measure over the non negative integers. We call them symmetric coalescents since they are the unique family of exchangeable coalescents satisfying a symmetry property on their coagulation rates: they are invariant under any transformation that consists in moving one element from one block to another without changing the…
▽ More
We define a new class of $Ξ$-coalescents characterized by a possibly infinite measure over the non negative integers. We call them symmetric coalescents since they are the unique family of exchangeable coalescents satisfying a symmetry property on their coagulation rates: they are invariant under any transformation that consists in moving one element from one block to another without changing the total number of blocks. We illustrate the diversity of behaviors of this family of processes by introducing and studying a one parameter subclass, the $(β,S)$-coalescents. We also embed this family in a larger class of $Ξ$-coalescents arising as the limit genealogies of Wright-Fisher models with bottlenecks. Some convergence results rely on a new Skorokhod type metric, that induces the Meyer-Zheng topology, which allows to study the scaling limit of non-markovian processes using standard techniques.
△ Less
Submitted 1 March, 2022; v1 submitted 13 March, 2019;
originally announced March 2019.
-
Phase-type distributions in population genetics
Authors:
Asger Hobolth,
Arno Siri-Jégousse,
Mogens Bladt
Abstract:
Probability modelling for DNA sequence evolution is well established and provides a rich framework for understanding genetic variation between samples of individuals from one or more populations. We show that both classical and more recent models for coalescence (with or without recombination) can be described in terms of the so-called phase-type theory, where complicated and tedious calculations…
▽ More
Probability modelling for DNA sequence evolution is well established and provides a rich framework for understanding genetic variation between samples of individuals from one or more populations. We show that both classical and more recent models for coalescence (with or without recombination) can be described in terms of the so-called phase-type theory, where complicated and tedious calculations are circumvented by the use of matrices. The application of phase-type theory consists of describing the stochastic model as a Markov model by appropriately setting up a state space and calculating the corresponding intensity and reward matrices. Formulae of interest are then expressed in terms of these aforementioned matrices. We illustrate this by a few examples calculating the mean, variance and even higher order moments of the site frequency spectrum in the multiple merger coalescent models, and by analysing the mean and variance for the number of segregating sites for multiple samples in the two-locus ancestral recombination graph. We believe that phase-type theory has great potential as a tool for analysing probability models in population genetics. The compact matrix notation is useful for clarification of current models, in particular their formal manipulation (calculation), but also for further development or extensions.
△ Less
Submitted 4 June, 2018;
originally announced June 2018.
-
The Nested Kingman Coalescent: Speed of Coming Down from Infinity
Authors:
Airam Blancas Benítez,
Tim Rogers,
Jason Schweinsberg,
Arno Siri-Jégousse
Abstract:
The nested Kingman coalescent describes the ancestral tree of a population undergoing neutral evolution at the level of individuals and at the level of species, simultaneously. We study the speed at which the number of lineages descends from infinity in this hierarchical coalescent process and prove the existence of an early-time phase during which the number of lineages at time $t$ decays as…
▽ More
The nested Kingman coalescent describes the ancestral tree of a population undergoing neutral evolution at the level of individuals and at the level of species, simultaneously. We study the speed at which the number of lineages descends from infinity in this hierarchical coalescent process and prove the existence of an early-time phase during which the number of lineages at time $t$ decays as $ 2γ/ct^2$, where $c$ is the ratio of the coalescence rates at the individual and species levels, and the constant $γ\approx 3.45$ is derived from a recursive distributional equation for the number of lineages contained within a species at a typical time.
△ Less
Submitted 23 March, 2018;
originally announced March 2018.
-
Trees within trees: Simple nested coalescents
Authors:
Airam Blancas,
Jean-Jil Duchamps,
Amaury Lambert,
Arno Siri-Jégousse
Abstract:
We consider the compact space of pairs of nested partitions of $\mathbb N$, where by analogy with models used in molecular evolution, we call "gene partition" the finer partition and "species partition" the coarser one. We introduce the class of nondecreasing processes valued in nested partitions, assumed Markovian and with exchangeable semigroup. These processes are said simple when each partitio…
▽ More
We consider the compact space of pairs of nested partitions of $\mathbb N$, where by analogy with models used in molecular evolution, we call "gene partition" the finer partition and "species partition" the coarser one. We introduce the class of nondecreasing processes valued in nested partitions, assumed Markovian and with exchangeable semigroup. These processes are said simple when each partition only undergoes one coalescence event at a time (but possibly the same time). Simple nested exchangeable coalescent (SNEC) processes can be seen as the extension of $Λ$-coalescents to nested partitions. We characterize the law of SNEC processes as follows. In the absence of gene coalescences, species blocks undergo $Λ$-coalescent type events and in the absence of species coalescences, gene blocks lying in the same species block undergo i.i.d. $Λ$-coalescents. Simultaneous coalescence of the gene and species partitions are governed by an intensity measure $ν_s$ on $(0,1]\times {\mathcal M}_1 ([0,1])$ providing the frequency of species merging and the law in which are drawn (independently) the frequencies of genes merging in each coalescing species block. As an application, we also study the conditions under which a SNEC process comes down from infinity.
△ Less
Submitted 25 September, 2018; v1 submitted 6 March, 2018;
originally announced March 2018.
-
Refracted Continuous-State Branching Processes: Self-regulating populations
Authors:
Antonio Murillo-Salas,
José Luis Pérez,
Arno Siri-Jégousse
Abstract:
We construct a modified continuous-state branching process whose Malthusian parameter is replaced by another when passing below a certain level. The construction is obtained via a Lamperti-like transform applied to a refracted Lévy process. Infinitesimal generator, probability of vanishing at infinity, of explosion and some path properties are also provided.
We construct a modified continuous-state branching process whose Malthusian parameter is replaced by another when passing below a certain level. The construction is obtained via a Lamperti-like transform applied to a refracted Lévy process. Infinitesimal generator, probability of vanishing at infinity, of explosion and some path properties are also provided.
△ Less
Submitted 29 November, 2016; v1 submitted 12 November, 2015;
originally announced November 2015.
-
Asymptotics of the minimal clade size and related functionals of certain beta-coalescents
Authors:
Arno Siri-Jégousse,
Linglong Yuan
Abstract:
This article shows the asymptotics of distributions of various functionals of the Beta$(2-α,α)$ $n$-coalescent process with $1<α<2$ when $n$ goes to infinity. This process is a Markov process taking {values} in the set of partitions of $\{1, \dots, n\}$, evolving from the intial value $\{1\},\cdots, \{n\}$ by merging (coalescing) blocks together into one and finally reaching the absorbing state…
▽ More
This article shows the asymptotics of distributions of various functionals of the Beta$(2-α,α)$ $n$-coalescent process with $1<α<2$ when $n$ goes to infinity. This process is a Markov process taking {values} in the set of partitions of $\{1, \dots, n\}$, evolving from the intial value $\{1\},\cdots, \{n\}$ by merging (coalescing) blocks together into one and finally reaching the absorbing state $\{1, \dots, n\}$. The minimal clade of $1$ is the block which contains $1$ at the time of coalescence of the singleton $\{1\}$. The limit size of the minimal clade of $1$ is provided. To this, we express it as a function of the coalescence time of $\{1\}$ and sizes of blocks at that time. Another quantity concerning the size of the largest block (at deterministic small time and at the coalescence time of $\{1\}$) is also studied.
△ Less
Submitted 25 March, 2014; v1 submitted 22 November, 2013;
originally announced November 2013.
-
Total internal and external lengths of the Bolthausen-Sznitman coalescent
Authors:
Götz Kersting,
Juan Carlos Pardo,
Arno Siri-Jégousse
Abstract:
In this paper, we study a weak law of large numbers for the total internal length of the Bolthausen-Szmitman coalescent. As a consequence, we obtain the weak limit law of the centered and rescaled total external length. The latter extends results obtained by Dhersin & Möhle \cite{DM12}. An application to population genetics dealing with the total number of mutations in the genealogical tree is als…
▽ More
In this paper, we study a weak law of large numbers for the total internal length of the Bolthausen-Szmitman coalescent. As a consequence, we obtain the weak limit law of the centered and rescaled total external length. The latter extends results obtained by Dhersin & Möhle \cite{DM12}. An application to population genetics dealing with the total number of mutations in the genealogical tree is also given.
△ Less
Submitted 6 February, 2013;
originally announced February 2013.
-
Minimal clade size in the Bolthausen-Sznitman coalescent
Authors:
Fabian Freund,
Arno Siri-Jégousse
Abstract:
This article shows the asymptotics of distribution and moments of the size $X_n$ of the minimal clade of a randomly chosen individual in a Bolthausen-Sznitman $n$-coalescent for $n\to\infty$. The Bolthausen-Sznitman $n$-coalescent is a Markov process taking states in the set of partitions of $\left\{1,\ldots,n\right\}$, where $1,\ldots,n$ are referred to as individuals. The minimal clade of an ind…
▽ More
This article shows the asymptotics of distribution and moments of the size $X_n$ of the minimal clade of a randomly chosen individual in a Bolthausen-Sznitman $n$-coalescent for $n\to\infty$. The Bolthausen-Sznitman $n$-coalescent is a Markov process taking states in the set of partitions of $\left\{1,\ldots,n\right\}$, where $1,\ldots,n$ are referred to as individuals. The minimal clade of an individual is the equivalence class the individual is in at the time of the first coalescence event this individual participates in.\\ The main tool used is the connection of the Bolthausen-Sznitman $n$-coalescent with random recursive trees introduced by Goldschmidt and Martin (see \cite{goldschmidtmartin}). This connection shows that $X_n-1$ is distributed as the number $M_n$ of all individuals not in the equivalence class of individual 1 shortly before the time of the last coalescence event. Both functionals are distributed like the size $RT_{n-1}$ of an uniformly chosen table in a standard Chinese restaurant process with $n-1$ customers.We give exact formulae for these distributions.\\ Using the asymptotics of $M_n$ shown by Goldschmidt and Martin in \cite{goldschmidtmartin}, we see $(\log n)^{-1}\log X_n$ converges in distribution to the uniform distribution on [0,1] for $n\to\infty$.\\ We provide the complimentary information that $\frac{\log n}{n^k}E(X_n^k)\to \frac{1}{k}$ for $n\to\infty$, which is also true for $M_n$ and $RT_n$.
△ Less
Submitted 6 March, 2013; v1 submitted 14 January, 2013;
originally announced January 2013.
-
On the length of an external branch in the Beta-coalescent
Authors:
Jean-Stephane Dhersin,
Fabian Freund,
Arno Siri-Jegousse,
Linglong Yuan
Abstract:
In this paper, we consider Beta$(2-α,α)$ (with $1<α<2$) and related $Λ$-coalescents. If $T^{(n)}$ denotes the length of an external branch of the $n$-coalescent, we prove the convergence of $n^{α-1}T^{(n)}$ when $n$ tends to $ \infty $, and give the limit. To this aim, we give asymptotics for the number $σ^{(n)}$ of collisions which occur in the $n$-coalescent until the end of the chosen external…
▽ More
In this paper, we consider Beta$(2-α,α)$ (with $1<α<2$) and related $Λ$-coalescents. If $T^{(n)}$ denotes the length of an external branch of the $n$-coalescent, we prove the convergence of $n^{α-1}T^{(n)}$ when $n$ tends to $ \infty $, and give the limit. To this aim, we give asymptotics for the number $σ^{(n)}$ of collisions which occur in the $n$-coalescent until the end of the chosen external branch, and for the block counting process associated with the $n$-coalescent.
△ Less
Submitted 19 January, 2012;
originally announced January 2012.
-
Asymptotic results on the length of coalescent trees
Authors:
Jean-François Delmas,
Jean-Stéphane Dhersin,
Arno Siri-Jegousse
Abstract:
We give the asymptotic distribution of the length of partial coalescent trees for Beta and related coalescents. This allows us to give the asymptotic distribution of the number of (neutral) mutations in the partial tree. This is a first step to study the asymptotic distribution of a natural estimator of DNA mutation rate for species with large families.
We give the asymptotic distribution of the length of partial coalescent trees for Beta and related coalescents. This allows us to give the asymptotic distribution of the number of (neutral) mutations in the partial tree. This is a first step to study the asymptotic distribution of a natural estimator of DNA mutation rate for species with large families.
△ Less
Submitted 1 June, 2007;
originally announced June 2007.