Skip to main content

Showing 1–31 of 31 results for author: Jenkins, P A

.
  1. arXiv:2406.16465  [pdf, ps, other

    math.PR q-bio.PE stat.CO

    Genealogical processes of non-neutral population models under rapid mutation

    Authors: Jere Koskela, Paul A. Jenkins, Adam M. Johansen, Dario Spano

    Abstract: We show that genealogical trees arising from a broad class of non-neutral models of population evolution converge to the Kingman coalescent under a suitable rescaling of time. As well as non-neutral biological evolution, our results apply to genetic algorithms encompassing the prominent class of sequential Monte Carlo (SMC) methods. The time rescaling we need differs slightly from that used in cla… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    MSC Class: 60J90; 65C35; 92D15

  2. arXiv:2312.17406  [pdf, ps, other

    math.PR q-bio.PE

    Sampling probabilities, diffusions, ancestral graphs, and duality under strong selection

    Authors: Martina Favero, Paul A. Jenkins

    Abstract: Wright-Fisher diffusions and their dual ancestral graphs occupy a central role in the study of allele frequency change and genealogical structure, and they provide expressions, explicit in some special cases but generally implicit, for the sampling probability, a crucial quantity in inference. Under a finite-allele mutation model, with possibly parent-dependent mutation, we consider the asymptotic… ▽ More

    Submitted 21 February, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 41 pages

  3. arXiv:2309.16271  [pdf, other

    math.PR

    Excursion theory for the Wright-Fisher diffusion

    Authors: Paul A. Jenkins, Jere Koskela, Jaromir Sant, Dario Spano, Ivana Valentic

    Abstract: In this work, we develop excursion theory for the Wright-Fisher diffusion with recurrent mutation. Our construction is intermediate between the classical excursion theory where all excursions begin and end at a single point and the more general approach considering excursions of processes from general sets. Since the Wright-Fisher diffusion has two boundary points, it is natural to construct excur… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 19 pages, 3 figures

    MSC Class: 60J70; 60J60; 92D25; 60J55

  4. arXiv:2301.05459  [pdf, ps, other

    q-bio.PE math.PR q-bio.QM stat.AP stat.CO

    EWF : simulating exact paths of the Wright--Fisher diffusion

    Authors: Jaromir Sant, Paul A. Jenkins, Jere Koskela, Dario Spanò

    Abstract: The Wright--Fisher diffusion is important in population genetics in modelling the evolution of allele frequencies over time subject to the influence of biological phenomena such as selection, mutation, and genetic drift. Simulating paths of the process is challenging due to the form of the transition density. We present EWF, a robust and efficient sampler which returns exact draws for the diffusio… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 92D25; 60J70; 65C99; 60J60

  5. arXiv:2212.07747  [pdf, other

    q-bio.PE math.PR math.ST

    An estimator for the recombination rate from a continuously observed diffusion of haplotype frequencies

    Authors: Robert C. Griffiths, Paul A. Jenkins

    Abstract: Recombination is a fundamental evolutionary force, but it is difficult to quantify because the effect of a recombination event on patterns of variation in a sample of genetic data can be hard to discern. Estimators for the recombination rate, which are usually based on the idea of integrating over the unobserved possible evolutionary histories of a sample, can therefore be noisy. Here we consider… ▽ More

    Submitted 4 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 28 pages, 3 figures

    MSC Class: 92D15 (Primary) 62M05 (Secondary)

  6. arXiv:2110.05356  [pdf, ps, other

    math.PR q-bio.PE stat.CO

    Weak Convergence of Non-neutral Genealogies to Kingman's Coalescent

    Authors: Suzie Brown, Paul A. Jenkins, Adam M. Johansen, Jere Koskela

    Abstract: Interacting particle systems undergoing repeated mutation and selection steps model genetic evolution, and also describe a broad class of sequential Monte Carlo methods. The genealogical tree embedded into the system is important in both applications. Under neutrality, when fitnesses of particles are independent from those of their parents, rescaled genealogies are known to converge to Kingman's c… ▽ More

    Submitted 19 April, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 37 pages, 1 figure

    MSC Class: 60J90; 65C35; 92D15

  7. arXiv:2106.05820  [pdf, other

    stat.ME stat.AP stat.CO

    Flexible Bayesian inference for diffusion processes using splines

    Authors: Paul A. Jenkins, Murray Pollock, Gareth O. Roberts

    Abstract: We introduce a flexible method to simultaneously infer both the drift and volatility functions of a discretely observed scalar diffusion. We introduce spline bases to represent these functions and develop a Markov chain Monte Carlo algorithm to infer, a posteriori, the coefficients of these functions in the spline basis. A key innovation is that we use spline bases to model transformed versions of… ▽ More

    Submitted 29 September, 2023; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 24 pages, 8 figures

    MSC Class: 65C05; 60H35; 60J60; 62G08; 65D07

  8. arXiv:2012.10316  [pdf, other

    math.PR

    Diffusion Limits at Small Times for Coalescent Processes with Mutation and Selection

    Authors: Philip A. Hanson, Paul A. Jenkins, Jere Koskela, Dario Spanò

    Abstract: The Ancestral Selection Graph (ASG) is an important genealogical process which extends the well-known Kingman coalescent to incorporate natural selection. We show that the number of lineages of the ASG with and without mutation is asymptotic to $2/t$ as $t\to 0$, in agreement with the limiting behaviour of the Kingman coalescent. We couple these processes on the same probability space using a Pois… ▽ More

    Submitted 22 December, 2020; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: 22 pages, 1 figure

    MSC Class: Primary 60J90; 60F05; secondary 60J80

  9. KwARG: Parsimonious reconstruction of ancestral recombination graphs with recurrent mutation

    Authors: Anastasia Ignatieva, Rune B. Lyngsø, Paul A. Jenkins, Jotun Hein

    Abstract: The reconstruction of possible histories given a sample of genetic data in the presence of recombination and recurrent mutation is a challenging problem, but can provide key insights into the evolution of a population. We present KwARG, which implements a parsimony-based greedy heuristic algorithm for finding plausible genealogical histories (ancestral recombination graphs) that are minimal or nea… ▽ More

    Submitted 13 May, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 18 pages, 12 figures; accepted for publication in Bioinformatics

  10. arXiv:2009.10440  [pdf, other

    stat.CO math.PR

    The computational cost of blocking for sampling discretely observed diffusions

    Authors: Marcin Mider, Paul A. Jenkins, Murray Pollock, Gareth O. Roberts

    Abstract: Many approaches for conducting Bayesian inference on discretely observed diffusions involve imputing diffusion bridges between observations. This can be computationally challenging in settings in which the temporal horizon between subsequent observations is large, due to the poor scaling of algorithms for simulating bridges as observation distance increases. It is common in practical settings to u… ▽ More

    Submitted 6 April, 2022; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: 15 pages, 3 figures

    MSC Class: 60H35 (Primary); 60J22; 60J60; 60J65 (Secondary)

  11. arXiv:2007.00096  [pdf, ps, other

    stat.CO math.PR stat.ME

    Simple conditions for convergence of sequential Monte Carlo genealogies with applications

    Authors: Suzie Brown, Paul A. Jenkins, Adam M. Johansen, Jere Koskela

    Abstract: We present simple conditions under which the limiting genealogical process associated with a class of interacting particle systems with non-neutral selection mechanisms, as the number of particles grows, is a time-rescaled Kingman coalescent. Sequential Monte Carlo algorithms are popular methods for approximating integrals in problems such as non-linear filtering and smoothing which employ this ty… ▽ More

    Submitted 7 December, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

    Comments: 22 pages, 1 figure

    MSC Class: 60J90; 60J95; 65C05; 65C35

  12. arXiv:2001.03527  [pdf, ps, other

    math.PR math.ST q-bio.PE

    Convergence of Likelihood Ratios and Estimators for Selection in non-neutral Wright-Fisher Diffusions

    Authors: Jaromir Sant, Paul A. Jenkins, Jere Koskela, Dario Spano

    Abstract: A number of discrete time, finite population size models in genetics describing the dynamics of allele frequencies are known to converge (subject to suitable scaling) to a diffusion process in the infinite population limit, termed the Wright-Fisher diffusion. In this article we show that the diffusion is ergodic uniformly in the selection and mutation parameters, and that the measures induced by t… ▽ More

    Submitted 13 September, 2021; v1 submitted 10 January, 2020; originally announced January 2020.

    MSC Class: 92D10 60J60 60J70

  13. arXiv:1912.04861  [pdf, other

    q-bio.PE math.PR

    A characterisation of the reconstructed birth-death process through time rescaling

    Authors: Anastasia Ignatieva, Jotun Hein, Paul A. Jenkins

    Abstract: The dynamics of a population exhibiting exponential growth can be modelled as a birth-death process, which naturally captures the stochastic variation in population size over time. In this article, we consider a supercritical birth-death process, started at a random time in the past, and conditioned to have n sampled individuals at the present. The genealogy of individuals sampled at the present t… ▽ More

    Submitted 6 May, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: 32 pages, 5 figures

  14. arXiv:1903.10184  [pdf, other

    stat.ME stat.CO

    Simulating bridges using confluent diffusions

    Authors: Paul A. Jenkins, Murray Pollock, Gareth O. Roberts, Michael Sørensen

    Abstract: Diffusions are a fundamental class of models in many fields, including finance, engineering, and biology. Simulating diffusions is challenging as their sample paths are infinite-dimensional and their transition functions are typically intractable. In statistical settings such as parameter inference for discretely observed diffusions, we require simulation techniques for diffusions conditioned on h… ▽ More

    Submitted 10 June, 2021; v1 submitted 25 March, 2019; originally announced March 2019.

    Comments: Significant revision of prior submission, with an improved methodology which is far broader in its applicability. Updated author listing. 19 pages, 5 figures

    MSC Class: 65C05; 60H35; 60J60

  15. arXiv:1804.07065  [pdf, other

    stat.ME math.PR math.ST

    Bayesian nonparametric analysis of Kingman's coalescent

    Authors: Stefano Favaro, Shui Feng, Paul A. Jenkins

    Abstract: Kingman's coalescent is one of the most popular models in population genetics. It describes the genealogy of a population whose genetic composition evolves in time according to the Wright-Fisher model, or suitable approximations of it belonging to the broad class of Fleming-Viot processes. Ancestral inference under Kingman's coalescent has had much attention in the literature, both in practical da… ▽ More

    Submitted 19 April, 2018; originally announced April 2018.

    Comments: 37 pages, 2 figures. To appear in Annales de l'Institut Henri Poincaré - Probabilités et Statistiques

    MSC Class: 62C10 (Primary) 62M05 (Secondary)

  16. arXiv:1804.01811  [pdf, other

    math.ST q-bio.PE stat.CO

    Asymptotic genealogies of interacting particle systems with an application to sequential Monte Carlo

    Authors: Jere Koskela, Paul A. Jenkins, Adam M. Johansen, Dario Spano

    Abstract: We study weighted particle systems in which new generations are resampled from current particles with probabilities proportional to their weights. This covers a broad class of sequential Monte Carlo (SMC) methods, widely-used in applied statistics and cognate disciplines. We consider the genealogical tree embedded into such particle systems, and identify conditions, as well as an appropriate time-… ▽ More

    Submitted 16 July, 2021; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: 28 pages, 1 figure. An earlier version of this manuscript contained an error, which we have been able to correct and in so doing give a stronger result under cleaner conditions. v7: Added several technical lemmas which make the overall argument more explicit

    MSC Class: Primary 60E15; secondary 60G99; 62E20

    Journal ref: Annals of Statistics 48(1):560-583, 2020

  17. arXiv:1802.06153  [pdf, other

    cs.LG q-bio.PE stat.ML

    A Likelihood-Free Inference Framework for Population Genetic Data using Exchangeable Neural Networks

    Authors: Jeffrey Chan, Valerio Perrone, Jeffrey P. Spence, Paul A. Jenkins, Sara Mathieson, Yun S. Song

    Abstract: An explosion of high-throughput DNA sequencing in the past decade has led to a surge of interest in population-scale inference with whole-genome data. Recent work in population genetics has centered on designing inference methods for relatively simple model classes, and few scalable general-purpose inference techniques exist for more realistic, complex models. To achieve this, two inferential chal… ▽ More

    Submitted 5 November, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

    Comments: 9 pages, 8 figures

  18. arXiv:1703.00208  [pdf, other

    math.PR

    Wright-Fisher diffusion bridges

    Authors: Robert Griffiths, Paul A. Jenkins, Dario Spanò

    Abstract: {\bf Abstract} The trajectory of the frequency of an allele which begins at $x$ at time $0$ and is known to have frequency $z$ at time $T$ can be modelled by the bridge process of the Wright-Fisher diffusion. Bridges when $x=z=0$ are particularly interesting because they model the trajectory of the frequency of an allele which appears at a time, then is lost by random drift or mutation after a tim… ▽ More

    Submitted 21 August, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

    MSC Class: 92D15; 60J60; 97K60

  19. arXiv:1612.01872  [pdf, other

    stat.CO math.PR

    Simulation from quasi-stationary distributions on reducible state spaces

    Authors: Adam Griffin, Paul A. Jenkins, Gareth O. Roberts, Simon E. F. Spencer

    Abstract: Quasi-stationary distributions (QSDs)arise from stochastic processes that exhibit transient equilibrium behaviour on the way to absorption QSDs are often mathematically intractable and even drawing samples from them is not straightforward. In this paper the framework of Sequential Monte Carlo samplers is utilized to simulate QSDs and several novel resampling techniques are proposed to accommodate… ▽ More

    Submitted 17 January, 2017; v1 submitted 6 December, 2016; originally announced December 2016.

    Comments: 30 pages, 9 Figures

    MSC Class: 60J27; 62G09

  20. arXiv:1611.07460  [pdf, other

    stat.ML

    Poisson Random Fields for Dynamic Feature Models

    Authors: Valerio Perrone, Paul A. Jenkins, Dario Spano, Yee Whye Teh

    Abstract: We present the Wright-Fisher Indian buffet process (WF-IBP), a probabilistic model for time-dependent data assumed to have been generated by an unknown number of latent features. This model is suitable as a prior in Bayesian nonparametric feature allocation models in which the features underlying the observed data exhibit a dependency structure over time. More specifically, we establish a new fram… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

  21. A coalescent dual process for a Wright-Fisher diffusion with recombination and its application to haplotype partitioning

    Authors: Robert C. Griffiths, Paul A. Jenkins, Sabin Lessard

    Abstract: Duality plays an important role in population genetics. It can relate results from forwards-in-time models of allele frequency evolution with those of backwards-in-time genealogical models; a well known example is the duality between the Wright-Fisher diffusion for genetic drift and its genealogical counterpart, the coalescent. There have been a number of articles extending this relationship to in… ▽ More

    Submitted 8 August, 2019; v1 submitted 14 April, 2016; originally announced April 2016.

    Comments: This version corrects typographical errors in equations (25), (26), (27), (B.3), (B.4). 39 pages, 3 figures

    Journal ref: Theoretical Population Biology, 112: 126-138 (2016)

  22. arXiv:1603.02834  [pdf, other

    stat.CO math.PR q-bio.PE stat.ME

    Inference and rare event simulation for stopped Markov processes via reverse-time sequential Monte Carlo

    Authors: Jere Koskela, Dario Spano, Paul A. Jenkins

    Abstract: We present a sequential Monte Carlo algorithm for Markov chain trajectories with proposals constructed in reverse time, which is advantageous when paths are conditioned to end in a rare set. The reverse time proposal distribution is constructed by approximating the ratio of Green's functions in Nagasawa's formula. Conditioning arguments can be used to interpret these ratios as low-dimensional cond… ▽ More

    Submitted 2 January, 2017; v1 submitted 9 March, 2016; originally announced March 2016.

    Comments: 21 pages, 6 figures

    MSC Class: Primary: 62M05; Secondary: 60J20; 60J22

    Journal ref: Statistics and Computing 28(1):131-144, 2018

  23. arXiv:1512.00982  [pdf, other

    stat.ME math.PR math.ST q-bio.PE stat.CO

    Bayesian non-parametric inference for $Λ$-coalescents: consistency and a parametric method

    Authors: Jere Koskela, Paul A. Jenkins, Dario Spanò

    Abstract: We investigate Bayesian non-parametric inference of the $Λ$-measure of $Λ$-coalescent processes with recurrent mutation, parametrised by probability measures on the unit interval. We give verifiable criteria on the prior for posterior consistency when observations form a time series, and prove that any non-trivial prior is inconsistent when all observations are contemporaneous. We then show that t… ▽ More

    Submitted 23 January, 2017; v1 submitted 3 December, 2015; originally announced December 2015.

    Comments: 28 pages, 3 figures

    MSC Class: Primary: 62M05; Secondary: 62G05; 92D15

    Journal ref: Bernoulli 24(3):2122-2153, 2018

  24. arXiv:1506.06998  [pdf, other

    stat.ME math.PR q-bio.PE stat.CO

    Exact simulation of the Wright-Fisher diffusion

    Authors: Paul A. Jenkins, Dario Spano

    Abstract: The Wright-Fisher family of diffusion processes is a widely used class of evolutionary models. However, simulation is difficult because there is no known closed-form formula for its transition function. In this article we demonstrate that it is in fact possible to simulate exactly from a broad class of Wright-Fisher diffusion processes and their bridges. For those diffusions corresponding to rever… ▽ More

    Submitted 29 September, 2023; v1 submitted 23 June, 2015; originally announced June 2015.

    Comments: 36 pages, 2 figure, 2 tables. This version corrects minor errors in the statements of Propositions 6 and 7

    Report number: CRiSM Working Paper 14-27 MSC Class: 65C05 (Primary); 60H35; 60J60; 92D15 (Secondary)

    Journal ref: Annals of Applied Probability 27(3):1478-1509 (2017)

  25. arXiv:1506.04709  [pdf, ps, other

    math.ST math.PR

    Consistency of Bayesian nonparametric inference for discretely observed jump diffusions

    Authors: Jere Koskela, Dario Spano, Paul A. Jenkins

    Abstract: We introduce verifiable criteria for weak posterior consistency of identifiable Bayesian nonparametric inference for jump diffusions with unit diffusion coefficient and uniformly Lipschitz drift and jump coefficients in arbitrary dimension. The criteria are expressed in terms of coefficients of the SDEs describing the process, and do not depend on intractable quantities such as transition densitie… ▽ More

    Submitted 14 September, 2018; v1 submitted 15 June, 2015; originally announced June 2015.

    Comments: 20 pages

    MSC Class: 62G20 (Primary) 60J25; 62M05 (Secondary)

    Journal ref: Bernoulli 25(3):2183-2205, 2019

  26. arXiv:1405.6863  [pdf, ps, other

    math.PR q-bio.PE

    Tractable diffusion and coalescent processes for weakly correlated loci

    Authors: Paul A. Jenkins, Paul Fearnhead, Yun S. Song

    Abstract: Widely used models in genetics include the Wright-Fisher diffusion and its moment dual, Kingman's coalescent. Each has a multilocus extension but under neither extension is the sampling distribution available in closed-form, and their computation is extremely difficult. In this paper we derive two new multilocus population genetic models, one a diffusion and the other a coalescent process, which a… ▽ More

    Submitted 4 March, 2015; v1 submitted 27 May, 2014; originally announced May 2014.

    Comments: 34 pages, 1 figure

    MSC Class: 92D15 (Primary) 65C50; 92D10 (Secondary)

    Journal ref: Electronic Journal of Probability, Vol. 20 (2015) Article 58

  27. arXiv:1311.5777  [pdf, other

    stat.ME math.PR stat.CO

    Exact simulation of the sample paths of a diffusion with a finite entrance boundary

    Authors: Paul A. Jenkins

    Abstract: Diffusion processes arise in many fields, and so simulating the path of a diffusion is an important problem. It is usually necessary to make some sort of approximation via model-discretization, but a recently introduced class of algorithms, known as the exact algorithm and based on retrospective rejection sampling ideas, obviate the need for such discretization. In this paper I extend the exact al… ▽ More

    Submitted 22 November, 2013; originally announced November 2013.

    Comments: 19 pages, 1 figure

  28. arXiv:1311.5699  [pdf, other

    math.PR q-bio.PE stat.CO

    Computational inference beyond Kingman's coalescent

    Authors: Jere Koskela, Paul A. Jenkins, Dario Spano

    Abstract: Full likelihood inference under Kingman's coalescent is a computationally challenging problem to which importance sampling (IS) and the product of approximate conditionals (PAC) method have been applied successfully. Both methods can be expressed in terms of families of intractable conditional sampling distributions (CSDs), and rely on principled approximations for accurate inference. Recently, mo… ▽ More

    Submitted 16 December, 2015; v1 submitted 22 November, 2013; originally announced November 2013.

    Comments: 20 pages, 5 figures

    MSC Class: 60G09 (Primary); 92D25; 93E10 (Secondary)

    Journal ref: J. Appl. Probab. 52(2), p. 519-537, 2015

  29. General triallelic frequency spectrum under demographic models with variable population size

    Authors: Paul A. Jenkins, Jonas W. Mueller, Yun S. Song

    Abstract: It is becoming routine to obtain datasets on DNA sequence variation across several thousands of chromosomes, providing unprecedented opportunity to infer the underlying biological and demographic forces. Such data make it vital to study summary statistics which offer enough compression to be tractable, while preserving a great deal of information. One well-studied summary is the site frequency spe… ▽ More

    Submitted 25 November, 2013; v1 submitted 12 October, 2013; originally announced October 2013.

    Comments: 29 pages, 11 figures (main text) + 6 pages, 3 figures (Supporting Information)

    Journal ref: Genetics, Vol. 196, No. 1 (2014) 295-311

  30. arXiv:1107.3897  [pdf, ps, other

    math.PR q-bio.PE

    Padé approximants and exact two-locus sampling distributions

    Authors: Paul A. Jenkins, Yun S. Song

    Abstract: For population genetics models with recombination, obtaining an exact, analytic sampling distribution has remained a challenging open problem for several decades. Recently, a new perspective based on asymptotic series has been introduced to make progress on this problem. Specifically, closed-form expressions have been derived for the first few terms in an asymptotic expansion of the two-locus samp… ▽ More

    Submitted 2 May, 2012; v1 submitted 20 July, 2011; originally announced July 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AAP780 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP780

    Journal ref: Annals of Applied Probability 2012, Vol. 22, No. 2, 576-607

  31. An asymptotic sampling formula for the coalescent with Recombination

    Authors: Paul A. Jenkins, Yun S. Song

    Abstract: Ewens sampling formula (ESF) is a one-parameter family of probability distributions with a number of intriguing combinatorial connections. This elegant closed-form formula first arose in biology as the stationary probability distribution of a sample configuration at one locus under the infinite-alleles model of mutation. Since its discovery in the early 1970s, the ESF has been used in various biol… ▽ More

    Submitted 15 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AAP646 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP646

    Journal ref: Annals of Applied Probability 2010, Vol. 20, No. 3, 1005-1028