Skip to main content

Showing 1–23 of 23 results for author: Lemey, P

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2402.11657  [pdf, other

    q-bio.PE q-bio.GN q-bio.QM

    On the importance of assessing topological convergence in Bayesian phylogenetic inference

    Authors: Marius Brusselmans, Luiz Max Carvalho, Samuel L. Hong, Jiansi Gao, Frederick A. Matsen IV, Andrew Rambaut, Philippe Lemey, Marc A. Suchard, Gytis Dudas, Guy Baele

    Abstract: Modern phylogenetics research is often performed within a Bayesian framework, using sampling algorithms such as Markov chain Monte Carlo (MCMC) to approximate the posterior distribution. These algorithms require careful evaluation of the quality of the generated samples. Within the field of phylogenetics, one frequently adopted diagnostic approach is to evaluate the effective sample size (ESS) and… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  2. arXiv:2303.13642  [pdf, other

    q-bio.PE stat.CO

    Random-effects substitution models for phylogenetics via scalable gradient approximations

    Authors: Andrew F. Magee, Andrew J. Holbrook, Jonathan E. Pekar, Itzue W. Caviedes-Solis, Fredrick A. Matsen IV, Guy Baele, Joel O. Wertheim, Xiang Ji, Philippe Lemey, Marc A. Suchard

    Abstract: Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitut… ▽ More

    Submitted 25 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  3. arXiv:2303.04390  [pdf, other

    stat.CO q-bio.PE

    Many-core algorithms for high-dimensional gradients on phylogenetic trees

    Authors: Karthik Gangavarapu, Xiang Ji, Guy Baele, Mathieu Fourment, Philippe Lemey, Frederick A. Matsen IV, Marc A. Suchard

    Abstract: The rapid growth in genomic pathogen data spurs the need for efficient inference techniques, such as Hamiltonian Monte Carlo (HMC) in a Bayesian framework, to estimate parameters of these phylogenetic models where the dimensions of the parameters increase with the number of sequences $N$. HMC requires repeated calculation of the gradient of the data log-likelihood with respect to (wrt) all branch-… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  4. arXiv:2201.07291  [pdf, other

    stat.ME q-bio.PE stat.CO

    Accelerating Bayesian inference of dependency between complex biological traits

    Authors: Zhenyu Zhang, Akihiko Nishimura, Nídia S. Trovão, Joshua L. Cherry, Andrew J. Holbrook, Xiang Ji, Philippe Lemey, Marc A. Suchard

    Abstract: Inferring dependencies between complex biological traits while accounting for evolutionary relationships between specimens is of great scientific interest yet remains infeasible when trait and specimen counts grow large. The state-of-the-art approach uses a phylogenetic multivariate probit model to accommodate binary and continuous traits via a latent variable framework, and utilizes an efficient… ▽ More

    Submitted 7 September, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: 39 pages, 5 figures, 3 tables

  5. arXiv:2110.13298  [pdf, other

    q-bio.PE stat.CO

    Scalable Bayesian divergence time estimation with ratio transformations

    Authors: Xiang Ji, Alexander A. Fisher, Shuo Su, Jeffrey L. Thorne, Barney Potter, Philippe Lemey, Guy Baele, Marc A. Suchard

    Abstract: Divergence time estimation is crucial to provide temporal signals for dating biologically important events, from species divergence to viral transmissions in space and time. With the advent of high-throughput sequencing, recent Bayesian phylogenetic studies have analyzed hundreds to thousands of sequences. Such large-scale analyses challenge divergence time reconstruction by requiring inference on… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 34 pages, 6 figures

  6. arXiv:2107.01246  [pdf, other

    q-bio.PE stat.AP stat.ME

    Principled, practical, flexible, fast: a new approach to phylogenetic factor analysis

    Authors: Gabriel W. Hassler, Brigida Gallone, Leandro Aristide, William L. Allen, Max R. Tolkoff, Andrew J. Holbrook, Guy Baele, Philippe Lemey, Marc A. Suchard

    Abstract: Biological phenotypes are products of complex evolutionary processes in which selective forces influence multiple biological trait measurements in unknown ways. Phylogenetic factor analysis disentangles these relationships across the evolutionary history of a group of organisms. Scientists seeking to employ this modeling framework confront numerous modeling and implementation decisions, the detail… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 27 pages, 7 figures, 1 table

  7. arXiv:2105.07119  [pdf, other

    stat.ME q-bio.PE

    Shrinkage-based random local clocks with scalable inference

    Authors: Alexander A. Fisher, Xiang Ji, Akihiko Nishimura, Philippe Lemey, Marc A. Suchard

    Abstract: Local clock models propose that the rate of molecular evolution is constant within phylogenetic sub-trees. Current local clock inference procedures scale poorly to large taxa problems, impose model misspecification, or require a priori knowledge of the existence and location of clocks. To overcome these challenges, we present an autocorrelated, Bayesian model of heritable clock rate evolution that… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: 24 pages, 6 figures

  8. arXiv:2003.10336  [pdf, other

    stat.AP q-bio.PE

    Efficient Bayesian Inference of General Gaussian Models on Large Phylogenetic Trees

    Authors: Paul Bastide, Lam Si Tung Ho, Guy Baele, Philippe Lemey, Marc A Suchard

    Abstract: Phylogenetic comparative methods correct for shared evolutionary history among a set of non-independent organisms by modeling sample traits as arising from a diffusion process along on the branches of a possibly unknown history. To incorporate such uncertainty, we present a scalable Bayesian inference framework under a general Gaussian trait evolution model that exploits Hamiltonian Monte Carlo (H… ▽ More

    Submitted 29 September, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

  9. arXiv:2002.00245  [pdf, other

    q-bio.PE stat.ME

    Online Bayesian phylodynamic inference in BEAST with application to epidemic reconstruction

    Authors: Mandev S. Gill, Philippe Lemey, Marc A. Suchard, Andrew Rambaut, Guy Baele

    Abstract: Reconstructing pathogen dynamics from genetic data as they become available during an outbreak or epidemic represents an important statistical scenario in which observations arrive sequentially in time and one is interested in performing inference in an 'online' fashion. Widely-used Bayesian phylogenetic inference packages are not set up for this purpose, generally requiring one to recompute trees… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: 20 pages, 3 figures

  10. arXiv:1912.09185  [pdf, other

    stat.ME q-bio.PE stat.CO

    Large-scale inference of correlation among mixed-type biological traits with phylogenetic multivariate probit models

    Authors: Zhenyu Zhang, Akihiko Nishimura, Paul Bastide, Xiang Ji, Rebecca P. Payne, Philip Goulder, Philippe Lemey, Marc A. Suchard

    Abstract: Inferring concerted changes among biological traits along an evolutionary history remains an important yet challenging problem. Besides adjusting for spurious correlation induced from the shared history, the task also requires sufficient flexibility and computational efficiency to incorporate multiple continuous and discrete traits as data size increases. To accomplish this, we jointly model mixed… ▽ More

    Submitted 23 September, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: 24 pages, 6 figures, 2 tables. Version accepted by Annals of Applied Statistics

  11. arXiv:1906.05136  [pdf, other

    q-bio.PE stat.ME

    Markov-modulated continuous-time Markov chains to identify site- and branch-specific evolutionary variation

    Authors: Guy Baele, Mandev S. Gill, Philippe Lemey, Marc A. Suchard

    Abstract: Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the pro… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: 30 pages, 8 figures

  12. arXiv:1906.04834  [pdf, other

    q-bio.PE stat.ME

    Relaxed random walks at scale

    Authors: Alexander A. Fisher, Xiang Ji, Philippe Lemey, Marc A. Suchard

    Abstract: Relaxed random walk (RRW) models of trait evolution introduce branch-specific rate multipliers to modulate the variance of a standard Brownian diffusion process along a phylogeny and more accurately model overdispersed biological data. Increased taxonomic sampling challenges inference under RRWs as the number of unknown parameters grows with the number of taxa. To solve this problem, we present a… ▽ More

    Submitted 14 November, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 18 pages, 4 figures

  13. arXiv:1905.12146  [pdf, other

    stat.CO q-bio.PE stat.ME

    Gradients do grow on trees: a linear-time ${\cal O}\hspace{-0.2em}\left( N \right)$-dimensional gradient for statistical phylogenetics

    Authors: Xiang Ji, Zhenyu Zhang, Andrew Holbrook, Akihiko Nishimura, Guy Baele, Andrew Rambaut, Philippe Lemey, Marc A. Suchard

    Abstract: Calculation of the log-likelihood stands as the computational bottleneck for many statistical phylogenetic algorithms. Even worse is its gradient evaluation, often used to target regions of high probability. Order ${\cal O}\hspace{-0.2em}\left( N \right)$-dimensional gradient calculations based on the standard pruning algorithm require ${\cal O}\hspace{-0.2em}\left( N^2 \right)$ operations where N… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  14. arXiv:1711.06299  [pdf, ps, other

    cs.LG cs.AI q-bio.PE

    Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies

    Authors: Pieter Libin, Timothy Verstraeten, Diederik M. Roijers, Jelena Grujic, Kristof Theys, Philippe Lemey, Ann Nowé

    Abstract: Pandemic influenza has the epidemic potential to kill millions of people. While various preventive measures exist (i.a., vaccination and school closures), deciding on strategies that lead to their most effective and efficient use remains challenging. To this end, individual-based epidemiological models are essential to assist decision makers in determining the best strategy to curb epidemic spread… ▽ More

    Submitted 15 June, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

  15. arXiv:1601.05078  [pdf, other

    stat.ME q-bio.PE

    Understanding Past Population Dynamics: Bayesian Coalescent-Based Modeling with Covariates

    Authors: Mandev S. Gill, Philippe Lemey, Shannon N. Bennett, Roman Biek, Marc A. Suchard

    Abstract: Effective population size characterizes the genetic variability in a population and is a parameter of paramount importance in population genetics. Kingman's coalescent process enables inference of past population dynamics directly from molecular sequence data, and researchers have developed a number of flexible coalescent-based models for Bayesian nonparametric estimation of the effective populati… ▽ More

    Submitted 19 January, 2016; originally announced January 2016.

    Comments: 31 pages, 6 figures

  16. arXiv:1512.07948  [pdf, other

    q-bio.PE stat.ME

    A Relaxed Drift Diffusion Model for Phylogenetic Trait Evolution

    Authors: Mandev S. Gill, Lam Si Tung Ho, Guy Baele, Philippe Lemey, Marc A. Suchard

    Abstract: Understanding the processes that give rise to quantitative measurements associated with molecular sequence data remains an important issue in statistical phylogenetics. Examples of such measurements include geographic coordinates in the context of phylogeography and phenotypic traits in the context of comparative studies. A popular approach is to model the evolution of continuously varying traits… ▽ More

    Submitted 29 December, 2015; v1 submitted 24 December, 2015; originally announced December 2015.

    Comments: 35 pages, 3 figures, 5 tables. Changed from double-spaced to single-spaced

  17. arXiv:1505.01105  [pdf, other

    q-bio.PE

    Spatio-temporal Dynamics of Foot-and-Mouth Disease Virus in South America

    Authors: Luiz Max Carvalho, Nuno Rodrigues Faria, Andres M. Perez, Marc A. Suchard, Philippe Lemey, Waldemir de Castro Silveira, Andrew Rambaut, Guy Baele

    Abstract: Although foot-and-mouth disease virus (FMDV) incidence has decreased in South America over the last years, the pathogen still circulates in the region and the risk of re-emergence in previously FMDV-free areas is a veterinary public health concern. In this paper we merge environmental, epidemiological and genetic data to reconstruct spatiotemporal patterns and determinants of FMDV serotypes A and… ▽ More

    Submitted 1 June, 2015; v1 submitted 5 May, 2015; originally announced May 2015.

    Comments: 21 pages, 5 figures, sumitted to Virus Evolution (http://ve.oxfordjournals.org/). Updated affiliations list

  18. arXiv:1410.1263  [pdf, other

    stat.ME q-bio.GN q-bio.PE

    Synonymous and Nonsynonymous Distances Help Untangle Convergent Evolution and Recombination

    Authors: Peter B. Chi, Sujay Chattopadhyay, Philippe Lemey, Evgeni V. Sokurenko, Vladimir N. Minin

    Abstract: When estimating a phylogeny from a multiple sequence alignment, researchers often assume the absence of recombination. However, if recombination is present, then tree estimation and all downstream analyses will be impacted, because different segments of the sequence alignment support different phylogenies. Similarly, convergent selective pressures at the molecular level can also lead to phylogenet… ▽ More

    Submitted 7 October, 2014; v1 submitted 6 October, 2014; originally announced October 2014.

    Comments: 21 pages, 8 figures, updated abstract

  19. arXiv:1406.3863  [pdf, ps, other

    q-bio.PE stat.AP stat.ME

    Assessing phenotypic correlation through the multivariate phylogenetic latent liability model

    Authors: Gabriela B. Cybis, Janet S. Sinsheimer, Trevor Bedford, Alison E. Mather, Philippe Lemey, Marc A. Suchard

    Abstract: Understanding which phenotypic traits are consistently correlated throughout evolution is a highly pertinent problem in modern evolutionary biology. Here, we propose a multivariate phylogenetic latent liability model for assessing the correlation between multiple types of data, while simultaneously controlling for their unknown shared evolutionary history informed through molecular sequences. The… ▽ More

    Submitted 16 September, 2015; v1 submitted 15 June, 2014; originally announced June 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS821 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS821

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 969-991

  20. arXiv:1312.4699  [pdf, other

    q-bio.PE

    piBUSS: a parallel BEAST/BEAGLE utility for sequence simulation under complex evolutionary scenarios

    Authors: Filip Bielejec, Philippe Lemey, Luiz Max Carvalho, Guy Baele, Andrew Rambaut, Marc A. Suchard

    Abstract: Background: Simulated nucleotide or amino acid sequences are frequently used to assess the performance of phylogenetic reconstruction methods. BEAST, a Bayesian statistical framework that focuses on reconstructing time-calibrated molecular evolutionary processes, supports a wide array of evolutionary models, but lacked matching machinery for simulation of character evolution along phylogenies. R… ▽ More

    Submitted 17 December, 2013; originally announced December 2013.

    Comments: 13 pages, 2 figures, 1 table

  21. arXiv:1309.3075  [pdf, other

    q-bio.PE stat.CO

    Inferring Heterogeneous Evolutionary Processes Through Time: from sequence substitution to phylogeography

    Authors: Filip Bielejec, Philippe Lemey, Guy Baele, Andrew Rambaut, Marc A Suchard

    Abstract: Molecular phylogenetic and phylogeographic reconstructions generally assume time-homogeneous substitution processes. Motivated by computational convenience, this assumption sacrifices biological realism and offers little opportunity to uncover the temporal dynamics in evolutionary histories. Here, we extend and generalize an evolutionary approach that relaxes the time-homogeneous process assumptio… ▽ More

    Submitted 12 September, 2013; originally announced September 2013.

    Comments: 30 pages, 6 figure, 3 tables

  22. arXiv:1304.3637  [pdf, other

    q-bio.PE q-bio.QM stat.AP

    Integrating influenza antigenic dynamics with molecular evolution

    Authors: Trevor Bedford, Marc A. Suchard, Philippe Lemey, Gytis Dudas, Victoria Gregory, Alan J. Hay, John W. McCauley, Colin A. Russell, Derek J. Smith, Andrew Rambaut

    Abstract: Influenza viruses undergo continual antigenic evolution allowing mutant viruses to evade host immunity acquired to previous virus strains. Antigenic phenotype is often assessed through pairwise measurement of cross-reactivity between influenza strains using the hemagglutination inhibition (HI) assay. Here, we extend previous approaches to antigenic cartography, and simultaneously characterize anti… ▽ More

    Submitted 19 December, 2013; v1 submitted 12 April, 2013; originally announced April 2013.

    Comments: 32 pages, 13 figures, 2 tables

  23. arXiv:1210.5877  [pdf, other

    q-bio.PE

    The seasonal flight of influenza: a unified framework for spatiotemporal hypothesis testing

    Authors: Philippe Lemey, Andrew Rambaut, Trevor Bedford, Nuno R. Faria, Filip Bielejec, Guy Baele, Colin A. Russell, Derek J. Smith, Oliver G. Pybus, Dirk Brockmann, Marc A. Suchard

    Abstract: Global mobility flow data are at the heart of spatial epidemiological models used to predict infectious disease behavior but this wealth of data on human mobility has been largely neglected by reconstructions of pathogen evolutionary dynamics using viral genetic data. Although stochastic models of viral evolution may potentially be informed by such data, a major challenge lies in deciding which mo… ▽ More

    Submitted 22 October, 2012; originally announced October 2012.

    Comments: 16 pages, 4 figures, 3 tables