Skip to main content

Showing 1–29 of 29 results for author: Sumner, J G

.
  1. The ancient Operational Code is embedded in the amino acid substitution matrix and aaRS phylogenies

    Authors: Julia A. Shore, Barbara R. Holland, Jeremy G. Sumner, Kay Nieselt, Peter R. Wills

    Abstract: The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the a… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Journal ref: Journal of molecular evolution, 1-15 (2019)

  2. Systematics and symmetry in molecular phylogenetic modelling: perspectives from physics

    Authors: Peter D Jarvis, Jeremy G Sumner

    Abstract: The aim of this review is to present and analyze the probabilistic models of mathematical phylogenetics which have been intensively used in recent years in biology as the cornerstone of attempts to infer and reconstruct the ancestral relationships between species. We outline the development of theoretical phylogenetics, from the earliest studies based on morphological characters, through to the us… ▽ More

    Submitted 15 September, 2018; v1 submitted 9 September, 2018; originally announced September 2018.

    Comments: 51 pages, LaTeX, 3 figures. Minor clarifications added and typos corrected

  3. arXiv:1804.11249  [pdf, other

    q-bio.PE math.RA math.ST

    The impracticalities of multiplicatively-closed codon models: a retreat to linear alternatives

    Authors: Julia A. Shore, Jeremy G. Sumner, Barbara R. Holland

    Abstract: A matrix Lie algebra is a linear space of matrices closed under the operation $ [A, B] = AB-BA $. The "Lie closure" of a set of matrices is the smallest matrix Lie algebra which contains the set. In the context of Markov chain theory, if a set of rate matrices form a Lie algebra, their corresponding Markov matrices are closed under matrix multiplication; this has been found to be a useful property… ▽ More

    Submitted 5 August, 2020; v1 submitted 25 April, 2018; originally announced April 2018.

    Journal ref: Journal of Mathematical Biology, 1-25 (2020)

  4. arXiv:1709.05079  [pdf, other

    q-bio.PE q-bio.QM

    Exploring the consequences of lack of closure in codon models

    Authors: Michael D. Woodhams, Jeremy G. Sumner, David A. Liberles, Michael A. Charleston, Barbara R. Holland

    Abstract: Models of codon evolution are commonly used to identify positive selection. Positive selection is typically a heterogeneous process, i.e., it acts on some branches of the evolutionary tree and not others. Previous work on DNA models showed that when evolution occurs under a heterogeneous process it is important to consider the property of model closure, because non-closed models can give biased es… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: 15 pages; 3 figures; 5 tables

  5. arXiv:1709.04548  [pdf, ps, other

    q-bio.PE math.ST q-bio.QM

    Distinguishing between convergent evolution and violation of the molecular clock

    Authors: Jonathan D. Mitchell, Jeremy G. Sumner, Barbara R. Holland

    Abstract: We give a non-technical introduction to convergence-divergence models, a new modeling approach for phylogenetic data that allows for the usual divergence of species post speciation but also allows for species to converge, i.e. become more similar over time. By examining the $3$-taxon case in some detail we illustrate that phylogeneticists have been "spoiled" in the sense of not having to think abo… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    Comments: 12 pages, 3 figures

  6. arXiv:1709.00520  [pdf, ps, other

    math.GR q-bio.PE q-bio.QM

    Lie-Markov models derived from finite semigroups

    Authors: Jeremy G. Sumner, Michael D. Woodhams

    Abstract: We present and explore a general method for deriving a Lie-Markov model from a finite semigroup. If the degree of the semigroup is $k$, the resulting model is a continuous-time Markov chain on $k$ states and, as a consequence of the product rule in the semigroup, satisfies the property of multiplicative closure. This means that the product of any two probability substitution matrices taken from th… ▽ More

    Submitted 1 September, 2017; originally announced September 2017.

    Comments: 17 pages

  7. arXiv:1704.01418  [pdf, ps, other

    math.GR math.ST q-bio.PE q-bio.QM

    Multiplicatively closed Markov models must form Lie algebras

    Authors: Jeremy G Sumner

    Abstract: We prove that the probability substitution matrices obtained from a continuous-time Markov chain form a multiplicatively closed set if and only if the rate matrices associated to the chain form a linear space spanning a Lie algebra. The key original contribution we make is to overcome an obstruction, due to the presence of inequalities that are unavoidable in the probabilistic application, that pr… ▽ More

    Submitted 31 August, 2017; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: v2: 6 pages. Minimality condition included in Property 0 to close gap in the proof of main result. To appear in the ANZIAM Journal

  8. arXiv:1612.06035  [pdf, ps, other

    q-bio.PE math.GR math.PR math.RT q-bio.QM

    A representation-theoretic approach to the calculation of evolutionary distance in bacteria

    Authors: Jeremy G Sumner, Peter D Jarvis, Andrew R Francis

    Abstract: In the context of bacteria and models of their evolution under genome rearrangement, we explore a novel application of group representation theory to the inference of evolutionary history. Our contribution is to show, in a very general maximum likelihood setting, how to use elementary matrix algebra to sidestep intractable combinatorial computations and convert the problem into one of eigenvalue e… ▽ More

    Submitted 18 December, 2016; originally announced December 2016.

    Comments: 13 pages

  9. arXiv:1608.04761  [pdf, other

    q-bio.QM math.AG math.RT q-bio.PE

    Develo** a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants

    Authors: Jeremy G Sumner, Amelia Taylor, Barbara R Holland, Peter D Jarvis

    Abstract: Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transform… ▽ More

    Submitted 29 March, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

    Comments: 27 pages; 5 figures (now colour); 7 tables. Updated in line with reviewer comments

  10. arXiv:1602.07780  [pdf, ps, other

    q-bio.PE math.AG math.RT q-bio.QM

    Dimensional reduction for the general Markov model on phylogenetic trees

    Authors: Jeremy G Sumner

    Abstract: We present a method of dimensional reduction for the general Markov model of sequence evolution on a phylogenetic tree. We show that taking certain linear combinations of the associated random variables (site pattern counts) reduces the dimensionality of the model from exponential in the number of extant taxa, to quadratic in the number of taxa, while retaining the ability to statistically identif… ▽ More

    Submitted 27 November, 2016; v1 submitted 24 February, 2016; originally announced February 2016.

    Comments: 17 pages, 3 figures. v4: Substantial revision. Additional motivations for transformation rules and more details in proofs of the main results are provided

  11. arXiv:1412.1525  [pdf, ps, other

    q-bio.PE

    A new hierarchy of phylogenetic models consistent with heterogeneous substitution rates

    Authors: Michael D. Woodhams, Jesús Fernández-Sánchez, Jeremy G. Sumner

    Abstract: When the process underlying DNA substitutions varies across evolutionary history, the standard Markov models underlying standard phylogenetic methods are mathematically inconsistent. The most prominent example is the general time reversible model (GTR) together with some, but not all, of its submodels. To rectify this deficiency, Lie Markov models have been developed as the class of models that ar… ▽ More

    Submitted 3 December, 2014; originally announced December 2014.

    Comments: 20 pages. Supplementary files available via email

  12. arXiv:1307.5574  [pdf, ps, other

    q-bio.PE math.RT math.ST q-bio.QM

    Matrix group structure and Markov invariants in the strand symmetric phylogenetic substitution model

    Authors: Peter D Jarvis, Jeremy G Sumner

    Abstract: We consider the continuous-time presentation of the strand symmetric phylogenetic substitution model (in which rate parameters are unchanged under nucleotide permutations given by Watson-Crick base conjugation). Algebraic analysis of the model's underlying structure as a matrix group leads to a change of basis where the rate generator matrix is given by a two-part block decomposition. We apply rep… ▽ More

    Submitted 28 October, 2014; v1 submitted 21 July, 2013; originally announced July 2013.

    Comments: v2: Major revision now includes explicit forms for quadratic and cubic Markov invariants

    Journal ref: J Mathematical Biology (Online First, Dec 2015)

  13. arXiv:1212.5311  [pdf, ps, other

    math.ST q-bio.PE

    Lie geometry of 2x2 Markov matrices

    Authors: Jeremy G. Sumner

    Abstract: In recent work discussing model choice for continuous-time Markov chains, we have argued that it is important that the Markov matrices that define the model are closed under matrix multiplication (Sumner 2012a, 2012b). The primary requirement is then that the associated set of rate matrices form a Lie algebra. For the generic case, this connection to Lie theory seems to have first been made by Joh… ▽ More

    Submitted 20 December, 2012; originally announced December 2012.

    Comments: 5 pages, 2 figures

  14. arXiv:1212.3888  [pdf, ps, other

    q-bio.PE math.ST

    A tensorial approach to the inversion of group-based phylogenetic models

    Authors: Jeremy G. Sumner, Peter D. Jarvis, Barbara R. Holland

    Abstract: Using a tensorial approach, we show how to construct a one-one correspondence between pattern probabilities and edge parameters for any group-based model. This is a generalisation of the "Hadamard conjugation" and is equivalent to standard results that use Fourier analysis. In our derivation we focus on the connections to group representation theory and emphasize that the inversion is possible bec… ▽ More

    Submitted 17 December, 2012; originally announced December 2012.

    Comments: 24 pages, 2 figures

  15. arXiv:1211.3461  [pdf, ps, other

    math.AG math.GR math.ST q-bio.PE

    Tensor Rank, Invariants, Inequalities, and Applications

    Authors: Elizabeth S. Allman, Peter D. Jarvis, John A. Rhodes, Jeremy G. Sumner

    Abstract: Though algebraic geometry over $\mathbb C$ is often used to describe the closure of the tensors of a given size and complex rank, this variety includes tensors of both smaller and larger rank. Here we focus on the $n\times n\times n$ tensors of rank $n$ over $\mathbb C$, which has as a dense subset the orbit of a single tensor under a natural group action. We construct polynomial invariants under… ▽ More

    Submitted 14 November, 2012; originally announced November 2012.

    Comments: 31 pages, 1 figure

    MSC Class: 15A72; 14P10

  16. arXiv:1206.1401  [pdf, ps, other

    q-bio.PE math.GR math.ST

    Lie Markov models with purine/pyrimidine symmetry

    Authors: Jesús Fernández-Sánchez, Jeremy G. Sumner, Peter D. Jarvis, Michael D. Woodhams

    Abstract: Continuous-time Markov chains are a standard tool in phylogenetic inference. If homogeneity is assumed, the chain is formulated by specifying time-independent rates of substitutions between states in the chain. In applications, there are usually extra constraints on the rates, depending on the situation. If a model is formulated in this way, it is possible to generalise it and allow for an inhomog… ▽ More

    Submitted 25 June, 2013; v1 submitted 7 June, 2012; originally announced June 2012.

    Comments: 32 pages

  17. arXiv:1205.5433  [pdf, ps, other

    q-bio.QM math.GR math.ST q-bio.PE quant-ph

    Adventures in Invariant Theory

    Authors: P. D. Jarvis, J. G. Sumner

    Abstract: We provide an introduction to enumerating and constructing invariants of group representations via character methods. The problem is contextualised via two case studies arising from our recent work: entanglement measures, for characterising the structure of state spaces for composite quantum systems; and Markov invariants, a robust alternative to parameter-estimation intensive methods of statistic… ▽ More

    Submitted 23 July, 2013; v1 submitted 23 May, 2012; originally announced May 2012.

    Comments: 12 pp, includes supplementary discussion of examples

    Journal ref: ANZIAM J. 56 (2014) 105-115

  18. arXiv:1204.4762  [pdf, other

    q-bio.QM math.ST q-bio.PE

    Low-parameter phylogenetic estimation under the general Markov model

    Authors: Barbara R. Holland, Peter D. Jarvis, Jeremy G. Sumner

    Abstract: In their 2008 and 2009 papers, Sumner and colleagues introduced the "squangles" - a small set of Markov invariants for phylogenetic quartets. The squangles are consistent with the general Markov model (GM) and can be used to infer quartets without the need to explicitly estimate all parameters. As GM is inhomogeneous and hence non-stationary, the squangles are expected to perform well compared to… ▽ More

    Submitted 20 April, 2012; originally announced April 2012.

    Comments: 22 pages, 6 figures

  19. arXiv:1012.5165  [pdf, ps, other

    q-bio.PE q-bio.QM

    The algebra of the general Markov model on phylogenetic trees and networks

    Authors: J. G. Sumner, B. H. Holland, P. D. Jarvis

    Abstract: It is known that the Kimura 3ST model of sequence evolution on phylogenetic trees can be extended quite naturally to arbitrary split systems. However, this extension relies heavily on mathematical peculiarities of the K3ST model, and providing an analogous augmentation of the general Markov model has thus far been elusive. In this paper we rectify this shortcoming by showing how to extend the gene… ▽ More

    Submitted 23 December, 2010; originally announced December 2010.

    Comments: 17 pages, 5 figures

    Journal ref: Bull. Math. Biol., 74(2012), 858-880

  20. arXiv:1008.1121  [pdf, ps, other

    q-bio.QM

    Markov invariants for phylogenetic rate matrices derived from embedded submodels

    Authors: P. D. Jarvis, J. G. Sumner

    Abstract: We consider novel phylogenetic models with rate matrices that arise via the embedding of a progenitor model on a small number of character states, into a target model on a larger number of character states. Adapting representation-theoretic results from recent investigations of Markov invariants for the general rate matrix model, we give a prescription for identifying and counting Markov invariant… ▽ More

    Submitted 6 August, 2010; originally announced August 2010.

    Comments: 16 pages, 1 figure, 1 appendix

  21. arXiv:0809.3070  [pdf, ps, other

    q-bio.QM q-bio.PE

    Markov invariants and the isotropy subgroup of a quartet tree

    Authors: J G Sumner, P D Jarvis

    Abstract: The purpose of this article is to show how the isotropy subgroup of leaf permutations on binary trees can be used to systematically identify tree-informative invariants relevant to models of phylogenetic evolution. In the quartet case, we give an explicit construction of the full set of representations and describe their properties. We apply these results directly to Markov invariants, thereby e… ▽ More

    Submitted 28 January, 2009; v1 submitted 18 September, 2008; originally announced September 2008.

    Comments: 18 pages, sequel to "Markov invariants, plethysms and phylogenetics" (arXiv:0711.3503v3) v2: In press for Journal of Theoretical Biology; extended introduction and other minor improvements in response to reviewers comments

    Journal ref: J. Theor. Biol., 258:302--310, 2009

  22. arXiv:0807.3387  [pdf, ps, other

    q-bio.QM cs.DS q-bio.PE

    Phylogenetic estimation with partial likelihood tensors

    Authors: J. G. Sumner, M. A. Charleston

    Abstract: We present an alternative method for calculating likelihoods in molecular phylogenetics. Our method is based on partial likelihood tensors, which are generalizations of partial likelihood vectors, as used in Felsenstein's approach. Exploiting a lexicographic sorting and partial likelihood tensors, it is possible to obtain significant computational savings. We show this on a range of simulated da… ▽ More

    Submitted 22 July, 2008; originally announced July 2008.

    Comments: 20 pages, 7 figures, 3 tables

  23. arXiv:0711.3503  [pdf, ps, other

    q-bio.PE math-ph q-bio.QM

    Markov invariants, plethysms, and phylogenetics (the long version)

    Authors: J. G. Sumner, M. A. Charleston, L. S. Jermiin, P. D. Jarvis

    Abstract: We explore model based techniques of phylogenetic tree inference exercising Markov invariants. Markov invariants are group invariant polynomials and are distinct from what is known in the literature as phylogenetic invariants, although we establish a commonality in some special cases. We show that the simplest Markov invariant forms the foundation of the Log-Det distance measure. We take as our… ▽ More

    Submitted 22 July, 2008; v1 submitted 22 November, 2007; originally announced November 2007.

    Comments: 39 pages, 10 figures, 2 tables, 3 appendices. Long arxiv version includes extended introduction, subsection on mixed-weight invariants, 3rd appendix on K3ST model and a more relaxed pace with additional discussion throughout. "Short version" is to appear in Journal of Theoretical Biology. v4: Sequence length in simulation was corrected from N=1000 to N=10000

    Journal ref: J. Theor. Biol., 253:601--615, 2008

  24. arXiv:0710.3210  [pdf, ps, other

    q-bio.QM q-bio.PE

    Entanglement, Invariants, and Phylogenetics

    Authors: J G Sumner

    Abstract: This thesis develops and expands upon known techniques of mathematical physics relevant to the analysis of the popular Markov model of phylogenetic trees required in biology to reconstruct the evolutionary relationships of taxonomic units from biomolecular sequence data. The techniques of mathematical physics are plethora and have been developed for some time. The Markov model of phylogenetics a… ▽ More

    Submitted 16 October, 2007; originally announced October 2007.

    Comments: PhD thesis

  25. arXiv:q-bio/0510035  [pdf, ps, other

    q-bio.PE

    Using the tangle: a consistent construction of phylogenetic distance matrices for quartets

    Authors: J G Sumner, P D Jarvis

    Abstract: Distance based algorithms are a common technique in the construction of phylogenetic trees from taxonomic sequence data. The first step in the implementation of these algorithms is the calculation of a pairwise distance matrix to give a measure of the evolutionary change between any pair of the extant taxa. A standard technique is to use the log det formula to construct pairwise distances from a… ▽ More

    Submitted 29 March, 2006; v1 submitted 17 October, 2005; originally announced October 2005.

    Comments: 18 Pges. Submitted to Mathematical Biosciences

  26. arXiv:q-bio/0411047  [pdf, ps, other

    q-bio.PE physics.bio-ph q-bio.QM

    Path integral formulation and Feynman rules for phylogenetic branching models

    Authors: P. D. Jarvis, J. D. Bashford, J. G. Sumner

    Abstract: A dynamical picture of phylogenetic evolution is given in terms of Markov models on a state space, comprising joint probability distributions for character types of taxonomic classes. Phylogenetic branching is a process which augments the number of taxa under consideration, and hence the rank of the underlying joint probability state tensor. We point out the combinatorial necessity for a second-… ▽ More

    Submitted 13 October, 2005; v1 submitted 27 November, 2004; originally announced November 2004.

    Comments: 25 pages LaTeX, uses pstricks. Appendix added deriving Feynman rules, appropriate text and title changes, figure for Feynman rules added

  27. arXiv:q-bio/0402007  [pdf, ps, other

    q-bio.PE

    Entanglement Invariants and Phylogenetic Branching

    Authors: J. G. Sumner, P. D. Jarvis

    Abstract: It is possible to consider stochastic models of sequence evolution in phylogenetics in the context of a dynamical tensor description inspired from physics. Approaching the problem in this framework allows for the well developed methods of mathematical physics to be exploited in the biological arena. We present the tensor description of the homogeneous continuous time Markov chain model of phylog… ▽ More

    Submitted 30 November, 2004; v1 submitted 3 February, 2004; originally announced February 2004.

    Comments: 21 pages, 3 Figures. Accepted for publication in Journal of Mathematical Biology

    Report number: UTAS-PHYS-04-01

  28. arXiv:q-bio/0310037  [pdf, ps, other

    q-bio.PE math.ST q-bio.QM

    U(1)xU(1)xU(1) symmetry of the Kimura 3ST model and phylogenetic branching processes

    Authors: J. D. Bashford, P. D. Jarvis, J. G. Sumner, M. A. Steel

    Abstract: An analysis of the Kimura 3ST model of DNA sequence evolution is given on the basis of its continuous Lie symmetries. The rate matrix commutes with a U(1)xU(1)xU(1) phase subgroup of the group GL(4) of 4x4x4 invertible complex matrices acting on a linear space spanned by the 4 nucleic acid base letters. The diagonal `branching operator' representing speciation is defined, and shown to intertwine… ▽ More

    Submitted 2 November, 2003; v1 submitted 30 October, 2003; originally announced October 2003.

    Comments: 9 pages, LaTeX, uses amsmath

    Report number: UTAS-PHYS-03-07

  29. arXiv:hep-th/0207262  [pdf, ps, other

    hep-th

    Polar decomposition of a Dirac spinor

    Authors: J. G. Sumner, P. D. Jarvis

    Abstract: Local decompositions of a Dirac spinor into `charged' and `real' pieces psi(x) = M(x) chi(x) are considered. chi(x) is a Majorana spinor, and M(x) a suitable Dirac-algebra valued field. Specific examples of the decomposition in 2+1 dimensions are developed, along with kinematical implications, and constraints on the component fields within M(x) sufficient to encompass the correct degree of freed… ▽ More

    Submitted 30 July, 2002; originally announced July 2002.

    Comments: 12 pages, LaTeX

    Report number: UTAS-PHYS-02-02