Skip to main content

Showing 1–50 of 60 results for author: Kelk, S

.
  1. arXiv:2403.10868  [pdf, other

    cs.DS

    Approximation Ratio of the Min-Degree Greedy Algorithm for Maximum Independent Set on Interval and Chordal Graphs

    Authors: Steven Chaplick, Martin Frohn, Steven Kelk, Johann Lottermoser, Matus Mihalak

    Abstract: In this article we prove that the minimum-degree greedy algorithm, with adversarial tie-breaking, is a $(2/3)$-approximation for the Maximum Independent Set problem on interval graphs. We show that this is tight, even on unit interval graphs of maximum degree 3. We show that on chordal graphs, the greedy algorithm is a $(1/2)$-approximation and that this is again tight. These results contrast with… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures, submitted to journal

  2. arXiv:2309.01110  [pdf, other

    cs.DS q-bio.PE

    Relaxed Agreement Forests

    Authors: Virginia Aardevol Martinez, Steven Chaplick, Steven Kelk, Ruben Meuwese, Matus Mihalak, Georgios Stamoulis

    Abstract: There are multiple factors which can cause the phylogenetic inference process to produce two or more conflicting hypotheses of the evolutionary history of a set X of biological entities. That is: phylogenetic trees with the same set of leaf labels X but with distinct topologies. This leads naturally to the goal of quantifying the difference between two such trees T_1 and T_2. Here we introduce the… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 14 pages plus appendix

  3. arXiv:2307.12176  [pdf, ps, other

    q-bio.PE cs.DS math.CO

    Agreement forests of caterpillar trees: complexity, kernelization and branching

    Authors: Steven Kelk, Ruben Meuwese

    Abstract: Given a set $X$ of species, a phylogenetic tree is an unrooted binary tree whose leaves are bijectively labelled by $X$. Such trees can be used to show the way species evolve over time. One way of understanding how topologically different two phylogenetic trees are, is to construct a minimum-size agreement forest: a partition of $X$ into the smallest number of blocks, such that the blocks induce h… ▽ More

    Submitted 31 August, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: 33 pages, 15 figures

  4. arXiv:2302.10662  [pdf, other

    math.CO cs.DS q-bio.PE

    Snakes and Ladders: a Treewidth Story

    Authors: Steven Chaplick, Steven Kelk, Ruben Meuwese, Matus Mihalak, Georgios Stamoulis

    Abstract: Let $G$ be an undirected graph. We say that $G$ contains a ladder of length $k$ if the $2 \times (k+1)$ grid graph is an induced subgraph of $G$ that is only connected to the rest of $G$ via its four cornerpoints. We prove that if all the ladders contained in $G$ are reduced to length 4, the treewidth remains unchanged (and that this bound is tight). Our result indicates that, when computing the t… ▽ More

    Submitted 30 January, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Compared to the earlier arXiv/WG version we have added analytical (as opposed to empirical) tightness bounds, and an extended discussion. See also Authors note 2 at the end of the introduction about earlier work in this area by Marchand et al

  5. arXiv:2206.04451  [pdf, other

    cs.DS math.CO q-bio.PE

    Deep kernelization for the Tree Bisection and Reconnnect (TBR) distance in phylogenetics

    Authors: Steven Kelk, Simone Linz, Ruben Meuwese

    Abstract: We describe a kernel of size 9k-8 for the NP-hard problem of computing the Tree Bisection and Reconnect (TBR) distance k between two unrooted binary phylogenetic trees. We achieve this by extending the existing portfolio of reduction rules with three novel new reduction rules. Two of the rules are based on the idea of topologically transforming the trees in a distance-preserving way in order to gu… ▽ More

    Submitted 20 September, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages. In this version a figure has been added, some references have been added, some small typo's have been fixed and the introduction and conclusion have been slightly extended. Submitted for journal review

  6. Cyclic generators and an improved linear kernel for the rooted subtree prune and regraft distance

    Authors: Steven Kelk, Simone Linz, Ruben Meuwese

    Abstract: The rooted subtree prune and regraft (rSPR) distance between two rooted binary phylogenetic trees is a well-studied measure of topological dissimilarity that is NP-hard to compute. Here we describe an improved linear kernel for the problem. In particular, we show that if the classical subtree and chain reduction rules are augmented with a modified type of chain reduction rule, the resulting trees… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  7. arXiv:2111.12632  [pdf, ps, other

    math.CO q-bio.PE

    Convex characters, algorithms and matchings

    Authors: Steven Kelk, Ruben Meuwese, Stephan Wagner

    Abstract: Phylogenetic trees are used to model evolution: leaves are labelled to represent contemporary species ("taxa") and interior vertices represent extinct ancestors. Informally, convex characters are measurements on the contemporary species in which the subset of species (both contemporary and extinct) that share a given state, form a connected subtree. In \cite{KelkS17} it was shown how to efficientl… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 23 pages, supplementary material can be found at https://github.com/skelk2001/legal-matchings

  8. arXiv:2107.10871  [pdf, ps, other

    math.CO q-bio.PE

    Sharp upper and lower bounds on a restricted class of convex characters

    Authors: Steven Kelk, Ruben Meuwese

    Abstract: Let $\mathcal{T}$ be an unrooted binary tree with $n$ distinctly labelled leaves. Deriving its name from the field of phylogenetics, a convex character on $\mathcal{T}$ is simply a partition of the leaves such that the minimal spanning subtrees induced by the blocks of the partition are mutually disjoint. In earlier work Kelk and Stamoulis (Advances in Applied Mathematics 84 (2017), pp. 34--46) de… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 12 pages, 5 figures

  9. Reflections on kernelizing and computing unrooted agreement forests

    Authors: Rim van Wersch, Steven Kelk, Simone Linz, Georgios Stamoulis

    Abstract: Phylogenetic trees are leaf-labelled trees used to model the evolution of species. Here we explore the practical impact of kernelization (i.e. data reduction) on the NP-hard problem of computing the TBR distance between two unrooted binary phylogenetic trees. This problem is better-known in the literature as the maximum agreement forest problem, where the goal is to partition the two trees into a… ▽ More

    Submitted 5 October, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: Updated version. New figures and analysis. Accepted for journal publication

  10. New FPT algorithms for finding the temporal hybridization number for sets of phylogenetic trees

    Authors: Sander Borst, Leo van Iersel, Mark Jones, Steven Kelk

    Abstract: We study the problem of finding a temporal hybridization network for a set of phylogenetic trees that minimizes the number of reticulations. First, we introduce an FPT algorithm for this problem on an arbitrary set of $m$ binary trees with $n$ leaves each with a running time of $O(5^k\cdot n\cdot m)$, where $k$ is the minimum temporal hybridization number. We also present the concept of temporal d… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  11. arXiv:2004.02298  [pdf, ps, other

    cs.DS

    Maximum parsimony distance on phylogenetictrees: a linear kernel and constant factor approximation algorithm

    Authors: Mark Jones, Steven Kelk, Leen Stougie

    Abstract: Maximum parsimony distance is a measure used to quantify the dissimilarity of two unrooted phylogenetic trees. It is NP-hard to compute, and very few positive algorithmic results are known due to its complex combinatorial structure. Here we address this shortcoming by showing that the problem is fixed parameter tractable. We do this by establishing a linear kernel i.e., that after applying certain… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: 27 pages, 7 figures

  12. arXiv:1910.11041  [pdf, ps, other

    q-bio.PE cs.DS

    Cutting an alignment with Ockham's razor

    Authors: Mark Jones, Philippe Gambette, Leo van Iersel, Remie Janssen, Steven Kelk, Fabio Pardi, Celine Scornavacca

    Abstract: In this article, we investigate different parsimony-based approaches towards finding recombination breakpoints in a multiple sequence alignment. This recombination detection task is crucial in order to avoid errors in evolutionary analyses caused by mixing together portions of sequences which had a different evolution history. Following an overview of the field of recombination detection, we formu… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

  13. arXiv:1905.13516  [pdf, other

    cs.AI

    Foundations of Digital Archæoludology

    Authors: Cameron Browne, Dennis J. N. J. Soemers, Éric Piette, Matthew Stephenson, Michael Conrad, Walter Crist, Thierry Depaulis, Eddie Duggan, Fred Horn, Steven Kelk, Simon M. Lucas, João Pedro Neto, David Parlett, Abdallah Saffidine, Ulrich Schädler, Jorge Nuno Silva, Alex de Voogt, Mark H. M. Winands

    Abstract: Digital Archaeoludology (DAL) is a new field of study involving the analysis and reconstruction of ancient games from incomplete descriptions and archaeological evidence using modern computational techniques. The aim is to provide digital tools and methods to help game historians and other researchers better understand traditional games, their development throughout recorded human history, and the… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: Report on Dagstuhl Research Meeting. Authored/edited by all participants. Appendices by Thierry Depaulis

  14. New reduction rules for the tree bisection and reconnection distance

    Authors: Steven Kelk, Simone Linz

    Abstract: Recently it was shown that, if the subtree and chain reduction rules have been applied exhaustively to two unrooted phylogenetic trees, the reduced trees will have at most 15k-9 taxa where k is the TBR (Tree Bisection and Reconnection) distance between the two trees, and that this bound is tight. Here we propose five new reduction rules and show that these further reduce the bound to 11k-9. The ne… ▽ More

    Submitted 14 June, 2020; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: Accepted for journal publication. This version contains extra figures. Keywords: fixed-parameter tractability, tree bisection and reconnection, generator, kernelization, agreement forest, phylogenetic network, phylogenetic tree, hybridization number

    Journal ref: Annals of Combinatorics, 24:475-502, 2020

  15. arXiv:1901.09187  [pdf, other

    cs.LG q-bio.QM stat.ML

    Discovery of Important Subsequences in Electrocardiogram Beats Using the Nearest Neighbour Algorithm

    Authors: Ricards Marcinkevics, Steven Kelk, Carlo Galuzzi, Berthold Stegemann

    Abstract: The classification of time series data is a well-studied problem with numerous practical applications, such as medical diagnosis and speech recognition. A popular and effective approach is to classify new time series in the same way as their nearest neighbours, whereby proximity is defined using Dynamic Time War** (DTW) distance, a measure analogous to sequence alignment in bioinformatics. Howev… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.

  16. arXiv:1811.06892  [pdf, other

    cs.DS math.CO q-bio.PE

    A tight kernel for computing the tree bisection and reconnection distance between two phylogenetic trees

    Authors: Steven Kelk, Simone Linz

    Abstract: In 2001 Allen and Steel showed that, if subtree and chain reduction rules have been applied to two unrooted phylogenetic trees, the reduced trees will have at most 28k taxa where k is the TBR (Tree Bisection and Reconnection) distance between the two trees. Here we reanalyse Allen and Steel's kernelization algorithm and prove that the reduced instances will in fact have at most 15k-9 taxa. Moreove… ▽ More

    Submitted 16 July, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: One figure added, two small typos fixed. This version to appear in SIDMA (SIAM Journal on Discrete Mathematics)

    Journal ref: SIAM Journal on Discrete Mathematics, 33:1556-1574, 2019

  17. arXiv:1809.00907  [pdf, other

    cs.DS

    Treewidth of display graphs: bounds, brambles and applications

    Authors: Remie Janssen, Mark Jones, Steven Kelk, Georgios Stamoulis, Taoyang Wu

    Abstract: Phylogenetic trees and networks are leaf-labelled graphs used to model evolution. Display graphs are created by identifying common leaf labels in two or more phylogenetic trees or networks. The treewidth of such graphs is bounded as a function of many common dissimilarity measures between phylogenetic trees and this has been leveraged in fixed parameter tractability results. Here we further elucid… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

  18. arXiv:1804.07232  [pdf, other

    cs.DM math.CO q-bio.PE

    A third strike against perfect phylogeny

    Authors: Leo van Iersel, Mark Jones, Steven Kelk

    Abstract: Perfect phylogenies are fundamental in the study of evolutionary trees because they capture the situation when each evolutionary trait emerges only once in history; if such events are believed to be rare, then by Occam's Razor such parsimonious trees are preferable as a hypothesis of evolution. A classical result states that 2-state characters permit a perfect phylogeny precisely if each subset of… ▽ More

    Submitted 14 January, 2019; v1 submitted 19 April, 2018; originally announced April 2018.

    Comments: This article has been accepted for publication in Systematic Biology Published by Oxford University Press

  19. Integrality Gaps for Colorful Matchings

    Authors: Steven Kelk, Georgios Stamoulis

    Abstract: We study the integrality gap of the natural linear programming relaxation for the \textit{Bounded Color Matching} (BCM) problem. We provide several families of instances and establish lower bounds on their integrality gaps and we study how the Sherali-Adams "lift-and-project" technique behaves on these instances. We complement these results by showing that if we exclude certain simple sub-structur… ▽ More

    Submitted 12 January, 2019; v1 submitted 24 January, 2018; originally announced January 2018.

    Comments: 25 pages, 3 Figures

  20. Deciding the existence of a cherry-picking sequence is hard on two trees

    Authors: Janosch Döcker, Leo van Iersel, Steven Kelk, Simone Linz

    Abstract: Here we show that deciding whether two rooted binary phylogenetic trees on the same set of taxa permit a cherry-picking sequence, a special type of elimination order on the taxa, is NP-complete. This improves on an earlier result which proved hardness for eight or more trees. Via a known equivalence between cherry-picking sequences and temporal phylogenetic networks, our result proves that it is N… ▽ More

    Submitted 25 January, 2019; v1 submitted 8 December, 2017; originally announced December 2017.

    Comments: Fixed some tiny things. Accepted for journal publication

    Journal ref: Discrete Applied Mathematics, 260:131-143, 2019

  21. arXiv:1707.03648  [pdf, ps, other

    q-bio.PE cs.DS

    Finding the most parsimonious or likely tree in a network with respect to an alignment

    Authors: Steven Kelk, Fabio Pardi, Celine Scornavacca, Leo van Iersel

    Abstract: Phylogenetic networks are often constructed by merging multiple conflicting phylogenetic signals into a directed acyclic graph. It is interesting to explore whether a network constructed in this way induces biologically-relevant phylogenetic signals that were not present in the input. Here we show that, given a multiple alignment A for a set of taxa X and a rooted phylogenetic network N whose leav… ▽ More

    Submitted 12 July, 2017; originally announced July 2017.

  22. arXiv:1703.10840  [pdf, other

    cs.DM cs.DS q-bio.PE

    Treewidth distance on phylogenetic trees

    Authors: Steven Kelk, Georgios Stamoulis, Taoyang Wu

    Abstract: In this article we study the treewidth of the \emph{display graph}, an auxiliary graph structure obtained from the fusion of phylogenetic (i.e., evolutionary) trees at their leaves. Earlier work has shown that the treewidth of the display graph is bounded if the trees are in some formal sense topologically similar. Here we further expand upon this relationship. We analyse a number of reduction rul… ▽ More

    Submitted 31 March, 2017; originally announced March 2017.

    Comments: 29 pages, 12 figures

  23. arXiv:1609.00544  [pdf, ps, other

    cs.DS

    On unrooted and root-uncertain variants of several well-known phylogenetic network problems

    Authors: Leo van Iersel, Steven Kelk, Georgios Stamoulis, Leen Stougie, Olivier Boes

    Abstract: The hybridization number problem requires us to embed a set of binary rooted phylogenetic trees into a binary rooted phylogenetic network such that the number of nodes with indegree two is minimized. However, from a biological point of view accurately inferring the root location in a phylogenetic tree is notoriously difficult and poor root placement can artificially inflate the hybridization numbe… ▽ More

    Submitted 2 September, 2016; originally announced September 2016.

    Comments: 28 pages, 8 Figures

  24. Do branch lengths help to locate a tree in a phylogenetic network?

    Authors: Philippe Gambette, Leo van Iersel, Steven Kelk, Fabio Pardi, Celine Scornavacca

    Abstract: Phylogenetic networks are increasingly used in evolutionary biology to represent the history of species that have undergone reticulate events such as horizontal gene transfer, hybrid speciation and recombination. One of the most fundamental questions that arise in this context is whether the evolution of a gene with one copy in all species can be explained by a given network. In mathematical terms… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

  25. arXiv:1512.07459  [pdf, other

    q-bio.PE cs.DS

    Reduction rules for the maximum parsimony distance on phylogenetic trees

    Authors: Steven Kelk, Mareike Fischer, Vincent Moulton, Taoyang Wu

    Abstract: In phylogenetics, distances are often used to measure the incongruence between a pair of phylogenetic trees that are reconstructed by different methods or using different regions of genome. Motivated by the maximum parsimony principle in tree inference, we recently introduced the maximum parsimony (MP) distance, which enjoys various attractive properties due to its connection with several other we… ▽ More

    Submitted 7 July, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

    Comments: Added material on graph minors, MSOL and treewidth

  26. arXiv:1511.08056  [pdf, ps, other

    math.CO

    On the challenge of reconstructing level-1 phylogenetic networks from triplets and clusters

    Authors: P. Gambette, K. T. Huber, S. Kelk

    Abstract: Phylogenetic networks have gained prominence over the years due to their ability to represent complex non-treelike evolutionary events such as recombination or hybridization. Popular combinatorial objects used to construct them are triplet systems and cluster systems, the motivation being that any network $N$ induces a triplet system $\mathcal R(N)$ and a softwired cluster system $\mathcal S(N)$.… ▽ More

    Submitted 18 October, 2016; v1 submitted 25 November, 2015; originally announced November 2015.

  27. arXiv:1508.02598  [pdf, ps, other

    q-bio.PE cs.DS math.CO

    A note on convex characters, Fibonacci numbers and exponential-time algorithms

    Authors: Steven Kelk, Georgios Stamoulis

    Abstract: Phylogenetic trees are used to model evolution: leaves are labelled to represent contemporary species ("taxa") and interior vertices represent extinct ancestors. Informally, convex characters are measurements on the contemporary species in which the subset of species (both contemporary and extinct) that share a given state, form a connected subtree. Given an unrooted, binary phylogenetic tree T on… ▽ More

    Submitted 27 July, 2016; v1 submitted 11 August, 2015; originally announced August 2015.

    Comments: added a significant number of new results to the previous version (on dynamic programming, g-spectra and so on)

  28. arXiv:1506.06404  [pdf, other

    q-bio.PE cs.DS

    A linear bound on the number of states in optimal convex characters for maximum parsimony distance

    Authors: Olivier Boes, Mareike Fischer, Steven Kelk

    Abstract: Given two phylogenetic trees on the same set of taxa X, the maximum parsimony distance d_MP is defined as the maximum, ranging over all characters c on X, of the absolute difference in parsimony score induced by c on the two trees. In this note we prove that for binary trees there exists a character achieving this maximum that is convex on one of the trees (i.e. the parsimony score induced on that… ▽ More

    Submitted 21 June, 2015; originally announced June 2015.

  29. arXiv:1503.00368  [pdf, ps, other

    cs.DS cs.CE cs.LO q-bio.PE

    Phylogenetic incongruence through the lens of Monadic Second Order logic

    Authors: Steven Kelk, Leo van Iersel, Celine Scornavacca

    Abstract: Within the field of phylogenetics there is growing interest in measures for summarising the dissimilarity, or 'incongruence', of two or more phylogenetic trees. Many of these measures are NP-hard to compute and this has stimulated a considerable volume of research into fixed parameter tractable algorithms. In this article we use Monadic Second Order logic (MSOL) to give alternative, compact proofs… ▽ More

    Submitted 1 March, 2015; originally announced March 2015.

  30. arXiv:1412.4076  [pdf, ps, other

    q-bio.PE cs.CE cs.DS math.CO

    On the complexity of computing MP distance between binary phylogenetic trees

    Authors: Steven Kelk, Mareike Fischer

    Abstract: Within the field of phylogenetics there is great interest in distance measures to quantify the dissimilarity of two trees. Recently, a new distance measure has been proposed: the Maximum Parsimony (MP) distance. This is based on the difference of the parsimony scores of a single character on both trees under consideration, and the goal is to find the character which maximizes this difference. Here… ▽ More

    Submitted 18 January, 2015; v1 submitted 12 December, 2014; originally announced December 2014.

    Comments: 37 pages, 8 figures

    MSC Class: 05C15; 05C35; 90C35; 92D15

  31. arXiv:1410.7004  [pdf, other

    q-bio.PE

    Exact reconciliation of undated trees

    Authors: Leo van Iersel, Celine Scornavacca, Steven Kelk

    Abstract: Reconciliation methods aim at recovering macro evolutionary events and at localizing them in the species history, by observing discrepancies between gene family trees and species trees. In this article we introduce an Integer Linear Programming (ILP) approach for the NP-hard problem of computing a most parsimonious time-consistent reconciliation of a gene tree with a species tree when dating infor… ▽ More

    Submitted 26 October, 2014; originally announced October 2014.

  32. arXiv:1410.2371  [pdf, other

    cs.CC q-bio.PE

    Satisfying ternary permutation constraints by multiple linear orders or phylogenetic trees

    Authors: Leo van Iersel, Steven Kelk, Nela Lekic, Simone Linz

    Abstract: A ternary permutation constraint satisfaction problem (CSP) is specified by a subset Pi of the symmetric group S_3. An instance of such a problem consists of a set of variables V and a set of constraints C, where each constraint is an ordered triple of distinct elements from V. The goal is to construct a linear order alpha on V such that, for each constraint (a,b,c) in C, the ordering of a,b,c ind… ▽ More

    Submitted 9 October, 2014; originally announced October 2014.

  33. arXiv:1402.7224  [pdf, other

    cs.DM cs.DS

    On low treewidth graphs and supertrees

    Authors: Alexander Grigoriev, Steven Kelk, Nela Lekic

    Abstract: Compatibility of unrooted phylogenetic trees is a well studied problem in phylogenetics. It asks to determine whether for a set of k input trees there exists a larger tree (called a supertree) that contains the topologies of all k input trees. When any such supertree exists we call the instance compatible and otherwise incompatible. It is known that the problem is NP-hard and FPT, although a const… ▽ More

    Submitted 28 February, 2014; originally announced February 2014.

  34. arXiv:1402.2136  [pdf, other

    cs.DS

    Hybridization Number on Three Rooted Binary Trees is EPT

    Authors: Leo van Iersel, Steven Kelk, Nela Lekić, Chris Whidden, Norbert Zeh

    Abstract: Phylogenetic networks are leaf-labelled directed acyclic graphs that are used to describe non-treelike evolutionary histories and are thus a generalization of phylogenetic trees. The hybridization number of a phylogenetic network is the sum of all indegrees minus the number of nodes plus one. The Hybridization Number problem takes as input a collection of phylogenetic trees and asks to construct a… ▽ More

    Submitted 31 May, 2016; v1 submitted 10 February, 2014; originally announced February 2014.

  35. arXiv:1402.1553  [pdf, other

    q-bio.PE math.CO

    On the Maximum Parsimony distance between phylogenetic trees

    Authors: Mareike Fischer, Steven Kelk

    Abstract: Within the field of phylogenetics there is great interest in distance measures to quantify the dissimilarity of two trees. Here, based on an idea of Bruen and Bryant, we propose and analyze a new distance measure: the Maximum Parsimony (MP) distance. This is based on the difference of the parsimony scores of a single character on both trees under consideration, and the goal is to find the characte… ▽ More

    Submitted 7 February, 2014; originally announced February 2014.

    Comments: 30 pages, 6 figures

  36. arXiv:1312.1255  [pdf, other

    q-bio.PE math.CO

    A short note on exponential-time algorithms for hybridization number

    Authors: Leo van Iersel, Steven Kelk, Nela Lekic, Leen Stougie

    Abstract: In this short note we prove that, given two (not necessarily binary) rooted phylogenetic trees T_1, T_2 on the same set of taxa X, where |X|=n, the hybridization number of T_1 and T_2 can be computed in time O^{*}(2^n) i.e. O(2^{n} poly(n)). The result also means that a Maximum Acyclic Agreement Forest (MAAF) can be computed within the same time bound.

    Submitted 4 December, 2013; originally announced December 2013.

  37. arXiv:1311.4045  [pdf, other

    cs.DM q-bio.PE

    Kernelizations for the hybridization number problem on multiple nonbinary trees

    Authors: Leo van Iersel, Steven Kelk, Celine Scornavacca

    Abstract: Given a finite set $X$, a collection $\mathcal{T}$ of rooted phylogenetic trees on $X$ and an integer $k$, the Hybridization Number problem asks if there exists a phylogenetic network on $X$ that displays all trees from $\mathcal{T}$ and has reticulation number at most $k$. We show two kernelization algorithms for Hybridization Number, with kernel sizes $4k(5k)^t$ and $20k^2(Δ^+-1)$ respectively,… ▽ More

    Submitted 22 March, 2016; v1 submitted 16 November, 2013; originally announced November 2013.

  38. arXiv:1310.6844  [pdf, other

    q-bio.PE

    Fighting network space: it is time for an SQL-type language to filter phylogenetic networks

    Authors: Steven Kelk, Simone Linz, David A. Morrison

    Abstract: The search space of rooted phylogenetic trees is vast and a major research focus of recent decades has been the development of algorithms to effectively navigate this space. However this space is tiny when compared with the space of rooted phylogenetic networks, and navigating this enlarged space remains a poorly understood problem. This, and the difficulty of biologically interpreting such networ… ▽ More

    Submitted 25 October, 2013; originally announced October 2013.

    Comments: opinion piece

  39. arXiv:1302.2430  [pdf, other

    q-bio.PE math.CO

    On Computing the Maximum Parsimony Score of a Phylogenetic Network

    Authors: Mareike Fischer, Leo van Iersel, Steven Kelk, Celine Scornavacca

    Abstract: Phylogenetic networks are used to display the relationship of different species whose evolution is not treelike, which is the case, for instance, in the presence of hybridization events or horizontal gene transfers. Tree inference methods such as Maximum Parsimony need to be modified in order to be applicable to networks. In this paper, we discuss two different definitions of Maximum Parsimony on… ▽ More

    Submitted 1 May, 2014; v1 submitted 11 February, 2013; originally announced February 2013.

    ACM Class: G.2; F.2

  40. arXiv:1210.3211  [pdf, other

    math.CO cs.DM q-bio.PE

    Approximation algorithms for nonbinary agreement forests

    Authors: Leo van Iersel, Steven Kelk, Nela Lekić, Leen Stougie

    Abstract: Given two rooted phylogenetic trees on the same set of taxa X, the Maximum Agreement Forest problem (MAF) asks to find a forest that is, in a certain sense, common to both trees and has a minimum number of components. The Maximum Acyclic Agreement Forest problem (MAAF) has the additional restriction that the components of the forest cannot have conflicting ancestral relations in the input trees. T… ▽ More

    Submitted 23 December, 2012; v1 submitted 11 October, 2012; originally announced October 2012.

    Comments: Note that this version contains significantly more results than the previous versions. Submitted for journal publication

  41. arXiv:1207.7034  [pdf, ps, other

    q-bio.PE cs.CC

    Towards the fixed parameter tractability of constructing minimal phylogenetic networks from arbitrary sets of nonbinary trees

    Authors: Steven Kelk, Celine Scornavacca

    Abstract: It has remained an open question for some time whether, given a set of not necessarily binary (i.e. "nonbinary") trees T on a set of taxa X, it is possible to determine in time f(r).poly(m) whether there exists a phylogenetic network that displays all the trees in T, where r refers to the reticulation number of the network and m=|X|+|T|. Here we show that this holds if one or both of the following… ▽ More

    Submitted 2 August, 2012; v1 submitted 30 July, 2012; originally announced July 2012.

    Comments: have fixed a number of small typo's etc

  42. arXiv:1207.6090  [pdf, other

    q-bio.QM math.CO q-bio.PE

    A simple fixed parameter tractable algorithm for computing the hybridization number of two (not necessarily binary) trees

    Authors: Teresa Piovesan, Steven Kelk

    Abstract: Here we present a new fixed parameter tractable algorithm to compute the hybridization number r of two rooted, not necessarily binary phylogenetic trees on taxon set X in time (6^r.r!).poly(n)$, where n=|X|. The novelty of this approach is its use of terminals, which are maximal elements of a natural partial order on X, and several insights from the softwired clusters literature. This yields a sur… ▽ More

    Submitted 25 July, 2012; originally announced July 2012.

  43. arXiv:1205.3417  [pdf, other

    q-bio.PE

    A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees

    Authors: Leo van Iersel, Steven Kelk, Nela Lekić, Celine Scornavacca

    Abstract: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve instances with reticulation number larger than 40-50. Here we present CycleKiller and NonbinaryCycleKiller, the fir… ▽ More

    Submitted 1 May, 2014; v1 submitted 15 May, 2012; originally announced May 2012.

  44. arXiv:1112.5359  [pdf, other

    math.CO cs.DS q-bio.QM

    Cycle killer... qu'est-ce que c'est? On the comparative approximability of hybridization number and directed feedback vertex set

    Authors: Steven Kelk, Leo van Iersel, Nela Lekic, Simone Linz, Celine Scornavacca, Leen Stougie

    Abstract: We show that the problem of computing the hybridization number of two rooted binary phylogenetic trees on the same set of taxa X has a constant factor polynomial-time approximation if and only if the problem of computing a minimum-size feedback vertex set in a directed graph (DFVS) has a constant factor polynomial-time approximation. The latter problem, which asks for a minimum number of vertices… ▽ More

    Submitted 22 December, 2011; originally announced December 2011.

  45. arXiv:1108.4642  [pdf, other

    q-bio.PE cs.DS

    A note on efficient computation of hybridization number via softwired clusters

    Authors: Steven Kelk

    Abstract: Here we present a new fixed parameter tractable algorithm to compute the hybridization number r of two rooted binary phylogenetic trees on taxon set X in time (6r)^r.poly(n), where n=|X|. The novelty of this approach is that it avoids the use of Maximum Acyclic Agreement Forests (MAAFs) and instead exploits the equivalence of the problem with a related problem from the softwired clusters literatur… ▽ More

    Submitted 23 August, 2011; originally announced August 2011.

  46. arXiv:1108.3653  [pdf, other

    cs.CC q-bio.PE

    Constructing minimal phylogenetic networks from softwired clusters is fixed parameter tractable

    Authors: Steven Kelk, Celine Scornavacca

    Abstract: Here we show that, given a set of clusters C on a set of taxa X, where |X|=n, it is possible to determine in time f(k).poly(n) whether there exists a level-<= k network (i.e. a network where each biconnected component has reticulation number at most k) that represents all the clusters in C in the softwired sense, and if so to construct such a network. This extends a polynomial time result from "On… ▽ More

    Submitted 18 August, 2011; originally announced August 2011.

    Comments: Submitted

  47. arXiv:1103.1834  [pdf, other

    q-bio.PE cs.SI physics.soc-ph

    On the elusiveness of clusters

    Authors: Steven Kelk, Celine Scornavacca, Leo van Iersel

    Abstract: Rooted phylogenetic networks are often used to represent conflicting phylogenetic signals. Given a set of clusters, a network is said to represent these clusters in the "softwired" sense if, for each cluster in the input set, at least one tree embedded in the network contains that cluster. Motivated by parsimony we might wish to construct such a network using as few reticulations as possible, or m… ▽ More

    Submitted 9 March, 2011; originally announced March 2011.

    Comments: Submitted

  48. arXiv:1004.5332  [pdf, other

    q-bio.PE

    When two trees go to war

    Authors: Leo van Iersel, Steven Kelk

    Abstract: Rooted phylogenetic networks are often constructed by combining trees, clusters, triplets or characters into a single network that in some well-defined sense simultaneously represents them all. We review these four models and investigate how they are related. In general, the model chosen influences the minimum number of reticulation events required. However, when one obtains the input data from tw… ▽ More

    Submitted 29 April, 2010; originally announced April 2010.

  49. arXiv:0912.4502  [pdf, ps, other

    q-bio.PE

    A short note on the tractability of constructing phylogenetic networks from clusters

    Authors: Leo van Iersel, Steven Kelk

    Abstract: In [2] it was proven that the Cass algorithm is a polynomial-time algorithm for constructing level<=2 networks from clusters. Here we demonstrate, for each k>=0, a polynomial-time algorithm for constructing level-k phylogenetic networks from clusters. Unlike Cass the algorithm scheme given here is only of theoretical interest. It does, however, strengthen the hope that efficient polynomial-time… ▽ More

    Submitted 22 December, 2009; originally announced December 2009.

  50. arXiv:0910.4067  [pdf, ps, other

    q-bio.PE

    A Practical Algorithm for Reconstructing Level-1 Phylogenetic Networks

    Authors: Katharina T. Huber, Leo van Iersel, Steven Kelk, Radoslaw Suchecki

    Abstract: Recently much attention has been devoted to the construction of phylogenetic networks which generalize phylogenetic trees in order to accommodate complex evolutionary processes. Here we present an efficient, practical algorithm for reconstructing level-1 phylogenetic networks - a type of network slightly more general than a phylogenetic tree - from triplets. Our algorithm has been made publicly… ▽ More

    Submitted 21 October, 2009; originally announced October 2009.