-
How quickly can you pack short paths? Engineering a search-tree algorithm for disjoint s-t paths of bounded length
Authors:
Michael Kiran Huber
Abstract:
We study the Short Path Packing problem which asks, given a graph $G$, integers $k$ and $\ell$, and vertices $s$ and $t$, whether there exist $k$ pairwise internally vertex-disjoint $s$-$t$ paths of length at most $\ell$. The problem has been proven to be NP-hard and fixed-parameter tractable parameterized by $k$ and $\ell$. Most previous research on this problem has been theoretical with limited…
▽ More
We study the Short Path Packing problem which asks, given a graph $G$, integers $k$ and $\ell$, and vertices $s$ and $t$, whether there exist $k$ pairwise internally vertex-disjoint $s$-$t$ paths of length at most $\ell$. The problem has been proven to be NP-hard and fixed-parameter tractable parameterized by $k$ and $\ell$. Most previous research on this problem has been theoretical with limited practical implemetations. We present an exact FPT-algorithm based on a search-tree approach in combination with greedy localization. While its worst case runtime complexity of $(k\cdot \ell^2)^{k\cdot \ell}\cdot n^{O(1)}$ is larger than the state of the art, the nature of search-tree algorithms allows for a broad range of potential optimizations. We exploit this potential by presenting techniques for input preprocessing, early detection of trivial and infeasible instances, and strategic selection of promising subproblems. Those approaches were implemented and heavily tested on a large dataset of diverse graphs. The results show that our heuristic improvements are very effective and that for the majority of instances, we can achieve fast runtimes.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Magnetospheric Venus Space Explorers (MVSE) Mission: A Proposal for Understanding the Dynamics of Induced Magnetospheres
Authors:
Roland Albers,
Henrik Andrews,
Gabriele Boccacci,
Vasco D. C Pires,
Sunny Laddha,
Ville Lundén,
Nadim Maraqten,
João Matias,
Eva Krämer,
Leonard Schulz,
Ines Terraza Palanca,
Daniel Teubenbacher,
Claire Baskevitch,
Francesca Covella,
Luca Cressa,
Juan Garrido Moreno,
Jana Gillmayr,
Joshua Hollowood,
Kilian Huber,
Viktoria Kutnohorsky,
Sofia Lennerstrand,
Adel Malatinszky,
Davide Manzini,
Manuel Maurer,
Daiana Maria Alessandra Nidele
, et al. (5 additional authors not shown)
Abstract:
Induced magnetospheres form around planetary bodies with atmospheres through the interaction of the solar wind with their ionosphere. Induced magnetospheres are highly dependent on the solar wind conditions and have only been studied with single spacecraft missions in the past. This gap in knowledge could be addressed by a multi-spacecraft plasma mission, optimized for studying global spatial and…
▽ More
Induced magnetospheres form around planetary bodies with atmospheres through the interaction of the solar wind with their ionosphere. Induced magnetospheres are highly dependent on the solar wind conditions and have only been studied with single spacecraft missions in the past. This gap in knowledge could be addressed by a multi-spacecraft plasma mission, optimized for studying global spatial and temporal variations in the magnetospheric system around Venus, which hosts the most prominent example of an induced magnetosphere in our solar system. The MVSE mission comprises four satellites, of which three are identical scientific spacecraft, carrying the same suite of instruments probing different regions of the induced magnetosphere and the solar wind simultaneously. The fourth spacecraft is the transfer vehicle which acts as a relay satellite for communications at Venus. In this way, changes in the solar wind conditions and extreme solar events can be observed, and their effects can be quantified as they propagate through the Venusian induced magnetosphere. Additionally, energy transfer in the Venusian induced magnetosphere can be investigated. The scientific payload includes instrumentation to measure the magnetic field, electric field, and ion-electron velocity distributions. This study presents the scientific motivation for the mission as well as requirements and the resulting mission design. Concretely, a mission timeline along with a complete spacecraft design, including mass, power, communication, propulsion and thermal budgets are given. This mission was initially conceived at the Alpbach Summer School 2022 and refined during a week-long study at ESAs Concurrent Design Facility in Redu, Belgium
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Phylogenetic trees defined by at most three characters
Authors:
Katharina T. Huber,
Simone Linz,
Vincent Moulton,
Charles Semple
Abstract:
In evolutionary biology, phylogenetic trees are commonly inferred from a set of characters (partitions) of a collection of biological entities (e.g., species or individuals in a population). Such characters naturally arise from molecular sequences or morphological data. Interestingly, it has been known for some time that any binary phylogenetic tree can be (convexly) defined by a set of at most fo…
▽ More
In evolutionary biology, phylogenetic trees are commonly inferred from a set of characters (partitions) of a collection of biological entities (e.g., species or individuals in a population). Such characters naturally arise from molecular sequences or morphological data. Interestingly, it has been known for some time that any binary phylogenetic tree can be (convexly) defined by a set of at most four characters, and that there are binary phylogenetic trees for which three characters are not enough. Thus, it is of interest to characterise those phylogenetic trees that are defined by a set of at most three characters. In this paper, we provide such a characterisation, in particular proving that a binary phylogenetic tree $T$ is defined by a set of at most three characters precisely if $T$ has no internal subtree isomorphic to a certain tree.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Is this network proper forest-based?
Authors:
Katharina T. Huber,
Leo van Iersel,
Vincent Moulton,
Guillaume Scholz
Abstract:
In evolutionary biology, networks are becoming increasingly used to represent evolutionary histories for species that have undergone non-treelike or reticulate evolution. Such networks are essentially directed acyclic graphs with a leaf set that corresponds to a collection of species, and in which non-leaf vertices with indegree 1 correspond to speciation events and vertices with indegree greater…
▽ More
In evolutionary biology, networks are becoming increasingly used to represent evolutionary histories for species that have undergone non-treelike or reticulate evolution. Such networks are essentially directed acyclic graphs with a leaf set that corresponds to a collection of species, and in which non-leaf vertices with indegree 1 correspond to speciation events and vertices with indegree greater than 1 correspond to reticulate events such as gene transfer. Recently forest-based networks have been introduced, which are essentially (multi-rooted) networks that can be formed by adding some arcs to a collection of phylogenetic trees (or phylogenetic forest), where each arc is added in such a way that its ends always lie in two different trees in the forest. In this paper, we consider the complexity of deciding whether or not a given network is proper forest-based, that is, whether it can be formed by adding arcs to some underlying phylogenetic forest which contains the same number of trees as there are roots in the network. More specifically, we show that it can be decided in polynomial time whether or not a binary, tree-child network with $m \ge 2$ roots is proper forest-based in case $m=2$, but that this problem is NP-complete for $m\ge 3$. We also give a fixed parameter tractable (FPT) algorithm for deciding whether or not a network in which every vertex has indegree at most 2 is proper forest-based. A key element in proving our results is a new characterization for when a network with $m$ roots is proper forest-based which is given in terms of the existence of certain $m$-colorings of the vertices of the network.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Shared ancestry graphs and symbolic arboreal maps
Authors:
Katharina T. Huber,
Vincent Moulton,
Guillaume E. Scholz
Abstract:
A network $N$ on a finite set $X$, $|X|\geq 2$, is a connected directed acyclic graph with leaf set $X$ in which every root in $N$ has outdegree at least 2 and no vertex in $N$ has indegree and outdegree equal to 1; $N$ is arboreal if the underlying unrooted, undirected graph of $N$ is a tree. Networks are of interest in evolutionary biology since they are used, for example, to represent the evolu…
▽ More
A network $N$ on a finite set $X$, $|X|\geq 2$, is a connected directed acyclic graph with leaf set $X$ in which every root in $N$ has outdegree at least 2 and no vertex in $N$ has indegree and outdegree equal to 1; $N$ is arboreal if the underlying unrooted, undirected graph of $N$ is a tree. Networks are of interest in evolutionary biology since they are used, for example, to represent the evolutionary history of a set $X$ of species whose ancestors have exchanged genes in the past. For $M$ some arbitrary set of symbols, $d:{X \choose 2} \to M \cup \{\odot\}$ is a symbolic arboreal map if there exists some arboreal network $N$ whose vertices with outdegree two or more are labelled by elements in $M$ and so that $d(\{x,y\})$, $\{x,y\} \in {X \choose 2}$, is equal to the label of the least common ancestor of $x$ and $y$ in $N$ if this exists and $\odot$ else. Important examples of symbolic arboreal maps include the symbolic ultrametrics, which arise in areas such as game theory, phylogenetics and cograph theory. In this paper we show that a map $d:{X \choose 2} \to M \cup \{\odot\}$ is a symbolic arboreal map if and only if $d$ satisfies certain 3- and 4-point conditions and the graph with vertex set $X$ and edge set consisting of those pairs $\{x,y\} \in {X \choose 2}$ with $d(\{x,y\}) \neq \odot$ is Ptolemaic. To do this, we introduce and prove a key theorem concerning the shared ancestry graph for a network $N$ on $X$, where this is the graph with vertex set $X$ and edge set consisting of those $\{x,y\} \in {X \choose 2}$ such that $x$ and $y$ share a common ancestor in $N$. In particular, we show that for any connected graph $G$ with vertex set $X$ and edge clique cover $K$ in which there are no two distinct sets in $K$ with one a subset of the other, there is some network with $|K|$ roots and leaf set $X$ whose shared ancestry graph is $G$.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Cherry picking in forests: A new characterization for the unrooted hybrid number of two phylogenetic trees
Authors:
Katharina T. Huber,
Simone Linz,
Vincent Moulton
Abstract:
Phylogenetic networks are a special type of graph which generalize phylogenetic trees and that are used to model non-treelike evolutionary processes such as recombination and hybridization. In this paper, we consider {\em unrooted} phylogenetic networks, i.e. simple, connected graphs $\mathcal{N}=(V,E)$ with leaf set $X$, for $X$ some set of species, in which every internal vertex in…
▽ More
Phylogenetic networks are a special type of graph which generalize phylogenetic trees and that are used to model non-treelike evolutionary processes such as recombination and hybridization. In this paper, we consider {\em unrooted} phylogenetic networks, i.e. simple, connected graphs $\mathcal{N}=(V,E)$ with leaf set $X$, for $X$ some set of species, in which every internal vertex in $\mathcal{N}$ has degree three. One approach used to construct such phylogenetic networks is to take as input a collection $\mathcal{P}$ of phylogenetic trees and to look for a network $\mathcal{N}$ that contains each tree in $\mathcal{P}$ and that minimizes the quantity $r(\mathcal{N}) = |E|-(|V|-1)$ over all such networks. Such a network always exists, and the quantity $r(\mathcal{N})$ for an optimal network $\mathcal{N}$ is called the hybrid number of $\mathcal{P}$. In this paper, we give a new characterization for the hybrid number in case $\mathcal{P}$ consists of two trees. This characterization is given in terms of a cherry picking sequence for the two trees, although to prove that our characterization holds we need to define the sequence more generally for two forests. Cherry picking sequences have been intensively studied for collections of rooted phylogenetic trees, but our new sequences are the first variant of this concept that can be applied in the unrooted setting. Since the hybrid number of two trees is equal to the well-known tree bisection and reconnection distance between the two trees, our new characterization also provides an alternative way to understand this important tree distance.
△ Less
Submitted 23 February, 2024; v1 submitted 15 December, 2022;
originally announced December 2022.
-
SPRINT: A fast, new software tool for reconstructing the evolutionary past of polyploid datasets
Authors:
Liam J. Maher,
Taoyang Wu,
Katharina T. Huber
Abstract:
Polyploidization is an important evolutionary process which affects organisms ranging from plants to fish and fungi. The signal left behind by it is in the form of a species' ploidy level (number of complete chromosome sets found in a cell) which is inherently non-treelike. Currently available tools for reconstructing the evolutionary past of a polyploid dataset generally start with a multi-labell…
▽ More
Polyploidization is an important evolutionary process which affects organisms ranging from plants to fish and fungi. The signal left behind by it is in the form of a species' ploidy level (number of complete chromosome sets found in a cell) which is inherently non-treelike. Currently available tools for reconstructing the evolutionary past of a polyploid dataset generally start with a multi-labelled tree obtained for a dataset of interest and then derive a (phylogenetic) network from that tree in some way that reflects that past by interpreting the networks's vertices of indegree at least two as polyploidization events. Since obtaining such a tree can be computationally expensive it is paramount to have alternative approaches available that allow one to shed light into the reticulate evolutionary past of a polyploid dataset. SPRINT aims to reconstruct the evolutionary past of a polyploid dataset in terms of a binary network which realises the dataset's ploidy profile (vector of ploidy levels of the dataset's taxa) and requires the fewest number of polyploidization events. It does this by representing the ploidy level of a species x in terms of the number of directed paths from the root of the network to the leaf of the network labelled by x. SPRINT is distributed on GitHub: https://github.com/lmaher1/SPRINT.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Injective split systems
Authors:
M. Hellmuth,
K. T. Huber,
V. Moulton,
G. E. Scholz,
P. F. Stadler
Abstract:
A split system $\mathcal S$ on a finite set $X$, $|X|\ge3$, is a set of bipartitions or splits of $X$ which contains all splits of the form $\{x,X-\{x\}\}$, $x \in X$. To any such split system $\mathcal S$ we can associate the Buneman graph $\mathcal B(\mathcal S)$ which is essentially a median graph with leaf-set $X$ that displays the splits in $\mathcal S$. In this paper, we consider properties…
▽ More
A split system $\mathcal S$ on a finite set $X$, $|X|\ge3$, is a set of bipartitions or splits of $X$ which contains all splits of the form $\{x,X-\{x\}\}$, $x \in X$. To any such split system $\mathcal S$ we can associate the Buneman graph $\mathcal B(\mathcal S)$ which is essentially a median graph with leaf-set $X$ that displays the splits in $\mathcal S$. In this paper, we consider properties of injective split systems, that is, split systems $\mathcal S$ with the property that $\mathrm{med}_{\mathcal B(\mathcal S)}(Y) \neq \mathrm{med}_{\mathrm B(\mathcal S)}(Y')$ for any 3-subsets $Y,Y'$ in $X$, where $\mathrm {med}_{\mathcal B(\mathcal S)}(Y)$ denotes the median in $\mathcal B(\mathcal S)$ of the three elements in $Y$ considered as leaves in $\mathcal B(\mathcal S)$. In particular, we show that for any set $X$ there always exists an injective split system on $X$, and we also give a characterization for when a split system is injective. We also consider how complex the Buneman graph $\mathcal B(\mathcal S)$ needs to become in order for a split system $\mathcal S$ on $X$ to be injective. We do this by introducing a quantity for $|X|$ which we call the injective dimension for $|X|$, as well as two related quantities, called the injective 2-split and the rooted-injective dimension. We derive some upper and lower bounds for all three of these dimensions and also prove that some of these bounds are tight. An underlying motivation for studying injective split systems is that they can be used to obtain a natural generalization of symbolic tree maps. An important consequence of our results is that any three-way symbolic map on $X$ can be represented using Buneman graphs.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Autopolyploidy, allopolyploidy, and phylogenetic networks with horizontal arcs
Authors:
Katharina T. Huber,
Liam J. Maher
Abstract:
Polyploidization is an evolutionary process by which a species acquires multiple copies of its complete set of chromosomes. The reticulate nature of the signal left behind by it means that phylogenetic networks offer themselves as a framework to reconstruct the evolutionary past of species affected by it. The main strategy for doing this is to first construct a so called multiple-labelled tree and…
▽ More
Polyploidization is an evolutionary process by which a species acquires multiple copies of its complete set of chromosomes. The reticulate nature of the signal left behind by it means that phylogenetic networks offer themselves as a framework to reconstruct the evolutionary past of species affected by it. The main strategy for doing this is to first construct a so called multiple-labelled tree and to then somehow derive such a network from it. The following question therefore arises: How much can be said about that past if such a tree is not readily available? By viewing a polyploid dataset as a certain vector which we call a ploidy (level) profile we show that, among other results, there always exists a phylogenetic network in the form of a beaded phylogenetic tree with additional arcs that realizes a given ploidy profile. Intriguingly, the two end vertices of almost all of these additional arcs can be interpreted as having co-existed in time thereby adding biological realism to our network, a feature that is, in general, not enjoyed by phylogenetic networks. In addition, we show that our network may be viewed as a generator of ploidy profile space, a novel concept similar to phylogenetic tree space that we introduce to be able to compare phylogenetic networks that realize one and the same ploidy profile. We illustrate our findings in terms of a publicly available Viola dataset.
△ Less
Submitted 19 February, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Forest-based networks
Authors:
Katharina T. Huber,
Vincent Moulton,
Guillaume E. Scholz
Abstract:
In evolutionary studies it is common to use phylogenetic trees to represent the evolutionary history of a set of species. However, in case the transfer of genes or other genetic information between the species or their ancestors has occurred in the past, a tree may not provide a complete picture of their history. In such cases,tree-based phylogenetic networks can provide a useful, more refined rep…
▽ More
In evolutionary studies it is common to use phylogenetic trees to represent the evolutionary history of a set of species. However, in case the transfer of genes or other genetic information between the species or their ancestors has occurred in the past, a tree may not provide a complete picture of their history. In such cases,tree-based phylogenetic networks can provide a useful, more refined representation of the species evolution. Such a network is essentially a phylogenetic tree with some arcs added between the tree edges so as to represent reticulate events such as gene transfer. Even so, this model does not permit the representation of evolutionary scenarios where reticulate events have taken place between different subfamilies or lineages of species. To represent such scenarios, in this paper we introduce the notion of a forest-based phylogenetic network, that is, a collection of leaf-disjoint phylogenetic trees on a set of species with arcs added between the edges of distinct trees within the collection. Forest-based networks include the recently introduced class of overlaid species forests which are used to model introgression. As we shall see, even though the definition of forest-based networks is closely related to that of tree-based networks, they lead to new mathematical theory which complements that of tree-based networks. As well as studying the relationship of forest-based networks with other classes of phylogenetic networks, such as tree-child networks and universal tree-based networks, we present some characterizations of some special classes of forest-based networks. We expect that our results will be useful for develo** new models and algorithms to understand reticulate evolution, such as gene transfer between collections of bacteria that live in different environments.
△ Less
Submitted 14 February, 2022;
originally announced February 2022.
-
The space of equidistant phylogenetic cactuses
Authors:
Katharina T. Huber,
Vincent Moulton,
Megan Owen,
Andreas Spillner,
Katherine St. John
Abstract:
We introduce and investigate the space of \emph{equidistant} $X$-\emph{cactuses}. These are rooted, arc weighted, phylogenetic networks with leaf set $X$, where $X$ is a finite set of species, and all leaves have the same distance from the root. The space contains as a subset the space of ultrametric trees on $X$ that was introduced by Gavryushkin and Drummond. We show that equidistant-cactus spac…
▽ More
We introduce and investigate the space of \emph{equidistant} $X$-\emph{cactuses}. These are rooted, arc weighted, phylogenetic networks with leaf set $X$, where $X$ is a finite set of species, and all leaves have the same distance from the root. The space contains as a subset the space of ultrametric trees on $X$ that was introduced by Gavryushkin and Drummond. We show that equidistant-cactus space is a CAT(0)-metric space which implies, for example, that there are unique geodesic paths between points. As a key step to proving this, we present a combinatorial result concerning \emph{ranked} rooted $X$-cactuses. In particular, we show that such networks can be encoded in terms of a pairwise compatibility condition arising from a poset of collections of pairs of subsets of $X$ that satisfy certain set-theoretic properties. As a corollary, we also obtain an encoding of ranked, rooted $X$-trees in terms of partitions of $X$, which provides an alternative proof that the space of ultrametric trees on $X$ is CAT(0). As with spaces of phylogenetic trees, we expect that our results should provide the basis for and new directions in performing statistical analyses for collections of phylogenetic networks with arc lengths.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Diversities and the Generalized Circumradius
Authors:
David Bryant,
Katharina T. Huber,
Vincent Moulton,
Paul F. Tupper
Abstract:
The generalized circumradius of a set of points $A \subseteq \mathbb{R}^d$ with respect to a convex body $K$ equals the minimum value of $λ\geq 0$ such that $A$ is contained in a translate of $λK$. Each choice of $K$ gives a different function on the set of bounded subsets of $\mathbb{R}^d$; we characterize which functions can arise in this way. Our characterization draws on the theory of diversit…
▽ More
The generalized circumradius of a set of points $A \subseteq \mathbb{R}^d$ with respect to a convex body $K$ equals the minimum value of $λ\geq 0$ such that $A$ is contained in a translate of $λK$. Each choice of $K$ gives a different function on the set of bounded subsets of $\mathbb{R}^d$; we characterize which functions can arise in this way. Our characterization draws on the theory of diversities, a recently introduced generalization of metrics from functions on pairs to functions on finite subsets. We additionally investigate functions which arise by restricting the generalised circumradius to a finite subset of $\mathbb{R}^d$. We obtain elegant characterizations in the case that $K$ is a simplex or parallelotope.
△ Less
Submitted 31 January, 2023; v1 submitted 25 October, 2021;
originally announced October 2021.
-
Encoding and ordering X-cactuses
Authors:
Andrew Francis,
Katharina T. Huber,
Vincent Moulton,
Taoyang Wu
Abstract:
Phylogenetic networks are a generalization of evolutionary or phylogenetic trees that are commonly used to represent the evolution of species which cross with one another. A special type of phylogenetic network is an {\em $X$-cactus}, which is essentially a cactus graph in which all vertices with degree less than three are labelled by at least one element from a set $X$ of species. In this paper,…
▽ More
Phylogenetic networks are a generalization of evolutionary or phylogenetic trees that are commonly used to represent the evolution of species which cross with one another. A special type of phylogenetic network is an {\em $X$-cactus}, which is essentially a cactus graph in which all vertices with degree less than three are labelled by at least one element from a set $X$ of species. In this paper, we present a way to {\em encode} $X$-cactuses in terms of certain collections of partitions of $X$ that naturally arise from $X$-cactuses. Using this encoding, we also introduce a partial order on the set of $X$-cactuses (up to isomorphism), and derive some structural properties of the resulting partially ordered set. This includes an analysis of some properties of its least upper and greatest lower bounds. Our results not only extend some fundamental properties of phylogenetic trees to $X$-cactuses, but also provides a new approach to solving topical problems in phylogenetic network theory such as deriving consensus networks.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
Phylogenetic consensus networks: Computing a consensus of 1-nested phylogenetic networks
Authors:
Katharina T. Huber,
Vincent Moulton,
Andreas Spillner
Abstract:
An important and well-studied problem in phylogenetics is to compute a \emph{consensus tree} so as to summarize the common features within a collection of rooted phylogenetic trees, all whose leaf-sets are bijectively labeled by the same set~(X) of species. More recently, however, it has become of interest to find a consensus for a collection of more general, rooted directed acyclic graphs all of…
▽ More
An important and well-studied problem in phylogenetics is to compute a \emph{consensus tree} so as to summarize the common features within a collection of rooted phylogenetic trees, all whose leaf-sets are bijectively labeled by the same set~(X) of species. More recently, however, it has become of interest to find a consensus for a collection of more general, rooted directed acyclic graphs all of whose sink-sets are bijectively labeled by~(X), so called rooted \emph{phylogenetic networks}. These networks are used to analyse the evolution of species that cross with one another, such as plants and viruses. In this paper, we introduce an algorithm for computing a consensus for a collection of so-called 1-\emph{nested} phylogenetic networks. Our approach builds on a previous result by Roselló et al. that describes an encoding for any 1-nested phylogenetic network in terms of a collection of ordered pairs of subsets of (X).More specifically, we characterize those collections of ordered pairs that arise as the encoding of some 1-nested phylogenetic network, and then use this characterization to compute a \emph{consensus network} for a collection of~$t$ 1-nested networks in $O(t|X|^2+|X|^3)$ time. Applying our algorithm to a collection of phylogenetic trees yields the well-known majority rule consensus tree. Our approach leads to several new directions for futurework, and we expect that it should provide a useful new tool to help understand complex evolutionary scenarios.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
The hybrid number of a ploidy profile
Authors:
Katharina T. Huber,
Liam J. Maher
Abstract:
Polyploidization, whereby an organism inherits multiple copies of the genome of their parents, is an important evolutionary event that has been observed in plants and animals. One way to study such events is in terms of the ploidy number of the species that make up a dataset of interest. It is therefore natural to ask: How much information about the evolutionary past of the set of species that for…
▽ More
Polyploidization, whereby an organism inherits multiple copies of the genome of their parents, is an important evolutionary event that has been observed in plants and animals. One way to study such events is in terms of the ploidy number of the species that make up a dataset of interest. It is therefore natural to ask: How much information about the evolutionary past of the set of species that form a dataset can be gleaned from the ploidy numbers of the species? To help answer this question, we introduce and study the novel concept of a ploidy profile which allows us to formalize it in terms of a multiplicity vector indexed by the species the dataset is comprised of. Using the framework of a phylogenetic network, we present a closed formula for computing the hybrid number (i.e. the minimal number of polyploidization events required to explain a ploidy profile) of a large class of ploidy profiles. This formula relies on the construction of a certain phylogenetic network from the simplification sequence of a ploidy profile and the hybrid number of the ploidy profile with which this construction is initialized. Both of them can be computed easily in case the ploidy numbers that make up the ploidy profile are not too large. To help illustrate the applicability of our approach, we apply it to a simplified version of a publicly available Viola dataset.
△ Less
Submitted 11 August, 2022; v1 submitted 26 January, 2021;
originally announced January 2021.
-
Level-$2$ networks from shortest and longest distances
Authors:
Katharina T. Huber,
Leo van Iersel,
Remie Janssen,
Mark Jones,
Vincent Moulton,
Yukihiro Murakami
Abstract:
Recently it was shown that a certain class of phylogenetic networks, called level-$2$ networks, cannot be reconstructed from their associated distance matrices. In this paper, we show that they can be reconstructed from their induced shortest and longest distance matrices. That is, if two level-$2$ networks induce the same shortest and longest distance matrices, then they must be isomorphic. We fu…
▽ More
Recently it was shown that a certain class of phylogenetic networks, called level-$2$ networks, cannot be reconstructed from their associated distance matrices. In this paper, we show that they can be reconstructed from their induced shortest and longest distance matrices. That is, if two level-$2$ networks induce the same shortest and longest distance matrices, then they must be isomorphic. We further show that level-$2$ networks are reconstructible from their shortest distance matrices if and only if they do not contain a subgraph from a family of graphs. A generator of a network is the graph obtained by deleting all pendant subtrees and suppressing degree-$2$ vertices. We also show that networks with a leaf on every generator side is reconstructible from their induced shortest distance matrix, regardless of level.
△ Less
Submitted 11 August, 2021; v1 submitted 21 January, 2021;
originally announced January 2021.
-
From Kepler's Laws to Newtonian Motion and the Direction Angle of Hamilton's Hodograph
Authors:
Klaus Huber
Abstract:
In this contribution it is shown that the path from Kepler's results to Newtonian motion can be remarkably short and simple. Following this path we also give a straight forward computation of the direction angle of Hamilton's Hodograph. Then we show how the speed as function of the direction angle can be expressed and inverted elegantly using elliptic functions.
In this contribution it is shown that the path from Kepler's results to Newtonian motion can be remarkably short and simple. Following this path we also give a straight forward computation of the direction angle of Hamilton's Hodograph. Then we show how the speed as function of the direction angle can be expressed and inverted elegantly using elliptic functions.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Overlaid species forests
Authors:
K. T. Huber,
V. Moulton,
G. E. Scholz
Abstract:
Introgression is an evolutionary process in which genes or other types of genetic material are introduced into a genome. It is an important evolutionary process that can, for example, play a fundamental role in speciation. Recently the concept of an overlaid species forest was introduced to represent introgression histories. Basically this approach takes a putative gene history in the form of a ph…
▽ More
Introgression is an evolutionary process in which genes or other types of genetic material are introduced into a genome. It is an important evolutionary process that can, for example, play a fundamental role in speciation. Recently the concept of an overlaid species forest was introduced to represent introgression histories. Basically this approach takes a putative gene history in the form of a phylogenetic gene tree and tries to overlay this onto a forest which usually consists of a collection of lineage trees for the species of interest. The result is a network called an overlaid species forest in which genes jump or introgress between lineages. In this paper we study properties of overlaid species forests, showing that they have various connections with models for lateral gene transfer, maximum parsimony, and unfolding of phylogenetic networks. In particular, we show that a certain algorithm called OSF-B UILDER for constructing overlaid species forests is guaranteed to a produce a special type of overlaid species forest with a minimum number introgressions, as well as providing some characterizations for networks that can arise from overlaid species forests. We expect that these results will be useful in develo** new methods for representing introgression histories, a growing area of interest in phylogenetics.
△ Less
Submitted 25 June, 2020;
originally announced June 2020.
-
Weakly displaying trees in temporal tree-child network
Authors:
Katharina T. Huber,
Simone Linz,
Vincent Moulton
Abstract:
Recently there has been considerable interest in the problem of finding a phylogenetic network with a minimum number of reticulation vertices which displays a given set of phylogenetic trees, that is, a network with minimum hybrid number. Even so, for certain evolutionary scenarios insisting that a network displays the set of trees can be an overly restrictive assumption. In this paper, we conside…
▽ More
Recently there has been considerable interest in the problem of finding a phylogenetic network with a minimum number of reticulation vertices which displays a given set of phylogenetic trees, that is, a network with minimum hybrid number. Even so, for certain evolutionary scenarios insisting that a network displays the set of trees can be an overly restrictive assumption. In this paper, we consider the less restrictive notion of displaying called weakly displaying and, in particular, a special case of this which we call rigidly displaying. We characterize when two trees can be rigidly displayed by a temporal tree-child network in terms of fork-picking sequences, a concept that is closely related to that of cherry-picking sequences. We also show that, in case it exists, the rigid hybrid number for two phylogenetic trees is given by a minimum weight fork-picking sequence for the trees, and that the rigid hybrid number can be quite different from the related beaded- and temporal-hybrid numbers.
△ Less
Submitted 3 April, 2020;
originally announced April 2020.
-
Recognizing and realizing cactus metrics
Authors:
Momoko Hayamizu,
Katharina T. Huber,
Vincent Moulton,
Yukihiro Murakami
Abstract:
The problem of realizing finite metric spaces in terms of weighted graphs has many applications. For example, the mathematical and computational properties of metrics that can be realized by trees have been well-studied and such research has laid the foundation of the reconstruction of phylogenetic trees from evolutionary distances. However, as trees may be too restrictive to accurately represent…
▽ More
The problem of realizing finite metric spaces in terms of weighted graphs has many applications. For example, the mathematical and computational properties of metrics that can be realized by trees have been well-studied and such research has laid the foundation of the reconstruction of phylogenetic trees from evolutionary distances. However, as trees may be too restrictive to accurately represent real-world data or phenomena, it is important to understand the relationship between more general graphs and distances. In this paper, we introduce a new type of metric called a cactus metric, that is, a metric that can be realized by a cactus graph. We show that, just as with tree metrics, a cactus metric has a unique optimal realization. In addition, we describe an algorithm that can recognize whether or not a metric is a cactus metric and, if so, compute its optimal realization in $O(n^3)$ time, where $n$ is the number of points in the space.
△ Less
Submitted 7 February, 2020; v1 submitted 5 August, 2019;
originally announced August 2019.
-
Orienting undirected phylogenetic networks
Authors:
Katharina T. Huber,
Leo van Iersel,
Remie Janssen,
Mark Jones,
Vincent Moulton,
Yukihiro Murakami,
Charles Semple
Abstract:
This paper studies the relationship between undirected (unrooted) and directed (rooted) phylogenetic networks. We describe a polynomial-time algorithm for deciding whether an undirected nonbinary phylogenetic network, given the locations of the root and reticulation vertices, can be oriented as a directed nonbinary phylogenetic network. Moreover, we characterize when this is possible and show that…
▽ More
This paper studies the relationship between undirected (unrooted) and directed (rooted) phylogenetic networks. We describe a polynomial-time algorithm for deciding whether an undirected nonbinary phylogenetic network, given the locations of the root and reticulation vertices, can be oriented as a directed nonbinary phylogenetic network. Moreover, we characterize when this is possible and show that, in such instances, the resulting directed nonbinary phylogenetic network is unique. In addition, without being given the location of the root and the reticulation vertices, we describe an algorithm for deciding whether an undirected binary phylogenetic network $N$ can be oriented as a directed binary phylogenetic network of a certain class. The algorithm is fixed-parameter tractable (FPT) when the parameter is the level of $N$ and is applicable to classes of directed phylogenetic networks that satisfy certain conditions. As an example, we show that the well-studied class of binary tree-child networks satisfies these conditions.
△ Less
Submitted 29 September, 2023; v1 submitted 18 June, 2019;
originally announced June 2019.
-
Reconciling Event-Labeled Gene Trees with MUL-trees and Species Networks
Authors:
Marc Hellmuth,
Katharina T. Huber,
Vincent Moulton
Abstract:
Phylogenomics commonly aims to construct evolutionary trees from genomic sequence information. One way to approach this problem is to first estimate event-labeled gene trees (i.e., rooted trees whose non-leaf vertices are labeled by speciation or gene duplication events), and to then look for a species tree which can be reconciled with this tree through a \emph{reconciliation map} between the tree…
▽ More
Phylogenomics commonly aims to construct evolutionary trees from genomic sequence information. One way to approach this problem is to first estimate event-labeled gene trees (i.e., rooted trees whose non-leaf vertices are labeled by speciation or gene duplication events), and to then look for a species tree which can be reconciled with this tree through a \emph{reconciliation map} between the trees. In practice, however, it can happen that there is no such map from a given event-labeled tree to \emph{any} species tree. An important situation where this might arise is where the species evolution is better represented by a \emph{network} instead of a tree. In this paper, we therefore consider the problem of reconciling event-labeled trees with species networks. In particular, we prove that any event-labeled gene tree can be reconciled with some network and that, under certain mild assumptions on the gene tree, the network can even be assumed to be multi-arc free. To prove this result, we show that we can always reconcile the gene tree with some multi-labeled (MUL-)tree, which can then be "folded up" to produce the desired reconciliation and network. In addition, we study the interplay between reconciliation maps from event-labeled gene trees to MUL-trees and networks. Our results could be useful for understanding how genomes have evolved after undergoing complex evolutionary events such as polyploidy.
△ Less
Submitted 8 May, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Joint cluster reconstructions: Combining free-form lensing and X-rays
Authors:
Korbinian Huber,
Celine Tchernin,
Julian Merten,
Stefan Hilbert,
Matthias Bartelmann
Abstract:
Galaxy clusters provide a multitude of observational data across wavelengths and their structure and morphology are of considerable interest in cosmology as well as astrophysics. We develop a framework that allows the combination of lensing and non-lensing observations in a free-form and mesh-free approach to infer the projected mass distribution of individual galaxy clusters. This method can be u…
▽ More
Galaxy clusters provide a multitude of observational data across wavelengths and their structure and morphology are of considerable interest in cosmology as well as astrophysics. We develop a framework that allows the combination of lensing and non-lensing observations in a free-form and mesh-free approach to infer the projected mass distribution of individual galaxy clusters. This method can be used to test common assumptions on the morphology of clusters in parametric models. We make use of the lensing reconstruction code SaWLens2 and expand its capabilities by incorporating an estimate of the projected gravitational potential based on X-ray data that are deprojected using the local Richardson-Lucy method and used to infer the Newtonian potential of the cluster and we discuss how potentially arising numerical artefacts can be treated. We demonstrate the feasibility of our method on a simplified mock NFW halo and on a cluster from a realistic hydrodynamical simulation and show how the combination of X-ray and weak lensing data can affect a free-form reconstruction, improving the accuracy in the central region in some cases by a factor of two.
△ Less
Submitted 7 July, 2019; v1 submitted 19 December, 2018;
originally announced December 2018.
-
Phylogenetic networks that are their own fold-ups
Authors:
Katharina T. Huber,
Guillaume E. Scholz
Abstract:
Phylogenetic networks are becoming of increasing interest to evolutionary biologists due to their ability to capture complex non-treelike evolutionary processes. From a combinatorial point of view, such networks are certain types of rooted directed acyclic graphs whose leaves are labelled by, for example, species. A number of mathematically interesting classes of phylogenetic networks are known. T…
▽ More
Phylogenetic networks are becoming of increasing interest to evolutionary biologists due to their ability to capture complex non-treelike evolutionary processes. From a combinatorial point of view, such networks are certain types of rooted directed acyclic graphs whose leaves are labelled by, for example, species. A number of mathematically interesting classes of phylogenetic networks are known. These include the biologically relevant class of stable phylogenetic networks whose members are defined via certain "fold-up" and "un-fold" operations that link them with concepts arising within the theory of, for example, graph fibrations. Despite this exciting link, the structural complexity of stable phylogenetic networks is still relatively poorly understood. Employing the popular tree-based, reticulation-visible, and tree-child properties which allow one to gauge this complexity in one way or another, we provide novel characterizations for when a stable phylogenetic network satisfies either one of these three properties.
△ Less
Submitted 18 October, 2019; v1 submitted 5 April, 2018;
originally announced April 2018.
-
The polytopal structure of the tight-span of a totally split-decomposable metric
Authors:
K. T. Huber,
J. Koolen,
V. Moulton
Abstract:
The tight-span of a finite metric space is a polytopal complex that has appeared in several areas of mathematics. In this paper we determine the polytopal structure of the tight-span of a totally split decomposable (finite) metric. Totally split-decomposable metrics are a generalization of tree-metrics and have importance within phylogenetics. In previous work, we showed that the cells of the tigh…
▽ More
The tight-span of a finite metric space is a polytopal complex that has appeared in several areas of mathematics. In this paper we determine the polytopal structure of the tight-span of a totally split decomposable (finite) metric. Totally split-decomposable metrics are a generalization of tree-metrics and have importance within phylogenetics. In previous work, we showed that the cells of the tight-span of such a metric are zonotopes that are polytope isomorphic to either hypercubes or rhombic dodecahedra. Here, we extend these results and show that the tight-spanof a totally split-decomposable metric can be broken up into a canonical collection of polytopal complexes whose polytopal structures can be directly determined from the metric. This allows us to also completely determine the polytopal structure of the tight-span of a totally split-decomposable metric in a very direct way.We anticipate that our improved understanding of this structure may ultimately lead to improved techniques for phylogenetic inference.
△ Less
Submitted 5 April, 2018;
originally announced April 2018.
-
The complexity of comparing multiply-labelled trees by extending phylogenetic-tree metrics
Authors:
Manuel Lafond,
Nadia El-Mabrouk,
Katharina T. Huber,
Vincent Moulton
Abstract:
A multilabeled tree (or MUL-tree) is a rooted tree in which every leaf is labelled by an element from some set, but in which more than one leaf may be labelled by the same element of that set. In phylogenetics, such trees are used in biogeographical studies, to study the evolution of gene families, and also within approaches to construct phylogenetic networks. A multilabelled tree in which no leaf…
▽ More
A multilabeled tree (or MUL-tree) is a rooted tree in which every leaf is labelled by an element from some set, but in which more than one leaf may be labelled by the same element of that set. In phylogenetics, such trees are used in biogeographical studies, to study the evolution of gene families, and also within approaches to construct phylogenetic networks. A multilabelled tree in which no leaf-labels are repeated is called a phylogenetic tree, and one in which every label is the same is also known as a tree-shape. In this paper, we consider the complexity of computing metrics on MUL-trees that are obtained by extending metrics on phylogenetic trees. In particular, by restricting our attention to tree shapes, we show that computing the metric extension on MUL-trees is NP complete for two well-known metrics on phylogenetic trees, namely, the path-difference and Robinson Foulds distances. We also show that the extension of the Robinson Foulds distance is fixed parameter tractable with respect to the distance parameter. The path distance complexity result allows us to also answer an open problem concerning the complexity of solving the quadratic assignment problem for two matrices that are a Robinson similarity and a Robinson dissimilarity, which we show to be NP-complete. We conclude by considering the maximum agreement subtree (MAST) distance on phylogenetic trees to MUL-trees. Although its extension to MUL-trees can be computed in polynomial time, we show that computing its natural generalization to more than two MUL-trees is NP-complete, although fixed-parameter tractable in the maximum degree when the number of given trees is bounded.
△ Less
Submitted 15 March, 2018;
originally announced March 2018.
-
Reconstruction of the two-dimensional gravitational potential of galaxy clusters from X-ray and Sunyaev-Zel'dovich measurements
Authors:
C. Tchernin,
M. Bartelmann,
K. Huber,
A. Dekel,
G. Hurier,
C. L. Majer,
S. Meyer,
E. Zinger,
D. Eckert,
M. Meneghetti,
J. Merten
Abstract:
The mass of galaxy clusters is not a direct observable, nonetheless it is commonly used to probe cosmological models. Based on the combination of all main cluster observables, that is, the X-ray emission, the thermal Sunyaev-Zel'dovich (SZ) signal, the velocity dispersion of the cluster galaxies, and gravitational lensing, the gravitational potential of galaxy clusters can be jointly reconstructed…
▽ More
The mass of galaxy clusters is not a direct observable, nonetheless it is commonly used to probe cosmological models. Based on the combination of all main cluster observables, that is, the X-ray emission, the thermal Sunyaev-Zel'dovich (SZ) signal, the velocity dispersion of the cluster galaxies, and gravitational lensing, the gravitational potential of galaxy clusters can be jointly reconstructed. We derive the two main ingredients required for this joint reconstruction: the potentials individually reconstructed from the observables and their covariance matrices, which act as a weight in the joint reconstruction. We show here the method to derive these quantities. The result of the joint reconstruction applied to a real cluster will be discussed in a forthcoming paper. We apply the Richardson-Lucy deprojection algorithm to data on a two-dimensional (2D) grid. We first test the 2D deprojection algorithm on a $β$-profile. Assuming hydrostatic equilibrium, we further reconstruct the gravitational potential of a simulated galaxy cluster based on synthetic SZ and X-ray data. We then reconstruct the projected gravitational potential of the massive and dynamically active cluster Abell 2142, based on the X-ray observations collected with XMM-Newton and the SZ observations from the Planck satellite. Finally, we compute the covariance matrix of the projected reconstructed potential of the cluster Abell 2142 based on the X-ray measurements collected with XMM-Newton. The gravitational potentials of the simulated cluster recovered from synthetic X-ray and SZ data are consistent, even though the potential reconstructed from X-rays shows larger deviations from the true potential. Regarding Abell 2142, the projected gravitational cluster potentials recovered from SZ and X-ray data reproduce well the projected potential inferred from gravitational-lensing observations. (abridged)
△ Less
Submitted 20 February, 2018;
originally announced February 2018.
-
Recovering tree-child networks from shortest inter-taxa distance information
Authors:
Magnus Bordewich,
Katharina T Huber,
Vincent Moulton,
Charles Semple
Abstract:
Phylogenetic networks are a type of leaf-labelled, acyclic, directed graph used by biologists to represent the evolutionary history of species whose past includes reticulation events. A phylogenetic network is tree-child if each non-leaf vertex is the parent of a tree vertex or a leaf. Up to a certain equivalence, it has been recently shown that, under two different types of weightings, edge-weigh…
▽ More
Phylogenetic networks are a type of leaf-labelled, acyclic, directed graph used by biologists to represent the evolutionary history of species whose past includes reticulation events. A phylogenetic network is tree-child if each non-leaf vertex is the parent of a tree vertex or a leaf. Up to a certain equivalence, it has been recently shown that, under two different types of weightings, edge-weighted tree-child networks are determined by their collection of distances between each pair of taxa. However, the size of these collections can be exponential in the size of the taxa set. In this paper, we show that, if we ignore redundant edges, the same results are obtained with only a quadratic number of inter-taxa distances by using the shortest distance between each pair of taxa. The proofs are constructive and give cubic-time algorithms in the size of the taxa sets for building such weighted networks.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.
-
Quarnet inference rules for level-1 networks
Authors:
Katharine T. Huber,
Vincent Moulton,
Charles Semple,
Taoyang Wu
Abstract:
An important problem in phylogenetics is the construction of phylogenetic trees. One way to approach this problem, known as the supertree method, involves inferring a phylogenetic tree with leaves consisting of a set $X$ of species from a collection of trees, each having leaf-set some subset of $X$. In the 1980's characterizations, certain inference rules were given for when a collection of 4-leav…
▽ More
An important problem in phylogenetics is the construction of phylogenetic trees. One way to approach this problem, known as the supertree method, involves inferring a phylogenetic tree with leaves consisting of a set $X$ of species from a collection of trees, each having leaf-set some subset of $X$. In the 1980's characterizations, certain inference rules were given for when a collection of 4-leaved trees, one for each 4-element subset of $X$, can all be simultaneously displayed by a single supertree with leaf-set $X$. Recently, it has become of interest to extend such results to phylogenetic networks. These are a generalization of phylogenetic trees which can be used to represent reticulate evolution (where species can come together to form a new species). It has been shown that a certain type of phylogenetic network, called a level-1 network, can essentially be constructed from 4-leaved trees. However, the problem of providing appropriate inference rules for such networks remains unresolved. Here we show that by considering 4-leaved networks, called quarnets, as opposed to 4-leaved trees, it is possible to provide such rules. In particular, we show that these rules can be used to characterize when a collection of quarnets, one for each 4-element subset of $X$, can all be simultaneously displayed by a level-1 network with leaf-set $X$. The rules are an intriguing mixture of tree inference rules, and an inference rule for building up a cyclic ordering of $X$ from orderings on subsets of $X$ of size 4. This opens up several new directions of research for inferring phylogenetic networks from smaller ones, which could yield new algorithms for solving the supernetwork problem in phylogenetics.
△ Less
Submitted 17 November, 2017;
originally announced November 2017.
-
Phylogenetic flexibility via Hall-type inequalities and submodularity
Authors:
Katharina T. Huber,
Vincent Moulton,
Mike Steel
Abstract:
Given a collection $τ$ of subsets of a finite set $X$, we say that $τ$ is {\em phylogenetically flexible} if, for any collection $R$ of rooted phylogenetic trees whose leaf sets comprise the collection $τ$, $R$ is compatible (i.e. there is a rooted phylogenetic $X$--tree that displays each tree in $R$). We show that $τ$ is phylogenetically flexible if and only if it satisfies a Hall-type inequalit…
▽ More
Given a collection $τ$ of subsets of a finite set $X$, we say that $τ$ is {\em phylogenetically flexible} if, for any collection $R$ of rooted phylogenetic trees whose leaf sets comprise the collection $τ$, $R$ is compatible (i.e. there is a rooted phylogenetic $X$--tree that displays each tree in $R$). We show that $τ$ is phylogenetically flexible if and only if it satisfies a Hall-type inequality condition of being `slim'. Using submodularity arguments, we show that there is a polynomial-time algorithm for determining whether or not $τ$ is slim. This `slim' condition reduces to a simpler inequality in the case where all of the sets in $τ$ have size 3, a property we call `thin'. Thin sets were recently shown to be equivalent to the existence of an (unrooted) tree for which the median function provides an injective map** to its vertex set; we show here that the unrooted tree in this representation can always be chosen to be a caterpillar tree. We also characterise when a collection $τ$ of subsets of size 2 is thin (in terms of the flexibility of total orders rather than phylogenies) and show that this holds if and only if an associated bipartite graph is a forest. The significance of our results for phylogenetics is in providing precise and efficiently verifiable conditions under which supertree methods that require consistent inputs of trees, can be applied to any input trees on given subsets of species.
△ Less
Submitted 11 January, 2018; v1 submitted 24 October, 2017;
originally announced October 2017.
-
Three-way symbolic tree-maps and ultrametrics
Authors:
Katharina T. Huber,
Vincent Moulton,
Guillaume E. Scholz
Abstract:
Three-way dissimilarities are a generalization of (two-way) dissimilarities which can be used to indicate the lack of homogeneity or resemblance between any three objects. Such maps have applications in cluster analysis, and have been used in areas such as psychology and phylogenetics, where three-way data tables can arise. Special examples of such dissimilarities are three-way tree-metrics and ul…
▽ More
Three-way dissimilarities are a generalization of (two-way) dissimilarities which can be used to indicate the lack of homogeneity or resemblance between any three objects. Such maps have applications in cluster analysis, and have been used in areas such as psychology and phylogenetics, where three-way data tables can arise. Special examples of such dissimilarities are three-way tree-metrics and ultrametrics, which arise from leaf-labelled trees with edges labelled by positive real numbers. Here we consider three-way maps which arise from leaf-labelled trees where instead the interior vertices are labelled by an arbitrary set of values. For unrooted trees we call such maps three-way symbolic tree-maps; for rooted trees we call them three-way symbolic ultrametrics since they can be considered as a generalization of the (two-way) symbolic ultrametrics of Böcker and Dress. We show that, as with two- and three-way tree-metrics and ultrametrics, three-way symbolic tree-maps and ultrametrics can be characterized via certain $k$-point conditions. In the unrooted case, our characterization is mathematically equivalent to one presented by Gurvich for a certain class of edge-labelled hypergraphs. We also show that it can be decided whether or not an arbitrary three-way symbolic map is a tree-map or a symbolic ultrametric using a triplet-based approach that relies on the so-called BUILD algorithm for deciding when a set of 3-leaved trees or triplets can be displayed by a single tree. We envisage that our results will be useful in develo** new approaches and algorithms for understanding 3-way data, especially within the area of phylogenetics.
△ Less
Submitted 18 October, 2017; v1 submitted 25 July, 2017;
originally announced July 2017.
-
Combinatorial properties of triplet covers for binary trees
Authors:
Stefan Gruenewald,
Katharina T. Huber,
Vincent Moulton,
Mike Steel
Abstract:
It is a classical result that an unrooted tree $T$ having positive real-valued edge lengths and no vertices of degree two can be reconstructed from the induced distance between each pair of leaves. Moreover, if each non-leaf vertex of $T$ has degree 3 then the number of distance values required is linear in the number of leaves. A canonical candidate for such a set of pairs of leaves in $T$ is the…
▽ More
It is a classical result that an unrooted tree $T$ having positive real-valued edge lengths and no vertices of degree two can be reconstructed from the induced distance between each pair of leaves. Moreover, if each non-leaf vertex of $T$ has degree 3 then the number of distance values required is linear in the number of leaves. A canonical candidate for such a set of pairs of leaves in $T$ is the following: for each non-leaf vertex $v$, choose a leaf in each of the three components of $T-v$, group these three leaves into three pairs, and take the union of this set over all choices of $v$. This forms a so-called 'triplet cover' for $T$. In the first part of this paper we answer an open question (from 2012) by showing that the induced leaf-to-leaf distances for any triplet cover for $T$ uniquely determine $T$ and its edge lengths. We then investigate the finer combinatorial properties of triplet covers. In particular, we describe the structure of triplet covers that satisfy one or more of the following properties of being minimal, 'sparse', and 'shellable'.
△ Less
Submitted 25 July, 2017;
originally announced July 2017.
-
Geometric medians in reconciliation spaces
Authors:
Katharina T. Huber,
Vincent Moulton,
Marie-France Sagot,
Blerina Sinaimeri
Abstract:
In evolutionary biology, it is common to study how various entities evolve together, for example, how parasites coevolve with their host, or genes with their species. Coevolution is commonly modelled by considering certain maps or reconciliations from one evolutionary tree $P$ to another $H$, all of which induce the same map $φ$ between the leaf-sets of $P$ and $H$ (corresponding to present-day as…
▽ More
In evolutionary biology, it is common to study how various entities evolve together, for example, how parasites coevolve with their host, or genes with their species. Coevolution is commonly modelled by considering certain maps or reconciliations from one evolutionary tree $P$ to another $H$, all of which induce the same map $φ$ between the leaf-sets of $P$ and $H$ (corresponding to present-day associations). Recently, there has been much interest in studying spaces of reconciliations, which arise by defining some metric $d$ on the set $Rec(P,H,φ)$ of all possible reconciliations between $P$ and $H$.
In this paper, we study the following question: How do we compute a geometric median for a given subset $Ψ$ of $Rec(P,H,φ)$ relative to $d$, i.e. an element $ψ_{med} \in Rec(P,H,φ)$ such that $$ \sum_{ψ' \in Ψ} d(ψ_{med},ψ') \le \sum_{ψ' \in Ψ} d(ψ,ψ') $$ holds for all $ψ\in Rec(P,H,φ)$? For a model where so-called host-switches or transfers are not allowed, and for a commonly used metric $d$ called the edit-distance, we show that although the cardinality of $Rec(P,H,φ)$ can be super-exponential, it is still possible to compute a geometric median for a set $Ψ$ in $Rec(P,H,φ)$ in polynomial time. We expect that this result could be useful for computing a summary or consensus for a set of reconciliations (e.g. for a set of suboptimal reconciliations).
△ Less
Submitted 3 July, 2017;
originally announced July 2017.
-
From event labeled gene trees to species trees
Authors:
Maribel Hernandez-Rosales,
Marc Hellmuth,
Nicolas Wieseke,
Katharina T. Huber,
Vincent Moulton,
Peter F. Stadler
Abstract:
Background: Tree reconciliation problems have long been studied in phylogenetics. A particular variant of the reconciliation problem for a gene tree T and a species tree S assumes that for each interior vertex x of T it is known whether x represents a speciation or a duplication. This problem appears in the context of analyzing orthology data.
Results: We show that S is a species tree for T if a…
▽ More
Background: Tree reconciliation problems have long been studied in phylogenetics. A particular variant of the reconciliation problem for a gene tree T and a species tree S assumes that for each interior vertex x of T it is known whether x represents a speciation or a duplication. This problem appears in the context of analyzing orthology data.
Results: We show that S is a species tree for T if and only if S displays all rooted triples of T that have three distinct species as their leaves and are rooted in a speciation vertex. A valid reconciliation map can then be found in polynomial time. Simulated data shows that the event-labeled gene trees convey a large amount of information on underlying species trees, even for a large percentage of losses.
Conclusions: The knowledge of event labels in a gene tree strongly constrains the possible species tree and, for a given species tree, also the possible reconciliation maps. Nevertheless, many degrees of freedom remain in the space of feasible solutions. In order to disambiguate the alternative solutions additional external constraints as well as optimization criteria could be employed.
△ Less
Submitted 10 May, 2017;
originally announced May 2017.
-
Tree-based unrooted phylogenetic networks
Authors:
Andrew Francis,
Katharina Huber,
Vincent Moulton
Abstract:
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An \emph{unrooted} phylogenetic network on a nonempty, finite set $X$ of taxa, or \emph{network}, is a connected graph in which every vertex has degree 1 or 3 and whose leaf-se…
▽ More
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An \emph{unrooted} phylogenetic network on a nonempty, finite set $X$ of taxa, or \emph{network}, is a connected graph in which every vertex has degree 1 or 3 and whose leaf-set is $X$. It is called a \emph{phylogenetic tree} if the underlying graph is a tree. In this paper we consider properties of \emph{tree-based networks}, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hyrbridization which are important factors in the evolution of many organisms.
△ Less
Submitted 7 December, 2017; v1 submitted 6 April, 2017;
originally announced April 2017.
-
Bounds for phylogenetic network space metrics
Authors:
Andrew Francis,
Katharina Huber,
Vincent Moulton,
Taoyang Wu
Abstract:
Phylogenetic networks are a generalization of phylogenetic trees that allow for representation of reticulate evolution. Recently, a space of unrooted phylogenetic networks was introduced, where such a network is a connected graph in which every vertex has degree 1 or 3 and whose leaf-set is a fixed set $X$ of taxa. This space, denoted $\mathcal{N}(X)$, is defined in terms of two operations on netw…
▽ More
Phylogenetic networks are a generalization of phylogenetic trees that allow for representation of reticulate evolution. Recently, a space of unrooted phylogenetic networks was introduced, where such a network is a connected graph in which every vertex has degree 1 or 3 and whose leaf-set is a fixed set $X$ of taxa. This space, denoted $\mathcal{N}(X)$, is defined in terms of two operations on networks -- the nearest neighbor interchange and triangle operations -- which can be used to transform any network with leaf set $X$ into any other network with that leaf set. In particular, it gives rise to a metric $d$ on $\mathcal N(X)$ which is given by the smallest number of operations required to transform one network in $\mathcal N(X)$ into another in $\mathcal N(X)$. The metric generalizes the well-known NNI-metric on phylogenetic trees which has been intensively studied in the literature. In this paper, we derive a bound for the metric $d$ as well as a related metric $d_{N\!N\!I}$ which arises when restricting $d$ to the subset of $\mathcal{N}(X)$ consisting of all networks with $2(|X|-1+i)$ vertices, $i \ge 1$. We also introduce two new metrics on networks -- the SPR and TBR metrics -- which generalize the metrics on phylogenetic trees with the same name and give bounds for these new metrics. We expect our results to eventually have applications to the development and understanding of network search algorithms.
△ Less
Submitted 8 March, 2017; v1 submitted 18 February, 2017;
originally announced February 2017.
-
Discovery of the secondary eclipse of HAT-P-11 b
Authors:
K. F. Huber,
S. Czesla,
J. H. M. M. Schmitt
Abstract:
We report the detection of the secondary eclipse of HAT-P-11 b, a Neptune-sized planet orbiting an active K4 dwarf. Using all available short-cadence data of the Kepler mission, we derive refined planetary ephemeris increasing their precision by more than an order of magnitude. Our simultaneous primary and secondary transit modeling results in improved transit and orbital parameters. In particular…
▽ More
We report the detection of the secondary eclipse of HAT-P-11 b, a Neptune-sized planet orbiting an active K4 dwarf. Using all available short-cadence data of the Kepler mission, we derive refined planetary ephemeris increasing their precision by more than an order of magnitude. Our simultaneous primary and secondary transit modeling results in improved transit and orbital parameters. In particular, the precise timing of the secondary eclipse allows to pin down the orbital eccentricity to $0.26459_{-0.00048}^{+0.00069}$. The secondary eclipse depth of $6.09_{-1.11}^{+1.12}$ ppm corresponds to a $5.5σ$ detection and results in a geometric albedo of $0.39\pm0.07$ for HAT-P-11 b, close to Neptune's value, which may indicate further resemblances between these two bodies. Due to the substantial orbital eccentricity, the planetary equilibrium temperature is expected to change significantly with orbital position and ought to vary between $630^\circ$ K and $950^\circ$ K, depending on the details of heat redistribution in the atmosphere of HAT-P-11 b.
△ Less
Submitted 1 November, 2016;
originally announced November 2016.
-
Minimum triplet covers of binary phylogenetic $X$-trees
Authors:
Katharina T. Huber,
Vincent Moulton,
Mike Steel
Abstract:
Trees with labelled leaves and with all other vertices of degree three play an important role in systematic biology and other areas of classification. A classical combinatorial result ensures that such trees can be uniquely reconstructed from the distances between the leaves (when the edges are given any strictly positive lengths). Moreover, a linear number of these pairwise distance values suffic…
▽ More
Trees with labelled leaves and with all other vertices of degree three play an important role in systematic biology and other areas of classification. A classical combinatorial result ensures that such trees can be uniquely reconstructed from the distances between the leaves (when the edges are given any strictly positive lengths). Moreover, a linear number of these pairwise distance values suffices to determine both the tree and its edge lengths. A natural set of pairs of leaves is provided by any `triplet cover' of the tree (based on the fact that each non-leaf vertex is the median vertex of three leaves). In this paper we describe a number of new results concerning triplet covers of minimum size. In particular, we characterize such covers in terms of an associated graph being a 2-tree. Also, we show that minimum triplet covers are `shellable' and thereby provide a set of pairs for which the inter-leaf distance values will uniquely determine the underlying tree and its associated branch lengths.
△ Less
Submitted 30 January, 2017; v1 submitted 23 October, 2016;
originally announced October 2016.
-
Beyond representing orthology relations by trees
Authors:
K. T. Huber,
G. E. Scholz
Abstract:
Reconstructing the evolutionary past of a family of genes is an important aspect of many genomic studies. To help with this, simple operations on a set of sequences called orthology relations may be employed. In addition to being interesting from a practical point of view they are also attractive from a theoretical perspective in that e. g. a characterization is known for when such a relation is r…
▽ More
Reconstructing the evolutionary past of a family of genes is an important aspect of many genomic studies. To help with this, simple operations on a set of sequences called orthology relations may be employed. In addition to being interesting from a practical point of view they are also attractive from a theoretical perspective in that e. g. a characterization is known for when such a relation is representable by a certain type of phylogenetic tree. For an orthology relation inferred from real biological data it is however generally too much to hope for that it satisfies that characterization. Rather than trying to correct the data in some way or another which has its own drawbacks, as an alternative, we propose to represent an orthology relation $δ$ in terms of a structure more general than a phylogenetic tree called a phylogenetic network. To compute such a network in the form of a level-1 representation for $δ$, we introduce the novel {\sc Network-Pop**} algorithm which has several attractive properties. In addition, we characterize orthology relations $δ$ on some set $X$ that have a level-1 representation in terms of eight natural properties for $δ$ as well as in terms for level-1 representations of orthology relations on certain subsets of $X$.
△ Less
Submitted 15 March, 2016;
originally announced March 2016.
-
Transforming phylogenetic networks: Moving beyond tree space
Authors:
Katharina T. Huber,
Vincent Moulton,
Taoyang Wu
Abstract:
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI)…
▽ More
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in develo** new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks.
△ Less
Submitted 8 January, 2016;
originally announced January 2016.
-
Bridging the gap between rooted and unrooted phylogenetic networks
Authors:
Philippe Gambette,
Katharina T. Huber,
Guillaume E. Scholz
Abstract:
The need for structures capable of accommodating complex evolutionary signals such as those found in, for example, wheat has fueled research into phylogenetic networks. Such structures generalize the standard phylogenetic tree model by also allowing cycles and have been introduced in rooted and unrooted form. In contrast to phylogenetic trees, however, surprisingly little is known about the interp…
▽ More
The need for structures capable of accommodating complex evolutionary signals such as those found in, for example, wheat has fueled research into phylogenetic networks. Such structures generalize the standard phylogenetic tree model by also allowing cycles and have been introduced in rooted and unrooted form. In contrast to phylogenetic trees, however, surprisingly little is known about the interplay between both types thus hampering our ability to make much needed progress for rooted phylogenetic networks by drawing on insights from their much better understood unrooted counterparts. Unrooted phylogenetic networks are underpinned by split systems and by focusing on them we establish a first link between both types. More precisely, we develop a link between 1-nested phylogenetic networks which are examples of rooted phylogenetic networks and the well-studied median networks (aka Buneman graph) which are examples of unrooted phylogenetic networks. In particular, we show that not only can a 1-nested network be obtained from a median network but also that that network is, in a well-defined sense, optimal. Along the way, we characterize circular split systems in terms of the novel $\mathcal I$-intersection closure of a split system and establish the 1-nested analogue of the fundamental "Splits Equivalence Theorem" for phylogenetic trees.
△ Less
Submitted 26 November, 2015;
originally announced November 2015.
-
On the challenge of reconstructing level-1 phylogenetic networks from triplets and clusters
Authors:
P. Gambette,
K. T. Huber,
S. Kelk
Abstract:
Phylogenetic networks have gained prominence over the years due to their ability to represent complex non-treelike evolutionary events such as recombination or hybridization. Popular combinatorial objects used to construct them are triplet systems and cluster systems, the motivation being that any network $N$ induces a triplet system $\mathcal R(N)$ and a softwired cluster system $\mathcal S(N)$.…
▽ More
Phylogenetic networks have gained prominence over the years due to their ability to represent complex non-treelike evolutionary events such as recombination or hybridization. Popular combinatorial objects used to construct them are triplet systems and cluster systems, the motivation being that any network $N$ induces a triplet system $\mathcal R(N)$ and a softwired cluster system $\mathcal S(N)$. Since in real-world studies it cannot be guaranteed that all triplets/softwired clusters induced by a network are available it is of particular interest to understand whether subsets of $\mathcal R(N)$ or $\mathcal S(N)$ allow one to uniquely reconstruct the underlying network $N$. Here we show that even within the highly restricted yet biologically interesting space of level-1 phylogenetic networks it is not always possible to uniquely reconstruct a level-1 network $N$ even when all triplets in $\mathcal R(N)$ or all clusters in $\mathcal S(N)$ are available. On the positive side, we introduce a reasonably large subclass of level-1 networks the members of which are uniquely determined by their induced triplet/softwired cluster systems. Along the way, we also establish various enumerative results, both positive and negative, including results which show that certain special subclasses of level-1 networks $N$ can be uniquely reconstructed from proper subsets of $\mathcal R(N)$ and $\mathcal S(N)$. We anticipate these results to be of use in the design of, for example, algorithms for phylogenetic network inference.
△ Less
Submitted 18 October, 2016; v1 submitted 25 November, 2015;
originally announced November 2015.
-
How do starspots influence the transit timing variations of exoplanets? Simulations of individual and consecutive transits
Authors:
P. Ioannidis,
K. F. Huber,
J. H. M. M. Schmitt
Abstract:
Transit timing variations (TTVs) of exoplanets are normally interpreted as the consequence of gravitational interaction with additional bodies in the system. However, TTVs can also be caused by deformations of the system transits by starspots, which might thus pose a serious complication in their interpretation. We therefore simulate transit light curves deformed by spot-crossing events for differ…
▽ More
Transit timing variations (TTVs) of exoplanets are normally interpreted as the consequence of gravitational interaction with additional bodies in the system. However, TTVs can also be caused by deformations of the system transits by starspots, which might thus pose a serious complication in their interpretation. We therefore simulate transit light curves deformed by spot-crossing events for different properties of the stellar surface and the planet, such as starspot position, limb darkening, planetary period, and impact parameter. Mid-transit times determined from these simulations can be significantly shifted with respect to the input values; these shifts cannot be larger than ~1% of the transit duration and depend most strongly on the longitudinal position of the spot during the transit and the transit duration. Consequently, TTVs with amplitudes larger than the above limit are very unlikely to be caused by starspots. We also investigate whether TTVs from sequences of consecutive transits with spot-crossing anomalies can be misinterpreted as the result of an additional body in the system. We use the Generalized Lomb-Scargle periodogram to search for periods in TTVs and conclude that low amplitude TTVs with statistically significant periods around active stars are the most problematic cases. In those cases where the photometric precision is high enough to inspect the transit shapes for deformations, it should be possible to identify TTVs caused by starspots, however, especially for cases with low transit signal to noise light curves (TSNR $\lesssim$ 15) it becomes quite difficult to reliably decide whether these periods come from starspots, physical companions in the system or if they are random noise artifacts.
△ Less
Submitted 12 October, 2015;
originally announced October 2015.
-
Folding and unfolding phylogenetic trees and networks
Authors:
Katharina T. Huber,
Vincent Moulton,
Mike Steel,
Taoyang Wu
Abstract:
Phylogenetic networks are rooted, labelled directed acyclic graphs which are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network $N$ can be 'unfolded' to obtain a MUL-tree $U(N)$ and, conversely, a MUL-tree $T$ can in certain circumstances be 'folded' to obtain a phylogen…
▽ More
Phylogenetic networks are rooted, labelled directed acyclic graphs which are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network $N$ can be 'unfolded' to obtain a MUL-tree $U(N)$ and, conversely, a MUL-tree $T$ can in certain circumstances be 'folded' to obtain a phylogenetic network $F(T)$ that exhibits $T$. In this paper, we study properties of the operations $U$ and $F$ in more detail. In particular, we introduce the class of stable networks, phylogenetic networks $N$ for which $F(U(N))$ is isomorphic to $N$, characterise such networks, and show that that they are related to the well-known class of tree-sibling networks. We also explore how the concept of displaying a tree in a network $N$ can be related to displaying the tree in the MUL-tree $U(N)$. To do this, we develop a phylogenetic analogue of graph fibrations. This allows us to view $U(N)$ as the analogue of the universal cover of a digraph, and to establish a close connection between displaying trees in $U(N)$ and reconciling phylogenetic trees with networks.
△ Less
Submitted 14 June, 2015;
originally announced June 2015.
-
Reconstructing phylogenetic level-1 networks from nondense binet and trinet sets
Authors:
Katharina Huber,
Leo van Iersel,
Vincent Moulton,
Celine Scornavacca,
Taoyang Wu
Abstract:
Binets and trinets are phylogenetic networks with two and three leaves, respectively. Here we consider the problem of deciding if there exists a binary level-1 phylogenetic network displaying a given set $\mathcal{T}$ of binary binets or trinets over a set $X$ of taxa, and constructing such a network whenever it exists. We show that this is NP-hard for trinets but polynomial-time solvable for bine…
▽ More
Binets and trinets are phylogenetic networks with two and three leaves, respectively. Here we consider the problem of deciding if there exists a binary level-1 phylogenetic network displaying a given set $\mathcal{T}$ of binary binets or trinets over a set $X$ of taxa, and constructing such a network whenever it exists. We show that this is NP-hard for trinets but polynomial-time solvable for binets. Moreover, we show that the problem is still polynomial-time solvable for inputs consisting of binets and trinets as long as the cycles in the trinets have size three. Finally, we present an $O(3^{|X|} poly(|X|))$ time algorithm for general sets of binets and trinets. The latter two algorithms generalise to instances containing level-1 networks with arbitrarily many leaves, and thus provide some of the first supernetwork algorithms for computing networks from a set of rooted phylogenetic networks.
△ Less
Submitted 25 November, 2014;
originally announced November 2014.
-
A multiwavelength study of the hierarchical triple HD 181068: A test bed for studying star-planet-interaction?
Authors:
S. Czesla,
K. F. Huber,
P. C. Schneider,
J. H. M. M. Schmitt
Abstract:
HD 181068 is the only compact, triply eclipsing, hierarchical triple system containing a giant star known to date. With its central, highly-active G-type giant orbited by a close pair of main-sequence dwarfs, the system is ideal to study tidal interactions. We carried out a multiwavelength study to characterize the magnetic activity of the HD 181068 system. To this end, we obtained in- and out-of-…
▽ More
HD 181068 is the only compact, triply eclipsing, hierarchical triple system containing a giant star known to date. With its central, highly-active G-type giant orbited by a close pair of main-sequence dwarfs, the system is ideal to study tidal interactions. We carried out a multiwavelength study to characterize the magnetic activity of the HD 181068 system. To this end, we obtained in- and out-of-eclipse X-ray snapshots with XMM-Newton and an optical spectrum, which we analyzed along with the Kepler light-curve. The primary giant shows strong quiescent X-ray emission at a level of 2e31 ergs, an S-index of 0.41 +/- 0.01, and marked white-light flares releasing up to 6e38 erg in the Kepler-band. During the second X-ray observation, we found a three-times elevated -- yet decaying -- level of X-ray emission, which might be due to an X-ray flare. The high level of magnetic activity is compatible with the previously reported absence of solar-like oscillations in the giant, whose atmosphere, however, undergoes tidally-induced oscillations imposed by the changing configuration of the dwarf-binary. We found that the driving force exciting these oscillations is comparable to the disturbances produced by a typical hot Jupiter, making the system a potential test bed to study the effects of tidal interactions also present in planetary systems.
△ Less
Submitted 13 August, 2014;
originally announced August 2014.
-
Representing Partitions on Trees
Authors:
Katharina T. Huber,
Vincent Moulton,
Charles Semple,
Taoyang Wu
Abstract:
In evolutionary biology, biologists often face the problem of constructing a phylogenetic tree on a set $X$ of species from a multiset $Π$ of partitions corresponding to various attributes of these species. One approach that is used to solve this problem is to try instead to associate a tree (or even a network) to the multiset $Σ_Π$ consisting of all those bipartitions $\{A,X-A\}$ with $A$ a part…
▽ More
In evolutionary biology, biologists often face the problem of constructing a phylogenetic tree on a set $X$ of species from a multiset $Π$ of partitions corresponding to various attributes of these species. One approach that is used to solve this problem is to try instead to associate a tree (or even a network) to the multiset $Σ_Π$ consisting of all those bipartitions $\{A,X-A\}$ with $A$ a part of some partition in $Π$. The rational behind this approach is that a phylogenetic tree with leaf set $X$ can be uniquely represented by the set of bipartitions of $X$ induced by its edges. Motivated by these considerations, given a multiset $Σ$ of bipartitions corresponding to a phylogenetic tree on $X$, in this paper we introduce and study the set $P(Σ)$ consisting of those multisets of partitions $Π$ of $X$ with $Σ_Π=Σ$. More specifically, we characterize when $P(Σ)$ is non-empty, and also identify some partitions in $P(Σ)$ that are of maximum and minimum size. We also show that it is NP-complete to decide when $P(Σ)$ is non-empty in case $Σ$ is an arbitrary multiset of bipartitions of $X$. Ultimately, we hope that by gaining a better understanding of the map** that takes an arbitrary partition system $Π$ to the multiset $Σ_Π$, we will obtain new insights into the use of median networks and, more generally, split-networks to visualize sets of partitions.
△ Less
Submitted 9 May, 2014;
originally announced May 2014.
-
Absolute cross sections for photoionization of Xe$^{q+}$ ions (1 $\le$ q $\le$ 5) at the 3d ionization threshold
Authors:
S. Schippers,
S. Ricz,
T. Buhr,
A. Borovik Jr.,
J. Hellhund,
K. Holste,
K. Huber,
H. -J. Schäfer,
D. Schury,
S. Klumpp,
K. Mertens,
M. Martins,
R. Flesch,
G. Ulrich,
E. Rühl,
T. Jahnke,
J. Lower,
D. Metz,
L. P. H. Schmidt,
M. Schöffler,
J. B. Williams,
L. Glaser,
F. Scholz,
J. Seltmann,
J. Viefhaus
, et al. (4 additional authors not shown)
Abstract:
The photon-ion merged-beams technique has been employed at the new Photon-Ion spectrometer at PETRA III (PIPE) for measuring multiple photoionization of Xe$^{q+}$ (q=1-5) ions. Total ionization cross sections have been obtained on an absolute scale for the dominant ionization reactions of the type hν+ Xe$^{q+}$ $\to$ Xe$^{r+}$ + (q-r) e$^-$ with product charge states q+2 $\le$ r $\le$ q+5. Promine…
▽ More
The photon-ion merged-beams technique has been employed at the new Photon-Ion spectrometer at PETRA III (PIPE) for measuring multiple photoionization of Xe$^{q+}$ (q=1-5) ions. Total ionization cross sections have been obtained on an absolute scale for the dominant ionization reactions of the type hν+ Xe$^{q+}$ $\to$ Xe$^{r+}$ + (q-r) e$^-$ with product charge states q+2 $\le$ r $\le$ q+5. Prominent ionization features are observed in the photon-energy range 650-750 eV, which are associated with excitation or ionization of an inner-shell 3d electron. Single-configuration Dirac-Fock calculations agree quantitatively with the experimental cross sections for non-resonant photoabsorption, but fail to reproduce all details of the measured ionization resonance structures.
△ Less
Submitted 11 April, 2014;
originally announced April 2014.
-
Characterizing Block Graphs in Terms of their Vertex-Induced Partitions
Authors:
A. Dress,
K. T. Huber,
J. Koolen,
V. Moulton,
A. Spillner
Abstract:
Given a finite connected simple graph $G=(V,E)$ with vertex set $V$ and edge set $E\subseteq \binom{V}{2}$, we will show that
$1.$ the (necessarily unique) smallest block graph with vertex set $V$ whose edge set contains $E$ is uniquely determined by the $V$-indexed family ${\bf P}_G:=\big(π_0(G^{(v)})\big)_{v \in V}$ of the various partitions $π_0(G^{(v)})$ of the set $V$ into the set of connec…
▽ More
Given a finite connected simple graph $G=(V,E)$ with vertex set $V$ and edge set $E\subseteq \binom{V}{2}$, we will show that
$1.$ the (necessarily unique) smallest block graph with vertex set $V$ whose edge set contains $E$ is uniquely determined by the $V$-indexed family ${\bf P}_G:=\big(π_0(G^{(v)})\big)_{v \in V}$ of the various partitions $π_0(G^{(v)})$ of the set $V$ into the set of connected components of the graph $G^{(v)}:=(V,\{e\in E: v\notin e\})$,
$2.$ the edge set of this block graph coincides with set of all $2$-subsets $\{u,v\}$ of $V$ for which $u$ and $v$ are, for all $w\in V-\{u,v\}$, contained in the same connected component of $G^{(w)}$,
$3.$ and an arbitrary $V$-indexed family ${\bf P}p=({\bf p}_v)_{v \in V}$ of partitions $π_v$ of the set $V$ is of the form ${\bf P}p={\bf P}p_G$ for some connected simple graph $G=(V,E)$ with vertex set $V$ as above if and only if, for any two distinct elements $u,v\in V$, the union of the set in ${\bf p}_v$ that contains $u$ and the set in ${\bf p}_u$ that contains $v$ coincides with the set $V$, and $\{v\}\in {\bf p}_v$ holds for all $v \in V$.
As well as being of inherent interest to the theory of block graphs, these facts are also useful in the analysis of compatible decompositions and block realizations of finite metric spaces.
△ Less
Submitted 28 February, 2014; v1 submitted 18 February, 2014;
originally announced February 2014.
-
Distinguished minimal toplogical lassos
Authors:
Katharina T. Huber,
George Kettleborough
Abstract:
A classical result in distance based tree-reconstruction characterizes when for a distance $D$ on some finite set $X$ there exist a uniquely determined dendrogram on $X$ (essentially a rooted tree $T=(V,E)$ with leaf set $X$ and no degree two vertices but possibly the root and an edge weighting $ω:E\to \mathbb R_{\geq 0}$) such that the distance $D_{(T,ω)}$ induced by $(T,ω)$ on $X$ is $D$. Moreov…
▽ More
A classical result in distance based tree-reconstruction characterizes when for a distance $D$ on some finite set $X$ there exist a uniquely determined dendrogram on $X$ (essentially a rooted tree $T=(V,E)$ with leaf set $X$ and no degree two vertices but possibly the root and an edge weighting $ω:E\to \mathbb R_{\geq 0}$) such that the distance $D_{(T,ω)}$ induced by $(T,ω)$ on $X$ is $D$. Moreover, algorithms that quickly reconstruct $(T,ω)$ from $D$ in this case are known. However in many areas where dendrograms are being constructed such as Computational Biology not all distances on $X$ are always available implying that the sought after dendrogram need not be uniquely determined anymore by the available distances with regards to topology of the underlying tree, edge-weighting, or both. To better understand the structural properties a set $\cL\subseteq {X\choose 2}$ has to satisfy to overcome this problem, various types of lassos have been introduced. Here, we focus on the question of when a lasso uniquely determines the topology of a dendrogram's underlying tree, that is, it is a topological lasso for that tree. We show that any set-inclusion minimal topological lasso for such a tree $T$ can be transformed into a 'distinguished' minimal topological lasso $\cL$ for $T$, that is, the graph $(X,\cL)$ is a claw-free block graph. Furthermore, we characterize such lassos in terms of the novel concept of a cluster marker map for $T$ and present results concerning the heritability of such lassos in the context of the subtree and supertree problems.
△ Less
Submitted 12 August, 2013;
originally announced August 2013.