-
When is a set of phylogenetic trees displayed by a normal network?
Authors:
Magnus Bordewich,
Simone Linz,
Charles Semple
Abstract:
A normal network is uniquely determined by the set of phylogenetic trees that it displays. Given a set $\mathcal{P}$ of rooted binary phylogenetic trees, this paper presents a polynomial-time algorithm that reconstructs the unique binary normal network whose set of displayed binary trees is $\mathcal{P}$, if such a network exists. Additionally, we show that any two rooted phylogenetic trees can be…
▽ More
A normal network is uniquely determined by the set of phylogenetic trees that it displays. Given a set $\mathcal{P}$ of rooted binary phylogenetic trees, this paper presents a polynomial-time algorithm that reconstructs the unique binary normal network whose set of displayed binary trees is $\mathcal{P}$, if such a network exists. Additionally, we show that any two rooted phylogenetic trees can be displayed by a normal network and show that this result does not extend to more than two trees. This is in contrast to tree-child networks where it has been previously shown that any collection of rooted phylogenetic trees can be displayed by a tree-child network. Lastly, we introduce a type of cherry-picking sequence that characterises when a collection $\mathcal{P}$ of rooted phylogenetic trees can be displayed by a normal network and, further, characterise the minimum number of reticulations needed over all normal networks that display $\mathcal{P}$. We then exploit these sequences to show that, for all $n\ge 3$, there exist two rooted binary phylogenetic trees on $n$ leaves that can be displayed by a tree-child network with a single reticulation, but cannot be displayed by a normal network with less than $n-2$ reticulations.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Quantifying the difference between phylogenetic diversity and diversity indices
Authors:
Magnus Bordewich,
Charles Semple
Abstract:
Phylogenetic diversity is a popular measure for quantifying the biodiversity of a collection $Y$ of species, while phylogenetic diversity indices provide a way to apportion phylogenetic diversity to individual species. Typically, for some specific diversity index, the phylogenetic diversity of $Y$ is not equal to the sum of the diversity indices of the species in $Y.$ In this paper, we investigate…
▽ More
Phylogenetic diversity is a popular measure for quantifying the biodiversity of a collection $Y$ of species, while phylogenetic diversity indices provide a way to apportion phylogenetic diversity to individual species. Typically, for some specific diversity index, the phylogenetic diversity of $Y$ is not equal to the sum of the diversity indices of the species in $Y.$ In this paper, we investigate the extent of this difference for two commonly-used indices: Fair Proportion and Equal Splits. In particular, we determine the maximum value of this difference under various instances including when the associated rooted phylogenetic tree is allowed to vary across all root phylogenetic trees with the same leaf set and whose edge lengths are constrained by either their total sum or their maximum value.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
Evaluating Gaussian Grasp Maps for Generative Gras** Models
Authors:
William Prew,
Toby P. Breckon,
Magnus Bordewich,
Ulrik Beierholm
Abstract:
Generalising robotic gras** to previously unseen objects is a key task in general robotic manipulation. The current method for training many antipodal generative gras** models rely on a binary ground truth grasp map generated from the centre thirds of correctly labelled grasp rectangles. However, these binary maps do not accurately reflect the positions in which a robotic arm can correctly gra…
▽ More
Generalising robotic gras** to previously unseen objects is a key task in general robotic manipulation. The current method for training many antipodal generative gras** models rely on a binary ground truth grasp map generated from the centre thirds of correctly labelled grasp rectangles. However, these binary maps do not accurately reflect the positions in which a robotic arm can correctly grasp a given object. We propose a continuous Gaussian representation of annotated grasps to generate ground truth training data which achieves a higher success rate on a simulated robotic gras** benchmark. Three modern generative gras** networks are trained with either binary or Gaussian grasp maps, along with recent advancements from the robotic gras** literature, such as discretisation of grasp angles into bins and an attentional loss function. Despite negligible difference according to the standard rectangle metric, Gaussian maps better reproduce the training data and therefore improve success rates when tested on the same simulated robot arm by avoiding collisions with the object: achieving 87.94\% accuracy. Furthermore, the best performing model is shown to operate with a high success rate when transferred to a real robotic arm, at high inference speeds, without the need for transfer learning. The system is then shown to be capable of performing grasps on an antagonistic physical object dataset benchmark.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
On the Complexity of Optimising Variants of Phylogenetic Diversity on Phylogenetic Networks
Authors:
Magnus Bordewich,
Charles Semple,
Kristina Wicke
Abstract:
Phylogenetic Diversity (PD) is a prominent quantitative measure of the biodiversity of a collection of present-day species (taxa). This measure is based on the evolutionary distance among the species in the collection. Loosely speaking, if $\mathcal{T}$ is a rooted phylogenetic tree whose leaf set $X$ represents a set of species and whose edges have real-valued lengths (weights), then the PD score…
▽ More
Phylogenetic Diversity (PD) is a prominent quantitative measure of the biodiversity of a collection of present-day species (taxa). This measure is based on the evolutionary distance among the species in the collection. Loosely speaking, if $\mathcal{T}$ is a rooted phylogenetic tree whose leaf set $X$ represents a set of species and whose edges have real-valued lengths (weights), then the PD score of a subset $S$ of $X$ is the sum of the weights of the edges of the minimal subtree of $\mathcal{T}$ connecting the species in $S$. In this paper, we define several natural variants of the PD score for a subset of taxa which are related by a known rooted phylogenetic network. Under these variants, we explore, for a positive integer $k$, the computational complexity of determining the maximum PD score over all subsets of taxa of size $k$ when the input is restricted to different classes of rooted phylogenetic networks
△ Less
Submitted 16 July, 2021;
originally announced July 2021.
-
Improving Robotic Gras** on Monocular Images Via Multi-Task Learning and Positional Loss
Authors:
William Prew,
Toby Breckon,
Magnus Bordewich,
Ulrik Beierholm
Abstract:
In this paper, we introduce two methods of improving real-time object gras** performance from monocular colour images in an end-to-end CNN architecture. The first is the addition of an auxiliary task during model training (multi-task learning). Our multi-task CNN model improves gras** performance from a baseline average of 72.04% to 78.14% on the large Jacquard gras** dataset when performing…
▽ More
In this paper, we introduce two methods of improving real-time object gras** performance from monocular colour images in an end-to-end CNN architecture. The first is the addition of an auxiliary task during model training (multi-task learning). Our multi-task CNN model improves gras** performance from a baseline average of 72.04% to 78.14% on the large Jacquard gras** dataset when performing a supplementary depth reconstruction task. The second is introducing a positional loss function that emphasises loss per pixel for secondary parameters (gripper angle and width) only on points of an object where a successful grasp can take place. This increases performance from a baseline average of 72.04% to 78.92% as well as reducing the number of training epochs required. These methods can be also performed in tandem resulting in a further performance increase to 79.12% while maintaining sufficient inference speed to afford real-time grasp processing.
△ Less
Submitted 5 November, 2020;
originally announced November 2020.
-
On the maximum agreement subtree conjecture for balanced trees
Authors:
Magnus Bordewich,
Simone Linz,
Megan Owen,
Katherine St. John,
Charles Semple,
Kristina Wicke
Abstract:
We give a counterexample to the conjecture of Martin and Thatte that two balanced rooted binary leaf-labelled trees on $n$ leaves have a maximum agreement subtree (MAST) of size at least $n^{\frac{1}{2}}$. In particular, we show that for any $c>0$, there exist two balanced rooted binary leaf-labelled trees on $n$ leaves such that any MAST for these two trees has size less than $c n^{\frac{1}{2}}$.…
▽ More
We give a counterexample to the conjecture of Martin and Thatte that two balanced rooted binary leaf-labelled trees on $n$ leaves have a maximum agreement subtree (MAST) of size at least $n^{\frac{1}{2}}$. In particular, we show that for any $c>0$, there exist two balanced rooted binary leaf-labelled trees on $n$ leaves such that any MAST for these two trees has size less than $c n^{\frac{1}{2}}$. We also improve the lower bound of the size of such a MAST to $n^{\frac{1}{6}}$.
△ Less
Submitted 15 May, 2020;
originally announced May 2020.
-
Recovering tree-child networks from shortest inter-taxa distance information
Authors:
Magnus Bordewich,
Katharina T Huber,
Vincent Moulton,
Charles Semple
Abstract:
Phylogenetic networks are a type of leaf-labelled, acyclic, directed graph used by biologists to represent the evolutionary history of species whose past includes reticulation events. A phylogenetic network is tree-child if each non-leaf vertex is the parent of a tree vertex or a leaf. Up to a certain equivalence, it has been recently shown that, under two different types of weightings, edge-weigh…
▽ More
Phylogenetic networks are a type of leaf-labelled, acyclic, directed graph used by biologists to represent the evolutionary history of species whose past includes reticulation events. A phylogenetic network is tree-child if each non-leaf vertex is the parent of a tree vertex or a leaf. Up to a certain equivalence, it has been recently shown that, under two different types of weightings, edge-weighted tree-child networks are determined by their collection of distances between each pair of taxa. However, the size of these collections can be exponential in the size of the taxa set. In this paper, we show that, if we ignore redundant edges, the same results are obtained with only a quadratic number of inter-taxa distances by using the shortest distance between each pair of taxa. The proofs are constructive and give cubic-time algorithms in the size of the taxa sets for building such weighted networks.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.
-
A universal tree-based network with the minimum number of reticulations
Authors:
Magnus Bordewich,
Charles Semple
Abstract:
A tree-based network $\mathcal N$ on $X$ is universal if every rooted binary phylogenetic $X$-tree is a base tree for $\mathcal N$. Hayamizu and, independently, Zhang constructively showed that, for all positive integers $n$, there exists an universal tree-based network on $n$ leaves. For all $n$, Hayamizu's construction contains $Θ(n!)$ reticulations, while Zhang's construction contains $Θ(n^2)$…
▽ More
A tree-based network $\mathcal N$ on $X$ is universal if every rooted binary phylogenetic $X$-tree is a base tree for $\mathcal N$. Hayamizu and, independently, Zhang constructively showed that, for all positive integers $n$, there exists an universal tree-based network on $n$ leaves. For all $n$, Hayamizu's construction contains $Θ(n!)$ reticulations, while Zhang's construction contains $Θ(n^2)$ reticulations. A simple counting argument shows that an universal tree-based network has $Ω(n\log n)$ reticulations. With this in mind, Hayamizu as well as Steel posed the problem of determining whether or not such networks exists with $O(n\log n)$ reticulations. In this paper, we show that, for all $n$, there exists an universal tree-based network on $n$ leaves with $O(n\log n)$ reticulations.
△ Less
Submitted 21 December, 2017; v1 submitted 25 July, 2017;
originally announced July 2017.
-
On the information content of discrete phylogenetic characters
Authors:
Magnus Bordewich,
Ina Maria Deutschmann,
Mareike Fischer,
Elisa Kasbohm,
Charles Semple,
Mike Steel
Abstract:
Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to…
▽ More
Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to each species groups all of them together and so provides no insight into the relationships of the species considered. At the other extreme, a character that assigns a different state to each species also conveys no phylogenetic signal. In this manuscript, we study a natural combinatorial measure of the information content of an individual character and analyse properties of characters that provide the maximum phylogenetic information, particularly, the number of states such a character uses and how the different states have to be distributed among the species or taxa of the phylogenetic tree.
△ Less
Submitted 19 December, 2017; v1 submitted 14 March, 2017;
originally announced March 2017.
-
Reticulation-visible networks
Authors:
Magnus Bordewich,
Charles Semple
Abstract:
Let $X$ be a finite set, $\mathcal N$ be a reticulation-visible network on $X$, and $\mathcal T$ be a rooted binary phylogenetic tree. We show that there is a polynomial-time algorithm for deciding whether or not $\mathcal N$ displays $\mathcal T$. Furthermore, for all $|X|\ge 1$, we show that $\mathcal N$ has at most $8|X|-7$ vertices in total and at most $3|X|-3$ reticulation vertices, and that…
▽ More
Let $X$ be a finite set, $\mathcal N$ be a reticulation-visible network on $X$, and $\mathcal T$ be a rooted binary phylogenetic tree. We show that there is a polynomial-time algorithm for deciding whether or not $\mathcal N$ displays $\mathcal T$. Furthermore, for all $|X|\ge 1$, we show that $\mathcal N$ has at most $8|X|-7$ vertices in total and at most $3|X|-3$ reticulation vertices, and that these upper bounds are sharp.
△ Less
Submitted 24 June, 2017; v1 submitted 21 August, 2015;
originally announced August 2015.
-
Mixing of the Glauber dynamics for the ferromagnetic Potts model
Authors:
Magnus Bordewich,
Catherine Greenhill,
Viresh Patel
Abstract:
We present several results on the mixing time of the Glauber dynamics for sampling from the Gibbs distribution in the ferromagnetic Potts model. At a fixed temperature and interaction strength, we study the interplay between the maximum degree ($Δ$) of the underlying graph and the number of colours or spins ($q$) in determining whether the dynamics mixes rapidly or not. We find a lower bound $L$ o…
▽ More
We present several results on the mixing time of the Glauber dynamics for sampling from the Gibbs distribution in the ferromagnetic Potts model. At a fixed temperature and interaction strength, we study the interplay between the maximum degree ($Δ$) of the underlying graph and the number of colours or spins ($q$) in determining whether the dynamics mixes rapidly or not. We find a lower bound $L$ on the number of colours such that Glauber dynamics is rapidly mixing if at least $L$ colours are used. We give a closely-matching upper bound $U$ on the number of colours such that with probability that tends to 1, the Glauber dynamics mixes slowly on random $Δ$-regular graphs when at most $U$ colours are used. We show that our bounds can be improved if we restrict attention to certain types of graphs of maximum degree $Δ$, e.g. toroidal grids for $Δ= 4$.
△ Less
Submitted 4 June, 2014; v1 submitted 3 May, 2013;
originally announced May 2013.
-
Rapid mixing of subset Glauber dynamics on graphs of bounded tree-width
Authors:
Magnus Bordewich,
Ross J. Kang
Abstract:
Motivated by the `subgraphs world' view of the ferromagnetic Ising model, we develop a general approach to studying mixing times of Glauber dynamics based on subset expansion expressions for a class of graph polynomials. With a canonical paths argument, we demonstrate that the chains defined within this framework mix rapidly upon graphs of bounded tree-width. This extends known results on rapid mi…
▽ More
Motivated by the `subgraphs world' view of the ferromagnetic Ising model, we develop a general approach to studying mixing times of Glauber dynamics based on subset expansion expressions for a class of graph polynomials. With a canonical paths argument, we demonstrate that the chains defined within this framework mix rapidly upon graphs of bounded tree-width. This extends known results on rapid mixing for the Tutte polynomial, the adjacency-rank ($R_2$-)polynomial and the interlace polynomial.
△ Less
Submitted 17 February, 2011;
originally announced February 2011.
-
Approximate Counting and Quantum Computation
Authors:
M. Bordewich,
M. Freedman,
L. Lovász,
D. Welsh
Abstract:
Motivated by the result that an `approximate' evaluation of the Jones polynomial of a braid at a $5^{th}$ root of unity can be used to simulate the quantum part of any algorithm in the quantum complexity class BQP, and results relating BQP to the counting class GapP, we introduce a form of additive approximation which can be used to simulate a function in BQP. We show that all functions in the c…
▽ More
Motivated by the result that an `approximate' evaluation of the Jones polynomial of a braid at a $5^{th}$ root of unity can be used to simulate the quantum part of any algorithm in the quantum complexity class BQP, and results relating BQP to the counting class GapP, we introduce a form of additive approximation which can be used to simulate a function in BQP. We show that all functions in the classes #P and GapP have such an approximation scheme under certain natural normalisations. However we are unable to determine whether the particular functions we are motivated by, such as the above evaluation of the Jones polynomial, can be approximated in this way. We close with some open problems motivated by this work.
△ Less
Submitted 14 August, 2009;
originally announced August 2009.
-
Metric Construction, Stop** Times and Path Coupling
Authors:
Magnus Bordewich,
Martin Dyer,
Marek Karpinski
Abstract:
In this paper we examine the importance of the choice of metric in path coupling, and the relationship of this to \emph{stop** time analysis}. We give strong evidence that stop** time analysis is no more powerful than standard path coupling. In particular, we prove a stronger theorem for path coupling with stop** times, using a metric which allows us to restrict analysis to standard one-st…
▽ More
In this paper we examine the importance of the choice of metric in path coupling, and the relationship of this to \emph{stop** time analysis}. We give strong evidence that stop** time analysis is no more powerful than standard path coupling. In particular, we prove a stronger theorem for path coupling with stop** times, using a metric which allows us to restrict analysis to standard one-step path coupling. This approach provides insight for the design of non-standard metrics giving improvements in the analysis of specific problems.
We give illustrative applications to hypergraph independent sets and SAT instances, hypergraph colourings and colourings of bipartite graphs.
△ Less
Submitted 22 November, 2005; v1 submitted 8 November, 2005;
originally announced November 2005.
-
Path Coupling Using Stop** Times and Counting Independent Sets and Colourings in Hypergraphs
Authors:
Magnus Bordewich,
Martin Dyer,
Marek Karpinski
Abstract:
We give a new method for analysing the mixing time of a Markov chain using path coupling with stop** times. We apply this approach to two hypergraph problems. We show that the Glauber dynamics for independent sets in a hypergraph mixes rapidly as long as the maximum degree Delta of a vertex and the minimum size m of an edge satisfy m>= 2Delta+1. We also show that the Glauber dynamics for prope…
▽ More
We give a new method for analysing the mixing time of a Markov chain using path coupling with stop** times. We apply this approach to two hypergraph problems. We show that the Glauber dynamics for independent sets in a hypergraph mixes rapidly as long as the maximum degree Delta of a vertex and the minimum size m of an edge satisfy m>= 2Delta+1. We also show that the Glauber dynamics for proper q-colourings of a hypergraph mixes rapidly if m>= 4 and q > Delta, and if m=3 and q>=1.65Delta. We give related results on the hardness of exact and approximate counting for both problems.
△ Less
Submitted 2 April, 2005; v1 submitted 6 January, 2005;
originally announced January 2005.