-
Geometry of rational quasi-independence models as toric fiber products
Authors:
Jane Ivy Coons,
Heather A. Harrington,
Niharika Chakrabarty Paul
Abstract:
We investigate the geometry of a family of log-linear statistical models called quasi-independence models. The toric fiber product is useful for understanding the geometry of parameter inference in these models because the maximum likelihood degree is multiplicative under the TFP. We define the coordinate toric fiber product, or cTFP, and give necessary and sufficient conditions under which a quas…
▽ More
We investigate the geometry of a family of log-linear statistical models called quasi-independence models. The toric fiber product is useful for understanding the geometry of parameter inference in these models because the maximum likelihood degree is multiplicative under the TFP. We define the coordinate toric fiber product, or cTFP, and give necessary and sufficient conditions under which a quasi-independence model is a cTFP of lower-order models. We show that the vanishing ideal of every 2-way quasi-independence model with ML-degree 1 can be realized as an iterated toric fiber product of linear ideals. We also classify which Lawrence lifts of 2-way quasi-independence models are cTFPs and give a necessary condition under which a $k$-way model has ML-degree 1 using its facial submodels.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Quiver Laplacians and Feature Selection
Authors:
Otto Sumray,
Heather A. Harrington,
Vidit Nanda
Abstract:
The challenge of selecting the most relevant features of a given dataset arises ubiquitously in data analysis and dimensionality reduction. However, features found to be of high importance for the entire dataset may not be relevant to subsets of interest, and vice versa. Given a feature selector and a fixed decomposition of the data into subsets, we describe a method for identifying selected featu…
▽ More
The challenge of selecting the most relevant features of a given dataset arises ubiquitously in data analysis and dimensionality reduction. However, features found to be of high importance for the entire dataset may not be relevant to subsets of interest, and vice versa. Given a feature selector and a fixed decomposition of the data into subsets, we describe a method for identifying selected features which are compatible with the decomposition into subsets. We achieve this by re-framing the problem of finding compatible features to one of finding sections of a suitable quiver representation. In order to approximate such sections, we then introduce a Laplacian operator for quiver representations valued in Hilbert spaces. We provide explicit bounds on how the spectrum of a quiver Laplacian changes when the representation and the underlying quiver are modified in certain natural ways. Finally, we apply this machinery to the study of peak-calling algorithms which measure chromatin accessibility in single-cell data. We demonstrate that eigenvectors of the associated quiver Laplacian yield locally and globally compatible features.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Absolute concentration robustness: Algebra and geometry
Authors:
Luis David García Puente,
Elizabeth Gross,
Heather A Harrington,
Matthew Johnston,
Nicolette Meshkat,
Mercedes Pérez Millán,
Anne Shiu
Abstract:
Motivated by the question of how biological systems maintain homeostasis in changing environments, Shinar and Feinberg introduced in 2010 the concept of absolute concentration robustness (ACR). A biochemical system exhibits ACR in some species if the steady-state value of that species does not depend on initial conditions. Thus, a system with ACR can maintain a constant level of one species even a…
▽ More
Motivated by the question of how biological systems maintain homeostasis in changing environments, Shinar and Feinberg introduced in 2010 the concept of absolute concentration robustness (ACR). A biochemical system exhibits ACR in some species if the steady-state value of that species does not depend on initial conditions. Thus, a system with ACR can maintain a constant level of one species even as the environment changes. Despite a great deal of interest in ACR in recent years, the following basic question remains open: How can we determine quickly whether a given biochemical system has ACR? Although various approaches to this problem have been proposed, we show that they are incomplete. Accordingly, we present new methods for deciding ACR, which harness computational algebra. We illustrate our results on several biochemical signaling networks.
△ Less
Submitted 29 December, 2023;
originally announced January 2024.
-
Active shape control by plants in dynamic environments
Authors:
Hadrien Oliveri,
Derek E. Moulton,
Heather A. Harrington,
Alain Goriely
Abstract:
Plants are a paradigm for active shape control in response to stimuli. For instance, it is well-known that a tilted plant will eventually straighten vertically, demonstrating the influence of both an external stimulus, gravity, and an internal stimulus, proprioception. These effects can be modulated when a potted plant is additionally rotated along the plant's axis, as in a rotating clinostat, lea…
▽ More
Plants are a paradigm for active shape control in response to stimuli. For instance, it is well-known that a tilted plant will eventually straighten vertically, demonstrating the influence of both an external stimulus, gravity, and an internal stimulus, proprioception. These effects can be modulated when a potted plant is additionally rotated along the plant's axis, as in a rotating clinostat, leading to intricate shapes. We use a morphoelastic model for the response of growing plants to study the joint effect of both stimuli at all rotation speeds. In the absence of rotation, we identify a universal planar shape towards which all shoots eventually converge. With rotation, we demonstrate the existence of a stable family of three-dimensional dynamic equilibria where the plant axis is fixed in space. Further, the effect of axial growth is to induce steady behaviors, such as solitary waves. Overall, this study offers new insight into the complex out-of-equilibrium dynamics of a plant in three dimensions and further establishes that internal stimuli in active materials are key for robust shape control.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Topological fingerprints for audio identification
Authors:
Wojciech Reise,
Ximena Fernández,
Maria Dominguez,
Heather A. Harrington,
Mariano Beguerisse-Díaz
Abstract:
We present a topological audio fingerprinting approach for robustly identifying duplicate audio tracks. Our method applies persistent homology on local spectral decompositions of audio signals, using filtered cubical complexes computed from mel-spectrograms. By encoding the audio content in terms of local Betti curves, our topological audio fingerprints enable accurate detection of time-aligned au…
▽ More
We present a topological audio fingerprinting approach for robustly identifying duplicate audio tracks. Our method applies persistent homology on local spectral decompositions of audio signals, using filtered cubical complexes computed from mel-spectrograms. By encoding the audio content in terms of local Betti curves, our topological audio fingerprints enable accurate detection of time-aligned audio matchings. Experimental results demonstrate the accuracy of our algorithm in the detection of tracks with the same audio content, even when subjected to various obfuscations. Our approach outperforms existing methods in scenarios involving topological distortions, such as time stretching and pitch shifting.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Relational persistent homology for multispecies data with application to the tumor microenvironment
Authors:
Bernadette J. Stolz,
Jagdeep Dhesi,
Joshua A. Bull,
Heather A. Harrington,
Helen M. Byrne,
Iris H. R. Yoon
Abstract:
Topological data analysis (TDA) is an active field of mathematics for quantifying shape in complex data. Standard methods in TDA such as persistent homology (PH) are typically focused on the analysis of data consisting of a single entity (e.g., cells or molecular species). However, state-of-the-art data collection techniques now generate exquisitely detailed multispecies data, prompting a need for…
▽ More
Topological data analysis (TDA) is an active field of mathematics for quantifying shape in complex data. Standard methods in TDA such as persistent homology (PH) are typically focused on the analysis of data consisting of a single entity (e.g., cells or molecular species). However, state-of-the-art data collection techniques now generate exquisitely detailed multispecies data, prompting a need for methods that can examine and quantify the relations among them. Such heterogeneous data types arise in many contexts, ranging from biomedical imaging, geospatial analysis, to species ecology. Here, we propose two methods for encoding spatial relations among different data types that are based on Dowker complexes and Witness complexes. We apply the methods to synthetic multispecies data of a tumor microenvironment and analyze topological features that capture relations between different cell types, e.g., blood vessels, macrophages, tumor cells, and necrotic cells. We demonstrate that relational topological features can extract biological insight, including the dominant immune cell phenotype (an important predictor of patient prognosis) and the parameter regimes of a data-generating model. The methods provide a quantitative perspective on the relational analysis of multispecies spatial data, overcome the limits of traditional PH, and are readily computable.
△ Less
Submitted 12 September, 2023; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Topological classification of tumour-immune interactions and dynamics
Authors:
**gjie Yang,
Heidi Fang,
Jagdeep Dhesi,
Iris H. R. Yoon,
Joshua A. Bull,
Helen M. Byrne,
Heather A. Harrington,
Gillian Grindstaff
Abstract:
The complex and dynamic crosstalk between tumour and immune cells results in tumours that can exhibit distinct qualitative behaviours - elimination, equilibrium, and escape - and intricate spatial patterns, yet share similar cell configurations in the early stages. We offer a topological approach to analyse time series of spatial data of cell locations (including tumour cells and macrophages) in o…
▽ More
The complex and dynamic crosstalk between tumour and immune cells results in tumours that can exhibit distinct qualitative behaviours - elimination, equilibrium, and escape - and intricate spatial patterns, yet share similar cell configurations in the early stages. We offer a topological approach to analyse time series of spatial data of cell locations (including tumour cells and macrophages) in order to predict malignant behaviour. We propose four topological vectorisations specialised to such cell data: persistence images of Vietoris-Rips and radial filtrations at static time points, and persistence images for zigzag filtrations and persistence vineyards varying in time. To demonstrate the approach, synthetic data are generated from an agent-based model with varying parameters. We compare the performance of topological summaries in predicting - with logistic regression at various time steps - whether tumour niches surrounding blood vessels are present at the end of the simulation, as a proxy for metastasis (i.e., tumour escape). We find that both static and time-dependent methods accurately identify perivascular niche formation, significantly earlier than simpler markers such as the number of tumour cells and the macrophage phenotype ratio. We find additionally that dimension 0 persistence applied to macrophage data, representing multi-scale clusters of the spatial arrangement of macrophages, performs best at this classification task at early time steps, prior to full tumour development, and performs even better when time-dependent data are included; in contrast, topological measures capturing the shape of the tumour, such as tortuosity and punctures in the cell arrangement, perform best at intermediate and later stages. The logistic regression coefficients reveal detailed shape differences between the classes.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Detecting Temporal shape changes with the Euler Characteristic Transform
Authors:
Lewis Marsh,
Felix Y. Zhou,
Xiao Qin,
Xin Lu,
Helen M. Byrne,
Heather A. Harrington
Abstract:
Organoids are multi-cellular structures which are cultured in vitro from stem cells to resemble specific organs (e.g., brain, liver) in their three-dimensional composition. Dynamic changes in the shape and composition of these model systems can be used to understand the effect of mutations and treatments in health and disease. In this paper, we propose a new technique in the field of topological d…
▽ More
Organoids are multi-cellular structures which are cultured in vitro from stem cells to resemble specific organs (e.g., brain, liver) in their three-dimensional composition. Dynamic changes in the shape and composition of these model systems can be used to understand the effect of mutations and treatments in health and disease. In this paper, we propose a new technique in the field of topological data analysis for DEtecting Temporal shape changes with the Euler Characteristic Transform (DETECT). DETECT is a rotationally invariant signature of dynamically changing shapes. We demonstrate our method on a data set of segmented videos of mouse small intestine organoid experiments and show that it outperforms classical shape descriptors. We verify our method on a synthetic organoid data set and illustrate how it generalises to 3D. We conclude that DETECT offers rigorous quantification of organoids and opens up computationally scalable methods for distinguishing different growth regimes and assessing treatment effects.
△ Less
Submitted 22 December, 2022; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Topological Data Analysis Detects Percolation Thresholds in Arctic Melt-Pond Evolution
Authors:
Wilfred Offord,
Michael Coughlan,
Ian J. Hewitt,
Heather A. Harrington,
Gillian Grindstaff
Abstract:
During the summer melt period, ponds form on the surface of Arctic sea ice as it melts, with important consequences for ice evolution and marine ecology. Due to the ice-albedo feedback, these melt ponds experience uneven heat absorption, and exhibit complex patterns, which has motivated the development of modelling and data analysis to understand their particular dynamics. We provide a multiscale…
▽ More
During the summer melt period, ponds form on the surface of Arctic sea ice as it melts, with important consequences for ice evolution and marine ecology. Due to the ice-albedo feedback, these melt ponds experience uneven heat absorption, and exhibit complex patterns, which has motivated the development of modelling and data analysis to understand their particular dynamics. We provide a multiscale shape analysis using tools from computational algebraic topology, simultaneously capturing convexity, proximity, integrity, and feature size complementing existing single-scale quantification. Of particular interest in modelling the ponds is a percolation threshold at which local pond structure begins merging into macroscopic features. This percolation threshold has previously been observed using fractal dimension techniques. The signed Euclidean distance transform (SEDT) is a topological encoding of heterogeneous shape in binary images, and has been previously applied to porous media for percolation as well as other material behaviours. Here we adapt the SEDT for Arctic melt pond data to give a rich characterization and computation of shape, quantifying overall melt pond development in several complementary ways, and from which classical percolation and dimension results can be extracted. This orientation-invariant topological approach distinguishes different dynamical network models of melt pond evolution of varying complexity.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Multiscale topology classifies and quantifies cell types in subcellular spatial transcriptomics
Authors:
Katherine Benjamin,
Aneesha Bhandari,
Zhouchun Shang,
Yanan Xing,
Yanru An,
Nannan Zhang,
Yong Hou,
Ulrike Tillmann,
Katherine R. Bull,
Heather A. Harrington
Abstract:
Spatial transcriptomics has the potential to transform our understanding of RNA expression in tissues. Classical array-based technologies produce multiple-cell-scale measurements requiring deconvolution to recover single cell information. However, rapid advances in subcellular measurement of RNA expression at whole-transcriptome depth necessitate a fundamentally different approach. To integrate si…
▽ More
Spatial transcriptomics has the potential to transform our understanding of RNA expression in tissues. Classical array-based technologies produce multiple-cell-scale measurements requiring deconvolution to recover single cell information. However, rapid advances in subcellular measurement of RNA expression at whole-transcriptome depth necessitate a fundamentally different approach. To integrate single-cell RNA-seq data with nanoscale spatial transcriptomics, we present a topological method for automatic cell type identification (TopACT). Unlike popular decomposition approaches to multicellular resolution data, TopACT is able to pinpoint the spatial locations of individual sparsely dispersed cells without prior knowledge of cell boundaries. Pairing TopACT with multiparameter persistent homology landscapes predicts immune cells forming a peripheral ring structure within kidney glomeruli in a murine model of lupus nephritis, which we experimentally validate with immunofluorescent imaging. The proposed topological data analysis unifies multiple biological scales, from subcellular gene expression to multicellular tissue organization.
△ Less
Submitted 13 December, 2022;
originally announced December 2022.
-
Algebraic network reconstruction of discrete dynamical systems
Authors:
Heather A. Harrington,
Mike Stillman,
Alan Veliz-Cuba
Abstract:
We present a computational algebra solution to reverse engineering the network structure of discrete dynamical systems from data. We use monomial ideals to determine dependencies between variables that encode constraints on the possible wiring diagrams underlying the process generating the discrete-time, continuous-space data. Our work assumes that each variable is either monotone increasing or de…
▽ More
We present a computational algebra solution to reverse engineering the network structure of discrete dynamical systems from data. We use monomial ideals to determine dependencies between variables that encode constraints on the possible wiring diagrams underlying the process generating the discrete-time, continuous-space data. Our work assumes that each variable is either monotone increasing or decreasing. We prove that with enough data, even in the presence of small noise, our method can reconstruct the correct unique wiring diagram.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Stability of topological descriptors for neuronal morphology
Authors:
David Beers,
Heather A. Harrington,
Alain Goriely
Abstract:
The topological morphology descriptor of a neuron is a multiset of intervals associated to the shape of the neuron represented as a tree. In practice, topological morphology descriptors are vectorized using persistence images, which can help classify and characterize the morphology of broad groups of neurons. We study the stability of topological morphology descriptors under small changes to neuro…
▽ More
The topological morphology descriptor of a neuron is a multiset of intervals associated to the shape of the neuron represented as a tree. In practice, topological morphology descriptors are vectorized using persistence images, which can help classify and characterize the morphology of broad groups of neurons. We study the stability of topological morphology descriptors under small changes to neuronal morphology. We show that the persistence diagram arising from the topological morphology descriptor of a neuron is stable for the 1-Wasserstein distance against a range of perturbations to the tree. These results guarantee that persistence images of topological morphology descriptors are stable against the same set of perturbations and reliable.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Grounded persistent path homology: a stable, topological descriptor for weighted digraphs
Authors:
Thomas Chaplin,
Heather A. Harrington,
Ulrike Tillmann
Abstract:
Weighted digraphs are used to model a variety of natural systems and can exhibit interesting structure across a range of scales. In order to understand and compare these systems, we require stable, interpretable, multiscale descriptors. To this end, we propose grounded persistent path homology (GrPPH) - a new, functorial, topological descriptor that describes the structure of an edge-weighted digr…
▽ More
Weighted digraphs are used to model a variety of natural systems and can exhibit interesting structure across a range of scales. In order to understand and compare these systems, we require stable, interpretable, multiscale descriptors. To this end, we propose grounded persistent path homology (GrPPH) - a new, functorial, topological descriptor that describes the structure of an edge-weighted digraph via a persistence barcode. We show there is a choice of circuit basis for the graph which yields geometrically interpretable representatives for the features in the barcode. Moreover, we show the barcode is stable, in bottleneck distance, to both numerical and structural perturbations. GrPPH arises from a flexible framework, parametrised by a choice of digraph chain complex and a choice of filtration; for completeness, we also investigate replacing the path homology complex, used in GrPPH, by the directed flag complex.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Hypergraphs for multiscale cycles in structured data
Authors:
Agnese Barbensi,
Iris H. R. Yoon,
Christian Degnbol Madsen,
Deborah O. Ajayi,
Michael P. H. Stumpf,
Heather A. Harrington
Abstract:
Scientific data has been growing in both size and complexity across the modern physical, engineering, life and social sciences. Spatial structure, for example, is a hallmark of many of the most important real-world complex systems, but its analysis is fraught with statistical challenges. Topological data analysis can provide a powerful computational window on complex systems. Here we present a fra…
▽ More
Scientific data has been growing in both size and complexity across the modern physical, engineering, life and social sciences. Spatial structure, for example, is a hallmark of many of the most important real-world complex systems, but its analysis is fraught with statistical challenges. Topological data analysis can provide a powerful computational window on complex systems. Here we present a framework to extend and interpret persistent homology summaries to analyse spatial data across multiple scales. We introduce hyperTDA, a topological pipeline that unifies local (e.g. geodesic) and global (e.g. Euclidean) metrics without losing spatial information, even in the presence of noise. Homology generators offer an elegant and flexible description of spatial structures and can capture the information computed by persistent homology in an interpretable way. Here the information computed by persistent homology is transformed into a weighted hypergraph, where hyperedges correspond to homology generators. We consider different choices of generators (e.g. matroid or minimal) and find that centrality and community detection are robust to either choice. We compare hyperTDA to existing geometric measures and validate its robustness to noise. We demonstrate the power of computing higher-order topological structures on spatial curves arising frequently in ecology, biophysics, and biology, but also in high-dimensional financial datasets. We find that hyperTDA can select between synthetic trajectories from the landmark 2020 AnDi challenge and quantifies movements of different animal species, even when data is limited.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Zigzag persistence for coral reef resilience using a stochastic spatial model
Authors:
Robert A. McDonald,
Rosanna Neuhausler,
Martin Robinson,
Laurel G. Larsen,
Heather A. Harrington,
Maria Bruna
Abstract:
A complex interplay between species governs the evolution of spatial patterns in ecology. An open problem in the biological sciences is characterising spatio-temporal data and understanding how changes at the local scale affect global dynamics/behaviour. Here, we extend a well-studied temporal mathematical model of coral reef dynamics to include stochastic and spatial interactions and generate dat…
▽ More
A complex interplay between species governs the evolution of spatial patterns in ecology. An open problem in the biological sciences is characterising spatio-temporal data and understanding how changes at the local scale affect global dynamics/behaviour. Here, we extend a well-studied temporal mathematical model of coral reef dynamics to include stochastic and spatial interactions and generate data to study different ecological scenarios. We present descriptors to characterise patterns in heterogeneous spatio-temporal data surpassing spatially averaged measures. We apply these descriptors to simulated coral data and demonstrate the utility of two topological data analysis techniques--persistent homology and zigzag persistence--for characterising mechanisms of reef resilience. We show that the introduction of local competition between species leads to the appearance of coral clusters in the reef. We use our analyses to distinguish temporal dynamics stemming from different initial configurations of coral, showing that the neighbourhood composition of coral sites determines their long-term survival. Using zigzag persistence, we determine which spatial configurations protect coral from extinction in different environments. Finally, we apply this toolkit of multi-scale methods to empirical coral reef data, which distinguish spatio-temporal reef dynamics in different locations, and demonstrate the applicability to a range of datasets.
△ Less
Submitted 12 August, 2023; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Brain Chains as Topological Signatures for Alzheimer's Disease
Authors:
Christian Goodbrake,
David Beers,
Travis B. Thompson,
Heather A. Harrington,
Alain Goriely
Abstract:
We propose a topological framework to study the evolution of Alzheimer's disease, the most common neurodegenerative disease. The modeling of this disease starts with the representation of the brain connectivity as a graph and the seeding of a toxic protein in a specific region represented by a vertex. Over time, the accumulation of toxic proteins at vertices and their propagation along edges are m…
▽ More
We propose a topological framework to study the evolution of Alzheimer's disease, the most common neurodegenerative disease. The modeling of this disease starts with the representation of the brain connectivity as a graph and the seeding of a toxic protein in a specific region represented by a vertex. Over time, the accumulation of toxic proteins at vertices and their propagation along edges are modeled by a dynamical system on this graph. These dynamics provide an order on the edges of the graph according to the damage created by high concentrations of proteins. This sequence of edges defines a filtration of the graph. We consider different filtrations given by different disease seeding locations. To study this filtration we propose a new combinatorial and topological method. A filtration defines a maximal chain in the partially ordered set of spanning subgraphs ordered by inclusion. To identify similar graphs, and define a topological signature, we quotient this poset by graph homotopy equivalence, which gives maximal chains in a smaller poset. We provide an algorithm to compute this direct quotient without computing all subgraphs and then propose bounds on the total number of graphs up to homotopy equivalence. To compare the maximal chains generated by this method, we extend Kendall's $d_K$ metric for permutations to more general graded posets and establish bounds for this metric. We then demonstrate the utility of this framework on actual brain graphs by studying the dynamics of tau proteins on the structural connectome. {We show that the proposed topological brain chain equivalence classes distinguish different simulated subtypes of Alzheimer's disease.
△ Less
Submitted 18 September, 2023; v1 submitted 22 August, 2022;
originally announced August 2022.
-
Multiscale methods for signal selection in single-cell data
Authors:
Renee S. Hoekzema,
Lewis Marsh,
Otto Sumray,
Thomas M. Carroll,
Xin Lu,
Helen M. Byrne,
Heather A. Harrington
Abstract:
Analysis of single-cell transcriptomics often relies on clustering cells and then performing differential gene expression (DGE) to identify genes that vary between these clusters. These discrete analyses successfully determine cell types and markers; however, continuous variation within and between cell types may not be detected. We propose three topologically motivated mathematical methods for un…
▽ More
Analysis of single-cell transcriptomics often relies on clustering cells and then performing differential gene expression (DGE) to identify genes that vary between these clusters. These discrete analyses successfully determine cell types and markers; however, continuous variation within and between cell types may not be detected. We propose three topologically motivated mathematical methods for unsupervised feature selection that consider discrete and continuous transcriptional patterns on an equal footing across multiple scales simultaneously. Eigenscores ($\text{eig}_i$) rank signals or genes based on their correspondence to low-frequency intrinsic patterning in the data using the spectral decomposition of the Laplacian graph. The multiscale Laplacian score (MLS) is an unsupervised method for locating relevant scales in data and selecting the genes that are coherently expressed at these respective scales. The persistent Rayleigh quotient (PRQ) takes data equipped with a filtration, allowing the separation of genes with different roles in a bifurcation process (e.g., pseudo-time). We demonstrate the utility of these techniques by applying them to published single-cell transcriptomics data sets. The methods validate previously identified genes and detect additional biologically meaningful genes with coherent expression patterns. By studying the interaction between gene signals and the geometry of the underlying space, the three methods give multidimensional rankings of the genes and visualisation of relationships between them.
△ Less
Submitted 6 October, 2022; v1 submitted 15 June, 2022;
originally announced June 2022.
-
Barcodes distinguish morphology of neuronal tauopathy
Authors:
David Beers,
Despoina Goniotaki,
Diane P. Hanger,
Alain Goriely,
Heather A. Harrington
Abstract:
The geometry of neurons is known to be important for their functions. Hence, neurons are often classified by their morphology. Two recent methods, persistent homology and the topological morphology descriptor, assign a morphology descriptor called a barcode to a neuron equipped with a given function, such as the Euclidean distance from the root of the neuron. These barcodes can be converted into m…
▽ More
The geometry of neurons is known to be important for their functions. Hence, neurons are often classified by their morphology. Two recent methods, persistent homology and the topological morphology descriptor, assign a morphology descriptor called a barcode to a neuron equipped with a given function, such as the Euclidean distance from the root of the neuron. These barcodes can be converted into matrices called persistence images, which can then be averaged across groups. We show that when the defining function is the path length from the root, both the topological morphology descriptor and persistent homology are equivalent. We further show that persistence images arising from the path length procedure provide an interpretable summary of neuronal morphology. We introduce {topological morphology functions}, a class of functions similar to Sholl functions, that can be recovered from the associated topological morphology descriptor. To demonstrate this topological approach, we compare healthy cortical and hippocampal mouse neurons to those affected by progressive tauopathy. We find a significant difference in the morphology of healthy neurons and those with a tauopathy at a postsymptomatic age. We use persistence images to conclude that the diseased group tends to have neurons with shorter branches as well as fewer branches far from the soma.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Homology of homologous knotted proteins
Authors:
Katherine Benjamin,
Lamisah Mukta,
Gabriel Moryoussef,
Christopher Uren,
Heather A. Harrington,
Ulrike Tillmann,
Agnese Barbensi
Abstract:
Quantification and classification of protein structures, such as knotted proteins, often requires noise-free and complete data. Here we develop a mathematical pipeline that systematically analyzes protein structures. We showcase this geometric framework on proteins forming open-ended trefoil knots, and we demonstrate that the mathematical tool, persistent homology, faithfully represents their stru…
▽ More
Quantification and classification of protein structures, such as knotted proteins, often requires noise-free and complete data. Here we develop a mathematical pipeline that systematically analyzes protein structures. We showcase this geometric framework on proteins forming open-ended trefoil knots, and we demonstrate that the mathematical tool, persistent homology, faithfully represents their structural homology. This topological pipeline identifies important geometric features of protein entanglement and clusters the space of trefoil proteins according to their depth. Persistence landscapes quantify the topological difference between a family of knotted and unknotted proteins in the same structural homology class. This difference is localized and interpreted geometrically with recent advancements in systematic computation of homology generators. The topological and geometric quantification we find is robust to noisy input data, which demonstrates the potential of this approach in contexts where standard knot theoretic tools fail.
△ Less
Submitted 19 January, 2022;
originally announced January 2022.
-
Algebra, Geometry and Topology of ERK Kinetics
Authors:
Lewis Marsh,
Emilie Dufresne,
Helen M. Byrne,
Heather A. Harrington
Abstract:
The MEK/ERK signalling pathway is involved in cell division, cell specialisation, survival and cell death. Here we study a polynomial dynamical system describing the dynamics of MEK/ERK proposed by Yeung et al. with their experimental setup, data and known biological information. The experimental dataset is a time-course of ERK measurements in different phosphorylation states following activation…
▽ More
The MEK/ERK signalling pathway is involved in cell division, cell specialisation, survival and cell death. Here we study a polynomial dynamical system describing the dynamics of MEK/ERK proposed by Yeung et al. with their experimental setup, data and known biological information. The experimental dataset is a time-course of ERK measurements in different phosphorylation states following activation of either wild-type MEK or MEK mutations associated with cancer or developmental defects. We demonstrate how methods from computational algebraic geometry, differential algebra, Bayesian statistics and computational algebraic topology can inform the model reduction, identification and parameter inference of MEK variants, respectively. Throughout, we show how this algebraic viewpoint offers a rigorous and systematic analysis of such models.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
Differential elimination for dynamical models via projections with applications to structural identifiability
Authors:
Ruiwen Dong,
Christian Goodbrake,
Heather A Harrington,
Gleb Pogudin
Abstract:
Elimination of unknowns in a system of differential equations is often required when analysing (possibly nonlinear) dynamical systems models, where only a subset of variables are observable. One such analysis, identifiability, often relies on computing input-output relations via differential algebraic elimination. Determining identifiability, a natural prerequisite for meaningful parameter estimat…
▽ More
Elimination of unknowns in a system of differential equations is often required when analysing (possibly nonlinear) dynamical systems models, where only a subset of variables are observable. One such analysis, identifiability, often relies on computing input-output relations via differential algebraic elimination. Determining identifiability, a natural prerequisite for meaningful parameter estimation, is often prohibitively expensive for medium to large systems due to the computationally expensive task of elimination.
We propose an algorithm that computes a description of the set of differential-algebraic relations between the input and output variables of a dynamical system model. The resulting algorithm outperforms general-purpose software for differential elimination on a set of benchmark models from literature.
We use the designed elimination algorithm to build a new randomized algorithm for assessing structural identifiability of a parameter in a parametric model. A parameter is said to be identifiable if its value can be uniquely determined from input-output data assuming the absence of noise and sufficiently exciting inputs. Our new algorithm allows the identification of models that could not be tackled before.
Our implementation is publicly available as a Julia package at https://github.com/SciML/StructuralIdentifiability.jl.
△ Less
Submitted 23 November, 2022; v1 submitted 1 November, 2021;
originally announced November 2021.
-
Topological Approximate Bayesian Computation for Parameter Inference of an Angiogenesis Model
Authors:
Thomas Thorne,
Paul D. W. Kirk,
Heather A. Harrington
Abstract:
Inferring the parameters of models describing biological systems is an important problem in the reverse engineering of the mechanisms underlying these systems. Much work has focused on parameter inference of stochastic and ordinary differential equation models using Approximate Bayesian Computation (ABC). While there is some recent work on inference in spatial models, this remains an open problem.…
▽ More
Inferring the parameters of models describing biological systems is an important problem in the reverse engineering of the mechanisms underlying these systems. Much work has focused on parameter inference of stochastic and ordinary differential equation models using Approximate Bayesian Computation (ABC). While there is some recent work on inference in spatial models, this remains an open problem. Simultaneously, advances in topological data analysis (TDA), a field of computational mathematics, have enabled spatial patterns in data to be characterised. Here we focus on recent work using topological data analysis to study different regimes of parameter space for a well-studied model of angiogenesis. We propose a method for combining TDA with ABC to infer parameters in the Anderson-Chaplain model of angiogenesis. We demonstrate that this topological approach outperforms ABC approaches that use simpler statistics based on spatial features of the data. This is a first step towards a general framework of spatial parameter inference for biological systems, for which there may be a variety of filtrations, vectorisations, and summary statistics to be considered. All code used to produce our results is available as a Snakemake workflow.
△ Less
Submitted 8 November, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
What are higher-order networks?
Authors:
Christian Bick,
Elizabeth Gross,
Heather A. Harrington,
Michael T. Schaub
Abstract:
Network-based modeling of complex systems and data using the language of graphs has become an essential topic across a range of different disciplines. Arguably, this graph-based perspective derives its success from the relative simplicity of graphs: A graph consists of nothing more than a set of vertices and a set of edges, describing relationships between pairs of such vertices. This simple combi…
▽ More
Network-based modeling of complex systems and data using the language of graphs has become an essential topic across a range of different disciplines. Arguably, this graph-based perspective derives its success from the relative simplicity of graphs: A graph consists of nothing more than a set of vertices and a set of edges, describing relationships between pairs of such vertices. This simple combinatorial structure makes graphs interpretable and flexible modeling tools. The simplicity of graphs as system models, however, has been scrutinized in the literature recently. Specifically, it has been argued from a variety of different angles that there is a need for higher-order networks, which go beyond the paradigm of modeling pairwise relationships, as encapsulated by graphs. In this survey article we take stock of these recent developments. Our goals are to clarify (i) what higher-order networks are, (ii) why these are interesting objects of study, and (iii) how they can be used in applications.
△ Less
Submitted 4 July, 2022; v1 submitted 20 April, 2021;
originally announced April 2021.
-
Principal Components along Quiver Representations
Authors:
Anna Seigal,
Heather A. Harrington,
Vidit Nanda
Abstract:
Quiver representations arise naturally in many areas across mathematics. Here we describe an algorithm for calculating the vector space of sections, or compatible assignments of vectors to vertices, of any finite-dimensional representation of a finite quiver. Consequently, we are able to define and compute principal components with respect to quiver representations. These principal components are…
▽ More
Quiver representations arise naturally in many areas across mathematics. Here we describe an algorithm for calculating the vector space of sections, or compatible assignments of vectors to vertices, of any finite-dimensional representation of a finite quiver. Consequently, we are able to define and compute principal components with respect to quiver representations. These principal components are solutions to constrained optimisation problems defined over the space of sections, and are eigenvectors of an associated matrix pencil.
△ Less
Submitted 24 November, 2021; v1 submitted 21 April, 2021;
originally announced April 2021.
-
Topological data analysis distinguishes parameter regimes in the Anderson-Chaplain model of angiogenesis
Authors:
John T. Nardini,
Bernadette J. Stolz,
Kevin B. Flores,
Heather A. Harrington,
Helen M. Byrne
Abstract:
Angiogenesis is the process by which blood vessels form from pre-existing vessels. It plays a key role in many biological processes, including embryonic development and wound healing, and contributes to many diseases including cancer and rheumatoid arthritis. The structure of the resulting vessel networks determines their ability to deliver nutrients and remove waste products from biological tissu…
▽ More
Angiogenesis is the process by which blood vessels form from pre-existing vessels. It plays a key role in many biological processes, including embryonic development and wound healing, and contributes to many diseases including cancer and rheumatoid arthritis. The structure of the resulting vessel networks determines their ability to deliver nutrients and remove waste products from biological tissues. Here we simulate the Anderson-Chaplain model of angiogenesis at different parameter values and quantify the vessel architectures of the resulting synthetic data. Specifically, we propose a topological data analysis (TDA) pipeline for systematic analysis of the model. TDA is a vibrant and relatively new field of computational mathematics for studying the shape of data. We compute topological and standard descriptors of model simulations generated by different parameter values. We show that TDA of model simulation data stratifies parameter space into regions with similar vessel morphology. The methodologies proposed here are widely applicable to other synthetic and experimental data including wound healing, development, and plant biology.
△ Less
Submitted 22 April, 2021; v1 submitted 2 January, 2021;
originally announced January 2021.
-
Multiscale Topology Characterises Dynamic Tumour Vascular Networks
Authors:
Bernadette J. Stolz,
Jakob Kaeppler,
Bostjan Markelc,
Franziska Mech,
Florian Lipsmeier,
Ruth J. Muschel,
Helen M. Byrne,
Heather A. Harrington
Abstract:
Advances in imaging techniques enable high resolution 3D visualisation of vascular networks over time and reveal abnormal structural features such as twists and loops, and their quantification is an active area of research. Here we showcase how topological data analysis (TDA), the mathematical field that studies `shape' of data, can characterise the geometric, spatial and temporal organisation of…
▽ More
Advances in imaging techniques enable high resolution 3D visualisation of vascular networks over time and reveal abnormal structural features such as twists and loops, and their quantification is an active area of research. Here we showcase how topological data analysis (TDA), the mathematical field that studies `shape' of data, can characterise the geometric, spatial and temporal organisation of vascular networks. We propose two topological lenses to study vasculature, which capture inherent multi-scale features and vessel connectivity, and surpass the single scale analysis of existing methods. We analyse images collected using intravital and ultramicroscopy modalities and quantify spatio-temporal variation of twists, loops, and avascular regions (voids) in 3D vascular networks. This topological approach validates and quantifies known qualitative trends such as dynamic changes in tortuosity and loops in response to antibodies that modulate vessel sprouting; furthermore, it quantifies the effect of radiotherapy on vessel architecture.
△ Less
Submitted 26 April, 2022; v1 submitted 19 August, 2020;
originally announced August 2020.
-
Grid diagrams as tools to investigate knot spaces and topoisomerase-mediated simplification of DNA topology
Authors:
Agnese Barbensi,
Daniele Celoria,
Heather A. Harrington,
Andrzej Stasiak,
Dorothy Buck
Abstract:
Grid diagrams with their relatively simple mathematical formalism provide a convenient way to generate and model projections of various knots. It has been an open question whether these 2D diagrams can be used to model a complex 3D process such as the topoisomerase-mediated preferential unknotting of DNA molecules. We model here topoisomerase-mediated passages of double-stranded DNA segments throu…
▽ More
Grid diagrams with their relatively simple mathematical formalism provide a convenient way to generate and model projections of various knots. It has been an open question whether these 2D diagrams can be used to model a complex 3D process such as the topoisomerase-mediated preferential unknotting of DNA molecules. We model here topoisomerase-mediated passages of double-stranded DNA segments through each other using the formalism of grid diagrams. We show that this grid diagram-based modelling approach captures the essence of the preferential unknotting mechanism, based on topoisomerase selectivity of hooked DNA juxtapositions as the sites of intersegmental passages. We show that grid diagram-based approach provide an important, new and computationally convenient framework for investigating entanglement in biopolymers.
△ Less
Submitted 12 September, 2019;
originally announced September 2019.
-
Geometric anomaly detection in data
Authors:
Bernadette J Stolz,
Jared Tanner,
Heather A Harrington,
Vidit Nanda
Abstract:
This paper describes the systematic application of local topological methods for detecting interfaces and related anomalies in complicated high-dimensional data. By examining the topology of small regions around each point, one can optimally stratify a given dataset into clusters, each of which is in turn well-approximable by a suitable submanifold of the ambient space. Since these approximating s…
▽ More
This paper describes the systematic application of local topological methods for detecting interfaces and related anomalies in complicated high-dimensional data. By examining the topology of small regions around each point, one can optimally stratify a given dataset into clusters, each of which is in turn well-approximable by a suitable submanifold of the ambient space. Since these approximating submanifolds might have different dimensions, we are able to detect non-manifold like singular regions in data even when none of the data points have been sampled from those singularities. We showcase this method by identifying the intersection of two surfaces in the 24-dimensional space of cyclo-octane conformations, and by locating all the self-intersections of a Henneberg minimal surface immersed in 3-dimensional space. Due to the local nature of the required topological computations, the algorithmic burden of performing such data stratification is readily distributable across several processors.
△ Less
Submitted 25 August, 2019;
originally announced August 2019.
-
Topological Methods for Characterising Spatial Networks: A Case Study in Tumour Vasculature
Authors:
Helen M Byrne,
Heather A Harrington,
Ruth Muschel,
Gesine Reinert,
Bernadette J Stolz,
Ulrike Tillmann
Abstract:
Understanding how the spatial structure of blood vessel networks relates to their function in healthy and abnormal biological tissues could improve diagnosis and treatment for diseases such as cancer. New imaging techniques can generate multiple, high-resolution images of the same tissue region, and show how vessel networks evolve during disease onset and treatment. Such experimental advances have…
▽ More
Understanding how the spatial structure of blood vessel networks relates to their function in healthy and abnormal biological tissues could improve diagnosis and treatment for diseases such as cancer. New imaging techniques can generate multiple, high-resolution images of the same tissue region, and show how vessel networks evolve during disease onset and treatment. Such experimental advances have created an exciting opportunity for discovering new links between vessel structure and disease through the development of mathematical tools that can analyse these rich datasets. Here we explain how topological data analysis (TDA) can be used to study vessel network structures. TDA is a growing field in the mathematical and computational sciences, that consists of algorithmic methods for identifying global and multi-scale structures in high-dimensional data sets that may be noisy and incomplete. TDA has identified the effect of ageing on vessel networks in the brain and more recently proposed to study blood flow and stenosis. Here we present preliminary work which shows how TDA of spatial network structure can be used to characterise tumour vasculature.
△ Less
Submitted 19 July, 2019;
originally announced July 2019.
-
Double branched covers of knotoids
Authors:
Agnese Barbensi,
Dorothy Buck,
Heather A. Harrington,
Marc Lackenby
Abstract:
By using double branched covers, we prove that there is a 1-1 correspondence between the set of knotoids in the 2-sphere, up to orientation reversion and rotation, and knots with a strong inversion, up to conjugacy. This correspondence allows us to study knotoids through tools and invariants coming from knot theory. In particular, concepts from geometrisation generalise to knotoids, allowing us to…
▽ More
By using double branched covers, we prove that there is a 1-1 correspondence between the set of knotoids in the 2-sphere, up to orientation reversion and rotation, and knots with a strong inversion, up to conjugacy. This correspondence allows us to study knotoids through tools and invariants coming from knot theory. In particular, concepts from geometrisation generalise to knotoids, allowing us to characterise invertibility and other properties in the hyperbolic case. Moreover, with our construction we are able to detect both the trivial knotoid in the 2-sphere and the trivial planar knotoid.
△ Less
Submitted 25 September, 2019; v1 submitted 22 November, 2018;
originally announced November 2018.
-
Coloured Noise from Stochastic Inflows in Reaction-Diffusion Systems
Authors:
Michael F Adamer,
Heather A Harrington,
Eamonn A Gaffney,
Thomas E Woolley
Abstract:
In this paper we present a framework for investigating coloured noise in reaction-diffusion systems. We start by considering a deterministic reaction-diffusion equation and show how external forcing can cause temporally correlated or coloured noise. Here, the main source of external noise is considered to be fluctuations in the parameter values representing the inflow of particles to the system. F…
▽ More
In this paper we present a framework for investigating coloured noise in reaction-diffusion systems. We start by considering a deterministic reaction-diffusion equation and show how external forcing can cause temporally correlated or coloured noise. Here, the main source of external noise is considered to be fluctuations in the parameter values representing the inflow of particles to the system. First, we determine which reaction systems, driven by extrinsic noise, can admit only one steady state, so that effects, such as stochastic switching, are precluded from our analysis. To analyse the steady state behaviour of reaction systems, even if the parameter values are changing, necessitates a parameter-free approach, which has been central to algebraic analysis in chemical reaction network theory. To identify suitable models we use tools from real algebraic geometry that link the network structure to its dynamical properties. We then make a connection to internal noise models and show how power spectral methods can be used to predict stochastically driven patterns in systems with coloured noise. In simple cases we show that the power spectrum of the coloured noise process and the power spectrum of the reaction-diffusion system modelled with white noise multiply to give the power spectrum of the coloured noise reaction-diffusion system.
△ Less
Submitted 30 November, 2018; v1 submitted 30 October, 2018;
originally announced October 2018.
-
On Some Configurations of Oppositely Charged Trapped Vortices in the Plane
Authors:
Emilie Dufresne,
Heather A Harrington,
Panayotis G Kevrekidis,
Paolo Tripoli
Abstract:
Our aim in the present work is to identify all the possible standing wave configurations involving few vortices of different charges in an atomic Bose-Einstein condensate (BEC). In this effort, we deploy the use of a computational algebra approach in order to identify stationary multi-vortex states with up to 6 vortices. The use of invariants and symmetries enables deducing a set of equations in e…
▽ More
Our aim in the present work is to identify all the possible standing wave configurations involving few vortices of different charges in an atomic Bose-Einstein condensate (BEC). In this effort, we deploy the use of a computational algebra approach in order to identify stationary multi-vortex states with up to 6 vortices. The use of invariants and symmetries enables deducing a set of equations in elementary symmetric polynomials, which can then be fully solved via computational algebra packages within Maple. We retrieve a number of previously identified configurations, including collinear ones and polygonal (e.g. quadrupolar and hexagonal) ones. However, importantly, we also retrieve a configuration with 4 positive charges and 2 negative ones which is unprecedented, to the best of our knowledge, in BEC studies. We corroborate these predictions via numerical computations in the fully two-dimensional PDE system of the Gross-Pitaevskii type which characterizes the BEC at the mean-field level.
△ Less
Submitted 26 October, 2018;
originally announced October 2018.
-
Joining and decomposing reaction networks
Authors:
Elizabeth Gross,
Heather A Harrington,
Nicolette Meshkat,
Anne Shiu
Abstract:
In systems and synthetic biology, much research has focused on the behavior and design of single pathways, while, more recently, experimental efforts have focused on how cross-talk (coupling two or more pathways) or inhibiting molecular function (isolating one part of the pathway) affects systems-level behavior. However, the theory for tackling these larger systems in general has lagged behind. He…
▽ More
In systems and synthetic biology, much research has focused on the behavior and design of single pathways, while, more recently, experimental efforts have focused on how cross-talk (coupling two or more pathways) or inhibiting molecular function (isolating one part of the pathway) affects systems-level behavior. However, the theory for tackling these larger systems in general has lagged behind. Here, we analyze how joining networks (e.g., cross-talk) or decomposing networks (e.g., inhibition or knock-outs) affects three properties that reaction networks may possess---identifiability (recoverability of parameter values from data), steady-state invariants (relationships among species concentrations at steady state, used in model selection), and multistationarity (capacity for multiple steady states, which correspond to multiple cell decisions). Specifically, we prove results that clarify, for a network obtained by joining two smaller networks, how properties of the smaller networks can be inferred from or can imply similar properties of the original network. Our proofs use techniques from computational algebraic geometry, including elimination theory and differential algebra.
△ Less
Submitted 14 August, 2019; v1 submitted 12 October, 2018;
originally announced October 2018.
-
Topological Data Analysis of Task-Based fMRI Data from Experiments on Schizophrenia
Authors:
Bernadette J. Stolz,
Tegan Emerson,
Satu Nahkuri,
Mason A. Porter,
Heather A. Harrington
Abstract:
We use methods from computational algebraic topology to study functional brain networks, in which nodes represent brain regions and weighted edges encode the similarity of fMRI time series from each region. With these tools, which allow one to characterize topological invariants such as loops in high-dimensional data, we are able to gain understanding into low-dimensional structures in networks in…
▽ More
We use methods from computational algebraic topology to study functional brain networks, in which nodes represent brain regions and weighted edges encode the similarity of fMRI time series from each region. With these tools, which allow one to characterize topological invariants such as loops in high-dimensional data, we are able to gain understanding into low-dimensional structures in networks in a way that complements traditional approaches that are based on pairwise interactions. In the present paper, we use persistent homology to analyze networks that we construct from task-based fMRI data from schizophrenia patients, healthy controls, and healthy siblings of schizophrenia patients. We thereby explore the persistence of topological structures such as loops at different scales in these networks. We use persistence landscapes and persistence images to create output summaries from our persistent-homology calculations, and we study the persistence landscapes and images using $k$-means clustering and community detection. Based on our analysis of persistence landscapes, we find that the members of the sibling cohort have topological features (specifically, their 1-dimensional loops) that are distinct from the other two cohorts. From the persistence images, we are able to distinguish all three subject groups and to determine the brain regions in the loops (with four or more edges) that allow us to make these distinctions.
△ Less
Submitted 25 August, 2020; v1 submitted 22 September, 2018;
originally announced September 2018.
-
Linear compartmental models: input-output equations and operations that preserve identifiability
Authors:
Elizabeth Gross,
Heather A. Harrington,
Nicolette Meshkat,
Anne Shiu
Abstract:
This work focuses on the question of how identifiability of a mathematical model, that is, whether parameters can be recovered from data, is related to identifiability of its submodels. We look specifically at linear compartmental models and investigate when identifiability is preserved after adding or removing model components. In particular, we examine whether identifiability is preserved when a…
▽ More
This work focuses on the question of how identifiability of a mathematical model, that is, whether parameters can be recovered from data, is related to identifiability of its submodels. We look specifically at linear compartmental models and investigate when identifiability is preserved after adding or removing model components. In particular, we examine whether identifiability is preserved when an input, output, edge, or leak is added or deleted. Our approach, via differential algebra, is to analyze specific input-output equations of a model and the Jacobian of the associated coefficient map. We clarify a prior determinantal formula for these equations, and then use it to prove that, under some hypotheses, a model's input-output equations can be understood in terms of certain submodels we call "output-reachable". Our proofs use algebraic and combinatorial techniques.
△ Less
Submitted 24 May, 2019; v1 submitted 1 August, 2018;
originally announced August 2018.
-
Topological data analysis of continuum percolation with disks
Authors:
Leo Speidel,
Heather A. Harrington,
S. Jonathan Chapman,
Mason A. Porter
Abstract:
We study continuum percolation with disks, a variant of continuum percolation in two-dimensional Euclidean space, by applying tools from topological data analysis. We interpret each realization of continuum percolation with disks as a topological subspace of $[0,1]^2$ and investigate its topological features across many realizations. We apply persistent homology to investigate topological changes…
▽ More
We study continuum percolation with disks, a variant of continuum percolation in two-dimensional Euclidean space, by applying tools from topological data analysis. We interpret each realization of continuum percolation with disks as a topological subspace of $[0,1]^2$ and investigate its topological features across many realizations. We apply persistent homology to investigate topological changes as we vary the number and radius of disks. We observe evidence that the longest persisting invariant is born at or near the percolation transition.
△ Less
Submitted 20 April, 2018;
originally announced April 2018.
-
Sampling real algebraic varieties for topological data analysis
Authors:
Emilie Dufresne,
Parker B. Edwards,
Heather A. Harrington,
Jonathan D. Hauenstein
Abstract:
Topological data analysis (TDA) provides a growing body of tools for computing geometric and topological information about spaces from a finite sample of points. We present a new adaptive algorithm for finding provably dense samples of points on real algebraic varieties given a set of defining polynomials. The algorithm utilizes methods from numerical algebraic geometry to give formal guarantees a…
▽ More
Topological data analysis (TDA) provides a growing body of tools for computing geometric and topological information about spaces from a finite sample of points. We present a new adaptive algorithm for finding provably dense samples of points on real algebraic varieties given a set of defining polynomials. The algorithm utilizes methods from numerical algebraic geometry to give formal guarantees about the density of the sampling and it also employs geometric heuristics to reduce the size of the sample. As TDA methods consume significant computational resources that scale poorly in the number of sample points, our sampling minimization makes applying TDA methods more feasible. We provide a software package that implements the algorithm and also demonstrate the implementation with several examples.
△ Less
Submitted 18 October, 2018; v1 submitted 21 February, 2018;
originally announced February 2018.
-
Stratifying multiparameter persistent homology
Authors:
Heather A. Harrington,
Nina Otter,
Hal Schenck,
Ulrike Tillmann
Abstract:
A fundamental tool in topological data analysis is persistent homology, which allows extraction of information from complex datasets in a robust way. Persistent homology assigns a module over a principal ideal domain to a one-parameter family of spaces obtained from the data. In applications data often depend on several parameters, and in this case one is interested in studying the persistent homo…
▽ More
A fundamental tool in topological data analysis is persistent homology, which allows extraction of information from complex datasets in a robust way. Persistent homology assigns a module over a principal ideal domain to a one-parameter family of spaces obtained from the data. In applications data often depend on several parameters, and in this case one is interested in studying the persistent homology of a multiparameter family of spaces associated to the data. While the theory of persistent homology for one-parameter families is well-understood, the situation for multiparameter families is more delicate. Following Carlsson and Zomorodian we recast the problem in the setting of multigraded algebra, and we propose multigraded Hilbert series, multigraded associated primes and local cohomology as invariants for studying multiparameter persistent homology. Multigraded associated primes provide a stratification of the region where a multigraded module does not vanish, while multigraded Hilbert series and local cohomology give a measure of the size of components of the module supported on different strata. These invariants generalize in a suitable sense the invariant for the one-parameter case.
△ Less
Submitted 18 June, 2019; v1 submitted 24 August, 2017;
originally announced August 2017.
-
Graph-Facilitated Resonant Mode Counting in Stochastic Interaction Networks
Authors:
Michael F Adamer,
Thomas E Woolley,
Heather A Harrington
Abstract:
Oscillations in a stochastic dynamical system, whose deterministic counterpart has a stable steady state, are a widely reported phenomenon. Traditional methods of finding parameter regimes for stochastically-driven resonances are, however, cumbersome for any but the smallest networks. In this letter we show by example of the Brusselator how to use real root counting algorithms and graph theoretic…
▽ More
Oscillations in a stochastic dynamical system, whose deterministic counterpart has a stable steady state, are a widely reported phenomenon. Traditional methods of finding parameter regimes for stochastically-driven resonances are, however, cumbersome for any but the smallest networks. In this letter we show by example of the Brusselator how to use real root counting algorithms and graph theoretic tools to efficiently determine the number of resonant modes and parameter ranges for stochastic oscillations. We argue that stochastic resonance is a network property by showing that resonant modes only depend on the squared Jacobian matrix $J^2$ , unlike deterministic oscillations which are determined by $J$. By using graph theoretic tools, analysis of stochastic behaviour for larger networks is simplified and chemical reaction networks with multiple resonant modes can be identified easily.
△ Less
Submitted 28 February, 2017;
originally announced February 2017.
-
Tensor clustering with algebraic constraints gives interpretable groups of crosstalk mechanisms in breast cancer
Authors:
Anna Seigal,
Mariano Beguerisse-Díaz,
Birgit Schoeberl,
Mario Niepel,
Heather A. Harrington
Abstract:
We introduce a tensor-based clustering method to extract sparse, low-dimensional structure from high-dimensional, multi-indexed datasets. This framework is designed to enable detection of clusters of data in the presence of structural requirements which we encode as algebraic constraints in a linear program. Our clustering method is general and can be tailored to a variety of applications in scien…
▽ More
We introduce a tensor-based clustering method to extract sparse, low-dimensional structure from high-dimensional, multi-indexed datasets. This framework is designed to enable detection of clusters of data in the presence of structural requirements which we encode as algebraic constraints in a linear program. Our clustering method is general and can be tailored to a variety of applications in science and industry. We illustrate our method on a collection of experiments measuring the response of genetically diverse breast cancer cell lines to an array of ligands. Each experiment consists of a cell line-ligand combination, and contains time-course measurements of the early-signalling kinases MAPK and AKT at two different ligand dose levels. By imposing appropriate structural constraints and respecting the multi-indexed structure of the data, the analysis of clusters can be optimized for biological interpretation and therapeutic understanding. We then perform a systematic, large-scale exploration of mechanistic models of MAPK-AKT crosstalk for each cluster. This analysis allows us to quantify the heterogeneity of breast cancer cell subtypes, and leads to hypotheses about the signalling mechanisms that mediate the response of the cell lines to ligands.
△ Less
Submitted 8 February, 2019; v1 submitted 23 December, 2016;
originally announced December 2016.
-
The Topological "Shape" of Brexit
Authors:
Bernadette J. Stolz,
Heather A. Harrington,
Mason A. Porter
Abstract:
Persistent homology is a method from computational algebraic topology that can be used to study the "shape" of data. We illustrate two filtrations --- the weight rank clique filtration and the Vietoris--Rips (VR) filtration --- that are commonly used in persistent homology, and we apply these filtrations to a pair of data sets that are both related to the 2016 European Union "Brexit" referendum in…
▽ More
Persistent homology is a method from computational algebraic topology that can be used to study the "shape" of data. We illustrate two filtrations --- the weight rank clique filtration and the Vietoris--Rips (VR) filtration --- that are commonly used in persistent homology, and we apply these filtrations to a pair of data sets that are both related to the 2016 European Union "Brexit" referendum in the United Kingdom. These examples consider a topical situation and give useful illustrations of the strengths and weaknesses of these methods.
△ Less
Submitted 15 September, 2016;
originally announced October 2016.
-
The Role of the Hes1 Crosstalk Hub in Notch-Wnt Interactions of the Intestinal Crypt
Authors:
Sophie K. Kay,
Heather A. Harrington,
Sarah Shepherd,
Keith Brennan,
Trevor Dale,
James M. Osborne,
David J. Gavaghan,
Helen M. Byrne
Abstract:
The Notch pathway plays a vital role in determining whether cells in the intestinal epithelium adopt a secretory or an absorptive phenotype. Cell fate specification is coordinated via Notch's interaction with the canonical Wnt pathway. Here, we propose a new mathematical model of the Notch and Wnt pathways, in which the Hes1 promoter acts as a hub for pathway crosstalk. Computational simulations o…
▽ More
The Notch pathway plays a vital role in determining whether cells in the intestinal epithelium adopt a secretory or an absorptive phenotype. Cell fate specification is coordinated via Notch's interaction with the canonical Wnt pathway. Here, we propose a new mathematical model of the Notch and Wnt pathways, in which the Hes1 promoter acts as a hub for pathway crosstalk. Computational simulations of the model can assist in understanding how healthy intestinal tissue is maintained, and predict the likely consequences of biochemical knockouts upon cell fate selection processes. Chemical reaction network theory (CRNT) is a powerful, generalised framework which assesses the capacity of our model for monostability or multistability, by analysing properties of the underlying network structure without recourse to specific parameter values or functional forms for reaction rates. CRNT highlights the role of beta-catenin in stabilising the Notch pathway and dam** oscillations, demonstrating that Wnt-mediated actions on the Hes1 promoter can induce dynamical transitions in the Notch system, from multistability to monostability. Time-dependent model simulations of cell pairs reveal the stabilising influence of Wnt upon the Notch pathway, in which beta-catenin- and Dsh-mediated action on the Hes1 promoter are key in sha** the subcellular dynamics. Where Notch-mediated transcription of Hes1 dominates, there is Notch oscillation and maintenance of fate flexibility; Wnt-mediated transcription of Hes1 favours bistability akin to cell fate selection. Cells could therefore regulate the proportion of Wnt- and Notch-mediated control of the Hes1 promoter to coordinate the timing of cell fate selection as they migrate through the intestinal epithelium and are subject to reduced Wnt stimuli.
△ Less
Submitted 22 August, 2016;
originally announced August 2016.
-
The geometry of sloppiness
Authors:
Emilie Dufresne,
Heather A. Harrington,
Dhruva V. Raman
Abstract:
The use of mathematical models in the sciences often involves the estimation of unknown parameter values from data. Sloppiness provides information about the uncertainty of this task. In this paper, we develop a precise mathematical foundation for sloppiness and define rigorously its key concepts, such as `model manifold', in relation to concepts of structural identifiability. We redefine sloppine…
▽ More
The use of mathematical models in the sciences often involves the estimation of unknown parameter values from data. Sloppiness provides information about the uncertainty of this task. In this paper, we develop a precise mathematical foundation for sloppiness and define rigorously its key concepts, such as `model manifold', in relation to concepts of structural identifiability. We redefine sloppiness conceptually as a comparison between the premetric on parameter space induced by measurement noise and a reference metric. This opens up the possibility of alternative quantification of sloppiness, beyond the standard use of the Fisher Information Matrix, which assumes that parameter space is equipped with the usual Euclidean metric and the measurement error is infinitesimal. Applications include parametric statistical models, explicit time dependent models, and ordinary differential equation models.
△ Less
Submitted 14 March, 2018; v1 submitted 19 August, 2016;
originally announced August 2016.
-
Persistent homology of time-dependent functional networks constructed from coupled time series
Authors:
Bernadette J. Stolz,
Heather A. Harrington,
Mason A. Porter
Abstract:
We use topological data analysis to study "functional networks" that we construct from time-series data from both experimental and synthetic sources. We use persistent homology with a weight rank clique filtration to gain insights into these functional networks, and we use persistence landscapes to interpret our results. Our first example uses time-series output from networks of coupled Kuramoto o…
▽ More
We use topological data analysis to study "functional networks" that we construct from time-series data from both experimental and synthetic sources. We use persistent homology with a weight rank clique filtration to gain insights into these functional networks, and we use persistence landscapes to interpret our results. Our first example uses time-series output from networks of coupled Kuramoto oscillators. Our second example consists of biological data in the form of functional magnetic resonance imaging (fMRI) data that was acquired from human subjects during a simple motor-learning task in which subjects were monitored on three days in a five-day period. With these examples, we demonstrate that (1) using persistent homology to study functional networks provides fascinating insights into their properties and (2) the position of the features in a filtration can sometimes play a more vital role than persistence in the interpretation of topological features, even though conventionally the latter is used to distinguish between signal and noise. We find that persistent homology can detect differences in synchronization patterns in our data sets over time, giving insight both on changes in community structure in the networks and on increased synchronization between brain regions that form loops in a functional network during motor learning. For the motor-learning data, persistence landscapes also reveal that on average the majority of changes in the network loops take place on the second of the three days of the learning process.
△ Less
Submitted 3 December, 2016; v1 submitted 2 May, 2016;
originally announced May 2016.
-
Decomposing the parameter space of biological networks via a numerical discriminant approach
Authors:
Heather A. Harrington,
Dhagash Mehta,
Helen M. Byrne,
Jonathan D. Hauenstein
Abstract:
Many systems in biology, physics and engineering can be described by systems of ordinary differential equation containing many parameters. When studying the dynamic behavior of these large, nonlinear systems, it is useful to identify and characterize the steady-state solutions as the model parameters vary, a technically challenging problem in a high-dimensional parameter landscape. Rather than sim…
▽ More
Many systems in biology, physics and engineering can be described by systems of ordinary differential equation containing many parameters. When studying the dynamic behavior of these large, nonlinear systems, it is useful to identify and characterize the steady-state solutions as the model parameters vary, a technically challenging problem in a high-dimensional parameter landscape. Rather than simply determining the number and stability of steady-states at distinct points in parameter space, we decompose the parameter space into finitely many regions, the steady-state solutions being consistent within each distinct region. From a computational algebraic viewpoint, the boundary of these regions is contained in the discriminant locus. We develop global and local numerical algorithms for constructing the discriminant locus and classifying the parameter landscape. We showcase our numerical approaches by applying them to molecular and cell-network models.
△ Less
Submitted 9 April, 2016;
originally announced April 2016.
-
Differential Algebra for Model Comparison
Authors:
Heather A. Harrington,
Kenneth L. Ho,
Nicolette Meshkat
Abstract:
We present a method for rejecting competing models from noisy time-course data that does not rely on parameter inference. First we characterize ordinary differential equation models in only measurable variables using differential algebra elimination. Next we extract additional information from the given data using Gaussian Process Regression (GPR) and then transform the differential invariants. We…
▽ More
We present a method for rejecting competing models from noisy time-course data that does not rely on parameter inference. First we characterize ordinary differential equation models in only measurable variables using differential algebra elimination. Next we extract additional information from the given data using Gaussian Process Regression (GPR) and then transform the differential invariants. We develop a test using linear algebra and statistics to reject transformed models with the given data in a parameter-free manner. This algorithm exploits the information about transients that is encoded in the model's structure. We demonstrate the power of this approach by discriminating between different models from mathematical biology.
△ Less
Submitted 31 March, 2016;
originally announced March 2016.
-
Geometric combinatorics and computational molecular biology: branching polytopes for RNA sequences
Authors:
Elizabeth Drellich,
Andrew Gainer-Dewar,
Heather A. Harrington,
Qijun He,
Christine Heitsch,
Svetlana Poznanović
Abstract:
Questions in computational molecular biology generate various discrete optimization problems, such as DNA sequence alignment and RNA secondary structure prediction. However, the optimal solutions are fundamentally dependent on the parameters used in the objective functions. The goal of a parametric analysis is to elucidate such dependencies, especially as they pertain to the accuracy and robustnes…
▽ More
Questions in computational molecular biology generate various discrete optimization problems, such as DNA sequence alignment and RNA secondary structure prediction. However, the optimal solutions are fundamentally dependent on the parameters used in the objective functions. The goal of a parametric analysis is to elucidate such dependencies, especially as they pertain to the accuracy and robustness of the optimal solutions. Techniques from geometric combinatorics, including polytopes and their normal fans, have been used previously to give parametric analyses of simple models for DNA sequence alignment and RNA branching configurations. Here, we present a new computational framework, and proof-of-principle results, which give the first complete parametric analysis of the branching portion of the nearest neighbor thermodynamic model for secondary structure prediction for real RNA sequences.
△ Less
Submitted 16 June, 2016; v1 submitted 14 September, 2015;
originally announced September 2015.
-
Reduction of dimension for nonlinear dynamical systems
Authors:
Heather A. Harrington,
Robert A. Van Gorder
Abstract:
We consider reduction of dimension for nonlinear dynamical systems. We demonstrate that in some cases, one can reduce a nonlinear system of equations into a single equation for one of the state variables, and this can be useful for computing the solution when using a variety of analytical approaches. In the case where this reduction is possible, we employ differential elimination to obtain the red…
▽ More
We consider reduction of dimension for nonlinear dynamical systems. We demonstrate that in some cases, one can reduce a nonlinear system of equations into a single equation for one of the state variables, and this can be useful for computing the solution when using a variety of analytical approaches. In the case where this reduction is possible, we employ differential elimination to obtain the reduced system. While analytical, the approach is algorithmic, and is implemented in symbolic software such as {\sc MAPLE} or {\sc SageMath}. In other cases, the reduction cannot be performed strictly in terms of differential operators, and one obtains integro-differential operators, which may still be useful. In either case, one can use the reduced equation to both approximate solutions for the state variables and perform chaos diagnostics more efficiently than could be done for the original higher-dimensional system, as well as to construct Lyapunov functions which help in the large-time study of the state variables. A number of chaotic and hyperchaotic dynamical systems are used as examples in order to motivate the approach.
△ Less
Submitted 24 August, 2015;
originally announced August 2015.
-
Numerical algebraic geometry for model selection and its application to the life sciences
Authors:
Elizabeth Gross,
Brent Davis,
Kenneth L. Ho,
Daniel J. Bates,
Heather A. Harrington
Abstract:
Researchers working with mathematical models are often confronted by the related problems of parameter estimation, model validation, and model selection. These are all optimization problems, well-known to be challenging due to non-linearity, non-convexity and multiple local optima. Furthermore, the challenges are compounded when only partial data is available. Here, we consider polynomial models (…
▽ More
Researchers working with mathematical models are often confronted by the related problems of parameter estimation, model validation, and model selection. These are all optimization problems, well-known to be challenging due to non-linearity, non-convexity and multiple local optima. Furthermore, the challenges are compounded when only partial data is available. Here, we consider polynomial models (e.g., mass-action chemical reaction networks at steady state) and describe a framework for their analysis based on optimization using numerical algebraic geometry. Specifically, we use probability-one polynomial homotopy continuation methods to compute all critical points of the objective function, then filter to recover the global optima. Our approach exploits the geometric structures relating models and data, and we demonstrate its utility on examples from cell signaling, synthetic biology, and epidemiology.
△ Less
Submitted 1 April, 2016; v1 submitted 15 July, 2015;
originally announced July 2015.
-
A roadmap for the computation of persistent homology
Authors:
Nina Otter,
Mason A. Porter,
Ulrike Tillmann,
Peter Grindrod,
Heather A. Harrington
Abstract:
Persistent homology (PH) is a method used in topological data analysis (TDA) to study qualitative features of data that persist across multiple scales. It is robust to perturbations of input data, independent of dimensions and coordinates, and provides a compact representation of the qualitative features of the input. The computation of PH is an open area with numerous important and fascinating ch…
▽ More
Persistent homology (PH) is a method used in topological data analysis (TDA) to study qualitative features of data that persist across multiple scales. It is robust to perturbations of input data, independent of dimensions and coordinates, and provides a compact representation of the qualitative features of the input. The computation of PH is an open area with numerous important and fascinating challenges. The field of PH computation is evolving rapidly, and new algorithms and software implementations are being updated and released at a rapid pace. The purposes of our article are to (1) introduce theory and computational methods for PH to a broad range of computational scientists and (2) provide benchmarks of state-of-the-art implementations for the computation of PH. We give a friendly introduction to PH, navigate the pipeline for the computation of PH with an eye towards applications, and use a range of synthetic and real-world data sets to evaluate currently available open-source implementations for the computation of PH. Based on our benchmarking, we indicate which algorithms and implementations are best suited to different types of data sets. In an accompanying tutorial, we provide guidelines for the computation of PH. We make publicly available all scripts that we wrote for the tutorial, and we make available the processed version of the data sets used in the benchmarking.
△ Less
Submitted 12 September, 2017; v1 submitted 29 June, 2015;
originally announced June 2015.