Search | arXiv e-print repository

An Intrinsic Approach to Scalar-Curvature Estimation for Point Clouds

Authors: Abigail Hickok, Andrew J. Blumberg

Abstract: We introduce an intrinsic estimator for the scalar curvature of a data set presented as a finite metric space. Our estimator depends only on the metric structure of the data and not on an embedding in $\mathbb{R}^n$. We show that the estimator is consistent in the sense that for points sampled from a probability measure on a compact Riemannian manifold, the estimator converges to the scalar curvat… ▽ More We introduce an intrinsic estimator for the scalar curvature of a data set presented as a finite metric space. Our estimator depends only on the metric structure of the data and not on an embedding in $\mathbb{R}^n$. We show that the estimator is consistent in the sense that for points sampled from a probability measure on a compact Riemannian manifold, the estimator converges to the scalar curvature as the number of points increases. To justify its use in applications, we show that the estimator is stable with respect to perturbations of the metric structure, e.g., noise in the sample or error estimating the intrinsic metric. We validate our estimator experimentally on synthetic data that is sampled from manifolds with specified curvature. △ Less

Submitted 4 August, 2023; originally announced August 2023.

Comments: 37 pages, 5 figures

arXiv:2210.06424 [pdf, other]

Computing Persistence Diagram Bundles

Authors: Abigail Hickok

Abstract: Persistence diagram (PD) bundles, a generalization of vineyards, were introduced as a way to study the persistent homology of a set of filtrations parameterized by a topological space $B$. In this paper, we present an algorithm for computing piecewise-linear PD bundles, a wide class that includes many of the PD bundles that one may encounter in practice. Full details are given for the case in whic… ▽ More Persistence diagram (PD) bundles, a generalization of vineyards, were introduced as a way to study the persistent homology of a set of filtrations parameterized by a topological space $B$. In this paper, we present an algorithm for computing piecewise-linear PD bundles, a wide class that includes many of the PD bundles that one may encounter in practice. Full details are given for the case in which $B$ is a triangulated surface, and we outline the generalization to higher dimensions and other cases. △ Less

Submitted 19 September, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

Comments: 21 pages, 8 figures. Expository changes throughout

MSC Class: 55N31; 55R99; 65D99

arXiv:2210.05124 [pdf, other]

Persistence Diagram Bundles: A Multidimensional Generalization of Vineyards

Authors: Abigail Hickok

Abstract: I introduce the concept of a persistence diagram (PD) bundle, which is the space of PDs for a fibered filtration function (a set $\{f_p: \mathcal{K}^p \to \mathbb{R}\}_{p \in B}$ of filtrations that is parameterized by a topological space $B$). Special cases include vineyards, the persistent homology transform, and fibered barcodes for multiparameter persistence modules. I prove that if $B$ is a s… ▽ More I introduce the concept of a persistence diagram (PD) bundle, which is the space of PDs for a fibered filtration function (a set $\{f_p: \mathcal{K}^p \to \mathbb{R}\}_{p \in B}$ of filtrations that is parameterized by a topological space $B$). Special cases include vineyards, the persistent homology transform, and fibered barcodes for multiparameter persistence modules. I prove that if $B$ is a smooth compact manifold, then for a generic fibered filtration function, $B$ is stratified such that within each stratum $Y \subseteq B$, there is a single PD "template" (a list of "birth" and "death" simplices) that can be used to obtain the PD for the filtration $f_p$ for any $p \in Y$. If $B$ is compact, then there are finitely many strata, so the PD bundle for a generic fibered filtration on $B$ is determined by the persistent homology at finitely many points in $B$. I also show that not every local section can be extended to a global section (a continuous map $s$ from $B$ to the total space $E$ of PDs such that $s(p) \in \textrm{PD}(f_p)$ for all $p \in B$). Consequently, a PD bundle is not necessarily the union of "vines" $γ: B \to E$; this is unlike a vineyard. When there is a stratification as described above, I construct a cellular sheaf that stores sufficient data to construct sections and determine whether a given local section can be extended to a global section. △ Less

Submitted 11 August, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

Comments: 40 pages, 8 figures. Substantial mathematical additions and expository changes throughout

MSC Class: 55N31; 55R99; 57N80

arXiv:2206.04834 [pdf, other]

Persistent Homology for Resource Coverage: A Case Study of Access to Polling Sites

Authors: Abigail Hickok, Benjamin Jarman, Michael Johnson, Jiajie Luo, Mason A. Porter

Abstract: It is important to choose the geographical distributions of public resources in a fair and equitable manner. However, it is complicated to quantify the equity of such a distribution; important factors include distances to resource sites, availability of transportation, and ease of travel. We use persistent homology, which is a tool from topological data analysis, to study the effective availabilit… ▽ More It is important to choose the geographical distributions of public resources in a fair and equitable manner. However, it is complicated to quantify the equity of such a distribution; important factors include distances to resource sites, availability of transportation, and ease of travel. We use persistent homology, which is a tool from topological data analysis, to study the effective availability and coverage of polling sites. The information from persistent homology allows us to infer holes in the distribution of polling sites. We analyze and compare the coverage of polling sites in Los Angeles County and five cities (Atlanta, Chicago, Jacksonville, New York City, and Salt Lake City), and we conclude that computation of persistent homology appears to be a reasonable approach to analyzing resource coverage. △ Less

Submitted 11 August, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

Comments: revised version

MSC Class: 55N31; 91D20; 91B18

arXiv:2112.03334 [pdf, other]

A Family of Density-Scaled Filtered Complexes

Authors: Abigail Hickok

Abstract: We develop novel methods for using persistent homology to infer the homology of an unknown Riemannian manifold $(M, g)$ from a point cloud sampled from an arbitrary smooth probability density function. Standard distance-based filtered complexes, such as the Čech complex, often have trouble distinguishing noise from features that are simply small. We address this problem by defining a family of "de… ▽ More We develop novel methods for using persistent homology to infer the homology of an unknown Riemannian manifold $(M, g)$ from a point cloud sampled from an arbitrary smooth probability density function. Standard distance-based filtered complexes, such as the Čech complex, often have trouble distinguishing noise from features that are simply small. We address this problem by defining a family of "density-scaled filtered complexes" that includes a density-scaled Čech complex and a density-scaled Vietoris--Rips complex. We show that the density-scaled Čech complex is homotopy-equivalent to $M$ for filtration values in an interval whose starting point converges to $0$ in probability as the number of points $N \to \infty$ and whose ending point approaches infinity as $N \to \infty$. By contrast, the standard Čech complex may only be homotopy-equivalent to $M$ for a very small range of filtration values. The density-scaled filtered complexes also have the property that they are invariant under conformal transformations, such as scaling. We implement a filtered complex $\widehat{DVR}$ that approximates the density-scaled Vietoris--Rips complex, and we empirically test the performance of our implementation. As examples, we use $\widehat{DVR}$ to identify clusters that have different densities, and we apply $\widehat{DVR}$ to a time-delay embedding of the Lorenz dynamical system. Our implementation is stable (under conditions that are almost surely satisfied) and designed to handle outliers in the point cloud that do not lie on $M$. △ Less

Submitted 5 January, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

Comments: submitted; minor revisions

MSC Class: 55N31 (Primary) 60B99; 53Z50 (Secondary)

arXiv:2107.09188 [pdf, other]

Analysis of Spatial and Spatiotemporal Anomalies Using Persistent Homology: Case Studies with COVID-19 Data

Authors: Abigail Hickok, Deanna Needell, Mason A. Porter

Abstract: We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one… ▽ More We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one correspondence with the anomalies. Our approach goes beyond simply identifying anomalies; it also encodes information about the relationships between anomalies. We use vineyards, which one can interpret as time-varying persistence diagrams (which are an approach for visualizing PH), to track how the locations of the anomalies change with time. We conduct two case studies using spatially heterogeneous COVID-19 data. First, we examine vaccination rates in New York City by zip code at a single point in time. Second, we study a year-long data set of COVID-19 case rates in neighborhoods of the city of Los Angeles. △ Less

Submitted 24 February, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

Comments: revised version

MSC Class: 55N31; 68T09; 92D30

arXiv:2104.00720 [pdf, other]

Topological Data Analysis of Spatial Systems

Authors: Michelle Feng, Abigail Hickok, Mason A. Porter

Abstract: In this chapter, we discuss applications of topological data analysis (TDA) to spatial systems. We briefly review the recently proposed level-set construction of filtered simplicial complexes, and we then examine persistent homology in two cases studies: street networks in Shanghai and hotspots of COVID-19 infections. We then summarize our results and provide an outlook on TDA in spatial systems. In this chapter, we discuss applications of topological data analysis (TDA) to spatial systems. We briefly review the recently proposed level-set construction of filtered simplicial complexes, and we then examine persistent homology in two cases studies: street networks in Shanghai and hotspots of COVID-19 infections. We then summarize our results and provide an outlook on TDA in spatial systems. △ Less

Submitted 1 April, 2021; originally announced April 2021.

Comments: draft of book chapter

arXiv:2102.06825 [pdf, other]

A Bounded-Confidence Model of Opinion Dynamics on Hypergraphs

Authors: Abigail Hickok, Yacoub Kureh, Heather Z. Brooks, Michelle Feng, Mason A. Porter

Abstract: People's opinions evolve over time as they interact with their friends, family, colleagues, and others. In the study of opinion dynamics on networks, one often encodes interactions between people in the form of dyadic relationships, but many social interactions in real life are polyadic (i.e., they involve three or more people). In this paper, we extend an asynchronous bounded-confidence model (BC… ▽ More People's opinions evolve over time as they interact with their friends, family, colleagues, and others. In the study of opinion dynamics on networks, one often encodes interactions between people in the form of dyadic relationships, but many social interactions in real life are polyadic (i.e., they involve three or more people). In this paper, we extend an asynchronous bounded-confidence model (BCM) on graphs, in which nodes are connected pairwise by edges, to an asynchronous BCM on hypergraphs, in which arbitrarily many nodes can be connected by a single hyperedge. We show that our hypergraph BCM converges to consensus under a wide range of initial conditions for the opinions of the nodes, including for non-uniform and asymmetric initial opinion distributions. We also show that, under suitable conditions, echo chambers can form on hypergraphs with community structure. We demonstrate that the opinions of individuals can sometimes jump from one opinion cluster to another in a single time step, a phenomenon (which we call ``opinion jum**'') that is not possible in standard dyadic BCMs. Additionally, we observe that there is a phase transition in the convergence time on {a complete hypergraph} when the variance $σ^2$ of the initial opinion distribution equals the confidence bound $c$. We prove that the convergence time grows at least exponentially fast with the number of nodes when $σ^2 > c$ and the initial opinions are normally distributed. Therefore, to determine the convergence properties of our hypergraph BCM when the variance and the number of hyperedges are both large, it is necessary to use analytical methods instead of relying only on Monte Carlo simulations. △ Less

Submitted 9 August, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

Comments: revised version

arXiv:2004.07036 [pdf]

Connecting the Dots: Discovering the "Shape" of Data

Authors: Michelle Feng, Abigail Hickok, Yacoub H. Kureh, Mason A. Porter, Chad M. Topaz

Abstract: Scientists use a mathematical subject called 'topology' to study the shapes of objects. An important part of topology is counting the numbers of pieces and holes in objects, and people use this information to group objects into different types. For example, a doughnut has the same number of holes and the same number of pieces as a teacup with one handle, but it is different from a ball. In studies… ▽ More Scientists use a mathematical subject called 'topology' to study the shapes of objects. An important part of topology is counting the numbers of pieces and holes in objects, and people use this information to group objects into different types. For example, a doughnut has the same number of holes and the same number of pieces as a teacup with one handle, but it is different from a ball. In studies that resemble activities like "connect the dots", scientists use ideas from topology to study the shape of data. Data can take many possible forms: a picture made of dots, a large collection of numbers from a scientific experiment, or something else. The approach in these studies is called 'topological data analysis', and it has been used to study the branching structures of veins in leaves, how people vote in elections, flight patterns in models of bird flocking, and more. Scientists can take data on the way veins branch on leaves and use topological data analysis to divide the leaves into different groups and discover patterns that may otherwise be hard to find. △ Less

Submitted 8 September, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

Comments: This article, which is under review in Frontiers for Young Minds, to introduce young readers (of ages roughly 12--14) to topological data analysis. We would appreciate receiving feedback from you and your children

Showing 1–9 of 9 results for author: Hickok, A