-
Computing $p$-presentation distances is hard
Authors:
Håvard Bakke Bjerkevik,
Magnus Bakke Botnan
Abstract:
Recently, $p$-presentation distances for $p\in [1,\infty]$ were introduced for merge trees and multiparameter persistence modules as more sensitive variations of the respective interleaving distances ($p=\infty$). It is well-known that computing the interleaving distance is NP-hard in both cases. We extend this result by showing that computing the $p$-presentation distance is NP-hard for all…
▽ More
Recently, $p$-presentation distances for $p\in [1,\infty]$ were introduced for merge trees and multiparameter persistence modules as more sensitive variations of the respective interleaving distances ($p=\infty$). It is well-known that computing the interleaving distance is NP-hard in both cases. We extend this result by showing that computing the $p$-presentation distance is NP-hard for all $p\in [1,\infty)$ for both merge trees and $t$-parameter persistence modules for any $t\geq 2$. Though the details differ, both proofs follow the same novel strategy, suggesting that our approach can be adapted to proving the NP-hardness of other distances based on sums or $p$-norms.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Stable Vectorization of Multiparameter Persistent Homology using Signed Barcodes as Measures
Authors:
David Loiseaux,
Luis Scoccola,
Mathieu Carrière,
Magnus Bakke Botnan,
Steve Oudot
Abstract:
Persistent homology (PH) provides topological descriptors for geometric data, such as weighted graphs, which are interpretable, stable to perturbations, and invariant under, e.g., relabeling. Most applications of PH focus on the one-parameter case -- where the descriptors summarize the changes in topology of data as it is filtered by a single quantity of interest -- and there is now a wide array o…
▽ More
Persistent homology (PH) provides topological descriptors for geometric data, such as weighted graphs, which are interpretable, stable to perturbations, and invariant under, e.g., relabeling. Most applications of PH focus on the one-parameter case -- where the descriptors summarize the changes in topology of data as it is filtered by a single quantity of interest -- and there is now a wide array of methods enabling the use of one-parameter PH descriptors in data science, which rely on the stable vectorization of these descriptors as elements of a Hilbert space. Although the multiparameter PH (MPH) of data that is filtered by several quantities of interest encodes much richer information than its one-parameter counterpart, the scarceness of stability results for MPH descriptors has so far limited the available options for the stable vectorization of MPH. In this paper, we aim to bring together the best of both worlds by showing how the interpretation of signed barcodes -- a recent family of MPH descriptors -- as signed measures leads to natural extensions of vectorization strategies from one parameter to multiple parameters. The resulting feature vectors are easy to define and to compute, and provably stable. While, as a proof of concept, we focus on simple choices of signed barcodes and vectorizations, we already see notable performance improvements when comparing our feature vectors to state-of-the-art topology-based methods on various types of data.
△ Less
Submitted 7 February, 2024; v1 submitted 6 June, 2023;
originally announced June 2023.
-
On the bottleneck stability of rank decompositions of multi-parameter persistence modules
Authors:
Magnus Bakke Botnan,
Steffen Oppermann,
Steve Oudot,
Luis Scoccola
Abstract:
A significant part of modern topological data analysis is concerned with the design and study of algebraic invariants of poset representations -- often referred to as multi-parameter persistence modules. One such invariant is the minimal rank decomposition, which encodes the ranks of all the structure morphisms of the persistence module by a single ordered pair of rectangle-decomposable modules, i…
▽ More
A significant part of modern topological data analysis is concerned with the design and study of algebraic invariants of poset representations -- often referred to as multi-parameter persistence modules. One such invariant is the minimal rank decomposition, which encodes the ranks of all the structure morphisms of the persistence module by a single ordered pair of rectangle-decomposable modules, interpreted as a signed barcode. This signed barcode generalizes the concept of persistence barcode from one-parameter persistence to any number of parameters, raising the question of its bottleneck stability. We show in this paper that the minimal rank decomposition is not stable under the natural notion of signed bottleneck matching between signed barcodes. We remedy this by turning our focus to the rank exact decomposition, a related signed barcode induced by the minimal projective resolution of the module relative to the so-called rank exact structure, which we prove to be bottleneck stable under signed matchings. As part of our proof, we obtain two intermediate results of independent interest: we compute the global dimension of the rank exact structure on the category of finitely presentable multi-parameter persistence modules, and we prove a bottleneck stability result for hook-decomposable modules. We also give a bound for the size of the rank exact decomposition that is polynomial in the size of the usual minimal projective resolution, we prove a universality result for the dissimilarity function induced by the notion of signed matching, and we compute, in the two-parameter case, the global dimension of a different exact structure related to the upsets of the indexing poset. This set of results combines concepts from topological data analysis and from the representation theory of posets, and we believe is relevant to both areas.
△ Less
Submitted 5 March, 2024; v1 submitted 30 July, 2022;
originally announced August 2022.
-
An Introduction to Multiparameter Persistence
Authors:
Magnus Bakke Botnan,
Michael Lesnick
Abstract:
In topological data analysis (TDA), one often studies the shape of data by constructing a filtered topological space, whose structure is then examined using persistent homology. However, a single filtered space often does not adequately capture the structure of interest in the data, and one is led to consider multiparameter persistence, which associates to the data a space equipped with a multipar…
▽ More
In topological data analysis (TDA), one often studies the shape of data by constructing a filtered topological space, whose structure is then examined using persistent homology. However, a single filtered space often does not adequately capture the structure of interest in the data, and one is led to consider multiparameter persistence, which associates to the data a space equipped with a multiparameter filtration. Multiparameter persistence has become one of the most active areas of research within TDA, with exciting progress on several fronts. In this article, we introduce multiparameter persistence and survey some of this recent progress, with a focus on ideas likely to lead to practical applications in the near future.
△ Less
Submitted 13 March, 2023; v1 submitted 27 March, 2022;
originally announced March 2022.
-
Structure and Interleavings of Relative Interlevel Set Cohomology
Authors:
Ulrich Bauer,
Magnus Bakke Botnan,
Benedikt Fluhr
Abstract:
The relative interlevel set cohomology (RISC) is an invariant of real-valued continuous functions closely related to the Mayer--Vietoris pyramid introduced by Carlsson, de Silva, and Morozov. As such, the relative interlevel set cohomology is a parametrization of the cohomology vector spaces of all open interlevel sets relative complements of closed interlevel sets. We provide a structure theorem,…
▽ More
The relative interlevel set cohomology (RISC) is an invariant of real-valued continuous functions closely related to the Mayer--Vietoris pyramid introduced by Carlsson, de Silva, and Morozov. As such, the relative interlevel set cohomology is a parametrization of the cohomology vector spaces of all open interlevel sets relative complements of closed interlevel sets. We provide a structure theorem, which applies to the RISC of real-valued continuous functions whose open interlevel sets have finite-dimensional cohomology in each degree. Moreover, we show this tameness assumption is in some sense equivalent to $q$-tameness as introduced by Chazal, de Silva, Glisse, and Oudot. Furthermore, we provide the notion of an interleaving for RISC and we show that it is stable in the sense that any space with two functions that are $δ$-close induces a $δ$-interleaving of the corresponding relative interlevel set cohomologies. Finally, we provide an elementary form of quantitative homotopy invariance for RISC.
△ Less
Submitted 30 May, 2022; v1 submitted 20 August, 2021;
originally announced August 2021.
-
Signed Barcodes for Multi-Parameter Persistence via Rank Decompositions and Rank-Exact Resolutions
Authors:
Magnus Bakke Botnan,
Steffen Oppermann,
Steve Oudot
Abstract:
In this paper we introduce the signed barcode, a new visual representation of the global structure of the rank invariant of a multi-parameter persistence module or, more generally, of a poset representation. Like its unsigned counterpart in one-parameter persistence, the signed barcode encodes the rank invariant as a $\mathbb{Z}$-linear combination of rank invariants of indicator modules supported…
▽ More
In this paper we introduce the signed barcode, a new visual representation of the global structure of the rank invariant of a multi-parameter persistence module or, more generally, of a poset representation. Like its unsigned counterpart in one-parameter persistence, the signed barcode encodes the rank invariant as a $\mathbb{Z}$-linear combination of rank invariants of indicator modules supported on segments in the poset. It can also be enriched to encode the generalized rank invariant as a $\mathbb{Z}$-linear combination of generalized rank invariants in fixed classes of interval modules. In the paper we develop the theory behind these rank invariant decompositions, showing under what conditions they exist and are unique -- so the signed barcode is canonically defined. We also connect them to the line of work on generalized persistence diagrams via Möbius inversions, deriving explicit formulas to compute a rank decomposition and its associated signed barcode. Finally, we show that, similarly to its unsigned counterpart, the signed barcode has its roots in algebra, coming from a projective resolution of the module in some exact category. To complete the picture, we show some experimental results that illustrate the contribution of the signed barcode in the exploration of multi-parameter persistence modules.
△ Less
Submitted 29 March, 2022; v1 submitted 14 July, 2021;
originally announced July 2021.
-
Universal Distances for Extended Persistence
Authors:
Ulrich Bauer,
Magnus Bakke Botnan,
Benedikt Fluhr
Abstract:
The extended persistence diagram is an invariant of piecewise linear functions, which is known to be stable under perturbations of functions with respect to the bottleneck distance as introduced by Cohen-Steiner, Edelsbrunner, and Harer. We address the question of universality, which asks for the largest possible stable distance on extended persistence diagrams, showing that a more discriminative…
▽ More
The extended persistence diagram is an invariant of piecewise linear functions, which is known to be stable under perturbations of functions with respect to the bottleneck distance as introduced by Cohen-Steiner, Edelsbrunner, and Harer. We address the question of universality, which asks for the largest possible stable distance on extended persistence diagrams, showing that a more discriminative variant of the bottleneck distance is universal. Our result applies more generally to settings where persistence diagrams are considered only up to a certain degree. We achieve our results by establishing a functorial construction and several characteristic properties of relative interlevel set homology, which mirror the classical Eilenberg--Steenrod axioms. Finally, we contrast the bottleneck distance with the interleaving distance of sheaves on the real line by showing that the latter is not intrinsic, let alone universal. This particular result has the further implication that the interleaving distance of Reeb graphs is not intrinsic either.
△ Less
Submitted 11 March, 2024; v1 submitted 3 July, 2020;
originally announced July 2020.
-
A Relative Theory of Interleavings
Authors:
Magnus Bakke Botnan,
Justin Curry,
Elizabeth Munch
Abstract:
The interleaving distance, although originally developed for persistent homology, has been generalized to measure the distance between functors modeled on many posets or even small categories. Existing theories require that such a poset have a superlinear family of translations or a similar structure. However, many posets of interest to topological data analysis, such as zig-zag posets and the fac…
▽ More
The interleaving distance, although originally developed for persistent homology, has been generalized to measure the distance between functors modeled on many posets or even small categories. Existing theories require that such a poset have a superlinear family of translations or a similar structure. However, many posets of interest to topological data analysis, such as zig-zag posets and the face relation poset of a cell-complex, do not admit interesting translations, and consequently don't admit a nice theory of interleavings. In this paper we show how one can side-step this limitation by providing a general theory where one maps to a poset that does admit interesting translations, such as the lattice of down sets, and then defines interleavings relative to this map. Part of our theory includes a rigorous notion of discretization or "pixelization" of poset modules, which in turn we use for interleaving inference. We provide an approximation condition that in the setting of lattices gives rise to two possible pixelizations, both of which are guaranteed to be close in the interleaving distance. Finally, we conclude by considering interleaving inference for cosheaves over a metric space and give an explicit description of interleavings over a grid structure on Euclidean space.
△ Less
Submitted 29 April, 2020;
originally announced April 2020.
-
On rectangle-decomposable 2-parameter persistence modules
Authors:
Magnus Bakke Botnan,
Vadim Lebovici,
Steve Oudot
Abstract:
This paper addresses two questions: (a) can we identify a sensible class of 2-parameter persistence modules on which the rank invariant is complete? (b) can we determine efficiently whether a given 2-parameter persistence module belongs to this class? We provide positive answers to both questions, and our class of interest is that of rectangle-decomposable modules. Our contributions include: on th…
▽ More
This paper addresses two questions: (a) can we identify a sensible class of 2-parameter persistence modules on which the rank invariant is complete? (b) can we determine efficiently whether a given 2-parameter persistence module belongs to this class? We provide positive answers to both questions, and our class of interest is that of rectangle-decomposable modules. Our contributions include: on the one hand, a proof that the rank invariant is complete on rectangle-decomposable modules, together with an inclusion-exclusion formula for counting the multiplicities of the summands; on the other hand, algorithms to check whether a module induced in homology by a bifiltration is rectangle-decomposable, and to decompose it in the affirmative, with a better complexity than state-of-the-art decomposition methods for general 2-parameter persistence modules. Our algorithms are backed up by a new structure theorem, whereby a 2-parameter persistence module is rectangle-decomposable if, and only if, its restrictions to squares are. This local characterization is key to the efficiency of our algorithms, and it generalizes previous conditions derived for the smaller class of block-decomposable modules. It also admits an algebraic formulation that turns out to be a weaker version of the one for block-decomposability. By contrast, we show that general interval-decomposability does not admit such a local characterization, even when locality is understood in a broad sense. Our analysis focuses on the case of modules indexed over finite grids, the more general cases are left as future work.
△ Less
Submitted 4 February, 2022; v1 submitted 20 February, 2020;
originally announced February 2020.
-
Computing the interleaving distance is NP-hard
Authors:
Håvard Bakke Bjerkevik,
Magnus Bakke Botnan,
Michael Kerber
Abstract:
We show that computing the interleaving distance between two multi-graded persistence modules is NP-hard. More precisely, we show that deciding whether two modules are $1$-interleaved is NP-complete, already for bigraded, interval decomposable modules. Our proof is based on previous work showing that a constrained matrix invertibility problem can be reduced to the interleaving distance computation…
▽ More
We show that computing the interleaving distance between two multi-graded persistence modules is NP-hard. More precisely, we show that deciding whether two modules are $1$-interleaved is NP-complete, already for bigraded, interval decomposable modules. Our proof is based on previous work showing that a constrained matrix invertibility problem can be reduced to the interleaving distance computation of a special type of persistence modules. We show that this matrix invertibility problem is NP-complete. We also give a slight improvement of the above reduction, showing that also the approximation of the interleaving distance is NP-hard for any approximation factor smaller than $3$. Additionally, we obtain corresponding hardness results for the case that the modules are indecomposable, and in the setting of one-sided stability. Furthermore, we show that checking for injections (resp. surjections) between persistence modules is NP-hard. In conjunction with earlier results from computational algebra this gives a complete characterization of the computational complexity of one-sided stability. Lastly, we show that it is in general NP-hard to approximate distances induced by noise systems within a factor of 2.
△ Less
Submitted 9 October, 2019; v1 submitted 22 November, 2018;
originally announced November 2018.
-
Computational Complexity of the Interleaving Distance
Authors:
Håvard Bakke Bjerkevik,
Magnus Bakke Botnan
Abstract:
The interleaving distance is arguably the most prominent distance measure in topological data analysis. In this paper, we provide bounds on the computational complexity of determining the interleaving distance in several settings. We show that the interleaving distance is NP-hard to compute for persistence modules valued in the category of vector spaces. In the specific setting of multidimensional…
▽ More
The interleaving distance is arguably the most prominent distance measure in topological data analysis. In this paper, we provide bounds on the computational complexity of determining the interleaving distance in several settings. We show that the interleaving distance is NP-hard to compute for persistence modules valued in the category of vector spaces. In the specific setting of multidimensional persistent homology we show that the problem is at least as hard as a matrix invertibility problem. Furthermore, this allows us to conclude that the interleaving distance of interval decomposable modules depends on the characteristic of the field. Persistence modules valued in the category of sets are also studied. As a corollary, we obtain that the isomorphism problem for Reeb graphs is graph isomorphism complete.
△ Less
Submitted 30 April, 2018; v1 submitted 12 December, 2017;
originally announced December 2017.
-
Algebraic Stability of Zigzag Persistence Modules
Authors:
Magnus Bakke Botnan,
Michael Lesnick
Abstract:
The stability theorem for persistent homology is a central result in topological data analysis. While the original formulation of the result concerns the persistence barcodes of $\mathbb{R}$-valued functions, the result was later cast in a more general algebraic form, in the language of \emph{persistence modules} and \emph{interleavings}. In this paper, we establish an analogue of this algebraic s…
▽ More
The stability theorem for persistent homology is a central result in topological data analysis. While the original formulation of the result concerns the persistence barcodes of $\mathbb{R}$-valued functions, the result was later cast in a more general algebraic form, in the language of \emph{persistence modules} and \emph{interleavings}. In this paper, we establish an analogue of this algebraic stability theorem for zigzag persistence modules. To do so, we functorially extend each zigzag persistence module to a two-dimensional persistence module, and establish an algebraic stability theorem for these extensions. One part of our argument yields a stability result for free two-dimensional persistence modules.
As an application of our main theorem, we strengthen a result of Bauer et al. on the stability of the persistent homology of Reeb graphs. Our main result also yields an alternative proof of the stability theorem for level set persistent homology of Carlsson et al.
△ Less
Submitted 6 April, 2017; v1 submitted 3 April, 2016;
originally announced April 2016.
-
Approximating Persistent Homology in Euclidean Space Through Collapses
Authors:
Magnus Bakke Botnan,
Gard Spreemann
Abstract:
The Čech complex is one of the most widely used tools in applied algebraic topology. Unfortunately, due to the inclusive nature of the Čech filtration, the number of simplices grows exponentially in the number of input points. A practical consequence is that computations may have to terminate at smaller scales than what the application calls for.
In this paper we propose two methods to approxima…
▽ More
The Čech complex is one of the most widely used tools in applied algebraic topology. Unfortunately, due to the inclusive nature of the Čech filtration, the number of simplices grows exponentially in the number of input points. A practical consequence is that computations may have to terminate at smaller scales than what the application calls for.
In this paper we propose two methods to approximate the Čech persistence module. Both are constructed on the level of spaces, i.e. as sequences of simplicial complexes induced by nerves. We also show how the bottleneck distance between such persistence modules can be understood by how tightly they are sandwiched on the level of spaces. In turn, this implies the correctness of our approximation methods.
Finally, we implement our methods and apply them to some example point clouds in Euclidean space.
△ Less
Submitted 20 August, 2014; v1 submitted 3 March, 2014;
originally announced March 2014.