-
Graphcode: Learning from multiparameter persistent homology using graph neural networks
Authors:
Michael Kerber,
Florian Russold
Abstract:
We introduce graphcodes, a novel multi-scale summary of the topological properties of a dataset that is based on the well-established theory of persistent homology. Graphcodes handle datasets that are filtered along two real-valued scale parameters. Such multi-parameter topological summaries are usually based on complicated theoretical foundations and difficult to compute; in contrast, graphcodes…
▽ More
We introduce graphcodes, a novel multi-scale summary of the topological properties of a dataset that is based on the well-established theory of persistent homology. Graphcodes handle datasets that are filtered along two real-valued scale parameters. Such multi-parameter topological summaries are usually based on complicated theoretical foundations and difficult to compute; in contrast, graphcodes yield an informative and interpretable summary and can be computed as efficient as one-parameter summaries. Moreover, a graphcode is simply an embedded graph and can therefore be readily integrated in machine learning pipelines using graph neural networks. We describe such a pipeline and demonstrate that graphcodes achieve better classification accuracy than state-of-the-art approaches on various datasets.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Experimental realisation of a universal inverse-design magnonic device
Authors:
Noura Zenbaa,
Claas Abert,
Fabian Majcen,
Michael Kerber,
Rostyslav O. Serha,
Sebastian Knauer,
Qi Wang,
Thomas Schrefl,
Dieter Suess,
Andrii V. Chumak
Abstract:
In the field of magnonics, which uses magnons, the quanta of spin waves, for energy-efficient data processing, significant progress has been made leveraging the capabilities of the inverse design concept. This approach involves defining a desired functionality and employing a feedback-loop algorithm to optimise the device design. In this study, we present the first experimental demonstration of a…
▽ More
In the field of magnonics, which uses magnons, the quanta of spin waves, for energy-efficient data processing, significant progress has been made leveraging the capabilities of the inverse design concept. This approach involves defining a desired functionality and employing a feedback-loop algorithm to optimise the device design. In this study, we present the first experimental demonstration of a reconfigurable, lithography-free, and simulation-free inverse-design device capable of implementing various RF components. The device features a square array of independent direct current loops that generate a complex reconfigurable magnetic medium atop a Yttrium-Iron-Garnet (YIG) rectangular film for data processing in the gigahertz range. Showcasing its versatility, the device addresses inverse problems using two algorithms to create RF notch filters and demultiplexers. Additionally, the device holds promise for binary, reservoir, and neuromorphic computing applications.
△ Less
Submitted 3 July, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Probabilistic Analysis of Multiparameter Persistence Decompositions
Authors:
Ángel Javier Alonso,
Michael Kerber,
Primoz Skraba
Abstract:
Multiparameter persistence modules can be uniquely decomposed into indecomposable summands. Among these indecomposables, intervals stand out for their simplicity, making them preferable for their ease of interpretation in practical applications and their computational efficiency. Empirical observations indicate that modules that decompose into only intervals are rare. To support this observation,…
▽ More
Multiparameter persistence modules can be uniquely decomposed into indecomposable summands. Among these indecomposables, intervals stand out for their simplicity, making them preferable for their ease of interpretation in practical applications and their computational efficiency. Empirical observations indicate that modules that decompose into only intervals are rare. To support this observation, we show that for numerous common multiparameter constructions, such as density- or degree-Rips bifiltrations, and across a general category of point samples, the probability of the homology-induced persistence module decomposing into intervals goes to zero as the sample size goes to infinity.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Delaunay Bifiltrations of Functions on Point Clouds
Authors:
Ángel Javier Alonso,
Michael Kerber,
Tung Lam,
Michael Lesnick
Abstract:
The Delaunay filtration $\mathcal{D}_{\bullet}(X)$ of a point cloud $X\subset \mathbb{R}^d$ is a central tool of computational topology. Its use is justified by the topological equivalence of $\mathcal{D}_{\bullet}(X)$ and the offset (i.e., union-of-balls) filtration of $X$. Given a function $γ: X \to \mathbb{R}$, we introduce a Delaunay bifiltration $\mathcal{DC}_{\bullet}(γ)$ that satisfies an a…
▽ More
The Delaunay filtration $\mathcal{D}_{\bullet}(X)$ of a point cloud $X\subset \mathbb{R}^d$ is a central tool of computational topology. Its use is justified by the topological equivalence of $\mathcal{D}_{\bullet}(X)$ and the offset (i.e., union-of-balls) filtration of $X$. Given a function $γ: X \to \mathbb{R}$, we introduce a Delaunay bifiltration $\mathcal{DC}_{\bullet}(γ)$ that satisfies an analogous topological equivalence, ensuring that $\mathcal{DC}_{\bullet}(γ)$ topologically encodes the offset filtrations of all sublevel sets of $γ$, as well as the topological relations between them. $\mathcal{DC}_{\bullet}(γ)$ is of size $O(|X|^{\lceil\frac{d+1}{2}\rceil})$, which for $d$ odd matches the worst-case size of $\mathcal{D}_{\bullet}(X)$. Adapting the Bowyer-Watson algorithm for computing Delaunay triangulations, we give a simple, practical algorithm to compute $\mathcal{DC}_{\bullet}(γ)$ in time $O(|X|^{\lceil \frac{d}{2}\rceil +1})$. Our implementation, based on CGAL, computes $\mathcal{DC}_{\bullet}(γ)$ with modest overhead compared to computing $\mathcal{D}_{\bullet}(X)$, and handles tens of thousands of points in $\mathbb{R}^3$ within seconds.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Topological Data Analysis in smart manufacturing
Authors:
Martin Uray,
Barbara Giunti,
Michael Kerber,
Stefan Huber
Abstract:
Topological Data Analysis (TDA) is a discipline that applies algebraic topology techniques to analyze complex, multi-dimensional data. Although it is a relatively new field, TDA has been widely and successfully applied across various domains, such as medicine, materials science, and biology. This survey provides an overview of the state of the art of TDA within a dynamic and promising application…
▽ More
Topological Data Analysis (TDA) is a discipline that applies algebraic topology techniques to analyze complex, multi-dimensional data. Although it is a relatively new field, TDA has been widely and successfully applied across various domains, such as medicine, materials science, and biology. This survey provides an overview of the state of the art of TDA within a dynamic and promising application area: industrial manufacturing and production, particularly within the Industry 4.0 context. We have conducted a rigorous and reproducible literature search focusing on TDA applications in industrial production and manufacturing settings. The identified works are categorized based on their application areas within the manufacturing process and the types of input data. We highlight the principal advantages of TDA tools in this context, address the challenges encountered and the future potential of the field. Furthermore, we identify TDA methods that are currently underexploited in specific industrial areas and discuss how their application could be beneficial, with the aim of stimulating further research in this field. This work seeks to bridge the theoretical advancements in TDA with the practical needs of industrial production. Our goal is to serve as a guide for practitioners and researchers applying TDA in industrial production and manufacturing systems. We advocate for the untapped potential of TDA in this domain and encourage continued exploration and research.
△ Less
Submitted 17 May, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
The Localized Union-of-Balls Bifiltration
Authors:
Michael Kerber,
Matthias Söls
Abstract:
We propose an extension of the classical union-of-balls filtration of persistent homology: fixing a point $q$, we focus our attention to a ball centered at $q$ whose radius is controlled by a second scale parameter. We discuss an absolute variant, where the union is just restricted to the $q$-ball, and a relative variant where the homology of the $q$-ball relative to its boundary is considered. In…
▽ More
We propose an extension of the classical union-of-balls filtration of persistent homology: fixing a point $q$, we focus our attention to a ball centered at $q$ whose radius is controlled by a second scale parameter. We discuss an absolute variant, where the union is just restricted to the $q$-ball, and a relative variant where the homology of the $q$-ball relative to its boundary is considered. Interestingly, these natural constructions lead to bifiltered simplicial complexes which are not $k$-critical for any finite $k$. Nevertheless, we demonstrate that these bifiltrations can be computed exactly and efficiently, and we provide a prototypical implementation using the CGAL library. We also argue that some of the recent algorithmic advances for $2$-parameter persistence (which usually assume $k$-criticality for some finite $k$) carry over to the $\infty$-critical case.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Sparse Higher Order Čech Filtrations
Authors:
Mickaël Buchet,
Bianca B. Dornelas,
Michael Kerber
Abstract:
For a finite set of balls of radius $r$, the $k$-fold cover is the space covered by at least $k$ balls. Fixing the ball centers and varying the radius, we obtain a nested sequence of spaces that is called the $k$-fold filtration of the centers. For $k=1$, the construction is the union-of-balls filtration that is popular in topological data analysis. For larger $k$, it yields a cleaner shape recons…
▽ More
For a finite set of balls of radius $r$, the $k$-fold cover is the space covered by at least $k$ balls. Fixing the ball centers and varying the radius, we obtain a nested sequence of spaces that is called the $k$-fold filtration of the centers. For $k=1$, the construction is the union-of-balls filtration that is popular in topological data analysis. For larger $k$, it yields a cleaner shape reconstruction in the presence of outliers. We contribute a sparsification algorithm to approximate the topology of the $k$-fold filtration. Our method is a combination and adaptation of several techniques from the well-studied case $k=1$, resulting in a sparsification of linear size that can be computed in expected near-linear time with respect to the number of input points. Our method also extends to the multicover bifiltration, composed of the $k$-fold filtrations for several values of $k$, with the same size and complexity bounds.
△ Less
Submitted 17 May, 2023; v1 submitted 12 March, 2023;
originally announced March 2023.
-
Decomposition of zero-dimensional persistence modules via rooted subsets
Authors:
Ángel Javier Alonso,
Michael Kerber
Abstract:
We study the decomposition of zero-dimensional persistence modules, viewed as functors valued in the category of vector spaces factorizing through sets. Instead of working directly at the level of vector spaces, we take a step back and first study the decomposition problem at the level of sets.
This approach allows us to define the combinatorial notion of rooted subsets. In the case of a filtere…
▽ More
We study the decomposition of zero-dimensional persistence modules, viewed as functors valued in the category of vector spaces factorizing through sets. Instead of working directly at the level of vector spaces, we take a step back and first study the decomposition problem at the level of sets.
This approach allows us to define the combinatorial notion of rooted subsets. In the case of a filtered metric space $M$, rooted subsets relate the clustering behavior of the points of $M$ with the decomposition of the associated persistence module. In particular, we can identify intervals in such a decomposition quickly. In addition, rooted subsets can be understood as a generalization of the elder rule, and are also related to the notion of constant conqueror of Cai, Kim, Mémoli and Wang. As an application, we give a lower bound on the number of intervals that we can expect in the decomposition of zero-dimensional persistence modules of a density-Rips filtration in Euclidean space: in the limit, and under very general circumstances, we can expect that at least 25% of the indecomposable summands are interval modules.
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
Kee** it sparse: Computing Persistent Homology revisited
Authors:
Ulrich Bauer,
Talha Bin Masood,
Barbara Giunti,
Guillaume Houry,
Michael Kerber,
Abhishek Rathod
Abstract:
In this work, we study several variants of matrix reduction via Gaussian elimination that try to keep the reduced matrix sparse. The motivation comes from the growing field of topological data analysis where matrix reduction is the major subroutine to compute barcodes, the main invariant therein. We propose two novel variants of the standard algorithm, called swap and retrospective reductions. We…
▽ More
In this work, we study several variants of matrix reduction via Gaussian elimination that try to keep the reduced matrix sparse. The motivation comes from the growing field of topological data analysis where matrix reduction is the major subroutine to compute barcodes, the main invariant therein. We propose two novel variants of the standard algorithm, called swap and retrospective reductions. We test them on a large collection of data against other known variants to compare their efficiency, and we find that sometimes they provide a considerable speed-up. We also present novel output-sensitive bounds for the retrospective variant which better explain the discrepancy between the cubic worst-case complexity bound and the almost linear practical behavior of matrix reduction. Finally, we provide several constructions on which one of the variants performs strictly better than the others.
△ Less
Submitted 13 June, 2024; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Filtration-Domination in Bifiltered Graphs
Authors:
Ángel Javier Alonso,
Michael Kerber,
Siddharth Pritam
Abstract:
Bifiltered graphs are a versatile tool for modelling relations between data points across multiple grades of a two-dimensional scale. They are especially popular in topological data analysis, where the homological properties of the induced clique complexes are studied. To reduce the large size of these clique complexes, we identify filtration-dominated edges of the graph, whose removal preserves t…
▽ More
Bifiltered graphs are a versatile tool for modelling relations between data points across multiple grades of a two-dimensional scale. They are especially popular in topological data analysis, where the homological properties of the induced clique complexes are studied. To reduce the large size of these clique complexes, we identify filtration-dominated edges of the graph, whose removal preserves the relevant topological properties. We give two algorithms to detect filtration-dominated edges in a bifiltered graph and analyze their complexity. These two algorithms work directly on the bifiltered graph, without first extracting the clique complexes, which are generally much bigger. We present extensive experimental evaluation which shows that in most cases, more than 90% of the edges can be removed. In turn, we demonstrate that this often leads to a substantial speedup, and reduction in the memory usage, of the computational pipeline of multiparameter topological data analysis.
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
A Unified View on the Functorial Nerve Theorem and its Variations
Authors:
Ulrich Bauer,
Michael Kerber,
Fabian Roll,
Alexander Rolle
Abstract:
The nerve theorem is a basic result of algebraic topology that plays a central role in computational and applied aspects of the subject. In topological data analysis, one often needs a nerve theorem that is functorial in an appropriate sense, and furthermore one often needs a nerve theorem for closed covers as well as for open covers. While the techniques for proving such functorial nerve theorems…
▽ More
The nerve theorem is a basic result of algebraic topology that plays a central role in computational and applied aspects of the subject. In topological data analysis, one often needs a nerve theorem that is functorial in an appropriate sense, and furthermore one often needs a nerve theorem for closed covers as well as for open covers. While the techniques for proving such functorial nerve theorems have long been available, there is unfortunately no general-purpose, explicit treatment of this topic in the literature. We address this by proving a variety of functorial nerve theorems. First, we show how one can use elementary techniques to prove nerve theorems for covers by closed convex sets in Euclidean space, and for covers of a simplicial complex by subcomplexes. Then, we establish a more general, "unified" nerve theorem that subsumes many of the variants, using standard techniques from abstract homotopy theory.
△ Less
Submitted 1 June, 2023; v1 submitted 7 March, 2022;
originally announced March 2022.
-
Asymptotic Improvements on the Exact Matching Distance for 2-parameter Persistence
Authors:
Håvard Bakke Bjerkevik,
Michael Kerber
Abstract:
In the field of topological data analysis, persistence modules are used to express geometrical features of data sets. The matching distance $d_\mathcal{M}$ measures the difference between $2$-parameter persistence modules by taking the maximum bottleneck distance between $1$-parameter slices of the modules. The previous best algorithm to compute $d_\mathcal{M}$ exactly runs in $O(n^{8+ω})$ time us…
▽ More
In the field of topological data analysis, persistence modules are used to express geometrical features of data sets. The matching distance $d_\mathcal{M}$ measures the difference between $2$-parameter persistence modules by taking the maximum bottleneck distance between $1$-parameter slices of the modules. The previous best algorithm to compute $d_\mathcal{M}$ exactly runs in $O(n^{8+ω})$ time using $O(n^4)$ space, where $n$ is the number of generators and relations of the modules and $ω$ is the matrix multiplication constant. We improve significantly on this by describing an algorithm with expected running time $O(n^5 \log^3 n)$ and using $O(n^2)$ space. We first solve the decision problem $d_\mathcal{M}\leq λ$ for a constant $λ$ in $O(n^5\log n)$ time by traversing a line arrangement in the dual plane, where each point represents a slice. Then we lift the line arrangement to a plane arrangement in $\mathbb{R}^3$ whose vertices represent possible values for $d_\mathcal{M}$, and use a randomized incremental method to search through the vertices and find $d_\mathcal{M}$. The expected running time of this algorithm is $O((n^4+T(n))\log^2 n)$, where $T(n)$ is an upper bound for the complexity of deciding if $d_\mathcal{M}\leq λ$. Moreover, we show how to compute the matching distance using only linear space, to the price of a much worse time complexity.
△ Less
Submitted 7 December, 2023; v1 submitted 19 November, 2021;
originally announced November 2021.
-
Expected Complexity of Persistent Homology Computation via Matrix Reduction
Authors:
Barbara Giunti,
Guillaume Houry,
Michael Kerber,
Matthias Söls
Abstract:
We study the algorithmic complexity of computing persistent homology of a randomly generated filtration. Specifically, we prove upper bounds for the average fill-in (number of non-zero entries) of the boundary matrix on Čech, Vietoris--Rips and Erdős--Rényi filtrations after matrix reduction. Our bounds show that the reduced matrix is expected to be significantly sparser than what the general wors…
▽ More
We study the algorithmic complexity of computing persistent homology of a randomly generated filtration. Specifically, we prove upper bounds for the average fill-in (number of non-zero entries) of the boundary matrix on Čech, Vietoris--Rips and Erdős--Rényi filtrations after matrix reduction. Our bounds show that the reduced matrix is expected to be significantly sparser than what the general worst-case predicts. Our method is based on previous results on the expected Betti numbers of the corresponding complexes. We establish a link between these results and the fill-in of the boundary matrix. In the $1$-dimensional case, our bound for Čech and Vietoris--Rips complexes is asymptotically tight up to a logarithmic factor. We also provide an Erdős--Rényi filtration realising the worst-case.
△ Less
Submitted 8 March, 2024; v1 submitted 3 November, 2021;
originally announced November 2021.
-
Compression for 2-Parameter Persistent Homology
Authors:
Ulderico Fugacci,
Michael Kerber,
Alexander Rolle
Abstract:
Compression aims to reduce the size of an input, while maintaining its relevant properties. For multi-parameter persistent homology, compression is a necessary step in any computational pipeline, since standard constructions lead to large inputs, and computational tasks in this area tend to be expensive. We propose two compression methods for chain complexes of free 2-parameter persistence modules…
▽ More
Compression aims to reduce the size of an input, while maintaining its relevant properties. For multi-parameter persistent homology, compression is a necessary step in any computational pipeline, since standard constructions lead to large inputs, and computational tasks in this area tend to be expensive. We propose two compression methods for chain complexes of free 2-parameter persistence modules. The first method extends the multi-chunk algorithm for one-parameter persistent homology, returning the smallest chain complex among all the ones quasi-isomorphic to the input. The second method produces minimal presentations of the homology of the input; it is based on an algorithm of Lesnick and Wright, but incorporates several improvements that lead to substantial performance gains. The two methods are complementary, and can be combined to compute minimal presentations for complexes with millions of generators in a few seconds. The methods have been implemented, and the software is publicly available. We report on experimental evaluations, which demonstrate substantial improvements in performance compared to previously available compression strategies.
△ Less
Submitted 16 August, 2022; v1 submitted 22 July, 2021;
originally announced July 2021.
-
Improved Approximate Rips Filtrations with Shifted Integer Lattices and Cubical Complexes
Authors:
Aruni Choudhary,
Michael Kerber,
Sharath Raghvendra
Abstract:
Rips complexes are important structures for analyzing topological features of metric spaces. Unfortunately, generating these complexes is expensive because of a combinatorial explosion in the complex size. For $n$ points in $\mathbb{R}^d$, we present a scheme to construct a $2$-approximation of the filtration of the Rips complex in the $L_\infty$-norm, which extends to a $2d^{0.25}$-approximation…
▽ More
Rips complexes are important structures for analyzing topological features of metric spaces. Unfortunately, generating these complexes is expensive because of a combinatorial explosion in the complex size. For $n$ points in $\mathbb{R}^d$, we present a scheme to construct a $2$-approximation of the filtration of the Rips complex in the $L_\infty$-norm, which extends to a $2d^{0.25}$-approximation in the Euclidean case. The $k$-skeleton of the resulting approximation has a total size of $n2^{O(d\log k +d)}$. The scheme is based on the integer lattice and simplicial complexes based on the barycentric subdivision of the $d$-cube.
We extend our result to use cubical complexes in place of simplicial complexes by introducing cubical maps between complexes. We get the same approximation guarantee as the simplicial case, while reducing the total size of the approximation to only $n2^{O(d)}$ (cubical) cells.
There are two novel techniques that we use in this paper. The first is the use of acyclic carriers for proving our approximation result. In our application, these are maps which relate the Rips complex and the approximation in a relatively simple manner and greatly reduce the complexity of showing the approximation guarantee. The second technique is what we refer to as scale balancing, which is a simple trick to improve the approximation ratio under certain conditions.
△ Less
Submitted 11 May, 2021;
originally announced May 2021.
-
Structure-dynamics relationships in cryogenically deformed bulk metallic glass
Authors:
Florian Spieckermann,
Daniel Şopu,
Viktor Soprunyuk,
Michael B. Kerber,
Jozef Bednarčík,
Alexander Schökel,
Amir Rezvan,
Sergey Ketov,
Baran Sarac,
Erhard Schafler,
Jürgen Eckert
Abstract:
The atomistic mechanisms occurring during the processes of aging and rejuvenation in glassy materials involve very small structural rearrangements that are extremely difficult to capture experimentally. Here we use in-situ X-ray diffraction to investigate the structural rearrangements during annealing from 77 K up to the crystallization temperature in Cu44Zr44Al8Hf2Co2 bulk metallic glass rejuvena…
▽ More
The atomistic mechanisms occurring during the processes of aging and rejuvenation in glassy materials involve very small structural rearrangements that are extremely difficult to capture experimentally. Here we use in-situ X-ray diffraction to investigate the structural rearrangements during annealing from 77 K up to the crystallization temperature in Cu44Zr44Al8Hf2Co2 bulk metallic glass rejuvenated by high pressure torsion performed at cryogenic temperatures and at room temperature. Using a measure of the configurational entropy calculated from the X-ray pair correlation function, the structural footprint of the deformation-induced rejuvenation in bulk metallic glass is revealed. With synchrotron radiation, temperature and time resolutions comparable to calorimetric experiments are possible. This opens hitherto unavailable experimental possibilities allowing to unambiguously correlate changes in atomic configuration and structure to calorimetrically observed signals and can attribute those to changes of the dynamic and vibrational relaxations (α-, β- and γ-transition) in glassy materials. The results suggest that the structural footprint of the β-transition is related to entropic relaxation with characteristics of a first-order transition. Dynamic mechanical analysis data shows that in the range of the β-transition, non-reversible structural rearrangements are preferentially activated. The low-temperature γ-transition is mostly triggering reversible deformations and shows a change of slope in the entropic footprint suggesting second-order characteristics.
△ Less
Submitted 11 January, 2022; v1 submitted 19 April, 2021;
originally announced April 2021.
-
Computing the Multicover Bifiltration
Authors:
René Corbet,
Michael Kerber,
Michael Lesnick,
Georg Osang
Abstract:
Given a finite set $A\subset\mathbb{R}^d$, let Cov$_{r,k}$ denote the set of all points within distance $r$ to at least $k$ points of $A$. Allowing $r$ and $k$ to vary, we obtain a 2-parameter family of spaces that grow larger when $r$ increases or $k$ decreases, called the \emph{multicover bifiltration}. Motivated by the problem of computing the homology of this bifiltration, we introduce two clo…
▽ More
Given a finite set $A\subset\mathbb{R}^d$, let Cov$_{r,k}$ denote the set of all points within distance $r$ to at least $k$ points of $A$. Allowing $r$ and $k$ to vary, we obtain a 2-parameter family of spaces that grow larger when $r$ increases or $k$ decreases, called the \emph{multicover bifiltration}. Motivated by the problem of computing the homology of this bifiltration, we introduce two closely related combinatorial bifiltrations, one polyhedral and the other simplicial, which are both topologically equivalent to the multicover bifiltration and far smaller than a \v Cech-based model considered in prior work of Sheehy. Our polyhedral construction is a bifiltration of the rhomboid tiling of Edelsbrunner and Osang, and can be efficiently computed using a variant of an algorithm given by these authors. Using an implementation for dimension 2 and 3, we provide experimental results. Our simplicial construction is useful for understanding the polyhedral construction and proving its correctness.
△ Less
Submitted 13 April, 2022; v1 submitted 13 March, 2021;
originally announced March 2021.
-
Fast Minimal Presentations of Bi-graded Persistence Modules
Authors:
Michael Kerber,
Alexander Rolle
Abstract:
Multi-parameter persistent homology is a recent branch of topological data analysis. In this area, data sets are investigated through the lens of homology with respect to two or more scale parameters. The high computational cost of many algorithms calls for a preprocessing step to reduce the input size. In general, a minimal presentation is the smallest possible representation of a persistence mod…
▽ More
Multi-parameter persistent homology is a recent branch of topological data analysis. In this area, data sets are investigated through the lens of homology with respect to two or more scale parameters. The high computational cost of many algorithms calls for a preprocessing step to reduce the input size. In general, a minimal presentation is the smallest possible representation of a persistence module. Lesnick and Wright proposed recently an algorithm (the LW-algorithm) for computing minimal presentations based on matrix reduction. In this work, we propose, implement and benchmark several improvements over the LW-algorithm. Most notably, we propose the use of priority queues to avoid extensive scanning of the matrix columns, which constitutes the computational bottleneck in the LW-algorithm, and we combine their algorithm with ideas from the multi-parameter chunk algorithm by Fugacci and Kerber. Our extensive experiments show that our algorithm outperforms the LW-algorithm and computes the minimal presentation for data sets with millions of simplices within a few seconds. Our software is publicly available.
△ Less
Submitted 29 October, 2020;
originally announced October 2020.
-
Applying the Isabelle Insider Framework to Airplane Security
Authors:
Florian Kammüller,
Manfred Kerber
Abstract:
Avionics is one of the fields in which verification methods have been pioneered and brought a new level of reliability to systems used in safety critical environments. Tragedies, like the 2015 insider attack on a German airplane, in which all 150 people on board died, show that safety and security crucially depend not only on the well functioning of systems but also on the way how humans interact…
▽ More
Avionics is one of the fields in which verification methods have been pioneered and brought a new level of reliability to systems used in safety critical environments. Tragedies, like the 2015 insider attack on a German airplane, in which all 150 people on board died, show that safety and security crucially depend not only on the well functioning of systems but also on the way how humans interact with the systems. Policies are a way to describe how humans should behave in their interactions with technical systems, formal reasoning about such policies requires integrating the human factor into the verification process. In this paper, we report on our work on using logical modelling and analysis of infrastructure models and policies with actors to scrutinize security policies in the presence of insiders. We model insider attacks on airplanes in the Isabelle Insider framework. This application motivates the use of an extension of the framework with Kripke structures and the temporal logic CTL to enable reasoning on dynamic system states. Furthermore, we illustrate that Isabelle modelling and invariant reasoning reveal subtle security assumptions. We summarize by providing a methodology for the development of policies that satisfy stated properties.
△ Less
Submitted 26 March, 2020;
originally announced March 2020.
-
Efficient Approximation of the Matching Distance for 2-parameter persistence
Authors:
Michael Kerber,
Arnur Nigmetov
Abstract:
The matching distance is a computationally tractable topological measure to compare multi-filtered simplicial complexes. We design efficient algorithms for approximating the matching distance of two bi-filtered complexes to any desired precision $ε>0$. Our approach is based on a quad-tree refinement strategy introduced by Biasotti et al., but we recast their approach entirely in geometric terms. T…
▽ More
The matching distance is a computationally tractable topological measure to compare multi-filtered simplicial complexes. We design efficient algorithms for approximating the matching distance of two bi-filtered complexes to any desired precision $ε>0$. Our approach is based on a quad-tree refinement strategy introduced by Biasotti et al., but we recast their approach entirely in geometric terms. This point of view leads to several novel observations resulting in a practically faster algorithm. We demonstrate this speed-up by experimental comparison and provide our code in a public repository which provides the first efficient publicly available implementation of the matching distance.
△ Less
Submitted 31 March, 2020; v1 submitted 12 December, 2019;
originally announced December 2019.
-
Topology-Preserving Terrain Simplification
Authors:
Ulderico Fugacci,
Michael Kerber,
Hugo Manet
Abstract:
We give necessary and sufficient criteria for elementary operations in a two-dimensional terrain to preserve the persistent homology induced by the height function. These operations are edge flips and removals of interior vertices, re-triangulating the link of the removed vertex. This problem is motivated by topological terrain simplification, which means removing as many critical vertices of a te…
▽ More
We give necessary and sufficient criteria for elementary operations in a two-dimensional terrain to preserve the persistent homology induced by the height function. These operations are edge flips and removals of interior vertices, re-triangulating the link of the removed vertex. This problem is motivated by topological terrain simplification, which means removing as many critical vertices of a terrain as possible while maintaining geometric closeness to the original surface. Existing methods manage to reduce the maximal possible number of critical vertices, but increase thereby the number of regular vertices. Our method can be used to post-process a simplified terrain, drastically reducing its size and preserving its favorable properties.
△ Less
Submitted 11 September, 2020; v1 submitted 6 December, 2019;
originally announced December 2019.
-
Metric Spaces with Expensive Distances
Authors:
Michael Kerber,
Arnur Nigmetov
Abstract:
In algorithms for finite metric spaces, it is common to assume that the distance between two points can be computed in constant time, and complexity bounds are expressed only in terms of the number of points of the metric space. We introduce a different model where we assume that the computation of a single distance is an expensive operation and consequently, the goal is to minimize the number of…
▽ More
In algorithms for finite metric spaces, it is common to assume that the distance between two points can be computed in constant time, and complexity bounds are expressed only in terms of the number of points of the metric space. We introduce a different model where we assume that the computation of a single distance is an expensive operation and consequently, the goal is to minimize the number of such distance queries. This model is motivated by metric spaces that appear in the context of topological data analysis.
We consider two standard operations on metric spaces, namely the construction of a $1+\varepsilon$-spanner and the computation of an approximate nearest neighbor for a given query point. In both cases, we partially explore the metric space through distance queries and infer lower and upper bounds for yet unexplored distances through triangle inequality. For spanners, we evaluate several exploration strategies through extensive experimental evaluation. For approximate nearest neighbors, we prove that our strategy returns an approximate nearest neighbor after a logarithmic number of distance queries.
△ Less
Submitted 25 January, 2019;
originally announced January 2019.
-
Exact computation of the matching distance on 2-parameter persistence modules
Authors:
Michael Kerber,
Michael Lesnick,
Steve Oudot
Abstract:
The matching distance is a pseudometric on multi-parameter persistence modules, defined in terms of the weighted bottleneck distance on the restriction of the modules to affine lines. It is known that this distance is stable in a reasonable sense, and can be efficiently approximated, which makes it a promising tool for practical applications. In this work, we show that in the 2-parameter setting,…
▽ More
The matching distance is a pseudometric on multi-parameter persistence modules, defined in terms of the weighted bottleneck distance on the restriction of the modules to affine lines. It is known that this distance is stable in a reasonable sense, and can be efficiently approximated, which makes it a promising tool for practical applications. In this work, we show that in the 2-parameter setting, the matching distance can be computed exactly in polynomial time. Our approach subdivides the space of affine lines into regions, via a line arrangement. In each region, the matching distance restricts to a simple analytic function, whose maximum is easily computed. As a byproduct, our analysis establishes that the matching distance is a rational number, if the bigrades of the input modules are rational.
△ Less
Submitted 28 May, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Chunk Reduction for Multi-Parameter Persistent Homology
Authors:
Ulderico Fugacci,
Michael Kerber
Abstract:
The extension of persistent homology to multi-parameter setups is an algorithmic challenge. Since most computation tasks scale badly with the size of the input complex, an important pre-processing step consists of simplifying the input while maintaining the homological information. We present an algorithm that drastically reduces the size of an input. Our approach is an extension of the chunk algo…
▽ More
The extension of persistent homology to multi-parameter setups is an algorithmic challenge. Since most computation tasks scale badly with the size of the input complex, an important pre-processing step consists of simplifying the input while maintaining the homological information. We present an algorithm that drastically reduces the size of an input. Our approach is an extension of the chunk algorithm for persistent homology (Bauer et al., Topological Methods in Data Analysis and Visualization III, 2014). We show that our construction produces the smallest multi-filtered chain complex among all the complexes quasi-isomorphic to the input, improving on the guarantees of previous work in the context of discrete Morse theory. Our algorithm also offers an immediate parallelization scheme in shared memory. Already its sequential version compares favorably with existing simplification schemes, as we show by experimental evaluation.
△ Less
Submitted 18 March, 2019; v1 submitted 20 December, 2018;
originally announced December 2018.
-
Topology and Geometry of Gaussian random fields I: on Betti Numbers, Euler characteristic and Minkowski functionals
Authors:
Pratyush Pranav,
Rien van de Weygaert,
Gert Vegter,
Bernard J. T. Jones,
Robert J. Adler,
Job Feldbrugge,
Changbom Park,
Thomas Buchert,
Michael Kerber
Abstract:
This study presents a numerical analysis of the topology of a set of cosmologically interesting three-dimensional Gaussian random fields in terms of their Betti numbers $β_0$, $β_1$ and $β_2$. We show that Betti numbers entail a considerably richer characterization of the topology of the primordial density field. Of particular interest is that Betti numbers specify which topological features - isl…
▽ More
This study presents a numerical analysis of the topology of a set of cosmologically interesting three-dimensional Gaussian random fields in terms of their Betti numbers $β_0$, $β_1$ and $β_2$. We show that Betti numbers entail a considerably richer characterization of the topology of the primordial density field. Of particular interest is that Betti numbers specify which topological features - islands, cavities or tunnels - define its spatial structure.
A principal characteristic of Gaussian fields is that the three Betti numbers dominate the topology at different density ranges. At extreme density levels, the topology is dominated by a single class of features. At low levels this is a \emph{Swiss-cheeselike} topology, dominated by isolated cavities, at high levels a predominantly \emph{Meatball-like} topology of isolated objects. At moderate density levels, two Betti number define a more \emph{Sponge-like} topology. At mean density, the topology even needs three Betti numbers, quantifying a field consisting of several disconnected complexes, not of one connected and percolating overdensity.
A {\it second} important aspect of Betti number statistics is that they are sensitive to the power spectrum. It reveals a monotonic trend in which at a moderate density range a lower spectral index corresponds to a considerably higher (relative) population of cavities and islands.
We also assess the level of complementary information that Betti numbers represent, in addition to conventional measures such as Minkowski functionals. To this end, we include an extensive description of the Gaussian Kinematic Formula (GKF), which represents a major theoretical underpinning for this discussion.
△ Less
Submitted 27 February, 2019; v1 submitted 18 December, 2018;
originally announced December 2018.
-
Improved Topological Approximations by Digitization
Authors:
Aruni Choudhary,
Michael Kerber,
Sharath Raghvendra
Abstract:
Čech complexes are useful simplicial complexes for computing and analyzing topological features of data that lies in Euclidean space. Unfortunately, computing these complexes becomes prohibitively expensive for large-sized data sets even for medium-to-low dimensional data. We present an approximation scheme for $(1+ε)$-approximating the topological information of the Čech complexes for $n$ points…
▽ More
Čech complexes are useful simplicial complexes for computing and analyzing topological features of data that lies in Euclidean space. Unfortunately, computing these complexes becomes prohibitively expensive for large-sized data sets even for medium-to-low dimensional data. We present an approximation scheme for $(1+ε)$-approximating the topological information of the Čech complexes for $n$ points in $\mathbb{R}^d$, for $ε\in(0,1]$. Our approximation has a total size of $n\left(\frac{1}ε\right)^{O(d)}$ for constant dimension $d$, improving all the currently available $(1+ε)$-approximation schemes of simplicial filtrations in Euclidean space. Perhaps counter-intuitively, we arrive at our result by adding additional $n\left(\frac{1}ε\right)^{O(d)}$ sample points to the input. We achieve a bound that is independent of the spread of the point set by pre-identifying the scales at which the Čech complexes changes and sampling accordingly.
△ Less
Submitted 12 December, 2018;
originally announced December 2018.
-
Computing the interleaving distance is NP-hard
Authors:
Håvard Bakke Bjerkevik,
Magnus Bakke Botnan,
Michael Kerber
Abstract:
We show that computing the interleaving distance between two multi-graded persistence modules is NP-hard. More precisely, we show that deciding whether two modules are $1$-interleaved is NP-complete, already for bigraded, interval decomposable modules. Our proof is based on previous work showing that a constrained matrix invertibility problem can be reduced to the interleaving distance computation…
▽ More
We show that computing the interleaving distance between two multi-graded persistence modules is NP-hard. More precisely, we show that deciding whether two modules are $1$-interleaved is NP-complete, already for bigraded, interval decomposable modules. Our proof is based on previous work showing that a constrained matrix invertibility problem can be reduced to the interleaving distance computation of a special type of persistence modules. We show that this matrix invertibility problem is NP-complete. We also give a slight improvement of the above reduction, showing that also the approximation of the interleaving distance is NP-hard for any approximation factor smaller than $3$. Additionally, we obtain corresponding hardness results for the case that the modules are indecomposable, and in the setting of one-sided stability. Furthermore, we show that checking for injections (resp. surjections) between persistence modules is NP-hard. In conjunction with earlier results from computational algebra this gives a complete characterization of the computational complexity of one-sided stability. Lastly, we show that it is in general NP-hard to approximate distances induced by noise systems within a factor of 2.
△ Less
Submitted 9 October, 2019; v1 submitted 22 November, 2018;
originally announced November 2018.
-
A Kernel for Multi-Parameter Persistent Homology
Authors:
René Corbet,
Ulderico Fugacci,
Michael Kerber,
Claudia Landi,
Bei Wang
Abstract:
Topological data analysis and its main method, persistent homology, provide a toolkit for computing topological information of high-dimensional and noisy data sets. Kernels for one-parameter persistent homology have been established to connect persistent homology with machine learning techniques. We contribute a kernel construction for multi-parameter persistence by integrating a one-parameter ker…
▽ More
Topological data analysis and its main method, persistent homology, provide a toolkit for computing topological information of high-dimensional and noisy data sets. Kernels for one-parameter persistent homology have been established to connect persistent homology with machine learning techniques. We contribute a kernel construction for multi-parameter persistence by integrating a one-parameter kernel weighted along straight lines. We prove that our kernel is stable and efficiently computable, which establishes a theoretical connection between topological data analysis and machine learning for multivariate data analysis.
△ Less
Submitted 5 June, 2019; v1 submitted 26 September, 2018;
originally announced September 2018.
-
The Representation Theorem of Persistent Homology Revisited and Generalized
Authors:
René Corbet,
Michael Kerber
Abstract:
The Representation Theorem by Zomorodian and Carlsson has been the starting point of the study of persistent homology under the lens of algebraic representation theory. In this work, we give a more accurate statement of the original theorem and provide a complete and self-contained proof. Furthermore, we generalize the statement from the case of linear sequences of $R$-modules to $R$-modules index…
▽ More
The Representation Theorem by Zomorodian and Carlsson has been the starting point of the study of persistent homology under the lens of algebraic representation theory. In this work, we give a more accurate statement of the original theorem and provide a complete and self-contained proof. Furthermore, we generalize the statement from the case of linear sequences of $R$-modules to $R$-modules indexed over more general monoids. This generalization subsumes the Representation Theorem of multidimensional persistence as a special case.
△ Less
Submitted 7 June, 2018; v1 submitted 27 July, 2017;
originally announced July 2017.
-
Improved Approximate Rips Filtrations with Shifted Integer Lattices
Authors:
Aruni Choudhary,
Michael Kerber,
Sharath Raghvendra
Abstract:
Rips complexes are important structures for analyzing topological features of metric spaces. Unfortunately, generating these complexes constitutes an expensive task because of a combinatorial explosion in the complex size. For $n$ points in $\mathbb{R}^d$, we present a scheme to construct a $3\sqrt{2}$-approximation of the multi-scale filtration of the $L_\infty$-Rips complex, which extends to a…
▽ More
Rips complexes are important structures for analyzing topological features of metric spaces. Unfortunately, generating these complexes constitutes an expensive task because of a combinatorial explosion in the complex size. For $n$ points in $\mathbb{R}^d$, we present a scheme to construct a $3\sqrt{2}$-approximation of the multi-scale filtration of the $L_\infty$-Rips complex, which extends to a $O(d^{0.25})$-approximation of the Rips filtration for the Euclidean case. The $k$-skeleton of the resulting approximation has a total size of $n2^{O(d\log k)}$. The scheme is based on the integer lattice and on the barycentric subdivision of the $d$-cube.
△ Less
Submitted 22 June, 2017;
originally announced June 2017.
-
Reading the orbital angular momentum of light using plasmonic nanoantennas
Authors:
R. M. Kerber,
J. M. Fitzgerald,
D. E. Reiter,
S. S. Oh,
O. Hess
Abstract:
Orbital angular momentum of light has recently been recognized as a new degree of freedom to encode information in quantum communication using light pulses. Methods to extract this information include reversing the process by which such twisted light was created in the first place or interference with other beams. Here, we propose an alternative new way to directly read out the extra information e…
▽ More
Orbital angular momentum of light has recently been recognized as a new degree of freedom to encode information in quantum communication using light pulses. Methods to extract this information include reversing the process by which such twisted light was created in the first place or interference with other beams. Here, we propose an alternative new way to directly read out the extra information encoded in twisted light using plasmonic nanoantennas by con- verting the information about the orbital angular momentum of light into spectral information using bright and dark modes. Exemplarily considering rotation-symmetrical nanorod nanoan- tennas we show that their scattering cross-section is sensitive to the value of the orbital angular momentum combined with the polarisation of an incident twisted light beam. Explaining the twist-dependence of the excited modes with a new analytical model our results pave the way to twisted light nanoplasmonics, which is of central importance for future on-chip communication using orbital angular momentum of light.
△ Less
Submitted 18 April, 2017;
originally announced April 2017.
-
Barcodes of Towers and a Streaming Algorithm for Persistent Homology
Authors:
Michael Kerber,
Hannah Schreiber
Abstract:
A tower is a sequence of simplicial complexes connected by simplicial maps. We show how to compute a filtration, a sequence of nested simplicial complexes, with the same persistent barcode as the tower. Our approach is based on the coning strategy by Dey et al. (SoCG 2014). We show that a variant of this approach yields a filtration that is asymptotically only marginally larger than the tower and…
▽ More
A tower is a sequence of simplicial complexes connected by simplicial maps. We show how to compute a filtration, a sequence of nested simplicial complexes, with the same persistent barcode as the tower. Our approach is based on the coning strategy by Dey et al. (SoCG 2014). We show that a variant of this approach yields a filtration that is asymptotically only marginally larger than the tower and can be efficiently computed by a streaming algorithm, both in theory and in practice. Furthermore, we show that our approach can be combined with a streaming algorithm to compute the barcode of the tower via matrix reduction. The space complexity of the algorithm does not depend on the length of the tower, but the maximal size of any subcomplex within the tower.
△ Less
Submitted 12 October, 2017; v1 submitted 9 January, 2017;
originally announced January 2017.
-
Constrained Triangulations, Volumes of Polytopes, and Unit Equations
Authors:
Michael Kerber,
Robert Tichy,
Mario Weitzer
Abstract:
Given a polytope $\mathcal{P}$ in $\mathbb{R}^d$ and a subset $U$ of its vertices, is there a triangulation of $\mathcal{P}$ using $d$-simplices that all contain $U$? We answer this question by proving an equivalent and easy-to-check combinatorial criterion for the facets of $\mathcal{P}$. Our proof relates triangulations of $\mathcal{P}$ to triangulations of its "shadow", a projection to a lower-…
▽ More
Given a polytope $\mathcal{P}$ in $\mathbb{R}^d$ and a subset $U$ of its vertices, is there a triangulation of $\mathcal{P}$ using $d$-simplices that all contain $U$? We answer this question by proving an equivalent and easy-to-check combinatorial criterion for the facets of $\mathcal{P}$. Our proof relates triangulations of $\mathcal{P}$ to triangulations of its "shadow", a projection to a lower-dimensional space determined by $U$. In particular, we obtain a formula relating the volume of $\mathcal{P}$ with the volume of its shadow. This leads to an exact formula for the volume of a polytope arising in the theory of unit equations.
△ Less
Submitted 8 March, 2018; v1 submitted 16 September, 2016;
originally announced September 2016.
-
The Topology of the Cosmic Web in Terms of Persistent Betti Numbers
Authors:
Pratyush Pranav,
Herbert Edelsbrunner,
Rien van de Weygaert,
Gert Vegter,
Michael Kerber,
Bernard J. T. Jones,
Mathijs Wintraecken
Abstract:
We introduce a multiscale topological description of the Megaparsec weblike cosmic matter distribution. Betti numbers and topological persistence offer a powerful means of describing the rich connectivity structure of the cosmic web and of its multiscale arrangement of matter and galaxies. Emanating from algebraic topology and Morse theory, Betti numbers and persistence diagrams represent an exten…
▽ More
We introduce a multiscale topological description of the Megaparsec weblike cosmic matter distribution. Betti numbers and topological persistence offer a powerful means of describing the rich connectivity structure of the cosmic web and of its multiscale arrangement of matter and galaxies. Emanating from algebraic topology and Morse theory, Betti numbers and persistence diagrams represent an extension and deepening of the cosmologically familiar topological genus measure, and the related geometric Minkowski functionals. In addition to a description of the mathematical background, this study presents the computational procedure for computing Betti numbers and persistence diagrams for density field filtrations. The field may be computed starting from a discrete spatial distribution of galaxies or simulation particles. The main emphasis of this study concerns an extensive and systematic exploration of the imprint of different weblike morphologies and different levels of multiscale clustering in the corresponding computed Betti numbers and persistence diagrams. To this end, we use Voronoi clustering models as templates for a rich variety of weblike configurations, and the fractal-like Soneira-Peebles models exemplify a range of multiscale configurations. We have identified the clear imprint of cluster nodes, filaments, walls, and voids in persistence diagrams, along with that of the nested hierarchy of structures in multiscale point distributions. We conclude by outlining the potential of persistent topology for understanding the connectivity structure of the cosmic web, in large simulations of cosmic structure formation and in the challenging context of the observed galaxy distribution in large galaxy surveys.
△ Less
Submitted 31 January, 2017; v1 submitted 16 August, 2016;
originally announced August 2016.
-
Geometry Helps to Compare Persistence Diagrams
Authors:
Michael Kerber,
Dmitriy Morozov,
Arnur Nigmetov
Abstract:
Exploiting geometric structure to improve the asymptotic complexity of discrete assignment problems is a well-studied subject. In contrast, the practical advantages of using geometry for such problems have not been explored. We implement geometric variants of the Hopcroft--Karp algorithm for bottleneck matching (based on previous work by Efrat el al.) and of the auction algorithm by Bertsekas for…
▽ More
Exploiting geometric structure to improve the asymptotic complexity of discrete assignment problems is a well-studied subject. In contrast, the practical advantages of using geometry for such problems have not been explored. We implement geometric variants of the Hopcroft--Karp algorithm for bottleneck matching (based on previous work by Efrat el al.) and of the auction algorithm by Bertsekas for Wasserstein distance computation. Both implementations use k-d trees to replace a linear scan with a geometric proximity query. Our interest in this problem stems from the desire to compute distances between persistence diagrams, a problem that comes up frequently in topological data analysis. We show that our geometric matching algorithms lead to a substantial performance gain, both in running time and in memory consumption, over their purely combinatorial counterparts. Moreover, our implementation significantly outperforms the only other implementation available for comparing persistence diagrams.
△ Less
Submitted 10 June, 2016;
originally announced June 2016.
-
Homotopy equivalence between Voronoi medusa and Delaunay medusa
Authors:
Siddharth Pritam,
Michael Kerber
Abstract:
We trace movements of certain points in space-time along their corresponding continuous path. We partition the space at every moment of time using alpha-Complexes, Voronoi medusa is then the collection or union of restricted Voronoi cells at every moment in time. We can imagine them as a four dimensional structure formed when three dimensional restricted Voronoi cells sweeps continuously through t…
▽ More
We trace movements of certain points in space-time along their corresponding continuous path. We partition the space at every moment of time using alpha-Complexes, Voronoi medusa is then the collection or union of restricted Voronoi cells at every moment in time. We can imagine them as a four dimensional structure formed when three dimensional restricted Voronoi cells sweeps continuously through the extra dimension of time. Similarly Delaunay medusa is the collection of the corresponding Delaunay triangulations at each moment in time. In this article we prove that these two structures are homotopic.
△ Less
Submitted 18 April, 2016; v1 submitted 12 April, 2016;
originally announced April 2016.
-
An Introduction to Mechanized Reasoning
Authors:
Manfred Kerber,
Christoph Lange,
Colin Rowat
Abstract:
Mechanized reasoning uses computers to verify proofs and to help discover new theorems. Computer scientists have applied mechanized reasoning to economic problems but -- to date -- this work has not yet been properly presented in economics journals. We introduce mechanized reasoning to economists in three ways. First, we introduce mechanized reasoning in general, describing both the techniques and…
▽ More
Mechanized reasoning uses computers to verify proofs and to help discover new theorems. Computer scientists have applied mechanized reasoning to economic problems but -- to date -- this work has not yet been properly presented in economics journals. We introduce mechanized reasoning to economists in three ways. First, we introduce mechanized reasoning in general, describing both the techniques and their successful applications. Second, we explain how mechanized reasoning has been applied to economic problems, concentrating on the two domains that have attracted the most attention: social choice theory and auction theory. Finally, we present a detailed example of mechanized reasoning in practice by means of a proof of Vickrey's familiar theorem on second-price auctions.
△ Less
Submitted 10 August, 2016; v1 submitted 8 March, 2016;
originally announced March 2016.
-
Polynomial-Sized Topological Approximations Using The Permutahedron
Authors:
Aruni Choudhary,
Michael Kerber,
Sharath Raghvendra
Abstract:
Classical methods to model topological properties of point clouds, such as the Vietoris-Rips complex, suffer from the combinatorial explosion of complex sizes. We propose a novel technique to approximate a multi-scale filtration of the Rips complex with improved bounds for size: precisely, for $n$ points in $\mathbb{R}^d$, we obtain a $O(d)$-approximation with at most $n2^{O(d \log k)}$ simplices…
▽ More
Classical methods to model topological properties of point clouds, such as the Vietoris-Rips complex, suffer from the combinatorial explosion of complex sizes. We propose a novel technique to approximate a multi-scale filtration of the Rips complex with improved bounds for size: precisely, for $n$ points in $\mathbb{R}^d$, we obtain a $O(d)$-approximation with at most $n2^{O(d \log k)}$ simplices of dimension $k$ or lower. In conjunction with dimension reduction techniques, our approach yields a $O(\mathrm{polylog} (n))$-approximation of size $n^{O(1)}$ for Rips filtrations on arbitrary metric spaces. This result stems from high-dimensional lattice geometry and exploits properties of the permutahedral lattice, a well-studied structure in discrete geometry.
Building on the same geometric concept, we also present a lower bound result on the size of an approximate filtration: we construct a point set for which every $(1+ε)$-approximation of the Čech filtration has to contain $n^{Ω(\log\log n)}$ features, provided that $ε<\frac{1}{\log^{1+c} n}$ for $c\in(0,1)$.
△ Less
Submitted 1 April, 2016; v1 submitted 12 January, 2016;
originally announced January 2016.
-
Semi-dynamic connectivity in the plane
Authors:
Sergio Cabello,
Michael Kerber
Abstract:
Motivated by a path planning problem we consider the following procedure. Assume that we have two points $s$ and $t$ in the plane and take $\mathcal{K}=\emptyset$. At each step we add to $\mathcal{K}$ a compact convex set that does not contain $s$ nor $t$. The procedure terminates when the sets in $\mathcal{K}$ separate $s$ and $t$. We show how to add one set to $\mathcal{K}$ in $O(1+kα(n))$ amort…
▽ More
Motivated by a path planning problem we consider the following procedure. Assume that we have two points $s$ and $t$ in the plane and take $\mathcal{K}=\emptyset$. At each step we add to $\mathcal{K}$ a compact convex set that does not contain $s$ nor $t$. The procedure terminates when the sets in $\mathcal{K}$ separate $s$ and $t$. We show how to add one set to $\mathcal{K}$ in $O(1+kα(n))$ amortized time plus the time needed to find all sets of $\mathcal{K}$ intersecting the newly added set, where $n$ is the cardinality of $\mathcal{K}$, $k$ is the number of sets in $\mathcal{K}$ intersecting the newly added set, and $α(\cdot)$ is the inverse of the Ackermann function.
△ Less
Submitted 12 February, 2015;
originally announced February 2015.
-
Budget Imbalance Criteria for Auctions: A Formalized Theorem
Authors:
Marco B. Caminati,
Manfred Kerber,
Colin Rowat
Abstract:
We present an original theorem in auction theory: it specifies general conditions under which the sum of the payments of all bidders is necessarily not identically zero, and more generally not constant. Moreover, it explicitly supplies a construction for a finite minimal set of possible bids on which such a sum is not constant. In particular, this theorem applies to the important case of a second-…
▽ More
We present an original theorem in auction theory: it specifies general conditions under which the sum of the payments of all bidders is necessarily not identically zero, and more generally not constant. Moreover, it explicitly supplies a construction for a finite minimal set of possible bids on which such a sum is not constant. In particular, this theorem applies to the important case of a second-price Vickrey auction, where it reduces to a basic result of which a novel proof is given. To enhance the confidence in this new theorem, it has been formalized in Isabelle/HOL: the main results and definitions of the formal proof are re- produced here in common mathematical language, and are accompanied by an informal discussion about the underlying ideas.
△ Less
Submitted 7 November, 2014;
originally announced December 2014.
-
The Offset Filtration of Convex Objects
Authors:
Dan Halperin,
Michael Kerber,
Doron Shaharabani
Abstract:
We consider offsets of a union of convex objects. We aim for a filtration, a sequence of nested cell complexes, that captures the topological evolution of the offsets for increasing radii. We describe methods to compute a filtration based on the Voronoi partition with respect to the given convex objects. We prove that, in two and three dimensions, the size of the filtration is proportional to the…
▽ More
We consider offsets of a union of convex objects. We aim for a filtration, a sequence of nested cell complexes, that captures the topological evolution of the offsets for increasing radii. We describe methods to compute a filtration based on the Voronoi partition with respect to the given convex objects. We prove that, in two and three dimensions, the size of the filtration is proportional to the size of the Voronoi diagram. Our algorithm runs in $Θ(n \log{n})$ in the $2$-dimensional case and in expected time $O(n^{3 + ε})$, for any $ε> 0$, in the $3$-dimensional case. Our approach is inspired by alpha-complexes for point sets, but requires more involved machinery and analysis primarily since Voronoi regions of general convex objects do not form a good cover. We show by experiments that our approach results in a similarly fast and topologically more stable method for computing a filtration compared to approximating the input by point samples.
△ Less
Submitted 23 January, 2015; v1 submitted 23 July, 2014;
originally announced July 2014.
-
Approximation and Streaming Algorithms for Projective Clustering via Random Projections
Authors:
Michael Kerber,
Sharath Raghvendra
Abstract:
Let $P$ be a set of $n$ points in $\mathbb{R}^d$. In the projective clustering problem, given $k, q$ and norm $ρ\in [1,\infty]$, we have to compute a set $\mathcal{F}$ of $k$ $q$-dimensional flats such that $(\sum_{p\in P}d(p, \mathcal{F})^ρ)^{1/ρ}$ is minimized; here $d(p, \mathcal{F})$ represents the (Euclidean) distance of $p$ to the closest flat in $\mathcal{F}$. We let $f_k^q(P,ρ)$ denote the…
▽ More
Let $P$ be a set of $n$ points in $\mathbb{R}^d$. In the projective clustering problem, given $k, q$ and norm $ρ\in [1,\infty]$, we have to compute a set $\mathcal{F}$ of $k$ $q$-dimensional flats such that $(\sum_{p\in P}d(p, \mathcal{F})^ρ)^{1/ρ}$ is minimized; here $d(p, \mathcal{F})$ represents the (Euclidean) distance of $p$ to the closest flat in $\mathcal{F}$. We let $f_k^q(P,ρ)$ denote the minimal value and interpret $f_k^q(P,\infty)$ to be $\max_{r\in P}d(r, \mathcal{F})$. When $ρ=1,2$ and $\infty$ and $q=0$, the problem corresponds to the $k$-median, $k$-mean and the $k$-center clustering problems respectively.
For every $0 < ε< 1$, $S\subset P$ and $ρ\ge 1$, we show that the orthogonal projection of $P$ onto a randomly chosen flat of dimension $O(((q+1)^2\log(1/ε)/ε^3) \log n)$ will $ε$-approximate $f_1^q(S,ρ)$. This result combines the concepts of geometric coresets and subspace embeddings based on the Johnson-Lindenstrauss Lemma. As a consequence, an orthogonal projection of $P$ to an $O(((q+1)^2 \log ((q+1)/ε)/ε^3) \log n)$ dimensional randomly chosen subspace $ε$-approximates projective clusterings for every $k$ and $ρ$ simultaneously. Note that the dimension of this subspace is independent of the number of clusters~$k$.
Using this dimension reduction result, we obtain new approximation and streaming algorithms for projective clustering problems. For example, given a stream of $n$ points, we show how to compute an $ε$-approximate projective clustering for every $k$ and $ρ$ simultaneously using only $O((n+d)((q+1)^2\log ((q+1)/ε))/ε^3 \log n)$ space. Compared to standard streaming algorithms with $Ω(kd)$ space requirement, our approach is a significant improvement when the number of input points and their dimensions are of the same order of magnitude.
△ Less
Submitted 2 June, 2015; v1 submitted 8 July, 2014;
originally announced July 2014.
-
Local Doubling Dimension of Point Sets
Authors:
Aruni Choudhary,
Michael Kerber
Abstract:
We introduce the notion of t-restricted doubling dimension of a point set in Euclidean space as the local intrinsic dimension up to scale t. In many applications information is only relevant for a fixed range of scales. We present an algorithm to construct a hierarchical net-tree up to scale t which we denote as the net-forest. We present a method based on Locality Sensitive Hashing to compute all…
▽ More
We introduce the notion of t-restricted doubling dimension of a point set in Euclidean space as the local intrinsic dimension up to scale t. In many applications information is only relevant for a fixed range of scales. We present an algorithm to construct a hierarchical net-tree up to scale t which we denote as the net-forest. We present a method based on Locality Sensitive Hashing to compute all near neighbours of points within a certain distance. Our construction of the net-forest is probabilistic, and we guarantee that with high probability, the net-forest is supplemented with the correct neighbouring information. We apply our net-forest construction scheme to create an approximate Cech complex up to a fixed scale; and its complexity depends on the local intrinsic dimension up to that scale.
△ Less
Submitted 18 June, 2014;
originally announced June 2014.
-
Set Theory or Higher Order Logic to Represent Auction Concepts in Isabelle?
Authors:
Marco B. Caminati,
Manfred Kerber,
Christoph Lange,
Colin Rowat
Abstract:
When faced with the question of how to represent properties in a formal proof system any user has to make design decisions. We have proved three of the theorems from Maskin's 2004 survey article on Auction Theory using the Isabelle/HOL system, and we have produced verified code for combinatorial Vickrey auctions. A fundamental question in this was how to represent some basic concepts: since set th…
▽ More
When faced with the question of how to represent properties in a formal proof system any user has to make design decisions. We have proved three of the theorems from Maskin's 2004 survey article on Auction Theory using the Isabelle/HOL system, and we have produced verified code for combinatorial Vickrey auctions. A fundamental question in this was how to represent some basic concepts: since set theory is available inside Isabelle/HOL, when introducing new definitions there is often the issue of balancing the amount of set-theoretical objects and of objects expressed using entities which are more typical of higher order logic such as functions or lists. Likewise, a user has often to answer the question whether to use a constructive or a non-constructive definition. Such decisions have consequences for the proof development and the usability of the formalization. For instance, sets are usually closer to the representation that economists would use and recognize, while the other objects are closer to the extraction of computational content. In this paper we give examples of the advantages and disadvantages for these approaches and their relationships. In addition, we present the corresponding Isabelle library of definitions and theorems, most prominently those dealing with relations and quotients.
△ Less
Submitted 1 June, 2014;
originally announced June 2014.
-
Sphere Packing with Limited Overlap
Authors:
Mabel Iglesias-Ham,
Michael Kerber,
Caroline Uhler
Abstract:
The classical sphere packing problem asks for the best (infinite) arrangement of non-overlap** unit balls which cover as much space as possible. We define a generalized version of the problem, where we allow each ball a limited amount of overlap with other balls. We study two natural choices of overlap measures and obtain the optimal lattice packings in a parameterized family of lattices which c…
▽ More
The classical sphere packing problem asks for the best (infinite) arrangement of non-overlap** unit balls which cover as much space as possible. We define a generalized version of the problem, where we allow each ball a limited amount of overlap with other balls. We study two natural choices of overlap measures and obtain the optimal lattice packings in a parameterized family of lattices which contains the FCC, BCC, and integer lattice.
△ Less
Submitted 2 January, 2014;
originally announced January 2014.
-
Distributed computation of persistent homology
Authors:
Ulrich Bauer,
Michael Kerber,
Jan Reininghaus
Abstract:
Persistent homology is a popular and powerful tool for capturing topological features of data. Advances in algorithms for computing persistent homology have reduced the computation time drastically -- as long as the algorithm does not exhaust the available memory. Following up on a recently presented parallel method for persistence computation on shared memory systems, we demonstrate that a simple…
▽ More
Persistent homology is a popular and powerful tool for capturing topological features of data. Advances in algorithms for computing persistent homology have reduced the computation time drastically -- as long as the algorithm does not exhaust the available memory. Following up on a recently presented parallel method for persistence computation on shared memory systems, we demonstrate that a simple adaption of the standard reduction algorithm leads to a variant for distributed systems. Our algorithmic design ensures that the data is distributed over the nodes without redundancy; this permits the computation of much larger instances than on a single machine. Moreover, we observe that the parallelism at least compensates for the overhead caused by communication between nodes, and often even speeds up the computation compared to sequential and even parallel shared memory algorithms. In our experiments, we were able to compute the persistent homology of filtrations with more than a billion (10^9) elements within seconds on a cluster with 32 nodes using less than 10GB of memory per node.
△ Less
Submitted 2 October, 2013;
originally announced October 2013.
-
Proving soundness of combinatorial Vickrey auctions and generating verified executable code
Authors:
Marco B. Caminati,
Manfred Kerber,
Christoph Lange,
Colin Rowat
Abstract:
Using mechanised reasoning we prove that combinatorial Vickrey auctions are soundly specified in that they associate a unique outcome (allocation and transfers) to any valid input (bids). Having done so, we auto-generate verified executable code from the formally defined auction. This removes a source of error in implementing the auction design. We intend to use formal methods to verify new auctio…
▽ More
Using mechanised reasoning we prove that combinatorial Vickrey auctions are soundly specified in that they associate a unique outcome (allocation and transfers) to any valid input (bids). Having done so, we auto-generate verified executable code from the formally defined auction. This removes a source of error in implementing the auction design. We intend to use formal methods to verify new auction designs. Here, our contribution is to introduce and demonstrate the use of formal methods for auction verification in the familiar setting of a well-known auction.
△ Less
Submitted 2 September, 2013; v1 submitted 8 August, 2013;
originally announced August 2013.
-
Approximate Cech Complexes in Low and High Dimensions
Authors:
Michael Kerber,
R. Sharathkumar
Abstract:
Čech complexes reveal valuable topological information about point sets at a certain scale in arbitrary dimensions, but the sheer size of these complexes limits their practical impact. While recent work introduced approximation techniques for filtrations of (Vietoris-)Rips complexes, a coarser version of Čech complexes, we propose the approximation of Čech filtrations directly.
For fixed dimensi…
▽ More
Čech complexes reveal valuable topological information about point sets at a certain scale in arbitrary dimensions, but the sheer size of these complexes limits their practical impact. While recent work introduced approximation techniques for filtrations of (Vietoris-)Rips complexes, a coarser version of Čech complexes, we propose the approximation of Čech filtrations directly.
For fixed dimensional point set $S$, we present an approximation of the Čech filtration of $S$ by a sequence of complexes of size linear in the number of points. We generalize well-separated pair decompositions (WSPD) to well-separated simplicial decomposition (WSSD) in which every simplex defined on $S$ is covered by some element of WSSD. We give an efficient algorithm to compute a linear-sized WSSD in fixed dimensional spaces. Using a WSSD, we then present a linear-sized approximation of the filtration of Čech complex of $S$.
We also present a generalization of the known fact that the Rips complex approximates the Čech complex by a factor of $\sqrt{2}$. We define a class of complexes that interpolate between Čech and Rips complexes and that, given any parameter $\eps > 0$, approximate the Čech complex by a factor $(1+\eps)$. Our complex can be represented by roughly $O(n^{\lceil 1/2\eps\rceil})$ simplices without any hidden dependence on the ambient dimension of the point set. Our results are based on an interesting link between Čech complex and coresets for minimum enclosing ball of high-dimensional point sets. As a consequence of our analysis, we show improved bounds on coresets that approximate the radius of the minimum enclosing ball.
△ Less
Submitted 11 July, 2013;
originally announced July 2013.
-
The ForMaRE Project - Formal Mathematical Reasoning in Economics
Authors:
Christoph Lange,
Colin Rowat,
Manfred Kerber
Abstract:
The ForMaRE project applies formal mathematical reasoning to economics. We seek to increase confidence in economics' theoretical results, to aid in discovering new results, and to foster interest in formal methods, i.e. computer-aided reasoning, within economics. To formal methods, we seek to contribute user experience feedback from new audiences, as well as new challenge problems. In the first pr…
▽ More
The ForMaRE project applies formal mathematical reasoning to economics. We seek to increase confidence in economics' theoretical results, to aid in discovering new results, and to foster interest in formal methods, i.e. computer-aided reasoning, within economics. To formal methods, we seek to contribute user experience feedback from new audiences, as well as new challenge problems. In the first project year, we continued earlier game theory studies but then focused on auctions, where we are building a toolbox of formalisations, and have started to study matching and financial risk.
In parallel to conducting research that connects economics and formal methods, we organise events and provide infrastructure to connect both communities, from fostering mutual awareness to targeted matchmaking. These efforts extend beyond economics, towards generally enabling domain experts to use mechanised reasoning.
△ Less
Submitted 18 May, 2013; v1 submitted 18 March, 2013;
originally announced March 2013.
-
A Qualitative Comparison of the Suitability of Four Theorem Provers for Basic Auction Theory
Authors:
Christoph Lange,
Marco B. Caminati,
Manfred Kerber,
Till Mossakowski,
Colin Rowat,
Makarius Wenzel,
Wolfgang Windsteiger
Abstract:
Novel auction schemes are constantly being designed. Their design has significant consequences for the allocation of goods and the revenues generated. But how to tell whether a new design has the desired properties, such as efficiency, i.e. allocating goods to those bidders who value them most? We say: by formal, machine-checked proofs. We investigated the suitability of the Isabelle, Theorema, Mi…
▽ More
Novel auction schemes are constantly being designed. Their design has significant consequences for the allocation of goods and the revenues generated. But how to tell whether a new design has the desired properties, such as efficiency, i.e. allocating goods to those bidders who value them most? We say: by formal, machine-checked proofs. We investigated the suitability of the Isabelle, Theorema, Mizar, and Hets/CASL/TPTP theorem provers for reproducing a key result of auction theory: Vickrey's 1961 theorem on the properties of second-price auctions. Based on our formalisation experience, taking an auction designer's perspective, we give recommendations on what system to use for formalising auctions, and outline further steps towards a complete auction theory toolbox.
△ Less
Submitted 23 May, 2013; v1 submitted 18 March, 2013;
originally announced March 2013.