Skip to main content

Showing 1–50 of 51 results for author: Rieck, B

.
  1. arXiv:2402.09529  [pdf, other

    cs.LG math.AT

    The Manifold Density Function: An Intrinsic Method for the Validation of Manifold Learning

    Authors: Benjamin Holmgren, Eli Quist, Jordan Schupbach, Brittany Terese Fasy, Bastian Rieck

    Abstract: We introduce the manifold density function, which is an intrinsic method to validate manifold learning techniques. Our approach adapts and extends Ripley's $K$-function, and categorizes in an unsupervised setting the extent to which an output of a manifold learning algorithm captures the structure of a latent manifold. Our manifold density function generalizes to broad classes of Riemannian manifo… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 24 pages, 6 figures

    MSC Class: 57Z25 ACM Class: I.5.2

  2. arXiv:2402.08871  [pdf, other

    cs.LG stat.ML

    Position: Topological Deep Learning is the New Frontier for Relational Learning

    Authors: Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi

    Abstract: Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning setting… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2402.01514  [pdf, other

    cs.LG math.AT stat.ML

    Map** the Multiverse of Latent Representations

    Authors: Jeremy Wayland, Corinna Coupette, Bastian Rieck

    Abstract: Echoing recent calls to counter reliability and robustness concerns in machine learning via multiverse analysis, we present PRESTO, a principled framework for map** the multiverse of machine-learning models that rely on latent representations. Although such models enjoy widespread adoption, the variability in their embeddings remains poorly understood, resulting in unnecessary complexity and unt… ▽ More

    Submitted 1 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  4. arXiv:2312.08515  [pdf, other

    cs.LG math.AT

    Simplicial Representation Learning with Neural $k$-Forms

    Authors: Kelly Maggs, Celia Hacker, Bastian Rieck

    Abstract: Geometric deep learning extends deep learning to incorporate information about the geometry and topology data, especially in complex domains like graphs. Despite the popularity of message passing in this field, it has limitations such as the need for graph rewiring, ambiguity in interpreting data, and over-smoothing. In this paper, we take a different approach, focusing on leveraging geometric inf… ▽ More

    Submitted 15 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at ICLR 2024 (https://openreview.net/forum?id=Djw0XhjHZb)

  5. arXiv:2311.16054  [pdf, other

    cs.LG math.GT stat.ML

    Metric Space Magnitude for Evaluating the Diversity of Latent Representations

    Authors: Katharina Limbeck, Rayna Andreeva, Rik Sarkar, Bastian Rieck

    Abstract: The magnitude of a metric space is a novel invariant that provides a measure of the 'effective size' of a space across multiple scales, while also capturing numerous geometrical properties, such as curvature, density, or entropy. We develop a family of magnitude-based measures of the intrinsic diversity of latent representations, formalising a novel notion of dissimilarity between magnitude functi… ▽ More

    Submitted 21 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  6. arXiv:2310.07630  [pdf, other

    cs.LG

    Differentiable Euler Characteristic Transforms for Shape Classification

    Authors: Ernst Roell, Bastian Rieck

    Abstract: The Euler Characteristic Transform (ECT) has proven to be a powerful representation, combining geometrical and topological characteristics of shapes and graphs. However, the ECT was hitherto unable to learn task-specific representations. We overcome this issue and develop a novel computational layer that enables learning the ECT in an end-to-end fashion. Our method, the Differentiable Euler Charac… ▽ More

    Submitted 19 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024 (https://openreview.net/forum?id=MO632iPq3I)

  7. arXiv:2309.03616  [pdf, other

    cs.LG

    Filtration Surfaces for Dynamic Graph Classification

    Authors: Franz Srambical, Bastian Rieck

    Abstract: Existing approaches for classifying dynamic graphs either lift graph kernels to the temporal domain, or use graph neural networks (GNNs). However, current baselines have scalability issues, cannot handle a changing node set, or do not take edge weight information into account. We propose filtration surfaces, a novel method that is scalable and flexible, to alleviate said restrictions. We experimen… ▽ More

    Submitted 21 October, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

  8. arXiv:2307.14025  [pdf, other

    cs.LG cs.CV eess.IV q-bio.QM stat.ML

    Topologically Regularized Multiple Instance Learning to Harness Data Scarcity

    Authors: Salome Kazeminia, Carsten Marr, Bastian Rieck

    Abstract: In biomedical data analysis, Multiple Instance Learning (MIL) models have emerged as a powerful tool to classify patients' microscopy samples. However, the data-intensive requirement of these models poses a significant challenge in scenarios with scarce data availability, e.g., in rare diseases. We introduce a topological regularization term to MIL to mitigate this challenge. It provides a shape-p… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

  9. arXiv:2306.00586  [pdf, other

    cs.LG cs.CY

    Evaluating the "Learning on Graphs" Conference Experience

    Authors: Bastian Rieck, Corinna Coupette

    Abstract: With machine learning conferences growing ever larger, and reviewing processes becoming increasingly elaborate, more data-driven insights into their workings are required. In this report, we present the results of a survey accompanying the first "Learning on Graphs" (LoG) Conference. The survey was directed to evaluate the submission and review process from different perspectives, including author… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  10. arXiv:2305.19303  [pdf, other

    physics.chem-ph cs.LG

    MAGNet: Motif-Agnostic Generation of Molecules from Shapes

    Authors: Leon Hetzel, Johanna Sommer, Bastian Rieck, Fabian Theis, Stephan Günnemann

    Abstract: Recent advances in machine learning for molecules exhibit great potential for facilitating drug discovery from in silico predictions. Most models for molecule generation rely on the decomposition of molecules into frequently occurring substructures (motifs), from which they generate novel compounds. While motif representations greatly aid in learning molecular distributions, such methods struggle… ▽ More

    Submitted 7 November, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  11. arXiv:2305.05611  [pdf, other

    cs.LG math.GT stat.ML

    Metric Space Magnitude and Generalisation in Neural Networks

    Authors: Rayna Andreeva, Katharina Limbeck, Bastian Rieck, Rik Sarkar

    Abstract: Deep learning models have seen significant successes in numerous applications, but their inner workings remain elusive. The purpose of this work is to quantify the learning process of deep neural networks through the lens of a novel topological invariant called magnitude. Magnitude is an isometry invariant; its properties are an active area of research as it encodes many known invariants of a metr… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  12. arXiv:2304.12417  [pdf, other

    cs.DL cs.DB math.AT

    DONUT -- Creation, Development, and Opportunities of a Database

    Authors: Barbara Giunti, Jānis Lazovskis, Bastian Rieck

    Abstract: DONUT is a database of papers about practical, real-world uses of Topological Data Analysis (TDA). Its original seed was planted in a group chat formed during the HIM Spring School on Applied and Computational Algebraic Topology in April 2017. This document describes the creation, curation, and maintenance process of the database.

    Submitted 24 April, 2023; originally announced April 2023.

  13. arXiv:2303.05286  [pdf, other

    cs.LG q-bio.QM

    Euler Characteristic Transform Based Topological Loss for Reconstructing 3D Images from Single 2D Slices

    Authors: Kalyan Varma Nadimpalli, Amit Chattopadhyay, Bastian Rieck

    Abstract: The computer vision task of reconstructing 3D images, i.e., shapes, from their single 2D image slices is extremely challenging, more so in the regime of limited data. Deep learning models typically optimize geometric loss functions, which may lead to poor reconstructions as they ignore the structural properties of the shape. To tackle this, we propose a novel topological loss function based on the… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: e-print

  14. arXiv:2302.09826  [pdf, other

    cs.LG math.AT stat.ML

    On the Expressivity of Persistent Homology in Graph Learning

    Authors: Rubén Ballester, Bastian Rieck

    Abstract: Persistent homology, a technique from computational topology, has recently shown strong empirical performance in the context of graph classification. Being able to capture long range graph properties via higher-order topological features, such as cycles of arbitrary length, in combination with multi-scale topological descriptors, has improved predictive performance for data sets with prominent top… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

    MSC Class: 55N31 (Primary) 62R40; 68T09 (Secondary)

  15. arXiv:2301.12906  [pdf, other

    cs.LG math.AT stat.ML

    Curvature Filtrations for Graph Generative Model Evaluation

    Authors: Joshua Southern, Jeremy Wayland, Michael Bronstein, Bastian Rieck

    Abstract: Graph generative model evaluation necessitates understanding differences between graphs on the distributional level. This entails being able to harness salient attributes of graphs in an efficient manner. Curvature constitutes one such property that has recently proved its utility in characterising graphs. Its expressive properties, stability, and practical utility in model evaluation remain large… ▽ More

    Submitted 26 October, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS) 2023

  16. arXiv:2210.12048  [pdf, other

    cs.LG cs.SI stat.ML

    Ollivier-Ricci Curvature for Hypergraphs: A Unified Framework

    Authors: Corinna Coupette, Sebastian Dalleiger, Bastian Rieck

    Abstract: Bridging geometry and topology, curvature is a powerful and expressive invariant. While the utility of curvature has been theoretically and empirically confirmed in the context of manifolds and graphs, its generalization to the emerging domain of hypergraphs has remained largely unexplored. On graphs, the Ollivier-Ricci curvature measures differences between random walks via Wasserstein distances,… ▽ More

    Submitted 6 April, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at ICLR 2023 (https://openreview.net/forum?id=sPCKNl5qDps)

    MSC Class: 68R10

  17. arXiv:2210.00069  [pdf, other

    cs.LG cs.AI math.AT stat.ML

    Topological Singularity Detection at Multiple Scales

    Authors: Julius von Rohrscheidt, Bastian Rieck

    Abstract: The manifold hypothesis, which assumes that data lies on or close to an unknown manifold of low intrinsic dimension, is a staple of modern machine learning research. However, recent work has shown that real-world data exhibits distinct non-manifold structures, i.e. singularities, that can lead to erroneous findings. Detecting such singularities is therefore crucial as a precursor to interpolation… ▽ More

    Submitted 14 June, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted at the International Conference on Machine Learning (ICML) 2023; camera-ready version

    MSC Class: 55N31 (Primary); 32S50 (Secondary)

  18. arXiv:2208.14125  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images

    Authors: Dominik J. E. Waibel, Ernst Röell, Bastian Rieck, Raja Giryes, Carsten Marr

    Abstract: Diffusion models are a special type of generative model, capable of synthesising new data from a learnt distribution. We introduce DISPR, a diffusion-based model for solving the inverse problem of three-dimensional (3D) cell shape prediction from two-dimensional (2D) single cell microscopy images. Using the 2D microscopy image as a prior, DISPR is conditioned to predict realistic 3D shape reconstr… ▽ More

    Submitted 14 March, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    MSC Class: 68-06

  19. arXiv:2206.08252  [pdf, other

    cs.LG stat.ML

    On the Surprising Behaviour of node2vec

    Authors: Celia Hacker, Bastian Rieck

    Abstract: Graph embedding techniques are a staple of modern graph learning research. When using embeddings for downstream tasks such as classification, information about their stability and robustness, i.e., their susceptibility to sources of noise, stochastic effects, or specific parameter choices, becomes increasingly important. As one of the most prominent graph embedding schemes, we focus on node2vec an… ▽ More

    Submitted 19 August, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: ICML 2022 Workshop on Topology, Algebra, and Geometry in Machine Learning (Camera-Ready Version)

  20. arXiv:2206.08225  [pdf, other

    cs.LG cs.CL cs.CY cs.SI

    All the World's a (Hyper)Graph: A Data Drama

    Authors: Corinna Coupette, Jilles Vreeken, Bastian Rieck

    Abstract: We introduce Hyperbard, a dataset of diverse relational data representations derived from Shakespeare's plays. Our representations range from simple graphs capturing character co-occurrence in single scenes to hypergraphs encoding complex communication settings and character contributions as hyperedges with edge-specific node weights. By making multiple intuitive representations readily available… ▽ More

    Submitted 6 December, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: This is the full version of our paper; an abridged version appears in Digital Scholarship in the Humanities. Landing page for code and data: https://hyperbard.net/

  21. arXiv:2206.07729  [pdf, other

    cs.LG

    Taxonomy of Benchmarks in Graph Representation Learning

    Authors: Renming Liu, Semih Cantürk, Frederik Wenkel, Sarah McGuire, Xinyi Wang, Anna Little, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

    Abstract: Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on develo** GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to w… ▽ More

    Submitted 30 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: In Proceedings of the First Learning on Graphs Conference (LoG 2022)

  22. arXiv:2206.03977  [pdf, other

    cs.LG

    Diffusion Curvature for Estimating Local Curvature in High Dimensional Data

    Authors: Dhananjay Bhaskar, Kincaid MacDonald, Oluwadamilola Fasina, Dawson Thomas, Bastian Rieck, Ian Adelstein, Smita Krishnaswamy

    Abstract: We introduce a new intrinsic measure of local curvature on point-cloud data called diffusion curvature. Our measure uses the framework of diffusion maps, including the data diffusion operator, to structure point cloud data and define local curvature based on the laziness of a random walk starting at a point or region of the data. We show that this laziness directly relates to volume comparison res… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Journal ref: Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  23. arXiv:2203.14860  [pdf, other

    cs.LG stat.ML

    Time-inhomogeneous diffusion geometry and topology

    Authors: Guillaume Huguet, Alexander Tong, Bastian Rieck, Jessie Huang, Manik Kuchroo, Matthew Hirn, Guy Wolf, Smita Krishnaswamy

    Abstract: Diffusion condensation is a dynamic process that yields a sequence of multiscale data representations that aim to encode meaningful abstractions. It has proven effective for manifold learning, denoising, clustering, and visualization of high-dimensional data. Diffusion condensation is constructed as a time-inhomogeneous process where each step first computes and then applies a diffusion operator t… ▽ More

    Submitted 5 January, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

  24. arXiv:2203.01703  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Capturing Shape Information with Multi-Scale Topological Loss Terms for 3D Reconstruction

    Authors: Dominik J. E. Waibel, Scott Atwell, Matthias Meier, Carsten Marr, Bastian Rieck

    Abstract: Reconstructing 3D objects from 2D images is both challenging for our brains and machine learning algorithms. To support this spatial reasoning task, contextual information about the overall shape of an object is critical. However, such information is not captured by established loss terms (e.g. Dice loss). We propose to complement geometrical shape information by including multi-scale topological… ▽ More

    Submitted 16 September, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted at the 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)

  25. arXiv:2202.08070  [pdf, other

    cs.LG stat.ML

    On Measuring Excess Capacity in Neural Networks

    Authors: Florian Graf, Sebastian Zeng, Bastian Rieck, Marc Niethammer, Roland Kwitt

    Abstract: We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as res… ▽ More

    Submitted 19 January, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Updated to Neurips 2022 camera-ready version

  26. arXiv:2112.09992  [pdf, other

    cs.LG cs.DS cs.NE stat.ML

    Weisfeiler and Leman go Machine Learning: The Story so far

    Authors: Christopher Morris, Yaron Lipman, Haggai Maron, Bastian Rieck, Nils M. Kriege, Martin Grohe, Matthias Fey, Karsten Borgwardt

    Abstract: In recent years, algorithms and neural architectures based on the Weisfeiler--Leman algorithm, a well-known heuristic for the graph isomorphism problem, have emerged as a powerful tool for machine learning with graphs and relational data. Here, we give a comprehensive overview of the algorithm's use in a machine-learning setting, focusing on the supervised regime. We discuss the theoretical backgr… ▽ More

    Submitted 13 July, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: Accepted at JMLR

  27. arXiv:2111.08701  [pdf, other

    eess.IV cs.LG

    Interpretability Aware Model Training to Improve Robustness against Out-of-Distribution Magnetic Resonance Images in Alzheimer's Disease Classification

    Authors: Merel Kuijs, Catherine R. Jutzeler, Bastian Rieck, Sarah C. Brüningk

    Abstract: Owing to its pristine soft-tissue contrast and high resolution, structural magnetic resonance imaging (MRI) is widely applied in neurology, making it a valuable data source for image-based machine learning (ML) and deep learning applications. The physical nature of MRI acquisition and reconstruction, however, causes variations in image intensity, resolution, and signal-to-noise ratio. Since ML mod… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  28. arXiv:2110.15188  [pdf, other

    cs.LG cs.CV math.AT

    The magnitude vector of images

    Authors: Michael F. Adamer, Edward De Brouwer, Leslie O'Bray, Bastian Rieck

    Abstract: The magnitude of a finite metric space has recently emerged as a novel invariant quantity, allowing to measure the effective size of a metric space. Despite encouraging first results demonstrating the descriptive abilities of the magnitude, such as being able to detect the boundary of a metric space, the potential use cases of magnitude remain under-explored. In this work, we investigate the prope… ▽ More

    Submitted 7 October, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

  29. arXiv:2110.14809  [pdf, other

    cs.LG

    Towards a Taxonomy of Graph Learning Datasets

    Authors: Renming Liu, Semih Cantürk, Frederik Wenkel, Dylan Sandfelder, Devin Kreuzer, Anna Little, Sarah McGuire, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

    Abstract: Graph neural networks (GNNs) have attracted much attention due to their ability to leverage the intrinsic geometries of the underlying data. Although many different types of GNN models have been developed, with many benchmarking procedures to demonstrate the superiority of one GNN model over the others, there is a lack of systematic understanding of the underlying benchmarking datasets, and what a… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: in Data-Centric AI Workshop at NeurIPS 2021

  30. arXiv:2107.05230  [pdf, other

    cs.LG

    Predicting sepsis in multi-site, multi-national intensive care cohorts using deep learning

    Authors: Michael Moor, Nicolas Bennet, Drago Plecko, Max Horn, Bastian Rieck, Nicolai Meinshausen, Peter Bühlmann, Karsten Borgwardt

    Abstract: Despite decades of clinical research, sepsis remains a global public health crisis with high mortality, and morbidity. Currently, when sepsis is detected and the underlying pathogen is identified, organ damage may have already progressed to irreversible stages. Effective sepsis management is therefore highly time-sensitive. By systematically analysing trends in the plethora of clinical data availa… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  31. arXiv:2106.01098  [pdf, other

    cs.LG cs.SI stat.ML

    Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical Solutions

    Authors: Leslie O'Bray, Max Horn, Bastian Rieck, Karsten Borgwardt

    Abstract: Graph generative models are a highly active branch of machine learning. Given the steady development of new models of ever-increasing complexity, it is necessary to provide a principled way to evaluate and compare them. In this paper, we enumerate the desirable criteria for such a comparison metric and provide an overview of the status quo of graph generative model comparison in use today, which p… ▽ More

    Submitted 18 March, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted as a Spotlight presentation at ICLR 2022

  32. arXiv:2104.12235  [pdf, other

    math.OC cs.DS

    Basic Analysis of Bin-Packing Heuristics

    Authors: Bastian Rieck

    Abstract: The bin-packing problem continues to remain relevant in numerous application areas. This technical report discusses the empirical performance of different bin-packing heuristics for certain test problems.

    Submitted 25 April, 2021; originally announced April 2021.

  33. arXiv:2102.07835  [pdf, other

    cs.LG math.AT stat.ML

    Topological Graph Neural Networks

    Authors: Max Horn, Edward De Brouwer, Michael Moor, Yves Moreau, Bastian Rieck, Karsten Borgwardt

    Abstract: Graph neural networks (GNNs) are a powerful architecture for tackling graph learning tasks, yet have been shown to be oblivious to eminent substructures such as cycles. We present TOGL, a novel layer that incorporates global topological information of a graph using persistent homology. TOGL can be easily integrated into any type of GNN and is strictly more expressive (in terms the Weisfeiler--Lehm… ▽ More

    Submitted 17 March, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

    Journal ref: Tenth International Conference on Learning Representations (ICLR), 2022

  34. arXiv:2102.00485  [pdf, other

    cs.LG stat.ML

    Exploring the Geometry and Topology of Neural Network Loss Landscapes

    Authors: Stefan Horoi, Jessie Huang, Bastian Rieck, Guillaume Lajoie, Guy Wolf, Smita Krishnaswamy

    Abstract: Recent work has established clear links between the generalization performance of trained neural networks and the geometry of their loss landscape near the local minima to which they converge. This suggests that qualitative and quantitative examination of the loss landscape geometry could yield insights about neural network generalization performance during training. To this end, researchers have… ▽ More

    Submitted 26 January, 2022; v1 submitted 31 January, 2021; originally announced February 2021.

    Comments: Accepted at the 20th Symposium on Intelligent Data Analysis (IDA) 2022

  35. arXiv:2011.11070  [pdf, other

    q-bio.GN cs.LG

    Topological Data Analysis of copy number alterations in cancer

    Authors: Stefan Groha, Caroline Weis, Alexander Gusev, Bastian Rieck

    Abstract: Identifying subgroups and properties of cancer biopsy samples is a crucial step towards obtaining precise diagnoses and being able to perform personalized treatment of cancer patients. Recent data collections provide a comprehensive characterization of cancer cell data, including genetic data on copy number alterations (CNAs). We explore the potential to capture information contained in cancer gen… ▽ More

    Submitted 22 April, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

  36. arXiv:2011.06531  [pdf, other

    cs.LG eess.IV stat.ML

    Image analysis for Alzheimer's disease prediction: Embracing pathological hallmarks for model architecture design

    Authors: Sarah C. Brüningk, Felix Hensel, Catherine R. Jutzeler, Bastian Rieck

    Abstract: Alzheimer's disease (AD) is associated with local (e.g. brain tissue atrophy) and global brain changes (loss of cerebral connectivity), which can be detected by high-resolution structural magnetic resonance imaging. Conventionally, these changes and their relation to AD are investigated independently. Here, we introduce a novel, highly-scalable approach that simultaneously captures… ▽ More

    Submitted 10 May, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 8 pages, 1 figure, Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  37. arXiv:2011.03854  [pdf, other

    cs.LG stat.ML

    Graph Kernels: State-of-the-Art and Future Challenges

    Authors: Karsten Borgwardt, Elisabetta Ghisu, Felipe Llinares-López, Leslie O'Bray, Bastian Rieck

    Abstract: Graph-structured data are an integral part of many application domains, including chemoinformatics, computational biology, neuroimaging, and social network analysis. Over the last two decades, numerous graph kernels, i.e. kernel functions between graphs, have been proposed to solve the problem of assessing the similarity between graphs, thereby making it possible to perform predictions in both cla… ▽ More

    Submitted 10 November, 2020; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: Accepted by Foundations and Trends in Machine Learning, 2020

  38. arXiv:2009.06116  [pdf, other

    cs.CV cs.DB cs.DL cs.LG eess.IV

    Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis

    Authors: Jannis Born, Nina Wiedemann, Gabriel Brändle, Charlotte Buhre, Bastian Rieck, Karsten Borgwardt

    Abstract: Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools. Ultrasound, in contrast to CT or X-Ray, has many practical advantages and can serve as a globally-applicable first-line examination technique. We provide the largest publicly available lung ultrasound (US) dataset for COVID-19 consisting of 106 videos from three classes (COVID-… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Comments: 8 pages, 4 figures

    Journal ref: Applied Sciences 2021 (special issue on: "Fighting COVID-19: Emerging Techniques and Aid Systems for Prevention, Forecasting and Diagnosis")

  39. arXiv:2006.07882  [pdf, other

    q-bio.NC cs.LG eess.IV math.AT stat.ML

    Uncovering the Topology of Time-Varying fMRI Data using Cubical Persistence

    Authors: Bastian Rieck, Tristan Yates, Christian Bock, Karsten Borgwardt, Guy Wolf, Nicholas Turk-Browne, Smita Krishnaswamy

    Abstract: Functional magnetic resonance imaging (fMRI) is a crucial technology for gaining insights into cognitive processes in humans. Data amassed from fMRI measurements result in volumetric data sets that vary over time. However, analysing such data presents a challenge due to the large degree of noise and person-to-person variation in how information is represented in the brain. To address this challeng… ▽ More

    Submitted 22 October, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Accepted at the Conference on Neural Information Processing Systems (NeurIPS) 2020; camera-ready version

  40. arXiv:2005.12359  [pdf, other

    cs.LG stat.ML

    Path Imputation Strategies for Signature Models of Irregular Time Series

    Authors: Michael Moor, Max Horn, Christian Bock, Karsten Borgwardt, Bastian Rieck

    Abstract: The signature transform is a 'universal nonlinearity' on the space of continuous vector-valued paths, and has received attention for use in machine learning on time series. However, real-world temporal data is typically observed at discrete points in time, and must first be transformed into a continuous path before signature techniques can be applied. We make this step explicit by characterising i… ▽ More

    Submitted 6 June, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

  41. arXiv:1909.12064  [pdf, other

    cs.LG stat.ML

    Set Functions for Time Series

    Authors: Max Horn, Michael Moor, Christian Bock, Bastian Rieck, Karsten Borgwardt

    Abstract: Despite the eminent successes of deep neural networks, many architectures are often hard to transfer to irregularly-sampled and asynchronous time series that commonly occur in real-world datasets, especially in healthcare applications. This paper proposes a novel approach for classifying irregularly-sampled time series with unaligned measurements, focusing on high scalability and data efficiency.… ▽ More

    Submitted 14 September, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Accepted at the International Conference on Machine Learning (ICML) 2020

  42. Topological Machine Learning with Persistence Indicator Functions

    Authors: Bastian Rieck, Filip Sadlo, Heike Leitte

    Abstract: Techniques from computational topology, in particular persistent homology, are becoming increasingly relevant for data analysis. Their stable metrics permit the use of many distance-based data analysis methods, such as multidimensional scaling, while providing a firm theoretical ground. Many modern machine learning algorithms, however, are based on kernels. This paper presents persistence indicato… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

    Comments: Topology-based Methods in Visualization 2017

  43. Hierarchies and Ranks for Persistence Pairs

    Authors: Bastian Rieck, Filip Sadlo, Heike Leitte

    Abstract: We develop a novel hierarchy for zero-dimensional persistence pairs, i.e., connected components, which is capable of capturing more fine-grained spatial relations between persistence pairs. Our work is motivated by a lack of spatial relationships between features in persistence diagrams, leading to a limited expressive power. We build upon a recently-introduced hierarchy of pairs in persistence di… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

    Comments: Topology-based Methods in Visualization 2017

  44. Persistence Concepts for 2D Skeleton Evolution Analysis

    Authors: Bastian Rieck, Filip Sadlo, Heike Leitte

    Abstract: In this work, we present concepts for the analysis of the evolution of two-dimensional skeletons. By introducing novel persistence concepts, we are able to reduce typical temporal incoherence, and provide insight in skeleton dynamics. We exemplify our approach by means of a simulation of viscous fingering---a highly dynamic process whose analysis is a hot topic in porous media research.

    Submitted 31 July, 2019; originally announced July 2019.

    Comments: Topology-based Methods in Visualization 2017

  45. Persistent Intersection Homology for the Analysis of Discrete Data

    Authors: Bastian Rieck, Markus Banagl, Filip Sadlo, Heike Leitte

    Abstract: Topological data analysis is becoming increasingly relevant to support the analysis of unstructured data sets. A common assumption in data analysis is that the data set is a sample---not necessarily a uniform one---of some high-dimensional manifold. In such cases, persistent homology can be successfully employed to extract features, remove noise, and compare data sets. The underlying problems in s… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

    Comments: Topology-based Methods in Visualization 2017

  46. arXiv:1906.01277  [pdf, other

    cs.LG q-bio.MN stat.ML

    Wasserstein Weisfeiler-Lehman Graph Kernels

    Authors: Matteo Togninalli, Elisabetta Ghisu, Felipe Llinares-López, Bastian Rieck, Karsten Borgwardt

    Abstract: Most graph kernels are an instance of the class of $\mathcal{R}$-Convolution kernels, which measure the similarity of objects by comparing their substructures. Despite their empirical success, most graph kernels use a naive aggregation of the final set of substructures, usually a sum or average, thereby potentially discarding valuable information about the distribution of individual components. Fu… ▽ More

    Submitted 30 October, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted as a Spotlight talk at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  47. arXiv:1906.00722  [pdf, other

    cs.LG math.AT stat.ML

    Topological Autoencoders

    Authors: Michael Moor, Max Horn, Bastian Rieck, Karsten Borgwardt

    Abstract: We propose a novel approach for preserving topological structures of the input space in latent representations of autoencoders. Using persistent homology, a technique from topological data analysis, we calculate topological signatures of both the input and latent space to derive a topological loss term. Under weak theoretical assumptions, we construct this loss in a differentiable manner, such tha… ▽ More

    Submitted 31 May, 2021; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Accepted at the International Conference on Machine Learning (ICML) 2020; camera-ready version

  48. arXiv:1905.10996  [pdf, other

    cs.LG math.AT stat.ML

    Graph Filtration Learning

    Authors: Christoph D. Hofer, Florian Graf, Bastian Rieck, Marc Niethammer, Roland Kwitt

    Abstract: We propose an approach to learning with graph-structured data in the problem domain of graph classification. In particular, we present a novel type of readout operation to aggregate node features into a graph-level representation. To this end, we leverage persistent homology computed via a real-valued, learnable, filter function. We establish the theoretical foundation for differentiating through… ▽ More

    Submitted 17 May, 2021; v1 submitted 27 May, 2019; originally announced May 2019.

  49. arXiv:1904.07990  [pdf

    cs.LG stat.AP stat.ML

    Machine learning for early prediction of circulatory failure in the intensive care unit

    Authors: Stephanie L. Hyland, Martin Faltys, Matthias Hüser, Xinrui Lyu, Thomas Gumbsch, Cristóbal Esteban, Christian Bock, Max Horn, Michael Moor, Bastian Rieck, Marc Zimmermann, Dean Bodenham, Karsten Borgwardt, Gunnar Rätsch, Tobias M. Merz

    Abstract: Intensive care clinicians are presented with large quantities of patient information and measurements from a multitude of monitoring systems. The limited ability of humans to process such complex information hinders physicians to readily recognize and act on early signs of patient deterioration. We used machine learning to develop an early warning system for circulatory failure based on a high-res… ▽ More

    Submitted 19 April, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: 5 main figures, 1 main table, 13 supplementary figures, 5 supplementary tables; 250ppi images

  50. arXiv:1902.01659  [pdf, other

    cs.LG stat.AP stat.ML

    Early Recognition of Sepsis with Gaussian Process Temporal Convolutional Networks and Dynamic Time War**

    Authors: Michael Moor, Max Horn, Bastian Rieck, Damian Roqueiro, Karsten Borgwardt

    Abstract: Sepsis is a life-threatening host response to infection associated with high mortality, morbidity, and health costs. Its management is highly time-sensitive since each hour of delayed treatment increases mortality due to irreversible organ damage. Meanwhile, despite decades of clinical research, robust biomarkers for sepsis are missing. Therefore, detecting sepsis early by utilizing the affluence… ▽ More

    Submitted 15 October, 2020; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: Accepted at the Machine Learning for Healthcare 2019 Conference (MLHC). Camera-ready version