Search | arXiv e-print repository

Neural Persistence Dynamics

Authors: Sebastian Zeng, Florian Graf, Martin Uray, Stefan Huber, Roland Kwitt

Abstract: We consider the problem of learning the dynamics in the topology of time-evolving point clouds, the prevalent spatiotemporal model for systems exhibiting collective behavior, such as swarms of insects and birds or particles in physics. In such systems, patterns emerge from (local) interactions among self-propelled entities. While several well-understood governing equations for motion and interacti… ▽ More We consider the problem of learning the dynamics in the topology of time-evolving point clouds, the prevalent spatiotemporal model for systems exhibiting collective behavior, such as swarms of insects and birds or particles in physics. In such systems, patterns emerge from (local) interactions among self-propelled entities. While several well-understood governing equations for motion and interaction exist, they are difficult to fit to data due to the often large number of entities and missing correspondences between the observation times, which may also not be equidistant. To evade such confounding factors, we investigate collective behavior from a \textit{topological perspective}, but instead of summarizing entire observation sequences (as in prior work), we propose learning a latent dynamical model from topological features \textit{per time point}. The latter is then used to formulate a downstream regression task to predict the parametrization of some a priori specified governing equation. We implement this idea based on a latent ODE learned from vectorized (static) persistence diagrams and show that this modeling choice is justified by a combination of recent stability results for persistent homology. Various (ablation) experiments not only demonstrate the relevance of each individual model component, but provide compelling empirical evidence that our proposed model -- \textit{neural persistence dynamics} -- substantially outperforms the state-of-the-art across a diverse set of parameter regression tasks. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2404.17791 [pdf, other]

doi 10.1109/TRO.2024.3420799

HIPer: A Human-Inspired Scene Perception Model for Multifunctional Mobile Robots

Authors: Florenz Graf, Jochen Lindermayr, Birgit Graf, Werner Kraus, Marco F. Huber

Abstract: Taking over arbitrary tasks like humans do with a mobile service robot in open-world settings requires a holistic scene perception for decision-making and high-level control. This paper presents a human-inspired scene perception model to minimize the gap between human and robotic capabilities. The approach takes over fundamental neuroscience concepts, such as a triplet perception split into recogn… ▽ More Taking over arbitrary tasks like humans do with a mobile service robot in open-world settings requires a holistic scene perception for decision-making and high-level control. This paper presents a human-inspired scene perception model to minimize the gap between human and robotic capabilities. The approach takes over fundamental neuroscience concepts, such as a triplet perception split into recognition, knowledge representation, and knowledge interpretation. A recognition system splits the background and foreground to integrate exchangeable image-based object detectors and SLAM, a multi-layer knowledge base represents scene information in a hierarchical structure and offers interfaces for high-level control, and knowledge interpretation methods deploy spatio-temporal scene analysis and perceptual learning for self-adjustment. A single-setting ablation study is used to evaluate the impact of each component on the overall performance for a fetch-and-carry scenario in two simulated and one real-world environment. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Report number: IEEE T-RO 24-0146

Journal ref: 2024 IEEE Transactions on Robotics (T-RO)

arXiv:2312.06292 [pdf, other]

HoLLiE C -- A Multifunctional Bimanual Mobile Robot Supporting Versatile Care Applications

Authors: Lea Steffen, Martin Schulze, Christian Eichmann, Robin Koch, Andreas Hermann, Rosa Frietsch Mussulin, Friedrich Graaf, Robert Wilbrandt, Marvin Große Besselmann, Arne Roennau, Rüdiger Dillmann

Abstract: Care robotics as a research field has developed a lot in recent years, driven by the rapidly increasing need for it. However, these technologies are mostly limited to a very concrete and usually relatively simple use case. The bimanual robot House of Living Labs intelligent Escort (HoLLiE) includes an omnidirectional mobile platform. This paper presents how HoLLiE is adapted, by flexible software… ▽ More Care robotics as a research field has developed a lot in recent years, driven by the rapidly increasing need for it. However, these technologies are mostly limited to a very concrete and usually relatively simple use case. The bimanual robot House of Living Labs intelligent Escort (HoLLiE) includes an omnidirectional mobile platform. This paper presents how HoLLiE is adapted, by flexible software and hardware modules, for different care applications. The design goal of HoLLiE was to be human-like but abstract enough to ensure a high level of acceptance, which is very advantageous for its use in hospitals. After a short retrospect of previous generations of HoLLiE, it is highlighted how the current version is equipped with a variety of additional sensors and actuators to allow a wide range of possible applications. Then, the software stack of HoLLiE is depicted, with the focus on navigation and force sensitive intention recognition. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 18th international conference on Intelligent Autonomous Systems (IAS18 - 2023)

arXiv:2306.16248 [pdf, other]

Latent SDEs on Homogeneous Spaces

Authors: Sebastian Zeng, Florian Graf, Roland Kwitt

Abstract: We consider the problem of variational Bayesian inference in a latent variable model where a (possibly complex) observed stochastic process is governed by the solution of a latent stochastic differential equation (SDE). Motivated by the challenges that arise when trying to learn an (almost arbitrary) latent neural SDE from data, such as efficient gradient computation, we take a step back and study… ▽ More We consider the problem of variational Bayesian inference in a latent variable model where a (possibly complex) observed stochastic process is governed by the solution of a latent stochastic differential equation (SDE). Motivated by the challenges that arise when trying to learn an (almost arbitrary) latent neural SDE from data, such as efficient gradient computation, we take a step back and study a specific subclass instead. In our case, the SDE evolves on a homogeneous latent space and is induced by stochastic dynamics of the corresponding (matrix) Lie group. In learning problems, SDEs on the unit n-sphere are arguably the most relevant incarnation of this setup. Notably, for variational inference, the sphere not only facilitates using a truly uninformative prior, but we also obtain a particularly simple and intuitive expression for the Kullback-Leibler divergence between the approximate posterior and prior process in the evidence lower bound. Experiments demonstrate that a latent SDE of the proposed type can be learned efficiently by means of an existing one-step geometric Euler-Maruyama scheme. Despite restricting ourselves to a less rich class of SDEs, we achieve competitive or even state-of-the-art results on various time series interpolation/classification problems. △ Less

Submitted 21 February, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: v3: updated experiments with results using the public source code (commit bc6edd1)

Journal ref: NeurIPS 2023

arXiv:2211.01441 [pdf, other]

eXplainable AI for Quantum Machine Learning

Authors: Patrick Steinmüller, Tobias Schulz, Ferdinand Graf, Daniel Herr

Abstract: Parametrized Quantum Circuits (PQCs) enable a novel method for machine learning (ML). However, from a computational point of view they present a challenge to existing eXplainable AI (xAI) methods. On the one hand, measurements on quantum circuits introduce probabilistic errors which impact the convergence of these methods. On the other hand, the phase space of a quantum circuit expands exponential… ▽ More Parametrized Quantum Circuits (PQCs) enable a novel method for machine learning (ML). However, from a computational point of view they present a challenge to existing eXplainable AI (xAI) methods. On the one hand, measurements on quantum circuits introduce probabilistic errors which impact the convergence of these methods. On the other hand, the phase space of a quantum circuit expands exponentially with the number of qubits, complicating efforts to execute xAI methods in polynomial time. In this paper we will discuss the performance of established xAI methods, such as Baseline SHAP and Integrated Gradients. Using the internal mechanics of PQCs we study ways to speed up their computation. △ Less

Submitted 2 November, 2022; originally announced November 2022.

arXiv:2202.08070 [pdf, other]

On Measuring Excess Capacity in Neural Networks

Authors: Florian Graf, Sebastian Zeng, Bastian Rieck, Marc Niethammer, Roland Kwitt

Abstract: We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as res… ▽ More We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as residual networks), we extend and unify prior Rademacher complexity bounds to accommodate function composition and addition, as well as the structure of convolutions. The capacity-driving terms in our bounds are the Lipschitz constants of the layers and an (2, 1) group norm distance to the initializations of the convolution weights. Experiments on benchmark datasets of varying task difficulty indicate that (1) there is a substantial amount of excess capacity per task, and (2) capacity can be kept at a surprisingly similar level across tasks. Overall, this suggests a notion of compressibility with respect to weight norms, complementary to classic compression via weight pruning. Source code is available at https://github.com/rkwitt/excess_capacity. △ Less

Submitted 19 January, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

Comments: Updated to Neurips 2022 camera-ready version

arXiv:2107.09031 [pdf, other]

Topological Attention for Time Series Forecasting

Authors: Sebastian Zeng, Florian Graf, Christoph Hofer, Roland Kwitt

Abstract: The problem of (point) forecasting $ \textit{univariate} $ time series is considered. Most approaches, ranging from traditional statistical methods to recent learning-based techniques with neural networks, directly operate on raw time series observations. As an extension, we study whether $\textit{local topological properties}$, as captured via persistent homology, can serve as a reliable signal t… ▽ More The problem of (point) forecasting $ \textit{univariate} $ time series is considered. Most approaches, ranging from traditional statistical methods to recent learning-based techniques with neural networks, directly operate on raw time series observations. As an extension, we study whether $\textit{local topological properties}$, as captured via persistent homology, can serve as a reliable signal that provides complementary information for learning to forecast. To this end, we propose $\textit{topological attention}$, which allows attending to local topological features within a time horizon of historical data. Our approach easily integrates into existing end-to-end trainable forecasting models, such as $\texttt{N-BEATS}$, and in combination with the latter exhibits state-of-the-art performance on the large-scale M4 benchmark dataset of 100,000 diverse time series from different domains. Ablation experiments, as well as a comparison to a broad range of forecasting methods in a setting where only a single time series is available for training, corroborate the beneficial nature of including local topological information through an attention mechanism. △ Less

Submitted 19 July, 2021; originally announced July 2021.

arXiv:2102.08817 [pdf, other]

Dissecting Supervised Contrastive Learning

Authors: Florian Graf, Christoph D. Hofer, Marc Niethammer, Roland Kwitt

Abstract: Minimizing cross-entropy over the softmax scores of a linear map composed with a high-capacity encoder is arguably the most popular choice for training neural networks on supervised learning tasks. However, recent works show that one can directly optimize the encoder instead, to obtain equally (or even more) discriminative representations via a supervised variant of a contrastive objective. In thi… ▽ More Minimizing cross-entropy over the softmax scores of a linear map composed with a high-capacity encoder is arguably the most popular choice for training neural networks on supervised learning tasks. However, recent works show that one can directly optimize the encoder instead, to obtain equally (or even more) discriminative representations via a supervised variant of a contrastive objective. In this work, we address the question whether there are fundamental differences in the sought-for representation geometry in the output space of the encoder at minimal loss. Specifically, we prove, under mild assumptions, that both losses attain their minimum once the representations of each class collapse to the vertices of a regular simplex, inscribed in a hypersphere. We provide empirical evidence that this configuration is attained in practice and that reaching a close-to-optimal state typically indicates good generalization performance. Yet, the two losses show remarkably different optimization behavior. The number of iterations required to perfectly fit to data scales superlinearly with the amount of randomly flipped labels for the supervised contrastive loss. This is in contrast to the approximately linear scaling previously reported for networks trained with cross-entropy. △ Less

Submitted 2 March, 2023; v1 submitted 17 February, 2021; originally announced February 2021.

Comments: v4 updates: - updated appendix section S1.3 - this includes fixing an oversight in the proofs (Lemma 1 missed an equality condition, which now appears in Lemma 2) - improved figure quality

Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:3821-3830, 2021

arXiv:2002.04805 [pdf, other]

Topologically Densified Distributions

Authors: Christoph D. Hofer, Florian Graf, Marc Niethammer, Roland Kwitt

Abstract: We study regularization in the context of small sample-size learning with over-parameterized neural networks. Specifically, we shift focus from architectural properties, such as norms on the network weights, to properties of the internal representations before a linear classifier. Specifically, we impose a topological constraint on samples drawn from the probability measure induced in that space.… ▽ More We study regularization in the context of small sample-size learning with over-parameterized neural networks. Specifically, we shift focus from architectural properties, such as norms on the network weights, to properties of the internal representations before a linear classifier. Specifically, we impose a topological constraint on samples drawn from the probability measure induced in that space. This provably leads to mass concentration effects around the representations of training instances, i.e., a property beneficial for generalization. By leveraging previous work to impose topological constraints in a neural network setting, we provide empirical evidence (across various vision benchmarks) to support our claim for better generalization. △ Less

Submitted 17 May, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

arXiv:1905.10996 [pdf, other]

Graph Filtration Learning

Authors: Christoph D. Hofer, Florian Graf, Bastian Rieck, Marc Niethammer, Roland Kwitt

Abstract: We propose an approach to learning with graph-structured data in the problem domain of graph classification. In particular, we present a novel type of readout operation to aggregate node features into a graph-level representation. To this end, we leverage persistent homology computed via a real-valued, learnable, filter function. We establish the theoretical foundation for differentiating through… ▽ More We propose an approach to learning with graph-structured data in the problem domain of graph classification. In particular, we present a novel type of readout operation to aggregate node features into a graph-level representation. To this end, we leverage persistent homology computed via a real-valued, learnable, filter function. We establish the theoretical foundation for differentiating through the persistent homology computation. Empirically, we show that this type of readout operation compares favorably to previous techniques, especially when the graph connectivity structure is informative for the learning problem. △ Less

Submitted 17 May, 2021; v1 submitted 27 May, 2019; originally announced May 2019.

arXiv:1905.01065 [pdf, other]

doi 10.1109/RO-MAN46459.2019.8956405

MobiKa - Low-Cost Mobile Robot for Human-Robot Interaction

Authors: Florenz Graf, Çağatay Odabaşı, Theo Jacobs, Birgit Graf, Thomas Födisch

Abstract: One way to allow elderly people to stay longer in their homes is to use of service robots to support them with everyday tasks. With this goal, we design, develop and evaluate a low-cost mobile robot to communicate with elderly people. The main idea is to create an affordable communication assistant robot which is optimized for multimodal Human-Robot Interaction (HRI). Our robot can navigate autono… ▽ More One way to allow elderly people to stay longer in their homes is to use of service robots to support them with everyday tasks. With this goal, we design, develop and evaluate a low-cost mobile robot to communicate with elderly people. The main idea is to create an affordable communication assistant robot which is optimized for multimodal Human-Robot Interaction (HRI). Our robot can navigate autonomously through dynamic environments using a new algorithm to calculate poses for approaching persons. The robot was tested in a real life scenario in an elderly care home. △ Less

Submitted 3 May, 2019; originally announced May 2019.

Journal ref: IEEE RO-MAN 2019: Responsible Robotics and AI for the Real World. 28th IEEE International Conference on Robot and Human Interactive Communication. October 14-18, 2019, New Delhi, India. Piscataway, NJ, USA : IEEE Press, 2019, 6 S

arXiv:1810.08385 [pdf, other]

doi 10.1021/acsphotonics.8b01454

Achiral, Helicity Preserving, and Resonant Structures for Enhanced Sensing of Chiral Molecules

Authors: Florian Graf, Joshua Feis, Xavier Garcia-Santiago, Martin Wegener, Carsten Rockstuhl, Ivan Fernandez-Corbaton

Abstract: We derive a set of design requirements that lead to structures suitable for molecular circular dichroism (CD) enhancement. Achirality of the structure and two suitably selected sequentially incident beams of opposite helicity ensures that the CD signal only depends on the chiral absorption properties of the molecules, and not on the achiral ones. Under this condition, a helicity preserving structu… ▽ More We derive a set of design requirements that lead to structures suitable for molecular circular dichroism (CD) enhancement. Achirality of the structure and two suitably selected sequentially incident beams of opposite helicity ensures that the CD signal only depends on the chiral absorption properties of the molecules, and not on the achiral ones. Under this condition, a helicity preserving structure, which prevents the coupling of the two polarization handednesses, maximizes the enhancement of the CD signal for a given ability of the structure to enhance the field. When the achirality and helicity preservation requirements are met, the enhancement of the CD signal is directly related to the enhancement of the field. Next, we design an exemplary structure following the requirements. The considered system is a planar array of silicon cylinders under normally incident plane-wave illumination. Full-wave numerical calculations show that the enhancement of the transmission CD signal is between 6.5 and 3.75 for interaction lengths between 1.25 and 3 times the height of the cylinders. △ Less

Submitted 24 February, 2020; v1 submitted 19 October, 2018; originally announced October 2018.

Comments: This document is the unedited Authors version of a Submitted Work that was subsequently accepted for publication in ACS Photonics, copyright American Chemical Society after peer review. To access the final edited and published work see 10.1021/acsphotonics.8b01454. The corrections published in 10.1021/acsphotonics.0c00113 are included in this arxiv document

Journal ref: ACS Photonics 2019, 6, 2, 482-491 Publication Date:January 28, 2019

arXiv:1302.3900 [pdf, other]

doi 10.1109/TIP.2005.846030

Robust Image Segmentation in Low Depth Of Field Images

Authors: Franz Graf, Hans-Peter Kriegel, Michael Weiler

Abstract: In photography, low depth of field (DOF) is an important technique to emphasize the object of interest (OOI) within an image. Thus, low DOF images are widely used in the application area of macro, portrait or sports photography. When viewing a low DOF image, the viewer implicitly concentrates on the regions that are sharper regions of the image and thus segments the image into regions of interest… ▽ More In photography, low depth of field (DOF) is an important technique to emphasize the object of interest (OOI) within an image. Thus, low DOF images are widely used in the application area of macro, portrait or sports photography. When viewing a low DOF image, the viewer implicitly concentrates on the regions that are sharper regions of the image and thus segments the image into regions of interest and non regions of interest which has a major impact on the perception of the image. Thus, a robust algorithm for the fully automatic detection of the OOI in low DOF images provides valuable information for subsequent image processing and image retrieval. In this paper we propose a robust and parameterless algorithm for the fully automatic segmentation of low DOF images. We compare our method with three similar methods and show the superior robustness even though our algorithm does not require any parameters to be set by hand. The experiments are conducted on a real world data set with high and low DOF images. △ Less

Submitted 15 February, 2013; originally announced February 2013.

Comments: Extended Version of the short paper published in "Robust Image Segmentation in Low Depth Of Field Images", IEEE International Conference on Image Processing 2011 (ICIP). The paper contains a lot more details about the algorithm and more evaluation

Journal ref: Extended Version of "Robust Image Segmentation in Low Depth Of Field Images", IEEE International Conference on Image Processing 2011 (ICIP)

arXiv:1105.0830 [pdf, other]

Maximum Gain Round Trips with Cost Constraints

Authors: Franz Graf, Hans-Peter Kriegel, Matthias Schubert

Abstract: Searching for optimal ways in a network is an important task in multiple application areas such as social networks, co-citation graphs or road networks. In the majority of applications, each edge in a network is associated with a certain cost and an optimal way minimizes the cost while fulfilling a certain property, e.g connecting a start and a destination node. In this paper, we want to extend pu… ▽ More Searching for optimal ways in a network is an important task in multiple application areas such as social networks, co-citation graphs or road networks. In the majority of applications, each edge in a network is associated with a certain cost and an optimal way minimizes the cost while fulfilling a certain property, e.g connecting a start and a destination node. In this paper, we want to extend pure cost networks to so-called cost-gain networks. In this type of network, each edge is additionally associated with a certain gain. Thus, a way having a certain cost additionally provides a certain gain. In the following, we will discuss the problem of finding ways providing maximal gain while costing less than a certain budget. An application for this type of problem is the round trip problem of a traveler: Given a certain amount of time, which is the best round trip traversing the most scenic landscape or visiting the most important sights? In the following, we distinguish two cases of the problem. The first does not control any redundant edges and the second allows a more sophisticated handling of edges occurring more than once. To answer the maximum round trip queries on a given graph data set, we propose unidirectional and bidirectional search algorithms. Both types of algorithms are tested for the use case named above on real world spatial networks. △ Less

Submitted 5 May, 2011; v1 submitted 4 May, 2011; originally announced May 2011.

Showing 1–14 of 14 results for author: Graf, F