Skip to main content

Showing 1–29 of 29 results for author: Strodthoff, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15871  [pdf, other

    cs.LG cs.AI stat.ML

    CausalConceptTS: Causal Attributions for Time Series Classification using High Fidelity Diffusion Models

    Authors: Juan Miguel Lopez Alcaraz, Nils Strodthoff

    Abstract: Despite the excelling performance of machine learning models, understanding the decisions of machine learning models remains a long-standing goal. While commonly used attribution methods in explainable AI attempt to address this issue, they typically rely on associational rather than causal relationships. In this study, within the context of time series classification, we introduce a novel framewo… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures. Source code under https://github.com/AI4HealthUOL/CausalConceptTS

  2. arXiv:2402.17779  [pdf, other

    eess.SP cs.LG

    Assessing the importance of long-range correlations for deep-learning-based sleep staging

    Authors: Tiezhi Wang, Nils Strodthoff

    Abstract: This study aims to elucidate the significance of long-range correlations for deep-learning-based sleep staging. It is centered around S4Sleep(TS), a recently proposed model for automated sleep staging. This model utilizes electroencephalography (EEG) as raw time series input and relies on structured state space sequence (S4) models as essential model component. Although the model already surpasses… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 3 pages, 1 figure, Accepted at Workshop Biosignals, 28.2.-1.3.2024, Göttingen, Germany

  3. arXiv:2401.06654  [pdf, other

    cs.CV cs.AI cs.LG

    Decoupling Pixel Flip** and Occlusion Strategy for Consistent XAI Benchmarks

    Authors: Stefan Blücher, Johanna Vielhaben, Nils Strodthoff

    Abstract: Feature removal is a central building block for eXplainable AI (XAI), both for occlusion-based explanations (Shapley values) as well as their evaluation (pixel flip**, PF). However, occlusion strategies can vary significantly from simple mean replacement up to inpainting with state-of-the-art diffusion models. This ambiguity limits the usefulness of occlusion-based approaches. For example, PF be… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 28 pages, 8 figures

  4. arXiv:2312.11050  [pdf, other

    eess.SP cs.LG stat.ML

    Prospects for AI-Enhanced ECG as a Unified Screening Tool for Cardiac and Non-Cardiac Conditions -- An Explorative Study in Emergency Care

    Authors: Nils Strodthoff, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp

    Abstract: Current deep learning algorithms designed for automatic ECG analysis have exhibited notable accuracy. However, akin to traditional electrocardiography, they tend to be narrowly focused and typically address a singular diagnostic condition. In this exploratory study, we specifically investigate the capability of a single model to predict a diverse range of both cardiac and non-cardiac discharge dia… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted version EHJDH. 30 pages, 6 figures, code available under https://github.com/AI4HealthUOL/ECG-MIMIC

  5. arXiv:2310.07463  [pdf, other

    eess.SP cs.LG stat.ML

    Using explainable AI to investigate electrocardiogram changes during healthy aging -- from expert features to raw signals

    Authors: Gabriel Ott, Yannik Schaubelt, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp, Nils Strodthoff

    Abstract: Cardiovascular diseases remain the leading global cause of mortality. Age is an important covariate whose effect is most easily investigated in a healthy cohort to properly distinguish the former from disease-related changes. Traditionally, most of such insights have been drawn from the analysis of electrocardiogram (ECG) feature changes in individuals as they age. However, these features, while i… ▽ More

    Submitted 22 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted version by PLOS ONE. 10 pages, 5 figures, code available under https://github.com/AI4HealthUOL/ECG-aging. Publication under https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0302024

  6. arXiv:2310.06715  [pdf, other

    cs.LG eess.SP stat.ML

    S4Sleep: Elucidating the design space of deep-learning-based sleep stage classification models

    Authors: Tiezhi Wang, Nils Strodthoff

    Abstract: Scoring sleep stages in polysomnography recordings is a time-consuming task plagued by significant inter-rater variability. Therefore, it stands to benefit from the application of machine learning algorithms. While many algorithms have been proposed for this purpose, certain critical architectural decisions have not received systematic exploration. In this study, we meticulously investigate these… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 11 pages, 1 figure, code available at https://github.com/AI4HealthUOL/s4sleep

  7. Insights Into the Inner Workings of Transformer Models for Protein Function Prediction

    Authors: Markus Wenzel, Erik Grüner, Nils Strodthoff

    Abstract: Motivation: We explored how explainable artificial intelligence (XAI) can help to shed light into the inner workings of neural networks for protein function prediction, by extending the widely used XAI method of integrated gradients such that latent representations inside of transformer models, which were finetuned to Gene Ontology term and Enzyme Commission number prediction, can be inspected too… ▽ More

    Submitted 9 February, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: 26 pages, 12 figures, 5 tables, source code available at https://github.com/markuswenzel/xai-proteins

    Journal ref: Bioinformatics (2024) btae031

  8. arXiv:2308.15291  [pdf, other

    eess.SP cs.LG stat.ML

    Towards quantitative precision for ECG analysis: Leveraging state space models, self-supervision and patient metadata

    Authors: Temesgen Mehari, Nils Strodthoff

    Abstract: Deep learning has emerged as the preferred modeling approach for automatic ECG analysis. In this study, we investigate three elements aimed at improving the quantitative accuracy of such systems. These components consistently enhance performance beyond the existing state-of-the-art, which is predominantly based on convolutional models. Firstly, we explore more expressive architectures by exploitin… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: extended version of arXiv:2211.07579

  9. Explaining Deep Learning for ECG Analysis: Building Blocks for Auditing and Knowledge Discovery

    Authors: Patrick Wagner, Temesgen Mehari, Wilhelm Haverkamp, Nils Strodthoff

    Abstract: Deep neural networks have become increasingly popular for analyzing ECG data because of their ability to accurately identify cardiac conditions and hidden clinical factors. However, the lack of transparency due to the black box nature of these models is a common concern. To address this issue, explainable AI (XAI) methods can be employed. In this study, we present a comprehensive analysis of post-… ▽ More

    Submitted 2 July, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Journal ref: Computers in Biology and Medicine, Vol. 176, June 2024, 108525

  10. arXiv:2304.02577  [pdf, other

    physics.med-ph cs.LG eess.SP

    ECG Feature Importance Rankings: Cardiologists vs. Algorithms

    Authors: Temesgen Mehari, Ashish Sundar, Alen Bosnjakovic, Peter Harris, Steven E. Williams, Axel Loewe, Olaf Doessel, Claudia Nagel, Nils Strodthoff, Philip J. Aston

    Abstract: Feature importance methods promise to provide a ranking of features according to importance for a given classification task. A wide range of methods exist but their rankings often disagree and they are inherently difficult to evaluate due to a lack of ground truth beyond synthetic datasets. In this work, we put feature importance methods to the test on real-world data in the domain of cardiology,… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  11. arXiv:2301.11911  [pdf, other

    cs.LG cs.AI stat.ML

    Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees

    Authors: Johanna Vielhaben, Stefan Blücher, Nils Strodthoff

    Abstract: The completeness axiom renders the explanation of a post-hoc XAI method only locally faithful to the model, i.e. for a single decision. For the trustworthy application of XAI, in particular for high-stake decisions, a more global model understanding is required. Recently, concept-based methods have been proposed, which are however not guaranteed to be bound to the actual model reasoning. To circum… ▽ More

    Submitted 18 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: v2: Version published by Transactions on Machine Learning Research in 2023 (TMLR ISSN 2835-8856) https://openreview.net/forum?id=KxBQPz7HKh. 25 pages, 11 figures. This work builds on an earlier manuscript (arXiv:2203.06043) and crucially extends it. Code is available at https://github.com/jvielhaben/MCD-XAI

    Journal ref: Version published by Transactions on Machine Learning Research in 2023 (TMLR ISSN 2835-8856) https://openreview.net/forum?id=KxBQPz7HKh

  12. arXiv:2301.08227  [pdf, other

    eess.SP cs.LG stat.ML

    Diffusion-based Conditional ECG Generation with Structured State Space Models

    Authors: Juan Miguel Lopez Alcaraz, Nils Strodthoff

    Abstract: Synthetic data generation is a promising solution to address privacy issues with the distribution of sensitive health data. Recently, diffusion models have set new standards for generative models for different data modalities. Also very recently, structured state space models emerged as a powerful modeling paradigm to capture long-term dependencies in time series. We put forward SSSD-ECG, as the c… ▽ More

    Submitted 15 June, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 12 pages, 9 figures. Accepted version by Computers in Biology and Medicine in 2023 under https://doi.org/10.1016/j.compbiomed.2023.107115. Source code under https://github.com/AI4HealthUOL/SSSD-ECG

    Journal ref: volume 163, year 2023, and page 107115

  13. arXiv:2211.07579  [pdf, other

    cs.LG eess.SP stat.ML

    Advancing the State-of-the-Art for ECG Analysis through Structured State Space Models

    Authors: Temesgen Mehari, Nils Strodthoff

    Abstract: The field of deep-learning-based ECG analysis has been largely dominated by convolutional architectures. This work explores the prospects of applying the recently introduced structured state space models (SSMs) as a particularly promising approach due to its ability to capture long-term dependencies in time series. We demonstrate that this approach leads to significant improvements over the curren… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 6+5 pages

  14. arXiv:2208.09399  [pdf, other

    cs.LG stat.ML

    Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models

    Authors: Juan Miguel Lopez Alcaraz, Nils Strodthoff

    Abstract: The imputation of missing values represents a significant obstacle for many real-world data analysis pipelines. Here, we focus on time series data and put forward SSSD, an imputation model that relies on two emerging technologies, (conditional) diffusion models as state-of-the-art generative models and structured state space models as internal model architecture, which are particularly suited to c… ▽ More

    Submitted 6 May, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: V3: Updated results for the solar dataset. 36 pages, 13 figures. Version published by Transactions on Machine Learning Research in 2022 (TMLR ISSN 2835-8856) https://openreview.net/forum?id=hHiIbk7ApW. Source code under https://github.com/AI4HealthUOL/SSSD

    Journal ref: Version published by Transactions on Machine Learning Research in 2022 (TMLR ISSN 2835-8856) https://openreview.net/forum?id=hHiIbk7ApW

  15. arXiv:2204.05044  [pdf, other

    eess.IV cs.LG stat.ML

    From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in Histopathology

    Authors: Maximilian Springenberg, Annika Frommholz, Markus Wenzel, Eva Weicken, Jackie Ma, Nils Strodthoff

    Abstract: While machine learning is currently transforming the field of histopathology, the domain lacks a comprehensive evaluation of state-of-the-art models based on essential but complementary quality requirements beyond a mere classification accuracy. In order to fill this gap, we developed a new methodology to extensively evaluate a wide range of classification models, including recent vision transform… ▽ More

    Submitted 9 May, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: 14 pages, 6 figures, v2: version accepted by Medical Image Analysis, code available under https://github.com/hhi-aml/histobenchmark

  16. arXiv:2203.06043  [pdf, other

    cs.LG cs.AI stat.ML

    Sparse Subspace Clustering for Concept Discovery (SSCCD)

    Authors: Johanna Vielhaben, Stefan Blücher, Nils Strodthoff

    Abstract: Concepts are key building blocks of higher level human understanding. Explainable AI (XAI) methods have shown tremendous progress in recent years, however, local attribution methods do not allow to identify coherent model behavior across samples and therefore miss this essential component. In this work, we study concept-based explanations and put forward a new definition of concepts as low-dimensi… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 24 pages, 24 figures, code will be made publicly available

  17. arXiv:2106.13497  [pdf, other

    cs.CV

    On the Robustness of Pretraining and Self-Supervision for a Deep Learning-based Analysis of Diabetic Retinopathy

    Authors: Vignesh Srinivasan, Nils Strodthoff, Jackie Ma, Alexander Binder, Klaus-Robert Müller, Wojciech Samek

    Abstract: There is an increasing number of medical use-cases where classification algorithms based on deep neural networks reach performance levels that are competitive with human medical experts. To alleviate the challenges of small dataset sizes, these systems often rely on pretraining. In this work, we aim to assess the broader implications of these approaches. For diabetic retinopathy grading as exempla… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  18. arXiv:2104.08237  [pdf, other

    q-bio.QM cs.LG

    Predicting the Binding of SARS-CoV-2 Peptides to the Major Histocompatibility Complex with Recurrent Neural Networks

    Authors: Johanna Vielhaben, Markus Wenzel, Eva Weicken, Nils Strodthoff

    Abstract: Predicting the binding of viral peptides to the major histocompatibility complex with machine learning can potentially extend the computational immunology toolkit for vaccine development, and serve as a key component in the fight against a pandemic. In this work, we adapt and extend USMPep, a recently proposed, conceptually simple prediction algorithm based on recurrent neural networks. Most notab… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at ICLR 2021 Workshop: Machine Learning for Preventing and Combating Pandemics; code available at https://github.com/nstrodt/USMPep

  19. Self-supervised representation learning from 12-lead ECG data

    Authors: Temesgen Mehari, Nils Strodthoff

    Abstract: Clinical 12-lead electrocardiography (ECG) is one of the most widely encountered kinds of biosignals. Despite the increased availability of public ECG datasets, label scarcity remains a central challenge in the field. Self-supervised learning represents a promising way to alleviate this issue. In this work, we put forward the first comprehensive assessment of self-supervised representation learnin… ▽ More

    Submitted 4 January, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: 15 pages, 12 figures, matches published version, code available under https://github.com/hhi-aml/ecg-selfsupervised

    Journal ref: Comput. Biol. Med. 141 (2022) 105114

  20. PredDiff: Explanations and Interactions from Conditional Expectations

    Authors: Stefan Blücher, Johanna Vielhaben, Nils Strodthoff

    Abstract: PredDiff is a model-agnostic, local attribution method that is firmly rooted in probability theory. Its simple intuition is to measure prediction changes while marginalizing features. In this work, we clarify properties of PredDiff and its close connection to Shapley values. We stress important differences between classification and regression, which require a specific treatment within both formal… ▽ More

    Submitted 8 September, 2022; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: 35 pages, 20 Figures, accepted journal version, code available at https://github.com/AI4HealthUOL/preddiff-interactions

    Journal ref: Artificial Intelligence 312 (2022) 103774

  21. arXiv:2012.10264  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Generative Neural Samplers for the Quantum Heisenberg Chain

    Authors: Johanna Vielhaben, Nils Strodthoff

    Abstract: Generative neural samplers offer a complementary approach to Monte Carlo methods for problems in statistical physics and quantum field theory. This work tests the ability of generative neural samplers to estimate observables for real-world low-dimensional spin systems. It maps out how autoregressive models can sample configurations of a quantum Heisenberg chain via a classical approximation based… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 10 figures

    Journal ref: Phys. Rev. E 103, 063304 (2021)

  22. arXiv:2010.09622  [pdf, other

    eess.IV cs.LG eess.SP

    Inferring respiratory and circulatory parameters from electrical impedance tomography with deep recurrent models

    Authors: Nils Strodthoff, Claas Strodthoff, Tobias Becher, Norbert Weiler, Inéz Frerichs

    Abstract: Electrical impedance tomography (EIT) is a noninvasive imaging modality that allows a continuous assessment of changes in regional bioimpedance of different organs. One of its most common biomedical applications is monitoring regional ventilation distribution in critically ill patients treated in intensive care units. In this work, we put forward a proof-of-principle study that demonstrates how on… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures

  23. arXiv:2004.13701  [pdf, other

    cs.LG stat.ML

    Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL

    Authors: Nils Strodthoff, Patrick Wagner, Tobias Schaeffter, Wojciech Samek

    Abstract: Electrocardiography is a very common, non-invasive diagnostic procedure and its interpretation is increasingly supported by automatic interpretation algorithms. The progress in the field of automatic ECG interpretation has up to now been hampered by a lack of appropriate datasets for training as well as a lack of well-defined evaluation procedures to ensure comparability of different algorithms. T… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 12 pages, 8 figures

  24. arXiv:2003.01504  [pdf, other

    hep-lat cs.LG physics.comp-ph

    Towards Novel Insights in Lattice Field Theory with Explainable Machine Learning

    Authors: Stefan Bluecher, Lukas Kades, Jan M. Pawlowski, Nils Strodthoff, Julian M. Urban

    Abstract: Machine learning has the potential to aid our understanding of phase structures in lattice quantum field theories through the statistical analysis of Monte Carlo samples. Available algorithms, in particular those based on deep learning, often demonstrate remarkable performance in the search for previously unidentified features, but tend to lack transparency if applied naively. To address these sho… ▽ More

    Submitted 18 May, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 13 pages, 11 figures

    Journal ref: Phys. Rev. D 101, 094507 (2020)

  25. arXiv:1910.13496  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Asymptotically unbiased estimation of physical observables with neural samplers

    Authors: Kim A. Nicoli, Shinichi Nakajima, Nils Strodthoff, Wojciech Samek, Klaus-Robert Müller, Pan Kessel

    Abstract: We propose a general framework for the estimation of observables with generative neural samplers focusing on modern deep generative neural networks that provide an exact sampling probability. In this framework, we present asymptotically unbiased estimators for generic observables, including those that explicitly depend on the partition function such as free energy or entropy, and derive correspond… ▽ More

    Submitted 13 February, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: 5 figures

    Journal ref: Phys. Rev. E 101, 023304 (2020)

  26. Achieving Generalizable Robustness of Deep Neural Networks by Stability Training

    Authors: Jan Laermann, Wojciech Samek, Nils Strodthoff

    Abstract: We study the recently introduced stability training as a general-purpose method to increase the robustness of deep neural networks against input perturbations. In particular, we explore its use as an alternative to data augmentation and validate its performance against a number of distortion types and transformations including adversarial examples. In our image classification experiments using Ima… ▽ More

    Submitted 12 November, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 18 pages, 25 figures; Camera-ready version

    Journal ref: DAGM GCPR 2019. Lecture Notes in Computer Science, vol. 11824, 360-373, 2019

  27. arXiv:1903.11048  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Comment on "Solving Statistical Mechanics Using VANs": Introducing saVANt - VANs Enhanced by Importance and MCMC Sampling

    Authors: Kim Nicoli, Pan Kessel, Nils Strodthoff, Wojciech Samek, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: In this comment on "Solving Statistical Mechanics Using Variational Autoregressive Networks" by Wu et al., we propose a subtle yet powerful modification of their approach. We show that the inherent sampling error of their method can be corrected by using neural network-based MCMC or importance sampling which leads to asymptotically unbiased estimators for physical quantities. This modification is… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

    Comments: 6 pages, 4 figures

  28. Enhanced Machine Learning Techniques for Early HARQ Feedback Prediction in 5G

    Authors: Nils Strodthoff, Barış Göktepe, Thomas Schierl, Cornelius Hellge, Wojciech Samek

    Abstract: We investigate Early Hybrid Automatic Repeat reQuest (E-HARQ) feedback schemes enhanced by machine learning techniques as a path towards ultra-reliable and low-latency communication (URLLC). To this end, we propose machine learning methods to predict the outcome of the decoding process ahead of the end of the transmission. We discuss different input features and classification algorithms ranging f… ▽ More

    Submitted 25 October, 2019; v1 submitted 27 July, 2018; originally announced July 2018.

    Comments: 14 pages, 15 figures; accepted version

    Journal ref: IEEE JSAC 37 (2019), no. 11, 2573-2587

  29. arXiv:1806.07385  [pdf, other

    cs.CY cs.LG stat.ML

    Detecting and interpreting myocardial infarction using fully convolutional neural networks

    Authors: Nils Strodthoff, Claas Strodthoff

    Abstract: Objective: We aim to provide an algorithm for the detection of myocardial infarction that operates directly on ECG data without any preprocessing and to investigate its decision criteria. Approach: We train an ensemble of fully convolutional neural networks on the PTB ECG dataset and apply state-of-the-art attribution methods. Main results: Our classifier reaches 93.3% sensitivity and 89.7% specif… ▽ More

    Submitted 5 February, 2019; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: 11 pages, 4 figures

    Journal ref: Physiological Measurement, vol. 40, no. 1, p. 015001, 2019