Skip to main content

Showing 1–50 of 116 results for author: Samek, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13462  [pdf, other

    cs.AI cs.CY

    Blockchain and Artificial Intelligence: Synergies and Conflicts

    Authors: Leon Witt, Armando Teles Fortes, Kentaroh Toyoda, Wojciech Samek, Dan Li

    Abstract: Blockchain technology and Artificial Intelligence (AI) have emerged as transformative forces in their respective domains. This paper explores synergies and challenges between these two technologies. Our research analyses the biggest projects combining blockchain and AI, based on market capitalization, and derives a novel framework to categorize contemporary and future use cases. Despite the theore… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.10433  [pdf, other

    cs.CV cs.AI cs.LG

    Explainable concept map**s of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification

    Authors: Christian Tinauer, Anna Damulina, Maximilian Sackl, Martin Soellradl, Reduan Achtibat, Maximilian Dreyer, Frederik Pahde, Sebastian Lapuschkin, Reinhold Schmidt, Stefan Ropele, Wojciech Samek, Christian Langkammer

    Abstract: Motivation. While recent studies show high accuracy in the classification of Alzheimer's disease using deep neural networks, the underlying learned concepts have not been investigated. Goals. To systematically identify changes in brain regions through concepts learned by the deep neural network for model validation. Approach. Using quantitative R2* maps we separated Alzheimer's patients (n=117… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2404.09601  [pdf, other

    cs.LG cs.AI cs.CV

    Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression

    Authors: Dilyara Bareeva, Maximilian Dreyer, Frederik Pahde, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Deep Neural Networks are prone to learning and relying on spurious correlations in the training data, which, for high-risk applications, can have fatal consequences. Various approaches to suppress model reliance on harmful features have been proposed that can be applied post-hoc without additional training. Whereas those methods can be applied with efficiency, they also tend to harm model performa… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.06453  [pdf, other

    cs.CV cs.AI cs.LG

    PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits

    Authors: Maximilian Dreyer, Erblina Purelku, Johanna Vielhaben, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The field of mechanistic interpretability aims to study the role of individual neurons in Deep Neural Networks. Single neurons, however, have the capability to act polysemantically and encode for multiple (unrelated) features, which renders their interpretation difficult. We present a method for disentangling polysemanticity of any Deep Neural Network by decomposing a polysemantic neuron into mult… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 14 pages (4 pages manuscript, 2 pages references, 8 pages appendix)

  5. arXiv:2402.13914  [pdf, other

    cs.AI cs.CR cs.LG

    Position: Explain to Question not to Justify

    Authors: Przemyslaw Biecek, Wojciech Samek

    Abstract: Explainable Artificial Intelligence (XAI) is a young but very promising field of research. Unfortunately, the progress in this field is currently slowed down by divergent and incompatible goals. We separate various threads tangled within the area of XAI into two complementary cultures of human/value-oriented explanations (BLUE XAI) and model/validation-oriented explanations (RED XAI). This positio… ▽ More

    Submitted 28 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  6. arXiv:2402.12118  [pdf, other

    cs.LG cs.AI

    DualView: Data Attribution from the Dual Perspective

    Authors: Galip Ümit Yolcu, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Local data attribution (or influence estimation) techniques aim at estimating the impact that individual data points seen during training have on particular predictions of an already trained Machine Learning model during test time. Previous methods either do not perform well consistently across different evaluation criteria from literature, are characterized by a high computational demand, or suff… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2402.05602  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers

    Authors: Reduan Achtibat, Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Aakriti Jain, Thomas Wiegand, Sebastian Lapuschkin, Wojciech Samek

    Abstract: Large Language Models are prone to biased predictions and hallucinations, underlining the paramount importance of understanding their model-internal reasoning process. However, achieving faithful attributions for the entirety of a black-box transformer model and maintaining computational efficiency is an unsolved challenge. By extending the Layer-wise Relevance Propagation attribution method to ha… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  8. arXiv:2401.17441  [pdf, other

    cs.LG cs.AI stat.ML

    Explaining Predictive Uncertainty by Exposing Second-Order Effects

    Authors: Florian Bley, Sebastian Lapuschkin, Wojciech Samek, Grégoire Montavon

    Abstract: Explainable AI has brought transparency into complex ML blackboxes, enabling, in particular, to identify which features these models use for their predictions. So far, the question of explaining predictive uncertainty, i.e. why a model 'doubts', has been scarcely studied. Our investigation reveals that predictive uncertainty is dominated by second-order effects, involving single features or produc… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 12 pages + supplement

  9. arXiv:2311.16681  [pdf, other

    cs.CV cs.AI

    Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations

    Authors: Maximilian Dreyer, Reduan Achtibat, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Ensuring both transparency and safety is critical when deploying Deep Neural Networks (DNNs) in high-risk applications, such as medicine. The field of explainable AI (XAI) has proposed various methods to comprehend the decision-making processes of opaque DNNs. However, only few XAI methods are suitable of ensuring safety in practice as they heavily rely on repeated labor-intensive and possibly bia… ▽ More

    Submitted 29 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 39 pages (8 pages manuscript, 3 pages references, 28 pages appendix)

  10. arXiv:2311.13028  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    DMLR: Data-centric Machine Learning Research -- Past, Present and Future

    Authors: Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš , et al. (13 additional authors not shown)

    Abstract: Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods tow… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Published in the Journal of Data-centric Machine Learning Research (DMLR) at https://data.mlr.press/assets/pdf/v01-5.pdf

  11. Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions

    Authors: Luca Longo, Mario Brcic, Federico Cabitza, Jaesik Choi, Roberto Confalonieri, Javier Del Ser, Riccardo Guidotti, Yoichi Hayashi, Francisco Herrera, Andreas Holzinger, Richard Jiang, Hassan Khosravi, Freddy Lecue, Gianclaudio Malgieri, Andrés Páez, Wojciech Samek, Johannes Schneider, Timo Speith, Simone Stumpf

    Abstract: As systems based on opaque Artificial Intelligence (AI) continue to flourish in diverse real-world applications, understanding these black box models has become paramount. In response, Explainable AI (XAI) has emerged as a field of research with practical and ethical benefits across various domains. This paper not only highlights the advancements in XAI and its application in real-world scenarios… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    ACM Class: F.2.0; H.1.2; I.2; I.2.6; K.4; K.5

    Journal ref: Information Fusion 2024

  12. arXiv:2310.17638  [pdf, other

    cs.LG stat.ML

    Generative Fractional Diffusion Models

    Authors: Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek

    Abstract: We introduce the first continuous-time score-based generative model that leverages fractional diffusion processes for its underlying dynamics. Although diffusion models have excelled at capturing data distributions, they still suffer from various limitations such as slow convergence, mode-collapse on imbalanced data, and lack of diversity. These issues are partially linked to the use of light-tail… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    ACM Class: I.2.4; F.4.1; G.3

  13. Human-Centered Evaluation of XAI Methods

    Authors: Karam Dawoud, Wojciech Samek, Peter Eisert, Sebastian Lapuschkin, Sebastian Bosse

    Abstract: In the ever-evolving field of Artificial Intelligence, a critical challenge has been to decipher the decision-making processes within the so-called "black boxes" in deep learning. Over recent years, a plethora of methods have emerged, dedicated to explaining decisions across diverse tasks. Particularly in tasks like image classification, these methods typically identify and emphasize the pivotal p… ▽ More

    Submitted 16 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Journal ref: ICDMW (2023) 912-921

  14. arXiv:2308.12053  [pdf, other

    cs.LG cs.AI cs.NE

    Layer-wise Feedback Propagation

    Authors: Leander Weber, Jim Berend, Alexander Binder, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: In this paper, we present Layer-wise Feedback Propagation (LFP), a novel training approach for neural-network-like predictors that utilizes explainability, specifically Layer-wise Relevance Propagation(LRP), to assign rewards to individual connections based on their respective contributions to solving a given task. This differs from traditional gradient descent, which updates parameters towards an… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    MSC Class: 68T05

  15. arXiv:2308.09437  [pdf, other

    cs.LG cs.AI cs.CV cs.CY

    From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space

    Authors: Maximilian Dreyer, Frederik Pahde, Christopher J. Anders, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Deep Neural Networks are prone to learning spurious correlations embedded in the training data, leading to potentially biased predictions. This poses risks when deploying these models for high-stake decision-making, such as in medical applications. Current methods for post-hoc model correction either require input-level annotations which are only possible for spatially localized biases, or augment… ▽ More

    Submitted 18 December, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 35 pages (9 pages manuscript, 2 pages references, 24 pages appendix)

  16. arXiv:2306.13384  [pdf, other

    eess.IV cs.CV cs.LG

    DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology

    Authors: Marco Aversa, Gabriel Nobis, Miriam Hägele, Kai Standvoss, Mihaela Chirica, Roderick Murray-Smith, Ahmed Alaa, Lukas Ruff, Daniela Ivanova, Wojciech Samek, Frederick Klauschen, Bruno Sanguinetti, Luis Oala

    Abstract: We present DiffInfinite, a hierarchical diffusion model that generates arbitrarily large histological images while preserving long-range correlation structural information. Our approach first generates synthetic segmentation masks, subsequently used as conditions for the high-fidelity generative diffusion process. The proposed sampling method can be scaled up to any desired image size while only r… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  17. arXiv:2304.14019  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    XAI-based Comparison of Input Representations for Audio Event Classification

    Authors: Annika Frommholz, Fabian Seipel, Sebastian Lapuschkin, Wojciech Samek, Johanna Vielhaben

    Abstract: Deep neural networks are a promising tool for Audio Event Classification. In contrast to other data like natural images, there are many sensible and non-obvious representations for audio data, which could serve as input to these models. Due to their black-box nature, the effect of different input representations has so far mostly been investigated by measuring classification performance. In this w… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures

  18. arXiv:2303.12641  [pdf, other

    cs.CV cs.AI

    Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models

    Authors: Frederik Pahde, Maximilian Dreyer, Wojciech Samek, Sebastian Lapuschkin

    Abstract: State-of-the-art machine learning models often learn spurious correlations embedded in the training data. This poses risks when deploying these models for high-stake decision-making, such as in medical applications like skin cancer detection. To tackle this problem, we propose Reveal to Revise (R2R), a framework entailing the entire eXplainable Artificial Intelligence (XAI) life cycle, enabling pr… ▽ More

    Submitted 27 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  19. arXiv:2303.06365  [pdf, other

    cs.LG cs.AI cs.CV

    Explainable AI for Time Series via Virtual Inspection Layers

    Authors: Johanna Vielhaben, Sebastian Lapuschkin, Grégoire Montavon, Wojciech Samek

    Abstract: The field of eXplainable Artificial Intelligence (XAI) has greatly advanced in recent years, but progress has mainly been made in computer vision and natural language processing. For time series, where the input is often not interpretable, only limited research on XAI is available. In this work, we put forward a virtual inspection layer, that transforms the time series to an interpretable represen… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 13 pages, 7 figures

  20. arXiv:2303.04689  [pdf, other

    cs.IR cs.CR cs.LG

    A Privacy Preserving System for Movie Recommendations Using Federated Learning

    Authors: David Neumann, Andreas Lutz, Karsten Müller, Wojciech Samek

    Abstract: Recommender systems have become ubiquitous in the past years. They solve the tyranny of choice problem faced by many users, and are utilized by many online businesses to drive engagement and sales. Besides other criticisms, like creating filter bubbles within social networks, recommender systems are often reproved for collecting considerable amounts of personal data. However, to personalize recomm… ▽ More

    Submitted 16 May, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in the ACM Transactions on Recommender Systems (TORS) Special Issue on Trustworthy Recommender Systems

  21. arXiv:2302.07265  [pdf, other

    cs.LG cs.AI

    The Meta-Evaluation Problem in Explainable AI: Identifying Reliable Estimators with MetaQuantus

    Authors: Anna Hedström, Philine Bommer, Kristoffer K. Wickstrøm, Wojciech Samek, Sebastian Lapuschkin, Marina M. -C. Höhne

    Abstract: One of the unsolved challenges in the field of Explainable AI (XAI) is determining how to most reliably estimate the quality of an explanation method in the absence of ground truth explanation labels. Resolving this issue is of utmost importance as the evaluation outcomes generated by competing evaluation methods (or ''quality estimators''), which aim at measuring the same property of an explanati… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 35 pages, 15 figures, 5 tables

    Journal ref: Transactions on Machine Learning Research, Volume 2023, (2023), ISSN: 2835-8856

  22. arXiv:2211.17174  [pdf, other

    cs.CV cs.AI cs.LG

    Optimizing Explanations by Network Canonization and Hyperparameter Search

    Authors: Frederik Pahde, Galip Ümit Yolcu, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Explainable AI (XAI) is slowly becoming a key component for many AI applications. Rule-based and modified backpropagation XAI approaches however often face challenges when being applied to modern model architectures including innovative layer building blocks, which is caused by two reasons. Firstly, the high flexibility of rule-based XAI methods leads to numerous potential parameterizations. Secon… ▽ More

    Submitted 27 March, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

  23. Explaining machine learning models for age classification in human gait analysis

    Authors: Djordje Slijepcevic, Fabian Horst, Marvin Simak, Sebastian Lapuschkin, Anna-Maria Raberger, Wojciech Samek, Christian Breiteneder, Wolfgang I. Schöllhorn, Matthias Zeppelzauer, Brian Horsak

    Abstract: Machine learning (ML) models have proven effective in classifying gait analysis data, e.g., binary classification of young vs. older adults. ML models, however, lack in providing human understandable explanations for their predictions. This "black-box" behavior impedes the understanding of which input features the model predictions are based on. We investigated an Explainable Artificial Intelligen… ▽ More

    Submitted 16 October, 2022; originally announced November 2022.

    Comments: 3 pages, 1 figure

    Journal ref: Gait & Posture 97 (Supplement 1) (2022) 252-253

  24. Explaining automated gender classification of human gait

    Authors: Fabian Horst, Djordje Slijepcevic, Matthias Zeppelzauer, Anna-Maria Raberger, Sebastian Lapuschkin, Wojciech Samek, Wolfgang I. Schöllhorn, Christian Breiteneder, Brian Horsak

    Abstract: State-of-the-art machine learning (ML) models are highly effective in classifying gait analysis data, however, they lack in providing explanations for their predictions. This "black-box" characteristic makes it impossible to understand on which input patterns, ML models base their predictions. The present study investigates whether Explainable Artificial Intelligence methods, i.e., Layer-wise Rele… ▽ More

    Submitted 16 October, 2022; originally announced November 2022.

    Comments: 3 pages, 1 figure

    Journal ref: Gait & Posture 81 (Supplement 1) (2020) 159-160

  25. arXiv:2211.12486  [pdf, other

    cs.LG cs.CV

    Shortcomings of Top-Down Randomization-Based Sanity Checks for Evaluations of Deep Neural Network Explanations

    Authors: Alexander Binder, Leander Weber, Sebastian Lapuschkin, Grégoire Montavon, Klaus-Robert Müller, Wojciech Samek

    Abstract: While the evaluation of explanations is an important step towards trustworthy models, it needs to be done carefully, and the employed metrics need to be well-understood. Specifically model randomization testing is often overestimated and regarded as a sole criterion for selecting or discarding certain explanation methods. To address shortcomings of this test, we start by observing an experimental… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 23 pages

  26. arXiv:2211.11426  [pdf, other

    cs.CV cs.AI cs.LG

    Revealing Hidden Context Bias in Segmentation and Object Detection through Concept-specific Explanations

    Authors: Maximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Applying traditional post-hoc attribution methods to segmentation or object detection predictors offers only limited insights, as the obtained feature attribution maps at input level typically resemble the models' predicted segmentation mask or bounding box. In this work, we address the need for more informative explanations for these predictors by proposing the post-hoc eXplainable Artificial Int… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  27. arXiv:2211.02578  [pdf

    cs.LG cs.AI cs.CV

    Data Models for Dataset Drift Controls in Machine Learning With Optical Images

    Authors: Luis Oala, Marco Aversa, Gabriel Nobis, Kurt Willis, Yoan Neuenschwander, Michèle Buck, Christian Matek, Jerome Extermann, Enrico Pomarico, Wojciech Samek, Roderick Murray-Smith, Christoph Clausen, Bruno Sanguinetti

    Abstract: Camera images are ubiquitous in machine learning research. They also play a central role in the delivery of important services spanning medicine and environmental surveying. However, the application of machine learning models in these domains has been limited because of robustness concerns. A primary failure mode are performance drops due to differences between the training and deployment data. Wh… ▽ More

    Submitted 7 May, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Published as a journal paper in the Transactions on Machine Learning Research 2023 (TMLR) available at https://openreview.net/forum?id=I4IkGmgFJz

  28. From Attribution Maps to Human-Understandable Explanations through Concept Relevance Propagation

    Authors: Reduan Achtibat, Maximilian Dreyer, Ilona Eisenbraun, Sebastian Bosse, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The field of eXplainable Artificial Intelligence (XAI) aims to bring transparency to today's powerful but opaque deep learning models. While local XAI methods explain individual predictions in form of attribution maps, thereby identifying where important features occur (but not providing information about what they represent), global explanation techniques visualize what concepts a model has gener… ▽ More

    Submitted 6 January, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 87 pages (13 pages manuscript, 8 pages references, 66 pages appendix) 63 figures (6 in manuscript, 57 in appendix) 3 tables (in appendix)

    Journal ref: Nature Machine Intelligence (year 2023, volume 5, pages 1006-1019)

  29. arXiv:2205.14960  [pdf, other

    cs.LG cs.DC

    FedAUXfdp: Differentially Private One-Shot Federated Distillation

    Authors: Haley Hoech, Roman Rischke, Karsten Müller, Wojciech Samek

    Abstract: Federated learning suffers in the case of non-iid local datasets, i.e., when the distributions of the clients' data are heterogeneous. One promising approach to this challenge is the recently proposed method FedAUX, an augmentation of federated distillation with robust results on even highly heterogeneous client data. FedAUX is a partially $(ε, δ)$-differentially private method, insofar as the cli… ▽ More

    Submitted 21 June, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

  30. arXiv:2205.07855  [pdf, other

    cs.LG cs.AI cs.DC

    Decentral and Incentivized Federated Learning Frameworks: A Systematic Literature Review

    Authors: Leon Witt, Mathis Heyer, Kentaroh Toyoda, Wojciech Samek, Dan Li

    Abstract: The advent of Federated Learning (FL) has ignited a new paradigm for parallel and confidential decentralized Machine Learning (ML) with the potential of utilizing the computational power of a vast number of IoT, mobile and edge devices without data leaving the respective device, ensuring privacy by design. Yet, in order to scale this new paradigm beyond small groups of already entrusted entities t… ▽ More

    Submitted 5 December, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: submitted to IEEE IOTJ

  31. arXiv:2205.01929  [pdf, other

    cs.LG

    Explain to Not Forget: Defending Against Catastrophic Forgetting with XAI

    Authors: Sami Ede, Serop Baghdadlian, Leander Weber, An Nguyen, Dario Zanca, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The ability to continuously process and retain new information like we do naturally as humans is a feat that is highly sought after when training neural networks. Unfortunately, the traditional optimization algorithms often require large amounts of data available during training time and updates wrt. new data are difficult after the training process has been completed. In fact, when new data or ta… ▽ More

    Submitted 22 June, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 14 pages including appendix, 5 figures, 2 tables, 1 algorithm listing. v2 update increases figure readability, updates Fig 5 caption, adds our collaborators Dario and An as co-authors v3 brings the preprint in line with the final version accepted for peer-reviewed publication at CD-MAKE 2022. v4 metadata update

  32. arXiv:2204.04424  [pdf, other

    cs.LG cs.AI cs.CV cs.DC

    Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning

    Authors: Daniel Becking, Heiner Kirchhoffer, Gerhard Tech, Paul Haase, Karsten Müller, Heiko Schwarz, Wojciech Samek

    Abstract: Federated learning (FL) scenarios inherently generate a large communication overhead by frequently transmitting neural network updates between clients and server. To minimize the communication cost, introducing sparsity in conjunction with differential updates is a commonly used technique. However, sparse model updates can slow down convergence speed or unintentionally skip certain update aspects,… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: CVPR 2022 FedVision Workshop (CVPRW), 12 pages, 5 figures, 2 tables, supplementary material

  33. arXiv:2203.08008  [pdf, other

    cs.LG

    Beyond Explaining: Opportunities and Challenges of XAI-Based Model Improvement

    Authors: Leander Weber, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek

    Abstract: Explainable Artificial Intelligence (XAI) is an emerging research field bringing transparency to highly complex and opaque machine learning (ML) models. Despite the development of a multitude of methods to explain the decisions of black-box classifiers in recent years, these tools are seldomly used beyond visualization purposes. Only recently, researchers have started to employ explanations in pra… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  34. arXiv:2202.06861  [pdf, other

    cs.LG

    Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and Beyond

    Authors: Anna Hedström, Leander Weber, Dilyara Bareeva, Daniel Krakowczyk, Franz Motzkus, Wojciech Samek, Sebastian Lapuschkin, Marina M. -C. Höhne

    Abstract: The evaluation of explanation methods is a research topic that has not yet been explored deeply, however, since explainability is supposed to strengthen trust in artificial intelligence, it is necessary to systematically review and compare explanation methods in order to confirm their correctness. Until now, no tool with focus on XAI evaluation exists that exhaustively and speedily allows research… ▽ More

    Submitted 27 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 4 pages, 1 figure, 1 table

    Journal ref: Journal of Machine Learning Research, Vol. 24 (2023) 1-11

  35. arXiv:2202.03482  [pdf, other

    cs.CV cs.AI cs.LG

    Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

    Authors: Frederik Pahde, Maximilian Dreyer, Leander Weber, Moritz Weckbecker, Christopher J. Anders, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: With a growing interest in understanding neural network prediction strategies, Concept Activation Vectors (CAVs) have emerged as a popular tool for modeling human-understandable concepts in the latent space. Commonly, CAVs are computed by leveraging linear classifiers optimizing the separability of latent representations of samples with and without a given concept. However, in this paper we show t… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

  36. arXiv:2112.11407  [pdf, other

    cs.LG cs.AI stat.ML

    Toward Explainable AI for Regression Models

    Authors: Simon Letzgus, Patrick Wagner, Jonas Lederer, Wojciech Samek, Klaus-Robert Müller, Gregoire Montavon

    Abstract: In addition to the impressive predictive power of machine learning (ML) models, more recently, explanation methods have emerged that enable an interpretation of complex non-linear learning models such as deep neural networks. Gaining a better understanding is especially important e.g. for safety-critical ML applications or medical diagnostics etc. While such Explainable AI (XAI) techniques have re… ▽ More

    Submitted 17 January, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 17 pages, 10 figures, published; changes: 1. references to code and xai-regression.org added (p. 1/2, end of introduction), 2. adjustment of sign-error in restructuring section (p. 8, just above Fig. 4)

    Journal ref: IEEE Signal Processing Magazine (Volume: 39, Issue: 4, July 2022) 40-58

  37. arXiv:2111.01562  [pdf, other

    q-bio.NC cs.LG

    Evaluating deep transfer learning for whole-brain cognitive decoding

    Authors: Armin W. Thomas, Ulman Lindenberger, Wojciech Samek, Klaus-Robert Müller

    Abstract: Research in many fields has shown that transfer learning (TL) is well-suited to improve the performance of deep learning (DL) models in datasets with small numbers of samples. This empirical success has triggered interest in the application of TL to cognitive decoding analyses with functional neuroimaging data. Here, we systematically evaluate TL for the application of DL models to the decoding of… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  38. ECQ$^{\text{x}}$: Explainability-Driven Quantization for Low-Bit and Sparse DNNs

    Authors: Daniel Becking, Maximilian Dreyer, Wojciech Samek, Karsten Müller, Sebastian Lapuschkin

    Abstract: The remarkable success of deep neural networks (DNNs) in various applications is accompanied by a significant increase in network parameters and arithmetic operations. Such increases in memory and computational demands make deep learning prohibitive for resource-constrained hardware platforms such as mobile devices. Recent efforts aim to reduce these overheads, while preserving model performance a… ▽ More

    Submitted 16 February, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: 22 pages, 10 figures, 1 table

    Journal ref: xxAI - Beyond Explainable AI, Lecture Notes in Computer Science (LNAI Vol. 13200), Springer International Publishing, 2022

  39. arXiv:2106.14265  [pdf, other

    cs.LG cs.CR cs.DC

    Reward-Based 1-bit Compressed Federated Distillation on Blockchain

    Authors: Leon Witt, Usama Zafar, KuoYeh Shen, Felix Sattler, Dan Li, Wojciech Samek

    Abstract: The recent advent of various forms of Federated Knowledge Distillation (FD) paves the way for a new generation of robust and communication-efficient Federated Learning (FL), where mere soft-labels are aggregated, rather than whole gradients of Deep Neural Networks (DNN) as done in previous FL schemes. This security-per-design approach in combination with increasingly performant Internet of Things… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

  40. arXiv:2106.13497  [pdf, other

    cs.CV

    On the Robustness of Pretraining and Self-Supervision for a Deep Learning-based Analysis of Diabetic Retinopathy

    Authors: Vignesh Srinivasan, Nils Strodthoff, Jackie Ma, Alexander Binder, Klaus-Robert Müller, Wojciech Samek

    Abstract: There is an increasing number of medical use-cases where classification algorithms based on deep neural networks reach performance levels that are competitive with human medical experts. To alleviate the challenges of small dataset sizes, these systems often rely on pretraining. In this work, we aim to assess the broader implications of these approaches. For diabetic retinopathy grading as exempla… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  41. arXiv:2106.13200  [pdf, other

    cs.LG

    Software for Dataset-wide XAI: From Local Explanations to Global Insights with Zennit, CoRelAy, and ViRelAy

    Authors: Christopher J. Anders, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

    Abstract: Deep Neural Networks (DNNs) are known to be strong predictors, but their prediction strategies can rarely be understood. With recent advances in Explainable Artificial Intelligence (XAI), approaches are available to explore the reasoning behind those complex models' predictions. Among post-hoc attribution methods, Layer-wise Relevance Propagation (LRP) shows high performance. For deeper quantitati… ▽ More

    Submitted 28 February, 2023; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 20 pages, 6 figures, 2 listings, 1 table

  42. arXiv:2102.02514  [pdf, other

    cs.LG cs.DC stat.ML

    FedAUX: Leveraging Unlabeled Auxiliary Data in Federated Learning

    Authors: Felix Sattler, Tim Korjakow, Roman Rischke, Wojciech Samek

    Abstract: Federated Distillation (FD) is a popular novel algorithmic paradigm for Federated Learning, which achieves training performance competitive to prior parameter averaging based methods, while additionally allowing the clients to train different model architectures, by distilling the client predictions on an unlabeled auxiliary set of data into a student model. In this work we propose FedAUX, an exte… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

  43. arXiv:2012.11331  [pdf, other

    cs.AR cs.LG

    FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons

    Authors: Simon Wiedemann, Suhas Shivapakash, Pablo Wiedemann, Daniel Becking, Wojciech Samek, Friedel Gerfers, Thomas Wiegand

    Abstract: With the growing demand for deploying deep learning models to the "edge", it is paramount to develop techniques that allow to execute state-of-the-art models within very tight and limited resource constraints. In this work we propose a software-hardware optimization paradigm for obtaining a highly efficient execution engine of deep neural networks (DNNs) that are based on fully-connected layers. O… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  44. arXiv:2012.00632  [pdf, other

    cs.LG cs.AI stat.ML

    Communication-Efficient Federated Distillation

    Authors: Felix Sattler, Arturo Marban, Roman Rischke, Wojciech Samek

    Abstract: Communication constraints are one of the major challenges preventing the wide-spread adoption of Federated Learning systems. Recently, Federated Distillation (FD), a new algorithmic paradigm for Federated Learning with fundamentally different communication properties, emerged. FD methods leverage ensemble distillation techniques and exchange model outputs, presented as soft labels on an unlabeled… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  45. arXiv:2009.11732  [pdf, other

    cs.LG cs.AI stat.ML

    A Unifying Review of Deep and Shallow Anomaly Detection

    Authors: Lukas Ruff, Jacob R. Kauffmann, Robert A. Vandermeulen, Grégoire Montavon, Wojciech Samek, Marius Kloft, Thomas G. Dietterich, Klaus-Robert Müller

    Abstract: Deep learning approaches to anomaly detection have recently improved the state of the art in detection performance on complex datasets such as large collections of images or text. These results have sparked a renewed interest in the anomaly detection problem and led to the introduction of a great variety of new methods. With the emergence of numerous such methods, including approaches based on gen… ▽ More

    Submitted 8 February, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: 40 pages; accepted for publication in the Proceedings of the IEEE;

    Journal ref: Proceedings of the IEEE (2021) 1-40

  46. arXiv:2008.13723  [pdf, other

    cs.LG stat.ML

    Langevin Cooling for Domain Translation

    Authors: Vignesh Srinivasan, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

    Abstract: Domain translation is the task of finding correspondence between two domains. Several Deep Neural Network (DNN) models, e.g., CycleGAN and cross-lingual language models, have shown remarkable successes on this task under the unsupervised setting---the map**s between the domains are learned from two independent sets of training data in both domains (without paired samples). However, those methods… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

  47. arXiv:2007.08790  [pdf, other

    cs.CV cs.LG

    Explanation-Guided Training for Cross-Domain Few-Shot Classification

    Authors: Jiamei Sun, Sebastian Lapuschkin, Wojciech Samek, Yunqing Zhao, Ngai-Man Cheung, Alexander Binder

    Abstract: Cross-domain few-shot classification task (CD-FSC) combines few-shot classification with the requirement to generalize across domains represented by datasets. This setup faces challenges originating from the limited labeled data in each class and, additionally, from the domain shift between training and test sets. In this paper, we introduce a novel training approach for existing FSC models. It le… ▽ More

    Submitted 9 December, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Journal ref: Proceedings of the 25th International Conference on Pattern Recognition 2021

  48. arXiv:2006.08368  [pdf

    cs.CY eess.SP

    Sensor Artificial Intelligence and its Application to Space Systems -- A White Paper

    Authors: Anko Börner, Heinz-Wilhelm Hübers, Odej Kao, Florian Schmidt, Sören Becker, Joachim Denzler, Daniel Matolin, David Haber, Sergio Lucia, Wojciech Samek, Rudolph Triebel, Sascha Eichstädt, Felix Biessmann, Anna Kruspe, Peter Jung, Manon Kok, Guillermo Gallego, Ralf Berger

    Abstract: Information and communication technologies have accompanied our everyday life for years. A steadily increasing number of computers, cameras, mobile devices, etc. generate more and more data, but at the same time we realize that the data can only partially be analyzed with classical approaches. The research and development of methods based on artificial intelligence (AI) made enormous progress in t… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: 4 pages. 1st Workshop on Sensor Artificial Intelligence, Apr. 2020, Berlin, Germany

  49. arXiv:2006.07767  [pdf, ps, other

    cs.LG stat.ML

    MixMOOD: A systematic approach to class distribution mismatch in semi-supervised learning using deep dataset dissimilarity measures

    Authors: Saul Calderon-Ramirez, Luis Oala, Jordina Torrents-Barrena, Shengxiang Yang, Armaghan Moemeni, Wojciech Samek, Miguel A. Molina-Cabello

    Abstract: In this work, we propose MixMOOD - a systematic approach to mitigate effect of class distribution mismatch in semi-supervised deep learning (SSDL) with MixMatch. This work is divided into two components: (i) an extensive out of distribution (OOD) ablation test bed for SSDL and (ii) a quantitative unlabelled dataset selection heuristic referred to as MixMOOD. In the first part, we analyze the sensi… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

    Comments: The first two authors made equal contribution

    ACM Class: I.5.2

  50. arXiv:2004.13701  [pdf, other

    cs.LG stat.ML

    Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL

    Authors: Nils Strodthoff, Patrick Wagner, Tobias Schaeffter, Wojciech Samek

    Abstract: Electrocardiography is a very common, non-invasive diagnostic procedure and its interpretation is increasingly supported by automatic interpretation algorithms. The progress in the field of automatic ECG interpretation has up to now been hampered by a lack of appropriate datasets for training as well as a lack of well-defined evaluation procedures to ensure comparability of different algorithms. T… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 12 pages, 8 figures