Skip to main content

Showing 1–12 of 12 results for author: Anders, C J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06150  [pdf, other

    cs.LG quant-ph

    Physics-Informed Bayesian Optimization of Variational Quantum Circuits

    Authors: Kim A. Nicoli, Christopher J. Anders, Lena Funcke, Tobias Hartung, Karl Jansen, Stefan Kühn, Klaus-Robert Müller, Paolo Stornati, Pan Kessel, Shinichi Nakajima

    Abstract: In this paper, we propose a novel and powerful method to harness Bayesian optimization for Variational Quantum Eigensolvers (VQEs) -- a hybrid quantum-classical protocol used to approximate the ground state of a quantum Hamiltonian. Specifically, we derive a VQE-kernel which incorporates important prior information about quantum circuits: the kernel feature map of the VQE-kernel exactly matches th… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 36 pages, 17 figures, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  2. arXiv:2310.01011  [pdf, other

    cs.AI

    Towards Fixing Clever-Hans Predictors with Counterfactual Knowledge Distillation

    Authors: Sidney Bender, Christopher J. Anders, Pattarawatt Chormai, Heike Marxfeld, Jan Herrmann, Grégoire Montavon

    Abstract: This paper introduces a novel technique called counterfactual knowledge distillation (CFKD) to detect and remove reliance on confounders in deep learning models with the help of human expert feedback. Confounders are spurious features that models tend to rely on, which can result in unexpected errors in regulated or safety-critical domains. The paper highlights the benefit of CFKD in such domains… ▽ More

    Submitted 3 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  3. arXiv:2308.09437  [pdf, other

    cs.LG cs.AI cs.CV cs.CY

    From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space

    Authors: Maximilian Dreyer, Frederik Pahde, Christopher J. Anders, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Deep Neural Networks are prone to learning spurious correlations embedded in the training data, leading to potentially biased predictions. This poses risks when deploying these models for high-stake decision-making, such as in medical applications. Current methods for post-hoc model correction either require input-level annotations which are only possible for spatially localized biases, or augment… ▽ More

    Submitted 18 December, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 35 pages (9 pages manuscript, 2 pages references, 24 pages appendix)

  4. arXiv:2302.14082  [pdf, other

    hep-lat cs.LG physics.comp-ph

    Detecting and Mitigating Mode-Collapse for Flow-based Sampling of Lattice Field Theories

    Authors: Kim A. Nicoli, Christopher J. Anders, Tobias Hartung, Karl Jansen, Pan Kessel, Shinichi Nakajima

    Abstract: We study the consequences of mode-collapse of normalizing flows in the context of lattice field theory. Normalizing flows allow for independent sampling. For this reason, it is hoped that they can avoid the tunneling problem of local-update MCMC algorithms for multi-modal distributions. In this work, we first point out that the tunneling problem is also present for normalizing flows but is shifted… ▽ More

    Submitted 3 November, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 10 pages, 7 figures, 6 pages of supplement material

  5. arXiv:2202.03482  [pdf, other

    cs.CV cs.AI cs.LG

    Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

    Authors: Frederik Pahde, Maximilian Dreyer, Leander Weber, Moritz Weckbecker, Christopher J. Anders, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: With a growing interest in understanding neural network prediction strategies, Concept Activation Vectors (CAVs) have emerged as a popular tool for modeling human-understandable concepts in the latent space. Commonly, CAVs are computed by leveraging linear classifiers optimizing the separability of latent representations of samples with and without a given concept. However, in this paper we show t… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

  6. arXiv:2106.13200  [pdf, other

    cs.LG

    Software for Dataset-wide XAI: From Local Explanations to Global Insights with Zennit, CoRelAy, and ViRelAy

    Authors: Christopher J. Anders, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

    Abstract: Deep Neural Networks (DNNs) are known to be strong predictors, but their prediction strategies can rarely be understood. With recent advances in Explainable Artificial Intelligence (XAI), approaches are available to explore the reasoning behind those complex models' predictions. Among post-hoc attribution methods, Layer-wise Relevance Propagation (LRP) shows high performance. For deeper quantitati… ▽ More

    Submitted 28 February, 2023; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 20 pages, 6 figures, 2 listings, 1 table

  7. arXiv:2012.10425  [pdf, other

    cs.LG

    Towards Robust Explanations for Deep Neural Networks

    Authors: Ann-Kathrin Dombrowski, Christopher J. Anders, Klaus-Robert Müller, Pan Kessel

    Abstract: Explanation methods shed light on the decision process of black-box classifiers such as deep neural networks. But their usefulness can be compromised because they are susceptible to manipulations. With this work, we aim to enhance the resilience of explanations. We develop a unified theoretical framework for deriving bounds on the maximal manipulability of a model. Based on these theoretical insig… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  8. arXiv:2007.09969  [pdf, other

    cs.LG stat.ML

    Fairwashing Explanations with Off-Manifold Detergent

    Authors: Christopher J. Anders, Plamen Pasliev, Ann-Kathrin Dombrowski, Klaus-Robert Müller, Pan Kessel

    Abstract: Explanation methods promise to make black-box classifiers more transparent. As a result, it is hoped that they can act as proof for a sensible, fair and trustworthy decision-making process of the algorithm and thereby increase its acceptance by the end-users. In this paper, we show both theoretically and experimentally that these hopes are presently unfounded. Specifically, we show that, for any c… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: 22 pages with 43 figures, to be published in ICML2020

  9. arXiv:2007.07115  [pdf, other

    hep-lat cs.LG physics.comp-ph

    Estimation of Thermodynamic Observables in Lattice Field Theories with Deep Generative Models

    Authors: Kim A. Nicoli, Christopher J. Anders, Lena Funcke, Tobias Hartung, Karl Jansen, Pan Kessel, Shinichi Nakajima, Paolo Stornati

    Abstract: In this work, we demonstrate that applying deep generative machine learning models for lattice field theory is a promising route for solving problems where Markov Chain Monte Carlo (MCMC) methods are problematic. More specifically, we show that generative models can be used to estimate the absolute value of the free energy, which is in contrast to existing MCMC-based methods which are limited to o… ▽ More

    Submitted 5 January, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 8 figures

    Journal ref: Phys. Rev. Lett. 126, 032001 (2021)

  10. arXiv:2003.07631  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications

    Authors: Wojciech Samek, Grégoire Montavon, Sebastian Lapuschkin, Christopher J. Anders, Klaus-Robert Müller

    Abstract: With the broader and highly successful usage of machine learning in industry and the sciences, there has been a growing demand for Explainable AI. Interpretability and explanation methods for gaining a better understanding about the problem solving abilities and strategies of nonlinear Machine Learning, in particular, deep neural networks, are therefore receiving increased attention. In this work… ▽ More

    Submitted 25 February, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: 30 pages, 20 figures

  11. arXiv:1912.11425  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    Finding and Removing Clever Hans: Using Explanation Methods to Debug and Improve Deep Models

    Authors: Christopher J. Anders, Leander Weber, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

    Abstract: Contemporary learning models for computer vision are typically trained on very large (benchmark) datasets with millions of samples. These may, however, contain biases, artifacts, or errors that have gone unnoticed and are exploitable by the model. In the worst case, the trained model does not learn a valid and generalizable strategy to solve the problem it was trained for, and becomes a 'Clever-Ha… ▽ More

    Submitted 18 December, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

    Comments: 47 pages, 21 figures

  12. arXiv:1906.07983  [pdf, other

    stat.ML cs.CR cs.LG

    Explanations can be manipulated and geometry is to blame

    Authors: Ann-Kathrin Dombrowski, Maximilian Alber, Christopher J. Anders, Marcel Ackermann, Klaus-Robert Müller, Pan Kessel

    Abstract: Explanation methods aim to make neural networks more trustworthy and interpretable. In this paper, we demonstrate a property of explanation methods which is disconcerting for both of these purposes. Namely, we show that explanations can be manipulated arbitrarily by applying visually hardly perceptible perturbations to the input that keep the network's output approximately constant. We establish t… ▽ More

    Submitted 25 September, 2019; v1 submitted 19 June, 2019; originally announced June 2019.