Skip to main content

Showing 1–4 of 4 results for author: Bareeva, D

.
  1. arXiv:2404.09601  [pdf, other

    cs.LG cs.AI cs.CV

    Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression

    Authors: Dilyara Bareeva, Maximilian Dreyer, Frederik Pahde, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Deep Neural Networks are prone to learning and relying on spurious correlations in the training data, which, for high-risk applications, can have fatal consequences. Various approaches to suppress model reliance on harmful features have been proposed that can be applied post-hoc without additional training. Whereas those methods can be applied with efficiency, they also tend to harm model performa… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2401.06122  [pdf, other

    cs.LG cs.AI cs.CV

    Manipulating Feature Visualizations with Gradient Slingshots

    Authors: Dilyara Bareeva, Marina M. -C. Höhne, Alexander Warnecke, Lukas Pirch, Klaus-Robert Müller, Konrad Rieck, Kirill Bykov

    Abstract: Deep Neural Networks (DNNs) are capable of learning complex and versatile representations, however, the semantic nature of the learned concepts remains unknown. A common method used to explain the concepts learned by DNNs is Activation Maximization (AM), which generates a synthetic input signal that maximally activates a particular neuron in the network. In this paper, we investigate the vulnerabi… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  3. arXiv:2303.00652  [pdf, other

    cs.LG cs.AI

    Finding the right XAI method -- A Guide for the Evaluation and Ranking of Explainable AI Methods in Climate Science

    Authors: Philine Bommer, Marlene Kretschmer, Anna Hedström, Dilyara Bareeva, Marina M. -C. Höhne

    Abstract: Explainable artificial intelligence (XAI) methods shed light on the predictions of machine learning algorithms. Several different approaches exist and have already been applied in climate science. However, usually missing ground truth explanations complicate their evaluation and comparison, subsequently impeding the choice of the XAI method. Therefore, in this work, we introduce XAI evaluation in… ▽ More

    Submitted 22 March, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 19 pages, 10 figure, accepted at AIES journal by AMS

  4. arXiv:2202.06861  [pdf, other

    cs.LG

    Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and Beyond

    Authors: Anna Hedström, Leander Weber, Dilyara Bareeva, Daniel Krakowczyk, Franz Motzkus, Wojciech Samek, Sebastian Lapuschkin, Marina M. -C. Höhne

    Abstract: The evaluation of explanation methods is a research topic that has not yet been explored deeply, however, since explainability is supposed to strengthen trust in artificial intelligence, it is necessary to systematically review and compare explanation methods in order to confirm their correctness. Until now, no tool with focus on XAI evaluation exists that exhaustively and speedily allows research… ▽ More

    Submitted 27 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 4 pages, 1 figure, 1 table

    Journal ref: Journal of Machine Learning Research, Vol. 24 (2023) 1-11